Skip to main content
Log in

Analysis of textual images using the Hough transform

  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

The analysis of images of printed pages of text is considered. Since printed text can be viewed as textured line, the use of the Hough transform for detecting straight lines is proposed as an analysis tool. Methods for handling several discretization problems that arise in mapping the rectangular image space to the (ρ, Θ) accumulator array are described. Several applications of analyzing the accumulator array are proposed. They include detecting the text skew angle, determining the signature of a text line so as to accept or reject a block as containing only text, using profile analysis to segment text into lines, and determining whether a textual block is rightside-up or otherwise.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Baird HS (1987) The skew angle of printed documents. In: Proceedings of Society of Photographic Scientists and Engineers, Rochester, NY, vol. 40, pp 21–24

    Google Scholar 

  • Ballard DH (1981) Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognition

  • Brown CM (1983) Inherent bias and noise in the Hough transform. IEEE Transactions on Pattern Analysis and Machine Intelligence 5(5):493–505, September

    Google Scholar 

  • Doster W (1984) Different states of a documents content on its way from the Gutenbergian world to the electronic world. In: Proceedings of the Seventh International Conference on Pattern Recognition, Montreal, 2, pp 872–874

    Google Scholar 

  • Duda RO, Hart PE (1972) Use of the Hough transform to detect lines and curves in pictures. Communications of the ACM 15:11–15

    Google Scholar 

  • Feldman JA (1985) Connectionist models and parallelism in high-level vision. TR 146, Department of Computer Science, University of Rochester, NY, January

    Google Scholar 

  • Fletcher LA, Kasturi R (1988) A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on Pattern Analysis and Machine Intelligence 10(6):910–918, November

    Google Scholar 

  • Gorman F, Clowes MB (1976) Finding picture edges through collinearity of feature points. IEEE Transactions on Computers C-25:449–456

    Google Scholar 

  • Hough PVC (1962) Method and means for recognizing complex patterns. US Patent 3,069,654, December 18

  • Nagy G, Seth SC, Stoddard SD (1985) Document analysis with an expert system. In: Proceedings of the Pattern Recognition in Practice II, Amsterdam, June 19–21

  • Rastogi A, Srihari SN (1986) Recognizing textual blocks in document images using the Hough transform. TR 87-01, Dept. of Computer Science, SUNY Buffalo, NY, January

    Google Scholar 

  • Shapiro SD, Iannino A (1979) Geometric constructions for predicting Hough transform performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 1(3):310–316, July

    Google Scholar 

  • Srihari SN, Zack GM (1986) Document image analysis. In: Proceedings of the Eighth International Conference on Pattern Recognition, Paris, pp 434–436

  • Suen Y Ching (1979) N-gram statistics for natural language understanding and text processing. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1:164–172, April

    Google Scholar 

  • Wallace RS (1985) A modified Hough transform for lines. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp 665–667

  • Wahl FM, Wang KY, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Computer Graphics and Image Processing 20:375–390

    Google Scholar 

  • Wang D, Srihari SN (in press) Classification of newspaper image blocks using texture analysis. Computer Vision, Graphics and Image Processing, in press

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Srihari, S.N., Govindaraju, V. Analysis of textual images using the Hough transform. Machine Vis. Apps. 2, 141–153 (1989). https://doi.org/10.1007/BF01212455

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01212455

Key words

Navigation