Abstract
The analysis of images of printed pages of text is considered. Since printed text can be viewed as textured line, the use of the Hough transform for detecting straight lines is proposed as an analysis tool. Methods for handling several discretization problems that arise in mapping the rectangular image space to the (ρ, Θ) accumulator array are described. Several applications of analyzing the accumulator array are proposed. They include detecting the text skew angle, determining the signature of a text line so as to accept or reject a block as containing only text, using profile analysis to segment text into lines, and determining whether a textual block is rightside-up or otherwise.
Similar content being viewed by others
References
Baird HS (1987) The skew angle of printed documents. In: Proceedings of Society of Photographic Scientists and Engineers, Rochester, NY, vol. 40, pp 21–24
Ballard DH (1981) Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognition
Brown CM (1983) Inherent bias and noise in the Hough transform. IEEE Transactions on Pattern Analysis and Machine Intelligence 5(5):493–505, September
Doster W (1984) Different states of a documents content on its way from the Gutenbergian world to the electronic world. In: Proceedings of the Seventh International Conference on Pattern Recognition, Montreal, 2, pp 872–874
Duda RO, Hart PE (1972) Use of the Hough transform to detect lines and curves in pictures. Communications of the ACM 15:11–15
Feldman JA (1985) Connectionist models and parallelism in high-level vision. TR 146, Department of Computer Science, University of Rochester, NY, January
Fletcher LA, Kasturi R (1988) A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on Pattern Analysis and Machine Intelligence 10(6):910–918, November
Gorman F, Clowes MB (1976) Finding picture edges through collinearity of feature points. IEEE Transactions on Computers C-25:449–456
Hough PVC (1962) Method and means for recognizing complex patterns. US Patent 3,069,654, December 18
Nagy G, Seth SC, Stoddard SD (1985) Document analysis with an expert system. In: Proceedings of the Pattern Recognition in Practice II, Amsterdam, June 19–21
Rastogi A, Srihari SN (1986) Recognizing textual blocks in document images using the Hough transform. TR 87-01, Dept. of Computer Science, SUNY Buffalo, NY, January
Shapiro SD, Iannino A (1979) Geometric constructions for predicting Hough transform performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 1(3):310–316, July
Srihari SN, Zack GM (1986) Document image analysis. In: Proceedings of the Eighth International Conference on Pattern Recognition, Paris, pp 434–436
Suen Y Ching (1979) N-gram statistics for natural language understanding and text processing. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1:164–172, April
Wallace RS (1985) A modified Hough transform for lines. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp 665–667
Wahl FM, Wang KY, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Computer Graphics and Image Processing 20:375–390
Wang D, Srihari SN (in press) Classification of newspaper image blocks using texture analysis. Computer Vision, Graphics and Image Processing, in press
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Srihari, S.N., Govindaraju, V. Analysis of textual images using the Hough transform. Machine Vis. Apps. 2, 141–153 (1989). https://doi.org/10.1007/BF01212455
Issue Date:
DOI: https://doi.org/10.1007/BF01212455