Segmentation and classification of mixed text/graphics/image documents☆
References (7)
- et al.
Page segmentation and classification
CVGIP: Graphical Models and Image Processing
(1992) - et al.
Hierarchical representation of optically scanned documents
- et al.
Cited by (50)
Complex layout analysis based on contour classification and morphological operations
2017, Engineering Applications of Artificial IntelligenceCitation Excerpt :Tsujimoto and Asada use the same smearing algorithm in order to aggregate adjacent connected components into segments by connecting two black runs separated by a small gap (Tsujimoto and Asada, 1992). Fan et al. also perform a run-length smearing operation and then merge consecutive horizontal stripes into paragraphs (Fan et al., 1994). Sun proposed a modified smearing algorithm, called selective CRLA, capable of processing documents with non-Manhattan (containing non-rectangular blocks) layouts (Sun, 2005).
Tensor representation learning based image patch analysis for text identification and recognition
2015, Pattern RecognitionCitation Excerpt :Subsequently, text lines are combined into blocks. Based on the run-length smearing algorithm (RLSA) [19], Fan et al. [20] proposed a document analysis system. RLSA has the effect of linking together neighboring black areas that are separated by less than L pixels, where L is a predefined number.
Parameter free approach for segmenting complex manhattan layouts
2023, Multimedia Tools and ApplicationsBackground grid extraction from historical hand-drawn cadastral maps
2023, International Journal on Document Analysis and RecognitionCan Deep Learning Approaches Detect Complex Text? Case of Onomatopoeia in Comics Albums
2023, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)Document Region Segmentation
2023, SpringerBriefs in Computer Science
- ☆
This work is supported by National Science Council of Taiwan under grant NSC-83-0408-E-008-001.