Skip to main content

2018 | OriginalPaper | Buchkapitel

A Novel Text Localization Scheme for Camera Captured Document Images

verfasst von : Tauseef Khan, Ayatullah Faruk Mollah

Erschienen in: Proceedings of 2nd International Conference on Computer Vision & Image Processing

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, a hybrid model for detecting text regions from scene images as well as document image is presented. At first, background is suppressed to isolate foreground regions. Then, morphological operations are applied on isolated foreground regions to ensure appropriate region boundary of such objects. Statistical features are extracted from these objects to classify them as text or non-text using a multi-layer perceptron. Classified text components are localized, and non-text ones are ignored. Experimenting on a data set of 227 camera captured images, it is found that the object isolation accuracy is 0.8638 and text non-text classification accuracy is 0.9648. It may be stated that for images with near homogenous background, the present method yields reasonably satisfactory accuracy for practical applications.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang, Z., Zhang, C., Shen, W., Yao, C., Liu, W., Bai, X.: Multi-oriented text detection with fully convolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4159–4167, (2016). Zhang, Z., Zhang, C., Shen, W., Yao, C., Liu, W., Bai, X.: Multi-oriented text detection with fully convolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4159–4167, (2016).
2.
Zurück zum Zitat Chen, X., Yuille, A. L.: Detecting and reading text in natural scenes. In. IEEE Conference on Computer Vision and Pattern Recognition, Vol. 2, pp. II-II. (2004). Chen, X., Yuille, A. L.: Detecting and reading text in natural scenes. In. IEEE Conference on Computer Vision and Pattern Recognition, Vol. 2, pp. II-II. (2004).
3.
Zurück zum Zitat Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In. IEEE Conference on Computer Vision and Pattern Recognition pp. 1083–1090, (2012). Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In. IEEE Conference on Computer Vision and Pattern Recognition pp. 1083–1090, (2012).
4.
Zurück zum Zitat Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. In. IEEE Transactions on Image Processing, pp. 2594–2605, (2011). Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. In. IEEE Transactions on Image Processing, pp. 2594–2605, (2011).
5.
Zurück zum Zitat Neumann, L., Matas, J.: Real-time scene text localization and recognition., In. IEEE Conference on Computer Vision and Pattern Recognition, pp. 3538–3545. IEEE, (2012). Neumann, L., Matas, J.: Real-time scene text localization and recognition., In. IEEE Conference on Computer Vision and Pattern Recognition, pp. 3538–3545. IEEE, (2012).
6.
Zurück zum Zitat Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In. IEEE International Conference on Computer Vision, pp. 1241–1248, (2013). Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In. IEEE International Conference on Computer Vision, pp. 1241–1248, (2013).
7.
Zurück zum Zitat Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970, (2010). Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970, (2010).
8.
Zurück zum Zitat Zhao, Y., Lu, T. and Liao, W.: A robust color-independent text detection method from complex videos. In International Conference on Document Analysis and Recognition (ICDAR), (pp. 374–378). IEEE, (2011). Zhao, Y., Lu, T. and Liao, W.: A robust color-independent text detection method from complex videos. In International Conference on Document Analysis and Recognition (ICDAR), (pp. 374–378). IEEE, (2011).
9.
Zurück zum Zitat Kim, K. I., Jung, K., Kim, J. H.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. In. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1631–1639 (2003). Kim, K. I., Jung, K., Kim, J. H.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. In. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1631–1639 (2003).
10.
Zurück zum Zitat Taravat, A., Del Frate, F., Cornaro, C., Vergari, S.: Neural networks and support vector machine algorithms for automatic cloud classification of whole-sky ground-based images. In. IEEE Geoscience and remote sensing letters, pp. 666–670 (2015). Taravat, A., Del Frate, F., Cornaro, C., Vergari, S.: Neural networks and support vector machine algorithms for automatic cloud classification of whole-sky ground-based images. In. IEEE Geoscience and remote sensing letters, pp. 666–670 (2015).
11.
Zurück zum Zitat Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Ng, A. Y.: Text detection and character recognition in scene images with unsupervised feature learning. In. IEEE International Conference on Document Analysis and recognition (ICDAR), pp. 440–445, (2011). Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Ng, A. Y.: Text detection and character recognition in scene images with unsupervised feature learning. In. IEEE International Conference on Document Analysis and recognition (ICDAR), pp. 440–445, (2011).
12.
Zurück zum Zitat Shi, Z., Setlur, S., Govindaraju, V.: A steerable directional local profile technique for extraction of handwritten arabic text lines. In. IEEE 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 176–180, IEEE, (2009). Shi, Z., Setlur, S., Govindaraju, V.: A steerable directional local profile technique for extraction of handwritten arabic text lines. In. IEEE 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 176–180, IEEE, (2009).
13.
Zurück zum Zitat Pan, Y. F., Hou, X., Liu, C. L.: A hybrid approach to detect and localize texts in natural scene images. In. IEEE Transactions on Image Processing, pp. 800–813, (2011). Pan, Y. F., Hou, X., Liu, C. L.: A hybrid approach to detect and localize texts in natural scene images. In. IEEE Transactions on Image Processing, pp. 800–813, (2011).
14.
Zurück zum Zitat Dalal, N. and Triggs, B.: Histograms of oriented gradients for human detection. In. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (Vol. 1, pp. 886–893). IEEE, (2005). Dalal, N. and Triggs, B.: Histograms of oriented gradients for human detection. In. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (Vol. 1, pp. 886–893). IEEE, (2005).
15.
Zurück zum Zitat Minetto, R., Thome, N., Cord, M., Leite, N.J. and Stolfi, J.: T-HOG: An effective gradient-based descriptor for single line text regions. Pattern recognition, 46(3), pp. 1078–1090, (2013). Minetto, R., Thome, N., Cord, M., Leite, N.J. and Stolfi, J.: T-HOG: An effective gradient-based descriptor for single line text regions. Pattern recognition, 46(3), pp. 1078–1090, (2013).
16.
Zurück zum Zitat Tian, S., Bhattacharya, U., Lu, S., Su, B., Wang, Q., Wei, X., Lu, Y. and Tan, C.L.: Multilingual scene character recognition with co-occurrence of histogram of oriented gradients. Pattern Recognition, 51, pp. 125–134, (2016). Tian, S., Bhattacharya, U., Lu, S., Su, B., Wang, Q., Wei, X., Lu, Y. and Tan, C.L.: Multilingual scene character recognition with co-occurrence of histogram of oriented gradients. Pattern Recognition, 51, pp. 125–134, (2016).
17.
Zurück zum Zitat Ojala, T., Pietikäinen, M. and Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern recognition, 29(1), pp. 51–59, (1996). Ojala, T., Pietikäinen, M. and Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern recognition, 29(1), pp. 51–59, (1996).
18.
Zurück zum Zitat Mäenpää, T. and Pietikäinen, M.: Multi-scale binary patterns for texture analysis. Image analysis, pp. 267–275, (2003). Mäenpää, T. and Pietikäinen, M.: Multi-scale binary patterns for texture analysis. Image analysis, pp. 267–275, (2003).
19.
Zurück zum Zitat Goto, H. and Tanaka, M.: Text-tracking wearable camera system for the blind. In 10th International Conference on Document Analysis and Recognition, ICDAR’09. (pp. 141–145). IEEE, (2009). Goto, H. and Tanaka, M.: Text-tracking wearable camera system for the blind. In 10th International Conference on Document Analysis and Recognition, ICDAR’09. (pp. 141–145). IEEE, (2009).
20.
Zurück zum Zitat Ye, Q., Huang, Q., Gao, W. and Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision Computing, 23(6), pp. 565–576, (2005). Ye, Q., Huang, Q., Gao, W. and Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision Computing, 23(6), pp. 565–576, (2005).
21.
Zurück zum Zitat Ye, Q. and Doermann, D.: Text detection and recognition in imagery: A survey. IEEE transactions on pattern analysis and machine intelligence, 37(7), pp. 1480–1500, (2015). Ye, Q. and Doermann, D.: Text detection and recognition in imagery: A survey. IEEE transactions on pattern analysis and machine intelligence, 37(7), pp. 1480–1500, (2015).
22.
Zurück zum Zitat Liang, J., Doermann, D. and Li, H.: Camera-based analysis of text and documents: a survey. International journal on document analysis and recognition, 7(2), pp. 84–104, (2005). Liang, J., Doermann, D. and Li, H.: Camera-based analysis of text and documents: a survey. International journal on document analysis and recognition, 7(2), pp. 84–104, (2005).
23.
Zurück zum Zitat Song, Y., Liu, A., Pang, L., Lin, S., Zhang, Y., Tang, S.: A novel image text extraction method based on k-means clustering. In. 7th IEEE/ACIS International Conference on Computer and Information Science, pp. 185–190, IEEE, (2008). Song, Y., Liu, A., Pang, L., Lin, S., Zhang, Y., Tang, S.: A novel image text extraction method based on k-means clustering. In. 7th IEEE/ACIS International Conference on Computer and Information Science, pp. 185–190, IEEE, (2008).
24.
Zurück zum Zitat Lu, S., Chen, T., Tian, S., Lim, J. H., Tan, C. L.: Scene text extraction based on edges and support vector regression. In. International Journal on Document Analysis and Recognition (IJDAR), pp. 125–135, (2015). Lu, S., Chen, T., Tian, S., Lim, J. H., Tan, C. L.: Scene text extraction based on edges and support vector regression. In. International Journal on Document Analysis and Recognition (IJDAR), pp. 125–135, (2015).
25.
Zurück zum Zitat Hsieh, J. W., Yu, S. H., Chen, Y. S.: Morphology-based license plate detection from complex scenes. In. 16th IEEE International Conference on Pattern Recognition, Vol. 3, pp. 176–179, (2002). Hsieh, J. W., Yu, S. H., Chen, Y. S.: Morphology-based license plate detection from complex scenes. In. 16th IEEE International Conference on Pattern Recognition, Vol. 3, pp. 176–179, (2002).
26.
Zurück zum Zitat Mollah, A. F., Basu, S., Nasipuri, M.: Text detection from camera captured images using a novel fuzzy-based technique. In. 3rd IEEE International Conference on Emerging Applications of Information Technology (EAIT), pp. 291–294, (2012). Mollah, A. F., Basu, S., Nasipuri, M.: Text detection from camera captured images using a novel fuzzy-based technique. In. 3rd IEEE International Conference on Emerging Applications of Information Technology (EAIT), pp. 291–294, (2012).
27.
Zurück zum Zitat Otsu, N.: A threshold selection method from gray-level histograms. Automatica, pp. 23–27, (1979). Otsu, N.: A threshold selection method from gray-level histograms. Automatica, pp. 23–27, (1979).
Metadaten
Titel
A Novel Text Localization Scheme for Camera Captured Document Images
verfasst von
Tauseef Khan
Ayatullah Faruk Mollah
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-7895-8_20

Neuer Inhalt