Skip to main content

2014 | OriginalPaper | Buchkapitel

A Robust Approach to Extraction of Texts from Camera Captured Images

verfasst von : Sudipto Banerjee, Koustav Mullick, Ujjwal Bhattacharya

Erschienen in: Camera-Based Document Analysis and Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Here, we present our recent study of a robust but simple approach to extraction of texts from camera-captured images. In the proposed approach, we first identify pixels which are highly specular. Connected components of this set of specular pixels are obtained. Pixels belonging to each such component are separately binarized using the well-known Otsu’s approach. We next apply smoothing on the whole image before obtaining its Canny edge representation. Bounding rectangle of each connected component of the Canny edge image is obtained and multiple components with pairwise overlapping bounding boxes are merged. Otsu’s thresholding technique is applied separately on different parts of input image defined by the resulting bounding boxes. Although Otsu’s thresholding approach does not generally provide acceptable performance on camera captured images, we observed its suitability when applied severally as in the above. The binarized specular components obtained at the initial stage replace the corresponding regions of the latter binarized image. Finally, a set of postprocessing operations is used to remove certain non-text components of the binarized image.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man Cybern. 9(1), 62–66 (1979)CrossRefMathSciNet Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man Cybern. 9(1), 62–66 (1979)CrossRefMathSciNet
2.
Zurück zum Zitat Kittler, J., Illingworth, J., Foglein, J.: Threshold selection based on a simple image statistic. Comp. Vision Graph. Image Proc. 30(2), 125–147 (1985) Kittler, J., Illingworth, J., Foglein, J.: Threshold selection based on a simple image statistic. Comp. Vision Graph. Image Proc. 30(2), 125–147 (1985)
3.
Zurück zum Zitat Sauvola, J.J., Pietikainen, M.: Adaptive document image binarization. Patt. Recog. 33(2), 225–236 (2000) Sauvola, J.J., Pietikainen, M.: Adaptive document image binarization. Patt. Recog. 33(2), 225–236 (2000)
4.
Zurück zum Zitat Niblack, W.: An Introduction to Digital Image Processing. Prentice Hall, New York (1986) Niblack, W.: An Introduction to Digital Image Processing. Prentice Hall, New York (1986)
5.
Zurück zum Zitat Stathis, P., Kavallieratou, E., Papamarkos, N.: An evaluation technique for binarization algorithms. J. Univ. Comp. Sci. 14(18), 3011–3030 (2008) Stathis, P., Kavallieratou, E., Papamarkos, N.: An evaluation technique for binarization algorithms. J. Univ. Comp. Sci. 14(18), 3011–3030 (2008)
6.
Zurück zum Zitat Peng, X., Setlur, S., Govindaraju, V., Sitaram, R.: Markov random field based binarization for hand-held devices captured document images. In: Proceedings of Indian Conference on Comp. Vision Graph. Image Proceedings, pp. 71–76 (2010) Peng, X., Setlur, S., Govindaraju, V., Sitaram, R.: Markov random field based binarization for hand-held devices captured document images. In: Proceedings of Indian Conference on Comp. Vision Graph. Image Proceedings, pp. 71–76 (2010)
7.
Zurück zum Zitat Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: Proceedings of the 7th Internationl Conference on Document Analysis and Recognition, pp. 682–687 (2003) Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: Proceedings of the 7th Internationl Conference on Document Analysis and Recognition, pp. 682–687 (2003)
8.
Zurück zum Zitat Shafer, S.A.: Using color to separate reflection components. Color Res. Appl. 10, 210–218 (1985)CrossRef Shafer, S.A.: Using color to separate reflection components. Color Res. Appl. 10, 210–218 (1985)CrossRef
9.
Zurück zum Zitat He, Y., et al.: Enhancement of camera-based whiteboard images. In: XVII-DRR (SPIE Proceedings Series, vol. 7534, pp. 1–10 (2010) He, Y., et al.: Enhancement of camera-based whiteboard images. In: XVII-DRR (SPIE Proceedings Series, vol. 7534, pp. 1–10 (2010)
10.
Zurück zum Zitat Canny, J.: A computational approach to edge detection. IEEE Trans. Patt. Anal. Mach. Intell. 8(6), 679–698 (1986)CrossRef Canny, J.: A computational approach to edge detection. IEEE Trans. Patt. Anal. Mach. Intell. 8(6), 679–698 (1986)CrossRef
11.
Zurück zum Zitat Roy Chowdhury, A., Bhattacharya, U., Parui, S.K.: Text detection of two major Indian scripts in natural scene images. In: Iwamura, M., Shafait, F. (eds.) CBDAR 2011. LNCS, vol. 7139, pp. 42–57. Springer, Heidelberg (2012) Roy Chowdhury, A., Bhattacharya, U., Parui, S.K.: Text detection of two major Indian scripts in natural scene images. In: Iwamura, M., Shafait, F. (eds.) CBDAR 2011. LNCS, vol. 7139, pp. 42–57. Springer, Heidelberg (2012)
12.
Zurück zum Zitat Roy Chowdhury, A., Bhattacharya, U., Parui, S.K.: Scene text detection using sparse stroke information and MLP. In: Proceedings of International Conference on Pattern Recognition, pp. 294–297 (2012) Roy Chowdhury, A., Bhattacharya, U., Parui, S.K.: Scene text detection using sparse stroke information and MLP. In: Proceedings of International Conference on Pattern Recognition, pp. 294–297 (2012)
13.
Zurück zum Zitat Kasar, T. et al.: Font and background color independent text binarization. In: Proceedings of CBDAR, pp. 3–9 (2007) Kasar, T. et al.: Font and background color independent text binarization. In: Proceedings of CBDAR, pp. 3–9 (2007)
14.
Zurück zum Zitat Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of CVPR, pp. 2963–2970 (2010) Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of CVPR, pp. 2963–2970 (2010)
15.
Zurück zum Zitat Borgefors, G.: Distance transformations in digital images. Comp. Vis. Graph. Image Proc. 34, 344–371 (1986)CrossRef Borgefors, G.: Distance transformations in digital images. Comp. Vis. Graph. Image Proc. 34, 344–371 (1986)CrossRef
16.
Zurück zum Zitat Chen, H., et al.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proceedings of ICIP (2011) Chen, H., et al.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proceedings of ICIP (2011)
17.
Zurück zum Zitat Merino-Gracia, C., Lenc, K., Mirmehdi, M: A head-mounted device for recognizing text in natural scenes. In: Proceedings of CBDAR, pp. 27–32 (2011) Merino-Gracia, C., Lenc, K., Mirmehdi, M: A head-mounted device for recognizing text in natural scenes. In: Proceedings of CBDAR, pp. 27–32 (2011)
18.
Zurück zum Zitat Zhang, J., Kasturi, R.: Text detection using edge gradient and graph spectrum. In: Proceedings of ICPR, pp. 3979–3982 (2010) Zhang, J., Kasturi, R.: Text detection using edge gradient and graph spectrum. In: Proceedings of ICPR, pp. 3979–3982 (2010)
Metadaten
Titel
A Robust Approach to Extraction of Texts from Camera Captured Images
verfasst von
Sudipto Banerjee
Koustav Mullick
Ujjwal Bhattacharya
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-05167-3_3

Premium Partner