Skip to main content

2015 | OriginalPaper | Buchkapitel

Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions

verfasst von : Alessandro Zamberletti, Lucia Noce, Ignazio Gallo

Erschienen in: Computer Vision - ACCV 2014 Workshops

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Text localization from scene images is a challenging task that finds application in many areas. In this work, we propose a novel hybrid text localization approach that exploits Multi-resolution Maximally Stable Extremal Regions to discard false-positive detections from the text confidence maps generated by a Fast Feature Pyramid based sliding window classifier. The use of a multi-scale approach during both feature computation and connected component extraction allows our method to identify uncommon text elements that are usually not detected by competing algorithms, while the adoption of approximated features and appropriately filtered connected components assures a low overall computational complexity of the proposed system.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Pan, Y.F., Hou, X., Liu, C.L.: Text localization in natural scene images based on conditional random field. In: Proceedings of the ICDAR (2009) Pan, Y.F., Hou, X., Liu, C.L.: Text localization in natural scene images based on conditional random field. In: Proceedings of the ICDAR (2009)
2.
Zurück zum Zitat Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of the ICDAR (2011) Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of the ICDAR (2011)
3.
Zurück zum Zitat Mishra, A., Alahari, K., Jawahar, C.: Scene text recognition using higher order language priors. In: Proceedings of the BVMC (2012) Mishra, A., Alahari, K., Jawahar, C.: Scene text recognition using higher order language priors. In: Proceedings of the BVMC (2012)
4.
Zurück zum Zitat Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proceedings of the ICCV (2011) Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proceedings of the ICCV (2011)
5.
Zurück zum Zitat Koo, H.I., Kim, D.H.: Scene text detection via connected component clustering and non-text filtering. IEEE Trans. IP 22, 2296–2305 (2013)MathSciNet Koo, H.I., Kim, D.H.: Scene text detection via connected component clustering and non-text filtering. IEEE Trans. IP 22, 2296–2305 (2013)MathSciNet
6.
Zurück zum Zitat Li, Y., Jia, W., Shen, C., Hengel, A.: Characterness: an indicator of text in the wild. IEEE Trans. IP 23, 1666–1677 (2014) Li, Y., Jia, W., Shen, C., Hengel, A.: Characterness: an indicator of text in the wild. IEEE Trans. IP 23, 1666–1677 (2014)
7.
Zurück zum Zitat Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011) CrossRef Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011) CrossRef
8.
Zurück zum Zitat Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn. Lett. 34, 107–116 (2013)CrossRef Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn. Lett. 34, 107–116 (2013)CrossRef
9.
Zurück zum Zitat Yin, X.C., Yin, X., Huang, K.: Robust text detection in natural scene images. IEEE Trans. PAMI 36, 970–983 (2013)MathSciNet Yin, X.C., Yin, X., Huang, K.: Robust text detection in natural scene images. IEEE Trans. PAMI 36, 970–983 (2013)MathSciNet
10.
Zurück zum Zitat Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010) Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010)
11.
Zurück zum Zitat Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of the BMVC (2002) Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of the BMVC (2002)
12.
Zurück zum Zitat Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Bigorda, L., Mestre, S., Mas, J., Mota, D., Almaz, J., Heras, L.: ICDAR 2013 robust reading competition. In: Proceedings of the ICDAR (2013) Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Bigorda, L., Mestre, S., Mas, J., Mota, D., Almaz, J., Heras, L.: ICDAR 2013 robust reading competition. In: Proceedings of the ICDAR (2013)
13.
Zurück zum Zitat Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. PAMI 36, 1532–1545 (2014)CrossRef Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. PAMI 36, 1532–1545 (2014)CrossRef
14.
Zurück zum Zitat Forssén, P.E., Lowe, D.G.: Shape descriptors for maximally stable extremal regions. In: Proceedings of the ICCV (2007) Forssén, P.E., Lowe, D.G.: Shape descriptors for maximally stable extremal regions. In: Proceedings of the ICCV (2007)
15.
Zurück zum Zitat Crimisi, A.: Microsoft Research Cambridge Object Recognition Image Database (2004) Crimisi, A.: Microsoft Research Cambridge Object Recognition Image Database (2004)
16.
Zurück zum Zitat Yao, C., Bai, X., Liu, W., Ma, Y.: Detecting texts of arbitrary orientations in natural images. In: Proceedings of the CVPR (2010) Yao, C., Bai, X., Liu, W., Ma, Y.: Detecting texts of arbitrary orientations in natural images. In: Proceedings of the CVPR (2010)
17.
Zurück zum Zitat Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proceedings of the CVPR (2012) Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proceedings of the CVPR (2012)
18.
Zurück zum Zitat Mathias, M., Timofte, R., Benenson, R., Gool, L.V.: Traffic sign recognition: how far are we from the solution? In: Proceedings of the IJCNN (2013) Mathias, M., Timofte, R., Benenson, R., Gool, L.V.: Traffic sign recognition: how far are we from the solution? In: Proceedings of the IJCNN (2013)
19.
Zurück zum Zitat Benenson, R., Mathias, M., Tuytelaars, T., Gool, L.V.: Seeking the strongest rigid detector. In: Proceedings of the CVPR (2013) Benenson, R., Mathias, M., Tuytelaars, T., Gool, L.V.: Seeking the strongest rigid detector. In: Proceedings of the CVPR (2013)
20.
Zurück zum Zitat Appeal, R., Fuchs, T., Dollár, P., Perona, P.: Quickly boosting decision trees pruning underachieving features early. In: Proceedings of the ICML (2013) Appeal, R., Fuchs, T., Dollár, P., Perona, P.: Quickly boosting decision trees pruning underachieving features early. In: Proceedings of the ICML (2013)
21.
Zurück zum Zitat Villamizar, M., Andrade-Cetto, J., Sanfeliu, A., Moreno-Noguer, F.: Bootstrapping boosted random ferns for discriminative and efficient object classification. Pattern Recogn. 45, 3141–3153 (2012)CrossRef Villamizar, M., Andrade-Cetto, J., Sanfeliu, A., Moreno-Noguer, F.: Bootstrapping boosted random ferns for discriminative and efficient object classification. Pattern Recogn. 45, 3141–3153 (2012)CrossRef
22.
Zurück zum Zitat de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the VISAPP (2009) de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the VISAPP (2009)
23.
Zurück zum Zitat Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: Proceedings of the CVPR (2010) Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: Proceedings of the CVPR (2010)
24.
Zurück zum Zitat Manen, S., Guillaumin, M., Gool, L.V.: Prime object proposals with randomized prims algorithm. In: Proceedings of the ICCV (2013) Manen, S., Guillaumin, M., Gool, L.V.: Prime object proposals with randomized prims algorithm. In: Proceedings of the ICCV (2013)
25.
Zurück zum Zitat Yi, C., Tian, Y.: Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Trans. IP 21, 4256–4268 (2012)MathSciNet Yi, C., Tian, Y.: Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Trans. IP 21, 4256–4268 (2012)MathSciNet
26.
Zurück zum Zitat Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: Proceedings of the ICDAR (2013) Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: Proceedings of the ICDAR (2013)
27.
Zurück zum Zitat Bai, B., Yin, F., Liu, C.L.: Scene text localization using gradient local correlation. In: Proceedings of the ICDAR (2013) Bai, B., Yin, F., Liu, C.L.: Scene text localization using gradient local correlation. In: Proceedings of the ICDAR (2013)
28.
Zurück zum Zitat Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competition. In: Proceedings of the ICDAR (2003) Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competition. In: Proceedings of the ICDAR (2003)
29.
Zurück zum Zitat Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. IJDAR 8, 280–296 (2006)CrossRef Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. IJDAR 8, 280–296 (2006)CrossRef
Metadaten
Titel
Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions
verfasst von
Alessandro Zamberletti
Lucia Noce
Ignazio Gallo
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-16631-5_7