Skip to main content

2017 | OriginalPaper | Buchkapitel

An Efficient Detection Method for Text of Arbitrary Orientations in Natural Images

verfasst von : Lanfang Dong, Zhongdi Chao, Jianfu Wang

Erschienen in: Wearable Sensors and Robots

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Due to the high complexity of natural scenes, text detection is always a critical yet challenging task. On the basis of existing character detection method, a novel text line detection method is proposed in this paper, which can localize text of arbitrary orientation by using related information of character regions in candidate text line. First, inspired by the Hough transform, text line detection problem is regarded as line detection problem in candidate characters set obtained by Most Stable Extremal Regions (MSERs). Second, in order to find out the relationship of adjacent candidate regions, a graph model is built based on some constraints and adjacent candidates are linked into pairs to obtain search domain. Then, to avoid repeated calculation of the same line, some strategies need to be used. Finally, as some of the potential text lines are incorrect, we use a new text line descriptor to exclude the non-text areas. Experimental results on the ICDAR 2013 competition dataset and MSRA-TD500 show that the proposed approach is favorable no matter for non-horizontal text or horizontal text.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Computer vision and pattern recognition. CVPR 2004. Conference on Proceedings of the 2004 IEEE computer society. IEEE 2004, vol 2, pp II-366-II-373. doi:10.1109/CVPR.2004.1315187 Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Computer vision and pattern recognition. CVPR 2004. Conference on Proceedings of the 2004 IEEE computer society. IEEE 2004, vol 2, pp II-366-II-373. doi:10.​1109/​CVPR.​2004.​1315187
Zurück zum Zitat Du Y, Ai H, Lao S (2011) Dot text detection based on fast points. In: International conference on document analysis and recognition (ICDAR). IEEE 2011, pp 435–439. doi:10.1109/ICDAR.2011.94 Du Y, Ai H, Lao S (2011) Dot text detection based on fast points. In: International conference on document analysis and recognition (ICDAR). IEEE 2011, pp 435–439. doi:10.​1109/​ICDAR.​2011.​94
Zurück zum Zitat Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2010, pp 2963–2970. doi:10.1109/CVPR.2010.5540041 Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2010, pp 2963–2970. doi:10.​1109/​CVPR.​2010.​5540041
Zurück zum Zitat Huang W, Lin Z, Yang J, Wang J (2013). Text localization in natural images using stroke feature transform and text covariance descriptors. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 1241–1248. doi:10.1109/ICCV.2013.157 Huang W, Lin Z, Yang J, Wang J (2013). Text localization in natural images using stroke feature transform and text covariance descriptors. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 1241–1248. doi:10.​1109/​ICCV.​2013.​157
Zurück zum Zitat Huang X (2012) Automatic video text detection and localization based on coarseness texture. In: Fifth international conference on intelligent computation technology and automation (ICICTA). IEEE 2012, pp 398–401. doi:10.1109/ICICTA.2012.106 Huang X (2012) Automatic video text detection and localization based on coarseness texture. In: Fifth international conference on intelligent computation technology and automation (ICICTA). IEEE 2012, pp 398–401. doi:10.​1109/​ICICTA.​2012.​106
Zurück zum Zitat Iqbal K, Yin XC, Hao HW, Asghar S, Ali H (2014) Bayesian network scores based text localization in scene images. In: International joint conference on neural networks (IJCNN). IEEE 2014, pp 2218–2225. doi:10.1109/IJCNN.2014.6889731 Iqbal K, Yin XC, Hao HW, Asghar S, Ali H (2014) Bayesian network scores based text localization in scene images. In: International joint conference on neural networks (IJCNN). IEEE 2014, pp 2218–2225. doi:10.​1109/​IJCNN.​2014.​6889731
Zurück zum Zitat Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767CrossRef Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767CrossRef
Zurück zum Zitat Mancas-Thillou C, Gosselin B (2006) Natural scene text understanding. na, Ann Arbor Mancas-Thillou C, Gosselin B (2006) Natural scene text understanding. na, Ann Arbor
Zurück zum Zitat Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 3538–3545. doi:10.1109/CVPR.2012.6248097 Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 3538–3545. doi:10.​1109/​CVPR.​2012.​6248097
Zurück zum Zitat Neumann L, Matas J (2013) Scene text localization and recognition with oriented stroke detection. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 97–104. doi:10.1109/ICCV.2013.19 Neumann L, Matas J (2013) Scene text localization and recognition with oriented stroke detection. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 97–104. doi:10.​1109/​ICCV.​2013.​19
Zurück zum Zitat Shekar BH, Smitha ML, Shivakumara P (2014) Discrete wavelet transform and gradient difference based approach for text localization in videos. In: 2014 fifth international conference on signal and image processing (ICSIP). IEEE 2014, pp 280–284. doi:10.1109/ICSIP.2014.50 Shekar BH, Smitha ML, Shivakumara P (2014) Discrete wavelet transform and gradient difference based approach for text localization in videos. In: 2014 fifth international conference on signal and image processing (ICSIP). IEEE 2014, pp 280–284. doi:10.​1109/​ICSIP.​2014.​50
Zurück zum Zitat Wan M, Zhang F, Cheng H, Liu Q (2008) Text localization in spam image using edge features. In: International conference on communications, circuits and systems. ICCCAS 2008. IEEE 2008, pp 838–842. doi:10.1109/ICCCAS.2008.4657900 Wan M, Zhang F, Cheng H, Liu Q (2008) Text localization in spam image using edge features. In: International conference on communications, circuits and systems. ICCCAS 2008. IEEE 2008, pp 838–842. doi:10.​1109/​ICCCAS.​2008.​4657900
Zurück zum Zitat Wang K, Babenko B, Belongie S (2011) End-to-end scene text recognition. In: IEEE International Conference on Computer Vision (ICCV). IEEE 2011, pp 1457–1464. doi:10.1109/ICCV.2011.6126402 Wang K, Babenko B, Belongie S (2011) End-to-end scene text recognition. In: IEEE International Conference on Computer Vision (ICCV). IEEE 2011, pp 1457–1464. doi:10.​1109/​ICCV.​2011.​6126402
Zurück zum Zitat Wen W, Huang X, Yang L, Yang Z, Zhang P (2009) An efficient method for text location and segmentation. In:. WRI world congress on software engineering. WCSE’09. IEEE 2009, vol 3, pp 3–7. doi:10.1109/WCSE.2009.292 Wen W, Huang X, Yang L, Yang Z, Zhang P (2009) An efficient method for text location and segmentation. In:. WRI world congress on software engineering. WCSE’09. IEEE 2009, vol 3, pp 3–7. doi:10.​1109/​WCSE.​2009.​292
Zurück zum Zitat Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 1083–1090. doi:10.1109/CVPR.2012.6247787 Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 1083–1090. doi:10.​1109/​CVPR.​2012.​6247787
Metadaten
Titel
An Efficient Detection Method for Text of Arbitrary Orientations in Natural Images
verfasst von
Lanfang Dong
Zhongdi Chao
Jianfu Wang
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-2404-7_35

Neuer Inhalt