Skip to main content
Top

2017 | OriginalPaper | Chapter

An Efficient Detection Method for Text of Arbitrary Orientations in Natural Images

Authors : Lanfang Dong, Zhongdi Chao, Jianfu Wang

Published in: Wearable Sensors and Robots

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Due to the high complexity of natural scenes, text detection is always a critical yet challenging task. On the basis of existing character detection method, a novel text line detection method is proposed in this paper, which can localize text of arbitrary orientation by using related information of character regions in candidate text line. First, inspired by the Hough transform, text line detection problem is regarded as line detection problem in candidate characters set obtained by Most Stable Extremal Regions (MSERs). Second, in order to find out the relationship of adjacent candidate regions, a graph model is built based on some constraints and adjacent candidates are linked into pairs to obtain search domain. Then, to avoid repeated calculation of the same line, some strategies need to be used. Finally, as some of the potential text lines are incorrect, we use a new text line descriptor to exclude the non-text areas. Experimental results on the ICDAR 2013 competition dataset and MSRA-TD500 show that the proposed approach is favorable no matter for non-horizontal text or horizontal text.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Computer vision and pattern recognition. CVPR 2004. Conference on Proceedings of the 2004 IEEE computer society. IEEE 2004, vol 2, pp II-366-II-373. doi:10.1109/CVPR.2004.1315187 Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Computer vision and pattern recognition. CVPR 2004. Conference on Proceedings of the 2004 IEEE computer society. IEEE 2004, vol 2, pp II-366-II-373. doi:10.​1109/​CVPR.​2004.​1315187
go back to reference Du Y, Ai H, Lao S (2011) Dot text detection based on fast points. In: International conference on document analysis and recognition (ICDAR). IEEE 2011, pp 435–439. doi:10.1109/ICDAR.2011.94 Du Y, Ai H, Lao S (2011) Dot text detection based on fast points. In: International conference on document analysis and recognition (ICDAR). IEEE 2011, pp 435–439. doi:10.​1109/​ICDAR.​2011.​94
go back to reference Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2010, pp 2963–2970. doi:10.1109/CVPR.2010.5540041 Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2010, pp 2963–2970. doi:10.​1109/​CVPR.​2010.​5540041
go back to reference Huang W, Lin Z, Yang J, Wang J (2013). Text localization in natural images using stroke feature transform and text covariance descriptors. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 1241–1248. doi:10.1109/ICCV.2013.157 Huang W, Lin Z, Yang J, Wang J (2013). Text localization in natural images using stroke feature transform and text covariance descriptors. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 1241–1248. doi:10.​1109/​ICCV.​2013.​157
go back to reference Huang X (2012) Automatic video text detection and localization based on coarseness texture. In: Fifth international conference on intelligent computation technology and automation (ICICTA). IEEE 2012, pp 398–401. doi:10.1109/ICICTA.2012.106 Huang X (2012) Automatic video text detection and localization based on coarseness texture. In: Fifth international conference on intelligent computation technology and automation (ICICTA). IEEE 2012, pp 398–401. doi:10.​1109/​ICICTA.​2012.​106
go back to reference Iqbal K, Yin XC, Hao HW, Asghar S, Ali H (2014) Bayesian network scores based text localization in scene images. In: International joint conference on neural networks (IJCNN). IEEE 2014, pp 2218–2225. doi:10.1109/IJCNN.2014.6889731 Iqbal K, Yin XC, Hao HW, Asghar S, Ali H (2014) Bayesian network scores based text localization in scene images. In: International joint conference on neural networks (IJCNN). IEEE 2014, pp 2218–2225. doi:10.​1109/​IJCNN.​2014.​6889731
go back to reference Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767CrossRef Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767CrossRef
go back to reference Mancas-Thillou C, Gosselin B (2006) Natural scene text understanding. na, Ann Arbor Mancas-Thillou C, Gosselin B (2006) Natural scene text understanding. na, Ann Arbor
go back to reference Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 3538–3545. doi:10.1109/CVPR.2012.6248097 Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 3538–3545. doi:10.​1109/​CVPR.​2012.​6248097
go back to reference Neumann L, Matas J (2013) Scene text localization and recognition with oriented stroke detection. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 97–104. doi:10.1109/ICCV.2013.19 Neumann L, Matas J (2013) Scene text localization and recognition with oriented stroke detection. In: IEEE international conference on computer vision (ICCV). IEEE 2013, pp 97–104. doi:10.​1109/​ICCV.​2013.​19
go back to reference Shekar BH, Smitha ML, Shivakumara P (2014) Discrete wavelet transform and gradient difference based approach for text localization in videos. In: 2014 fifth international conference on signal and image processing (ICSIP). IEEE 2014, pp 280–284. doi:10.1109/ICSIP.2014.50 Shekar BH, Smitha ML, Shivakumara P (2014) Discrete wavelet transform and gradient difference based approach for text localization in videos. In: 2014 fifth international conference on signal and image processing (ICSIP). IEEE 2014, pp 280–284. doi:10.​1109/​ICSIP.​2014.​50
go back to reference Wan M, Zhang F, Cheng H, Liu Q (2008) Text localization in spam image using edge features. In: International conference on communications, circuits and systems. ICCCAS 2008. IEEE 2008, pp 838–842. doi:10.1109/ICCCAS.2008.4657900 Wan M, Zhang F, Cheng H, Liu Q (2008) Text localization in spam image using edge features. In: International conference on communications, circuits and systems. ICCCAS 2008. IEEE 2008, pp 838–842. doi:10.​1109/​ICCCAS.​2008.​4657900
go back to reference Wen W, Huang X, Yang L, Yang Z, Zhang P (2009) An efficient method for text location and segmentation. In:. WRI world congress on software engineering. WCSE’09. IEEE 2009, vol 3, pp 3–7. doi:10.1109/WCSE.2009.292 Wen W, Huang X, Yang L, Yang Z, Zhang P (2009) An efficient method for text location and segmentation. In:. WRI world congress on software engineering. WCSE’09. IEEE 2009, vol 3, pp 3–7. doi:10.​1109/​WCSE.​2009.​292
go back to reference Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 1083–1090. doi:10.1109/CVPR.2012.6247787 Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on computer vision and pattern recognition (CVPR). IEEE 2012, pp 1083–1090. doi:10.​1109/​CVPR.​2012.​6247787
Metadata
Title
An Efficient Detection Method for Text of Arbitrary Orientations in Natural Images
Authors
Lanfang Dong
Zhongdi Chao
Jianfu Wang
Copyright Year
2017
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-2404-7_35