Top

Published in:

2017 | OriginalPaper | Chapter

Text Detection in Low Resolution Scene Images Using Convolutional Neural Network

Authors : Anhar Risnumawan, Indra Adji Sulistijono, Jemal Abawajy

Published in: Recent Advances on Soft Computing and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Text detection on scene images has increasingly gained a lot of interests, especially due to the increase of wearable devices. However, the devices often acquire low resolution images, thus making it difficult to detect text due to noise. Notable method for detection in low resolution images generally utilizes many features which are cleverly integrated and cascaded classifiers to form better discriminative system. Those methods however require a lot of hand-crafted features and manually tuned, which are difficult to achieve in practice. In this paper, we show that the notable cascaded method is equivalent to a Convolutional Neural Network (CNN) framework to deal with text detection in low resolution scene images. The CNN framework however has interesting mutual interaction between layers from which the parameters are jointly learned without requiring manual design, thus its parameters can be better optimized from training data. Experiment results show the efficiency of the method for detecting text in low resolution scene images.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter Does Number of Clusters Effect the Purity and Entropy of Clustering?

next chapter Handling Imbalanced Data in Churn Prediction Using RUSBoost and Feature Selection (Case Study: PT.Telekomunikasi Indonesia Regional 7)

http://www.google.com/mobile/goggles/#text.

http://www.artificialvision.com/android.htm.

http://tcts.fpms.ac.be/projects/sypole/index.php?lang=en.

Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893. IEEE (2005)

Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef

LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)CrossRef

Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Intl. J. Doc. Anal. Recogn. (IJDAR) 7(2–3), 84–104 (2005)CrossRef

Mählisch, M., Oberländer, M., Löhlein, O., Gavrila, D., Ritter, W.: A multiple detector approach to low-resolution fir pedestrian recognition. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV2005), Las Vegas, NV, USA (2005)

Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008)

Mirmehdi, M., Clark, P., Lam, J.: Extracting low resolution text with an active camera for OCR. In: Spanish Symposium on Pattern Recognition and Image Processing IX, pp. 43–48 (2001)

Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: CVPR, pp. 3538–3545. IEEE (2012)

Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: ICDAR (2013)

10.

Nguyen, M.H., Kim, S.-H., Lee, G.: Recognizing text in low resolution born-digital images. In: Jeong, Y.-S., Park, Y.-H., Hsu, C.-H.R., Park, J.J.J.H. (eds.) Ubiquitous Information Technologies and Applications. LNEE, vol. 280, pp. 85–92. Springer, Heidelberg (2014). doi:10.1007/978-3-642-41671-2_12CrossRef

11.

Risnumawan, A., Chan, C.S.: Text detection via edgeless stroke width transform. In: ISPACS, pp. 336–340. IEEE (2014)

12.

Risnumawan, A., Shivakumara, P., Chan, C.S., Tan, C.L.: A robust arbitrary text detection system for natural scene images. Expert Syst. Appl. 41(18), 8027–8048 (2014)CrossRef

13.

Sahli, S., Ouyang, Y., Sheng, Y., Lavigne, D.A.: Robust vehicle detection in low-resolution aerial imagery. In: SPIE Defense, Security, and Sensing, p. 76680G. International Society for Optics and Photonics (2010)

14.

Sanketi, P., Shen, H., Coughlan, J.M.: Localizing blurry and low-resolution text in natural images. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 503–510. IEEE (2011)

15.

Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: ICCV, pp. 1457–1464. IEEE (2011)

16.

Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: ICPR, pp. 3304–3308. IEEE (2012)

17.

Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)CrossRef

18.

Zhang, J., Gong, S.: People detection in low-resolution video with non-stationary background. Image Vis. Comput. 27(4), 437–443 (2009)CrossRef

19.

Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 21(8), 693–703 (2003)CrossRef

20.

Zhu, J., Javed, O., Liu, J., Yu, Q., Cheng, H., Sawhney, H.: Pedestrian detection in low-resolution imagery by learning multi-scale intrinsic motion structures (mims). In: CVPR, pp. 3510–3517 (2014)

Title: Text Detection in Low Resolution Scene Images Using Convolutional Neural Network
Authors: Anhar Risnumawan
Indra Adji Sulistijono
Jemal Abawajy
Publisher: Springer International Publishing
Book: Recent Advances on Soft Computing and Data Mining
Print ISBN: 978-3-319-51279-2

Electronic ISBN: 978-3-319-51281-5

Copyright Year: 2017
DOI: https://doi.org/10.1007/978-3-319-51281-5_37

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner