nach oben

Erschienen in:

2017 | OriginalPaper | Buchkapitel

Text Detection in Low Resolution Scene Images Using Convolutional Neural Network

verfasst von : Anhar Risnumawan, Indra Adji Sulistijono, Jemal Abawajy

Erschienen in: Recent Advances on Soft Computing and Data Mining

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Text detection on scene images has increasingly gained a lot of interests, especially due to the increase of wearable devices. However, the devices often acquire low resolution images, thus making it difficult to detect text due to noise. Notable method for detection in low resolution images generally utilizes many features which are cleverly integrated and cascaded classifiers to form better discriminative system. Those methods however require a lot of hand-crafted features and manually tuned, which are difficult to achieve in practice. In this paper, we show that the notable cascaded method is equivalent to a Convolutional Neural Network (CNN) framework to deal with text detection in low resolution scene images. The CNN framework however has interesting mutual interaction between layers from which the parameters are jointly learned without requiring manual design, thus its parameters can be better optimized from training data. Experiment results show the efficiency of the method for detecting text in low resolution scene images.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Does Number of Clusters Effect the Purity and Entropy of Clustering?

Nächstes Kapitel Handling Imbalanced Data in Churn Prediction Using RUSBoost and Feature Selection (Case Study: PT.Telekomunikasi Indonesia Regional 7)

http://www.google.com/mobile/goggles/#text.

http://www.artificialvision.com/android.htm.

http://tcts.fpms.ac.be/projects/sypole/index.php?lang=en.

Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893. IEEE (2005)

Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef

LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)CrossRef

Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Intl. J. Doc. Anal. Recogn. (IJDAR) 7(2–3), 84–104 (2005)CrossRef

Mählisch, M., Oberländer, M., Löhlein, O., Gavrila, D., Ritter, W.: A multiple detector approach to low-resolution fir pedestrian recognition. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV2005), Las Vegas, NV, USA (2005)

Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008)

Mirmehdi, M., Clark, P., Lam, J.: Extracting low resolution text with an active camera for OCR. In: Spanish Symposium on Pattern Recognition and Image Processing IX, pp. 43–48 (2001)

Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: CVPR, pp. 3538–3545. IEEE (2012)

Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: ICDAR (2013)

10.

Nguyen, M.H., Kim, S.-H., Lee, G.: Recognizing text in low resolution born-digital images. In: Jeong, Y.-S., Park, Y.-H., Hsu, C.-H.R., Park, J.J.J.H. (eds.) Ubiquitous Information Technologies and Applications. LNEE, vol. 280, pp. 85–92. Springer, Heidelberg (2014). doi:10.1007/978-3-642-41671-2_12CrossRef

11.

Risnumawan, A., Chan, C.S.: Text detection via edgeless stroke width transform. In: ISPACS, pp. 336–340. IEEE (2014)

12.

Risnumawan, A., Shivakumara, P., Chan, C.S., Tan, C.L.: A robust arbitrary text detection system for natural scene images. Expert Syst. Appl. 41(18), 8027–8048 (2014)CrossRef

13.

Sahli, S., Ouyang, Y., Sheng, Y., Lavigne, D.A.: Robust vehicle detection in low-resolution aerial imagery. In: SPIE Defense, Security, and Sensing, p. 76680G. International Society for Optics and Photonics (2010)

14.

Sanketi, P., Shen, H., Coughlan, J.M.: Localizing blurry and low-resolution text in natural images. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 503–510. IEEE (2011)

15.

Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: ICCV, pp. 1457–1464. IEEE (2011)

16.

Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: ICPR, pp. 3304–3308. IEEE (2012)

17.

Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)CrossRef

18.

Zhang, J., Gong, S.: People detection in low-resolution video with non-stationary background. Image Vis. Comput. 27(4), 437–443 (2009)CrossRef

19.

Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 21(8), 693–703 (2003)CrossRef

20.

Zhu, J., Javed, O., Liu, J., Yu, Q., Cheng, H., Sawhney, H.: Pedestrian detection in low-resolution imagery by learning multi-scale intrinsic motion structures (mims). In: CVPR, pp. 3510–3517 (2014)

Titel: Text Detection in Low Resolution Scene Images Using Convolutional Neural Network
verfasst von: Anhar Risnumawan
Indra Adji Sulistijono
Jemal Abawajy
Verlag: Springer International Publishing
Buch: Recent Advances on Soft Computing and Data Mining
Print ISBN: 978-3-319-51279-2

Electronic ISBN: 978-3-319-51281-5

Copyright-Jahr: 2017
DOI: https://doi.org/10.1007/978-3-319-51281-5_37

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner