Skip to main content
Top

2017 | OriginalPaper | Chapter

Text Detection in Low Resolution Scene Images Using Convolutional Neural Network

Authors : Anhar Risnumawan, Indra Adji Sulistijono, Jemal Abawajy

Published in: Recent Advances on Soft Computing and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Text detection on scene images has increasingly gained a lot of interests, especially due to the increase of wearable devices. However, the devices often acquire low resolution images, thus making it difficult to detect text due to noise. Notable method for detection in low resolution images generally utilizes many features which are cleverly integrated and cascaded classifiers to form better discriminative system. Those methods however require a lot of hand-crafted features and manually tuned, which are difficult to achieve in practice. In this paper, we show that the notable cascaded method is equivalent to a Convolutional Neural Network (CNN) framework to deal with text detection in low resolution scene images. The CNN framework however has interesting mutual interaction between layers from which the parameters are jointly learned without requiring manual design, thus its parameters can be better optimized from training data. Experiment results show the efficiency of the method for detecting text in low resolution scene images.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893. IEEE (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893. IEEE (2005)
2.
go back to reference Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef
3.
go back to reference LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)CrossRef LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)CrossRef
4.
go back to reference Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Intl. J. Doc. Anal. Recogn. (IJDAR) 7(2–3), 84–104 (2005)CrossRef Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Intl. J. Doc. Anal. Recogn. (IJDAR) 7(2–3), 84–104 (2005)CrossRef
5.
go back to reference Mählisch, M., Oberländer, M., Löhlein, O., Gavrila, D., Ritter, W.: A multiple detector approach to low-resolution fir pedestrian recognition. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV2005), Las Vegas, NV, USA (2005) Mählisch, M., Oberländer, M., Löhlein, O., Gavrila, D., Ritter, W.: A multiple detector approach to low-resolution fir pedestrian recognition. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV2005), Las Vegas, NV, USA (2005)
6.
go back to reference Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008) Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008)
7.
go back to reference Mirmehdi, M., Clark, P., Lam, J.: Extracting low resolution text with an active camera for OCR. In: Spanish Symposium on Pattern Recognition and Image Processing IX, pp. 43–48 (2001) Mirmehdi, M., Clark, P., Lam, J.: Extracting low resolution text with an active camera for OCR. In: Spanish Symposium on Pattern Recognition and Image Processing IX, pp. 43–48 (2001)
8.
go back to reference Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: CVPR, pp. 3538–3545. IEEE (2012) Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: CVPR, pp. 3538–3545. IEEE (2012)
9.
go back to reference Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: ICDAR (2013) Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: ICDAR (2013)
10.
go back to reference Nguyen, M.H., Kim, S.-H., Lee, G.: Recognizing text in low resolution born-digital images. In: Jeong, Y.-S., Park, Y.-H., Hsu, C.-H.R., Park, J.J.J.H. (eds.) Ubiquitous Information Technologies and Applications. LNEE, vol. 280, pp. 85–92. Springer, Heidelberg (2014). doi:10.1007/978-3-642-41671-2_12CrossRef Nguyen, M.H., Kim, S.-H., Lee, G.: Recognizing text in low resolution born-digital images. In: Jeong, Y.-S., Park, Y.-H., Hsu, C.-H.R., Park, J.J.J.H. (eds.) Ubiquitous Information Technologies and Applications. LNEE, vol. 280, pp. 85–92. Springer, Heidelberg (2014). doi:10.​1007/​978-3-642-41671-2_​12CrossRef
11.
go back to reference Risnumawan, A., Chan, C.S.: Text detection via edgeless stroke width transform. In: ISPACS, pp. 336–340. IEEE (2014) Risnumawan, A., Chan, C.S.: Text detection via edgeless stroke width transform. In: ISPACS, pp. 336–340. IEEE (2014)
12.
go back to reference Risnumawan, A., Shivakumara, P., Chan, C.S., Tan, C.L.: A robust arbitrary text detection system for natural scene images. Expert Syst. Appl. 41(18), 8027–8048 (2014)CrossRef Risnumawan, A., Shivakumara, P., Chan, C.S., Tan, C.L.: A robust arbitrary text detection system for natural scene images. Expert Syst. Appl. 41(18), 8027–8048 (2014)CrossRef
13.
go back to reference Sahli, S., Ouyang, Y., Sheng, Y., Lavigne, D.A.: Robust vehicle detection in low-resolution aerial imagery. In: SPIE Defense, Security, and Sensing, p. 76680G. International Society for Optics and Photonics (2010) Sahli, S., Ouyang, Y., Sheng, Y., Lavigne, D.A.: Robust vehicle detection in low-resolution aerial imagery. In: SPIE Defense, Security, and Sensing, p. 76680G. International Society for Optics and Photonics (2010)
14.
go back to reference Sanketi, P., Shen, H., Coughlan, J.M.: Localizing blurry and low-resolution text in natural images. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 503–510. IEEE (2011) Sanketi, P., Shen, H., Coughlan, J.M.: Localizing blurry and low-resolution text in natural images. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 503–510. IEEE (2011)
15.
go back to reference Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: ICCV, pp. 1457–1464. IEEE (2011) Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: ICCV, pp. 1457–1464. IEEE (2011)
16.
go back to reference Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: ICPR, pp. 3304–3308. IEEE (2012) Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: ICPR, pp. 3304–3308. IEEE (2012)
17.
go back to reference Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)CrossRef Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)CrossRef
18.
go back to reference Zhang, J., Gong, S.: People detection in low-resolution video with non-stationary background. Image Vis. Comput. 27(4), 437–443 (2009)CrossRef Zhang, J., Gong, S.: People detection in low-resolution video with non-stationary background. Image Vis. Comput. 27(4), 437–443 (2009)CrossRef
19.
go back to reference Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 21(8), 693–703 (2003)CrossRef Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 21(8), 693–703 (2003)CrossRef
20.
go back to reference Zhu, J., Javed, O., Liu, J., Yu, Q., Cheng, H., Sawhney, H.: Pedestrian detection in low-resolution imagery by learning multi-scale intrinsic motion structures (mims). In: CVPR, pp. 3510–3517 (2014) Zhu, J., Javed, O., Liu, J., Yu, Q., Cheng, H., Sawhney, H.: Pedestrian detection in low-resolution imagery by learning multi-scale intrinsic motion structures (mims). In: CVPR, pp. 3510–3517 (2014)
Metadata
Title
Text Detection in Low Resolution Scene Images Using Convolutional Neural Network
Authors
Anhar Risnumawan
Indra Adji Sulistijono
Jemal Abawajy
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-51281-5_37

Premium Partner