Skip to main content

2017 | OriginalPaper | Buchkapitel

Text Detection in Low Resolution Scene Images Using Convolutional Neural Network

verfasst von : Anhar Risnumawan, Indra Adji Sulistijono, Jemal Abawajy

Erschienen in: Recent Advances on Soft Computing and Data Mining

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Text detection on scene images has increasingly gained a lot of interests, especially due to the increase of wearable devices. However, the devices often acquire low resolution images, thus making it difficult to detect text due to noise. Notable method for detection in low resolution images generally utilizes many features which are cleverly integrated and cascaded classifiers to form better discriminative system. Those methods however require a lot of hand-crafted features and manually tuned, which are difficult to achieve in practice. In this paper, we show that the notable cascaded method is equivalent to a Convolutional Neural Network (CNN) framework to deal with text detection in low resolution scene images. The CNN framework however has interesting mutual interaction between layers from which the parameters are jointly learned without requiring manual design, thus its parameters can be better optimized from training data. Experiment results show the efficiency of the method for detecting text in low resolution scene images.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893. IEEE (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893. IEEE (2005)
2.
Zurück zum Zitat Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)CrossRef
3.
Zurück zum Zitat LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)CrossRef LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)CrossRef
4.
Zurück zum Zitat Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Intl. J. Doc. Anal. Recogn. (IJDAR) 7(2–3), 84–104 (2005)CrossRef Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Intl. J. Doc. Anal. Recogn. (IJDAR) 7(2–3), 84–104 (2005)CrossRef
5.
Zurück zum Zitat Mählisch, M., Oberländer, M., Löhlein, O., Gavrila, D., Ritter, W.: A multiple detector approach to low-resolution fir pedestrian recognition. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV2005), Las Vegas, NV, USA (2005) Mählisch, M., Oberländer, M., Löhlein, O., Gavrila, D., Ritter, W.: A multiple detector approach to low-resolution fir pedestrian recognition. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV2005), Las Vegas, NV, USA (2005)
6.
Zurück zum Zitat Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008) Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008)
7.
Zurück zum Zitat Mirmehdi, M., Clark, P., Lam, J.: Extracting low resolution text with an active camera for OCR. In: Spanish Symposium on Pattern Recognition and Image Processing IX, pp. 43–48 (2001) Mirmehdi, M., Clark, P., Lam, J.: Extracting low resolution text with an active camera for OCR. In: Spanish Symposium on Pattern Recognition and Image Processing IX, pp. 43–48 (2001)
8.
Zurück zum Zitat Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: CVPR, pp. 3538–3545. IEEE (2012) Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: CVPR, pp. 3538–3545. IEEE (2012)
9.
Zurück zum Zitat Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: ICDAR (2013) Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: ICDAR (2013)
10.
Zurück zum Zitat Nguyen, M.H., Kim, S.-H., Lee, G.: Recognizing text in low resolution born-digital images. In: Jeong, Y.-S., Park, Y.-H., Hsu, C.-H.R., Park, J.J.J.H. (eds.) Ubiquitous Information Technologies and Applications. LNEE, vol. 280, pp. 85–92. Springer, Heidelberg (2014). doi:10.1007/978-3-642-41671-2_12CrossRef Nguyen, M.H., Kim, S.-H., Lee, G.: Recognizing text in low resolution born-digital images. In: Jeong, Y.-S., Park, Y.-H., Hsu, C.-H.R., Park, J.J.J.H. (eds.) Ubiquitous Information Technologies and Applications. LNEE, vol. 280, pp. 85–92. Springer, Heidelberg (2014). doi:10.​1007/​978-3-642-41671-2_​12CrossRef
11.
Zurück zum Zitat Risnumawan, A., Chan, C.S.: Text detection via edgeless stroke width transform. In: ISPACS, pp. 336–340. IEEE (2014) Risnumawan, A., Chan, C.S.: Text detection via edgeless stroke width transform. In: ISPACS, pp. 336–340. IEEE (2014)
12.
Zurück zum Zitat Risnumawan, A., Shivakumara, P., Chan, C.S., Tan, C.L.: A robust arbitrary text detection system for natural scene images. Expert Syst. Appl. 41(18), 8027–8048 (2014)CrossRef Risnumawan, A., Shivakumara, P., Chan, C.S., Tan, C.L.: A robust arbitrary text detection system for natural scene images. Expert Syst. Appl. 41(18), 8027–8048 (2014)CrossRef
13.
Zurück zum Zitat Sahli, S., Ouyang, Y., Sheng, Y., Lavigne, D.A.: Robust vehicle detection in low-resolution aerial imagery. In: SPIE Defense, Security, and Sensing, p. 76680G. International Society for Optics and Photonics (2010) Sahli, S., Ouyang, Y., Sheng, Y., Lavigne, D.A.: Robust vehicle detection in low-resolution aerial imagery. In: SPIE Defense, Security, and Sensing, p. 76680G. International Society for Optics and Photonics (2010)
14.
Zurück zum Zitat Sanketi, P., Shen, H., Coughlan, J.M.: Localizing blurry and low-resolution text in natural images. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 503–510. IEEE (2011) Sanketi, P., Shen, H., Coughlan, J.M.: Localizing blurry and low-resolution text in natural images. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 503–510. IEEE (2011)
15.
Zurück zum Zitat Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: ICCV, pp. 1457–1464. IEEE (2011) Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: ICCV, pp. 1457–1464. IEEE (2011)
16.
Zurück zum Zitat Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: ICPR, pp. 3304–3308. IEEE (2012) Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: ICPR, pp. 3304–3308. IEEE (2012)
17.
Zurück zum Zitat Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)CrossRef Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)CrossRef
18.
Zurück zum Zitat Zhang, J., Gong, S.: People detection in low-resolution video with non-stationary background. Image Vis. Comput. 27(4), 437–443 (2009)CrossRef Zhang, J., Gong, S.: People detection in low-resolution video with non-stationary background. Image Vis. Comput. 27(4), 437–443 (2009)CrossRef
19.
Zurück zum Zitat Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 21(8), 693–703 (2003)CrossRef Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 21(8), 693–703 (2003)CrossRef
20.
Zurück zum Zitat Zhu, J., Javed, O., Liu, J., Yu, Q., Cheng, H., Sawhney, H.: Pedestrian detection in low-resolution imagery by learning multi-scale intrinsic motion structures (mims). In: CVPR, pp. 3510–3517 (2014) Zhu, J., Javed, O., Liu, J., Yu, Q., Cheng, H., Sawhney, H.: Pedestrian detection in low-resolution imagery by learning multi-scale intrinsic motion structures (mims). In: CVPR, pp. 3510–3517 (2014)
Metadaten
Titel
Text Detection in Low Resolution Scene Images Using Convolutional Neural Network
verfasst von
Anhar Risnumawan
Indra Adji Sulistijono
Jemal Abawajy
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-51281-5_37

Premium Partner