Skip to main content
Top

2020 | OriginalPaper | Chapter

Extraction and Recognition of Bangla Texts from Natural Scene Images Using CNN

Authors : Rashedul Islam, Md Rafiqul Islam, Kamrul Hasan Talukder

Published in: Image and Signal Processing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The semantic information presents in the scene images may be the useful information for the viewers who is searching for a specific location or any specific shop and address. This type of information can also be useful in licenseplate detection, controlling the vehicle on the road, robot navigation, and assisting visually impaired persons. An efficient method is presented in this paper to detect and extract Bangla texts from scene images based on a connected component approach along with rule-based filtering and vertical scanning scheme. Next, extracted characters are recognized by using Convolutional Neural Network (CNN). The method consists of the four basic consecutive steps such as detection and extraction of the Region of Interest (ROI), segmentation of the words, extraction of characters, and recognition of the extracted characters. After extracting the ROI from the input image, connected component(CC) analysis and bounding box technology are used for segmentation of Bangla words. To separate and extract Bangla characters from the segmented Bangla words, vertical scanning based method along with a dynamic threshold value has been applied. Finally, character recognition is carried out using CNN. The proposed algorithm is applied to 600 scene images of different writing styles and colors, and we have obtained 89.25% accuracy in text detection and 94.50% accuracy in the extraction of characters. We have achieved an accuracy of 99.30% and 95.76% in recognition of Bangla digits and characters respectively. By combining both the digits and characters, obtained recognition accuracy is 95.39%.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Zhu, Y., Yao, C., Bai, X.: Scene text detection and recognition: recent advances and future trends. Front. Comput. Sci. 10(1), 19–36 (2016)CrossRef Zhu, Y., Yao, C., Bai, X.: Scene text detection and recognition: recent advances and future trends. Front. Comput. Sci. 10(1), 19–36 (2016)CrossRef
3.
go back to reference Zhang, H., Zhao, K., Song, YZ., Guo, J.: Text extraction from natural scene image: a survey. Neurocomputing 122, 310–323 (2013) Zhang, H., Zhao, K., Song, YZ., Guo, J.: Text extraction from natural scene image: a survey. Neurocomputing 122, 310–323 (2013)
4.
go back to reference Unar, S., Hussain, A., Shaikh, M., Memon, K.H., Ansari, M.A., Memon, Z.: A study on text detection and localization techniques for natural scene images. IJCSNS 18(1) (2018) Unar, S., Hussain, A., Shaikh, M., Memon, K.H., Ansari, M.A., Memon, Z.: A study on text detection and localization techniques for natural scene images. IJCSNS 18(1) (2018)
5.
go back to reference Yu, C., Song, Y., Zhang, Y.: Scene text localization using edge analysis and feature pool. Neurocomputing 175, 652–661 (2016) Yu, C., Song, Y., Zhang, Y.: Scene text localization using edge analysis and feature pool. Neurocomputing 175, 652–661 (2016)
6.
go back to reference Silva, B.L.S., Ciarelli, P.M.: Edge detection and confidence map applied to identify textual elements in the image (2016) Silva, B.L.S., Ciarelli, P.M.: Edge detection and confidence map applied to identify textual elements in the image (2016)
7.
go back to reference Lee, S., Cho, M.S., Jung, K., Kim, J.H.: Scene text extraction with edge constraint and text collinearity. In: 20th International Conference on Pattern Recognition (ICPR), pp. 3983–3986. IEEE (2010) Lee, S., Cho, M.S., Jung, K., Kim, J.H.: Scene text extraction with edge constraint and text collinearity. In: 20th International Conference on Pattern Recognition (ICPR), pp. 3983–3986. IEEE (2010)
8.
go back to reference Moyeen, M.A., Alam, K.M.R., Awal, M.A.: Bangla text extraction from natural scene images for mobile applications. J. Electr. Eng. Inst. Eng. EE 39(I & II) (2013) Moyeen, M.A., Alam, K.M.R., Awal, M.A.: Bangla text extraction from natural scene images for mobile applications. J. Electr. Eng. Inst. Eng. EE 39(I & II) (2013)
9.
go back to reference Aurich, V., Weule, J.: Non linear Gaussian filters performing edge preserving diffusion. In: 17 DAGM Symposium, pp. 538–545 (1995) Aurich, V., Weule, J.: Non linear Gaussian filters performing edge preserving diffusion. In: 17 DAGM Symposium, pp. 538–545 (1995)
10.
go back to reference Asaduzzaman, A., Molla, M.K.I., Ali, M.G.: Printed Bangla text recognition using artificial neural network with heuristic method. In: Proceedings of ICCIT, Dhaka, Bangladesh (2002) Asaduzzaman, A., Molla, M.K.I., Ali, M.G.: Printed Bangla text recognition using artificial neural network with heuristic method. In: Proceedings of ICCIT, Dhaka, Bangladesh (2002)
11.
go back to reference Bhattacharya, U., Parui, S.K., Mondal, S.: Devanagari and Bangla text extraction from natural scene images. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 26–29 (2009) Bhattacharya, U., Parui, S.K., Mondal, S.: Devanagari and Bangla text extraction from natural scene images. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 26–29 (2009)
13.
go back to reference Hanif, S.M., Prevost, L.: Texture based text detection in natural scene images: a help to blind and visually impaired persons. In: Conference Workshop on Assistive Technologies for People with Vision Hearing Impairments Assistive Technology for All Ages CVHI Hanif, S.M., Prevost, L.: Texture based text detection in natural scene images: a help to blind and visually impaired persons. In: Conference Workshop on Assistive Technologies for People with Vision Hearing Impairments Assistive Technology for All Ages CVHI
14.
go back to reference Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893. IEEE (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893. IEEE (2005)
15.
go back to reference Ghoshal, R., Roy, A., Parui, S.K.: Recognition of Bangla text from scene images through perspective correction. In: 2011 International Conference on Image Information Processing (ICIIP), pp. 1–6 (2011) Ghoshal, R., Roy, A., Parui, S.K.: Recognition of Bangla text from scene images through perspective correction. In: 2011 International Conference on Image Information Processing (ICIIP), pp. 1–6 (2011)
16.
go back to reference Ghoshal, R., Roy, A., Dhara, B.C., Parui, S.K.: Recognition of Bangla text from outdoor images using decision tree model. Int. J. Knowl.-Based Intell. Eng. Syst. 21(1), 29–38 (2017) Ghoshal, R., Roy, A., Dhara, B.C., Parui, S.K.: Recognition of Bangla text from outdoor images using decision tree model. Int. J. Knowl.-Based Intell. Eng. Syst. 21(1), 29–38 (2017)
Metadata
Title
Extraction and Recognition of Bangla Texts from Natural Scene Images Using CNN
Authors
Rashedul Islam
Md Rafiqul Islam
Kamrul Hasan Talukder
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-51935-3_26

Premium Partner