Skip to main content
Erschienen in: Pattern Recognition and Image Analysis 1/2021

01.01.2021 | APPLIED PROBLEMS

Performance Analysis of State of the Art Convolutional Neural Network Architectures in Bangla Handwritten Character Recognition

verfasst von: Tapotosh Ghosh, Min-Ha-Zul Abedin, Hasan Al Banna, Nasirul Mumenin, Mohammad Abu Yousuf

Erschienen in: Pattern Recognition and Image Analysis | Ausgabe 1/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Bangla handwritten character recognition is a popular research topic as its difficulty is higher than the recognition of other languages because of multiple formats of compound characters. State of the art Convolutional neural network (CNN) architectures are very much useful in computer vision applications. Some works have been carried out in Bangla handwritten character recognition but most of them either not very efficient or they can not classify a lot of characters. In this work, state of art pre-trained CNN architectures is used to classify 231 different Bangla handwritten characters using CMATERdb dataset. The images were first converted to B&W form with white as the foreground color. The size of the images is reduced to 28 × 28 form. These images are used as input to the CNN architectures. The weights of the state-of-the-art CNN models are kept as it was. The training learning rate was set to 0.001 and categorical cross-entropy as the error function. After 50 epochs, InceptionResNetV2 achieved the best accuracy (96.99%). DenseNet121 and InceptionNetV3 also provided remarkable recognition accuracy (96.55 and 96.20%, respectively). We also considered combination of trained InceptionResNetV2, InceptionNetV3 and DenseNet121 architectures which provided better recognition accuracy (97.69%) than other single CNN architectures but it is not feasible for using as it requires a lot of computation power and memory. The models were tested in the cases where characters look confusing to humans, but all the architectures showed equal capability in recognizing these images. Considering computational complexity, memory and capability of recognizing confused characters, InceptionResNetV2 can be said as the best performing model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015). C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015).
2.
Zurück zum Zitat K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016). K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016).
3.
Zurück zum Zitat K. He, X. Zhang, S. Ren, and J. Sun, “Identity mappings in deep residual networks,” in European Conference on Computer Vision(ECCV) (2016). K. He, X. Zhang, S. Ren, and J. Sun, “Identity mappings in deep residual networks,” in European Conference on Computer Vision(ECCV) (2016).
4.
Zurück zum Zitat G. Huang, Z. Liu, L. V. D. Maaten, and K. Q. Weinberger, “Densely connected convolutional networks,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017). G. Huang, Z. Liu, L. V. D. Maaten, and K. Q. Weinberger, “Densely connected convolutional networks,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017).
5.
Zurück zum Zitat K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition” (2014). arXiv:1409.1556 [cs.CV] K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition” (2014). arXiv:1409.1556 [cs.CV]
6.
Zurück zum Zitat M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.-C. Chen, “MobileNetV2: Inverted residuals and linear bottlenecks,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2018). M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.-C. Chen, “MobileNetV2: Inverted residuals and linear bottlenecks,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2018).
7.
Zurück zum Zitat C. Szegedy, S. Ioffe, V. Vanhoucke, and A. A. Alemi, “InceptionV4, InceptionResNet, and the impact of residual connections on learning,” in 31st AAAI Conf. Artif. Intell. (AAAI 2017) (2017), pp. 4278–4284. C. Szegedy, S. Ioffe, V. Vanhoucke, and A. A. Alemi, “InceptionV4, InceptionResNet, and the impact of residual connections on learning,” in 31st AAAI Conf. Artif. Intell. (AAAI 2017) (2017), pp. 4278–4284.
8.
Zurück zum Zitat M. Tan and Q. V. Le, “EfficientNet: Rethinking model scaling for convolutional neural networks,” in 36th Int. Conf. Mach. Learn. (ICML 2019) (2019), pp. 10691–10700. M. Tan and Q. V. Le, “EfficientNet: Rethinking model scaling for convolutional neural networks,” in 36th Int. Conf. Mach. Learn. (ICML 2019) (2019), pp. 10691–10700.
9.
Zurück zum Zitat B. Zoph, V. Vasudevan, J. Shlens, and Q. V. Le, “Learning transferable architectures for scalable image recognition,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. (2018), pp. 8697–8710. B. Zoph, V. Vasudevan, J. Shlens, and Q. V. Le, “Learning transferable architectures for scalable image recognition,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. (2018), pp. 8697–8710.
10.
Zurück zum Zitat N. Das, K. Acharya, R. Sarkar, S. Basu, M. Kundu, and M. Nasipuri, “A benchmark data base of isolated Bangla handwritten compound characters,” IJDAR 17, 413–431 (2014).CrossRef N. Das, K. Acharya, R. Sarkar, S. Basu, M. Kundu, and M. Nasipuri, “A benchmark data base of isolated Bangla handwritten compound characters,” IJDAR 17, 413–431 (2014).CrossRef
11.
Zurück zum Zitat N. Das, B. Das, R. Sarkar, S. Basu, M. Kundu, and M. Nasipuri, “Handwritten Bangla basic and compound character recognition using MLP and SVM classifier,” J. Comput. 2 (2), 109–115 (2010). N. Das, B. Das, R. Sarkar, S. Basu, M. Kundu, and M. Nasipuri, “Handwritten Bangla basic and compound character recognition using MLP and SVM classifier,” J. Comput. 2 (2), 109–115 (2010).
12.
Zurück zum Zitat R. Sarkhel, A. K. Saha, and N. Das, “An enhanced harmony search method for Bangla handwritten character recognition using region sampling,” in Proc. 2015 IEEE 2nd Int. Conf. Recent Trends Inf. Syst. (ReTIS 2015) (2015), pp. 325–330. R. Sarkhel, A. K. Saha, and N. Das, “An enhanced harmony search method for Bangla handwritten character recognition using region sampling,” in Proc. 2015 IEEE 2nd Int. Conf. Recent Trends Inf. Syst. (ReTIS 2015) (2015), pp. 325–330.
13.
Zurück zum Zitat R. Pramanik and S. Bag, “Shape decomposition-based handwritten compound character recognition for Bangla OCR,” J. Vis. Commun. Image Represent. 50, 123–134 (2018).CrossRef R. Pramanik and S. Bag, “Shape decomposition-based handwritten compound character recognition for Bangla OCR,” J. Vis. Commun. Image Represent. 50, 123–134 (2018).CrossRef
14.
Zurück zum Zitat N. Das et al., “Recognition of handwritten Bangla basic characters and digits using convex hull-based feature set,” in Int. Conf. Artif. Intell. Pattern Recognit. 2009 (AIPR 2009) (2009), pp. 380–386. N. Das et al., “Recognition of handwritten Bangla basic characters and digits using convex hull-based feature set,” in Int. Conf. Artif. Intell. Pattern Recognit. 2009 (AIPR 2009) (2009), pp. 380–386.
15.
Zurück zum Zitat N. Das, S. Basu, R. Sarkar, M. Kundu, M. Nasipuri, and D. Kumar Basu, “An improved feature descriptor for recognition of handwritten Bangla alphabet” (2015). arXiv:1501.05497 [cs.CV] N. Das, S. Basu, R. Sarkar, M. Kundu, M. Nasipuri, and D. Kumar Basu, “An improved feature descriptor for recognition of handwritten Bangla alphabet” (2015). arXiv:1501.05497 [cs.CV]
16.
Zurück zum Zitat S. Basu, N. Das, R. Sarkar, M. Kundu, M. Nasipuri, and D. K. Basu, “A hierarchical approach to recognition of handwritten Bangla characters,” Pattern Recognit. 42 (7), 1467–1484 (2009).CrossRef S. Basu, N. Das, R. Sarkar, M. Kundu, M. Nasipuri, and D. K. Basu, “A hierarchical approach to recognition of handwritten Bangla characters,” Pattern Recognit. 42 (7), 1467–1484 (2009).CrossRef
17.
Zurück zum Zitat T. Bhowmik, P. Ghanty, A. Roy, and S. Parui, “SVM-based hierarchical architectures for handwritten Bangla character recognition,” Doc. Anal. Recognit. 12, 97–108 (2009).CrossRef T. Bhowmik, P. Ghanty, A. Roy, and S. Parui, “SVM-based hierarchical architectures for handwritten Bangla character recognition,” Doc. Anal. Recognit. 12, 97–108 (2009).CrossRef
18.
Zurück zum Zitat S. K. Parui, K. Guin, U. Bhattacharya, and B. B. Chaudhuri, “Online handwritten Bangla character recognition using HMM,” in 2008 19th International Conference on Pattern Recognition (2008), pp. 1–4. S. K. Parui, K. Guin, U. Bhattacharya, and B. B. Chaudhuri, “Online handwritten Bangla character recognition using HMM,” in 2008 19th International Conference on Pattern Recognition (2008), pp. 1–4.
19.
Zurück zum Zitat K. Roy, “Stroke-database design for online handwriting recognition in Bangla,” Int. J. Mod. Eng. Res. 2 (4), 2534–2540 (2012). K. Roy, “Stroke-database design for online handwriting recognition in Bangla,” Int. J. Mod. Eng. Res. 2 (4), 2534–2540 (2012).
20.
Zurück zum Zitat M. M. R. Sazal, S. K. Biswas, M. F. Amin, and K. Murase, “Bangla handwritten character recognition using deep belief network,” in 2013 Int. Conf. Electr. Inf. Commun. Technol. (EICT 2013) (2013), pp. 1–5. M. M. R. Sazal, S. K. Biswas, M. F. Amin, and K. Murase, “Bangla handwritten character recognition using deep belief network,” in 2013 Int. Conf. Electr. Inf. Commun. Technol. (EICT 2013) (2013), pp. 1–5.
21.
Zurück zum Zitat S. Roy, N. Das, M. Kundu, and M. Nasipuri, “Handwritten isolated Bangla compound character recognition: A new benchmark using a novel deep learning approach,” Pattern Recognit. Lett. 90, 15–21 (2017).CrossRef S. Roy, N. Das, M. Kundu, and M. Nasipuri, “Handwritten isolated Bangla compound character recognition: A new benchmark using a novel deep learning approach,” Pattern Recognit. Lett. 90, 15–21 (2017).CrossRef
22.
Zurück zum Zitat Ashiquzzaman, A. K. Tushar, S. Dutta, and F. Mohsin, “An efficient method for improving classification accuracy of handwritten Bangla compound characters using DCNN with dropout and ELU,” in Proc. 2017 3rd IEEE Int. Conf. Res. Comput. Intell. Commun. Networks (ICRCICN 2017) (2017), pp. 147–152. Ashiquzzaman, A. K. Tushar, S. Dutta, and F. Mohsin, “An efficient method for improving classification accuracy of handwritten Bangla compound characters using DCNN with dropout and ELU,” in Proc. 2017 3rd IEEE Int. Conf. Res. Comput. Intell. Commun. Networks (ICRCICN 2017) (2017), pp. 147–152.
23.
Zurück zum Zitat A. Fardous and S. Afroge, “Handwritten isolated Bangla compound character recognition,” in 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE) (Cox’s Bazar, Bangladesh, 2019), pp. 1–5. A. Fardous and S. Afroge, “Handwritten isolated Bangla compound character recognition,” in 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE) (Cox’s Bazar, Bangladesh, 2019), pp. 1–5.
24.
Zurück zum Zitat S. Saha and N. Saha, “A lightning fast approach to classify Bangla handwritten characters and numerals using newly structured deep neural network,” Procedia Comput. Sci. 132, 1760–1770 (2018).CrossRef S. Saha and N. Saha, “A lightning fast approach to classify Bangla handwritten characters and numerals using newly structured deep neural network,” Procedia Comput. Sci. 132, 1760–1770 (2018).CrossRef
25.
Zurück zum Zitat A. K. M. S. Azad Rabby, S. Haque, S. Abujar, and S. A. Hossain, “Ekushnet: Using convolutional neural network for Bangla handwritten recognition,” Procedia Comput. Sci. 143, 603–610 (2018).CrossRef A. K. M. S. Azad Rabby, S. Haque, S. Abujar, and S. A. Hossain, “Ekushnet: Using convolutional neural network for Bangla handwritten recognition,” Procedia Comput. Sci. 143, 603–610 (2018).CrossRef
26.
Zurück zum Zitat M. A. R. Alif, S. Ahmed, and M. A. Hasan, “Isolated Bangla handwritten character recognition with convolutional neural network,” in 20th Int. Conf. Comput. Inf. Technol. (ICCIT 2017) (2018), pp. 1–6. M. A. R. Alif, S. Ahmed, and M. A. Hasan, “Isolated Bangla handwritten character recognition with convolutional neural network,” in 20th Int. Conf. Comput. Inf. Technol. (ICCIT 2017) (2018), pp. 1–6.
27.
Zurück zum Zitat T. Ghosh et al., “Bangla handwritten character recognition using MobileNet V1 architecture,” Bull. Electr. Eng. Inf. 9 (6), 2547–2554 (2020). T. Ghosh et al., “Bangla handwritten character recognition using MobileNet V1 architecture,” Bull. Electr. Eng. Inf. 9 (6), 2547–2554 (2020).
28.
Zurück zum Zitat T. Ghosh, S. M. Chowdhury, M. A. Yousuf, et al., “A comprehensive review on recognition techniques for Bangla handwritten characters,” in 2019 International Conference on Bangla Speech and Language Processing (ICBSLP) (2019), pp. 1–6. T. Ghosh, S. M. Chowdhury, M. A. Yousuf, et al., “A comprehensive review on recognition techniques for Bangla handwritten characters,” in 2019 International Conference on Bangla Speech and Language Processing (ICBSLP) (2019), pp. 1–6.
Metadaten
Titel
Performance Analysis of State of the Art Convolutional Neural Network Architectures in Bangla Handwritten Character Recognition
verfasst von
Tapotosh Ghosh
Min-Ha-Zul Abedin
Hasan Al Banna
Nasirul Mumenin
Mohammad Abu Yousuf
Publikationsdatum
01.01.2021
Verlag
Pleiades Publishing
Erschienen in
Pattern Recognition and Image Analysis / Ausgabe 1/2021
Print ISSN: 1054-6618
Elektronische ISSN: 1555-6212
DOI
https://doi.org/10.1134/S1054661821010089

Weitere Artikel der Ausgabe 1/2021

Pattern Recognition and Image Analysis 1/2021 Zur Ausgabe

MATHEMATICAL THEORY OF IMAGES AND SIGNALS REPRESENTING, PROCESSING, ANALYSIS, RECOGNITION, AND UNDERSTANDING

A Hybrid Single Image Super-Resolution Technique Using Fractal Interpolation and Convolutional Neural Network