ABSTRACT
Handwritten Bangla digit recognition is one of the most challenging computer vision problems due to its diverse shapes and writing style. Recently deep learning based convolutional neural network known as deep CNN finds wide-spread applications in recognizing different objects due to its high accuracy. This paper investigates the performance of some state-of-the-art deep CNN techniques for the recognition of handwritten digits. It considers four deep CNN architectures, such as AlexNet, MobileNet, GoogLeNet (Inception V3), and CapsuleNet models. These four deep CNNs have been experimented on a large, unbiased and highly augmented standard dataset, NumtaDB and confirmed that the AlexNet showed the best performance on the basis of accuracy and computation time.
- M. M. Rahman, M. A. H. Akhand, S. Islam and P. C. Shill (2015). Bangla Handwritten Character Recognition using Convolutional Neural Network. I. J. Image, Graphics and Signal Processing, 8, 42--49.Google ScholarCross Ref
- Y. Qiao. The Mnist Database of handwritten digits. Available online: http://www. gavo.t.u-tokyo.ac.jp/~qiao/database.htmlGoogle Scholar
- U. Bhattacharya, M. Shridhar, S. K. Parui, P. K. Sen and B. B. Chaudhuri (2012). Offline recognition of handwritten Bangla characters: an efficient two-stage approach. Pattern Analysis and Applications, 15(4), 445--458.Google ScholarDigital Library
- B. B. Chaudhuri and U. Pal (1998). A complete printed Bangla OCR system. Pattern Recognition, 31(5), 531--549.Google ScholarCross Ref
- U. Pal (1997). On the development of an optical character recognition (OCR) system for printed Bangla script. Ph.D. Thesis. Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, India.Google ScholarDigital Library
- U. Pal and B. B. Chaudhuri (2004). Indian script character recognition: a survey. Pattern Recognition, 37(9), 1887--1899.Google ScholarCross Ref
- U. Pal and B. B. Chaudhuri (2000). Automatic recognition of unconstrained offline Bangla hand-written numerals. In: T. Tan, Y. Shi, W. Gao (eds) Advances in Multimodal Interfaces, ICMI 2000. Lecture Notes in Computer Science, 1948, 371--378.Google Scholar
- K. Roy, S. Vajda, U. Pal and B. B. Chaudhuri (2004). A system towards Indian postal automation. 9th International Workshop on Frontiers in Handwritten Recognition (IWFHR-9), Kokubunji, Tokyo, Japan, 580--585.Google ScholarDigital Library
- U. Pal, A. Belaid and C. Choisy (2003). Touching numeral segmentation using water reservoir concept. Pattern Recognition Letters, 24(1-3), 261--272.Google ScholarDigital Library
- S. M. A. Sharif, N. Mohammed, N. Mansoor and S. Momen (2016). A hybrid deep model with HOG features for Bangla handwritten numeral classification. 9th International Conference on Electrical and Computer Engineering (ICECE), Dhaka, Bangladesh, 463--466.Google ScholarCross Ref
- U. Bhattacharya. Off-Line Handwritten Bangla Numeral Database. Available online: http://www.isical.ac.in/~ujjwalGoogle Scholar
- N. Das, R. Sarkar, S. Basu, M. Kundu, M. Nasipuri and D. K. Basu (2012). A genetic algorithm based region sampling for selection of local features in handwritten digit recognition application. Applied Soft Computing, 12(5), 1592--1606.Google ScholarDigital Library
- M. Shopon, N. Mohammed and M. A. Abedin (2016). Bangla handwritten digit recognition using auto encoder and deep convolutional neural network. International Workshop on Computational Intelligence (IWCI), Dhaka, Bangladesh, 64--68.Google Scholar
- A. S. A. Rabby, S. Abujar, S. Haque and S. A. Hossain (2019). Bangla Handwritten Digit Recognition Using Convolutional Neural Network. In: A. Abraham, P. Dutta, J. Mandal, A. Bhattacharya, S. Dutta (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, 755, 111--122.Google ScholarCross Ref
- M. Biswas, R. Islam, G. K. Shom, M. Shopon, N. Mohammed, S. Momen and A. Abedin (2017). BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters. Data in Brief, 12, 103--107.Google ScholarCross Ref
- M. Z. Alom, P. Sidike, T. M. Taha and V. K. Asari (2017). Handwritten Bangla Digit Recognition Using Deep Learning. Available online: http://arxiv.org/abs/1705.02680Google Scholar
- M. Z. Alom, P. Sidike, M. Hasan, T. M. Taha and V. K. Asari (2018). Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks. Computational Intelligence and Neuroscience, 2018, Article ID 6747098, 13 pages.Google ScholarDigital Library
- M. Z. Alom, T. M. Taha, C. Yakopcic, S. Westberg, P. Sidike, M.S. Nasrin, B. C. V. Esesn, A. A. S. Awwal and V. K. Asari (2018). The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches. Available online: http://arxiv.org/abs/1803.01164Google Scholar
- M. A. H. Akhand, M. Ahmed and M. M. H. Rahman (2016). Convolutional neural network training with artificial pattern for Bangla handwritten numeral recognition. 5th International Conference on Informatics, Electronics and Vision (ICIEV), Dhaka, Bangladesh, 625--630.Google ScholarCross Ref
- S. Saha and N. Saha (2018). A Lightning fast approach to classify Bangla Handwritten Characters and Numerals using newly structured Deep Neural Network. Procedia Computer Science, 132, 1760--1770.Google ScholarDigital Library
- S. Alam, T. Reasat, R. M. Doha and A. I. Humayun (2018). NumtaDB - Assembled Bengali Handwritten Digits. Available online: http://arxiv.org/abs/1806.02452Google Scholar
- A. Shawon, M. J. Rahman, F. Mahmud and M. M. A. Zaman (2018). Bangla Handwritten Digit Recognition Using Deep CNN for Large and Unbiased Dataset. International Conference on Bangla Speech and Language Processing (ICBSLP). Sylhet, Bangladesh, 1--6.Google Scholar
- H. Zunair, N. Mohammed and S. Momen (2018). Unconventional Wisdom: A New Transfer Learning Approach Applied to Bengali Numeral Classification. International Conference on Bangla Speech and Language Processing (ICBSLP), Sylhet, Bangladesh, 1--6.Google ScholarCross Ref
- J. Deng, W. Dong, R. Socher, L. Li, Kai. Li and Li. Fei-Fei (2009). ImageNet: A large-scale hierarchical image database. IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 248--255.Google Scholar
- A. Gelbukh (2005). Natural language processing. 5th International Conference on Hybrid Intelligent Systems (HIS'05), Rio de Janeiro, Brazil, 1.Google ScholarDigital Library
- K. Fukushima (1980). A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics, 36(4), 193--202.Google ScholarCross Ref
- Y. Lecun, L. Bottou, Y. Bengio and P. Haffner (1998). Gradient-based learning applied to document recognition. In Proceedings of the IEEE, 86(11), 2278--2324.Google ScholarCross Ref
- K. O'Shea and R. Nash (2015). An Introduction to Convolutional Neural Networks. Available online: http://arxiv.org/abs/1511.08458Google Scholar
- S. Albawi, T. A. Mohammed and S. Al-Zawi (2017). Understanding of a convolutional neural network. International Conference on Engineering and Technology (ICET), Antalya, Turkey, 1--6.Google ScholarCross Ref
- R. B. Arif, M. A. B. Siddique, M. M. R. Khan and M. R. Oishe (2018). Study and Observation of the Variations of Accuracies for Handwritten Digits Recognition with Various Hidden Layers and Epochs using Convolutional Neural Network. 4th International Conference on Electrical Engineering and Information & Communication Technology (iCEEiCT), Dhaka, Bangladesh, 112--117.Google ScholarCross Ref
- V. Ayumi, L. M. R. Rere, M. I. Fanany and A. M. Arymurthy (2016). Optimization of Convolutional Neural network using Microcanonical Annealing Algorithm. International Conference on Advanced Computer Science and Information Systems (ICACSIS), Malang, Indonesia, 506--511.Google ScholarCross Ref
- A. Krizhevsky, I. Sutskever and G. E. Hinton (2012). ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems, 25, 1097--1105.Google ScholarDigital Library
- A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto and H. Adam (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. Available online: http://arxiv.org/abs/1704.04861Google Scholar
- C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke and A. Rabinovich (2015). Going Deeper With Convolutions. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 1--9.Google Scholar
- B. Mandal, S. Dubey, S. Ghosh, R. Sarkhel and N. Das (2018). Handwritten Indic Character Recognition using Capsule Networks. IEEE Applied Signal Processing Conference (ASPCON), Kolkata, India, 304--308.Google ScholarCross Ref
Index Terms
- Bangla Handwritten Digit Recognition Using Deep Convolutional Neural Network
Recommendations
A Novel Deep Convolutional Neural Network Structure for Off-line Handwritten Digit Recognition
ICBDT '19: Proceedings of the 2nd International Conference on Big Data TechnologiesHandwritten digit recognition is widely used nowadays because traditional methods require the extraction of feature vectors from the original data by hand. The deep convolutional neural network (DCNN) can solve this problem efficiently, and the ...
Isolated Bangla Handwritten Character Classification using Transfer Learning
ICCA '22: Proceedings of the 2nd International Conference on Computing AdvancementsBangla language consists of fifty distinct characters and many compound characters to be named. Several notable studies have been performed to recognize the Bangla characters both as handwritten and optical characters. Our approach is to use transfer ...
Occluded offline handwritten Chinese character recognition using deep convolutional generative adversarial network and improved GoogLeNet
AbstractIn this paper, we propose a novel method for recognizing occluded offline handwritten Chinese characters based on deep convolutional generative adversarial network (DCGAN) and improved GoogLeNet. Different from previous methods, our proposed ...
Comments