Skip to main content
Top
Published in: Artificial Intelligence Review 3/2020

11-06-2019

Performance evaluation of classifiers for the recognition of offline handwritten Gurmukhi characters and numerals: a study

Authors: Munish Kumar, M. K. Jindal, R. K. Sharma, Simpel Rani Jindal

Published in: Artificial Intelligence Review | Issue 3/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Classification is a process to pull out patterns from a number of classes by using various statistical properties and artificial intelligence techniques. The problem of classification is considered as one of the important problems for the development of applications and for efficient data analysis. Based on the learning adaptability and capability to solve complex computations, classifiers are always the best suited for the pattern recognition problems. This paper presents a comparative study of various classifiers and the results achieved for offline handwritten Gurmukhi characters and numerals recognition. Various classifiers used and evaluated in this study include k-nearest neighbors, linear-support vector machine (SVM), RBF-SVM, Naive Bayes, decision tree, convolution neural network and random forest classifier. For the experimental work, authors used a balanced data set of 13,000 samples that includes 7000 characters and 6000 numerals. To assess the performance of classifiers, authors have used the Waikato Environment for Knowledge Analysis which is an open source tool for machine learning. The performance is assessed by considering various parameters such as accuracy rate, size of the dataset, time taken to train the model, false acceptance rate, false rejection rate and area under receiver operating characteristic Curve. The paper also highlights the comparison of correctness of tests obtained by applying the selected classifiers. Based on the experimental results, it is clear that classifiers considered in this study have complementary rewards and they should be implemented in a hybrid manner to achieve higher accuracy rates. After executing the experimental work, their comparison and analysis, it is concluded that the Random Forest classifier is performing better than other recently used classifiers for character and numeral recognition of offline handwritten Gurmukhi characters and numerals with the recognition accuracy of 87.9% for 13,000 samples.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Amin A, Singh S (1998) Recognition of hand-printed Chinese characters using decision trees/machine learning C4.5 system. Pattern Anal Appl 1(2):130–141CrossRef Amin A, Singh S (1998) Recognition of hand-printed Chinese characters using decision trees/machine learning C4.5 system. Pattern Anal Appl 1(2):130–141CrossRef
go back to reference Anil R, Manjusha K, Kumar SS, Soman KP (2015) Convolutional neural networks for the recognition of Malayalam characters. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA), pp 493–500 Anil R, Manjusha K, Kumar SS, Soman KP (2015) Convolutional neural networks for the recognition of Malayalam characters. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA), pp 493–500
go back to reference Bhowmik TK, Bhattacharya U, Parui SK (2004) Recognition of Bangla handwritten characters using an MLP classifier based on stroke features. In: Proceedings of international conference on neural information processing (ICONIP’04), pp 814–819 Bhowmik TK, Bhattacharya U, Parui SK (2004) Recognition of Bangla handwritten characters using an MLP classifier based on stroke features. In: Proceedings of international conference on neural information processing (ICONIP’04), pp 814–819
go back to reference Blue JL, Candela GT, Grother PJ, Chellappa R, Wilson CL (1994) Evaluation of pattern classifiers for fingerprint and OCR applications. Pattern Recognit 27(4):485–501CrossRef Blue JL, Candela GT, Grother PJ, Chellappa R, Wilson CL (1994) Evaluation of pattern classifiers for fingerprint and OCR applications. Pattern Recognit 27(4):485–501CrossRef
go back to reference Cordella LP, Stefano CD, Fontanella F, Freca ASD (2014) Random forest for reliable pre-classification of handwritten characters. In: Proceedings of the 22nd international conference on pattern recognition, pp 1319–1324 Cordella LP, Stefano CD, Fontanella F, Freca ASD (2014) Random forest for reliable pre-classification of handwritten characters. In: Proceedings of the 22nd international conference on pattern recognition, pp 1319–1324
go back to reference Desai AA (2010) Gujarati handwritten numeral optical character reorganization through neural network. Pattern Recognit 43(7):2582–2589CrossRef Desai AA (2010) Gujarati handwritten numeral optical character reorganization through neural network. Pattern Recognit 43(7):2582–2589CrossRef
go back to reference Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10(7):1895–1924CrossRef Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10(7):1895–1924CrossRef
go back to reference Elakkiya V, Muthumani I, Jegajothi M (2017) Tamil text recognition using KNN classifier. Adv Nat Appl Sci 11(7):41–45 Elakkiya V, Muthumani I, Jegajothi M (2017) Tamil text recognition using KNN classifier. Adv Nat Appl Sci 11(7):41–45
go back to reference Favata JT, Srikantan G, Srihari SN (1994) Handprinted character/digit recognition using a multiple feature/resolution philosophy. In: Proceedings of 4th international workshop on frontiers of handwriting recognition, pp 57–66 Favata JT, Srikantan G, Srihari SN (1994) Handprinted character/digit recognition using a multiple feature/resolution philosophy. In: Proceedings of 4th international workshop on frontiers of handwriting recognition, pp 57–66
go back to reference Han J, Kamber M (2001) Data mining concepts and techniques. Morgan Kaufmann Publishers, San Francisco, pp 70–181 Han J, Kamber M (2001) Data mining concepts and techniques. Morgan Kaufmann Publishers, San Francisco, pp 70–181
go back to reference Hazra TK, Singh DP, Daga N (2017) Optical character recognition using KNN on custom image dataset. In: Proceedings of the 8th annual conference on industrial automation and electromechanical engineering, pp 110–114 Hazra TK, Singh DP, Daga N (2017) Optical character recognition using KNN on custom image dataset. In: Proceedings of the 8th annual conference on industrial automation and electromechanical engineering, pp 110–114
go back to reference He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
go back to reference Homenda W, Lesinski L (2011) Features selection in character recognition with random forest classifier. In: Proceedings of the international conference on computational collective intelligence, pp 93–102 Homenda W, Lesinski L (2011) Features selection in character recognition with random forest classifier. In: Proceedings of the international conference on computational collective intelligence, pp 93–102
go back to reference Huang J, Lu J, Ling CX (2003) Comparing Naïve Bayes, decision trees, and SVM with AUC and accuracy. In: Proceedings of the third IEEE international conference on data mining, pp 1–4 Huang J, Lu J, Ling CX (2003) Comparing Naïve Bayes, decision trees, and SVM with AUC and accuracy. In: Proceedings of the third IEEE international conference on data mining, pp 1–4
go back to reference Jain AK, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37CrossRef Jain AK, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37CrossRef
go back to reference Jeong SW, Kim SH, Cho WH (1999) Performance comparison of statistical and neural network classifiers in handwritten digits’ recognition. In: Lee S-W (ed) Advances in handwriting recognition. World Scientific, Singapore, pp 406–415CrossRef Jeong SW, Kim SH, Cho WH (1999) Performance comparison of statistical and neural network classifiers in handwritten digits’ recognition. In: Lee S-W (ed) Advances in handwriting recognition. World Scientific, Singapore, pp 406–415CrossRef
go back to reference Jindal MK, Sharma RK, Lehal GS (2008) Structural features for recognizing degraded printed Gurmukhi script. In: Proceedings of the 5th international conference on information technology: new generations (ITNG), pp 668–673 Jindal MK, Sharma RK, Lehal GS (2008) Structural features for recognizing degraded printed Gurmukhi script. In: Proceedings of the 5th international conference on information technology: new generations (ITNG), pp 668–673
go back to reference John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the 11th conference on uncertainty in artificial intelligence, pp 338–345 John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the 11th conference on uncertainty in artificial intelligence, pp 338–345
go back to reference John R, Raju G, Guru DS (2007) 1D wavelet transform of projection profiles for isolated handwritten Malayalam character recognition. In: Proceedings of international conference on computational intelligence and multimedia applications (ICCIMA), vol 2, pp 481–485 John R, Raju G, Guru DS (2007) 1D wavelet transform of projection profiles for isolated handwritten Malayalam character recognition. In: Proceedings of international conference on computational intelligence and multimedia applications (ICCIMA), vol 2, pp 481–485
go back to reference Kim YS (2008) Comparison of the decision tree, artificial neural network, and linear regression methods based on the number and types of independent variables and sample size. Expert Syst Appl 34(2):1227–1234CrossRef Kim YS (2008) Comparison of the decision tree, artificial neural network, and linear regression methods based on the number and types of independent variables and sample size. Expert Syst Appl 34(2):1227–1234CrossRef
go back to reference Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th international conference on neural information processing, vol 1, pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th international conference on neural information processing, vol 1, pp 1097–1105
go back to reference Kumar M, Sharma RK, Jindal MK (2012) Offline handwritten Gurmukhi character recognition: study of different features and classifiers combinations. In: Proceedings of international workshop on document analysis and recognition, IIT Bombay, pp 94–99 Kumar M, Sharma RK, Jindal MK (2012) Offline handwritten Gurmukhi character recognition: study of different features and classifiers combinations. In: Proceedings of international workshop on document analysis and recognition, IIT Bombay, pp 94–99
go back to reference Kumar M, Sharma RK, Jindal MK (2013a) A novel feature extraction technique for offline handwritten Gurmukhi character recognition. IETE J Res 59(6):687–692CrossRef Kumar M, Sharma RK, Jindal MK (2013a) A novel feature extraction technique for offline handwritten Gurmukhi character recognition. IETE J Res 59(6):687–692CrossRef
go back to reference Kumar M, Sharma RK, Jindal MK (2013b) Size of training set vis-a-vis recognition accuracy of handwritten character recognition system. J Emerg Technol Web Intell 5(4):380–384 Kumar M, Sharma RK, Jindal MK (2013b) Size of training set vis-a-vis recognition accuracy of handwritten character recognition system. J Emerg Technol Web Intell 5(4):380–384
go back to reference Kumar M, Sharma RK, Jindal MK (2014a) Efficient feature extraction techniques for offline handwritten Gurmukhi character recognition. Natl Acad Sci Lett 37(4):381–391CrossRef Kumar M, Sharma RK, Jindal MK (2014a) Efficient feature extraction techniques for offline handwritten Gurmukhi character recognition. Natl Acad Sci Lett 37(4):381–391CrossRef
go back to reference Kumar M, Jindal MK, Sharma RK (2014b) A novel hierarchical technique for offline handwritten Gurmukhi character recognition. Natl Acad Sci Lett 37(6):567–572CrossRef Kumar M, Jindal MK, Sharma RK (2014b) A novel hierarchical technique for offline handwritten Gurmukhi character recognition. Natl Acad Sci Lett 37(6):567–572CrossRef
go back to reference Lajish VL (2007) Handwritten character recognition using perceptual fuzzy-zoning and class modular neural networks. In: Proceedings of 4th international conference on innovations in information technology (ICIIT), pp 188–192 Lajish VL (2007) Handwritten character recognition using perceptual fuzzy-zoning and class modular neural networks. In: Proceedings of 4th international conference on innovations in information technology (ICIIT), pp 188–192
go back to reference LeCun Y, Bengio Y (1990) Handwritten digit recognition with a back-propagation network. In: Proceedings of the advances in neural information processing systems, pp 396–404 LeCun Y, Bengio Y (1990) Handwritten digit recognition with a back-propagation network. In: Proceedings of the advances in neural information processing systems, pp 396–404
go back to reference LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
go back to reference Lee DS, Srihari SN (1993) Handprinted digit recognition: a comparison of algorithms. In: Proceedings of 3rd international workshop on frontiers of handwriting recognition, pp 153–164 Lee DS, Srihari SN (1993) Handprinted digit recognition: a comparison of algorithms. In: Proceedings of 3rd international workshop on frontiers of handwriting recognition, pp 153–164
go back to reference Lehal GS, Singh C, Lehal R (2001) A shape based post processor for Gurmukhi OCR. In: Proceedings of the 6th international conference on document analysis and recognition (ICDAR), pp 1105–1109 Lehal GS, Singh C, Lehal R (2001) A shape based post processor for Gurmukhi OCR. In: Proceedings of the 6th international conference on document analysis and recognition (ICDAR), pp 1105–1109
go back to reference Liu CL, Sako H, Fujisawa H (2002) Performance evaluation of pattern classifiers for handwritten character recognition. Int J Doc Anal Recognit 4(3):191–204CrossRef Liu CL, Sako H, Fujisawa H (2002) Performance evaluation of pattern classifiers for handwritten character recognition. Int J Doc Anal Recognit 4(3):191–204CrossRef
go back to reference Liu C, Liu J, Yu F, Huang Y, Chen J (2013) Handwritten character recognition with sequential convolutional neural network. In: Proceedings of the international conference on machine learning and cybernetics, pp 291–296 Liu C, Liu J, Yu F, Huang Y, Chen J (2013) Handwritten character recognition with sequential convolutional neural network. In: Proceedings of the international conference on machine learning and cybernetics, pp 291–296
go back to reference Rachidi Y, Mahani Z (2017) Handwritten Amazigh character recognition system for image obtained by camera phone. Int J Sci Eng Res 8(3):1319–1324 Rachidi Y, Mahani Z (2017) Handwritten Amazigh character recognition system for image obtained by camera phone. Int J Sci Eng Res 8(3):1319–1324
go back to reference Raju G (2008) Wavelet transform and projection profiles in handwritten character recognition—a performance analysis. In: Proceedings of international conference on advanced computing and communications, pp 309–314 Raju G (2008) Wavelet transform and projection profiles in handwritten character recognition—a performance analysis. In: Proceedings of international conference on advanced computing and communications, pp 309–314
go back to reference Ramanan M, Ramanan A, Charles EYA (2015) A hybrid decision tree for printed Tamil character recognition using SVMs. In: Proceedings of the 15th international conference on advances in ICT for emerging regions (ICTer), pp 130–141 Ramanan M, Ramanan A, Charles EYA (2015) A hybrid decision tree for printed Tamil character recognition using SVMs. In: Proceedings of the 15th international conference on advances in ICT for emerging regions (ICTer), pp 130–141
go back to reference Rampalli R, Ramakrishnan AG (2011) Fusion of complementary online and offline strategies for recognition of handwritten Kannada characters. J Univers Comput Sci (JUCS) 17(1):81–93 Rampalli R, Ramakrishnan AG (2011) Fusion of complementary online and offline strategies for recognition of handwritten Kannada characters. J Univers Comput Sci (JUCS) 17(1):81–93
go back to reference Rashad M, Semary NA (2014) Isolated printed Arabic character recognition using KNN and random forest tree classifiers. In: Proceedings of the international conference on advanced machine learning technologies and applications, pp 11–17 Rashad M, Semary NA (2014) Isolated printed Arabic character recognition using KNN and random forest tree classifiers. In: Proceedings of the international conference on advanced machine learning technologies and applications, pp 11–17
go back to reference Rathi R, Pandey RK, Jangid M (2012) Offline handwritten Devanagari vowels recognition using KNN classifier. Int J Comput Appl 49(23):11–16 Rathi R, Pandey RK, Jangid M (2012) Offline handwritten Devanagari vowels recognition using KNN classifier. Int J Comput Appl 49(23):11–16
go back to reference Sastry PN, Krishnan R, Ram BVS (2010) Classification and identification of Telugu handwritten characters extracted from palm leaves using decision tree approach. ARPN J Eng Appl Sci 5(3):22–32 Sastry PN, Krishnan R, Ram BVS (2010) Classification and identification of Telugu handwritten characters extracted from palm leaves using decision tree approach. ARPN J Eng Appl Sci 5(3):22–32
go back to reference Shanthi N, Duraiswamy K (2010) A novel SVM based handwritten Tamil character recognition system. Pattern Anal Appl (PAA) 13(2):173–180MathSciNetCrossRef Shanthi N, Duraiswamy K (2010) A novel SVM based handwritten Tamil character recognition system. Pattern Anal Appl (PAA) 13(2):173–180MathSciNetCrossRef
go back to reference Sharma DV, Jhajj P (2010) Recognition of isolated handwritten characters in Gurmukhi script. Int J Comput Appl 4(8):9–17 Sharma DV, Jhajj P (2010) Recognition of isolated handwritten characters in Gurmukhi script. Int J Comput Appl 4(8):9–17
go back to reference Sharma DV, Lehal GS (2009) Form field frame boundary removal for form processing system in Gurmukhi script. In: Proceedings of the 10th international conference on document analysis and recognition (ICDAR), pp 256–260 Sharma DV, Lehal GS (2009) Form field frame boundary removal for form processing system in Gurmukhi script. In: Proceedings of the 10th international conference on document analysis and recognition (ICDAR), pp 256–260
go back to reference Sharma A, Kumar R, Sharma RK (2008) Online handwritten Gurmukhi character recognition using elastic matching. In: Proceedings of the congress on image and signal processing, pp 391–396 Sharma A, Kumar R, Sharma RK (2008) Online handwritten Gurmukhi character recognition using elastic matching. In: Proceedings of the congress on image and signal processing, pp 391–396
go back to reference Sharma DV, Lehal GS, Mehta S (2009) Shape encoded post processing of Gurmukhi OCR. In: Proceedings of the 10th international conference on document analysis and recognition (ICDAR), pp 788–792 Sharma DV, Lehal GS, Mehta S (2009) Shape encoded post processing of Gurmukhi OCR. In: Proceedings of the 10th international conference on document analysis and recognition (ICDAR), pp 788–792
go back to reference Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations, pp 1–14 Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations, pp 1–14
go back to reference Sundaram S, Ramakrishnan AG (2008) Two dimensional principal component analysis for online character recognition. In: Proceedings of 11th international conference on frontiers in handwriting recognition (ICFHR), pp 88–94 Sundaram S, Ramakrishnan AG (2008) Two dimensional principal component analysis for online character recognition. In: Proceedings of 11th international conference on frontiers in handwriting recognition (ICFHR), pp 88–94
go back to reference Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Computer vision and pattern recognition. arXiv:1409.4842 Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Computer vision and pattern recognition. arXiv:​1409.​4842
go back to reference Wu C, Fan W, He Y, Sun J, Naoi S (2014) Handwritten character recognition by alternately trained relaxation convolutional neural network. In: Proceedings of the 14th international conference on frontiers in handwriting recognition, pp 291–296 Wu C, Fan W, He Y, Sun J, Naoi S (2014) Handwritten character recognition by alternately trained relaxation convolutional neural network. In: Proceedings of the 14th international conference on frontiers in handwriting recognition, pp 291–296
go back to reference Yuan A, Bai G, Jiao L, Liu Y (2012) Offline handwritten English character recognition based on convolutional neural network. In: Proceedings of the 10th IAPR international workshop on document analysis systems, pp 125–129 Yuan A, Bai G, Jiao L, Liu Y (2012) Offline handwritten English character recognition based on convolutional neural network. In: Proceedings of the 10th IAPR international workshop on document analysis systems, pp 125–129
go back to reference Zahedi M, Eslami S (2012) Improvement of random forest classifier through localization of Persian handwritten OCR. ACEEE Int J Inf Technol 2(1):13–17 Zahedi M, Eslami S (2012) Improvement of random forest classifier through localization of Persian handwritten OCR. ACEEE Int J Inf Technol 2(1):13–17
go back to reference Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: Proceedings of the European conference on computer vision, pp 818–833 Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: Proceedings of the European conference on computer vision, pp 818–833
go back to reference Zhu X, Shi Y, Wang S (1999) A new distinguishing algorithm of connected character images based on Fourier transform. In: Proceedings of 4th international conference on document analysis and recognition, pp 788–791 Zhu X, Shi Y, Wang S (1999) A new distinguishing algorithm of connected character images based on Fourier transform. In: Proceedings of 4th international conference on document analysis and recognition, pp 788–791
Metadata
Title
Performance evaluation of classifiers for the recognition of offline handwritten Gurmukhi characters and numerals: a study
Authors
Munish Kumar
M. K. Jindal
R. K. Sharma
Simpel Rani Jindal
Publication date
11-06-2019
Publisher
Springer Netherlands
Published in
Artificial Intelligence Review / Issue 3/2020
Print ISSN: 0269-2821
Electronic ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-019-09727-2

Other articles of this Issue 3/2020

Artificial Intelligence Review 3/2020 Go to the issue

Premium Partner