Skip to main content
Erschienen in: Soft Computing 24/2019

08.03.2019 | Methodologies and Application

Devanagari ancient character recognition using DCT features with adaptive boosting and bootstrap aggregating

Erschienen in: Soft Computing | Ausgabe 24/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Devanagari ancient manuscript recognition framework is drawing a lot of considerations from researchers nowadays. Devanagari ancient manuscripts are rare and delicate documents. To exploit the priceless information included in these documents, these documents are being digitized. Optical character recognition process is being used for the recognition of these documents. This paper presents a system for improvement in recognition of Devanagari ancient manuscripts using AdaBoost and Bagging methodologies. Discrete cosine transform (DCT) zigzag is used for feature extraction. Decision tree, Naïve Bayes and support vector machine classifiers are used for the recognition of basic characters segmented from Devanagari ancient manuscripts. A dataset of 5484 pre-segmented characters of Devanagari ancient documents is considered for experimental work. Maximum recognition accuracy of 90.70% has been achieved using DCT zigzag features and RBF-SVM classifier. AdaBoost and Bagging ensemble methods are used with the base classifiers to improve the accuracy. Maximum accuracy of 91.70% is achieved for adaptive boosting (AdaBoost) with RBF-SVM. Various parameters for performance measures such as precision, recall, F-measure, false acceptance rate, false rejection rate and RMSE are used for assessing the quality of the ensemble methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abualigah LMQ, Hanandeh ES (2015) Applying genetic algorithms to information retrieval using vector space model. Int J Comput Sci Eng Appl 5(1):19–28 Abualigah LMQ, Hanandeh ES (2015) Applying genetic algorithms to information retrieval using vector space model. Int J Comput Sci Eng Appl 5(1):19–28
Zurück zum Zitat Abualigah LM, Khader AT (2017) Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering. J Supercomput 73(11):4773–4795CrossRef Abualigah LM, Khader AT (2017) Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering. J Supercomput 73(11):4773–4795CrossRef
Zurück zum Zitat Abualigah LM, Khader AT, Al-Betar MA, Alomari OA (2017) Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering. Expert Syst Appl 84:24–36CrossRef Abualigah LM, Khader AT, Al-Betar MA, Alomari OA (2017) Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering. Expert Syst Appl 84:24–36CrossRef
Zurück zum Zitat Abualigah LM, Khader AT, Hanandeh ES (2018) Hybrid clustering analysis using improved krill herd algorithm. Appl Intell 48(11):4047–4071CrossRef Abualigah LM, Khader AT, Hanandeh ES (2018) Hybrid clustering analysis using improved krill herd algorithm. Appl Intell 48(11):4047–4071CrossRef
Zurück zum Zitat Alkhateeb J, Ren J, Jiang J, Ipson SS, Abed HE (2008) Word based handwritten Arabic scripts recognition using DCT features and neural network classifier. In: Proceedings of the 5th international multi-conference on systems, signals and devices, pp 1–5 Alkhateeb J, Ren J, Jiang J, Ipson SS, Abed HE (2008) Word based handwritten Arabic scripts recognition using DCT features and neural network classifier. In: Proceedings of the 5th international multi-conference on systems, signals and devices, pp 1–5
Zurück zum Zitat Ameta D (2017) Ensemble classifier approach in breast cancer detection and malignancy grading: a review. Int J Manag Public Sect Inf Commun Technol (IJMPICT) 8(1):17–26 Ameta D (2017) Ensemble classifier approach in breast cancer detection and malignancy grading: a review. Int J Manag Public Sect Inf Commun Technol (IJMPICT) 8(1):17–26
Zurück zum Zitat Bansal S, Paliwal K (2018) Handwritten character recognition system using Gabor filter and SVM classifier. Int J Digit Appl Contemp Res 6(9):1–5 Bansal S, Paliwal K (2018) Handwritten character recognition system using Gabor filter and SVM classifier. Int J Digit Appl Contemp Res 6(9):1–5
Zurück zum Zitat Chung Y, Kim N, Park C, Lee JH (2018) Improved neighborhood search for collaborative filtering. Int J Fuzzy Log Intell Syst 18(1):29–40CrossRef Chung Y, Kim N, Park C, Lee JH (2018) Improved neighborhood search for collaborative filtering. Int J Fuzzy Log Intell Syst 18(1):29–40CrossRef
Zurück zum Zitat Dabbaghchian S, Ghaemmaghami MP, Aghagolzadeh A (2010) Feature extraction using discrete cosine transform with discrimination power analysis with a face recognition technology. Pattern Recogn 43(4):1431–1440CrossRef Dabbaghchian S, Ghaemmaghami MP, Aghagolzadeh A (2010) Feature extraction using discrete cosine transform with discrimination power analysis with a face recognition technology. Pattern Recogn 43(4):1431–1440CrossRef
Zurück zum Zitat Dattatray VJ, Raghunath SH (2008) Radon and discrete cosine transforms based feature extraction and dimensionality reduction approach for face recognition. Sig Process 88(10):2604–2609CrossRef Dattatray VJ, Raghunath SH (2008) Radon and discrete cosine transforms based feature extraction and dimensionality reduction approach for face recognition. Sig Process 88(10):2604–2609CrossRef
Zurück zum Zitat Dietterich T (2000) Ensemble methods in machine learning. In: Proceedings of first international workshop on multiple classifier systems, pp 1–15 Dietterich T (2000) Ensemble methods in machine learning. In: Proceedings of first international workshop on multiple classifier systems, pp 1–15
Zurück zum Zitat Jiang S, Frigui H, Calhoun AW (2014) Text-independent speaker identification using soft bag-of-words feature representation. Int J Fuzzy Log Intell Syst 14(4):240–248CrossRef Jiang S, Frigui H, Calhoun AW (2014) Text-independent speaker identification using soft bag-of-words feature representation. Int J Fuzzy Log Intell Syst 14(4):240–248CrossRef
Zurück zum Zitat Khodadad I, Sid-Ahmed M, Abdel-Raheem E (2011) Online Arabic/Persian character recognition using neural network classifier and DCT features. In: Proceedings of the 54th international midwest symposium on circuits and systems, pp 1–4 Khodadad I, Sid-Ahmed M, Abdel-Raheem E (2011) Online Arabic/Persian character recognition using neural network classifier and DCT features. In: Proceedings of the 54th international midwest symposium on circuits and systems, pp 1–4
Zurück zum Zitat Kim JS, Jeong JS (2015) Pattern recognition of ship navigational data using support vector machine. Int J Fuzzy Log Intell Syst 15(4):268–276CrossRef Kim JS, Jeong JS (2015) Pattern recognition of ship navigational data using support vector machine. Int J Fuzzy Log Intell Syst 15(4):268–276CrossRef
Zurück zum Zitat Kleber F, Sablatnig R, Gau M, and Miklas H (2008) Ancient document analysis based on text line extraction. In: Proceedings of the 19th international conference on pattern recognition, pp 1–4 Kleber F, Sablatnig R, Gau M, and Miklas H (2008) Ancient document analysis based on text line extraction. In: Proceedings of the 19th international conference on pattern recognition, pp 1–4
Zurück zum Zitat Kumar M, Jindal MK, Sharma RK (2014) A novel hierarchical techniques for offline handwritten Gurmukhi character recognition. Natl Acad Sci Lett 37(6):567–572CrossRef Kumar M, Jindal MK, Sharma RK (2014) A novel hierarchical techniques for offline handwritten Gurmukhi character recognition. Natl Acad Sci Lett 37(6):567–572CrossRef
Zurück zum Zitat Kuncheva LI (2005) Combining Pattern Classifiers: Methods and Algorithms. Wiley, New YorkMATH Kuncheva LI (2005) Combining Pattern Classifiers: Methods and Algorithms. Wiley, New YorkMATH
Zurück zum Zitat Lawgali A, Bouridane A, Angelova M, Ghassemlooy Z (2011) Handwritten Arabic character recognition: which feature extraction method. Int J Adv Sci Technol 34:1–8 Lawgali A, Bouridane A, Angelova M, Ghassemlooy Z (2011) Handwritten Arabic character recognition: which feature extraction method. Int J Adv Sci Technol 34:1–8
Zurück zum Zitat Lee H, Kim S (2016) Black-box classifier interpretation using decision tree and fuzzy logic-based classifier implementation. Int J Fuzzy Log Intell Syst 16(1):27–35CrossRef Lee H, Kim S (2016) Black-box classifier interpretation using decision tree and fuzzy logic-based classifier implementation. Int J Fuzzy Log Intell Syst 16(1):27–35CrossRef
Zurück zum Zitat Ling CX, Huang J, Zhang H (2003) AUC: a statistically consistent and more discriminating measure than accuracy. In: Proceedings of the 18th international joint conference on artificial intelligence (IJCAI’03), pp 329–341 Ling CX, Huang J, Zhang H (2003) AUC: a statistically consistent and more discriminating measure than accuracy. In: Proceedings of the 18th international joint conference on artificial intelligence (IJCAI’03), pp 329–341
Zurück zum Zitat Liu N, Han W (2007) Recognition of human faces using discrete cosine transform filtered trace feature. In: Proceedings of the 6th international conference on information, communications and signal processing (ICICS), pp 1–5 Liu N, Han W (2007) Recognition of human faces using discrete cosine transform filtered trace feature. In: Proceedings of the 6th international conference on information, communications and signal processing (ICICS), pp 1–5
Zurück zum Zitat Mitchell T (1997) Machine learning. McGraw-Hill, New York CityMATH Mitchell T (1997) Machine learning. McGraw-Hill, New York CityMATH
Zurück zum Zitat Monro DM, Rakshit S, Zhang D (2007) DCT-based iris recognition. IEEE Trans Pattern Anal Mach Intell 29(4):586–595CrossRef Monro DM, Rakshit S, Zhang D (2007) DCT-based iris recognition. IEEE Trans Pattern Anal Mach Intell 29(4):586–595CrossRef
Zurück zum Zitat Ngo CW, Chan CK (2005) Video text detection and segmentation for optical character recognition. Multimed Syst 10(3):261–272CrossRef Ngo CW, Chan CK (2005) Video text detection and segmentation for optical character recognition. Multimed Syst 10(3):261–272CrossRef
Zurück zum Zitat Parisi R, Claudio ED, Lucarelli G, Orlandi G (1998) Car plate recognition by neural networks and image processing. Proc IEEE Int Symp Circuits Syst 3:195–198 Parisi R, Claudio ED, Lucarelli G, Orlandi G (1998) Car plate recognition by neural networks and image processing. Proc IEEE Int Symp Circuits Syst 3:195–198
Zurück zum Zitat Quacimy BE, Kerroum MA, Hammouch A (2014) Feature extraction based on DCT for handwritten digit recognition. Int J Comput Sci Issues 11(6):27–33 Quacimy BE, Kerroum MA, Hammouch A (2014) Feature extraction based on DCT for handwritten digit recognition. Int J Comput Sci Issues 11(6):27–33
Zurück zum Zitat Quo L, Boukir S (2014) Ensemble margin framework for image classification. In: Proceedings of the IEEE international conference on image processing, France, pp 4231–4235 Quo L, Boukir S (2014) Ensemble margin framework for image classification. In: Proceedings of the IEEE international conference on image processing, France, pp 4231–4235
Zurück zum Zitat Quo L, Boukir S (2017) Building an ensemble classifier using ensemble margin. Application to image classification. In: Proceedings of the 2017 IEEE international conference on image processing, Beijing, pp 4492–4496 Quo L, Boukir S (2017) Building an ensemble classifier using ensemble margin. Application to image classification. In: Proceedings of the 2017 IEEE international conference on image processing, Beijing, pp 4492–4496
Zurück zum Zitat Ramteke SP, Gurjar AA, Deshmukh DS (2018) A streamlined OCR system for handwritten Marathi text document classification and recognition using SVM-ACS algorithm. Int J Intell Eng Syst 11(3):186–195 Ramteke SP, Gurjar AA, Deshmukh DS (2018) A streamlined OCR system for handwritten Marathi text document classification and recognition using SVM-ACS algorithm. Int J Intell Eng Syst 11(3):186–195
Zurück zum Zitat Santana LEA, Silva L, Canuto AM, Pintro F, Vale KO (2010) A comparative analysis of genetic algorithm and ant colony optimization to select attributes for a heterogeneous ensemble of classifiers. In: IEEE congress evolutionary computation (CEC), pp 1–8 Santana LEA, Silva L, Canuto AM, Pintro F, Vale KO (2010) A comparative analysis of genetic algorithm and ant colony optimization to select attributes for a heterogeneous ensemble of classifiers. In: IEEE congress evolutionary computation (CEC), pp 1–8
Zurück zum Zitat Wang S, Yao X (2013) Relationships between diversity of classification ensembles and single-class performance measures. Knowl Data Eng 25(1):206–219CrossRef Wang S, Yao X (2013) Relationships between diversity of classification ensembles and single-class performance measures. Knowl Data Eng 25(1):206–219CrossRef
Metadaten
Titel
Devanagari ancient character recognition using DCT features with adaptive boosting and bootstrap aggregating
Publikationsdatum
08.03.2019
Erschienen in
Soft Computing / Ausgabe 24/2019
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-03897-5

Weitere Artikel der Ausgabe 24/2019

Soft Computing 24/2019 Zur Ausgabe

Premium Partner