Skip to main content
Erschienen in: Neural Computing and Applications 7/2020

06.03.2019 | Original Article

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

verfasst von: Soumya Ukil, Swarnendu Ghosh, Sk Md Obaidullah, K. C. Santosh, Kaushik Roy, Nibaran Das

Erschienen in: Neural Computing and Applications | Ausgabe 7/2020

Einloggen

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Handwritten document recognition has been an active domain of research in the field of computer vision for several years since 1914 with the development of handheld scanner for reading printed texts called “optophone”. In India, which has several different scripts in one document page, identifying them is a must to automate process: document understanding. We propose a novel technique in integrating convolutional neural networks (CNNs) for script identification. We combined small individually trainable small CNNs, and used several different levels of variation in the architectures of the individual CNNs. Such a collection of individually trainable modules vary with respect to the input image size, CNN’s depth and wavelet transformation. In our test, we used publicly available dataset of size 11K words (1K per script) from 11 different Indic Scripts: Bangla, Devanagari, Gujarati, Gurumukhi, Kannada, Malayalam, Oriya, Roman, Tamil, Telugu and Urdu. Several ensemble strategies were implemented such as max-voting and probabilistic voting are used in addition to other conventional approaches like feature concatenation. We achieved a maximum accuracy of 95.04%, and it outperforms the accuracy of the state-of-the-art techniques like AlexNet by 2.9% and more importantly, benchmark techniques as (for script identification) on the dataset by more than 4%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ahmed SB, Naz S, Razzak MI, Rashid SF, Afzal MZ, Breuel TM (2016) Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput Appl 27(3):603–613CrossRef Ahmed SB, Naz S, Razzak MI, Rashid SF, Afzal MZ, Breuel TM (2016) Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput Appl 27(3):603–613CrossRef
2.
Zurück zum Zitat Anil R, Manjusha K, Kumar SS, Soman K (2015) Convolutional neural networks for the recognition of Malayalam characters. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA) 2014. Springer, pp 493–500 Anil R, Manjusha K, Kumar SS, Soman K (2015) Convolutional neural networks for the recognition of Malayalam characters. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA) 2014. Springer, pp 493–500
3.
Zurück zum Zitat Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK (2009) A hierarchical approach to recognition of handwritten bangla characters. Pattern Recognit 42(7):1467–1484CrossRef Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK (2009) A hierarchical approach to recognition of handwritten bangla characters. Pattern Recognit 42(7):1467–1484CrossRef
4.
Zurück zum Zitat Bhattacharya U, Chaudhuri B (2005) Databases for research on recognition of handwritten characters of Indian scripts. In: Eighth international conference on document analysis and recognition, 2005. Proceedings. IEEE, pp 789–793 Bhattacharya U, Chaudhuri B (2005) Databases for research on recognition of handwritten characters of Indian scripts. In: Eighth international conference on document analysis and recognition, 2005. Proceedings. IEEE, pp 789–793
5.
Zurück zum Zitat Bhattacharya U, Chaudhuri BB (2009) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457CrossRef Bhattacharya U, Chaudhuri BB (2009) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457CrossRef
6.
Zurück zum Zitat Bracewell RN, Bracewell RN (1986) The Fourier transform and its applications, vol 31999. McGraw-Hill, New YorkMATH Bracewell RN, Bracewell RN (1986) The Fourier transform and its applications, vol 31999. McGraw-Hill, New YorkMATH
7.
Zurück zum Zitat Brodić D, Amelio A, Milivojević ZN (2016) Language discrimination by texture analysis of the image corresponding to the text. Neural Comput Appl 29:1–22 Brodić D, Amelio A, Milivojević ZN (2016) Language discrimination by texture analysis of the image corresponding to the text. Neural Comput Appl 29:1–22
8.
Zurück zum Zitat Busch A, Boles WW, Sridharan S (2005) Texture for script identification. IEEE Trans Pattern Anal Mach Intell 27(11):1720–1732CrossRef Busch A, Boles WW, Sridharan S (2005) Texture for script identification. IEEE Trans Pattern Anal Mach Intell 27(11):1720–1732CrossRef
9.
Zurück zum Zitat Das N, Sarkar R, Basu S, Saha PK, Kundu M, Nasipuri M (2015) Handwritten bangla character recognition using a soft computing paradigm embedded in two pass approach. Pattern Recognit 48(6):2054–2071CrossRef Das N, Sarkar R, Basu S, Saha PK, Kundu M, Nasipuri M (2015) Handwritten bangla character recognition using a soft computing paradigm embedded in two pass approach. Pattern Recognit 48(6):2054–2071CrossRef
10.
Zurück zum Zitat Daubechies I (1990) The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory 36(5):961–1005MathSciNetCrossRef Daubechies I (1990) The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory 36(5):961–1005MathSciNetCrossRef
11.
Zurück zum Zitat Dhanya D, Ramakrishnan A, Pati PB (2002) Script identification in printed bilingual documents. Sadhana 27(1):73–82CrossRef Dhanya D, Ramakrishnan A, Pati PB (2002) Script identification in printed bilingual documents. Sadhana 27(1):73–82CrossRef
12.
Zurück zum Zitat Garain U, Chakraborty M, Dasgupta D (2006) Recognition of handwritten indic script using clonal selection algorithm. In: Artificial immune systems, pp 256–266 Garain U, Chakraborty M, Dasgupta D (2006) Recognition of handwritten indic script using clonal selection algorithm. In: Artificial immune systems, pp 256–266
13.
Zurück zum Zitat Ghosh D, Dube T, Shivaprasad A (2010) Script recognition a review. IEEE Trans Pattern Anal Mach Intell 32(12):2142–2161CrossRef Ghosh D, Dube T, Shivaprasad A (2010) Script recognition a review. IEEE Trans Pattern Anal Mach Intell 32(12):2142–2161CrossRef
14.
Zurück zum Zitat Govindaraju V, Setlur S (2009) Guide to OCR for indic scripts. Springer, Berlin Govindaraju V, Setlur S (2009) Guide to OCR for indic scripts. Springer, Berlin
15.
Zurück zum Zitat Hangarge M, Santosh K, Pardeshi R (2013) Directional discrete cosine transform for handwritten script identification. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 344–348 Hangarge M, Santosh K, Pardeshi R (2013) Directional discrete cosine transform for handwritten script identification. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 344–348
16.
Zurück zum Zitat John J, Pramod K, Balakrishnan K (2012) Unconstrained handwritten Malayalam character recognition using wavelet transform and support vector machine classifier. Procedia Eng 30:598–605CrossRef John J, Pramod K, Balakrishnan K (2012) Unconstrained handwritten Malayalam character recognition using wavelet transform and support vector machine classifier. Procedia Eng 30:598–605CrossRef
18.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton G.E (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105 Krizhevsky A, Sutskever I, Hinton G.E (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
20.
Zurück zum Zitat LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
21.
Zurück zum Zitat Mehrotra K, Jetley S, Deshmukh A, Belhe S (2013) Unconstrained handwritten devanagari character recognition using convolutional neural networks. In: Proceedings of the 4th international workshop on multilingual OCR. ACM, p 15 Mehrotra K, Jetley S, Deshmukh A, Belhe S (2013) Unconstrained handwritten devanagari character recognition using convolutional neural networks. In: Proceedings of the 4th international workshop on multilingual OCR. ACM, p 15
22.
Zurück zum Zitat Neeba N, Jawahar C (2009) Empirical evaluation of character classification schemes. In: Seventh international conference on advances in pattern recognition, 2009. ICAPR’09. IEEE, pp 310–313 Neeba N, Jawahar C (2009) Empirical evaluation of character classification schemes. In: Seventh international conference on advances in pattern recognition, 2009. ICAPR’09. IEEE, pp 310–313
23.
Zurück zum Zitat Obaidullah SM, Das N, Halder C, Roy K (2015) Indic script identification from handwritten document images an unconstrained block-level approach. In: 2015 IEEE 2nd international conference on recent trends in information systems (ReTIS). IEEE, pp 213–218 Obaidullah SM, Das N, Halder C, Roy K (2015) Indic script identification from handwritten document images an unconstrained block-level approach. In: 2015 IEEE 2nd international conference on recent trends in information systems (ReTIS). IEEE, pp 213–218
24.
Zurück zum Zitat Obaidullah SM, Halder C, Santosh K, Das N, Roy K (2017) Phdindic\(\_\)11: page-level handwritten document image dataset of 11 official indic scripts for script identification. Multimedia Tools Appl 77:1–36 Obaidullah SM, Halder C, Santosh K, Das N, Roy K (2017) Phdindic\(\_\)11: page-level handwritten document image dataset of 11 official indic scripts for script identification. Multimedia Tools Appl 77:1–36
25.
Zurück zum Zitat Obaidullah SM, Santosh K, Halder C, Das N, Roy K (2017) Automatic indic script identification from handwritten documents: page, block, line and word-level approach. Int J Mach Learn Cybern 10:1–20 Obaidullah SM, Santosh K, Halder C, Das N, Roy K (2017) Automatic indic script identification from handwritten documents: page, block, line and word-level approach. Int J Mach Learn Cybern 10:1–20
26.
Zurück zum Zitat Pal U, Chaudhuri B (2004) Indian script character recognition: a survey. Pattern Recognit 37(9):1887–1899CrossRef Pal U, Chaudhuri B (2004) Indian script character recognition: a survey. Pattern Recognit 37(9):1887–1899CrossRef
27.
Zurück zum Zitat Pal U, Jayadevan R, Sharma N (2012) Handwriting recognition in indian regional scripts: a survey of offline techniques. ACM Trans Asian Lang Inf Process (TALIP) 11(1):1CrossRef Pal U, Jayadevan R, Sharma N (2012) Handwriting recognition in indian regional scripts: a survey of offline techniques. ACM Trans Asian Lang Inf Process (TALIP) 11(1):1CrossRef
28.
Zurück zum Zitat Pal U, Sinha S, Chaudhuri B (2003) Multi-script line identification from indian documents. In: Seventh international conference on document analysis and recognition, 2003. Proceedings. IEEE, pp 880–884 Pal U, Sinha S, Chaudhuri B (2003) Multi-script line identification from indian documents. In: Seventh international conference on document analysis and recognition, 2003. Proceedings. IEEE, pp 880–884
29.
Zurück zum Zitat Pati PB, Ramakrishnan A (2008) Word level multi-script identification. Pattern Recogn Lett 29(9):1218–1229CrossRef Pati PB, Ramakrishnan A (2008) Word level multi-script identification. Pattern Recogn Lett 29(9):1218–1229CrossRef
30.
Zurück zum Zitat Portnoff M (1980) Time-frequency representation of digital signals and systems based on short-time fourier analysis. IEEE Trans Acoust Speech Signal Process 28(1):55–69CrossRef Portnoff M (1980) Time-frequency representation of digital signals and systems based on short-time fourier analysis. IEEE Trans Acoust Speech Signal Process 28(1):55–69CrossRef
31.
Zurück zum Zitat Porwik P, Lisowska A (2004) The haar-wavelet transform in digital image processing: its status and achievements. Mach Graph Vis 13(1/2):79–98MATH Porwik P, Lisowska A (2004) The haar-wavelet transform in digital image processing: its status and achievements. Mach Graph Vis 13(1/2):79–98MATH
32.
Zurück zum Zitat Rajput G, Anita H (2013) Handwritten script recognition at line level-a multiple feature based approach. Int J Eng Innovative Technol 3(4):90–95 Rajput G, Anita H (2013) Handwritten script recognition at line level-a multiple feature based approach. Int J Eng Innovative Technol 3(4):90–95
33.
Zurück zum Zitat Rani R, Dhir R, Lehal GS (2013) Script identification of pre-segmented multi-font characters and digits. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1150–1154 Rani R, Dhir R, Lehal GS (2013) Script identification of pre-segmented multi-font characters and digits. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1150–1154
34.
Zurück zum Zitat Roy K, Das S.K, Obaidullah SM (2011) Script identification from handwritten document. In: 2011 Third national conference on computer vision, pattern recognition, image processing and graphics (NCVPRIPG). IEEE, pp 66–69 Roy K, Das S.K, Obaidullah SM (2011) Script identification from handwritten document. In: 2011 Third national conference on computer vision, pattern recognition, image processing and graphics (NCVPRIPG). IEEE, pp 66–69
35.
Zurück zum Zitat Roy S, Das N, Kundu M, Nasipuri M (2017) Handwritten isolated bangla compound character recognition: a new benchmark using a novel deep learning approach. Pattern Recogn Lett 90:15–21CrossRef Roy S, Das N, Kundu M, Nasipuri M (2017) Handwritten isolated bangla compound character recognition: a new benchmark using a novel deep learning approach. Pattern Recogn Lett 90:15–21CrossRef
36.
Zurück zum Zitat Sarkhel R, Das N, Das A, Kundu M, Nasipuri M (2017) A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts. Pattern Recognit 71:78–93CrossRef Sarkhel R, Das N, Das A, Kundu M, Nasipuri M (2017) A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts. Pattern Recognit 71:78–93CrossRef
37.
Zurück zum Zitat Schenkel M, Guyon I, Henderson D (1995) On-line cursive script recognition using time-delay neural networks and hidden markov models. Mach Vis Appl 8(4):215–223CrossRef Schenkel M, Guyon I, Henderson D (1995) On-line cursive script recognition using time-delay neural networks and hidden markov models. Mach Vis Appl 8(4):215–223CrossRef
38.
Zurück zum Zitat Sharma MK, Dhaka VP (2016) Pixel plot and trace based segmentation method for bilingual handwritten scripts using feedforward neural network. Neural Comput Appl 27(7):1817–1829CrossRef Sharma MK, Dhaka VP (2016) Pixel plot and trace based segmentation method for bilingual handwritten scripts using feedforward neural network. Neural Comput Appl 27(7):1817–1829CrossRef
39.
Zurück zum Zitat Singh PK, Mondal A, Bhowmik S, Sarkar R, Nasipuri M (2015) Word-level script identification from handwritten multi-script documents. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA) 2014. Springer, pp 551–558 Singh PK, Mondal A, Bhowmik S, Sarkar R, Nasipuri M (2015) Word-level script identification from handwritten multi-script documents. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA) 2014. Springer, pp 551–558
40.
Zurück zum Zitat Singh PK, Sarkar R, Nasipuri M (2015) Offline script identification from multilingual indic-script documents: a state-of-the-art. Comput Sci Rev 15:1–28MathSciNetCrossRef Singh PK, Sarkar R, Nasipuri M (2015) Offline script identification from multilingual indic-script documents: a state-of-the-art. Comput Sci Rev 15:1–28MathSciNetCrossRef
41.
Zurück zum Zitat Singh PK, Sarkar R, Nasipuri M, Doermann D (2015) Word-level script identification for handwritten indic scripts. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 1106–1110 Singh PK, Sarkar R, Nasipuri M, Doermann D (2015) Word-level script identification for handwritten indic scripts. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 1106–1110
42.
Zurück zum Zitat Smith S (1997) Fourier transform properties. The scientist and engineers guide to digital signal processing. California Technical Publishing, San Diego, pp 185–208 Smith S (1997) Fourier transform properties. The scientist and engineers guide to digital signal processing. California Technical Publishing, San Diego, pp 185–208
43.
Zurück zum Zitat Stanković RS, Falkowski BJ (2003) The haar wavelet transform: its status and achievements. Comput Electr Eng 29(1):25–44CrossRef Stanković RS, Falkowski BJ (2003) The haar wavelet transform: its status and achievements. Comput Electr Eng 29(1):25–44CrossRef
44.
Zurück zum Zitat Ubul K, Tursun G, Aysa A, Impedovo D, Pirlo G, Yibulayin T (2017) Script identification of multi-script documents: a survey. IEEE Access 5:6546–6559 Ubul K, Tursun G, Aysa A, Impedovo D, Pirlo G, Yibulayin T (2017) Script identification of multi-script documents: a survey. IEEE Access 5:6546–6559
45.
Zurück zum Zitat Verma K, Sharma RK (2016) Comparison of HMM-and SVM-based stroke classifiers for Gurmukhi script. Neural Comput Appl 28:1–13CrossRef Verma K, Sharma RK (2016) Comparison of HMM-and SVM-based stroke classifiers for Gurmukhi script. Neural Comput Appl 28:1–13CrossRef
Metadaten
Titel
Improved word-level handwritten Indic script identification by integrating small convolutional neural networks
verfasst von
Soumya Ukil
Swarnendu Ghosh
Sk Md Obaidullah
K. C. Santosh
Kaushik Roy
Nibaran Das
Publikationsdatum
06.03.2019
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 7/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-019-04111-1

Weitere Artikel der Ausgabe 7/2020

Neural Computing and Applications 7/2020 Zur Ausgabe

Deep Learning & Neural Computing for Intelligent Sensing and Control

Image object detection and semantic segmentation based on convolutional neural network

Deep Learning & Neural Computing for Intelligent Sensing and Control

Even faster retinal vessel segmentation via accelerated singular value decomposition

Deep Learning & Neural Computing for Intelligent Sensing and Control

Prediction of air quality in Shenzhen based on neural network algorithm

Deep Learning & Neural Computing for Intelligent Sensing and Control

Research on radar signal recognition based on automatic machine learning