Skip to main content
Erschienen in: Neural Computing and Applications 12/2020

18.05.2019 | Hybrid Artificial Intelligence and Machine Learning Technologies

Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition

verfasst von: Soumyadeep Kundu, Sayantan Paul, Pawan Kumar Singh, Ram Sarkar, Mita Nasipuri

Erschienen in: Neural Computing and Applications | Ausgabe 12/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a deep learning architecture modified for resource-constrained environments, called Non-Fully-Connected Network or NFC-Net, based on convolutional neural network architecture in order to solve the problem of Indic script recognition from handwritten word images. NFC-Net mainly targets resource constraint environment where there is a limited computation power or inadequate training samples or restricted training time. Previous approaches to handwritten script recognition included handcrafted features such as structure-based features and texture-based features. In contrast, here our model learns relatively different features from raw input pixels using NFC-Net. Various parameters of the NFC-Net are adjusted to present a vast and comprehensive study of the neural net in the domain of handwritten script recognition. In order to evaluate the performance of the NFC-Net with suitable parameter estimation, a dataset of 18,000 handwritten multiscript word images consisting of 1500 text words from each of the 12 officially recognized Indic scripts has been considered and a maximum script recognition accuracy of 96.30% is noted. Our proposed model also performs better than some of the recently published script recognition methods in bi-script, tri-script, tetra-script and 12-script scenarios. It has been additionally tested on the RaFD and BHCCD datasets with improved results to prove dataset independency of our model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Singh PK, Sarkar R, Nasipuri M (2015) Offline script identification from multilingual Indic-script documents: a state-of-the-art. Comput Sci Rev 15(C):1–28MathSciNetCrossRef Singh PK, Sarkar R, Nasipuri M (2015) Offline script identification from multilingual Indic-script documents: a state-of-the-art. Comput Sci Rev 15(C):1–28MathSciNetCrossRef
2.
Zurück zum Zitat Sangame SK, Ramteke RJ, Andure S, Gundge Y (2012) V: script identification of text words from a bilingual document using voting techniques. World J Sci Technol 2:114–119CrossRef Sangame SK, Ramteke RJ, Andure S, Gundge Y (2012) V: script identification of text words from a bilingual document using voting techniques. World J Sci Technol 2:114–119CrossRef
3.
Zurück zum Zitat Roy K, Pal U (2006) Word-wise handwritten script separation for Indian postal automation. In: Proceedings of the international workshop on frontiers in handwriting recognition, La Baule, pp 521–526 Roy K, Pal U (2006) Word-wise handwritten script separation for Indian postal automation. In: Proceedings of the international workshop on frontiers in handwriting recognition, La Baule, pp 521–526
4.
Zurück zum Zitat Roy K, Pal U, Chaudhuri BB (2005) Neural network based word-wise handwritten script identification system for Indian postal automation. In: Proceedings of the international conference on intelligent sensing and information processing, Chennai, pp 581–586. https://doi.org/10.1109/icisip.2005.1529455 Roy K, Pal U, Chaudhuri BB (2005) Neural network based word-wise handwritten script identification system for Indian postal automation. In: Proceedings of the international conference on intelligent sensing and information processing, Chennai, pp 581–586. https://​doi.​org/​10.​1109/​icisip.​2005.​1529455
5.
Zurück zum Zitat Sarkar R, Das N, Basu S, Kundu M, Nasipuri M, Basu DK (2010) Word level script identification from Bangla and Devanagari handwritten texts mixed with Roman scripts. J Comput 2(2):103–108 Sarkar R, Das N, Basu S, Kundu M, Nasipuri M, Basu DK (2010) Word level script identification from Bangla and Devanagari handwritten texts mixed with Roman scripts. J Comput 2(2):103–108
6.
Zurück zum Zitat Memon MH, Li JP, Memon I, Arain QA, Memon MH (2017) Region based localized matching image retrieval system using color-size features for image retrieval. In: 2017 14th International computer conference on wavelet active media technology and information processing (ICCWAMTIP). https://doi.org/10.1109/iccwamtip.2017.8301481 Memon MH, Li JP, Memon I, Arain QA, Memon MH (2017) Region based localized matching image retrieval system using color-size features for image retrieval. In: 2017 14th International computer conference on wavelet active media technology and information processing (ICCWAMTIP). https://​doi.​org/​10.​1109/​iccwamtip.​2017.​8301481
8.
Zurück zum Zitat Memon MH, Li JP, Memon I, Arain QA (2017) GEO matching regions: multiple regions of interests using content based image retrieval based on relative locations. Multimed Tools Appl 76:15377–15411CrossRef Memon MH, Li JP, Memon I, Arain QA (2017) GEO matching regions: multiple regions of interests using content based image retrieval based on relative locations. Multimed Tools Appl 76:15377–15411CrossRef
11.
Zurück zum Zitat Singh PK, Sarkar R, Das N, Basu S, Nasipuri M (2014) Statistical comparison of classifiers for script identification from multi-script handwritten documents. Int J Appl Pattern Recognit 1(2):152–172CrossRef Singh PK, Sarkar R, Das N, Basu S, Nasipuri M (2014) Statistical comparison of classifiers for script identification from multi-script handwritten documents. Int J Appl Pattern Recognit 1(2):152–172CrossRef
12.
Zurück zum Zitat Patil SB, Subbareddy NV (2002) Neural network-based system for script identification in Indian documents. Sadhana 27(1):83–97CrossRef Patil SB, Subbareddy NV (2002) Neural network-based system for script identification in Indian documents. Sadhana 27(1):83–97CrossRef
13.
Zurück zum Zitat Khandelwal A, Choudhury P, Sarkar R, Basu S, Nasipuri M, Das N (2009) Text line segmentation for unconstrained handwritten document images using neighborhood connected component analysis. In: International conference on pattern recognition and machine intelligence, LNCS 5909. Springer, Berlin, pp 369–374 Khandelwal A, Choudhury P, Sarkar R, Basu S, Nasipuri M, Das N (2009) Text line segmentation for unconstrained handwritten document images using neighborhood connected component analysis. In: International conference on pattern recognition and machine intelligence, LNCS 5909. Springer, Berlin, pp 369–374
14.
Zurück zum Zitat Wahl FM, Wong KY, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20(4):375–390CrossRef Wahl FM, Wong KY, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20(4):375–390CrossRef
16.
Zurück zum Zitat Ma H, Doermann D (2004) Word level script identification on scanned document images. In: Proceedings of the SPIE conference on document recognition and retrieval, San Jose, CA, USA, pp 124–135 Ma H, Doermann D (2004) Word level script identification on scanned document images. In: Proceedings of the SPIE conference on document recognition and retrieval, San Jose, CA, USA, pp 124–135
17.
Zurück zum Zitat Peake GS, Tan TN (1998) Script and language identification from document images. In: Proceedings of the Asian conference computer vision, LNCS, vol 1352, pp 97–104 Peake GS, Tan TN (1998) Script and language identification from document images. In: Proceedings of the Asian conference computer vision, LNCS, vol 1352, pp 97–104
18.
Zurück zum Zitat Padma MC, Vijaya PA (2010) Global approach for script identification using wavelet packet based features. Int J Signal Process Image Process Pattern Recognit 3:29–40 Padma MC, Vijaya PA (2010) Global approach for script identification using wavelet packet based features. Int J Signal Process Image Process Pattern Recognit 3:29–40
19.
Zurück zum Zitat Singh PK, Mondal A, Bhowmik S, Sarkar R, Nasipuri M (2014) Word-level script identification from multi-script handwritten documents. In: Proceedings of the 3rd international conference on frontiers in intelligent computing theory and applications (FICTA), pp 551–558 Singh PK, Mondal A, Bhowmik S, Sarkar R, Nasipuri M (2014) Word-level script identification from multi-script handwritten documents. In: Proceedings of the 3rd international conference on frontiers in intelligent computing theory and applications (FICTA), pp 551–558
21.
Zurück zum Zitat Hangarge M, Santosh KC, Pardeshi R (2013) Directional discrete Cosine transform for handwritten script identification. In: Proceedings of 12th IEEE international conference on document analysis and recognition (ICDAR), 2013, pp 344–348 Hangarge M, Santosh KC, Pardeshi R (2013) Directional discrete Cosine transform for handwritten script identification. In: Proceedings of 12th IEEE international conference on document analysis and recognition (ICDAR), 2013, pp 344–348
22.
Zurück zum Zitat Pardeshi R, Chaudhuri BB, Hangarge M, Santosh KC (2014) Automatic handwritten Indian scripts identification. In: Proceedings of 14th IEEE international conference on frontiers in handwriting recognition (ICFHR), 2014, pp 375–380 Pardeshi R, Chaudhuri BB, Hangarge M, Santosh KC (2014) Automatic handwritten Indian scripts identification. In: Proceedings of 14th IEEE international conference on frontiers in handwriting recognition (ICFHR), 2014, pp 375–380
23.
Zurück zum Zitat Chanda S, Pal S, Pal U (2008) Word-wise Sinhala, Tamil and English script identification using Gaussian kernel SVM. In: Proceedings of 19th IEEE international conference on pattern recognition, pp 1–4 Chanda S, Pal S, Pal U (2008) Word-wise Sinhala, Tamil and English script identification using Gaussian kernel SVM. In: Proceedings of 19th IEEE international conference on pattern recognition, pp 1–4
24.
Zurück zum Zitat Chanda S, Pal S, Franke K, Pal U (2009) Two-stage approach for word-wise script identification. In: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR), pp 926–930 Chanda S, Pal S, Franke K, Pal U (2009) Two-stage approach for word-wise script identification. In: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR), pp 926–930
25.
Zurück zum Zitat Swamy Das M, Sandhya Rani D, Reddy CRK (2012) Heuristic based script identification from multilingual text documents. In: Proceedings of 1st conference on recent advances in information technology (RAIT), pp 487–492 Swamy Das M, Sandhya Rani D, Reddy CRK (2012) Heuristic based script identification from multilingual text documents. In: Proceedings of 1st conference on recent advances in information technology (RAIT), pp 487–492
26.
Zurück zum Zitat Swamy Das M, Sandhya Rani D, Reddy CRK, Govadhan A (2011) Script identification from multilingual Telugu, Hindi and English text documents. Int J Wisdom Based Comput 1(3):79–85 Swamy Das M, Sandhya Rani D, Reddy CRK, Govadhan A (2011) Script identification from multilingual Telugu, Hindi and English text documents. Int J Wisdom Based Comput 1(3):79–85
27.
Zurück zum Zitat Singh PK, Sarkar R, Nasipuri M, Doermann D (2015) Word-level script identification for handwritten Indic scripts. In: Proceedings of 13th IEEE international conference on document analysis and recognition (ICDAR), pp 1106–1110 Singh PK, Sarkar R, Nasipuri M, Doermann D (2015) Word-level script identification for handwritten Indic scripts. In: Proceedings of 13th IEEE international conference on document analysis and recognition (ICDAR), pp 1106–1110
28.
Zurück zum Zitat Obaidullah SM, Santosh KC, Halder C, Das N, Roy K (2017) Automatic Indic script identification from handwritten documents: page, block, line and word-level approach. Int J Mach Learn Cybern 10(1):87–106CrossRef Obaidullah SM, Santosh KC, Halder C, Das N, Roy K (2017) Automatic Indic script identification from handwritten documents: page, block, line and word-level approach. Int J Mach Learn Cybern 10(1):87–106CrossRef
29.
Zurück zum Zitat Singh PK, Sarkar R, Das N, Basu S, Kundu M, Nasipuri M (2018) Benchmark databases of handwritten Bangla-Roman and Devanagari-Roman mixed-script document images. Multimed Tools Appl 77(7):8441–8473CrossRef Singh PK, Sarkar R, Das N, Basu S, Kundu M, Nasipuri M (2018) Benchmark databases of handwritten Bangla-Roman and Devanagari-Roman mixed-script document images. Multimed Tools Appl 77(7):8441–8473CrossRef
30.
Zurück zum Zitat Obaidullah SM, Goswami C, Santosh KC, Das N, Halder C, Roy K (2017) Separating Indic scripts with matra for effective handwritten script identification in multi-script documents. Int J Pattern Recognit Artif Intell 31(05):1753003CrossRef Obaidullah SM, Goswami C, Santosh KC, Das N, Halder C, Roy K (2017) Separating Indic scripts with matra for effective handwritten script identification in multi-script documents. Int J Pattern Recognit Artif Intell 31(05):1753003CrossRef
31.
Zurück zum Zitat Bhunia AK, Mukherjee S, Sain A, Bhattacharyya A, Bhunia AK, Roy PP, Pal U (2018) Indic handwritten script identification using offline-online multimodal deep network. arXiv preprint arXiv:1802.08568 Bhunia AK, Mukherjee S, Sain A, Bhattacharyya A, Bhunia AK, Roy PP, Pal U (2018) Indic handwritten script identification using offline-online multimodal deep network. arXiv preprint arXiv:​1802.​08568
32.
Zurück zum Zitat Ukil S, Ghosh S, Obaidullah SM, Santosh KC, Roy K, Das N (2018) Deep learning for word-level handwritten Indic script identification. arXiv preprint arXiv:1801.01627 Ukil S, Ghosh S, Obaidullah SM, Santosh KC, Roy K, Das N (2018) Deep learning for word-level handwritten Indic script identification. arXiv preprint arXiv:​1801.​01627
33.
Zurück zum Zitat Pati PB, Ramakrishnan AG (2006) HVS inspired system for script identification in Indian multi-script documents. In: Lecture notes in computer science: international workshop document analysis systems, vol 3872, Nelson, 2006, pp 380–389 Pati PB, Ramakrishnan AG (2006) HVS inspired system for script identification in Indian multi-script documents. In: Lecture notes in computer science: international workshop document analysis systems, vol 3872, Nelson, 2006, pp 380–389
35.
Zurück zum Zitat LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
36.
Zurück zum Zitat Akhand MAH, Rahman MM, Shill PC, Islam S, Hafizur Rahman MM (2015) Bangla handwritten numeral recognition using convolutional neural network. In: 2015 IEEE international conference on electrical engineering and information communication technology (ICEEICT), 1–5 Akhand MAH, Rahman MM, Shill PC, Islam S, Hafizur Rahman MM (2015) Bangla handwritten numeral recognition using convolutional neural network. In: 2015 IEEE international conference on electrical engineering and information communication technology (ICEEICT), 1–5
37.
Zurück zum Zitat Zhao H, Hu Y, Zhang J (2017) Character recognition via a compact convolutional neural network. In: 2017 International conference on digital image computing: techniques and applications (DICTA), 1–6 Zhao H, Hu Y, Zhang J (2017) Character recognition via a compact convolutional neural network. In: 2017 International conference on digital image computing: techniques and applications (DICTA), 1–6
39.
Zurück zum Zitat LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436CrossRef LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436CrossRef
41.
Zurück zum Zitat Langner O, Dotsch R, Bijlstra G, Wigboldus DH, Hawk ST, Van Knippenberg AD (2010) Presentation and validation of the Radboud faces database. Cogn Emot 24(8):1377–1388CrossRef Langner O, Dotsch R, Bijlstra G, Wigboldus DH, Hawk ST, Van Knippenberg AD (2010) Presentation and validation of the Radboud faces database. Cogn Emot 24(8):1377–1388CrossRef
43.
Zurück zum Zitat Das N, Acharya K, Sarkar R, Basu S, Kundu M, Nasipuri M (2014) A benchmark image database of isolated Bangla handwritten compound characters. Int J Doc Anal Recognit (IJDAR) 17(4):413–431CrossRef Das N, Acharya K, Sarkar R, Basu S, Kundu M, Nasipuri M (2014) A benchmark image database of isolated Bangla handwritten compound characters. Int J Doc Anal Recognit (IJDAR) 17(4):413–431CrossRef
44.
Zurück zum Zitat Roy S, Das N, Kundu M, Nasipuri M (2017) Handwritten isolated Bangla compound character recognition: a new benchmark using a novel deep learning approach. Pattern Recogn Lett 90:15–21CrossRef Roy S, Das N, Kundu M, Nasipuri M (2017) Handwritten isolated Bangla compound character recognition: a new benchmark using a novel deep learning approach. Pattern Recogn Lett 90:15–21CrossRef
45.
Zurück zum Zitat Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Rabinovich A (2015) Going deeper with convolutions. In: CVPR Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Rabinovich A (2015) Going deeper with convolutions. In: CVPR
46.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012). Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012). Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
47.
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
48.
Zurück zum Zitat Obaidullah SM, Karim R, Shaikh S, Halder C, Das N, Roy K (2015) Transform based approach for Indic script identification from handwritten document images. In: 2015 3rd International conference on signal processing, communication and networking (ICSCN). IEEE, pp 1–7 Obaidullah SM, Karim R, Shaikh S, Halder C, Das N, Roy K (2015) Transform based approach for Indic script identification from handwritten document images. In: 2015 3rd International conference on signal processing, communication and networking (ICSCN). IEEE, pp 1–7
49.
Zurück zum Zitat Singh PK, Das S, Sarkar R, Nasipuri M (2016) Line parameter based word-level Indic script identification system. Int J Comput Vis Image Process (IJCVIP) 6(2):18–41CrossRef Singh PK, Das S, Sarkar R, Nasipuri M (2016) Line parameter based word-level Indic script identification system. Int J Comput Vis Image Process (IJCVIP) 6(2):18–41CrossRef
Metadaten
Titel
Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition
verfasst von
Soumyadeep Kundu
Sayantan Paul
Pawan Kumar Singh
Ram Sarkar
Mita Nasipuri
Publikationsdatum
18.05.2019
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 12/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-019-04235-4

Weitere Artikel der Ausgabe 12/2020

Neural Computing and Applications 12/2020 Zur Ausgabe

Hybrid Artificial Intelligence and Machine Learning Technologies

Improved prediction of daily pan evaporation using Deep-LSTM model

Premium Partner