Skip to main content

2019 | OriginalPaper | Buchkapitel

A Study on the Effect of CNN-Based Transfer Learning on Handwritten Indic and Mixed Numeral Recognition

verfasst von : Rahul Pramanik, Prabhat Dansena, Soumen Bag

Erschienen in: Document Analysis and Recognition

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Filling up forms at post offices, railway counters, and for application of jobs has become a routine for modern people, especially in a developing country like India. Research on automation for the recognition of such handwritten forms has become mandatory. This applies more for a multilingual country like India. In the present work, we use readily available pre-trained Convolutional Neural Network (CNN) architectures on four different Indic scripts, viz. Bangla, Devanagari, Oriya, and Telugu to achieve a satisfactory recognition rate for handwritten Indic numerals. Furthermore, we have mixed Bangla and Oriya numerals and applied transfer learning for recognition. The main objective of this study is to realize how good a CNN model trained on an entire different dataset (of natural images) works for small and unrelated datasets. As a part of practical application, we have applied the proposed approach to recognize Bangla handwritten pin codes after their extraction from postal letters.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mahalat, M.H., Mollah, A.F., Basu, S., Nasipuri, M.: Design of novel post-processing algorithms for handwritten Arabic numerals classification. Int. J. Appl. Pattern Recognit. 4(4), 342–357 (2017)CrossRef Mahalat, M.H., Mollah, A.F., Basu, S., Nasipuri, M.: Design of novel post-processing algorithms for handwritten Arabic numerals classification. Int. J. Appl. Pattern Recognit. 4(4), 342–357 (2017)CrossRef
2.
Zurück zum Zitat Prasad, B.K., Sanyal, G.: Novel features and a cascaded classifier based Arabic numerals recognition system. Multidimension. Syst. Signal Process. 29(1), 321–338 (2018)MathSciNetCrossRef Prasad, B.K., Sanyal, G.: Novel features and a cascaded classifier based Arabic numerals recognition system. Multidimension. Syst. Signal Process. 29(1), 321–338 (2018)MathSciNetCrossRef
3.
Zurück zum Zitat Zhang, X.Y., Bengio, Y., Liu, C.L.: Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recognit. 61, 348–360 (2017)CrossRef Zhang, X.Y., Bengio, Y., Liu, C.L.: Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recognit. 61, 348–360 (2017)CrossRef
4.
Zurück zum Zitat Niu, X.X., Suen, C.Y.: A novel hybrid CNN-SVM classifier for recognizing handwritten digits. Pattern Recognit. 45(4), 1318–1325 (2012)CrossRef Niu, X.X., Suen, C.Y.: A novel hybrid CNN-SVM classifier for recognizing handwritten digits. Pattern Recognit. 45(4), 1318–1325 (2012)CrossRef
5.
Zurück zum Zitat Ouchtati, S., Redjimi, M., Bedda, M.: Realization of an offline system for the recognition of the handwritten numeric chains. In: Proceedings of the Iberian Conference on Information Systems and Technologies, pp. 1–6 (2014) Ouchtati, S., Redjimi, M., Bedda, M.: Realization of an offline system for the recognition of the handwritten numeric chains. In: Proceedings of the Iberian Conference on Information Systems and Technologies, pp. 1–6 (2014)
6.
Zurück zum Zitat Chakraborty, D., Pramanik, R., Bag, S.: A novel approach towards segmentation of connected handwritten numerals. In: Proceedings of the International Conference on Image Information Processing, pp. 1–5 (2017) Chakraborty, D., Pramanik, R., Bag, S.: A novel approach towards segmentation of connected handwritten numerals. In: Proceedings of the International Conference on Image Information Processing, pp. 1–5 (2017)
7.
Zurück zum Zitat Singh, P.K., Sarkar, R., Nasipuri, M.: Offline script identification from multilingual Indic-script documents: a state-of-the-art. Comput. Sci. Rev. 15, 1–28 (2015)MathSciNetCrossRef Singh, P.K., Sarkar, R., Nasipuri, M.: Offline script identification from multilingual Indic-script documents: a state-of-the-art. Comput. Sci. Rev. 15, 1–28 (2015)MathSciNetCrossRef
8.
Zurück zum Zitat Pramanik, R., Bag, S.: Shape decomposition-based handwritten compound character recognition for Bangla OCR. J. Vis. Commun. Image Represent. 50, 123–134 (2018)CrossRef Pramanik, R., Bag, S.: Shape decomposition-based handwritten compound character recognition for Bangla OCR. J. Vis. Commun. Image Represent. 50, 123–134 (2018)CrossRef
9.
Zurück zum Zitat Khan, H.A., Al Helal, A., Ahmed, K.I.: Handwritten Bangla digit recognition using sparse representation classifier. In: Proceedings of the International Conference on Informatics, Electronics and Vision, pp. 1–6 (2014) Khan, H.A., Al Helal, A., Ahmed, K.I.: Handwritten Bangla digit recognition using sparse representation classifier. In: Proceedings of the International Conference on Informatics, Electronics and Vision, pp. 1–6 (2014)
10.
Zurück zum Zitat Hassan, T., Khan, H.A.: Handwritten Bangla numeral recognition using local binary pattern. In: Proceedings of the International Conference on Electrical Engineering and Information Communication Technology, pp. 1–4 (2015) Hassan, T., Khan, H.A.: Handwritten Bangla numeral recognition using local binary pattern. In: Proceedings of the International Conference on Electrical Engineering and Information Communication Technology, pp. 1–4 (2015)
11.
Zurück zum Zitat Sarkhel, R., Das, N., Saha, A.K., Nasipuri, M.: A multi-objective approach towards cost effective isolated handwritten Bangla character and digit recognition. Pattern Recognit. 58, 172–189 (2016)CrossRef Sarkhel, R., Das, N., Saha, A.K., Nasipuri, M.: A multi-objective approach towards cost effective isolated handwritten Bangla character and digit recognition. Pattern Recognit. 58, 172–189 (2016)CrossRef
12.
Zurück zum Zitat Singh, P., Verma, A., Chaudhari, N.S.: Feature selection based classifier combination approach for handwritten Devanagari numeral recognition. Sadhana 40(6), 1701–1714 (2015)MathSciNetCrossRef Singh, P., Verma, A., Chaudhari, N.S.: Feature selection based classifier combination approach for handwritten Devanagari numeral recognition. Sadhana 40(6), 1701–1714 (2015)MathSciNetCrossRef
13.
Zurück zum Zitat Prabhanjan, S., Dinesh, R.: Handwritten Devanagari numeral recognition by fusion of classifiers. Int. J. Signal Process. Image Process. Pattern Recognit. 8(7), 41–50 (2015) Prabhanjan, S., Dinesh, R.: Handwritten Devanagari numeral recognition by fusion of classifiers. Int. J. Signal Process. Image Process. Pattern Recognit. 8(7), 41–50 (2015)
14.
Zurück zum Zitat Roy, K., Pal, T., Pal, U., Kimura, F.: Oriya handwritten numeral recognition system. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 770–774 (2005) Roy, K., Pal, T., Pal, U., Kimura, F.: Oriya handwritten numeral recognition system. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 770–774 (2005)
15.
Zurück zum Zitat Bhowmik, T.K., Parui, S.K., Bhattacharya, U., Shaw, B.: An HMM based recognition scheme for handwritten Oriya numerals. In: Proceedings of the International Conference on Information Technology, pp. 105–110 (2006) Bhowmik, T.K., Parui, S.K., Bhattacharya, U., Shaw, B.: An HMM based recognition scheme for handwritten Oriya numerals. In: Proceedings of the International Conference on Information Technology, pp. 105–110 (2006)
16.
Zurück zum Zitat Shopon, M., Mohammed, N., Abedin, M.A.: Bangla handwritten digit recognition using autoencoder and deep convolutional neural network. In: Proceedings of the International Workshop on Computational Intelligence, pp. 64–68 (2016) Shopon, M., Mohammed, N., Abedin, M.A.: Bangla handwritten digit recognition using autoencoder and deep convolutional neural network. In: Proceedings of the International Workshop on Computational Intelligence, pp. 64–68 (2016)
17.
Zurück zum Zitat Alom, M.Z., Sidike, P., Taha, T.M., Asari, V.K.: Handwritten Bangla digit recognition using deep learning. arXiv preprint arXiv:1705.02680 (2017) Alom, M.Z., Sidike, P., Taha, T.M., Asari, V.K.: Handwritten Bangla digit recognition using deep learning. arXiv preprint arXiv:​1705.​02680 (2017)
18.
Zurück zum Zitat Bhattacharya, U., Chaudhuri, B.B.: Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans. Pattern Anal. Mach. Intell. 31(3), 444–457 (2009)CrossRef Bhattacharya, U., Chaudhuri, B.B.: Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans. Pattern Anal. Mach. Intell. 31(3), 444–457 (2009)CrossRef
19.
Zurück zum Zitat Singh, P.K., Sarkar, R., Nasipuri, M.: A study of moment based features on handwritten digit recognition. Appl. Comput. Intell. Soft Comput. 1–17 (2016)CrossRef Singh, P.K., Sarkar, R., Nasipuri, M.: A study of moment based features on handwritten digit recognition. Appl. Comput. Intell. Soft Comput. 1–17 (2016)CrossRef
20.
Zurück zum Zitat Maitra, D.S., Bhattacharya, U., Parui, S.K.: CNN based common approach to handwritten character recognition of multiple scripts. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 1021–1025 (2015) Maitra, D.S., Bhattacharya, U., Parui, S.K.: CNN based common approach to handwritten character recognition of multiple scripts. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 1021–1025 (2015)
21.
Zurück zum Zitat Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef
22.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
23.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556 (2014)
25.
Zurück zum Zitat Bhattacharya, U., Chaudhuri, B.B.: Databases for research on recognition of handwritten characters of Indian scripts. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 789–793 (2005) Bhattacharya, U., Chaudhuri, B.B.: Databases for research on recognition of handwritten characters of Indian scripts. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 789–793 (2005)
26.
Zurück zum Zitat Das, N., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: A genetic algorithm based region sampling for selection of local features in handwritten digit recognition application. Appl. Soft Comput. 12(5), 1592–1606 (2012)CrossRef Das, N., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: A genetic algorithm based region sampling for selection of local features in handwritten digit recognition application. Appl. Soft Comput. 12(5), 1592–1606 (2012)CrossRef
27.
Zurück zum Zitat Das, N., Reddy, J.M., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: A statistical-topological feature combination for recognition of handwritten numerals. Appl. Soft Comput. 12(8), 2486–2495 (2012)CrossRef Das, N., Reddy, J.M., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: A statistical-topological feature combination for recognition of handwritten numerals. Appl. Soft Comput. 12(8), 2486–2495 (2012)CrossRef
28.
Zurück zum Zitat Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recognit. 43(10), 3507–3521 (2010)CrossRef Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recognit. 43(10), 3507–3521 (2010)CrossRef
Metadaten
Titel
A Study on the Effect of CNN-Based Transfer Learning on Handwritten Indic and Mixed Numeral Recognition
verfasst von
Rahul Pramanik
Prabhat Dansena
Soumen Bag
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-9361-7_4