Skip to main content
Top

2015 | OriginalPaper | Chapter

Odia Running Text Recognition Using Moment-Based Feature Extraction and Mean Distance Classification Technique

Authors : Mamata Nayak, Ajit Kumar Nayak

Published in: Intelligent Computing, Communication and Devices

Publisher: Springer India

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Optical character recognition (OCR) is a process of automatic recognition of character from optically scanned documents for the purpose of editing, indexing, searching, as well as reduction in storage space. Development of OCR for an Indian script is an active area of research today because the presence of a large number of letters in the alphabet set, their sophisticated combinations, and the complicated grapheme’s they formed is a great challenge to an OCR designer. We are trying to develop the OCR system for Odia language, which is used as official language of Odisha (formerly known as Orissa). In this paper, we attempt to recognize the vowels, consonants, matras, and compound characters of running Odia script. At first, the given scanned text is segmented into individual Odia symbols, then, extract corresponding feature vectors, using two-dimensional moments and Hough transform (based on topological and geometrical properties), which are used to classify and recognize the symbol. We found that the proposed model can recognize up to 100 % running test having no touched characters.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Pal, U., Chaudhuri, B.B.: Indian script character recognition: a survey. J. Pattern Recogn. 37, 1887–1899 (2004)CrossRef Pal, U., Chaudhuri, B.B.: Indian script character recognition: a survey. J. Pattern Recogn. 37, 1887–1899 (2004)CrossRef
2.
go back to reference Dongre, V.J., Mankar, V.H.: A review of research on Devnagari character recognition. Int. J. Comput. Appl. (0975–8887) 12(2), 8–14 (2010) Dongre, V.J., Mankar, V.H.: A review of research on Devnagari character recognition. Int. J. Comput. Appl. (0975–8887) 12(2), 8–14 (2010)
3.
go back to reference Kumar, M.P., Ravikiran, S.S., Nayani, A., Jawahar, C.V., Narayanan, P.J.: Tools for developing OCRs for Indian scripts. CVIT, pp. 1–6 (2011) Kumar, M.P., Ravikiran, S.S., Nayani, A., Jawahar, C.V., Narayanan, P.J.: Tools for developing OCRs for Indian scripts. CVIT, pp. 1–6 (2011)
4.
go back to reference Jayadevan, R., Kolhe, S.R., Patil, P.M., Pal, U.: Offline recognition of Devanagari script: a survey. IEEE Trans. Syst. Man Cybern 41(6), 2011 (2011) Jayadevan, R., Kolhe, S.R., Patil, P.M., Pal, U.: Offline recognition of Devanagari script: a survey. IEEE Trans. Syst. Man Cybern 41(6), 2011 (2011)
5.
go back to reference Chaudhuri, B.B., Pal, U., Mitra, M.: Automatic recognition of printed Oriya script. Special Issue Sadhana, Printed in India 27(1), 23–34 (2002) Chaudhuri, B.B., Pal, U., Mitra, M.: Automatic recognition of printed Oriya script. Special Issue Sadhana, Printed in India 27(1), 23–34 (2002)
6.
go back to reference Mohanty, S., Behera, H.K.: A complete OCR development system for Oriya Script. In: Proceeding of SIMPLE RC-ILTS-Oriya, vol. 4 (2004) Mohanty, S., Behera, H.K.: A complete OCR development system for Oriya Script. In: Proceeding of SIMPLE RC-ILTS-Oriya, vol. 4 (2004)
7.
go back to reference Mohanty, S., Bebartta, H.N.D.: A novel approach for Bilingual (English–Oriya) script identification and recognition in a printed document Sangh. Int. J. Image Process. (IJIP) 4(2), 175–191 (2010) Mohanty, S., Bebartta, H.N.D.: A novel approach for Bilingual (English–Oriya) script identification and recognition in a printed document Sangh. Int. J. Image Process. (IJIP) 4(2), 175–191 (2010)
8.
go back to reference Pall, U., Wakabayashi, T., Kimura, F.: A system for off-line Oriya handwritten character recognition using curvature feature. In: 10th International Conference on Information Technology (ICIT), IEEE Computer Society, pp. 227–229 (2007) Pall, U., Wakabayashi, T., Kimura, F.: A system for off-line Oriya handwritten character recognition using curvature feature. In: 10th International Conference on Information Technology (ICIT), IEEE Computer Society, pp. 227–229 (2007)
9.
go back to reference Meher, S., Basa, D.: An intelligent scanner with handwritten Odia character recognition capability. In: Fifth International Conferrence On Sensing Technology, IEEE Computer Society, pp. 53–59 (2011) Meher, S., Basa, D.: An intelligent scanner with handwritten Odia character recognition capability. In: Fifth International Conferrence On Sensing Technology, IEEE Computer Society, pp. 53–59 (2011)
10.
go back to reference Nayak, M., Nayak, A.K.: Odia characters recognition by training tesseract OCR engine. International Conference in Distributed Computing and Internet Technology (ICDCIT-2014), published in Int. J. Comput. Appl. (0975–8887), pp. 25–30 (2013) Nayak, M., Nayak, A.K.: Odia characters recognition by training tesseract OCR engine. International Conference in Distributed Computing and Internet Technology (ICDCIT-2014), published in Int. J. Comput. Appl. (0975–8887), pp. 25–30 (2013)
11.
go back to reference Sridevi, N., Subashini, P.: Moment based feature extraction for classification of handwritten ancient Tamil document. Int. J. Emerg. Trends Eng. Dev. 7(2), 106–115 (2012) Sridevi, N., Subashini, P.: Moment based feature extraction for classification of handwritten ancient Tamil document. Int. J. Emerg. Trends Eng. Dev. 7(2), 106–115 (2012)
Metadata
Title
Odia Running Text Recognition Using Moment-Based Feature Extraction and Mean Distance Classification Technique
Authors
Mamata Nayak
Ajit Kumar Nayak
Copyright Year
2015
Publisher
Springer India
DOI
https://doi.org/10.1007/978-81-322-2009-1_56

Premium Partner