Skip to main content

2017 | OriginalPaper | Buchkapitel

8. Optical Character Recognition Systems for Hindi Language

verfasst von : Arindam Chaudhuri, Krupa Mandaviya, Pratixa Badelia, Soumya K. Ghosh

Erschienen in: Optical Character Recognition Systems for Different Languages with Soft Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The optical character recognition (OCR) systems for Hindi language were the most primitive ones and occupy a significant place in pattern recognition. The Hindi language OCR systems have been used successfully in a wide array of commercial applications. The different challenges involved in the OCR systems for Hindi language is investigated in this Chapter. The pre-processing activities such as binarization, noise removal, skew detection, character segmentation and thinning performed on the datasets considered. The feature extraction is performed through fuzzy Hough transform. The feature based classification is performed through important soft computing techniques viz rough fuzzy multilayer perceptron (RFMLP), fuzzy support vector machine (FSVM), fuzzy rough support vector machine (FRSVM) and fuzzy markov random fields (FMRF). The superiority of soft computing techniques is demonstrated through the experimental results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bajaj, R., Dey, L., Chaudhury, S., Devnagari Numeral Recognition by combining Decision of Multiple Connectionist Classifiers, Sadhana, 27(1), pp 59–72, 2002. Bajaj, R., Dey, L., Chaudhury, S., Devnagari Numeral Recognition by combining Decision of Multiple Connectionist Classifiers, Sadhana, 27(1), pp 59–72, 2002.
2.
Zurück zum Zitat Bansal, V., Sinha, R. M. K., Integrating Knowledge Sources in Devanagari Text Recognition System, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 30(4), pp 500–505, 2000. Bansal, V., Sinha, R. M. K., Integrating Knowledge Sources in Devanagari Text Recognition System, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 30(4), pp 500–505, 2000.
3.
Zurück zum Zitat Bunke, H., Wang, P. S. P. (Editors), Handbook of Character Recognition and Document Image Analysis, World Scientific, 1997. Bunke, H., Wang, P. S. P. (Editors), Handbook of Character Recognition and Document Image Analysis, World Scientific, 1997.
4.
Zurück zum Zitat Chaudhuri, A., Fuzzy Rough Support Vector Machine for Data Classification, International Journal of Fuzzy System Applications, 5(2), pp 26–53, 2016. Chaudhuri, A., Fuzzy Rough Support Vector Machine for Data Classification, International Journal of Fuzzy System Applications, 5(2), pp 26–53, 2016.
5.
Zurück zum Zitat Chaudhuri, A., Modified Fuzzy Support Vector Machine for Credit Approval Classification, AI Communications, 27(2), pp 189–211, 2014. Chaudhuri, A., Modified Fuzzy Support Vector Machine for Credit Approval Classification, AI Communications, 27(2), pp 189–211, 2014.
6.
Zurück zum Zitat Chaudhuri, A., De, Fuzzy Support Vector Machine for Bankruptcy Prediction, Applied Soft Computing, 11(2), pp 2472–2486, 2011. Chaudhuri, A., De, Fuzzy Support Vector Machine for Bankruptcy Prediction, Applied Soft Computing, 11(2), pp 2472–2486, 2011.
7.
Zurück zum Zitat Chaudhuri, A., Applications of Support Vector Machines in Engineering and Science, Technical Report, Birla Institute of Technology Mesra, Patna Campus, India, 2011. Chaudhuri, A., Applications of Support Vector Machines in Engineering and Science, Technical Report, Birla Institute of Technology Mesra, Patna Campus, India, 2011.
8.
Zurück zum Zitat Chaudhuri, A., Some Experiments on Optical Character Recognition Systems for different Languages using Soft Computing Techniques, Technical Report, Birla Institute of Technology Mesra, Patna Campus, India, 2010. Chaudhuri, A., Some Experiments on Optical Character Recognition Systems for different Languages using Soft Computing Techniques, Technical Report, Birla Institute of Technology Mesra, Patna Campus, India, 2010.
9.
Zurück zum Zitat Chaudhuri, A., De, K., Job Scheduling using Rough Fuzzy Multi-Layer Perception Networks, Journal of Artificial Intelligence: Theory and Applications, 1(1), pp 4–19, 2010. Chaudhuri, A., De, K., Job Scheduling using Rough Fuzzy Multi-Layer Perception Networks, Journal of Artificial Intelligence: Theory and Applications, 1(1), pp 4–19, 2010.
10.
Zurück zum Zitat Chaudhuri, A., De, K., Chatterjee, D., Discovering Stock Price Prediction Rules of Bombay Stock Exchange using Rough Fuzzy Multi-Layer Perception Networks, Book Chapter: Forecasting Financial Markets in India, Rudra P. Pradhan, Indian Institute of Technology Kharagpur, (Editor), Allied Publishers, India, pp 69–96, 2009. Chaudhuri, A., De, K., Chatterjee, D., Discovering Stock Price Prediction Rules of Bombay Stock Exchange using Rough Fuzzy Multi-Layer Perception Networks, Book Chapter: Forecasting Financial Markets in India, Rudra P. Pradhan, Indian Institute of Technology Kharagpur, (Editor), Allied Publishers, India, pp 69–96, 2009.
11.
Zurück zum Zitat Cheriet, M., Kharma, N., Liu, C. L., Suen, C. Y., Character Recognition Systems: A Guide for Students and Practitioners, John Wiley and Sons, 2007. Cheriet, M., Kharma, N., Liu, C. L., Suen, C. Y., Character Recognition Systems: A Guide for Students and Practitioners, John Wiley and Sons, 2007.
12.
Zurück zum Zitat De, R. K., Basak, J., Pal, S. K., Neuro-Fuzzy Feature Evaluation with Theoretical Analysis, Neural Networks, 12(10), pp 1429–1455, 1999. De, R. K., Basak, J., Pal, S. K., Neuro-Fuzzy Feature Evaluation with Theoretical Analysis, Neural Networks, 12(10), pp 1429–1455, 1999.
13.
Zurück zum Zitat De, R. K., Pal, N. R., Pal, S. K., Feature Analysis: Neural Network and Fuzzy Set Theoretic Approaches, Pattern Recognition, 30(10), pp 1579–1590, 1997. De, R. K., Pal, N. R., Pal, S. K., Feature Analysis: Neural Network and Fuzzy Set Theoretic Approaches, Pattern Recognition, 30(10), pp 1579–1590, 1997.
14.
Zurück zum Zitat Gonzalez, R. C., Woods, R. E., Digital Image Processing, 3rd Edition, Pearson, 2013. Gonzalez, R. C., Woods, R. E., Digital Image Processing, 3rd Edition, Pearson, 2013.
15.
Zurück zum Zitat Jain, A. K., Fundamentals of Digital Image Processing, Prentice Hall, India, 2006. Jain, A. K., Fundamentals of Digital Image Processing, Prentice Hall, India, 2006.
16.
Zurück zum Zitat Kompalli, S., Setlur, S., Design and Comparison of Segmentation driven and Recognition driven Devanagari OCR, International Conference on Document Image Analysis for Libraries, pp 96–102, 2006. Kompalli, S., Setlur, S., Design and Comparison of Segmentation driven and Recognition driven Devanagari OCR, International Conference on Document Image Analysis for Libraries, pp 96–102, 2006.
17.
Zurück zum Zitat Pal, S. K., Mitra, S., Mitra, P., Rough-Fuzzy Multilayer Perception: Modular Evolution, Rule Generation and Evaluation, IEEE Transactions on Knowledge and Data Engineering, 15(1), pp 14–25, 2003. Pal, S. K., Mitra, S., Mitra, P., Rough-Fuzzy Multilayer Perception: Modular Evolution, Rule Generation and Evaluation, IEEE Transactions on Knowledge and Data Engineering, 15(1), pp 14–25, 2003.
18.
Zurück zum Zitat Pal, U., Sharma, N., Wakabayashi, T., Kimura, F., Offline Handwritten Character Recognition of Devnagari Script, International Conference on Document Analysis and Recognition, pp 496–500, 2007. Pal, U., Sharma, N., Wakabayashi, T., Kimura, F., Offline Handwritten Character Recognition of Devnagari Script, International Conference on Document Analysis and Recognition, pp 496–500, 2007.
19.
Zurück zum Zitat Pal, U., Chaudhuri, B. B., Printed Devnagari script OCR system, Vivek, 10(1), pp 12–24, 1997. Pal, U., Chaudhuri, B. B., Printed Devnagari script OCR system, Vivek, 10(1), pp 12–24, 1997.
20.
Zurück zum Zitat Russ, J. C., The Image Processing Handbook, CRC Press, 6th Edition, 2011. Russ, J. C., The Image Processing Handbook, CRC Press, 6th Edition, 2011.
21.
Zurück zum Zitat Sethi, K. Chatterjee, B., Machine Recognition of Constrained Hand Printed Devnagari, Pattern Recognition, 9(2), pp 69–77, 1977. Sethi, K. Chatterjee, B., Machine Recognition of Constrained Hand Printed Devnagari, Pattern Recognition, 9(2), pp 69–77, 1977.
22.
Zurück zum Zitat Sharma, N., Pal, U., Kimura, F., Pal, S., Recognition of Offline Handwritten Devnagari Characters using Quadratic Classifier, Indian Conference on Computer Vision, Graphics and Image Processing, pp 805–816, 2006. Sharma, N., Pal, U., Kimura, F., Pal, S., Recognition of Offline Handwritten Devnagari Characters using Quadratic Classifier, Indian Conference on Computer Vision, Graphics and Image Processing, pp 805–816, 2006.
23.
Zurück zum Zitat Taghva, K., Borsack, J., Condit, A., Effects of OCR Errors on Ranking and Feedback using the Vector Space Model, Information Processing and Management, 32(3), pp 317–327, 1996. Taghva, K., Borsack, J., Condit, A., Effects of OCR Errors on Ranking and Feedback using the Vector Space Model, Information Processing and Management, 32(3), pp 317–327, 1996.
24.
Zurück zum Zitat Taghva, K., Borsack, J., Condit, A., Evaluation of Model Based Retrieval Effectiveness with OCR Text, ACM Transactions on Information Systems, 14(1), pp 64–93, 1996. Taghva, K., Borsack, J., Condit, A., Evaluation of Model Based Retrieval Effectiveness with OCR Text, ACM Transactions on Information Systems, 14(1), pp 64–93, 1996.
25.
Zurück zum Zitat Taghva, K., Borsack, J., Condit, A., Erva, S., The Effects of Noisy Data on Text Retrieval, Journal of American Society for Information Science, 45 (1), pp 50–58, 1994. Taghva, K., Borsack, J., Condit, A., Erva, S., The Effects of Noisy Data on Text Retrieval, Journal of American Society for Information Science, 45 (1), pp 50–58, 1994.
26.
Zurück zum Zitat Young, T. Y., Fu, K. S., Handbook of Pattern Recognition and Image Processing, Academic Press, 1986. Young, T. Y., Fu, K. S., Handbook of Pattern Recognition and Image Processing, Academic Press, 1986.
27.
Zurück zum Zitat Zadeh, L. A., Fuzzy Sets, Information and Control, 8(3), pp 338–353, 1965. Zadeh, L. A., Fuzzy Sets, Information and Control, 8(3), pp 338–353, 1965.
28.
Zurück zum Zitat Zeng, J., Liu, Z. Q., Type-2 Fuzzy Markov Random Fields and their Application to Handwritten Chinese Character Recognition, IEEE Transactions on Fuzzy Systems, 16(3), pp 747–760, 2008. Zeng, J., Liu, Z. Q., Type-2 Fuzzy Markov Random Fields and their Application to Handwritten Chinese Character Recognition, IEEE Transactions on Fuzzy Systems, 16(3), pp 747–760, 2008.
29.
Zurück zum Zitat Zimmermann, H. J., Fuzzy Set Theory and its Applications, 4th Edition, Kluwer Academic Publishers, Boston, 2001. Zimmermann, H. J., Fuzzy Set Theory and its Applications, 4th Edition, Kluwer Academic Publishers, Boston, 2001.
Metadaten
Titel
Optical Character Recognition Systems for Hindi Language
verfasst von
Arindam Chaudhuri
Krupa Mandaviya
Pratixa Badelia
Soumya K. Ghosh
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-50252-6_8