Skip to main content
Top

2015 | OriginalPaper | Chapter

A Neural Approach to Cursive Handwritten Character Recognition Using Features Extracted from Binarization Technique

Authors : Amit Choudhary, Savita Ahlawat, Rahul Rishi

Published in: Complex System Modelling and Control Through Intelligent Soft Computations

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The feature extraction is one of the most crucial steps for an Optical Character Recognition (OCR) System. The efficiency and accuracy of the OCR System, in recognizing the off-line printed characters, mainly depends on the selection of feature extraction technique and the classification algorithm employed. This chapter focuses on the recognition of handwritten characters of Roman Script by using features which are obtained by using binarization technique. The goal of binarization is to minimize the unwanted information present in the image while protecting the useful information. Various preprocessing techniques such as thinning, foreground and background noise removal, cropping and size normalization etc. are also employed to preprocess the character images before their classification. A multi-layered feed forward neural network is proposed for classification of handwritten character images. The difference between the desired and actual output is calculated for each cycle and the weights are adjusted during error back-propagation. This process continues till the network converges to the allowable or acceptable error. This method involves the back propagation-learning rule based on the principle of gradient descent along the error surface in the negative direction. Very promising results are achieved when binarization features and the multilayer feed forward neural network classifier is used to recognize the off-line cursive handwritten characters.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Aburas, A. A., & Rehiel, S. A. (2008). New promising off-line tool for Arabic handwritten character recognition based on JPEG2000 image compression. In Proceedings of the 3rd International Conference on Introduction and Communication Technology—from Theory to Applications (ICTTA) (pp. 1–5), April 7–11, 2008. doi:10.1109/ICTTA.2008.4530087. Aburas, A. A., & Rehiel, S. A. (2008). New promising off-line tool for Arabic handwritten character recognition based on JPEG2000 image compression. In Proceedings of the 3rd International Conference on Introduction and Communication Technology—from Theory to Applications (ICTTA) (pp. 1–5), April 7–11, 2008. doi:10.​1109/​ICTTA.​2008.​4530087.
go back to reference Alginahi, Y. (2010). Preprocessing techniques in character recognition. Character Recognition. In M. Mori (Ed.) (pp. 1–20). In Techopen Publishers. ISBN 978-953-307-105-3. doi:10.5772/9776. Alginahi, Y. (2010). Preprocessing techniques in character recognition. Character Recognition. In M. Mori (Ed.) (pp. 1–20). In Techopen Publishers. ISBN 978-953-307-105-3. doi:10.​5772/​9776.
go back to reference Banashree, N. P., Andhre, D., Vasanta, R., & Satyanarayana, P. S. (2007). OCR for script identification of Hindi (Devanagari) numerals using error diffusion Halftoning Algorithm with neural classifier. International Journal of Computer, Information Science and Engineering, 1(2), 281–285. Banashree, N. P., Andhre, D., Vasanta, R., & Satyanarayana, P. S. (2007). OCR for script identification of Hindi (Devanagari) numerals using error diffusion Halftoning Algorithm with neural classifier. International Journal of Computer, Information Science and Engineering, 1(2), 281–285.
go back to reference Bernsen, J. (1986). Dynamic thresholding of grey-level images. In Proceedings of 8th International Conference on Pattern Recognition (pp. 1251–1255), Paris, France. Bernsen, J. (1986). Dynamic thresholding of grey-level images. In Proceedings of 8th International Conference on Pattern Recognition (pp. 1251–1255), Paris, France.
go back to reference Bharath, A. & Madhvanath, S. (2008). FreePad: A novel handwriting-based text input for pen and touch interfaces. In Proceedings of the 13th International Conference on Intelligent User Interfaces (pp. 297–300), New York, NY, USA. doi:10.1145/1378773.1378814. Bharath, A. & Madhvanath, S. (2008). FreePad: A novel handwriting-based text input for pen and touch interfaces. In Proceedings of the 13th International Conference on Intelligent User Interfaces (pp. 297–300), New York, NY, USA. doi:10.​1145/​1378773.​1378814.
go back to reference Blumenstein, M., Verma, B. & Basli, H. (2003). A novel feature extraction technique for the recognition of segmented handwritten characters. In Proceedings of the 7th International Conference on Document Analysis and Recognition (Vol. 1, pp. 137–141). Edinburgh, UK: IEEE Computer Society Press. doi:10.1109/ICDAR.2003.1227647. Blumenstein, M., Verma, B. & Basli, H. (2003). A novel feature extraction technique for the recognition of segmented handwritten characters. In Proceedings of the 7th International Conference on Document Analysis and Recognition (Vol. 1, pp. 137–141). Edinburgh, UK: IEEE Computer Society Press. doi:10.​1109/​ICDAR.​2003.​1227647.
go back to reference Cavalin, P. R., Britto, A. S., Bortolozzi, F., Sabourin, R. & Oliveira, L. S. (2006). An implicit segmentation based method for recognition of handwritten strings of characters. In Proceedings of ACM Symposium on Applied Computing (SAC) (pp. 836–840), New York, NY, USA. doi:10.1145/1141277.1141468. Cavalin, P. R., Britto, A. S., Bortolozzi, F., Sabourin, R. & Oliveira, L. S. (2006). An implicit segmentation based method for recognition of handwritten strings of characters. In Proceedings of ACM Symposium on Applied Computing (SAC) (pp. 836–840), New York, NY, USA. doi:10.​1145/​1141277.​1141468.
go back to reference Cheng, H. D., Chen, J. R., & Li, J. (1998). Threshold selection based on fuzzy c-partition entropy approach. Pattern Recognition, 31(7), 857–870.CrossRef Cheng, H. D., Chen, J. R., & Li, J. (1998). Threshold selection based on fuzzy c-partition entropy approach. Pattern Recognition, 31(7), 857–870.CrossRef
go back to reference Chiang, J.-H. (1998). A hybrid neural network model in handwritten word recognition. Neural Networks, 11(2), 337–346.CrossRef Chiang, J.-H. (1998). A hybrid neural network model in handwritten word recognition. Neural Networks, 11(2), 337–346.CrossRef
go back to reference Davies, E. (2005). Machine vision—Theory algorithms practicalities (3rd ed.). San Francisco, CA, USA: Morgan Kaufmann Publishers. ISBN 13: 978-0-12-206093-9. Davies, E. (2005). Machine vision—Theory algorithms practicalities (3rd ed.). San Francisco, CA, USA: Morgan Kaufmann Publishers. ISBN 13: 978-0-12-206093-9.
go back to reference Farooq, F., Bhardwaj, A., Cao, H. & Govindaraju, V. (2008). Topic based language models for OCR correction. In Proceedings of the 2nd Workshop on Analytics for Noisy Unstructured Text Data (pp. 107–112), New York, NY, USA. doi:10.1145/1390749.1390766 Farooq, F., Bhardwaj, A., Cao, H. & Govindaraju, V. (2008). Topic based language models for OCR correction. In Proceedings of the 2nd Workshop on Analytics for Noisy Unstructured Text Data (pp. 107–112), New York, NY, USA. doi:10.​1145/​1390749.​1390766
go back to reference Gatos, B., Pratikakis, I. & Perantonis, S. J. (2006a). Hybrid off-line cursive handwriting word recognition. In Proceedings of 18th International Conference on Pattern Recognition (ICPR’06) (Vol. 2, pp. 998–1002), Hong Kong. doi:10.1109/ICPR.2006.644. Gatos, B., Pratikakis, I. & Perantonis, S. J. (2006a). Hybrid off-line cursive handwriting word recognition. In Proceedings of 18th International Conference on Pattern Recognition (ICPR’06) (Vol. 2, pp. 998–1002), Hong Kong. doi:10.​1109/​ICPR.​2006.​644.
go back to reference Gatos, B., Pratikakis, I., Kesidis, A. L. & Perantonis, S. J. (2006b). Efficient off-line cursive handwriting word recognition. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition. October 2006, La Baule. Gatos, B., Pratikakis, I., Kesidis, A. L. & Perantonis, S. J. (2006b). Efficient off-line cursive handwriting word recognition. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition. October 2006, La Baule.
go back to reference Guillevic, D., Suen, C. Y. (1998). HMM-KNN word recognition engine for bank check processing. In Proceedings of International Conference on Pattern Recognition, Brisbane. Washington DC, USA: IEEE Computer Society, pp. 1526–1529. DOI: 10.1109/ICPR.1998.711998. Guillevic, D., Suen, C. Y. (1998). HMM-KNN word recognition engine for bank check processing. In Proceedings of International Conference on Pattern Recognition, Brisbane. Washington DC, USA: IEEE Computer Society, pp. 1526–1529. DOI: 10.​1109/​ICPR.​1998.​711998.
go back to reference Günter, S., & Bunke, H. (2004). Feature selection algorithms for the generation of multiple classifier systems and their application to handwritten word recognition. Pattern Recognition Letters, 25(11), 1323–1336.CrossRef Günter, S., & Bunke, H. (2004). Feature selection algorithms for the generation of multiple classifier systems and their application to handwritten word recognition. Pattern Recognition Letters, 25(11), 1323–1336.CrossRef
go back to reference Günter, S., & Bunke, H. (2005). Off-line cursive handwriting recognition using multiple classifier systems—On the influence of vocabulary, ensemble, and training set size. Optics and Lasers in Engineering, 43(3–5), 437–454.CrossRef Günter, S., & Bunke, H. (2005). Off-line cursive handwriting recognition using multiple classifier systems—On the influence of vocabulary, ensemble, and training set size. Optics and Lasers in Engineering, 43(3–5), 437–454.CrossRef
go back to reference Kapur, J. N., Sahoo, P. K., & Wong, A. K. C. (1985). A New method for gray-level picture threshold using the entropy of the histogram. Computer Vision, Graphics, and Image Processing, 29, 273–285.CrossRef Kapur, J. N., Sahoo, P. K., & Wong, A. K. C. (1985). A New method for gray-level picture threshold using the entropy of the histogram. Computer Vision, Graphics, and Image Processing, 29, 273–285.CrossRef
go back to reference Kim, J. H., Kim, K. K., & Suen, C. Y. (2000). An HMM-MLP hybrid model for cursive script recognition. Pattern Analysis and Application, 3, 314–324.CrossRefMATHMathSciNet Kim, J. H., Kim, K. K., & Suen, C. Y. (2000). An HMM-MLP hybrid model for cursive script recognition. Pattern Analysis and Application, 3, 314–324.CrossRefMATHMathSciNet
go back to reference Koch, G, Paquet, T., Heutte, L. (2004). Combination of contextual information for handwritten word recognition. In Proceedings of 9th International Workshop on Frontiers in Handwriting Recognition, Kokubunji (pp. 468–473). doi:10.1109/IWFHR.2004.27. Koch, G, Paquet, T., Heutte, L. (2004). Combination of contextual information for handwritten word recognition. In Proceedings of 9th International Workshop on Frontiers in Handwriting Recognition, Kokubunji (pp. 468–473). doi:10.​1109/​IWFHR.​2004.​27.
go back to reference Kundu, Y. H., & Chen, M. (2002). Alternatives to variable duration HMM in handwriting recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(11), 1275–1280. doi:10.1109/34.730561.CrossRef Kundu, Y. H., & Chen, M. (2002). Alternatives to variable duration HMM in handwriting recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(11), 1275–1280. doi:10.​1109/​34.​730561.CrossRef
go back to reference Li, C. H., & Lee, C. K. (1993). Minimum cross entropy thresholding. Pattern Recognition, 26(4), 617–625.CrossRef Li, C. H., & Lee, C. K. (1993). Minimum cross entropy thresholding. Pattern Recognition, 26(4), 617–625.CrossRef
go back to reference Niblack, W. (1986). An introduction to digital image processing. Englewood Cliffs: Prentice Hall. Niblack, W. (1986). An introduction to digital image processing. Englewood Cliffs: Prentice Hall.
go back to reference O’Gorman, L., Sammon, M., & Seul, M. (2008). Practical algorithms for image analysis. New York, NY, USA: Cambridge University Press. ISBN 978-0 = 521-88411-2.MATH O’Gorman, L., Sammon, M., & Seul, M. (2008). Practical algorithms for image analysis. New York, NY, USA: Cambridge University Press. ISBN 978-0 = 521-88411-2.MATH
go back to reference Oliveira, L. S., Sabourin, R., Bortolozzi, F., & Suen, C. Y. (2002). Automatic recognition of handwritten numerical strings: A recognition and verification strategy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(11), 1438–1454.CrossRef Oliveira, L. S., Sabourin, R., Bortolozzi, F., & Suen, C. Y. (2002). Automatic recognition of handwritten numerical strings: A recognition and verification strategy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(11), 1438–1454.CrossRef
go back to reference Otsu, N. (1979). A threshold selection method from gray level histogram. IEEE Transaction on System, Man, Cybernetics, 9(1), 62–66.CrossRefMathSciNet Otsu, N. (1979). A threshold selection method from gray level histogram. IEEE Transaction on System, Man, Cybernetics, 9(1), 62–66.CrossRefMathSciNet
go back to reference Rajashekararadhya, S. V., & Ranjan, P. V. (2009). Efficient zone based feature extraction algorithm for handwritten numeral recognition of four popular south Indian scripts. Journal of Theoretical and Applied Information Technology, 4(12), 1171–1180. Rajashekararadhya, S. V., & Ranjan, P. V. (2009). Efficient zone based feature extraction algorithm for handwritten numeral recognition of four popular south Indian scripts. Journal of Theoretical and Applied Information Technology, 4(12), 1171–1180.
go back to reference Russ, J. (2007). The image processing handbook (5th ed.). Boca Raton, FL, USA: CRC Press. ISBN 0849372542.MATH Russ, J. (2007). The image processing handbook (5th ed.). Boca Raton, FL, USA: CRC Press. ISBN 0849372542.MATH
go back to reference Saba, T., Sulong, G., & Rehman, A. (2011). Document image analysis: Issues, comparison of methods and remaining problems. Artificial Intelligence Review, 35(2), 101–118.CrossRef Saba, T., Sulong, G., & Rehman, A. (2011). Document image analysis: Issues, comparison of methods and remaining problems. Artificial Intelligence Review, 35(2), 101–118.CrossRef
go back to reference Sadri, J., Cheriet, M. (2009). A new approach for skew correction of documents based on particle swarm optimization. In Proceedings of 10th International Conference on Document Analysis and Recognition ICDAR ’09 (pp. 1066–1070). IEEE Computer Society, Washington, DC, USA .doi:10.1109/ICDAR.2009.268. Sadri, J., Cheriet, M. (2009). A new approach for skew correction of documents based on particle swarm optimization. In Proceedings of 10th International Conference on Document Analysis and Recognition ICDAR ’09 (pp. 1066–1070). IEEE Computer Society, Washington, DC, USA .doi:10.​1109/​ICDAR.​2009.​268.
go back to reference Sarfraz, M., Rasheed, Z. (2008). Skew estimation and correction of text using bounding box. In Proceedings of 5th International Conference on Computer Graphics, Imaging and Visualization (CGIV ‘08) (pp. 259–264). Washington, DC, USA: IEEE Computer Society. doi: 10.1109/CGIV.2008.10. Sarfraz, M., Rasheed, Z. (2008). Skew estimation and correction of text using bounding box. In Proceedings of 5th International Conference on Computer Graphics, Imaging and Visualization (CGIV ‘08) (pp. 259–264). Washington, DC, USA: IEEE Computer Society. doi: 10.​1109/​CGIV.​2008.​10.
go back to reference Sauvola, J., Seppänen. T., Haapakoski, S. & Pietikänen, M. (1997). Adaptive document binarization. In Fourth International Conference Document Analysis and Recognition (ICDAR) (pp. 147–152), Ulm, Germany. Sauvola, J., Seppänen. T., Haapakoski, S. & Pietikänen, M. (1997). Adaptive document binarization. In Fourth International Conference Document Analysis and Recognition (ICDAR) (pp. 147–152), Ulm, Germany.
go back to reference Sivanandam, S. N., & Deepa, S. N. (2008). Principals of soft computing (pp. 71–83). New Delhi, India: Wiley-India. ISBN 978812652741. Sivanandam, S. N., & Deepa, S. N. (2008). Principals of soft computing (pp. 71–83). New Delhi, India: Wiley-India. ISBN 978812652741.
go back to reference Tomoyuki, H., Takuma, A. & Bunpei, I. (2007). An analytic word recognition algorithm using a posteriori probability. In Proceedings of the 9th International Conference on Document Analysis and Recognition (Vol. 2, pp. 669–673), September 23–26, 2007, Tokyo. doi:10.1109/ICDAR.2007.4376999. Tomoyuki, H., Takuma, A. & Bunpei, I. (2007). An analytic word recognition algorithm using a posteriori probability. In Proceedings of the 9th International Conference on Document Analysis and Recognition (Vol. 2, pp. 669–673), September 23–26, 2007, Tokyo. doi:10.​1109/​ICDAR.​2007.​4376999.
go back to reference Verma, B., Blumenstein, M. (2008). Pattern recognition technologies and applications: Recent advances pp. 1–16. Hershey, New York: Information Science Reference (An Imprint of IGI Global Publications). Verma, B., Blumenstein, M. (2008). Pattern recognition technologies and applications: Recent advances pp. 1–16. Hershey, New York: Information Science Reference (An Imprint of IGI Global Publications).
go back to reference Verma, B., Blumenstein, M., & Ghosh, M. (2004). A novel approach for structural feature extraction: Contour vs direction. Pattern Recognition Letter, 25(9), 975–988.CrossRef Verma, B., Blumenstein, M., & Ghosh, M. (2004). A novel approach for structural feature extraction: Contour vs direction. Pattern Recognition Letter, 25(9), 975–988.CrossRef
go back to reference Wang, X., Ding, X., & Liu, C. (2005). Gabor filters-based feature extraction for character recognition. Pattern Recognition, 38(3), 369–379.CrossRefMATHMathSciNet Wang, X., Ding, X., & Liu, C. (2005). Gabor filters-based feature extraction for character recognition. Pattern Recognition, 38(3), 369–379.CrossRefMATHMathSciNet
Metadata
Title
A Neural Approach to Cursive Handwritten Character Recognition Using Features Extracted from Binarization Technique
Authors
Amit Choudhary
Savita Ahlawat
Rahul Rishi
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-12883-2_26

Premium Partner