Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 1/2015

01.03.2015 | Original Paper

Handwritten Hangul recognition using deep convolutional neural networks

verfasst von: In-Jung Kim, Xiaohui Xie

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 1/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In spite of the advances in recognition technology, handwritten Hangul recognition (HHR) remains largely unsolved due to the presence of many confusing characters and excessive cursiveness in Hangul handwritings. Even the best existing recognizers do not lead to satisfactory performance for practical applications and have much lower performance than those developed for Chinese or alphanumeric characters. To improve the performance of HHR, here we developed a new type of recognizers based on deep neural networks (DNNs). DNN has recently shown excellent performance in many pattern recognition and machine learning problems, but have not been attempted for HHR. We built our Hangul recognizers based on deep convolutional neural networks and proposed several novel techniques to improve the performance and training speed of the networks. We systematically evaluated the performance of our recognizers on two public Hangul image databases, SERI95a and PE92. Using our framework, we achieved a recognition rate of 95.96 % on SERI95a and 92.92 % on PE92. Compared with the previous best records of 93.71 % on SERI95a and 87.70 % on PE92, our results yielded improvements of 2.25 and 5.22 %, respectively. These improvements lead to error reduction rates of 35.71 % on SERI95a and 42.44 % on PE92, relative to the previous lowest error rates. Such improvement fills a significant portion of the large gap between practical requirement and the actual performance of Hangul recognizers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
SERI95a is also known as KU-1.
 
Literatur
1.
Zurück zum Zitat Park, G.-R., Kim, I.-J., Liu, C.-L.: An evaluation of statistical methods in handwritten Hangul recognition. Int. J. Doc. Anal. Recognit. 16(3), 273–283 (2013)CrossRef Park, G.-R., Kim, I.-J., Liu, C.-L.: An evaluation of statistical methods in handwritten Hangul recognition. Int. J. Doc. Anal. Recognit. 16(3), 273–283 (2013)CrossRef
2.
Zurück zum Zitat Kang, K.-W., Kim, J.H.: Utilization of hierarchical, stochastic relationship modeling for Hangul character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1185–1196 (2004)CrossRefMathSciNet Kang, K.-W., Kim, J.H.: Utilization of hierarchical, stochastic relationship modeling for Hangul character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1185–1196 (2004)CrossRefMathSciNet
5.
Zurück zum Zitat Cireşan, D.C., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition (2012) Cireşan, D.C., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition (2012)
6.
Zurück zum Zitat Cireşan, D.C., Schmidhuber, J.: Multi-column Deep Neural Networks for Offline Handwritten Chinese Character Classification. IDSIA Technical Report No. IDSIA-05-13 (2013) Cireşan, D.C., Schmidhuber, J.: Multi-column Deep Neural Networks for Offline Handwritten Chinese Character Classification. IDSIA Technical Report No. IDSIA-05-13 (2013)
8.
Zurück zum Zitat Kim, H.Y., Kim, J.H.: Hierarchical random graph representation of handwritten characters and its application to Hangul recognition. Pattern Recognit. 34(2), 187–201 (2001)CrossRefMATH Kim, H.Y., Kim, J.H.: Hierarchical random graph representation of handwritten characters and its application to Hangul recognition. Pattern Recognit. 34(2), 187–201 (2001)CrossRefMATH
9.
Zurück zum Zitat Jang, S.I.: Post-Processing of Handwritten Hangul Recognition Using Pair-Wise Grapheme Discrimination, Master Thesis, KAIST (2002) Jang, S.I.: Post-Processing of Handwritten Hangul Recognition Using Pair-Wise Grapheme Discrimination, Master Thesis, KAIST (2002)
10.
Zurück zum Zitat Kim, D.-I., Lee, S.-W.: Automatic evaluation of handwriting qualities of handwritten hangul image database, KU-1. In: Proceedings of the 6th IWFHR, pp. 455–464, Taejon, Korea (1998) Kim, D.-I., Lee, S.-W.: Automatic evaluation of handwriting qualities of handwritten hangul image database, KU-1. In: Proceedings of the 6th IWFHR, pp. 455–464, Taejon, Korea (1998)
11.
Zurück zum Zitat Kim, D.H., Hwang, Y.S., Park, S.T., Kim, E.J., Paek, S.H., Bang, S.Y.: Handwritten Korean character image database PE92. In: Proceedings of the 2nd ICDAR, pp. 470–473 (1993) Kim, D.H., Hwang, Y.S., Park, S.T., Kim, E.J., Paek, S.H., Bang, S.Y.: Handwritten Korean character image database PE92. In: Proceedings of the 2nd ICDAR, pp. 470–473 (1993)
12.
Zurück zum Zitat Bae, H.J., Yun, J.M., Cha, E.Y.: Neural network for hand-written character recognition using dynamic bar method. In: Proceedings of the Korea Information Science Autumn Conference, vol. 17, issue 2, pp. 251–254 (1990) Bae, H.J., Yun, J.M., Cha, E.Y.: Neural network for hand-written character recognition using dynamic bar method. In: Proceedings of the Korea Information Science Autumn Conference, vol. 17, issue 2, pp. 251–254 (1990)
13.
Zurück zum Zitat Kim, M.W., Jang, J.S., Lim, C.D., Song, Y.S., Kim, J.H.: Improvements to a Hierarchical Interaction Neural Network for Context-Dependent Pattern Recognition and Its Experimentation with Handwritten Korean Character Recognition. Technical Report, Electronics and Telecommunication Research Institute, Taejon, Korea (1992) Kim, M.W., Jang, J.S., Lim, C.D., Song, Y.S., Kim, J.H.: Improvements to a Hierarchical Interaction Neural Network for Context-Dependent Pattern Recognition and Its Experimentation with Handwritten Korean Character Recognition. Technical Report, Electronics and Telecommunication Research Institute, Taejon, Korea (1992)
14.
Zurück zum Zitat Jeong, S.H.: Handwritten Hangul Recognition Based on Character Cluster Segmentation, Technical Memo. Electronics and Telecommunication Research Institute, Taejon, Korea (2002) Jeong, S.H.: Handwritten Hangul Recognition Based on Character Cluster Segmentation, Technical Memo. Electronics and Telecommunication Research Institute, Taejon, Korea (2002)
15.
Zurück zum Zitat Torralba, A., Fergus, R., Freeman, W.: 80 million tiny images: a large dataset for non-parametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1958–1970 (2008)CrossRef Torralba, A., Fergus, R., Freeman, W.: 80 million tiny images: a large dataset for non-parametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1958–1970 (2008)CrossRef
16.
Zurück zum Zitat Hinton, G.E., Osindero, S., Teh, Y.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)CrossRefMATHMathSciNet Hinton, G.E., Osindero, S., Teh, Y.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)CrossRefMATHMathSciNet
17.
Zurück zum Zitat Hinton, G.E., Dayan, P., Frey, B.J., Neal, R.: The wake-sleep algorithm for self-organizing neural networks. Science 268, 1158–1161 (1995)CrossRef Hinton, G.E., Dayan, P., Frey, B.J., Neal, R.: The wake-sleep algorithm for self-organizing neural networks. Science 268, 1158–1161 (1995)CrossRef
18.
Zurück zum Zitat Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.-A.: Extracting and composing robust features with denoising autoencoders. In: Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) Proceedings of the Twenty-Fifth International Conference on Machine Learning (ICML’08), pp. 1096–1103, ACM (2008) Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.-A.: Extracting and composing robust features with denoising autoencoders. In: Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) Proceedings of the Twenty-Fifth International Conference on Machine Learning (ICML’08), pp. 1096–1103, ACM (2008)
19.
Zurück zum Zitat Fukushima, K.: Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol. Cybern. 36(4), 193–202 (1980)CrossRefMATH Fukushima, K.: Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol. Cybern. 36(4), 193–202 (1980)CrossRefMATH
20.
Zurück zum Zitat LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
21.
Zurück zum Zitat Simard, D., Steinkraus, P.Y., Platt, J.C.: Best practices for convolutional neural networks. In: Proceedings of the International Conference on Document Analysis and Recognition (ICDAR’03), p. 958. IEEE Computer Society, Washington, DC, USA (2003) Simard, D., Steinkraus, P.Y., Platt, J.C.: Best practices for convolutional neural networks. In: Proceedings of the International Conference on Document Analysis and Recognition (ICDAR’03), p. 958. IEEE Computer Society, Washington, DC, USA (2003)
22.
Zurück zum Zitat Boureau, Y., Ponce, J., LeCun, Y.: A theoretical analysis of feature pooling in vision algorithms. In: Proceedings of the International Conference on Machine Learning (ICML’10) (2010) Boureau, Y., Ponce, J., LeCun, Y.: A theoretical analysis of feature pooling in vision algorithms. In: Proceedings of the International Conference on Machine Learning (ICML’10) (2010)
24.
Zurück zum Zitat Ozkan, C., Erbek, F.: A comparison of activation functions for multispectral Landsat TM image classification. Photogramm. Eng. Remote Sens. 69(11), 1225–1234 (2003)CrossRef Ozkan, C., Erbek, F.: A comparison of activation functions for multispectral Landsat TM image classification. Photogramm. Eng. Remote Sens. 69(11), 1225–1234 (2003)CrossRef
26.
Zurück zum Zitat Nair, V., Hinton, G.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10) (2010) Nair, V., Hinton, G.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10) (2010)
27.
Zurück zum Zitat Glorot, X., Antoine, B., Bengio, Y.: Deep Sparse Rectifier Networks. In: Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP, vol. 15 (2011) Glorot, X., Antoine, B., Bengio, Y.: Deep Sparse Rectifier Networks. In: Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP, vol. 15 (2011)
28.
Zurück zum Zitat Goodhill, G., Barrow, H.: The role of weight normalization in competitive learning. Neural Comput. 6(2), 255–269 (1994)CrossRef Goodhill, G., Barrow, H.: The role of weight normalization in competitive learning. Neural Comput. 6(2), 255–269 (1994)CrossRef
29.
Zurück zum Zitat Liu, H., Ding, X.: Handwritten character recognition using gradient feature and quadratic classifier with multiple discrimination schemes. In: Proceedings. 8th International Conference on Document Analysis and Recognition, IEEE (2005) Liu, H., Ding, X.: Handwritten character recognition using gradient feature and quadratic classifier with multiple discrimination schemes. In: Proceedings. 8th International Conference on Document Analysis and Recognition, IEEE (2005)
30.
Zurück zum Zitat Cireşan, D., et al.: Deep Big Multilayer Perceptrons for Digit Recognition, Neural Networks: Tricks of the Trade, pp. 581--598. Springer, Berlin Heidelberg (2012) Cireşan, D., et al.: Deep Big Multilayer Perceptrons for Digit Recognition, Neural Networks: Tricks of the Trade, pp. 581--598. Springer, Berlin Heidelberg (2012)
32.
Zurück zum Zitat Everitt, B.: The Analysis of Contingency Tables, 2nd edn. Chapman and Hall/CRC, London (1992) Everitt, B.: The Analysis of Contingency Tables, 2nd edn. Chapman and Hall/CRC, London (1992)
Metadaten
Titel
Handwritten Hangul recognition using deep convolutional neural networks
verfasst von
In-Jung Kim
Xiaohui Xie
Publikationsdatum
01.03.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 1/2015
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-014-0229-4

Weitere Artikel der Ausgabe 1/2015

International Journal on Document Analysis and Recognition (IJDAR) 1/2015 Zur Ausgabe