Skip to main content
Top
Published in: Soft Computing 11/2020

30-05-2019 | Focus

A high-performance CNN method for offline handwritten Chinese character recognition and visualization

Authors: Pavlo Melnyk, Zhiqiang You, Keqin Li

Published in: Soft Computing | Issue 11/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Recent researches introduced fast, compact and efficient convolutional neural networks (CNNs) for offline handwritten Chinese character recognition (HCCR). However, many of them did not address the problem of network interpretability. We propose a new architecture of a deep CNN with high recognition performance which is capable of learning deep features for visualization. A special characteristic of our model is the bottleneck layers which enable us to retain its expressiveness while reducing the number of multiply-accumulate operations and the required storage. We introduce a modification of global weighted average pooling (GWAP)—global weighted output average pooling (GWOAP). This paper demonstrates how they allow us to calculate class activation maps (CAMs) in order to indicate the most relevant input character image regions used by our CNN to identify a certain class. Evaluating on the ICDAR-2013 offline HCCR competition dataset, we show that our model enables a relative 0.83% error reduction while having 49% fewer parameters and the same computational cost compared to the current state-of-the-art single-network method trained only on handwritten data. Our solution outperforms even recent residual learning approaches.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al (2016) Tensorflow: a system for large-scale machine learning. OSDI 16:265–283 Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al (2016) Tensorflow: a system for large-scale machine learning. OSDI 16:265–283
go back to reference Al-Janabi S, Alkaim AF (2019) A nifty collaborative analysis to predicting a novel tool (DRFLLS) for missing values estimation. Soft Comput 1–15 Al-Janabi S, Alkaim AF (2019) A nifty collaborative analysis to predicting a novel tool (DRFLLS) for missing values estimation. Soft Comput 1–15
go back to reference Al-Janabi S, Salman M, Fanfakh A (2018a) Recommendation system to improve time management for people in education environments. J Eng Appl Sci 13:10182–10193 Al-Janabi S, Salman M, Fanfakh A (2018a) Recommendation system to improve time management for people in education environments. J Eng Appl Sci 13:10182–10193
go back to reference Al-Janabi S, Salman MA, Mohammad M (2018b) Multi-level network construction based on intelligent big data analysis. In: International conference on bigdata and smart digital environment. Springer, pp 102–118 Al-Janabi S, Salman MA, Mohammad M (2018b) Multi-level network construction based on intelligent big data analysis. In: International conference on bigdata and smart digital environment. Springer, pp 102–118
go back to reference Ali SH (2012) A novel tool (FP-KC) for handle the three main dimensions reduction and association rule mining. In: 2012 6th International conference on sciences of electronics. Technologies of information and telecommunications (SETIT). IEEE, pp 951–961 Ali SH (2012) A novel tool (FP-KC) for handle the three main dimensions reduction and association rule mining. In: 2012 6th International conference on sciences of electronics. Technologies of information and telecommunications (SETIT). IEEE, pp 951–961
go back to reference Arqub OA, Mohammed AS, Momani S, Hayat T (2016) Numerical solutions of fuzzy differential equations using reproducing kernel Hilbert space method. Soft Comput 20(8):3283–3302CrossRef Arqub OA, Mohammed AS, Momani S, Hayat T (2016) Numerical solutions of fuzzy differential equations using reproducing kernel Hilbert space method. Soft Comput 20(8):3283–3302CrossRef
go back to reference Arqub OA, Al-Smadi M, Momani S, Hayat T (2017) Application of reproducing kernel algorithm for solving second-order, two-point fuzzy boundary value problems. Soft Comput 21(23):7191–7206CrossRef Arqub OA, Al-Smadi M, Momani S, Hayat T (2017) Application of reproducing kernel algorithm for solving second-order, two-point fuzzy boundary value problems. Soft Comput 21(23):7191–7206CrossRef
go back to reference Chen TQ, Rubanova Y, Bettencourt J, Duvenaud DK (2018) Neural ordinary differential equations. In: Advances in neural information processing systems, pp 6572–6583 Chen TQ, Rubanova Y, Bettencourt J, Duvenaud DK (2018) Neural ordinary differential equations. In: Advances in neural information processing systems, pp 6572–6583
go back to reference Cheng C, Zhang XY, Shao XH, Zhou XD (2016) Handwritten Chinese character recognition by joint classification and similarity ranking. In: 2016 15th International conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 507–511 Cheng C, Zhang XY, Shao XH, Zhou XD (2016) Handwritten Chinese character recognition by joint classification and similarity ranking. In: 2016 15th International conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 507–511
go back to reference Cireşan D, Meier U, Schmidhuber J (2012) Multi-column deep neural networks for image classification. arXiv preprint arXiv:1202.2745 Cireşan D, Meier U, Schmidhuber J (2012) Multi-column deep neural networks for image classification. arXiv preprint arXiv:​1202.​2745
go back to reference He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, pp 1026–1034 He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, pp 1026–1034
go back to reference He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
go back to reference Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:​1502.​03167
go back to reference Kimura F, Takashina K, Tsuruoka S, Miyake Y (1987) Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans Pattern Anal Mach Intell 1:149–153CrossRef Kimura F, Takashina K, Tsuruoka S, Miyake Y (1987) Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans Pattern Anal Mach Intell 1:149–153CrossRef
go back to reference Li F, Shen Q, Li Y, Mac Parthaláin N (2016) Handwritten Chinese character recognition using fuzzy image alignment. Soft Comput 20(8):2939–2949CrossRef Li F, Shen Q, Li Y, Mac Parthaláin N (2016) Handwritten Chinese character recognition using fuzzy image alignment. Soft Comput 20(8):2939–2949CrossRef
go back to reference Li Z, Teng N, Jin M, Lu H (2018) Building efficient CNN architecture for offline handwritten Chinese character recognition. Int J Doc Anal Recognit (IJDAR) 21(4):233–240CrossRef Li Z, Teng N, Jin M, Lu H (2018) Building efficient CNN architecture for offline handwritten Chinese character recognition. Int J Doc Anal Recognit (IJDAR) 21(4):233–240CrossRef
go back to reference Liu CL, Yin F, Wang DH, Wang QF (2011) CASIA online and offline Chinese handwriting databases. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 37–41 Liu CL, Yin F, Wang DH, Wang QF (2011) CASIA online and offline Chinese handwriting databases. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 37–41
go back to reference Liu CL, Yin F, Wang DH, Wang QF (2013) Online and offline handwritten Chinese character recognition: benchmarking on new databases. Pattern Recognit 46(1):155–162CrossRef Liu CL, Yin F, Wang DH, Wang QF (2013) Online and offline handwritten Chinese character recognition: benchmarking on new databases. Pattern Recognit 46(1):155–162CrossRef
go back to reference Lu S, Wei X, Lu Y (2015) Cost-sensitive MQDF classifier for handwritten Chinese address recognition. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 76–80 Lu S, Wei X, Lu Y (2015) Cost-sensitive MQDF classifier for handwritten Chinese address recognition. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 76–80
go back to reference Qin Z, Yu F, Liu C, Chen X (2018) How convolutional neural networks see the world—a survey of convolutional neural network visualization methods. Math Found Comput 1(2):149–180CrossRef Qin Z, Yu F, Liu C, Chen X (2018) How convolutional neural networks see the world—a survey of convolutional neural network visualization methods. Math Found Comput 1(2):149–180CrossRef
go back to reference Saravanan B, Mohanraj V, Senthilkumar J (2019) A fuzzy entropy technique for dimensionality reduction in recommender systems using deep learning. Soft Comput 23(8):2575–2583CrossRef Saravanan B, Mohanraj V, Senthilkumar J (2019) A fuzzy entropy technique for dimensionality reduction in recommender systems using deep learning. Soft Comput 23(8):2575–2583CrossRef
go back to reference Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529:484–503CrossRef Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529:484–503CrossRef
go back to reference Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958MathSciNetMATH Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958MathSciNetMATH
go back to reference Xiao X, Jin L, Yang Y, Yang W, Sun J, Chang T (2017) Building fast and compact convolutional neural networks for offline handwritten Chinese character recognition. Pattern Recognit 72:72–81CrossRef Xiao X, Jin L, Yang Y, Yang W, Sun J, Chang T (2017) Building fast and compact convolutional neural networks for offline handwritten Chinese character recognition. Pattern Recognit 72:72–81CrossRef
go back to reference Yang X, He D, Zhou Z, Kifer D, Giles CL (2017) Improving offline handwritten Chinese character recognition by iterative refinement. In: 2017 14th IAPR international conference on document analysis and recognition (ICDAR). IEEE, pp 5–10 Yang X, He D, Zhou Z, Kifer D, Giles CL (2017) Improving offline handwritten Chinese character recognition by iterative refinement. In: 2017 14th IAPR international conference on document analysis and recognition (ICDAR). IEEE, pp 5–10
go back to reference Yin F, Wang QF, Zhang XY, Liu CL (2013) ICDAR 2013 Chinese handwriting recognition competition. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1464–1470 Yin F, Wang QF, Zhang XY, Liu CL (2013) ICDAR 2013 Chinese handwriting recognition competition. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1464–1470
go back to reference Zhang Y (2015) Deep convolutional network for handwritten Chinese character recognition. Computer Science Department, Stanford University Zhang Y (2015) Deep convolutional network for handwritten Chinese character recognition. Computer Science Department, Stanford University
go back to reference Zhang XY, Bengio Y, Liu CL (2017) Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recognit 61:348–360CrossRef Zhang XY, Bengio Y, Liu CL (2017) Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recognit 61:348–360CrossRef
go back to reference Zhang Y, Liang S, Nie S, Liu W, Peng S (2018) Robust offline handwritten character recognition through exploring writer-independent features under the guidance of printed data. Pattern Recognit Lett 106:20–26CrossRef Zhang Y, Liang S, Nie S, Liu W, Peng S (2018) Robust offline handwritten character recognition through exploring writer-independent features under the guidance of printed data. Pattern Recognit Lett 106:20–26CrossRef
go back to reference Zhong Z, Jin L, Xie Z (2015) High performance offline handwritten Chinese character recognition using googlenet and directional feature maps. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 846–850 Zhong Z, Jin L, Xie Z (2015) High performance offline handwritten Chinese character recognition using googlenet and directional feature maps. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 846–850
go back to reference Zhong Z, Zhang XY, Yin F, Liu CL (2016) Handwritten Chinese character recognition with spatial transformer and deep residual networks. In: 2016 23rd International conference on pattern recognition (ICPR). IEEE, pp 3440–3445 Zhong Z, Zhang XY, Yin F, Liu CL (2016) Handwritten Chinese character recognition with spatial transformer and deep residual networks. In: 2016 23rd International conference on pattern recognition (ICPR). IEEE, pp 3440–3445
go back to reference Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A (2016) Learning deep features for discriminative localization. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2921–2929 Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A (2016) Learning deep features for discriminative localization. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2921–2929
Metadata
Title
A high-performance CNN method for offline handwritten Chinese character recognition and visualization
Authors
Pavlo Melnyk
Zhiqiang You
Keqin Li
Publication date
30-05-2019
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 11/2020
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04083-3

Other articles of this Issue 11/2020

Soft Computing 11/2020 Go to the issue

Premium Partner