Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 4/2018

29.08.2018 | Original Paper

Building efficient CNN architecture for offline handwritten Chinese character recognition

verfasst von: Zhiyuan Li, Nanjun Teng, Min Jin, Huaxiang Lu

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 4/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep convolutional neural networks-based methods have brought great breakthrough in image classification, which provides an end-to-end solution for handwritten Chinese character recognition (HCCR) problem through learning discriminative features automatically. Nevertheless, state-of-the-art CNNs appear to incur huge computational cost and require the storage of a large number of parameters especially in fully connected layers, which is difficult to deploy such networks into alternative hardware devices with limited computation capacity. To solve the storage problem, we propose a novel technique called weighted average pooling for reducing the parameters in fully connected layer without loss in accuracy. Besides, we implement a cascaded model in single CNN by adding mid output to complete recognition as early as possible, which reduces average inference time significantly. Experiments are performed on the ICDAR-2013 offline HCCR dataset. It is found that our proposed approach only needs 6.9 ms for classifying a character image on average and achieves the state-of-the-art accuracy of 97.1% while requires only 3.3 MB for storage.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 149–153 (1987)CrossRef Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 149–153 (1987)CrossRef
2.
Zurück zum Zitat Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: Squeezenet: Alexnet-level accuracy with 50x fewer parameters and \(<0.5\) mb model size. In: Computer Vision and Pattern Recognition (2016). arXiv:1602.07360 Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: Squeezenet: Alexnet-level accuracy with 50x fewer parameters and \(<0.5\) mb model size. In: Computer Vision and Pattern Recognition (2016). arXiv:​1602.​07360
3.
Zurück zum Zitat Liu, C., Yin, F., Wang, D., Wang, Q.: Online and offline handwritten Chinese character recognition: benchmarking on new databases. Pattern Recognit. 46(1), 155–162 (2013)CrossRef Liu, C., Yin, F., Wang, D., Wang, Q.: Online and offline handwritten Chinese character recognition: benchmarking on new databases. Pattern Recognit. 46(1), 155–162 (2013)CrossRef
4.
Zurück zum Zitat Yong, G., Qiang, H., Zhidan, F.: Chinese character recognition: history, status, and prospects. In: Proceedings of the 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA. IEEE (2002) Yong, G., Qiang, H., Zhidan, F.: Chinese character recognition: history, status, and prospects. In: Proceedings of the 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA. IEEE (2002)
5.
Zurück zum Zitat Liu, C.: Normalization-cooperated gradient feature extraction for handwritten character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 29(8), 1465–1469 (2007)CrossRef Liu, C.: Normalization-cooperated gradient feature extraction for handwritten character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 29(8), 1465–1469 (2007)CrossRef
6.
Zurück zum Zitat Mangasarian, O.L., Musicant, D.R.: Data discrimination via nonlinear generalized support vector machines. In: Ferris, M.C., Mangasarian, O.L., Pang, J.S. (eds.) Complementarity: Applications, Algorithms and Extensions, pp. 233–251. Springer, Boston (2001)CrossRef Mangasarian, O.L., Musicant, D.R.: Data discrimination via nonlinear generalized support vector machines. In: Ferris, M.C., Mangasarian, O.L., Pang, J.S. (eds.) Complementarity: Applications, Algorithms and Extensions, pp. 233–251. Springer, Boston (2001)CrossRef
7.
Zurück zum Zitat Liu, C., Sako, H., Fujisawa, H.: Discriminative learning quadratic discriminant function for handwriting recognition. IEEE Trans. Neural Netw. 15(2), 430–444 (2004)CrossRef Liu, C., Sako, H., Fujisawa, H.: Discriminative learning quadratic discriminant function for handwriting recognition. IEEE Trans. Neural Netw. 15(2), 430–444 (2004)CrossRef
8.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Neural. Inf. Proc. Syst. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Neural. Inf. Proc. Syst. 1097–1105 (2012)
9.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)
10.
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.E., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.E., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Computer Vision and Pattern Recognition, pp. 1–9 (2015)
11.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Computer Vision and Pattern Recognition, pp. 770–778 (2016)
12.
Zurück zum Zitat Ciregan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Computer Vision and Pattern Recognition, pp. 3642–3649 (2012) Ciregan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Computer Vision and Pattern Recognition, pp. 3642–3649 (2012)
13.
Zurück zum Zitat Ciresan, D.C., Meier, U.: Multi-column deep neural networks for offline handwritten Chinese character classification. In: International Symposium on Neural Networks, pp. 1–6 (2015) Ciresan, D.C., Meier, U.: Multi-column deep neural networks for offline handwritten Chinese character classification. In: International Symposium on Neural Networks, pp. 1–6 (2015)
14.
Zurück zum Zitat Yin, F., Wang, Q., Zhang, X., Liu, C.: ICDAR 2013 Chinese handwriting recognition competition. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), pp. 1095–1101 (2013) Yin, F., Wang, Q., Zhang, X., Liu, C.: ICDAR 2013 Chinese handwriting recognition competition. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), pp. 1095–1101 (2013)
15.
Zurück zum Zitat Wu, C., Fan, W., He, Y., Sun, J., Naoi, S.: Handwritten character recognition by alternately trained relaxation convolutional neural network. Int. Conf. Front. Handwriting. Recogn. 291–296 (2014) Wu, C., Fan, W., He, Y., Sun, J., Naoi, S.: Handwritten character recognition by alternately trained relaxation convolutional neural network. Int. Conf. Front. Handwriting. Recogn. 291–296 (2014)
16.
Zurück zum Zitat Zhong, Z., Jin, L., Xie, Z.: High performance offline handwritten Chinese character recognition using googlenet and directional feature maps. In: International Conference on Document Analysis and Recognition, pp. 846–850 (2015) Zhong, Z., Jin, L., Xie, Z.: High performance offline handwritten Chinese character recognition using googlenet and directional feature maps. In: International Conference on Document Analysis and Recognition, pp. 846–850 (2015)
17.
Zurück zum Zitat Zhang, X., Bengio, Y., Liu, C.: Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recognit. 61, 348–360 (2017)CrossRef Zhang, X., Bengio, Y., Liu, C.: Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recognit. 61, 348–360 (2017)CrossRef
18.
Zurück zum Zitat Chen, W., Wilson, J.T., Tyree, S., Weinberger, K.Q., Chen, Y.: Compressing neural networks with the hashing trick. In: International Conference on Machine Learning, pp. 2285–2294 (2015) Chen, W., Wilson, J.T., Tyree, S., Weinberger, K.Q., Chen, Y.: Compressing neural networks with the hashing trick. In: International Conference on Machine Learning, pp. 2285–2294 (2015)
19.
Zurück zum Zitat Xue, J., Li, J., Gong, Y.: Restructuring of deep neural network acoustic models with singular value decomposition. Conf. Int. Speech. Commun. Assoc. 2365–2369 (2013) Xue, J., Li, J., Gong, Y.: Restructuring of deep neural network acoustic models with singular value decomposition. Conf. Int. Speech. Commun. Assoc. 2365–2369 (2013)
20.
Zurück zum Zitat Jaderberg, M., Vedaldi, A., Zisserman, A.: Speeding up convolutional neural networks with low rank expansions. In: British Machine Vision Conference (2014) Jaderberg, M., Vedaldi, A., Zisserman, A.: Speeding up convolutional neural networks with low rank expansions. In: British Machine Vision Conference (2014)
21.
Zurück zum Zitat Lebedev, V., Ganin, Y., Rakhuba, M., Oseledets, I.V., Lempitsky, V.S.: Speeding-up convolutional neural networks using fine-tuned cp-decomposition. In: International Conference on Learning Representations (2015) Lebedev, V., Ganin, Y., Rakhuba, M., Oseledets, I.V., Lempitsky, V.S.: Speeding-up convolutional neural networks using fine-tuned cp-decomposition. In: International Conference on Learning Representations (2015)
22.
Zurück zum Zitat Han, S., Mao, H., Dally, W.J.: Deep compression: compressing deep neural networks with pruning, trained quantization and huffman coding. In: International Conference on Learning Representations (2016) Han, S., Mao, H., Dally, W.J.: Deep compression: compressing deep neural networks with pruning, trained quantization and huffman coding. In: International Conference on Learning Representations (2016)
23.
Zurück zum Zitat Li, H., Kadav, A., Durdanovic, I., Samet, H., Graf, H.P.: Pruning filters for efficient convnets. In: Computer Vision and Pattern Recognition (2016). arXiv:1608.08710 Li, H., Kadav, A., Durdanovic, I., Samet, H., Graf, H.P.: Pruning filters for efficient convnets. In: Computer Vision and Pattern Recognition (2016). arXiv:​1608.​08710
24.
Zurück zum Zitat He, Y., Zhang, X., Sun, J.: Channel pruning for accelerating very deep neural networks. In: The IEEE International Conference on Computer Vision (2017) He, Y., Zhang, X., Sun, J.: Channel pruning for accelerating very deep neural networks. In: The IEEE International Conference on Computer Vision (2017)
25.
Zurück zum Zitat Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations (2014) Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations (2014)
26.
Zurück zum Zitat Andrew, H.G., Menglong, Z., Bo, C., Dmitry, K., Weijun, W., Tobias, W., Marco, A., Adam, H.: Mobilenets: efficient convolutional neural networks for mobile vision applications. In: Computer Vision and Pattern Recognition (2017). arXiv:1704.04861 Andrew, H.G., Menglong, Z., Bo, C., Dmitry, K., Weijun, W., Tobias, W., Marco, A., Adam, H.: Mobilenets: efficient convolutional neural networks for mobile vision applications. In: Computer Vision and Pattern Recognition (2017). arXiv:​1704.​04861
27.
Zurück zum Zitat Xiangyu, Z., Xinyu, Z., Mengxiao, L., Jian, S.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: Computer Vision and Pattern Recognition (2017). arXiv:1707.01083 Xiangyu, Z., Xinyu, Z., Mengxiao, L., Jian, S.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: Computer Vision and Pattern Recognition (2017). arXiv:​1707.​01083
28.
Zurück zum Zitat Courbariaux, M., Hubara, I., Soudry, D., Elyaniv, R., Bengio, Y.: Binarized neural networks: training deep neural networks with weights and activations constrained to \(+1\) or \(-1\). In: Learning (2016). arXiv:1602.02830 Courbariaux, M., Hubara, I., Soudry, D., Elyaniv, R., Bengio, Y.: Binarized neural networks: training deep neural networks with weights and activations constrained to \(+1\) or \(-1\). In: Learning (2016). arXiv:​1602.​02830
29.
Zurück zum Zitat Rastegari, M., Ordonez, V., Redmon, J., Farhadi, A.: Xnor-net: imagenet classification using binary convolutional neural networks. In: European Conference on Computer Vision, pp. 525–542 (2016)CrossRef Rastegari, M., Ordonez, V., Redmon, J., Farhadi, A.: Xnor-net: imagenet classification using binary convolutional neural networks. In: European Conference on Computer Vision, pp. 525–542 (2016)CrossRef
30.
Zurück zum Zitat Xiao, X., Jin, L., Yang, Y., Yang, W., Sun, J., Chang, T.: Building fast and compact convolutional neural networks for offline handwritten Chinese character recognition. Pattern Recognit. 72, 72–81 (2017)CrossRef Xiao, X., Jin, L., Yang, Y., Yang, W., Sun, J., Chang, T.: Building fast and compact convolutional neural networks for offline handwritten Chinese character recognition. Pattern Recognit. 72, 72–81 (2017)CrossRef
31.
Zurück zum Zitat Liu, C., Yin, F., Wang, D., Wang, Q.: CASIA online and offline Chinese handwriting databases. Int. Conf. Doc. Anal. Recogn. 37–41 (2011) Liu, C., Yin, F., Wang, D., Wang, Q.: CASIA online and offline Chinese handwriting databases. Int. Conf. Doc. Anal. Recogn. 37–41 (2011)
32.
Zurück zum Zitat Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., et al.: Tensorflow: a system for large-scale machine learning. In: Operating Systems Design and Implementation, pp. 265–283 (2016) Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., et al.: Tensorflow: a system for large-scale machine learning. In: Operating Systems Design and Implementation, pp. 265–283 (2016)
Metadaten
Titel
Building efficient CNN architecture for offline handwritten Chinese character recognition
verfasst von
Zhiyuan Li
Nanjun Teng
Min Jin
Huaxiang Lu
Publikationsdatum
29.08.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 4/2018
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-018-0311-4

Weitere Artikel der Ausgabe 4/2018

International Journal on Document Analysis and Recognition (IJDAR) 4/2018 Zur Ausgabe