Skip to main content
Top
Published in: International Journal on Document Analysis and Recognition (IJDAR) 3/2014

01-09-2014 | Original Paper

Learning confidence transformation for handwritten Chinese text recognition

Authors: Da-Han Wang, Cheng-Lin Liu

Published in: International Journal on Document Analysis and Recognition (IJDAR) | Issue 3/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Handwritten text recognition systems commonly combine character classification confidence scores and context models for evaluating candidate segmentation-recognition paths, and the classification confidence is usually optimized at character level. In this paper, we investigate into different confidence-learning methods for handwritten Chinese text recognition and propose a string-level confidence-learning method, which estimates confidence parameters by directly optimizing the performance of character string recognition. We first compare the performances of parametric (class-dependent and class-independent parameters) and nonparametric (isotonic regression) confidence-learning methods. Then, we propose two regularized confidence estimation methods and particularly, a string-level confidence-learning method under the minimum classification error criterion. In experiments of online handwritten Chinese text recognition, the string-level confidence-learning method is shown to effectively improve the string recognition performance. Using three character classifiers, the character correct rates are improved from 92.39, 90.24 and 88.69 % to 92.76, 90.91 and 89.93 %, respectively.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Liu, C.-L., Jaeger, S., Nakagawa, M.: Online handwritten Chinese character recognition: the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 26(2), 198–213 (2004)CrossRef Liu, C.-L., Jaeger, S., Nakagawa, M.: Online handwritten Chinese character recognition: the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 26(2), 198–213 (2004)CrossRef
2.
go back to reference Cheriet, M., Kharma, N., Liu, C.-L., Suen, C.Y.: Character Recognition Systems: A Guide for Students and Practitioners. Wiley, New Jersey (2007)CrossRef Cheriet, M., Kharma, N., Liu, C.-L., Suen, C.Y.: Character Recognition Systems: A Guide for Students and Practitioners. Wiley, New Jersey (2007)CrossRef
3.
go back to reference Liu, C.-L.: Classifier combination based on confidence transformation. Pattern Recognit. 38(1), 11–28 (2005)CrossRefMATH Liu, C.-L.: Classifier combination based on confidence transformation. Pattern Recognit. 38(1), 11–28 (2005)CrossRefMATH
4.
go back to reference Li, Y.X., Tan, C.L., Ding, X.: A hybrid post-processing system for offline handwritten Chinese script recognition. Pattern Anal. Appl. 8, 272–286 (2005)CrossRefMathSciNet Li, Y.X., Tan, C.L., Ding, X.: A hybrid post-processing system for offline handwritten Chinese script recognition. Pattern Anal. Appl. 8, 272–286 (2005)CrossRefMathSciNet
5.
go back to reference Jiang, Y., Ding, X., Fu, Q., Ren, Z.: Context driven Chinese string segmentation and recognition. Struct. Struct. Syntactic Stat. Pattern Recognit. 4109, 127–135 (2006)CrossRef Jiang, Y., Ding, X., Fu, Q., Ren, Z.: Context driven Chinese string segmentation and recognition. Struct. Struct. Syntactic Stat. Pattern Recognit. 4109, 127–135 (2006)CrossRef
6.
go back to reference Wang, Q.-F., Yin, F., Liu, C.-L.: Improving handwritten Chinese text recognition by confidence transformation. In: Proceedings of the 11th ICDAR, pp. 518–522 (2011) Wang, Q.-F., Yin, F., Liu, C.-L.: Improving handwritten Chinese text recognition by confidence transformation. In: Proceedings of the 11th ICDAR, pp. 518–522 (2011)
7.
go back to reference Lin, X., Ding, X., Chen, M., Zhang, R., Wu, Y.: Adaptive confidence transform based on classifier combination for Chinese character recognition. Pattern Recognit. Lett. 19(10), 975–988 (1998)CrossRef Lin, X., Ding, X., Chen, M., Zhang, R., Wu, Y.: Adaptive confidence transform based on classifier combination for Chinese character recognition. Pattern Recognit. Lett. 19(10), 975–988 (1998)CrossRef
8.
go back to reference Gillick, L., Ito, Y., Young, J.: A probabilistic approach to confidence estimation and evaluation. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, pp. 879–882 (1997) Gillick, L., Ito, Y., Young, J.: A probabilistic approach to confidence estimation and evaluation. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, pp. 879–882 (1997)
9.
go back to reference Platt, J.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Smola, A.J., Bartlett, P., Schölkpf, D., Schuurmanns, D. (eds.) Advances in Large Margin Classifiers, pp. 61–74. MIT Press, Cambridge, MA (1999) Platt, J.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Smola, A.J., Bartlett, P., Schölkpf, D., Schuurmanns, D. (eds.) Advances in Large Margin Classifiers, pp. 61–74. MIT Press, Cambridge, MA (1999)
10.
go back to reference Schürmann, J.: Pattern Classification: A Unified View of Statistical and Neural Approaches. Wiley, New York (1996) Schürmann, J.: Pattern Classification: A Unified View of Statistical and Neural Approaches. Wiley, New York (1996)
11.
go back to reference Barnett, J.A.: Computational methods for a mathematical theory of evidence. In: Proceedings of the 7th IJCAI, pp. 868–875 (1981) Barnett, J.A.: Computational methods for a mathematical theory of evidence. In: Proceedings of the 7th IJCAI, pp. 868–875 (1981)
12.
go back to reference Liu, C.-L., Sako, H., Fujisawa, H.: Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings. IEEE Trans. Pattern Anal. Mach. Intell. 26(11), 1395–1407 (2004)CrossRef Liu, C.-L., Sako, H., Fujisawa, H.: Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings. IEEE Trans. Pattern Anal. Mach. Intell. 26(11), 1395–1407 (2004)CrossRef
13.
go back to reference Zadrozny, B., Elkan, C.: Learning and making decisions when costs and probabilities are both unknown. In: Proceedings of the 7th ACM SIGKDD, pp. 204–213 (2001) Zadrozny, B., Elkan, C.: Learning and making decisions when costs and probabilities are both unknown. In: Proceedings of the 7th ACM SIGKDD, pp. 204–213 (2001)
14.
go back to reference Robertson, T., Wright, F., Dykstra, R.: Order restricted statistical inference, chap. 1. Wiley, New York (1988) Robertson, T., Wright, F., Dykstra, R.: Order restricted statistical inference, chap. 1. Wiley, New York (1988)
15.
go back to reference Zadrozny, B., Elkan, C.: Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the 8th SIGKDD (2002) Zadrozny, B., Elkan, C.: Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the 8th SIGKDD (2002)
16.
go back to reference Ayer, M., Brunk, H., Ewing, G., Reid, W., Silverman, E.: An empirical distribution function for sampling with incomplete information. Ann. Math. Stat. 26(4), 641–647 (1955)CrossRefMATHMathSciNet Ayer, M., Brunk, H., Ewing, G., Reid, W., Silverman, E.: An empirical distribution function for sampling with incomplete information. Ann. Math. Stat. 26(4), 641–647 (1955)CrossRefMATHMathSciNet
17.
go back to reference Juang, B.-H., Chou, W., Lee, C.-H.: Minimum classification error rate methods for speech recognition. IEEE Trans. Speech Audio Process. 5(3), 257–265 (1997)CrossRef Juang, B.-H., Chou, W., Lee, C.-H.: Minimum classification error rate methods for speech recognition. IEEE Trans. Speech Audio Process. 5(3), 257–265 (1997)CrossRef
18.
go back to reference Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA online and offline Chinese handwriting databases. In: Proceedings of the 11th ICDAR, pp. 37–41 (2011) Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA online and offline Chinese handwriting databases. In: Proceedings of the 11th ICDAR, pp. 37–41 (2011)
19.
go back to reference Wang, D.-H., Liu, C.-L.: String-level learning of confidence transformation for Chinese handwritten text recognition. In: Proceedings of the 21th ICPR, pp. 3208–3211 (2012) Wang, D.-H., Liu, C.-L.: String-level learning of confidence transformation for Chinese handwritten text recognition. In: Proceedings of the 21th ICPR, pp. 3208–3211 (2012)
20.
go back to reference Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. 34(8), 1469–1481 (2012)CrossRef Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. 34(8), 1469–1481 (2012)CrossRef
21.
go back to reference Chen, M.-Y., Kundu, A., Srihari, S.N.: Variable duration hidden Markov model and morphological segmentation for handwritten word recognition. IEEE Trans. Image Process. 4(12), 1675–1688 (1995)CrossRef Chen, M.-Y., Kundu, A., Srihari, S.N.: Variable duration hidden Markov model and morphological segmentation for handwritten word recognition. IEEE Trans. Image Process. 4(12), 1675–1688 (1995)CrossRef
23.
go back to reference Chou, W.: Discriminant-function-based minimum recognition error pattern-recognition approach to speech recognition. Proc. IEEE 88(8), 1201–1223 (2000)CrossRef Chou, W.: Discriminant-function-based minimum recognition error pattern-recognition approach to speech recognition. Proc. IEEE 88(8), 1201–1223 (2000)CrossRef
24.
go back to reference Chen, W.-T., Gader, P.: Word level discriminative training for handwritten word recognition. In: Proceedings of the 7th IWFHR, Amsterdam, pp. 393–402 (2000) Chen, W.-T., Gader, P.: Word level discriminative training for handwritten word recognition. In: Proceedings of the 7th IWFHR, Amsterdam, pp. 393–402 (2000)
25.
go back to reference Liu, C.-L., Marukawa, K.: Handwritten numeral string recognition: character-level training versus string-level training. In: Proceedings of the 17th ICPR, Cambridge, UK, pp. 405–408 (2004) Liu, C.-L., Marukawa, K.: Handwritten numeral string recognition: character-level training versus string-level training. In: Proceedings of the 17th ICPR, Cambridge, UK, pp. 405–408 (2004)
26.
go back to reference Biem, A.: Minimum classification error training for online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(7), 1041–1051 (2006)CrossRef Biem, A.: Minimum classification error training for online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(7), 1041–1051 (2006)CrossRef
27.
go back to reference Wang, D.-H., Liu, C.-L., Zhou, X.-D.: An approach for real-time recognition of online Chinese handwritten sentences. Pattern Recognit. 45(10), 3661–3675 (2012)CrossRef Wang, D.-H., Liu, C.-L., Zhou, X.-D.: An approach for real-time recognition of online Chinese handwritten sentences. Pattern Recognit. 45(10), 3661–3675 (2012)CrossRef
28.
go back to reference Liu, C.-L., Fujisawa, H.: Classification and learning in character recognition: advances and remaining problems. In: Marinai, S., Fujisawa, H. (eds.) Machine Learning in Document Analysis and Recognition, pp. 139–161. Springer, Berlin (2008)CrossRef Liu, C.-L., Fujisawa, H.: Classification and learning in character recognition: advances and remaining problems. In: Marinai, S., Fujisawa, H. (eds.) Machine Learning in Document Analysis and Recognition, pp. 139–161. Springer, Berlin (2008)CrossRef
29.
go back to reference Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 149–153 (1987)CrossRef Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 149–153 (1987)CrossRef
30.
go back to reference Jin, X.-B., Liu, C.-L., Hou, X.: Regularized margin-based conditional log-likelihood loss for prototype learning. Pattern Recognit. 43(7), 2428–2438 (2010)CrossRefMATH Jin, X.-B., Liu, C.-L., Hou, X.: Regularized margin-based conditional log-likelihood loss for prototype learning. Pattern Recognit. 43(7), 2428–2438 (2010)CrossRefMATH
31.
go back to reference Liu, C.-L.: One-vs-all training of prototype classifier for pattern classification and retrieval. In: Proceedings of the 20th ICPR, pp. 3328–3331 (2010) Liu, C.-L.: One-vs-all training of prototype classifier for pattern classification and retrieval. In: Proceedings of the 20th ICPR, pp. 3328–3331 (2010)
32.
go back to reference Liu, C.-L., Zhou, X.-D.: Online Japanese character recognition using trajectory-based normalization and direction feature extraction. In: Proceedings of the 10th IWFHR, pp. 217–222 (2006) Liu, C.-L., Zhou, X.-D.: Online Japanese character recognition using trajectory-based normalization and direction feature extraction. In: Proceedings of the 10th IWFHR, pp. 217–222 (2006)
33.
go back to reference Yin, F., Wang, Q.-F., Liu, C.-L.: Integrating geometric context for text alignment of handwritten Chinese documents. In: Proceedings of the 12th ICFHR, pp. 7–12 (2010) Yin, F., Wang, Q.-F., Liu, C.-L.: Integrating geometric context for text alignment of handwritten Chinese documents. In: Proceedings of the 12th ICFHR, pp. 7–12 (2010)
34.
go back to reference Rabiner, L.R.: A tutorial on hidden Markov models and selective applications in speech recognition. Proc. IEEE 77, 257–286 (1989)CrossRef Rabiner, L.R.: A tutorial on hidden Markov models and selective applications in speech recognition. Proc. IEEE 77, 257–286 (1989)CrossRef
35.
go back to reference Su, T.-H., Zhang, T.-W., Guan, D.-J., Huang, H.-J.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognit. 42(1), 167–182 (2009)CrossRefMATH Su, T.-H., Zhang, T.-W., Guan, D.-J., Huang, H.-J.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognit. 42(1), 167–182 (2009)CrossRefMATH
36.
go back to reference Zhou, X.-D., Yu, J.-L., Liu, C.-L., Nagasaki, T., Marukawa, K.: Online handwritten Japanese character string recognition incorporating geometric context. In: Proceedings of the 9th ICDAR, Curitiba, Brazil, pp. 48–52 (2007) Zhou, X.-D., Yu, J.-L., Liu, C.-L., Nagasaki, T., Marukawa, K.: Online handwritten Japanese character string recognition incorporating geometric context. In: Proceedings of the 9th ICDAR, Curitiba, Brazil, pp. 48–52 (2007)
Metadata
Title
Learning confidence transformation for handwritten Chinese text recognition
Authors
Da-Han Wang
Cheng-Lin Liu
Publication date
01-09-2014
Publisher
Springer Berlin Heidelberg
Published in
International Journal on Document Analysis and Recognition (IJDAR) / Issue 3/2014
Print ISSN: 1433-2833
Electronic ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-013-0214-3

Other articles of this Issue 3/2014

International Journal on Document Analysis and Recognition (IJDAR) 3/2014 Go to the issue

Premium Partner