Skip to main content
Top
Published in: International Journal on Document Analysis and Recognition (IJDAR) 2/2023

27-10-2022 | Original Paper

Retrieval-based language model adaptation for handwritten Chinese text recognition

Authors: Shuying Hu, Qiufeng Wang, Kaizhu Huang, Min Wen, Frans Coenen

Published in: International Journal on Document Analysis and Recognition (IJDAR) | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In handwritten text recognition, compared to human, computers are far short of linguistic context knowledge, especially domain-matched knowledge. In this paper, we present a novel retrieval-based method to obtain an adaptive language model for offline recognition of unconstrained handwritten Chinese texts. The content of handwritten texts to be recognized is varied and usually unknown a priori. Therefore we adopt a two-pass recognition strategy. In the first pass, we utilize a common language model to obtain initial recognition results, which are used to retrieve the related contents from Internet. In the content retrieval, we evaluate different types of semantic representation from BERT output and the traditional TF–IDF representation. Then, we dynamically generate an adaptive language model from these related contents, which will consequently be combined with the common language model and applied in the second-pass recognition. We evaluate the proposed method on two benchmark unconstrained handwriting datasets, namely CASIA-HWDB and ICDAR-2013. Experimental results show that the proposed retrieval-based language model adaptation yields improvements in recognition performance, despite the reduced Internet contents hereby employed.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Nagy, G.: Disruptive developments in document recognition. Pattern Recogn. Lett. 79, 106–112 (2016)CrossRef Nagy, G.: Disruptive developments in document recognition. Pattern Recogn. Lett. 79, 106–112 (2016)CrossRef
2.
go back to reference Fujisawa, H.: Forty years of research in character and document recognition–an industrial perspective. Pattern Recogn. 41(8), 2435–2446 (2008)CrossRef Fujisawa, H.: Forty years of research in character and document recognition–an industrial perspective. Pattern Recogn. 41(8), 2435–2446 (2008)CrossRef
3.
go back to reference Dai, R.-W., Liu, C.-L., Xiao, B.-H.: Chinese character recognition: history, status and prospects. Front. Comput. Sci. China 1(2), 126–136 (2007)CrossRef Dai, R.-W., Liu, C.-L., Xiao, B.-H.: Chinese character recognition: history, status and prospects. Front. Comput. Sci. China 1(2), 126–136 (2007)CrossRef
4.
go back to reference Liu, C.-L., Lu, Y. (eds.): Advances in Chinese Document and Text Processing, book in Series on Language Processing, Pattern Recognition, and Intelligent Systems, vol. 2. World Scientific (2017) Liu, C.-L., Lu, Y. (eds.): Advances in Chinese Document and Text Processing, book in Series on Language Processing, Pattern Recognition, and Intelligent Systems, vol. 2. World Scientific (2017)
5.
go back to reference Liu, C.-L., Yin, F., Wang, Q.-F., Wang, D.-H.: ICDAR 2011 Chinese Handwriting Recognition Competition. Proc. ICDAR, pp.1464–1469 (2011) Liu, C.-L., Yin, F., Wang, Q.-F., Wang, D.-H.: ICDAR 2011 Chinese Handwriting Recognition Competition. Proc. ICDAR, pp.1464–1469 (2011)
6.
go back to reference Yin, F., Wang, Q.-F., Zhang, X.-Y., Liu, C.-L.: ICDAR 2013 Chinese Handwriting Recognition Competition. Proc. ICDAR, pp. 1464–1470 (2013) Yin, F., Wang, Q.-F., Zhang, X.-Y., Liu, C.-L.: ICDAR 2013 Chinese Handwriting Recognition Competition. Proc. ICDAR, pp. 1464–1470 (2013)
7.
go back to reference Cheng, C., Zhang, X.Y., Shao, X.H., Zhou, X.D.: Handwritten Chinese Character Recognition by Joint Classification and Similarity Ranking. Proc. Int’l Conf. on Frontiers in Handwriting Recognition (ICFHR), pp. 507-511 (2016) Cheng, C., Zhang, X.Y., Shao, X.H., Zhou, X.D.: Handwritten Chinese Character Recognition by Joint Classification and Similarity Ranking. Proc. Int’l Conf. on Frontiers in Handwriting Recognition (ICFHR), pp. 507-511 (2016)
8.
go back to reference Zhang, X.-Y., Bengio, Y., Liu, C.-L.: Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recogn. 61, 348–360 (2017)CrossRef Zhang, X.-Y., Bengio, Y., Liu, C.-L.: Online and offline handwritten Chinese character recognition: a comprehensive study and new benchmark. Pattern Recogn. 61, 348–360 (2017)CrossRef
9.
go back to reference Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) 34(8), 1469–1481 (2012)CrossRef Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) 34(8), 1469–1481 (2012)CrossRef
10.
go back to reference Wu, Y.-C., Yin, F., Liu, C.-L.: Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models. Pattern Recogn. 65, 251–264 (2017)CrossRef Wu, Y.-C., Yin, F., Liu, C.-L.: Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models. Pattern Recogn. 65, 251–264 (2017)CrossRef
11.
go back to reference Wang, Q.-F., Cambria, E., Liu, C.-L., Hussain, A.: Common sense knowledge for handwritten Chinese text recognition. Cogn. Comput. 5(2), 234–242 (2013)CrossRef Wang, Q.-F., Cambria, E., Liu, C.-L., Hussain, A.: Common sense knowledge for handwritten Chinese text recognition. Cogn. Comput. 5(2), 234–242 (2013)CrossRef
12.
go back to reference Wang, Q.-F., Yin, F., Liu, C.-L.: Unsupervised language model adaptation for handwritten Chinese text recognition. Pattern Recogn. 47(3), 1202–1216 (2014)CrossRef Wang, Q.-F., Yin, F., Liu, C.-L.: Unsupervised language model adaptation for handwritten Chinese text recognition. Pattern Recogn. 47(3), 1202–1216 (2014)CrossRef
13.
go back to reference Li, Y.X., Tan, C.L., Ding, X.Q.: A hybrid postprocessing system for offline handwritten Chinese Script recognition. Pattern Anal. Appl. 8, 272–286 (2005)MathSciNetCrossRef Li, Y.X., Tan, C.L., Ding, X.Q.: A hybrid postprocessing system for offline handwritten Chinese Script recognition. Pattern Anal. Appl. 8, 272–286 (2005)MathSciNetCrossRef
14.
go back to reference Xu, R.F., Yeung, D.S., Shi, D.M.: A hybrid postprocessing system for offline handwritten Chinese character recognition based on a statistical language model. Int. J. Pattern Recognit. Artif. Intell. 19(3), 415–428 (2005)CrossRef Xu, R.F., Yeung, D.S., Shi, D.M.: A hybrid postprocessing system for offline handwritten Chinese character recognition based on a statistical language model. Int. J. Pattern Recognit. Artif. Intell. 19(3), 415–428 (2005)CrossRef
15.
go back to reference Wang, Q.-F., Yin, F., Liu, C.-L.: Integrating language model in handwriting Chinese text recognition. Proc. 10th ICDAR, pp. 1036-1040 (2009) Wang, Q.-F., Yin, F., Liu, C.-L.: Integrating language model in handwriting Chinese text recognition. Proc. 10th ICDAR, pp. 1036-1040 (2009)
16.
go back to reference Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka, E.R. Jr., Mitchell, T.M.: Toward an Architecture for Never-Ending Language Learning. In Proceedings of the Conference on Artificial Intelligence (AAAI) (2010) Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka, E.R. Jr., Mitchell, T.M.: Toward an Architecture for Never-Ending Language Learning. In Proceedings of the Conference on Artificial Intelligence (AAAI) (2010)
17.
go back to reference Mitchell, T., Cohen, W., Hruschka, E., et al.: Never-ending learning. In Proceedings of the Conference on Artificial Intelligence (AAAI) (2015) Mitchell, T., Cohen, W., Hruschka, E., et al.: Never-ending learning. In Proceedings of the Conference on Artificial Intelligence (AAAI) (2015)
18.
go back to reference Fergus, R., Fei-Fei, L., Perona, P., et al.: Learning object categories from Google’s image search. Tenth IEEE International Conference on Computer Vision. IEEE, pp. 1816-1823 (2005) Fergus, R., Fei-Fei, L., Perona, P., et al.: Learning object categories from Google’s image search. Tenth IEEE International Conference on Computer Vision. IEEE, pp. 1816-1823 (2005)
19.
go back to reference Nishizaki, H., Sekiguchi, Y.: Word Error Correction of Continuous Speech Recognition Using WEB Documents for Spoken Document Indexing. In: Matsumoto, Y., Sproat, R.W., Wong, K.F., Zhang, M. (eds.) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, ICCPOL (2006) Nishizaki, H., Sekiguchi, Y.: Word Error Correction of Continuous Speech Recognition Using WEB Documents for Spoken Document Indexing. In: Matsumoto, Y., Sproat, R.W., Wong, K.F., Zhang, M. (eds.) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, ICCPOL (2006)
20.
go back to reference Oertel, C., O’Shea, S., Bodnar, A., Blostein, D.: Using the web to validate document recognition results: experiments with business cards. Proc. SPIE, Document Recognition and Retrieval XII (2005) Oertel, C., O’Shea, S., Bodnar, A., Blostein, D.: Using the web to validate document recognition results: experiments with business cards. Proc. SPIE, Document Recognition and Retrieval XII (2005)
21.
go back to reference Donoser, M., Bischof, H., Wagner, S.: Using web search engines to improve text recognition. International Conference on Pattern Recognition, pp. 1–4 (2008) Donoser, M., Bischof, H., Wagner, S.: Using web search engines to improve text recognition. International Conference on Pattern Recognition, pp. 1–4 (2008)
22.
go back to reference Donoser, M., Wagner, S., Bischof, H.: Context information from search engines for document recognition. Pattern Recogn. Lett. 31, 750–754 (2010)CrossRef Donoser, M., Wagner, S., Bischof, H.: Context information from search engines for document recognition. Pattern Recogn. Lett. 31, 750–754 (2010)CrossRef
23.
go back to reference Bellegarda, J.R.: Statistical language model adaptation: review and perspectives. Speech Commun. 42(1), 93–108 (2004)CrossRef Bellegarda, J.R.: Statistical language model adaptation: review and perspectives. Speech Commun. 42(1), 93–108 (2004)CrossRef
24.
go back to reference Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2019) Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv:​1810.​04805 (2019)
25.
go back to reference Russell, B.C., Torralba, A., Murphy, K.P., et al.: LabelMe: a database and web-based tool for image annotation. Int. J. Comput. Vision 77(1–3), 157–173 (2008)CrossRef Russell, B.C., Torralba, A., Murphy, K.P., et al.: LabelMe: a database and web-based tool for image annotation. Int. J. Comput. Vision 77(1–3), 157–173 (2008)CrossRef
26.
go back to reference Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. IEEE Computer Vision and Pattern Recognition (CVPR) (2009) Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. IEEE Computer Vision and Pattern Recognition (CVPR) (2009)
27.
go back to reference Li, F.-F. ImageNet: crowdsourcing, benchmarking & other cool things, CMU VASC Seminar, March (2010) Li, F.-F. ImageNet: crowdsourcing, benchmarking & other cool things, CMU VASC Seminar, March (2010)
28.
go back to reference Zhuo, H.H., Yang, Q., Pan, R., Li, L.: Cross-domain action-model acquisition for planning via web search. Twenty-First International Conference on Automated Planning and Scheduling (2011) Zhuo, H.H., Yang, Q., Pan, R., Li, L.: Cross-domain action-model acquisition for planning via web search. Twenty-First International Conference on Automated Planning and Scheduling (2011)
29.
go back to reference Chen, L., Lamel, L., Gauvain, J.L., et al.: Dynamic language modeling for broadcast news. International Conference on Spoken Language Processing (2004) Chen, L., Lamel, L., Gauvain, J.L., et al.: Dynamic language modeling for broadcast news. International Conference on Spoken Language Processing (2004)
30.
go back to reference Whitelaw, C., Hutchinson, B., Chung, G.Y., et al.: Using the web for language independent spellchecking and autocorrection. Conference on Empirical Methods in Natural Language Processing: Volume. Association for Computational Linguistics, pp. 890-899 (2009) Whitelaw, C., Hutchinson, B., Chung, G.Y., et al.: Using the web for language independent spellchecking and autocorrection. Conference on Empirical Methods in Natural Language Processing: Volume. Association for Computational Linguistics, pp. 890-899 (2009)
31.
go back to reference Bassil, Y., Alwani, M.: OCR post-processing error correction algorithm using Google’s online spelling suggestion. Emerg. Trends Comput. Inf. Sci. 3(1), 90–99 (2012) Bassil, Y., Alwani, M.: OCR post-processing error correction algorithm using Google’s online spelling suggestion. Emerg. Trends Comput. Inf. Sci. 3(1), 90–99 (2012)
32.
go back to reference Oprean, C., Likforman-Sulem, L., Popescu, A., et al.: Using the Web to Create Dynamic Dictionaries in Handwritten Out-of-Vocabulary Word Recognition. International Conference on Document Analysis and Recognition, pp. 989-993 (2013) Oprean, C., Likforman-Sulem, L., Popescu, A., et al.: Using the Web to Create Dynamic Dictionaries in Handwritten Out-of-Vocabulary Word Recognition. International Conference on Document Analysis and Recognition, pp. 989-993 (2013)
33.
go back to reference Oprean, C., Popescu, A., Popescu, A., et al.: Handwritten word recognition using Web resources and recurrent neural networks. IJDAR 18(4), 287–301 (2015)CrossRef Oprean, C., Popescu, A., Popescu, A., et al.: Handwritten word recognition using Web resources and recurrent neural networks. IJDAR 18(4), 287–301 (2015)CrossRef
34.
go back to reference Oprean, C., Likformansulem, L., Mokbel, C., et al.: BLSTM-based handwritten text recognition using Web resources. International Conference on Document Analysis and Recognition, pp. 466-470 (2015) Oprean, C., Likformansulem, L., Mokbel, C., et al.: BLSTM-based handwritten text recognition using Web resources. International Conference on Document Analysis and Recognition, pp. 466-470 (2015)
35.
go back to reference Rosenfeld, R.: Two decades of statistical language modeling: Where do we go from here? Proc. IEEE 88(8), 1270–8 (2000)CrossRef Rosenfeld, R.: Two decades of statistical language modeling: Where do we go from here? Proc. IEEE 88(8), 1270–8 (2000)CrossRef
36.
go back to reference Marti, U.V., Bunke, H.: Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition systems. Int. J. Pattern Recognit. Artif. Intell. 15(01), 65–90 (2001)CrossRef Marti, U.V., Bunke, H.: Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition systems. Int. J. Pattern Recognit. Artif. Intell. 15(01), 65–90 (2001)CrossRef
37.
go back to reference Li, N.-X., Jin, L.-W.: A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition. Proc. IEEE Int’l Conf. Systems, Man, and Cybernetics, pp. 3664 - 3668 (2010) Li, N.-X., Jin, L.-W.: A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition. Proc. IEEE Int’l Conf. Systems, Man, and Cybernetics, pp. 3664 - 3668 (2010)
38.
go back to reference Zhou, X.-D., Wang, D.-H., Tian, F., Liu, C.-L., Nakagawa, M.: Handwritten Chinese/Japanese text recognition using semi-markov conditional random fields. IEEE Trans. Pattern Anal. Mach. Intell. 35(10), 2413–2426 (2013)CrossRef Zhou, X.-D., Wang, D.-H., Tian, F., Liu, C.-L., Nakagawa, M.: Handwritten Chinese/Japanese text recognition using semi-markov conditional random fields. IEEE Trans. Pattern Anal. Mach. Intell. 35(10), 2413–2426 (2013)CrossRef
39.
go back to reference Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)MATH Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)MATH
40.
go back to reference Mikolov, T., Karafiat, M., Burget, L., Cernocky, J. H., Khudanpur, S.: Recurrent neural network based language model. Proc. Interspeech, pp. 1045-1048 (2010) Mikolov, T., Karafiat, M., Burget, L., Cernocky, J. H., Khudanpur, S.: Recurrent neural network based language model. Proc. Interspeech, pp. 1045-1048 (2010)
42.
go back to reference Irie, K., Tüske, Z., Alkhouli, T., et al.: LSTM, GRU, highway and a bit of attention: an empirical overview for language modeling in speech recognition. INTERSPEECH, 519-3523 (2016) Irie, K., Tüske, Z., Alkhouli, T., et al.: LSTM, GRU, highway and a bit of attention: an empirical overview for language modeling in speech recognition. INTERSPEECH, 519-3523 (2016)
43.
go back to reference Luong, T., Kayser, M., Manning, C.D.: Deep neural language models for machine translation. Nineteenth Conference on Computational Natural Language Learning, 305-309 (2015) Luong, T., Kayser, M., Manning, C.D.: Deep neural language models for machine translation. Nineteenth Conference on Computational Natural Language Learning, 305-309 (2015)
44.
go back to reference Bellegarda, J.R.: Exploiting latent semantic information in statistical language modeling. Proc. IEEE 88(8), 1279–1296 (2000)CrossRef Bellegarda, J.R.: Exploiting latent semantic information in statistical language modeling. Proc. IEEE 88(8), 1279–1296 (2000)CrossRef
45.
go back to reference Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 42(1–2), 177–196 (2001)CrossRefMATH Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 42(1–2), 177–196 (2001)CrossRefMATH
46.
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR 3, 993–1022 (2003)MATH
48.
go back to reference Gao, J., Suzuki, H., Yuan, W.: An empirical study on language model adaptation. ACM Trans. Asian Lang. Inf. Process. 5(3), 209–227 (2006)CrossRef Gao, J., Suzuki, H., Yuan, W.: An empirical study on language model adaptation. ACM Trans. Asian Lang. Inf. Process. 5(3), 209–227 (2006)CrossRef
49.
go back to reference Liu, X., Gales, M.J.F., Woodland, P.C.: Use of contexts in language model interpolation and adaptation. Comput. Speech Lang. 27(1), 301–321 (2013)CrossRef Liu, X., Gales, M.J.F., Woodland, P.C.: Use of contexts in language model interpolation and adaptation. Comput. Speech Lang. 27(1), 301–321 (2013)CrossRef
50.
go back to reference Su, T.-H., Zhang, T.-W., Guan, D.-J.: Corpus-based HIT-MW database for offline recognition of general-purpose Chinese handwritten text. Int’l J. Doc. Anal. Recognit. 10(1), 27–38 (2007)CrossRef Su, T.-H., Zhang, T.-W., Guan, D.-J.: Corpus-based HIT-MW database for offline recognition of general-purpose Chinese handwritten text. Int’l J. Doc. Anal. Recognit. 10(1), 27–38 (2007)CrossRef
51.
go back to reference Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA Online and offline Chinese handwriting databases. Proc. 11th Int’l Conf. Document Analysis and Recognition, pp. 37-41, (2011) Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA Online and offline Chinese handwriting databases. Proc. 11th Int’l Conf. Document Analysis and Recognition, pp. 37-41, (2011)
52.
go back to reference Yin, F., Wang, Q.-F., Zhang, X.-Y., Liu, C.-L.: ICDAR 2013 Chinese handwriting recognition competition. Proc. ICDAR, pp. 1464-1470 (2013) Yin, F., Wang, Q.-F., Zhang, X.-Y., Liu, C.-L.: ICDAR 2013 Chinese handwriting recognition competition. Proc. ICDAR, pp. 1464-1470 (2013)
53.
go back to reference Su, T.-H., Zhang, T., Guan, D.-J., Huang, H.-J.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognit. 42(1), 167–182 (2009) Su, T.-H., Zhang, T., Guan, D.-J., Huang, H.-J.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognit. 42(1), 167–182 (2009)
54.
go back to reference Wang, Z.-R., Du, Jun, Hu, J.-S., Hu, Yu-Long: Deep convolutional neural network based hidden markov model for offline handwritten Chinese text recognition. Proc. ACPR (2017) Wang, Z.-R., Du, Jun, Hu, J.-S., Hu, Yu-Long: Deep convolutional neural network based hidden markov model for offline handwritten Chinese text recognition. Proc. ACPR (2017)
55.
go back to reference Peng, D., Jin, L., Ma, W., Xie, C., Zhang, H., Zhu, S., Li, J.: Recognition of handwritten Chinese text by segmentation: A segment-annotation-free approach. IEEE Trans. Multimed. (2022) Peng, D., Jin, L., Ma, W., Xie, C., Zhang, H., Zhu, S., Li, J.: Recognition of handwritten Chinese text by segmentation: A segment-annotation-free approach. IEEE Trans. Multimed. (2022)
56.
go back to reference Wang, Z.-R., Du, J., Wang, J.-M.: Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition. Pattern Recogn. 100, 107102 (2020)CrossRef Wang, Z.-R., Du, J., Wang, J.-M.: Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition. Pattern Recogn. 100, 107102 (2020)CrossRef
57.
go back to reference Xu, L., Yin, F., Wang, Q.-F., Liu, C.-L.: An over-segmentation method for single touching Chinese handwriting with learning-based filtering. Int. J. Doc. Anal. Recognit. 17(1), 91–104 (2014)CrossRef Xu, L., Yin, F., Wang, Q.-F., Liu, C.-L.: An over-segmentation method for single touching Chinese handwriting with learning-based filtering. Int. J. Doc. Anal. Recognit. 17(1), 91–104 (2014)CrossRef
58.
go back to reference Wang, Z.X., Wang, Q.F., Yin, F., Liu, C.L.: Weakly supervised learning for over-segmentation based handwritten Chinese text recognition. 17th International Conference on Frontiers in Handwriting Recognition (ICFHR) (2020) Wang, Z.X., Wang, Q.F., Yin, F., Liu, C.L.: Weakly supervised learning for over-segmentation based handwritten Chinese text recognition. 17th International Conference on Frontiers in Handwriting Recognition (ICFHR) (2020)
59.
go back to reference Peng, D.-Z., Jin, L.-W., Wu, Y.-Q., Wang, Z.-P., Cai, M.-X.: A Fast and Accurate Fully Convolutional Network for End-to-End Handwritten Chinese Text Segmentation and Recognition. In Proc. 15th International Conference on Document Analysis and Recognition, pp. 25-30 (2019) Peng, D.-Z., Jin, L.-W., Wu, Y.-Q., Wang, Z.-P., Cai, M.-X.: A Fast and Accurate Fully Convolutional Network for End-to-End Handwritten Chinese Text Segmentation and Recognition. In Proc. 15th International Conference on Document Analysis and Recognition, pp. 25-30 (2019)
60.
go back to reference Fink, G.A.: Markov models for offline handwriting recognition: a survey. Springer-Verlag (2009) Fink, G.A.: Markov models for offline handwriting recognition: a survey. Springer-Verlag (2009)
61.
go back to reference Messina, R., Louradour, J.: Segmentation-free handwritten Chinese text recognition with LSTM-RNN. Proc. Int’l Conf. on Document Analysis and Recognition (ICDAR), pp.171-175 (2015) Messina, R., Louradour, J.: Segmentation-free handwritten Chinese text recognition with LSTM-RNN. Proc. Int’l Conf. on Document Analysis and Recognition (ICDAR), pp.171-175 (2015)
62.
go back to reference Stolcke, A.: SRILM—An extensible language modeling toolkit. In: Proceedings of the 7th international conference on spoken language processing (ICSLP 2002) 901-904 (2002) Stolcke, A.: SRILM—An extensible language modeling toolkit. In: Proceedings of the 7th international conference on spoken language processing (ICSLP 2002) 901-904 (2002)
63.
go back to reference Wang, S., Chen, L., Xu, L., Fan, W., Sun, J., Naoi, S.: Deep knowledge training and heterogeneous cnn for handwritten chinese text recognition. Proc. 15th International Conference on Frontiers of Handwriting Recognition, pp. 84-89 (2016) Wang, S., Chen, L., Xu, L., Fan, W., Sun, J., Naoi, S.: Deep knowledge training and heterogeneous cnn for handwritten chinese text recognition. Proc. 15th International Conference on Frontiers of Handwriting Recognition, pp. 84-89 (2016)
64.
go back to reference Xie, Z.-C., Huang, Y.-X., Zhu, Y.-Z., Jin, L.-W., Liu, Y.-L., Xie, L.-L.: Aggregation cross-entropy for sequence recognition. In Proc. the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6538-6547 (2019) Xie, Z.-C., Huang, Y.-X., Zhu, Y.-Z., Jin, L.-W., Liu, Y.-L., Xie, L.-L.: Aggregation cross-entropy for sequence recognition. In Proc. the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6538-6547 (2019)
Metadata
Title
Retrieval-based language model adaptation for handwritten Chinese text recognition
Authors
Shuying Hu
Qiufeng Wang
Kaizhu Huang
Min Wen
Frans Coenen
Publication date
27-10-2022
Publisher
Springer Berlin Heidelberg
Published in
International Journal on Document Analysis and Recognition (IJDAR) / Issue 2/2023
Print ISSN: 1433-2833
Electronic ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-022-00419-2

Other articles of this Issue 2/2023

International Journal on Document Analysis and Recognition (IJDAR) 2/2023 Go to the issue

Premium Partner