Skip to main content
Erschienen in: Cognitive Computation 2/2013

01.06.2013

Common Sense Knowledge for Handwritten Chinese Text Recognition

verfasst von: Qiu-Feng Wang, Erik Cambria, Cheng-Lin Liu, Amir Hussain

Erschienen in: Cognitive Computation | Ausgabe 2/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Compared to human intelligence, computers are far short of common sense knowledge which people normally acquire during the formative years of their lives. This paper investigates the effects of employing common sense knowledge as a new linguistic context in handwritten Chinese text recognition. Three methods are introduced to supplement the standard n-gram language model: embedding model, direct model, and an ensemble of these two. The embedding model uses semantic similarities from common sense knowledge to make the n-gram probabilities estimation more reliable, especially for the unseen n-grams in the training text corpus. The direct model, in turn, considers the linguistic context of the whole document to make up for the short context limit of the n-gram model. The three models are evaluated on a large unconstrained handwriting database, CASIA-HWDB, and the results show that the adoption of common sense knowledge yields improvements in recognition performance, despite the reduced concept list hereby employed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
In the case of unconstrained texts, no corpus is wide enough to contain all possible n-grams.
 
2
In Chinese, a word can comprises one or multiple characters, which can explore both syntactic and semantic meaning better than a character.
 
3
High-order n-gram models need much larger training corpus and higher cost of computation and memory, n usually takes no more than 5 in practice.
 
Literatur
1.
Zurück zum Zitat Liu H, Singh P. ConceptNet—a practical commonsense reasoning tool-kit. BT Tech J (2004); 22(4):211–26.CrossRef Liu H, Singh P. ConceptNet—a practical commonsense reasoning tool-kit. BT Tech J (2004); 22(4):211–26.CrossRef
2.
Zurück zum Zitat Lenat D, Guha R. Building large knowledge-based systems: representation and inference in the Cyc project. Reading: Addison-Wesley; (1989). Lenat D, Guha R. Building large knowledge-based systems: representation and inference in the Cyc project. Reading: Addison-Wesley; (1989).
3.
Zurück zum Zitat Speer R, Havasi C. Representing general relational knowledge in ConceptNet 5. In: International conference on language resources and evaluation (LREC); 2012. p. 3679–86. Speer R, Havasi C. Representing general relational knowledge in ConceptNet 5. In: International conference on language resources and evaluation (LREC); 2012. p. 3679–86.
4.
Zurück zum Zitat Cambria E, Havasi C, Hussain A. SenticNet 2: a semantic and affective resource for opinion mining and sentiment analysis. In: Florida artificial intelligence research society conference (FLAIRS); 2012. p. 202–7. Cambria E, Havasi C, Hussain A. SenticNet 2: a semantic and affective resource for opinion mining and sentiment analysis. In: Florida artificial intelligence research society conference (FLAIRS); 2012. p. 202–7.
5.
Zurück zum Zitat Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J. Freebase: A collaboratively created graph database for structuring human knowledge. In: SIGMOD Conference; 2008. pp. 1247–50. Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J. Freebase: A collaboratively created graph database for structuring human knowledge. In: SIGMOD Conference; 2008. pp. 1247–50.
6.
Zurück zum Zitat Cambria E, Song Y, Wang H, Hussain A. Isanette: a common and common sense knowledge base for opinon mining. In: International conference on data mining; 2011. p. 315–22. Cambria E, Song Y, Wang H, Hussain A. Isanette: a common and common sense knowledge base for opinon mining. In: International conference on data mining; 2011. p. 315–22.
7.
Zurück zum Zitat Hsu MH, Chen HH. Information retrieval with commonsense knowledge. In: ACM SIGIR conference on research and development in information retrieval; 2006. p. 651–2. Hsu MH, Chen HH. Information retrieval with commonsense knowledge. In: ACM SIGIR conference on research and development in information retrieval; 2006. p. 651–2.
8.
Zurück zum Zitat Cambria E, Hussain A. Sentic computing: techniques, tools, and applications. Berlin: SpringerBriefs in Cognitive Computation, Springer (2012).CrossRef Cambria E, Hussain A. Sentic computing: techniques, tools, and applications. Berlin: SpringerBriefs in Cognitive Computation, Springer (2012).CrossRef
9.
Zurück zum Zitat Lieberman H, Faaborg A, Daher W, Espinosa J. How to wreck a nice beach you sing calm incense. In: International conference on intelligent user interfaces; 2005. Lieberman H, Faaborg A, Daher W, Espinosa J. How to wreck a nice beach you sing calm incense. In: International conference on intelligent user interfaces; 2005.
10.
Zurück zum Zitat Cambria E, Hussain A. Sentic Album: content-, concept-, and context-based online personal photo management system. In: Cognitive Computation, Springer, Berlin/Heidelberg; 2012 (in press). Cambria E, Hussain A. Sentic Album: content-, concept-, and context-based online personal photo management system. In: Cognitive Computation, Springer, Berlin/Heidelberg; 2012 (in press).
11.
Zurück zum Zitat Stocky T, Faaborg A, Lieberman H. A commonsense approach to predictive text entry. In: International conference on human factors in computing systems; 2004. Stocky T, Faaborg A, Lieberman H. A commonsense approach to predictive text entry. In: International conference on human factors in computing systems; 2004.
12.
Zurück zum Zitat Cambria E, Benson T, Eckl C, Hussain A. Sentic PROMs: application of sentic computing to the development of a novel unified framework for measuring health-care quality. Expert Syst Appl (2012); 39(12):10533–43.CrossRef Cambria E, Benson T, Eckl C, Hussain A. Sentic PROMs: application of sentic computing to the development of a novel unified framework for measuring health-care quality. Expert Syst Appl (2012); 39(12):10533–43.CrossRef
13.
Zurück zum Zitat Dai RW, Liu CL, Xiao BH. Chinese character recognition: history, status and prospects. Front Comput Sci China (2007); 1(2):126–36.CrossRef Dai RW, Liu CL, Xiao BH. Chinese character recognition: history, status and prospects. Front Comput Sci China (2007); 1(2):126–36.CrossRef
14.
Zurück zum Zitat Tang HS, Augustin E, Suen CY, Baret O, Cheriet M. Spiral recognition methodology and its application for recognition of Chinese bank checks. In: International workshop on frontiers in handwriting recognition (IWFHR); 2004. p. 263–8. Tang HS, Augustin E, Suen CY, Baret O, Cheriet M. Spiral recognition methodology and its application for recognition of Chinese bank checks. In: International workshop on frontiers in handwriting recognition (IWFHR); 2004. p. 263–8.
15.
Zurück zum Zitat Wang CH, Hotta Y, Suwa M, Naoi S. Handwritten Chinese address recognition. In: International workshop on frontiers in handwriting recognition (IWFHR); 2004. p. 539–44. Wang CH, Hotta Y, Suwa M, Naoi S. Handwritten Chinese address recognition. In: International workshop on frontiers in handwriting recognition (IWFHR); 2004. p. 539–44.
16.
Zurück zum Zitat Su TH, Zhang TW, Guan DJ, Huang HJ. Off-line recognition of realistic chinese handwriting using segmentation-free strategy. Pattern Recogn (2009); 42(1):167–82.CrossRef Su TH, Zhang TW, Guan DJ, Huang HJ. Off-line recognition of realistic chinese handwriting using segmentation-free strategy. Pattern Recogn (2009); 42(1):167–82.CrossRef
17.
Zurück zum Zitat Li NX, Jin LW. A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition. In: International conference on systems man and cybernetics (SMC), 2010. p. 3664–68. Li NX, Jin LW. A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition. In: International conference on systems man and cybernetics (SMC), 2010. p. 3664–68.
18.
Zurück zum Zitat Wang QF, Yin F, Liu CL. Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans Pattern Anal Mach Intell (2012); 34(8):1469–81.PubMedCrossRef Wang QF, Yin F, Liu CL. Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans Pattern Anal Mach Intell (2012); 34(8):1469–81.PubMedCrossRef
19.
Zurück zum Zitat Li YX, Tan CL, Ding XQ. A hybrid post-processing system for offline handwritten Chinese script recognition. pattern analysis and applications (2005); 8(3):272–86.CrossRef Li YX, Tan CL, Ding XQ. A hybrid post-processing system for offline handwritten Chinese script recognition. pattern analysis and applications (2005); 8(3):272–86.CrossRef
20.
Zurück zum Zitat Rosenfeld R. Two decades of statistical language modeling: where do we go from here? Proc IEEE (2000); 88(8):1270–8.CrossRef Rosenfeld R. Two decades of statistical language modeling: where do we go from here? Proc IEEE (2000); 88(8):1270–8.CrossRef
21.
Zurück zum Zitat Goodman JT. A bit of progress in language modeling: extended version. Technical Report MSR-TR-2001-72, Microsoft Research 2001. Goodman JT. A bit of progress in language modeling: extended version. Technical Report MSR-TR-2001-72, Microsoft Research 2001.
22.
Zurück zum Zitat Liu CL, Yin F, Wang DH, Wang QF. CASIA online and offline Chinese handwriting databases. In: International conference on document analysis and recognition (ICDAR); 2011. p. 37–41. Liu CL, Yin F, Wang DH, Wang QF. CASIA online and offline Chinese handwriting databases. In: International conference on document analysis and recognition (ICDAR); 2011. p. 37–41.
23.
Zurück zum Zitat Wang QF, Yin F, Liu CL. Integrating language model in handwritten Chinese text recognition. In: International conference on document analysis and recognition (ICDAR); 2009. p. 1036–40. Wang QF, Yin F, Liu CL. Integrating language model in handwritten Chinese text recognition. In: International conference on document analysis and recognition (ICDAR); 2009. p. 1036–40.
24.
Zurück zum Zitat Wang QF, Yin F, Liu CL. Improving handwritten Chinese text recognition by unsupervised language model adaptation. In: International workshop on document analysis systems (DAS); 2012. p. 110–4. Wang QF, Yin F, Liu CL. Improving handwritten Chinese text recognition by unsupervised language model adaptation. In: International workshop on document analysis systems (DAS); 2012. p. 110–4.
25.
Zurück zum Zitat Siu M, Mari O. Variable n-grams and extensions for conversational speech language modeling. IEEE Trans Speech Audio Process (2000); 8:63–75.CrossRef Siu M, Mari O. Variable n-grams and extensions for conversational speech language modeling. IEEE Trans Speech Audio Process (2000); 8:63–75.CrossRef
26.
Zurück zum Zitat Kuhn R, De Mori R. A cache-based natural language model for speech reproduction. IEEE Trans Pattern Anal Mach Intell (1990); 12(6):570–83.CrossRef Kuhn R, De Mori R. A cache-based natural language model for speech reproduction. IEEE Trans Pattern Anal Mach Intell (1990); 12(6):570–83.CrossRef
27.
Zurück zum Zitat Kuhn R, De Mori R. Correction to a cache-based natural language model for speech reproduction. IEEE Trans Pattern Anal Mach Intell (1992); 14(6):691–2. Kuhn R, De Mori R. Correction to a cache-based natural language model for speech reproduction. IEEE Trans Pattern Anal Mach Intell (1992); 14(6):691–2.
28.
Zurück zum Zitat Coccaro N, Jurafsky D. Towards better integration of semantic predictors in statistical language modeling. In: International conference on spoken language processing (ICSLP), 1998. p. 2403–6. Coccaro N, Jurafsky D. Towards better integration of semantic predictors in statistical language modeling. In: International conference on spoken language processing (ICSLP), 1998. p. 2403–6.
29.
Zurück zum Zitat Bellegarda JR. A multispan language modeling framework for large vocabulary speech recognition. IEEE Trans Speech Audio Process (1998); 6(5):456–467.CrossRef Bellegarda JR. A multispan language modeling framework for large vocabulary speech recognition. IEEE Trans Speech Audio Process (1998); 6(5):456–467.CrossRef
30.
Zurück zum Zitat Bellegarda JR. Exploiting latent semantic information in statistical language modeling. Proc IEEE (2000); 88(8):1279–96.CrossRef Bellegarda JR. Exploiting latent semantic information in statistical language modeling. Proc IEEE (2000); 88(8):1279–96.CrossRef
31.
Zurück zum Zitat Mrva D, Woodland PC. Unsupervised Language Model Adaptation for Mandarin Broadcast Conversation Transcription. In: Interspeech, 2006. pp. 2206–9. Mrva D, Woodland PC. Unsupervised Language Model Adaptation for Mandarin Broadcast Conversation Transcription. In: Interspeech, 2006. pp. 2206–9.
32.
Zurück zum Zitat Cambria E, Grassi M, Hussain A, Havasi C. Sentic computing for social media marketing. Multimed Tools Appl (2012); 59(2):557–77.CrossRef Cambria E, Grassi M, Hussain A, Havasi C. Sentic computing for social media marketing. Multimed Tools Appl (2012); 59(2):557–77.CrossRef
33.
Zurück zum Zitat Cambria E, Olsher D, Kwok K. Sentic activation: a two-level affective common sense reasoning framework. In: Association for the Advancement of Artificial Intelligence (AAAI); 2012. Cambria E, Olsher D, Kwok K. Sentic activation: a two-level affective common sense reasoning framework. In: Association for the Advancement of Artificial Intelligence (AAAI); 2012.
34.
Zurück zum Zitat Martin S, Liermann J, Ney H. Algorithms for bigram and trigram word clustering. Speech Commun (1998); 24(1):19–37.CrossRef Martin S, Liermann J, Ney H. Algorithms for bigram and trigram word clustering. Speech Commun (1998); 24(1):19–37.CrossRef
35.
Zurück zum Zitat Manning CD, Schutze H. Foundations of statistical natural language processing. 2nd ed. The MIT Press, Cambridge (1999). Manning CD, Schutze H. Foundations of statistical natural language processing. 2nd ed. The MIT Press, Cambridge (1999).
36.
Zurück zum Zitat Lieberman H, Liu H, Singh H, Barry B. Beating common sense into interactive applications. AI Mag (2004); 25(4):63–76. Lieberman H, Liu H, Singh H, Barry B. Beating common sense into interactive applications. AI Mag (2004); 25(4):63–76.
37.
Zurück zum Zitat Speer R, Havasi C, Lieberman H. AnalogySpace: reducing the dimensionality of common sense knowledge. In: Association for the Advancement of Artificial Intelligence (AAAI). 2008. p. 548–53. Speer R, Havasi C, Lieberman H. AnalogySpace: reducing the dimensionality of common sense knowledge. In: Association for the Advancement of Artificial Intelligence (AAAI). 2008. p. 548–53.
38.
Zurück zum Zitat Chen SF, Goodman J (1998) An empirical study of smoothing techniques for language modeling. Cambridge, Massachusetts: Computer Science Group, Harvard University, TR-10-98. Chen SF, Goodman J (1998) An empirical study of smoothing techniques for language modeling. Cambridge, Massachusetts: Computer Science Group, Harvard University, TR-10-98.
39.
Zurück zum Zitat Katz SM. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Trans Acoustics Speech Signal Process (1987); 35(3):400–1.CrossRef Katz SM. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Trans Acoustics Speech Signal Process (1987); 35(3):400–1.CrossRef
40.
Zurück zum Zitat Kimura F, Takashina K, Tsuruoka S, Miyake Y. modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans Pattern Anal Mach Intell (1987); 9(1):149–53.PubMedCrossRef Kimura F, Takashina K, Tsuruoka S, Miyake Y. modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans Pattern Anal Mach Intell (1987); 9(1):149–53.PubMedCrossRef
Metadaten
Titel
Common Sense Knowledge for Handwritten Chinese Text Recognition
verfasst von
Qiu-Feng Wang
Erik Cambria
Cheng-Lin Liu
Amir Hussain
Publikationsdatum
01.06.2013
Verlag
Springer-Verlag
Erschienen in
Cognitive Computation / Ausgabe 2/2013
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-012-9183-y

Weitere Artikel der Ausgabe 2/2013

Cognitive Computation 2/2013 Zur Ausgabe