Skip to main content

2016 | OriginalPaper | Buchkapitel

Chinese Word Similarity Computing Based on Combination Strategy

verfasst von : Shaoru Guo, Yong Guan, Ru Li, Qi Zhang

Erschienen in: Natural Language Understanding and Intelligent Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Chinese word similarity computing is a fundamental task for natural language processing. This paper presents a method to calculate the similarity between Chinese words based on combination strategy. We apply Baidubaike to train Word2Vector model, and then integrate different methods, semantic Dictionary-based method, Word2Vector-based method and Chinese FrameNet (CFN)-based method, to calculate the semantic similarity between Chinese words. The semantic Dictionary-based method includes dictionaries such as HowNet, DaCilin, Tongyici Cilin (Extended) and Antonym. The experiments are performed on 500 pairs of words and the Spearman correlation coefficient of test data is 0.524, which shows that the proposed method is feasible and effective.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Zurück zum Zitat Liu, K.: Research on Chinese FrameNet construction and application technologies. J. Chin. Inf. Process. 6(006), 47 (2011) Liu, K.: Research on Chinese FrameNet construction and application technologies. J. Chin. Inf. Process. 6(006), 47 (2011)
Zurück zum Zitat Dong, Z., Dong, Q., Hao, C.: Hownet and its computation of meaning. In: Proceedings of 23rd International Conference on Computational Linguistics: Demonstrations, pp. 53–56. Association for Computational Linguistics, August 2010 Dong, Z., Dong, Q., Hao, C.: Hownet and its computation of meaning. In: Proceedings of 23rd International Conference on Computational Linguistics: Demonstrations, pp. 53–56. Association for Computational Linguistics, August 2010
Zurück zum Zitat Liu, Q., Li, S.: Word similarity computing based on How-net. Comput. Linguist. Chin. Lang. Process. 7(2), 59–76 (2002) Liu, Q., Li, S.: Word similarity computing based on How-net. Comput. Linguist. Chin. Lang. Process. 7(2), 59–76 (2002)
Zurück zum Zitat Xue, B., Fu, C., Shaobin, Z.: A study on sentiment computing and classification of Sina Weibo with Word2vec. In: 2014 IEEE International Congress on Big Data, pp. 358–363. IEEE, June 2014 Xue, B., Fu, C., Shaobin, Z.: A study on sentiment computing and classification of Sina Weibo with Word2vec. In: 2014 IEEE International Congress on Big Data, pp. 358–363. IEEE, June 2014
Zurück zum Zitat Fillmore, C.J.: Frame semantics and the nature of language. Ann. N.Y. Acad. Sci. 280(1), 20–32 (1976)CrossRef Fillmore, C.J.: Frame semantics and the nature of language. Ann. N.Y. Acad. Sci. 280(1), 20–32 (1976)CrossRef
Zurück zum Zitat Fillmore, C.: Frame semantics. In: Linguistics in the Morning Calm, pp. 111–137 (1982) Fillmore, C.: Frame semantics. In: Linguistics in the Morning Calm, pp. 111–137 (1982)
Zurück zum Zitat Fillmore, C.J., Wooters, C., Baker, C.F.: Building a large lexical databank which provides deep semantics. publisher not identified (2001) Fillmore, C.J., Wooters, C., Baker, C.F.: Building a large lexical databank which provides deep semantics. publisher not identified (2001)
Zurück zum Zitat Hao, X., Wei, L., Ru, L., Kaiying, L.: Description systems of the Chinese FrameNet database and software tools. J. Chin. Inf. Process. 21(5), 96–100 (2007) Hao, X., Wei, L., Ru, L., Kaiying, L.: Description systems of the Chinese FrameNet database and software tools. J. Chin. Inf. Process. 21(5), 96–100 (2007)
Zurück zum Zitat Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Proceedings of 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, vol. 1, pp. 86–90. Association for Computational Linguistics, August 1998 Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Proceedings of 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, vol. 1, pp. 86–90. Association for Computational Linguistics, August 1998
Zurück zum Zitat Wu, Y., Li, W.: NLPCC-ICCPOL 2016 Shared Task 3: Chinese word similarity measurement. In: Proceedings of NLPCC 2016 (2016) Wu, Y., Li, W.: NLPCC-ICCPOL 2016 Shared Task 3: Chinese word similarity measurement. In: Proceedings of NLPCC 2016 (2016)
Zurück zum Zitat Petruck, M.R.L.: Frame semantics. Handbook of Pragmatics, pp. 1–13 (1996) Petruck, M.R.L.: Frame semantics. Handbook of Pragmatics, pp. 1–13 (1996)
Metadaten
Titel
Chinese Word Similarity Computing Based on Combination Strategy
verfasst von
Shaoru Guo
Yong Guan
Ru Li
Qi Zhang
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-50496-4_67

Premium Partner