Skip to main content
Erschienen in:
Buchtitelbild

2018 | OriginalPaper | Buchkapitel

A New Approach to the Supervised Word Sense Disambiguation

verfasst von : Gennady Agre, Daniel Petrov, Simona Keskinova

Erschienen in: Artificial Intelligence: Methodology, Systems, and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The paper presents a new supervised approach for solving the all-words sense disambiguation (WSD) task, which allows avoiding the necessity to construct different specialized classifiers for disambiguating different target words. In the core of the approach lies a new interpretation of the notion ‘class’, which relates each possible meaning of a word to a frequency with which it occurs in some corpora. In such a way all possible senses of different words can be classified in a unified way into a restricted set of classes starting from the most frequent, and ending with the least frequent class. For representing target and context words the approach uses word embeddings and information about their part-of-speech (POS) categories. The experiments have shown that classifiers trained on examples created by means of the approach outperform the standard baselines for measuring the behavior of all-words WSD classifiers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mallery, J.C.: Thinking about foreign policy: finding an appropriate role for artificial intelligence computers. Ph.D. dissertation. MIT Political Science Department, Cambridge, MA (1988) Mallery, J.C.: Thinking about foreign policy: finding an appropriate role for artificial intelligence computers. Ph.D. dissertation. MIT Political Science Department, Cambridge, MA (1988)
2.
Zurück zum Zitat Navigli, R. Word sense disambiguation: a survey. ACM Comput. Surv. 41(2) (2009). Article 10 Navigli, R. Word sense disambiguation: a survey. ACM Comput. Surv. 41(2) (2009). Article 10
3.
Zurück zum Zitat Fellbaum, C.: WordNet and wordnets. In: Brown, K., et al. (eds.) Encyclopedia of Language and Linguistics, 2nd edn., pp. 665–670. Elsevier, Oxford (2005) Fellbaum, C.: WordNet and wordnets. In: Brown, K., et al. (eds.) Encyclopedia of Language and Linguistics, 2nd edn., pp. 665–670. Elsevier, Oxford (2005)
4.
Zurück zum Zitat Miller, G.A., Leacock, C., Tengi, R., Bunker, R.T.: A semantic concordance. In: Proceedings of the ARPA Workshop on Human Language Technology, pp. 303–308 (1993) Miller, G.A., Leacock, C., Tengi, R., Bunker, R.T.: A semantic concordance. In: Proceedings of the ARPA Workshop on Human Language Technology, pp. 303–308 (1993)
5.
Zurück zum Zitat Kuchera, H., Francis, W.N.: Computational Analysis of Present-Day American English. Brown University Press, Providence (1967) Kuchera, H., Francis, W.N.: Computational Analysis of Present-Day American English. Brown University Press, Providence (1967)
6.
Zurück zum Zitat Pilehvar, M.T., Navigli, R.: A large-scale pseudoword-based evaluation framework for state-of-the-art Word Sense Disambiguation. Comput. Linguist. 40(4), 837–881 (2014)CrossRef Pilehvar, M.T., Navigli, R.: A large-scale pseudoword-based evaluation framework for state-of-the-art Word Sense Disambiguation. Comput. Linguist. 40(4), 837–881 (2014)CrossRef
7.
Zurück zum Zitat Raganato, A., Camacho-Collados, J., Navigli, R.: Word sense disambiguation: a unified evaluation framework and empirical comparison. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pp. 99–110 (2017) Raganato, A., Camacho-Collados, J., Navigli, R.: Word sense disambiguation: a unified evaluation framework and empirical comparison. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pp. 99–110 (2017)
8.
Zurück zum Zitat Lesk, M. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th SIGDOC, New York, NY, pp. 24–26 (1986) Lesk, M. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th SIGDOC, New York, NY, pp. 24–26 (1986)
9.
Zurück zum Zitat Chen, X., Liu, Z., Sun, M.: A unified model for word sense representation and disambiguation. In: Proceedings of EMNLP, pp. 1025–1035 (2014) Chen, X., Liu, Z., Sun, M.: A unified model for word sense representation and disambiguation. In: Proceedings of EMNLP, pp. 1025–1035 (2014)
10.
Zurück zum Zitat Camacho-Collados, J., Pilehvar, M.H., Navigli, R.: Nasari: integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities. Artif. Intell. 240, 36–64 (2016)MathSciNetCrossRef Camacho-Collados, J., Pilehvar, M.H., Navigli, R.: Nasari: integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities. Artif. Intell. 240, 36–64 (2016)MathSciNetCrossRef
11.
Zurück zum Zitat Agirre, E., Soroa, A.: Personalizing Pagerank for word sense disambiguation. In: Proceedings of 12th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 33–41 (2009) Agirre, E., Soroa, A.: Personalizing Pagerank for word sense disambiguation. In: Proceedings of 12th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 33–41 (2009)
12.
13.
Zurück zum Zitat Zhong, Z., Ng, H.T.: It Makes Sense: a wide-coverage Word Sense Disambiguation system for free text. In: Proceedings of the ACL System Demonstrations, pp. 78–83 (2010) Zhong, Z., Ng, H.T.: It Makes Sense: a wide-coverage Word Sense Disambiguation system for free text. In: Proceedings of the ACL System Demonstrations, pp. 78–83 (2010)
14.
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781 (2013) Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:​1301.​3781 (2013)
15.
Zurück zum Zitat Taghipour, K., Ng, H. T. Semisupervised word sense disambiguation using word embeddings in general and specific domains. In: Proceedings of NAACL HLT, pp. 314–323 (2015) Taghipour, K., Ng, H. T. Semisupervised word sense disambiguation using word embeddings in general and specific domains. In: Proceedings of NAACL HLT, pp. 314–323 (2015)
16.
Zurück zum Zitat Rothe, S., Schutze, H.: AutoExtend: extending word embeddings to embeddings for synsets and lexemes. In: Proceedings of ACL, Beijing, China, pp. 1793–1803 (2015) Rothe, S., Schutze, H.: AutoExtend: extending word embeddings to embeddings for synsets and lexemes. In: Proceedings of ACL, Beijing, China, pp. 1793–1803 (2015)
17.
Zurück zum Zitat Iacobacci, I., Pilehvar, M.H., Navigli, R.: Embeddings for word sense disambiguation: An evaluation study. In: Proceedings of ACL, Berlin, Germany, pp. 897–907 (2016) Iacobacci, I., Pilehvar, M.H., Navigli, R.: Embeddings for word sense disambiguation: An evaluation study. In: Proceedings of ACL, Berlin, Germany, pp. 897–907 (2016)
18.
Zurück zum Zitat Melamud, O., Goldberger, J., Dagan, I.: Learning generic context embedding with bidirectional LSTM. In: Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL), pp. 51–61 (2016) Melamud, O., Goldberger, J., Dagan, I.: Learning generic context embedding with bidirectional LSTM. In: Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL), pp. 51–61 (2016)
19.
20.
Zurück zum Zitat Yuan, D., Richardson, J., Doherty, R., Evans, C., Altendorf, E.: Semi-supervised word sense disambiguation with neural models. In: Proceedings of COLING, pp. 1374–1385 (2016) Yuan, D., Richardson, J., Doherty, R., Evans, C., Altendorf, E.: Semi-supervised word sense disambiguation with neural models. In: Proceedings of COLING, pp. 1374–1385 (2016)
22.
Zurück zum Zitat Zhou, Z-H., Feng, J.: Deep Forest: towards an alternative to deep neural networks. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-2017), pp. 3553–3559 (2017) Zhou, Z-H., Feng, J.: Deep Forest: towards an alternative to deep neural networks. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-2017), pp. 3553–3559 (2017)
Metadaten
Titel
A New Approach to the Supervised Word Sense Disambiguation
verfasst von
Gennady Agre
Daniel Petrov
Simona Keskinova
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-99344-7_1