Skip to main content

2016 | OriginalPaper | Buchkapitel

WSD-TIC: Word Sense Disambiguation Using Taxonomic Information Content

verfasst von : Mohamed Ben Aouicha, Mohamed Ali Hadj Taieb, Hania Ibn Marai

Erschienen in: Computational Collective Intelligence

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Word sense disambiguation (WSD) is the ability to identify the meaning of words in context in a computational manner. WSD is considered as an AI-complete problem, that is, a task whose solution is at least as hard as the most difficult problems in artificial intelligence. This is basically used in application like information retrieval, machine translation, information extraction because of its semantics understanding. This paper describes the proposed approach (WSD-TIC) which is based on the words surrounding the polysemous word in a context. Each meaning of these words is represented by a vector composed of weighted nouns using taxonomic information content. The main emphasis of this paper is feature selection for disambiguation purpose. The assessment of WSD systems is discussed in the context of the Senseval campaign, aiming at the objective evaluation of our proposal to the systems participating in several different disambiguation tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Weaver, W.: Translation. In: Locke, W.N., Boothe, A.D. (eds.) Machine Translation of Languages, pp. 15–23. MIT Press, Cambridge (1955) Weaver, W.: Translation. In: Locke, W.N., Boothe, A.D. (eds.) Machine Translation of Languages, pp. 15–23. MIT Press, Cambridge (1955)
2.
Zurück zum Zitat Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21, 543–565 (1995) Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21, 543–565 (1995)
3.
Zurück zum Zitat Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press, Cambridge (1998). Illustrated editionMATH Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press, Cambridge (1998). Illustrated editionMATH
4.
Zurück zum Zitat Véronis, J.: HyperLex: lexical cartography for information retrieval. Comput. Speech Lang. 18, 223–252 (2004)CrossRef Véronis, J.: HyperLex: lexical cartography for information retrieval. Comput. Speech Lang. 18, 223–252 (2004)CrossRef
5.
Zurück zum Zitat Nameh, M., Fakhrahmad, S.M., Jahromi, M.Z.: A new approach to word sense disambiguation based on context similarity. In: Proceedings of the Workshop of the World Congress on Engineering (2011) Nameh, M., Fakhrahmad, S.M., Jahromi, M.Z.: A new approach to word sense disambiguation based on context similarity. In: Proceedings of the Workshop of the World Congress on Engineering (2011)
6.
Zurück zum Zitat Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, pp. 24–26. ACM, Toronto (1986) Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, pp. 24–26. ACM, Toronto (1986)
7.
Zurück zum Zitat Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp. 805–810. Morgan Kaufmann Publishers Inc., Acapulco (2003) Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp. 805–810. Morgan Kaufmann Publishers Inc., Acapulco (2003)
8.
Zurück zum Zitat Sinha, R., Mihalcea, R.: Unsupervised graph-basedword sense disambiguation using measures of word semantic similarity. In: Proceedings of the International Conference on Semantic Computing, pp. 363–369. IEEE Computer Society, Washington (2007) Sinha, R., Mihalcea, R.: Unsupervised graph-basedword sense disambiguation using measures of word semantic similarity. In: Proceedings of the International Conference on Semantic Computing, pp. 363–369. IEEE Computer Society, Washington (2007)
9.
Zurück zum Zitat Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. CoRR cmp-lg/9709008 (1997) Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. CoRR cmp-lg/9709008 (1997)
10.
Zurück zum Zitat Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. WordNet Electron. Lexical Database 49(2), 265–283 (1998). Fellfaum, C. (ed.) MIT Press, Cambridge Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. WordNet Electron. Lexical Database 49(2), 265–283 (1998). Fellfaum, C. (ed.) MIT Press, Cambridge
11.
Zurück zum Zitat Hessami, F.E., Mahmoudi, Jadidinejad, A.: Unsupervised graph-based word sense disambiguation using lexical relation of WordNet. Int. J. Comput. Sci. Issues (IJCSI) (2011) Hessami, F.E., Mahmoudi, Jadidinejad, A.: Unsupervised graph-based word sense disambiguation using lexical relation of WordNet. Int. J. Comput. Sci. Issues (IJCSI) (2011)
12.
Zurück zum Zitat Basile, P., Caputo, A., Semeraro, G.: An enhanced lesk word sense disambiguation algorithm through a distributional semantic model. In: 25th International Conference on Computational Linguistics, Proceedings of the Conference: COLING 2014, Technical papers, 23–29 August 2014, pp. 1591–1600. Dublin, Ireland (2014) Basile, P., Caputo, A., Semeraro, G.: An enhanced lesk word sense disambiguation algorithm through a distributional semantic model. In: 25th International Conference on Computational Linguistics, Proceedings of the Conference: COLING 2014, Technical papers, 23–29 August 2014, pp. 1591–1600. Dublin, Ireland (2014)
13.
Zurück zum Zitat Navigli, R., Ponzetto, S.P.: BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)MathSciNetCrossRefMATH Navigli, R., Ponzetto, S.P.: BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)MathSciNetCrossRefMATH
14.
Zurück zum Zitat Hadj Taieb, M.A., Ben Aouicha, M., Ben Hamadou, A.: A new semantic relatedness measurement using WordNet features. Knowl. Inf. Syst. 41, 467–497 (2014)CrossRef Hadj Taieb, M.A., Ben Aouicha, M., Ben Hamadou, A.: A new semantic relatedness measurement using WordNet features. Knowl. Inf. Syst. 41, 467–497 (2014)CrossRef
15.
Zurück zum Zitat Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. 41, 10:1–10:69 (2009)CrossRef Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. 41, 10:1–10:69 (2009)CrossRef
16.
Zurück zum Zitat Palmer, M., Fellbaum, C., Cotton, S., Delfs, L., Dang, H.T.: English tasks: all-words and verb lexical sample. In: Proceedings of SENSEVAL-2 Second International Workshop on Evaluating Word Sense Disambiguation Systems. Association for Computational Linguistics, pp. 21–24, Toulouse, France (2001) Palmer, M., Fellbaum, C., Cotton, S., Delfs, L., Dang, H.T.: English tasks: all-words and verb lexical sample. In: Proceedings of SENSEVAL-2 Second International Workshop on Evaluating Word Sense Disambiguation Systems. Association for Computational Linguistics, pp. 21–24, Toulouse, France (2001)
17.
Zurück zum Zitat Snyder, B., Palmer, M.: The English all-words task. In: Mihalcea, R., Edmonds, P. (eds.) Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text. Association for Computational Linguistics, pp. 41–43, Barcelona, Spain (2004) Snyder, B., Palmer, M.: The English all-words task. In: Mihalcea, R., Edmonds, P. (eds.) Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text. Association for Computational Linguistics, pp. 41–43, Barcelona, Spain (2004)
Metadaten
Titel
WSD-TIC: Word Sense Disambiguation Using Taxonomic Information Content
verfasst von
Mohamed Ben Aouicha
Mohamed Ali Hadj Taieb
Hania Ibn Marai
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-45243-2_12