Skip to main content
Erschienen in: Knowledge and Information Systems 2/2022

03.01.2022 | Regular Paper

Named entity disambiguation in short texts over knowledge graphs

verfasst von: Wissem Bouarroudj, Zizette Boufaida, Ladjel Bellatreche

Erschienen in: Knowledge and Information Systems | Ausgabe 2/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The ever-growing usage of knowledge graphs (KGs) positions named entity disambiguation (NED) at the heart of designing accurate KG-driven systems such as query answering systems (QAS). According to the current research, most studies dealing with NED on KGs involve long texts, which is not the case of short text fragments, identified by their limited contexts. The accuracy of QASs strongly depends on the management of such short text. This limitation motivates this paper, which studies the NED problem on KGs, involving only short texts. First, we propose a NED approach including the following steps: (i) context expansion using WordNet to measure its similarity to the resource context. (ii) Exploiting coherence between entities in queries that contain more than one entity, such as “Is Michelle Obama the wife of Barack Obama?”. (iii) Taking into account the relations between words to calculate their similarity with the properties of a resource. (iv) the use of syntactic features. The NED solution approach is compared to state-of-the-art approaches using five datasets. The experimental results show that our approach outperforms these systems by 27% in the F-measure. A system called Welink, implementing our proposal, is available on GitHub, and it is also accessible via a REST API.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
16.
Zurück zum Zitat Dredze M, McNamee P, Rao D, Gerber A, Finin T (2010) Entity disambiguation for knowledge base population. In: Proceedings of the 23rd international conference on computational linguistics. Association for Computational Linguistics, pp 277–285 Dredze M, McNamee P, Rao D, Gerber A, Finin T (2010) Entity disambiguation for knowledge base population. In: Proceedings of the 23rd international conference on computational linguistics. Association for Computational Linguistics, pp 277–285
22.
Zurück zum Zitat Guo S, Chang MW, Kiciman E (2013) To link or not to link? A study on end-to-end tweet entity linking. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1020–1030 Guo S, Chang MW, Kiciman E (2013) To link or not to link? A study on end-to-end tweet entity linking. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1020–1030
29.
Zurück zum Zitat Hoffart J, Yosef MA, Bordino I, Fürstenau H, Pinkal M, Spaniol M, Taneva B, Thater S, Weikum G (2011) Robust disambiguation of named entities in text. In: Proceedings of the 2011 conference on empirical methods in natural language processing, pp 782–792 Hoffart J, Yosef MA, Bordino I, Fürstenau H, Pinkal M, Spaniol M, Taneva B, Thater S, Weikum G (2011) Robust disambiguation of named entities in text. In: Proceedings of the 2011 conference on empirical methods in natural language processing, pp 782–792
31.
Zurück zum Zitat Hogan A, Blomqvist E, Cochez M, d’Amato C, de Melo G, Gutierrez C, Gayo JEL, Kirrane S, Neumaier S, Polleres A, Navigli R, Ngomo ACN, Rashid SM, Rula A, Schmelzeisen L, Sequeda J, Staab S, Zimmermann A (2021) Knowledge graphs. Synth Lect Data Semantics Knowledge 12(2):1–257 Hogan A, Blomqvist E, Cochez M, d’Amato C, de Melo G, Gutierrez C, Gayo JEL, Kirrane S, Neumaier S, Polleres A, Navigli R, Ngomo ACN, Rashid SM, Rula A, Schmelzeisen L, Sequeda J, Staab S, Zimmermann A (2021) Knowledge graphs. Synth Lect Data Semantics Knowledge 12(2):1–257
37.
Zurück zum Zitat Logeswaran L, Chang MW, Lee K, Toutanova K, Devlin J, Lee H (2019) Zero-shot entity linking by reading entity descriptions. ArXiv preprint arXiv:1906.07348 Logeswaran L, Chang MW, Lee K, Toutanova K, Devlin J, Lee H (2019) Zero-shot entity linking by reading entity descriptions. ArXiv preprint arXiv:​1906.​07348
42.
Zurück zum Zitat Michel F, Gandon F, Ah-Kane V, Bobasheva A, Cabrio E, Corby O, Gazzotti R, Giboin A, Marro S, Mayer T, et al. (2020) Covid-on-the-web: knowledge graph and services to advance covid-19 research. In: International semantic web conference. Springer, pp 294–310. https://doi.org/10.1007/978-3-030-62466-8_19 Michel F, Gandon F, Ah-Kane V, Bobasheva A, Cabrio E, Corby O, Gazzotti R, Giboin A, Marro S, Mayer T, et al. (2020) Covid-on-the-web: knowledge graph and services to advance covid-19 research. In: International semantic web conference. Springer, pp 294–310. https://​doi.​org/​10.​1007/​978-3-030-62466-8_​19
47.
Zurück zum Zitat Parravicini A, Patra R, Bartolini DB, Santambrogio MD (2019) Fast and accurate entity linking via graph embedding. In: Proceedings of the 2nd joint international workshop on graph data management experiences and systems (GRADES) and network data analytics (NDA), pp 1–9. https://doi.org/10.1145/3327964.3328499 Parravicini A, Patra R, Bartolini DB, Santambrogio MD (2019) Fast and accurate entity linking via graph embedding. In: Proceedings of the 2nd joint international workshop on graph data management experiences and systems (GRADES) and network data analytics (NDA), pp 1–9. https://​doi.​org/​10.​1145/​3327964.​3328499
48.
Zurück zum Zitat Pohl K, Böckle G, van Der Linden FJ (2005) Software product line engineering: foundations, principles and techniques. Springer Pohl K, Böckle G, van Der Linden FJ (2005) Software product line engineering: foundations, principles and techniques. Springer
50.
Zurück zum Zitat Sakor A, Mulang IO, Singh K, Shekarpour S, Vidal ME, Lehmann J, Auer S (2019) Old is gold: linguistic driven approach for entity and relation linking of short text. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1, pp 2336–2346. https://doi.org/10.18653/v1/N19-1243 Sakor A, Mulang IO, Singh K, Shekarpour S, Vidal ME, Lehmann J, Auer S (2019) Old is gold: linguistic driven approach for entity and relation linking of short text. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1, pp 2336–2346. https://​doi.​org/​10.​18653/​v1/​N19-1243
51.
Zurück zum Zitat Sevgili Ö, Panchenko A, Biemann C (2019) Improving neural entity disambiguation with graph embeddings. In: Proceedings of the 57th annual meeting of the association for computational linguistics: student research workshop, pp 315–322. https://doi.org/10.18653/v1/P19-2044 Sevgili Ö, Panchenko A, Biemann C (2019) Improving neural entity disambiguation with graph embeddings. In: Proceedings of the 57th annual meeting of the association for computational linguistics: student research workshop, pp 315–322. https://​doi.​org/​10.​18653/​v1/​P19-2044
52.
Zurück zum Zitat Sevgili O, Shelmanov A, Arkhipov M, Panchenko A, Biemann C (2020) Neural entity linking: a survey of models based on deep learning. ArXiv preprint arXiv:2006.00575 Sevgili O, Shelmanov A, Arkhipov M, Panchenko A, Biemann C (2020) Neural entity linking: a survey of models based on deep learning. ArXiv preprint arXiv:​2006.​00575
55.
Zurück zum Zitat Singh K, Radhakrishna AS, Both A, Shekarpour S, Lytra I, Usbeck R, Vyas A, Khikmatullaev A, Punjani D, Lange C, et al. (2018) Why reinvent the wheel: let’s build question answering systems together. In: Proceedings of the 2018 world wide web conference, pp 1247–1256. https://doi.org/10.1145/3178876.3186023 Singh K, Radhakrishna AS, Both A, Shekarpour S, Lytra I, Usbeck R, Vyas A, Khikmatullaev A, Punjani D, Lange C, et al. (2018) Why reinvent the wheel: let’s build question answering systems together. In: Proceedings of the 2018 world wide web conference, pp 1247–1256. https://​doi.​org/​10.​1145/​3178876.​3186023
57.
Zurück zum Zitat Usbeck R, Gusmita RH, Ngomo AN, Saleem M (2018) 9th challenge on question answering over linked data (QALD-9). In: SemDeep-4 and NLIWOD-4 and ISWC vol 2241, pp 58-64. CEUR-WS.org Usbeck R, Gusmita RH, Ngomo AN, Saleem M (2018) 9th challenge on question answering over linked data (QALD-9). In: SemDeep-4 and NLIWOD-4 and ISWC vol 2241, pp 58-64. CEUR-WS.org
58.
Zurück zum Zitat Usbeck R, Ngomo ACN, Conrads F, Röder M, Napolitano G (2018) 8th challenge on question answering over linked data (qald-8). Language 7:1 Usbeck R, Ngomo ACN, Conrads F, Röder M, Napolitano G (2018) 8th challenge on question answering over linked data (qald-8). Language 7:1
Metadaten
Titel
Named entity disambiguation in short texts over knowledge graphs
verfasst von
Wissem Bouarroudj
Zizette Boufaida
Ladjel Bellatreche
Publikationsdatum
03.01.2022
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 2/2022
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-021-01642-9

Weitere Artikel der Ausgabe 2/2022

Knowledge and Information Systems 2/2022 Zur Ausgabe

Premium Partner