skip to main content
10.1145/3342558.3345393acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
research-article

Using Knowledge Base Semantics in Context-Aware Entity Linking

Published:23 September 2019Publication History

ABSTRACT

Entity linking is a core task in textual document processing, which consists in identifying the entities of a knowledge base (KB) that are mentioned in a text. Approaches in the literature consider either independent linking of individual mentions or collective linking of all mentions. Regardless of this distinction, most approaches rely on the Wikipedia encyclopedic KB in order to improve the linking quality, by exploiting its entity descriptions (web pages) or its entity interconnections (hyperlink graph of web pages). In this paper, we devise a novel collective linking technique which departs from most approaches in the literature by relying on a structured RDF KB. This allows exploiting the semantics of the interrelationships that candidate entities may have at disambiguation time rather than relying on raw structural approximation based on Wikipedia's hyperlink graph. The few approaches that also use an RDF KB simply rely on the existence of a relation between the candidate entities to which mentions may be linked. Instead, we weight such relations based on the RDF KB structure and propose an efficient decoding strategy for collective linking. Experiments on standard benchmarks show significant improvement over the state of the art.

References

  1. Ayman Alhelbawy and Robert Gaizauskas. 2014. Graph ranking for collective named entity disambiguation. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Vol. 2. 75--80.Google ScholarGoogle ScholarCross RefCross Ref
  2. Roi Blanco, Giuseppe Ottaviano, and Edgar Meij. 2015. Fast and space-efficient entity linking for queries. In Proceedings of the 8th ACM International Conference on Web Search and Data Mining. 179--188.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5 (2017), 135--146.Google ScholarGoogle ScholarCross RefCross Ref
  4. Kurt Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: a collaboratively created graph database for structuring human knowledge. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. 1247--1250.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Razvan Bunescu and Marius Paşca. 2006. Using encyclopedic knowledge for named entity disambiguation. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics. 9--16.Google ScholarGoogle Scholar
  6. Yixin Cao, Lei Hou, Juanzi Li, and Zhiyuan Liu. 2018. Neural collective entity linking. In Proceedings of the 27th International Conference on Computational Linguistics. 675--686.Google ScholarGoogle Scholar
  7. Diego Ceccarelli, Claudio Lucchese, Salvatore Orlando, Raffaele Perego, and Salvatore Trani. 2013. Learning relatedness measures for entity linking. In Proceedings of the 22nd ACM International Conference on Information & Knowledge Management. 139--148.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Silviu Cucerzan. 2007. Large-scale named entity disambiguation based on Wikipedia data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 708--716.Google ScholarGoogle Scholar
  9. Mark Dredze, Paul McNamee, Delip Rao, Adam Gerber, and Tim Finin. 2010. Entity disambiguation for knowledge base population. In Proceedings of the 23rd International Conference on Computational Linguistics. 277--285.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Greg Durrett and Dan Klein. 2014. A joint model for entity analysis: coreference, typing, and linking. Transactions of the Association for Computational Linguistics 2 (2014), 477--490.Google ScholarGoogle ScholarCross RefCross Ref
  11. John R Firth. 1957. A synopsis of linguistic theory, 1930--1955. Studies in linguistic analysis (1957).Google ScholarGoogle Scholar
  12. Matthew Francis-Landau, Greg Durrett, and Dan Klein. 2016. Capturing semantic similarity for entity linking with convolutional neural networks. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 1256--1261.Google ScholarGoogle ScholarCross RefCross Ref
  13. Octavian-Eugen Ganea, Marina Ganea, Aurelien Lucchi, Carsten Eickhoff, and Thomas Hofmann. 2016. Probabilistic bag-of-hyperlinks model for entity linking. In Proceedings of the 25th International Conference on World Wide Web. 927--938.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Octavian-Eugen Ganea and Thomas Hofmann. 2017. Deep joint entity disambiguation with local neural attention. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2619--2629.Google ScholarGoogle ScholarCross RefCross Ref
  15. Nitish Gupta, Sameer Singh, and Dan Roth. 2017. Entity linking via joint encoding of types, descriptions, and context. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2681--2690.Google ScholarGoogle ScholarCross RefCross Ref
  16. Johannes Hoffart, Mohamed Amir Yosef, Ilaria Bordino, Hagen Fürstenau, Manfred Pinkal, Marc Spaniol, Bilyana Taneva, Stefan Thater, and Gerhard Weikum. 2011. Robust disambiguation of named entities in text. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. 782--792.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Hongzhao Huang, Larry Heck, and Heng Ji. 2015. Leveraging deep neural networks and knowledge graphs for entity disambiguation. arXiv preprint arXiv:1504.07678 (2015).Google ScholarGoogle Scholar
  18. Heng Ji and Nothman. 2016. Overview of TAC-KBP2016 tri-lingual EDL and its impact on end-to-end cold-start KBP. Proceedings of the 2016 Text Analysis Conference (2016).Google ScholarGoogle Scholar
  19. Heng Ji, Joel Nothman, Ben Hachey, and Radu Florian. 2015. Overview of TAC-KBP2015 tri-lingual entity discovery and linking. In Proceedings of the 2015 Text Analysis Conference.Google ScholarGoogle Scholar
  20. Heng Ji, Xiaoman Pan, Boliang Zhang, Joel Nothman, James Mayfield, Paul McNamee, and Cash Costello. 2017. Overview of TAC-KBP2017 13 languages entity discovery and linking. In Proceedings of the 2017 Text Analysis Conference.Google ScholarGoogle Scholar
  21. Nikolaos Kolitsas, Octavian-Eugen Ganea, and Thomas Hofmann. 2018. End-to-end neural entity linking. In Proceedings of the 22nd Conference on Computational Natural Language Learning. 519--529.Google ScholarGoogle ScholarCross RefCross Ref
  22. Phong Le and Ivan Titov. 2018. Improving entity linking by modeling latent relations between mentions. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 1595--1604.Google ScholarGoogle ScholarCross RefCross Ref
  23. Xiao Ling, Sameer Singh, and Daniel S Weld. 2015. Design challenges for entity linking. Transactions of the Association for Computational Linguistics 3 (2015), 315--328.Google ScholarGoogle ScholarCross RefCross Ref
  24. Ming Liu, Gu Gong, Bing Qin, and Ting Liu. 2019. A multi-view-based collective entity linking method. ACM Transactions on Information Systems. 37, 2, Article 23, 29 pages. https://doi.org/10.1145/3300197Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Weiming Lu, Yangfan Zhou, Haijiao Lu, Pengkun Ma, Zhenyu Zhang, and Baogang Wei. 2017. Boosting collective entity linking via type-guided semantic embedding. In Proceedings of the National CCF Conference on Natural Language Processing and Chinese Computing. 541--553.Google ScholarGoogle Scholar
  26. Pablo N. Mendes, Max Jakob, Andrés García-Silva, and Christian Bizer. 2011. DBpedia spotlight: shedding light on the web of documents. In Proceedings of the 7th International Conference on Semantic Systems. 1--8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of Advances in Neural Information Processing Systems. 3111--3119.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. David Milne and Ian H Witten. 2008. Learning to link with Wikipedia. In Proceedings of the 17th ACM Conference on Information and Knowledge Management. 509--518.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Jose G. Moreno, Romaric Besançon, Romain Beaumont, Eva D'hondt, Anne-Laure Ligozat, Sophie Rosset, Xavier Tannier, and Brigitte Grau. 2017. Combining word and entity embeddings for entity linking. In Proceedings of European Semantic Web Conference. 337--352.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Andrea Moro, Alessandro Raganato, and Roberto Navigli. 2014. Entity linking meets word sense disambiguation: a unified approach. Transactions of the Association for Computational Linguistics 2 (2014), 231--244.Google ScholarGoogle ScholarCross RefCross Ref
  31. Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 1532--1543.Google ScholarGoogle ScholarCross RefCross Ref
  32. Minh C. Phan, Aixin Sun, Yi Tay, Jialong Han, and Chenliang Li. 2018. Pair-linking for collective entity disambiguation: Two could be better than all. IEEE Transactions on Knowledge and Data Engineering.Google ScholarGoogle Scholar
  33. Michael Röder, Ricardo Usbeck, Sebastian Hellmann, Daniel Gerber, and Andreas Both. 2014. N3- A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. In Proceedings of the 9th International Conference on Language Resources and Evaluation. 3529--3533.Google ScholarGoogle Scholar
  34. Wei Shen, Jianyong Wang, and Jiawei Han. 2015. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Transactions on Knowledge and Data Engineering 27, 2 (2015), 443--460.Google ScholarGoogle ScholarCross RefCross Ref
  35. Valentin I. Spitkovsky and Angel X. Chang. 2012. A cross-lingual dictionary for English Wikipedia concepts. In Proceedings of the 8th International Conference on Language Resources and Evaluation. 3168--3175.Google ScholarGoogle Scholar
  36. Ricardo Usbeck, Axel-Cyrille Ngonga Ngomo, Michael Röder, Daniel Gerber, Sandro Athaide Coelho, Sören Auer, and Andreas Both. 2014. AGDISTIS - Graph-based disambiguation of named entities using linked data. In Proceedings of the International Semantic Web Conference. 457--471.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Ricardo Usbeck, Michael Röder, Axel-Cyrille Ngonga Ngomo, Ciro Baron, Andreas Both, Martin Brümmer, Diego Ceccarelli, Marco Cornolti, Didier Cherix, Bernd Eickmann, et al. 2015. GERBIL: General entity annotator benchmarking framework. In Proceedings of the 24th International Conference on World Wide Web. 1133--1143.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. W3C. 2014. RDF 1.1 Concepts and Abstract Syntax. https://www.w3.org/TR/rdf11-concepts.Google ScholarGoogle Scholar
  39. W3C. 2014. RDF 1.1 Primer. https://www.w3.org/TR/rdf11-primer.Google ScholarGoogle Scholar
  40. W3C. 2014. RDF 1.1 Semantics. https://www.w3.org/TR/rdf11-mt.Google ScholarGoogle Scholar
  41. Han Wang, Jin Guang Zheng, Xiaogang Ma, Peter Fox, and Heng Ji. 2015. Language and domain independent entity linking with quantified collective validation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 695--704.Google ScholarGoogle ScholarCross RefCross Ref
  42. Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, and Yoshiyasu Takefuji. 2016. Joint learning of the embedding of words and entities for named entity disambiguation. In Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning. 250--259. https://doi.org/10.18653/v1/k16-1025Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Using Knowledge Base Semantics in Context-Aware Entity Linking

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          DocEng '19: Proceedings of the ACM Symposium on Document Engineering 2019
          September 2019
          254 pages
          ISBN:9781450368872
          DOI:10.1145/3342558

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 23 September 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          DocEng '19 Paper Acceptance Rate30of77submissions,39%Overall Acceptance Rate178of537submissions,33%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader