skip to main content
10.1145/3384544.3384592acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Wikidata based Location Entity Linking

Authors Info & Claims
Published:17 April 2020Publication History

ABSTRACT

Online news reading has become general among people and suggesting relevant news articles to readers is a non-trivial task. News recommender systems (NRS) are built to provide appropriate stories to readers based on their interest. News articles usually contain mentions of persons, locations and other named entities which are excellent resources for making sense of readers' news interest. However, entity mentions are often ambiguous. It can make readers retrieve stories that are not relevant to them, impacting the performance of NRS. Entity linking (EL) is a task to extract mentions in documents, and then link them to their corresponding entities in a knowledge base (KB). This task is challenging due to name variations, high ambiguity of entity mentions and incompleteness of the KB. Several approaches have been proposed to tackle these challenges. However, current systems do not focus on improving the performance of EL on location entity mentions which are identified as far more informative entities in news article for user interest profiling. The goal of this paper is to present the design of location entity linking algorithms based on Wikidata KB. We propose new approaches to candidate entity generation and candidate entity ranking of the location EL task. We extensively evaluate the performance of our EL algorithms over a manually annotated AIDA-CoNLL testb news corpus. Experimental results show that our location EL method achieves top-1 precision of 95.58% which is much higher than the state-of-the-art results obtained on the same dataset by collective EL methods.

References

  1. Abel, F., Gao, Q., Houben, G.-J. and Tao, K. 2011. Analyzing User Modeling on Twitter for Personalized News Recommendations. In International Conference on User Modeling, Adaptation and Personalization (UMAP). Springer.Google ScholarGoogle Scholar
  2. Cucerzan, S. 2007. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), 708--716.Google ScholarGoogle Scholar
  3. Dredze, M., Mcnamee, P., Rao, D., Gerber, A. and Finin, T. 2010. Entity Disambiguation for Knowledge Base Population. 23rd International Conference on Computational Linguistics, 277--285.Google ScholarGoogle Scholar
  4. van Erp, M., Mendes, P.N., Paulheim, H., Ilievski, F., Plu, J. and Rizzo, G. 2016. Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 4373--4379.Google ScholarGoogle Scholar
  5. Ganea, O.-E., Ganea, M., Lucchi, A., Eickhoff, C. and Hofmann, T. 2016. Probabilistic Bag-Of-Hyperlinks Model for Entity Linking. In Proceedings of the 25th International Conference on World Wide Web - WWW '16. ACM Press, New York, 927--938.Google ScholarGoogle Scholar
  6. Geiß, J., Spitz, A. and Gertz, M. 2018. NECKAr: A Named Entity Classifier for Wikidata. International Conference of the German Society for Computational Linguistics and Language Technology. Springer, Cham, 115--129.Google ScholarGoogle Scholar
  7. Hoffart, J., Yosef, M.A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., et al. 2011. Robust Disambiguation of Named Entities in Text. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 782--792.Google ScholarGoogle Scholar
  8. Inan, E. and Dikenelli, O. 2018. A Sequence Learning Method for Domain-Specific Entity Linking. In Proceedings of the Seventh Named Entities Workshop. Association for Computational Linguistics, Stroudsburg, PA, USA, 14--21.Google ScholarGoogle Scholar
  9. Loper, E. and Bird, S. 2002. Nltk: The Natural Language Toolkit. In Proceedings of the ACL-02 Workshop on Effective tools and methodologies for teaching natural language processing and computational linguistics.Google ScholarGoogle Scholar
  10. McKinney, W. 2011. pandas: a Foundational Python Library for Data Analysis and Statistics. Python for High Performance and Scientific Computing, 19, 583--591.Google ScholarGoogle Scholar
  11. Nguyen, D.B., Hoffart, J., Theobald, M. and Weikum, G. 2014. AIDA-light: High-throughput named-entity disambiguation. In Proceedings of the Workshop on Linked Data on the Web (LDOW 2014), 1184.Google ScholarGoogle Scholar
  12. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. 2011. Scikitlearn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825--2830.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Pershina, M., He, Y. and Grishman, R. 2015. Personalized Page Rank for Named Entity Disambiguation. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 238--243.Google ScholarGoogle Scholar
  14. Řehůřek, R. and Sojka, P. 2010. Software framework for topic modelling with large corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, 45--50.Google ScholarGoogle Scholar
  15. Shen, W., Wang, J. and Han, J. 2015. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Transactions on Knowledge and Data Engineering, 27, 443--460.Google ScholarGoogle ScholarCross RefCross Ref
  16. Shen, W., Wang, J., Luo, P. and Wang, M. 2012. LINDEN: Linking Named Entities with Knowledge Base via Semantic Knowledge. In Proceedings of the 21st international conference on World Wide Web - WWW '12. ACM Press, New York, 449--458.Google ScholarGoogle Scholar
  17. Taufer, B.P. and Straka, R.M. 2017. Named Entity Recognition and Linking. Master Thesis. Faculty of Mathematics and Physics, Charles University.Google ScholarGoogle Scholar
  18. Wu, G., He, Y. and Hu, X. 2018. Entity Linking: An Issue to Extract Corresponding Entity with Knowledge Base. IEEE Access, 6, 6220--6231.Google ScholarGoogle ScholarCross RefCross Ref
  19. Xiong, C., Liu, Z., Callan, J. and Hovy, E. 2017. JointSem: Combining Query Entity Linking and Entity based Document Ranking. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management - CIKM '1. ACM Press, New York, 2391--2394.Google ScholarGoogle Scholar
  20. Zhang, W., Su, J., Tan, C.L. and Wang, W.T. 2010. Entity Linking Leveraging Automatically Generated Annotation. In Proceedings of the 23rd International Conference on Computational Linguistics, 1290--1298.Google ScholarGoogle Scholar
  21. Zheng, Z., Li, F., Huang, M. and Zhu, X. 2010. Learning to Link Entities with Knowledge Base. The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 483--491.Google ScholarGoogle Scholar

Index Terms

  1. Wikidata based Location Entity Linking

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICSCA '20: Proceedings of the 2020 9th International Conference on Software and Computer Applications
      February 2020
      382 pages
      ISBN:9781450376655
      DOI:10.1145/3384544

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 April 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)13
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader