research-article

Wikidata based Location Entity Linking

Authors:
Fathima Shanaz

Department of Computer Science and Engineering, South Eastern University of Sri Lanka, Oluvil, Sri Lanka

Department of Computer Science and Engineering, South Eastern University of Sri Lanka, Oluvil, Sri Lanka
View Profile

,
Roshan G. Ragel

Department of Computer Engineering, University of Peradeniya, Peradeniya, Sri Lanka

Department of Computer Engineering, University of Peradeniya, Peradeniya, Sri Lanka
View Profile

ICSCA '20: Proceedings of the 2020 9th International Conference on Software and Computer ApplicationsFebruary 2020Pages 307–312https://doi.org/10.1145/3384544.3384592

Published:17 April 2020Publication History

ICSCA '20: Proceedings of the 2020 9th International Conference on Software and Computer Applications

Pages 307–312

ABSTRACT

Online news reading has become general among people and suggesting relevant news articles to readers is a non-trivial task. News recommender systems (NRS) are built to provide appropriate stories to readers based on their interest. News articles usually contain mentions of persons, locations and other named entities which are excellent resources for making sense of readers' news interest. However, entity mentions are often ambiguous. It can make readers retrieve stories that are not relevant to them, impacting the performance of NRS. Entity linking (EL) is a task to extract mentions in documents, and then link them to their corresponding entities in a knowledge base (KB). This task is challenging due to name variations, high ambiguity of entity mentions and incompleteness of the KB. Several approaches have been proposed to tackle these challenges. However, current systems do not focus on improving the performance of EL on location entity mentions which are identified as far more informative entities in news article for user interest profiling. The goal of this paper is to present the design of location entity linking algorithms based on Wikidata KB. We propose new approaches to candidate entity generation and candidate entity ranking of the location EL task. We extensively evaluate the performance of our EL algorithms over a manually annotated AIDA-CoNLL testb news corpus. Experimental results show that our location EL method achieves top-1 precision of 95.58% which is much higher than the state-of-the-art results obtained on the same dataset by collective EL methods.

References

Abel, F., Gao, Q., Houben, G.-J. and Tao, K. 2011. Analyzing User Modeling on Twitter for Personalized News Recommendations. In International Conference on User Modeling, Adaptation and Personalization (UMAP). Springer.Google Scholar
Cucerzan, S. 2007. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), 708--716.Google Scholar
Dredze, M., Mcnamee, P., Rao, D., Gerber, A. and Finin, T. 2010. Entity Disambiguation for Knowledge Base Population. 23rd International Conference on Computational Linguistics, 277--285.Google Scholar
van Erp, M., Mendes, P.N., Paulheim, H., Ilievski, F., Plu, J. and Rizzo, G. 2016. Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 4373--4379.Google Scholar
Ganea, O.-E., Ganea, M., Lucchi, A., Eickhoff, C. and Hofmann, T. 2016. Probabilistic Bag-Of-Hyperlinks Model for Entity Linking. In Proceedings of the 25th International Conference on World Wide Web - WWW '16. ACM Press, New York, 927--938.Google Scholar
Geiß, J., Spitz, A. and Gertz, M. 2018. NECKAr: A Named Entity Classifier for Wikidata. International Conference of the German Society for Computational Linguistics and Language Technology. Springer, Cham, 115--129.Google Scholar
Hoffart, J., Yosef, M.A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., et al. 2011. Robust Disambiguation of Named Entities in Text. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 782--792.Google Scholar
Inan, E. and Dikenelli, O. 2018. A Sequence Learning Method for Domain-Specific Entity Linking. In Proceedings of the Seventh Named Entities Workshop. Association for Computational Linguistics, Stroudsburg, PA, USA, 14--21.Google Scholar
Loper, E. and Bird, S. 2002. Nltk: The Natural Language Toolkit. In Proceedings of the ACL-02 Workshop on Effective tools and methodologies for teaching natural language processing and computational linguistics.Google Scholar
McKinney, W. 2011. pandas: a Foundational Python Library for Data Analysis and Statistics. Python for High Performance and Scientific Computing, 19, 583--591.Google Scholar
Nguyen, D.B., Hoffart, J., Theobald, M. and Weikum, G. 2014. AIDA-light: High-throughput named-entity disambiguation. In Proceedings of the Workshop on Linked Data on the Web (LDOW 2014), 1184.Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. 2011. Scikitlearn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825--2830.Google ScholarDigital Library
Pershina, M., He, Y. and Grishman, R. 2015. Personalized Page Rank for Named Entity Disambiguation. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 238--243.Google Scholar
Řehůřek, R. and Sojka, P. 2010. Software framework for topic modelling with large corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, 45--50.Google Scholar
Shen, W., Wang, J. and Han, J. 2015. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Transactions on Knowledge and Data Engineering, 27, 443--460.Google ScholarCross Ref
Shen, W., Wang, J., Luo, P. and Wang, M. 2012. LINDEN: Linking Named Entities with Knowledge Base via Semantic Knowledge. In Proceedings of the 21st international conference on World Wide Web - WWW '12. ACM Press, New York, 449--458.Google Scholar
Taufer, B.P. and Straka, R.M. 2017. Named Entity Recognition and Linking. Master Thesis. Faculty of Mathematics and Physics, Charles University.Google Scholar
Wu, G., He, Y. and Hu, X. 2018. Entity Linking: An Issue to Extract Corresponding Entity with Knowledge Base. IEEE Access, 6, 6220--6231.Google ScholarCross Ref
Xiong, C., Liu, Z., Callan, J. and Hovy, E. 2017. JointSem: Combining Query Entity Linking and Entity based Document Ranking. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management - CIKM '1. ACM Press, New York, 2391--2394.Google Scholar
Zhang, W., Su, J., Tan, C.L. and Wang, W.T. 2010. Entity Linking Leveraging Automatically Generated Annotation. In Proceedings of the 23rd International Conference on Computational Linguistics, 1290--1298.Google Scholar
Zheng, Z., Li, F., Huang, M. and Zhu, X. 2010. Learning to Link Entities with Knowledge Base. The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 483--491.Google Scholar

Index Terms

Wikidata based Location Entity Linking
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction

Recommendations

Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation
WWW '21: Companion Proceedings of the Web Conference 2021

Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical ...
Read More
NILK: Entity Linking Dataset Targeting NIL-linking Cases
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

The NIL-linking task in Entity Linking deals with cases where the text mentions do not have a corresponding entity in the associated knowledge base. NIL-linking has two sub-tasks: NIL-detection and NIL-disambiguation. NIL-detection identifies NIL-...
Read More
Re-ranking for joint named-entity recognition and linking
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management

Recognizing names and linking them to structured data is a fundamental task in text analysis. Existing approaches typically perform these two steps using a pipeline architecture: they use a Named-Entity Recognition (NER) system to find the boundaries of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICSCA '20: Proceedings of the 2020 9th International Conference on Software and Computer Applications
February 2020
382 pages
ISBN:9781450376655
DOI:10.1145/3384544

Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 April 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Entity Linking
Entity Relatedness
Wikidata
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 206
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Wikidata based Location Entity Linking

ICSCA '20: Proceedings of the 2020 9th International Conference on Software and Computer Applications

ABSTRACT

References

Cited By

Index Terms

Recommendations

Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation

NILK: Entity Linking Dataset Targeting NIL-linking Cases

Re-ranking for joint named-entity recognition and linking

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Wikidata based Location Entity Linking

ICSCA '20: Proceedings of the 2020 9th International Conference on Software and Computer Applications

ABSTRACT

References

Cited By

Index Terms

Recommendations

Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation

NILK: Entity Linking Dataset Targeting NIL-linking Cases

Re-ranking for joint named-entity recognition and linking

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media