ABSTRACT
Online news reading has become general among people and suggesting relevant news articles to readers is a non-trivial task. News recommender systems (NRS) are built to provide appropriate stories to readers based on their interest. News articles usually contain mentions of persons, locations and other named entities which are excellent resources for making sense of readers' news interest. However, entity mentions are often ambiguous. It can make readers retrieve stories that are not relevant to them, impacting the performance of NRS. Entity linking (EL) is a task to extract mentions in documents, and then link them to their corresponding entities in a knowledge base (KB). This task is challenging due to name variations, high ambiguity of entity mentions and incompleteness of the KB. Several approaches have been proposed to tackle these challenges. However, current systems do not focus on improving the performance of EL on location entity mentions which are identified as far more informative entities in news article for user interest profiling. The goal of this paper is to present the design of location entity linking algorithms based on Wikidata KB. We propose new approaches to candidate entity generation and candidate entity ranking of the location EL task. We extensively evaluate the performance of our EL algorithms over a manually annotated AIDA-CoNLL testb news corpus. Experimental results show that our location EL method achieves top-1 precision of 95.58% which is much higher than the state-of-the-art results obtained on the same dataset by collective EL methods.
- Abel, F., Gao, Q., Houben, G.-J. and Tao, K. 2011. Analyzing User Modeling on Twitter for Personalized News Recommendations. In International Conference on User Modeling, Adaptation and Personalization (UMAP). Springer.Google Scholar
- Cucerzan, S. 2007. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), 708--716.Google Scholar
- Dredze, M., Mcnamee, P., Rao, D., Gerber, A. and Finin, T. 2010. Entity Disambiguation for Knowledge Base Population. 23rd International Conference on Computational Linguistics, 277--285.Google Scholar
- van Erp, M., Mendes, P.N., Paulheim, H., Ilievski, F., Plu, J. and Rizzo, G. 2016. Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 4373--4379.Google Scholar
- Ganea, O.-E., Ganea, M., Lucchi, A., Eickhoff, C. and Hofmann, T. 2016. Probabilistic Bag-Of-Hyperlinks Model for Entity Linking. In Proceedings of the 25th International Conference on World Wide Web - WWW '16. ACM Press, New York, 927--938.Google Scholar
- Geiß, J., Spitz, A. and Gertz, M. 2018. NECKAr: A Named Entity Classifier for Wikidata. International Conference of the German Society for Computational Linguistics and Language Technology. Springer, Cham, 115--129.Google Scholar
- Hoffart, J., Yosef, M.A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., et al. 2011. Robust Disambiguation of Named Entities in Text. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 782--792.Google Scholar
- Inan, E. and Dikenelli, O. 2018. A Sequence Learning Method for Domain-Specific Entity Linking. In Proceedings of the Seventh Named Entities Workshop. Association for Computational Linguistics, Stroudsburg, PA, USA, 14--21.Google Scholar
- Loper, E. and Bird, S. 2002. Nltk: The Natural Language Toolkit. In Proceedings of the ACL-02 Workshop on Effective tools and methodologies for teaching natural language processing and computational linguistics.Google Scholar
- McKinney, W. 2011. pandas: a Foundational Python Library for Data Analysis and Statistics. Python for High Performance and Scientific Computing, 19, 583--591.Google Scholar
- Nguyen, D.B., Hoffart, J., Theobald, M. and Weikum, G. 2014. AIDA-light: High-throughput named-entity disambiguation. In Proceedings of the Workshop on Linked Data on the Web (LDOW 2014), 1184.Google Scholar
- Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. 2011. Scikitlearn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825--2830.Google ScholarDigital Library
- Pershina, M., He, Y. and Grishman, R. 2015. Personalized Page Rank for Named Entity Disambiguation. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 238--243.Google Scholar
- Řehůřek, R. and Sojka, P. 2010. Software framework for topic modelling with large corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, 45--50.Google Scholar
- Shen, W., Wang, J. and Han, J. 2015. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Transactions on Knowledge and Data Engineering, 27, 443--460.Google ScholarCross Ref
- Shen, W., Wang, J., Luo, P. and Wang, M. 2012. LINDEN: Linking Named Entities with Knowledge Base via Semantic Knowledge. In Proceedings of the 21st international conference on World Wide Web - WWW '12. ACM Press, New York, 449--458.Google Scholar
- Taufer, B.P. and Straka, R.M. 2017. Named Entity Recognition and Linking. Master Thesis. Faculty of Mathematics and Physics, Charles University.Google Scholar
- Wu, G., He, Y. and Hu, X. 2018. Entity Linking: An Issue to Extract Corresponding Entity with Knowledge Base. IEEE Access, 6, 6220--6231.Google ScholarCross Ref
- Xiong, C., Liu, Z., Callan, J. and Hovy, E. 2017. JointSem: Combining Query Entity Linking and Entity based Document Ranking. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management - CIKM '1. ACM Press, New York, 2391--2394.Google Scholar
- Zhang, W., Su, J., Tan, C.L. and Wang, W.T. 2010. Entity Linking Leveraging Automatically Generated Annotation. In Proceedings of the 23rd International Conference on Computational Linguistics, 1290--1298.Google Scholar
- Zheng, Z., Li, F., Huang, M. and Zhu, X. 2010. Learning to Link Entities with Knowledge Base. The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 483--491.Google Scholar
Index Terms
- Wikidata based Location Entity Linking
Recommendations
Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation
WWW '21: Companion Proceedings of the Web Conference 2021Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical ...
NILK: Entity Linking Dataset Targeting NIL-linking Cases
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge ManagementThe NIL-linking task in Entity Linking deals with cases where the text mentions do not have a corresponding entity in the associated knowledge base. NIL-linking has two sub-tasks: NIL-detection and NIL-disambiguation. NIL-detection identifies NIL-...
Re-ranking for joint named-entity recognition and linking
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementRecognizing names and linking them to structured data is a fundamental task in text analysis. Existing approaches typically perform these two steps using a pipeline architecture: they use a Named-Entity Recognition (NER) system to find the boundaries of ...
Comments