ABSTRACT
Wikification is the task to link textual mentions in a document to articles in Wikipedia. It comprises three main steps, namely, mention recognition, candidate generation, and entity linking. For candidate generation, existing methods use hyperlinks in Wikipedia or match a mention of discourse to Wikipedia article titles. They may miss the correct target entity and thus fail to link the mention to Wikipedia. In this paper, we propose to use a mention as a query and Wikipedia own search engine to look for additional candidate articles. Moreover, for entity linking, we introduce new coreference heuristics and apply the incremental liking approach. The conducted experiments show that our proposed method outperforms or achieves competitive results in comparison to some state-of-the-art systems, but is simpler and uses less features.
- Bunescu, R. C. and Pasca, M. 2006. Using encyclopedic knowledge for named entity disambiguation. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, 9--16.Google Scholar
- Cheng, X. and Roth, D. 2013. Relational inference for wikification. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 1787--1796.Google Scholar
- Cucerzan, S. 2007. Large-scale named entity disambiguation based on wikipedia data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 708--716.Google Scholar
- Dredze, M., McNamee, P., Rao, D., Gerber, A., and Finin, T. 2010. Entity disambiguation for knowledge base population. In Proceedings of the 23rd International Conference on Computational Linguistics, 277--285. Google ScholarDigital Library
- Huynh, H. M., Nguyen, T. T., and Cao, T. H. 2013. Using coreference and surrounding contexts for entity linking. In Proceedings of the 9th International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future, 1--5.Google Scholar
- Ellis, J. 2010. Overview of the TAC 2010 knowledge base population track. In Proceedings of 3rd Text Analysis Conference.Google Scholar
- Mihalcea, R. and Csomai, A. 2007. Wikify: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM conference on Conference on Information and Knowledge Management, 233--242. Google ScholarDigital Library
- Milne, D. and Witten, I. H. 2008. Learning to link with wikipedia. In Proceedings of the 17th ACM conference on Information and Knowledge Management, 509--518. Google ScholarDigital Library
- Nguyen, H. T. and Cao, T. H. 2012. Named entity disambiguation: A hybrid statistical and rule-based incremental approach. International Journal of Computational Intelligence Systems, 5 (6), 1052--1067.Google ScholarCross Ref
- Raghunathan, K., Lee, H., Rangarajan, S., Chambers, N., Surdeanu, M., Jurafsky, D., and Manning, C. 2010. A multi-pass sieve for coreference resolution. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 492--501. Google ScholarDigital Library
- Ratinov, L., Roth, D., Downey, D., and Anderson, M. 2011. Local and global algorithms for disambiguation to wikipedia. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 1375--1384. Google ScholarDigital Library
- Zhang, W., Su, J., and Tan, C. L. 2011. A wikipedia-LDA model for entity linking with batch size changing instance selection. In Proceedings of 5th International Joint Conference on Natural Language Processing, 562--570.Google Scholar
Index Terms
- Candidate Searching and Key Coreference Resolution for Wikification
Recommendations
DAWT: Densely Annotated Wikipedia Texts Across Multiple Languages
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web CompanionIn this work, we open up the DAWT dataset - Densely Annotated Wikipedia Texts across multiple languages. The annotations include labeled text mentions mapping to entities (represented by their Freebase machine ids) as well as the type of the entity. The ...
Named entity recognition an aid to improve multilingual entity filling in language-independent approach
IKM4DR '12: Proceedings of the first workshop on Information and knowledge management for developing regionThis paper details the approach to identify Named Entities (NEs) from a large non-English corpus and associate them with appropriate tags, requiring minimal human intervention and no linguistic expertise. The main objective in this paper is to focus on ...
Mention detection in coreference resolution: survey
AbstractCoreference Resolution is an essential task for Natural Language Processing (NLP) application, which has a paramount impact on the performance of text summarization, machine translation, text classification, and recognizing textual entailment. ...
Comments