ABSTRACT
Real-world factoid or list questions often have a simple structure, yet are hard to match to facts in a given knowledge base due to high representational and linguistic variability. For example, to answer "who is the ceo of apple" on Freebase requires a match to an abstract "leadership" entity with three relations "role", "organization" and "person", and two other entities "apple inc" and "managing director". Recent years have seen a surge of research activity on learning-based solutions for this method. We further advance the state of the art by adopting learning-to-rank methodology and by fully addressing the inherent entity recognition problem, which was neglected in recent works.
We evaluate our system, called Aqqu, on two standard benchmarks, Free917 and WebQuestions, improving the previous best result for each benchmark considerably. These two benchmarks exhibit quite different challenges, and many of the existing approaches were evaluated (and work well) only for one of them. We also consider efficiency aspects and take care that all questions can be answered interactively (that is, within a second). Materials for full reproducibility are available on our website: http://ad.informatik.uni-freiburg.de/publications.
- H. Bast, F. Bäurle, B. Buchhold, and E. Haussmann. Broccoli: Semantic full-text search at your fingertips. CoRR, abs/1207.2615, 2012.Google Scholar
- J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic Parsing on Freebase from Question-Answer Pairs. In EMNLP, pages 1533--1544, 2013.Google Scholar
- J. Berant and P. Liang. Semantic Parsing via Paraphrasing. In ACL, pages 1415--1425, 2014.Google ScholarCross Ref
- A. Bordes, S. Chopra, and J. Weston. Question Answering with Subgraph Embeddings. CoRR, abs/1406.3676, 2014.Google Scholar
- L. Breiman. Random forests. Machine Learning, 45(1):5--32, 2001. Google ScholarDigital Library
- C. J. C. Burges, R. Ragno, and Q. V. Le. Learning to rank with nonsmooth cost functions. In NIPS, pages 193--200, 2006.Google ScholarDigital Library
- Q. Cai and A. Yates. Large-scale Semantic Parsing via Schema Matching and Lexicon Extension. In ACL, pages 423--433, 2013.Google Scholar
- A. X. Chang and C. D. Manning. Sutime: A library for recognizing and normalizing time expressions. In LREC, pages 3735--3740, 2012.Google Scholar
- ClueWeb, 2012. The Lemur Projekt.Google Scholar
- W. W. Cohen, R. E. Schapire, and Y. Singer. Learning to order things. JAIR, 10:243--270, 1999. Google ScholarDigital Library
- C. Fellbaum. WordNet. Wiley Online Library, 1998.Google Scholar
- Y. Freund, R. D. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. JMLR, 4:933--969, 2003. Google ScholarDigital Library
- E. Gabrilovich, M. Ringgaard, and A. Subramanya. FACC1: Freebase annotation of ClueWeb corpora, Version 1.Google Scholar
- T. Joachims. Optimizing search engines using clickthrough data. In KDD, pages 133--142, 2002. Google ScholarDigital Library
- T. Kwiatkowski, E. Choi, Y. Artzi, and L. S. Zettlemoyer. Scaling Semantic Parsers with On-the-Fly Ontology Matching. In EMNLP, pages 1545--1556, 2013.Google Scholar
- T. Liu. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, 3(3):225--331, 2009. Google ScholarDigital Library
- C. D. Manning, M. Surdeanu, J. Bauer, J. R. Finkel, S. Bethard, and D. McClosky. The stanford corenlp natural language processing toolkit. In ACL, pages 55--60, 2014.Google ScholarCross Ref
- M. Mintz, S. Bills, R. Snow, and D. Jurafsky. Distant supervision for relation extraction without labeled data. In ACL, pages 1003--1011, 2009. Google ScholarDigital Library
- S. Reddy, M. Lapata, and M. Steedman. Large-scale Semantic Parsing without Question-Answer Pairs. TACL, 2:377--392, 2014.Google ScholarCross Ref
- V. I. Spitkovsky and A. X. Chang. A Cross-Lingual Dictionary for English Wikipedia Concepts. In LREC, pages 3168--3175, 2012.Google Scholar
- M. Steedman. The syntactic process, volume 35. MIT Press, 2000. Google ScholarDigital Library
- C. Unger, C. Forascu, V. Lopez, A. N. Ngomo, E. Cabrio, P. Cimiano, and S. Walter. Question answering over linked data (QALD-4). In CLEF 2014, pages 1172--1180, 2014.Google Scholar
- J. Xu and H. Li. Adarank: a boosting algorithm for information retrieval. In SIGIR, pages 391--398, 2007. Google ScholarDigital Library
- X. Yao, J. Berant, and B. V. Durme. Freebase QA: Information Extraction or Semantic Parsing? In ACL, Workshop on Semantic Parsing, 2014.Google ScholarCross Ref
- X. Yao and B. V. Durme. Information Extraction over Structured Data: Question Answering with Freebase. In ACL, pages 956--966, 2014.Google ScholarCross Ref
- C. Zhu, R. H. Byrd, P. Lu, and J. Nocedal. Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization. ACM Trans. Math. Softw., 23(4):550--560, 1997. Google ScholarDigital Library
Index Terms
- More Accurate Question Answering on Freebase
Recommendations
Entity Disambiguation with Freebase
WI-IAT '12: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01Entity disambiguation with a knowledge base becomes increasingly popular in the NLP community. In this paper, we employ Freebase as the knowledge base, which contains significantly more entities than Wikipedia and others. While huge in size, Freebase ...
The neofonie NERD system at the ERD challenge 2014
ERD '14: Proceedings of the first international workshop on Entity recognition & disambiguationThis paper describes Neofonie NERD, our Named Entity Recognition and Disambiguation system submitted to the ERD Challenge 2014. The system uses a vector space model approach for disambiguation, based on the link structure of Freebase, in combination ...
Easy access to the freebase dataset
WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide WebWe demonstrate a system for fast and intuitive exploration of the Freebase dataset. This required solving several non-trivial problems, including: entity scores for proper ranking and name disambiguation, a unique meaningful name for every entity and ...
Comments