skip to main content
10.1145/2348283.2348383acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Structural relationships for large-scale learning of answer re-ranking

Published:12 August 2012Publication History

ABSTRACT

Supervised learning applied to answer re-ranking can highly improve on the overall accuracy of question answering (QA) systems. The key aspect is that the relationships and properties of the question/answer pair composed of a question and the supporting passage of an answer candidate, can be efficiently compared with those captured by the learnt model.

In this paper, we define novel supervised approaches that exploit structural relationships between a question and their candidate answer passages to learn a re-ranking model. We model structural representations of both questions and answers and their mutual relationships by just using an off-the-shelf shallow syntactic parser. We encode structures in Support Vector Machines (SVMs) by means of sequence and tree kernels, which can implicitly represent question and answer pairs in huge feature spaces. Such models together with the latest approach to fast kernel-based learning enabled the training of our rerankers on hundreds of thousands of instances, which previously rendered intractable for kernelized SVMs. The results on two different QA datasets, e.g., Answerbag and Jeopardy! data, show that our models deliver large improvement on passage re-ranking tasks, reducing the error in Recall of BM25 baseline by about 18%. One of the key findings of this work is that, despite its simplicity, shallow syntactic trees allow for learning complex relational structures, which exhibits a steep learning curve with the increase in the training size.

References

  1. M. Bilotti, P. Ogilvie, J. Callan, and E. Nyberg. Structured retrieval for Question Answering. In Proceedings of ACM SIGIR, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. W. Bilotti, J. L. Elsas, J. Carbonell, and E. Nyberg. Rank learning for factoid Question Answering with linguistic and semantic constraints. In Proc. of CIKM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. W. Bilotti and E. Nyberg. Improving text retrieval precision and answer accuracy in question answering systems. In Proc. of IR4QA at COLING, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Blair-Goldensohn, K. R. McKeown, and A. H. Schlaikjer. Answering definitional questions: A hybrid approach. In Proc. of AAAI, 2004.Google ScholarGoogle Scholar
  5. N. Cancedda, E. Gaussier, C. Goutte, and J. M. Renders. Word sequence kernels. JMLR, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Chen, M. Zhou, and S. Wang. Reranking answers from definitional QA using language models. In Proc. of ACL, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Ciaramita and Y. Altun. Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In Proc. EMNLP, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Collins and N. Duffy. New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron. In Proc. of ACL, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. Cui, M. Kan, and T. Chua. Generic soft pattern models for definitional QA. In Proc. of SIGIR, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Echihabi and D. Marcu. A noisy-channel approach to question answering. In Proc. of ACL, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Ferrucci. Build watson: an overview of deepqa for the Jeopardy! challenge. In Proc. of PACT, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A.-M. Giuglea and A. Moschitti. Knowledge Discovering using FrameNet, VerbNet and PropBank. In Proc. of Ontology and Knowledge Discovering at ECML 2004, Pisa, Italy, 2004.Google ScholarGoogle Scholar
  13. A.-M. Giuglea and A. Moschitti. Semantic Role Labeling via Framenet, Verbnet and Propbank. In Proc. of ACL, Sydney, Australia, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Hickl, J. Williams, J. Bensley, K. Roberts, Y. Shi, and B. Rink. Question answering with LCC chaucer at trec 2006. In Proc. of TREC, 2006.Google ScholarGoogle Scholar
  15. J. Jeon, W. B. Croft, and J. H. Lee. Finding similar questions in large question and answer archives. In Proc. of CIKM, NY, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Joachims. Making large-scale SVM learning practical. In B. Sch$\ddoto$lkopf, C. Burges, and A. Smola, editors, Advances in Kernel Methods, 1999.Google ScholarGoogle Scholar
  17. T. Kudo and Y. Matsumoto. Fast methods for kernel-based text analysis. In Proc. of ACL, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Mehdad, A. Moschitti, and F. M. Zanzotto. Syntactic/semantic structures for textual entailment recognition. In HLT-NAACL, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Moschitti. A study on convolution kernels for shallow semantic parsing. In Proc. of ACL, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Moschitti. Efficient convolution kernels for dependency and constituent syntactic trees. In Proceedings of ECML, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. Moschitti. Kernel methods, syntax and semantics for relational text categorization. In Proc. of CIKM, NY, USA, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Moschitti and C. Bejan. A semantic kernel for predicate argument classification. In Proc. of CoNLL, Boston, MA, USA, 2004.Google ScholarGoogle Scholar
  23. A. Moschitti, S. Quarteroni, R. Basili, and S. Manandhar. Exploiting syntactic and shallow semantic kernels for question/answer classification. In Proc. of ACL, 2007.Google ScholarGoogle Scholar
  24. T.-V. T. Nguyen and A. Moschitti. Joint distant and direct supervision for relation extraction. In Proc. IJCNLP, Chiang Mai, Thailand, 2011.Google ScholarGoogle Scholar
  25. F. Radlinski and T. Joachims. Query chains: Learning to rank from implicit feedback. CoRR, 2006.Google ScholarGoogle Scholar
  26. Y. Sasaki. Question answering as question-biased term extraction: A new approach toward multilingual QA. In Proc. of ACL, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. A. Severyn and A. Moschitti. Large-scale support vector learning with structural kernels. In ECML/PKDD, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Shawe-Taylor and N. Cristianini. Kernel Methods for Pattern Analysis. Cambridge Univ. Press, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. D. Shen and M. Lapata. Using semantic roles to improve question answering. In Proc. of EMNLP-CoNLL, 2007.Google ScholarGoogle Scholar
  30. L. Shen, A. Sarkar, and A. Joshi. Using LTAG Based Features in Parse Reranking. In EMNLP, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. S. Small, T. Strzalkowski, T. Liu, S. Ryan, R. Salkin, N. Shimizu, P. Kantor, D. Kelly, and N. Wacholder. Hitiqa: Towards analytical question answering. In Proc. of COLING, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. A. F. Smeaton. Using NLP or NLP resources for Information Retrieval tasks. In T. Strzalkowski, editor, Natural Language Information Retrieval. Kluwer Ac. Pub., Dordrecht, NL, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  33. T. Strzalkowski, J. P. Carballo, J. Karlgren, A. H. P. Tapanainen, and T. Jarvinen. Natural Language Information Retrieval: TREC-8 report. In Proc. of TREC, 1999.Google ScholarGoogle Scholar
  34. T. Strzalkowski, G. C. Stein, G. B. Wise, J. P. Carballo, P. Tapanainen, T. Jarvinen, A. Voutilainen, and J. Karlgren. Natural Language Information Retrieval: TREC-7 report. In Proc. of TREC, 1998.Google ScholarGoogle Scholar
  35. M. Surdeanu, M. Ciaramita, and H. Zaragoza. Learning to rank answers on large online QA collections. In Proc. of ACL-HLT, 2008.Google ScholarGoogle Scholar
  36. J. Suzuki, Y. Sasaki, and E. Maeda. SVM answer selection for open-domain Question Answering. In Proc. of Coling, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Y. Versley, A. Moschitti, M. Poesio, and X. Yang. Coreference systems based on kernels methods. In Coling, Manchester, England, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. E. M. Voorhees. Using Wordnet to disambiguate word senses for text retrieval. In Proc. of SIGIR, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. E. M. Voorhees. Query expansion using lexical-semantic relations. In Proc. of SIGIR, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. E. M. Voorhees. Overview of the trec 2004 question answering track. In Proc. of TREC 2004, 2004.Google ScholarGoogle Scholar
  41. C.-N. J. Yu and T. Joachims. Training structural SVMs with kernels using sampled cuts. In KDD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. D. Zelenko, C. Aone, and A. Richardella. Kernel methods for relation extraction. JMLR, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Structural relationships for large-scale learning of answer re-ranking

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
      August 2012
      1236 pages
      ISBN:9781450314725
      DOI:10.1145/2348283

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 12 August 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader