skip to main content
10.1145/2983323.2983872acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper
Public Access

Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks

Authors Info & Claims
Published:24 October 2016Publication History

ABSTRACT

We study answer selection for question answering, in which given a question and a set of candidate answer sentences, the goal is to identify the subset that contains the answer. Unlike previous work which treats this task as a straightforward pointwise classification problem, we model this problem as a ranking task and propose a pairwise ranking approach that can directly exploit existing pointwise neural network models as base components. We extend the Noise-Contrastive Estimation approach with a triplet ranking loss function to exploit interactions in triplet inputs over the question paired with positive and negative examples. Experiments on TrecQA and WikiQA datasets show that our approach achieves state-of-the-art effectiveness without the need for external knowledge sources or feature engineering.

References

  1. ACL. Question answering (state of the art), http://aclweb.org/aclwiki/index.php?title=Question_Answering_(State_of_the_art), accessed Aug., 18, 2016.Google ScholarGoogle Scholar
  2. J. Bromley, J. W. Bentz, L. Bottou, I. Guyon, Y. LeCun, C. Moore, E. Sackinger, and R. Shah. Signature verification using a "siamese" time delay neural network. IJPRAI, 1993.Google ScholarGoogle ScholarCross RefCross Ref
  3. H. He, K. Gimpel, and J. Lin. Multi-perspective sentence similarity modeling with convolutional neural networks. EMNLP, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  4. H. He and J. Lin. Pairwise word interaction modeling with deep neural networks for semantic similarity measurement. NAACL, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  5. H. He, J. Wieting, K. Gimpel, J. Rao, and J. Lin. UMD-TTIC-UW at SemEval-2016 task 1: Attention-based multi-perspective convolutional neural networks for textual similarity measurement. SemEval, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  6. Y. Miao, L. Yu, and P. Blunsom. Neural variational inference for text processing. arXiv:1511.06038, 2015.Google ScholarGoogle Scholar
  7. T. Mikolov and J. Dean. Distributed representations of words and phrases and their compositionality. NIPS, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Pennington, R. Socher, and C. D. Manning. GloVe: Global vectors for word representation. EMNLP, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  9. C. d. Santos, M. Tan, B. Xiang, and B. Zhou. Attentive pooling networks. arXiv:1602.03609, 2016.Google ScholarGoogle Scholar
  10. A. Severyn and A. Moschitti. Learning to rank short text pairs with convolutional deep neural networks. SIGIR, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Tellex, B. Katz, J. Lin, G. Marton, and A. Fernandes. Quantitative evaluation of passage retrieval algorithms for question answering. SIGIR, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. E. M. Voorhees. Overview of the TREC 2002 question answering track. TREC, 2002.Google ScholarGoogle Scholar
  13. D. Wang and E. Nyberg. Carnegie Mellon University OAQA at TREC 2015 LiveQA: Discovering the right answer with clues. TREC, 2015.Google ScholarGoogle Scholar
  14. D. Wang and E. Nyberg. A long short-term memory model for answer sentence selection in question answering. ACL, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  15. M. Wang, N. A. Smith, and T. Mitamura. What is the Jeopardy model? A quasi-synchronous grammar for QA. EMNLP-CoNLL, 2007.Google ScholarGoogle Scholar
  16. Z. Wang, H. Mi, and A. Ittycheriah. Sentence similarity learning by lexical decomposition and composition. arXiv:1602.07019, 2016.Google ScholarGoogle Scholar
  17. J. Wieting, M. Bansal, K. Gimpel, and K. Livescu. Towards universal paraphrastic sentence embeddings. ICLR, 2016.Google ScholarGoogle Scholar
  18. Y. Yang, W.-t. Yih, and C. Meek. WikiQA: A challenge dataset for open-domain question answering. EMNLP, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  19. X. Yao, B. Van Durme, C. Callison-Burch, and P. Clark. Answer extraction as sequence tagging with tree edit distance. HLT-NAACL, 2013.Google ScholarGoogle Scholar
  20. W.-t. Yih, M.-W. Chang, C. Meek, and A. Pastusiak. Question answering using enhanced lexical semantic models. ACL, 2013.Google ScholarGoogle Scholar
  21. W. Yin, H. Schütze, B. Xiang, and B. Zhou. ABCNN: Attention-based convolutional neural network for modeling sentence pairs. arXiv:1512.05193, 2015.Google ScholarGoogle Scholar
  22. L. Yu, K. M. Hermann, P. Blunsom, and S. Pulman. Deep learning for answer sentence selection. NIPS deep learning workshop, 2014.Google ScholarGoogle Scholar

Index Terms

  1. Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
          October 2016
          2566 pages
          ISBN:9781450340731
          DOI:10.1145/2983323

          Copyright © 2016 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 24 October 2016

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper

          Acceptance Rates

          CIKM '16 Paper Acceptance Rate160of701submissions,23%Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader