ABSTRACT
We study answer selection for question answering, in which given a question and a set of candidate answer sentences, the goal is to identify the subset that contains the answer. Unlike previous work which treats this task as a straightforward pointwise classification problem, we model this problem as a ranking task and propose a pairwise ranking approach that can directly exploit existing pointwise neural network models as base components. We extend the Noise-Contrastive Estimation approach with a triplet ranking loss function to exploit interactions in triplet inputs over the question paired with positive and negative examples. Experiments on TrecQA and WikiQA datasets show that our approach achieves state-of-the-art effectiveness without the need for external knowledge sources or feature engineering.
- ACL. Question answering (state of the art), http://aclweb.org/aclwiki/index.php?title=Question_Answering_(State_of_the_art), accessed Aug., 18, 2016.Google Scholar
- J. Bromley, J. W. Bentz, L. Bottou, I. Guyon, Y. LeCun, C. Moore, E. Sackinger, and R. Shah. Signature verification using a "siamese" time delay neural network. IJPRAI, 1993.Google ScholarCross Ref
- H. He, K. Gimpel, and J. Lin. Multi-perspective sentence similarity modeling with convolutional neural networks. EMNLP, 2015.Google ScholarCross Ref
- H. He and J. Lin. Pairwise word interaction modeling with deep neural networks for semantic similarity measurement. NAACL, 2016.Google ScholarCross Ref
- H. He, J. Wieting, K. Gimpel, J. Rao, and J. Lin. UMD-TTIC-UW at SemEval-2016 task 1: Attention-based multi-perspective convolutional neural networks for textual similarity measurement. SemEval, 2016.Google ScholarCross Ref
- Y. Miao, L. Yu, and P. Blunsom. Neural variational inference for text processing. arXiv:1511.06038, 2015.Google Scholar
- T. Mikolov and J. Dean. Distributed representations of words and phrases and their compositionality. NIPS, 2013. Google ScholarDigital Library
- J. Pennington, R. Socher, and C. D. Manning. GloVe: Global vectors for word representation. EMNLP, 2014.Google ScholarCross Ref
- C. d. Santos, M. Tan, B. Xiang, and B. Zhou. Attentive pooling networks. arXiv:1602.03609, 2016.Google Scholar
- A. Severyn and A. Moschitti. Learning to rank short text pairs with convolutional deep neural networks. SIGIR, 2015. Google ScholarDigital Library
- S. Tellex, B. Katz, J. Lin, G. Marton, and A. Fernandes. Quantitative evaluation of passage retrieval algorithms for question answering. SIGIR, 2003. Google ScholarDigital Library
- E. M. Voorhees. Overview of the TREC 2002 question answering track. TREC, 2002.Google Scholar
- D. Wang and E. Nyberg. Carnegie Mellon University OAQA at TREC 2015 LiveQA: Discovering the right answer with clues. TREC, 2015.Google Scholar
- D. Wang and E. Nyberg. A long short-term memory model for answer sentence selection in question answering. ACL, 2015.Google ScholarCross Ref
- M. Wang, N. A. Smith, and T. Mitamura. What is the Jeopardy model? A quasi-synchronous grammar for QA. EMNLP-CoNLL, 2007.Google Scholar
- Z. Wang, H. Mi, and A. Ittycheriah. Sentence similarity learning by lexical decomposition and composition. arXiv:1602.07019, 2016.Google Scholar
- J. Wieting, M. Bansal, K. Gimpel, and K. Livescu. Towards universal paraphrastic sentence embeddings. ICLR, 2016.Google Scholar
- Y. Yang, W.-t. Yih, and C. Meek. WikiQA: A challenge dataset for open-domain question answering. EMNLP, 2015.Google ScholarCross Ref
- X. Yao, B. Van Durme, C. Callison-Burch, and P. Clark. Answer extraction as sequence tagging with tree edit distance. HLT-NAACL, 2013.Google Scholar
- W.-t. Yih, M.-W. Chang, C. Meek, and A. Pastusiak. Question answering using enhanced lexical semantic models. ACL, 2013.Google Scholar
- W. Yin, H. Schütze, B. Xiang, and B. Zhou. ABCNN: Attention-based convolutional neural network for modeling sentence pairs. arXiv:1512.05193, 2015.Google Scholar
- L. Yu, K. M. Hermann, P. Blunsom, and S. Pulman. Deep learning for answer sentence selection. NIPS deep learning workshop, 2014.Google Scholar
Index Terms
- Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks
Recommendations
Length-adaptive Neural Network for Answer Selection
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information RetrievalAnswer selection focuses on selecting the correct answer for a question. Most previous work on answer selection achieves good performance by employing an RNN, which processes all question and answer sentences with the same feature extractor regardless ...
An Enhanced Convolutional Neural Network Model for Answer Selection
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web CompanionAnswer selection is an important task in question answering (QA) from the Web. To address the intrinsic difficulty in encoding sentences with semantic meanings, we introduce a general framework, i.e., Lexical Semantic Feature based Skip Convolution ...
Recurrent convolutional neural network for answer selection in community question answering
AbstractIn this paper, we propose a recurrent convolutional neural network (RCNN) for answer selection in community question answering (CQA). It combines convolutional neural network (CNN) with recurrent neural network (RNN) to capture both ...
Comments