Article

Structured retrieval for question answering

Authors:
Matthew W. Bilotti

Carnegie Mellon University

Carnegie Mellon University
View Profile

,
Paul Ogilvie

Carnegie Mellon University

Carnegie Mellon University
View Profile

,
Jamie Callan

Carnegie Mellon University

Carnegie Mellon University
View Profile

,
Eric Nyberg

Carnegie Mellon University

Carnegie Mellon University
View Profile

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrievalJuly 2007Pages 351–358https://doi.org/10.1145/1277741.1277802

Published:23 July 2007Publication History

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 351–358

ABSTRACT

Bag-of-words retrieval is popular among Question Answering (QA) system developers, but it does not support constraint checking and ranking on the linguistic and semantic information of interest to the QA system. We present anapproach to retrieval for QA, applying structured retrieval techniques to the types of text annotations that QA systems use. We demonstrate that the structured approach can retrieve more relevant results, more highly ranked, compared with bag-of-words, on a sentence retrieval task. We also characterize the extent to which structured retrieval effectiveness depends on the quality of the annotations.

References

D. Bikel, et al. An algorithm that learns what's in a name. ML, 34(1-3):211--231, 1999. Google ScholarDigital Library
M. Bilotti, et al. What works better for question answering: Stemming or morphological query expansion? In Proc. of IR4QA at SIGIR, 2004.Google Scholar
D. Carmel, et al. Searching XML documents via XML fragments. In Proc. of SIGIR, 2003. Google ScholarDigital Library
H. Cui, et al. Question answering passage retrieval using dependency relations. In Proc. of SIGIR, 2005. Google ScholarDigital Library
D. Graff. The AQUAINT Corpus of English News Text. LDC, 2002. Cat. No. LDC2002T31.Google Scholar
S. Harabagiu, et al. Employing two question answering systems in TREC-2005. In Proc. of TREC-14, 2005.Google Scholar
P. Kingsbury, et al. Adding semantic annotation to the penn treebank. In Proc. of HLT, 2002.Google Scholar
C. Lin and E. Hovy. The automated acquisition of topic signatures for text summarization. In Proc. of COLING, 2000. Google ScholarDigital Library
J. Lin and B. Katz. Building a reusable test collection for question answering. JASIST, 57(7):851--861, 2006. Google ScholarDigital Library
M. Marcus, et al. Building a large annotated corpus of english: the penn treebank. CL, 19(2):313--330, 1993. Google ScholarDigital Library
M. Montague and J. Aslam. Relevance score normalization for metasearch. In Proc. of CIKM, 2001. Google ScholarDigital Library
P. Ogilvie and J. Callan. Combining document representations for known-item search. In Proc. of SIGIR, 2003. Google ScholarDigital Library
S. Pradhan, et al. Shallow semantic parsing using support vector machines. In Proc. of HLT, 2004.Google Scholar
J. Prager, et al. Question-answering by predictive annotation. In Proc. of SIGIR, 2000. Google ScholarDigital Library
T. Rajashekar and W. B. Croft. Combining automatic and manual index representations in probabilistic retrieval. JASIS, 46(4):272--283, 1995. Google ScholarDigital Library
J. Reynar and A. Ratnaparkhi. A maximum entropy approach to identifying sentence boundaries. In Proc. of ANLP, 1997. Google ScholarDigital Library
J. Shaw and E. Fox. Combination of multiple searches. In Proc. of TREC-3, 1994.Google Scholar
T. Strohman, et al. Indri: A language model-based search engine for complex queries. In Proc. of ICIA, 2005.Google Scholar
S. Tellex, et al. Quantitative evaluation of passage retrieval algorithms for question answering. In Proc. of SIGIR, 2003. Google ScholarDigital Library
H. Turtle and W. Croft. Evaluation of an inference network-based retrieval model. ACM TOIS, 9(3):187--222, 1991. Google ScholarDigital Library
E. Voorhees, et al. The collection fusion problem. In Proc. of TREC-3, 1994.Google Scholar
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proc. of SIGIR, 2001. Google ScholarDigital Library

Index Terms

Structured retrieval for question answering
1. Information systems
  1. Information retrieval
    1. Document representation

Recommendations

A generative retrieval model for structured documents
CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management

Structured documents contain elements defined by the author(s) and annotations assigned by other people or processes. Structured documents pose challenges for probabilistic retrieval models when there are mismatches between the structured query and the ...
Read More
Human question answering performance using an interactive document retrieval system
IIIX '12: Proceedings of the 4th Information Interaction in Context Symposium

Every day, people answer their questions by using document retrieval systems. Compared to document retrieval systems, question answering (QA) systems aim to speed the rate at which users find answers by retrieving answers rather than documents. To ...
Read More
Dynamic element retrieval in a structured environment

This research examines the feasibility of dynamic element retrieval in a structured environment. Structured documents and queries are represented in extended vector form, based on a modification of the basic vector space model suggested by Fox [1983]. A ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
July 2007
946 pages
ISBN:9781595935977
DOI:10.1145/1277741
General Chairs:
Wessel Kraaij
TNO, The Netherlands
,
Arjen P. de Vries
CWI, The Netherlands
,
Program Chairs:
Charles L. A. Clarke
University of Waterloo, Canada
,
Norbert Fuhr
University of Duisburg-Essen, Germany
,
Noriko Kando
National Institute of Informatics, Japan
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 July 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
question answering
structured retrieval
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate792of3,983submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 59
  Total Citations
  View Citations
- 1,191
  Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Structured retrieval for question answering

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

A generative retrieval model for structured documents

Human question answering performance using an interactive document retrieval system

Dynamic element retrieval in a structured environment