Article

Query performance prediction in web search environments

Authors:
Yun Zhou

University of Massachusetts: Amherst, Amherst, MA

University of Massachusetts: Amherst, Amherst, MA
View Profile

,
W. Bruce Croft

University of Massachusetts: Amherst, Amherst, MA

University of Massachusetts: Amherst, Amherst, MA
View Profile

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrievalJuly 2007Pages 543–550https://doi.org/10.1145/1277741.1277835

Published:23 July 2007Publication History

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 543–550

ABSTRACT

Current prediction techniques, which are generally designed for content-based queries and are typically evaluated on relatively homogenous test collections of small sizes, face serious challenges in web search environments where collections are significantly more heterogeneous and different types of retrieval tasks exist. In this paper, we present three techniques to address these challenges. We focus on performance prediction for two types of queries in web search environments: content-based and Named-Page finding. Our evaluation is mainly performed on the GOV2 collection. In addition to evaluating our models for the two types of queries separately, we consider a more challenging and realistic situation that the two types of queries are mixed together without prior information on query types. To assist prediction under the mixed-query situation, a novel query classifier is adopted. Results show that our prediction of web query performance is substantially more accurate than the current state-of-the-art prediction techniques. Consequently, our paper provides a practical approach to performance prediction in real-world web settings.

References

Y. Zhou, W.B. Croft, Ranking Robustness: A Novel Framework to Predict Query Performance, in Proceedings of CIKM 2006. Google ScholarDigital Library
D. Carmel, E. Yom-Tov, A. Darlow, D. Pelleg, What Makes a Query Difficult?, in Proceedings of SIGIR 2006. Google ScholarDigital Library
C.L.A. Clarke, F. Scholer, I. Soboroff, The TREC 2005 Terabyte Track, In the Online Proceedings of 2005 TREC.Google Scholar
B. He and I. Ounis. Inferring query performance using pre-retrieval predictors. In proceedings of the SPIRE 2004.Google ScholarCross Ref
S. Tomlinson. Robust, Web and Terabyte Retrieval with Hummingbird SearchServer at TREC 2004. In the Online Proceedings of 2004 TREC.Google Scholar
S. Cronen-Townsend, Y. Zhou, W.B. Croft, Predicting Query Performance, in Proceedings of SIGIR 2002. Google ScholarDigital Library
V. Vinay, I. J.Cox, N. Mill-Frayling, K. Wood, On Ranking the Effectiveness of Searcher, in Proceedings of SIGIR 2006. Google ScholarDigital Library
D. Metzler, W.B. Croft, A Markov Random Filed Model for Term Dependencies, in Proceedings of SIGIR 2005. Google ScholarDigital Library
D. Metzler, T. Strohman, Y. Zhou, W.B. Croft, Indri at TREC 2005: Terabyte Track, In the Online Proceedings of 2004 TREC.Google Scholar
P. Ogilvie and J. Callan, Combining document representations for known-item search, in Proceedings of SIGIR 2003. Google ScholarDigital Library
A. Berger, J. Lafferty, Information retrieval as statistical translation, in Proceedings of SIGIR 1999. Google ScholarDigital Library
Indri search engine: http://www.lemurproject.org/indri/Google Scholar
I.J. Taneja: On Generalized Information Measures and Their Applications, Advances in Electronics and Electron Physics, Academic Press (USA), 76, 1989, 327--413.Google Scholar
S. Cronen-Townsend, Y. Zhou and Croft, W.B., "A Framework for Selective Query Expansion," in Proceedings of CIKM 2004. Google ScholarDigital Library
F. Song, W.B. Croft, A general language model for information retrieval, in Proceedings of SIGIR 1999. Google ScholarDigital Library
Personal email contact with Vishwa Vinay and our own experiments.Google Scholar
E. Yom-Tov, S. Fine, D. Carmel, A. Darlow, Learning to Estimate Query Difficulty Including Applications to Missing Content Detection and Distributed Information retrieval, in Proceedings of SIGIR 2005. Google ScholarDigital Library

Index Terms

Query performance prediction in web search environments
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing

Recommendations

Information Needs, Queries, and Query Performance Prediction
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

The query performance prediction (QPP) task is to estimate the effectiveness of a search performed in response to a query with no relevance judgments. Existing QPP methods do not account for the effectiveness of a query in representing the underlying ...
Read More
BERT-QPP: Contextualized Pre-trained transformers for Query Performance Prediction
CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management

Query Performance Prediction (QPP) is focused on estimating the difficulty of satisfying a user query for a certain retrieval method. While most state of the art QPP methods are based on term frequency and corpus statistics, more recent work in this ...
Read More
Predicting query performance
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval

We develop a method for predicting query performance by computing the relative entropy between a query language model and the corresponding collection language model. The resulting clarity score measures the coherence of the language usage in documents ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
July 2007
946 pages
ISBN:9781595935977
DOI:10.1145/1277741
General Chairs:
Wessel Kraaij
TNO, The Netherlands
,
Arjen P. de Vries
CWI, The Netherlands
,
Program Chairs:
Charles L. A. Clarke
University of Waterloo, Canada
,
Norbert Fuhr
University of Duisburg-Essen, Germany
,
Noriko Kando
National Institute of Informatics, Japan
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 July 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
query classification
query performance prediction
web search
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate792of3,983submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 176
  Total Citations
  View Citations
- 1,306
  Total Downloads
- Downloads (Last 12 months)56
- Downloads (Last 6 weeks)8
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Query performance prediction in web search environments

SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Information Needs, Queries, and Query Performance Prediction

BERT-QPP: Contextualized Pre-trained transformers for Query Performance Prediction

Predicting query performance