research-article

Reranking search results for sparse queries

Authors:
Elif Aktolga

University of Massachusetts Amherst, Amherst, MA, USA

University of Massachusetts Amherst, Amherst, MA, USA
View Profile

,
James Allan

University of Massachusetts Amherst, Amherst, MA, USA

University of Massachusetts Amherst, Amherst, MA, USA
View Profile

CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementOctober 2011Pages 173–182https://doi.org/10.1145/2063576.2063606

Published:24 October 2011Publication History

CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

Pages 173–182

ABSTRACT

It is well known that clickthrough data can be used to improve the effectiveness of search results: broadly speaking, a query's past clicks are a predictor of future clicks on documents. However, when a new or unusual query appears, or when a system is not as widely used as a mainstream web search system, there may be little to no click data available to improve the results. Existing methods to boost query performance for sparse queries extend the query-document click relationship to more documents or queries, but require substantial clickthrough data from other queries. In this work we describe a way to boost rarely-clicked queries in a system where limited clickthrough data is available for all queries. We describe a probabilistic approach for carrying out that estimation and use it to rerank retrieved documents. We utilize information from co-click queries, subset queries, and synonym queries to estimate the clickthrough for a sparse query. Our experiments on a query log from a medical informatics company demonstrate that when overall clickthrough data is sparse, reranking search results using clickthrough information from related queries significantly outperforms reranking that employs clickthrough information from the query alone.

References

E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incorporating user behavior information. In Proceedings, SIGIR '06, pages 19--26, New York, NY, USA, 2006. ACM. Google ScholarDigital Library
A. Al-Maskari, M. Sanderson, and P. Clough. The relationship between ir effectiveness measures and user satisfaction. In Proceedings, SIGIR '07, pages 773--774, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
J. Bar-Ilan, M. Mat-Hassan, and M. Levene. Methods for comparing rankings of search engine results. Comput. Netw., 50:1448--1463, July 2006. Google ScholarDigital Library
M. Bendersky and W. B. Croft. Discovering key concepts in verbose queries. In Proceedings, SIGIR '08, pages 491--498, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
T. Brants and A. Franz. Google n-grams, Web 1T 5-gram Version 1, 2006.Google Scholar
A. Broder, P. Ciccolo, E. Gabrilovich, V. Josifovski, D. Metzler, L. Riedel, and J. Yuan. Online expansion of rare queries for sponsored search. In Proceedings, WWW '09, pages 511--520, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
comScore. comScore Releases March 2011 U.S. Search Engine Rankings. http://www.comscore.com.Google Scholar
N. Craswell and M. Szummer. Random walks on the click graph. In Proceedings, SIGIR '07, pages 239--246, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
Z. Dou, R. Song, X. Yuan, and J.-R. Wen. Are click-through data adequate for learning web search rankings? In Proceedings, CIKM '08, pages 73--82, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
J. Gao, W. Yuan, X. Li, K. Deng, and J.-Y. Nie. Smoothing clickthrough data for web search ranking. In Proceedings, SIGIR '09, pages 355--362, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
J. Huang and E. N. Efthimiadis. Analyzing and evaluating query reformulation strategies in web search logs. In Proceedings, CIKM '09, pages 77--86, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
K. Jarvelin and J. Kekalainen. Cumulated Gain-based Evaluation of IR Techniques. ACM Trans. Inf. Syst., 20:422--446, October 2002. Google ScholarDigital Library
X.-M. Jiang, W.-G. Song, and H.-J. Zeng. Applying associative relationship on the clickthrough data to improve web search. In Advances in Information Retrieval, volume 3408 of Lecture Notes in Computer Science, pages 475--486. Springer Berlin / Heidelberg, 2005. Google ScholarDigital Library
T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In Proceedings, SIGIR '05, pages 154--161, New York, NY, USA, 2005. ACM. Google ScholarDigital Library
T. Joachims, L. Granka, B. Pan, H. Hembrooke, F. Radlinski, and G. Gay. Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. ACM Trans. Inf. Syst., 25, April 2007. Google ScholarDigital Library
R. Kumar and S. Vassilvitskii. Generalized distances between rankings. In Proceedings, WWW '10, pages 571--580, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
A. Mastora, M. Monopoli, and S. Kapidakis. Exploring query formulation and reformulation: A preliminary study to map users' search behaviour. In Research and Advanced Technology for Digital Libraries, volume 5173 of Lecture Notes in Computer Science, pages 427--430. Springer Berlin / Heidelberg, 2008. Google ScholarDigital Library
J. Pickens, M. Cooper, and G. Golovchinsky. Reverted indexing for feedback and expansion. In Proceedings, CIKM '10, pages 1049--1058, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In Proceedings, SIGIR '98, pages 275--281, New York, NY, USA, 1998. ACM. Google ScholarDigital Library
F. Radlinski and T. Joachims. Query chains: learning to rank from implicit feedback. In Proceedings, KDD '05, pages 239--248, New York, NY, USA, 2005. ACM. Google ScholarDigital Library
S. E. Robertson, E. Kanoulas, and E. Yilmaz. Extending Average Precision to Graded Relevance Judgments. In Proceedings, SIGIR '10, pages 603--610, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
X. Wei, F. Peng, H. Tseng, Y. Lu, and B. Dumoulin. Context sensitive synonym discovery for web search queries. In Proceedings, CIKM '09, pages 1585--1588, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
G.-R. Xue, H.-J. Zeng, Z. Chen, Y. Yu, W.-Y. Ma, W. Xi, and W. Fan. Optimizing web search using web click-through data. In Proceedings, CIKM '04, pages 118--126, New York, NY, USA, 2004. ACM. Google ScholarDigital Library
H. Zaragoza, B. B. Cambazoglu, and R. Baeza-Yates. Web search solved?: all result rankings the same? In Proceedings, CIKM '10, pages 529--538, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
M. Zhao, H. Li, A. Ratnaparkhi, H.-W. Hon, and J. Wang. Adapting document ranking to users' preferences using click-through data. In Information Retrieval Technology, volume 4182 of Lecture Notes in Computer Science, pages 26--42. Springer Berlin / Heidelberg, 2006. Google ScholarDigital Library

Index Terms

Reranking search results for sparse queries
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing

Recommendations

Optimizing video search reranking via minimum incremental information loss
MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval

This paper is concerned with video search reranking - the task of reordering the initial ranked documents (video shots) to improve the search performance - in an optimization framework. Conventional supervised reranking approaches empirically convert ...
Read More
Reranking web search results for diversity
Abstract
Search engine results are often biased towards a certain aspect of a query or towards a certain meaning for ambiguous query terms. Diversification of search results offers a way to supply the user with a better balanced result set increasing the ...
Read More
Expert agreement and content based reranking in a meta search environment using Mearf
WWW '02: Proceedings of the 11th international conference on World Wide Web

Recent increase in the number of search engines on the Web and the availability of meta search engines that can query multiple search engines makes it important to find effective methods for combining results coming from different sources. In this paper ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
October 2011
2712 pages
ISBN:9781450307178
DOI:10.1145/2063576
Editors:
Bettina Berendt,
Arjen de Vries,
Wenfei Fan,
Craig Macdonald
University of Glasgow, UK
,
Iadh Ounis
University of Glasgow, UK
,
Ian Ruthven
University of Strathclyde, UK
Copyright © 2011 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 October 2011
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
query log mining
query selection
reranking
sparse clickthrough data
sparse queries
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 5
  Total Citations
  View Citations
- 223
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Reranking search results for sparse queries

CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Optimizing video search reranking via minimum incremental information loss

Reranking web search results for diversity

Expert agreement and content based reranking in a meta search environment using Mearf