research-article

Query Suggestion and Data Fusion in Contextual Disambiguation

Authors:
Milad Shokouhi

Microsoft, Cambridge, United Kingdom

Microsoft, Cambridge, United Kingdom
View Profile

,
Marc Sloan

University College London, London, United Kingdom

University College London, London, United Kingdom
View Profile

,
Paul N. Bennett

Microsoft, Redmond, USA

Microsoft, Redmond, USA
View Profile

,
Kevyn Collins-Thompson

University of Michigan, Ann Arbor, USA

University of Michigan, Ann Arbor, USA
View Profile

,
Siranush Sarkizova

Harvard University, Cambridge, USA

Harvard University, Cambridge, USA
View Profile

WWW '15: Proceedings of the 24th International Conference on World Wide WebMay 2015Pages 971–980https://doi.org/10.1145/2736277.2741646

Published:18 May 2015Publication History

WWW '15: Proceedings of the 24th International Conference on World Wide Web

Pages 971–980

ABSTRACT

Queries issued to a search engine are often under-specified or ambiguous. The user's search context or background may provide information that disambiguates their information need in order to automatically predict and issue a more effective query. The disambiguation can take place at different stages of the retrieval process. For instance, contextual query suggestions may be computed and recommended to users on the result page when appropriate, an approach that does not require modifying the original query's results. Alternatively, the search engine can attempt to provide efficient access to new relevant documents by injecting these documents directly into search results based on the user's context.

In this paper, we explore these complementary approaches and how they might be combined. We first develop a general framework for mining context-sensitive query reformulations for query suggestion. We evaluate our context-sensitive suggestions against a state-of-the-art baseline using a click-based metric. The resulting query suggestions generated by our approach outperform the baseline by 13% overall and by 16% on an ambiguous query subset.

While the query suggestions generated by our approach have higher quality than the existing baselines, we demonstrate that using them naively for injecting new documents into search results can lead to inferior rankings. To remedy this issue, we develop a classifier that decides when to inject new search results using features based on suggestion quality and user context. We show that our context-sensitive result fusion approach (Corfu) improves retrieval quality for ambiguous queries by up to 2.92%. Our approaches can efficiently scale to massive search logs, enabling a data-driven strategy that benefits from observing how users issue and reformulate queries in different contexts.

References

R. Baeza-Yates, C. Hurtado, and M. Mendoza. Query recommendation using query logs in search engines. EDBT'04, pages 588--596. Springer-Verlag, 2004. Google ScholarDigital Library
Z. Bar-Yossef and N. Kraus. Context-sensitive query auto-completion. WWW '11, pages 107--116. ACM, 2011. Google ScholarDigital Library
N. Belkin, P. Kantor, E. Fox, and J. Shaw. Combining the evidence of multiple query representations for information retrieval. Information Processing and Management, 31:431--448, 1995. Google ScholarDigital Library
P. N. Bennett, K. Svore, and S. T. Dumais. Classification-enhanced ranking. WWW '10, pages 111--120. ACM, 2010. Google ScholarDigital Library
P. N. Bennett, R. W. White, W. Chu, S. T. Dumais, P. Bailey, F. Borisyuk, and X. Cui. Modeling the impact of short- and long-term behavior on search personalization. SIGIR '12, pages 185--194. ACM, 2012. Google ScholarDigital Library
S. Bhatia, D. Majumdar, and P. Mitra. Query suggestions in the absence of query logs. SIGIR '11, pages 795--804. ACM, 2011. Google ScholarDigital Library
P. Boldi, F. Bonchi, C. Castillo, D. Donato, A. Gionis, and S. Vigna. The query-flow graph: Model and applications. CIKM '08, pages 609--618. ACM, 2008. Google ScholarDigital Library
H. Cao, D. H. Hu, D. Shen, D. Jiang, J.-T. Sun, E. Chen, and Q. Yang. Context-aware query classification. SIGIR '09, pages 3--10. ACM, 2009. Google ScholarDigital Library
H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. Context-aware query suggestion by mining click-through and session data. KDD '08, pages 875--883. ACM, 2008. Google ScholarDigital Library
K. Collins-Thompson, P. N. Bennett, R. W. White, S. de la Chica, and D. Sontag. Personalizing web search results by reading level. In Proceedings of CIKM 2011, pages 403--412. ACM, 2011. Google ScholarDigital Library
N. Craswell and M. Szummer. Random walks on the click graph. SIGIR '07, pages 239--246. ACM, 2007. Google ScholarDigital Library
S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In D. Lin and D. Wu, editors, EMNLP 2004, pages 293--300, Barcelona, Spain, 2004. Association for Computational Linguistics.Google Scholar
H. Cui, J.-R. Wen, J.-Y. Nie, and W.-Y. Ma. Query expansion by mining user logs. IEEE Trans. on Knowl. and Data Eng., 15(4):829-- 839, July 2003. Google ScholarDigital Library
D. Downey, S. Dumais, D. Liebling, and E. Horvitz. Understanding the relationship between searchers' queries and information goals. In CIKM '08, pages 449--458, Napa Valley, CA, 2008. Google ScholarDigital Library
C. Eickhoff, K. Collins-Thompson, P. N. Bennett, and S. Dumais. Personalizing atypical web search sessions. WSDM '13, pages 285--294. ACM, 2013. Google ScholarDigital Library
S. Fox, K. Karnawat, M. Mydland, S. Dumais, and T. White. Evaluating implicit measures to improve web search. ACM Trans. Inf. Syst., 23(2):147--168, Apr. 2005. Google ScholarDigital Library
J. H. Friedman. Greedy function approximation: a gradient boosting machine. Annals of Statistics, pages 1189--1232, 2001.Google ScholarCross Ref
J. Huang and E. N. Efthimiadis. Analyzing and evaluating query reformulation strategies in web search logs. CIKM '09, pages 77--86. ACM, 2009. Google ScholarDigital Library
B. J. Jansen, A. Spink, and J. O. Pedersen. A Temporal Comparison of AltaVista Web Searching. Journal of The American Society for Information Science and Technology, 56:559--570, 2005. Google ScholarDigital Library
E. C. Jensen, S. M. Beitzel, A. Chowdhury, and O. Frieder. Query phrase suggestion from topically tagged session logs. FQAS'06, pages 185--196. Springer-Verlag, 2006. Google ScholarDigital Library
M. P. Kato, T. Sakai, and K. Tanaka. When do people use query suggestion? a query suggestion log analysis. Inf. Retr., 16(6):725--746, Dec. 2013. Google ScholarDigital Library
M. P. Kato, T. Yamamoto, H. Ohshima, and K. Tanaka. Investigating users' query formulations for cognitive search intents. SIGIR '14, pages 577--586. ACM, 2014. Google ScholarDigital Library
F. Liu, C. Yu, and W. Meng. Personalized web search by mapping user queries to categories. CIKM '02, pages 558--565. ACM, 2002. Google ScholarDigital Library
C. Makris, Y. Plegas, and S. Stamou. Web query disambiguation using pagerank. J. Am. Soc. Inf. Sci. Technol., 63(8):1581--1592, Aug. 2012. Google ScholarDigital Library
L. Mihalkova and R. Mooney. Search query disambiguation from short sessions. In Beyond Search: Computational Intelligence for the Web Workshop at NIPS, 2008.Google Scholar
J. Minker. An evaluation of query expansion by the addition of clustered terms for a document retrieval system. Information Storage and Retrieval, 8(6):329--348, Dec. 1972.Google ScholarCross Ref
U. Ozertem, O. Chapelle, P. Donmez, and E. Velipasaoglu. Learning to suggest: A machine learning framework for ranking query suggestions. SIGIR '12, pages 25--34, Portland, OR, 2012. Google ScholarDigital Library
K. Raman, P. N. Bennett, and K. Collins-Thompson. Toward whole session relevance: Exploring intrinsic diversity in web search. SIGIR '13, New York, NY, USA, 2013. ACM. Google ScholarDigital Library
T. Sakai. On the reliability of information retrieval metrics based on graded relevance. Inf. Process. Manage., 43(2):531--548, Mar. 2007. Google ScholarDigital Library
T. Sakai and Z. Dou. Summaries, ranked retrieval and sessions: A unified framework for information access evaluation. SIGIR '13, pages 473--482, Dublin, Ireland, 2013. Google ScholarDigital Library
M. Sanderson. Ambiguous queries: Test collections need more sense. SIGIR '08, pages 499--506. ACM, 2008. Google ScholarDigital Library
J. A. Shaw and E. A. Fox. Combination of Multiple Searches. In Text REtrieval Conference, 1994.Google Scholar
D. Sheldon, M. Shokouhi, M. Szummer, and N. Craswell. Lambdamerge: Merging the results of query reformulations. WSDM '11, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
M. Shokouhi. Learning to personalize query auto-completion. SIGIR '13, New York, NY, USA, 2013. ACM. Google ScholarDigital Library
M. Shokouhi and L. Si. Federated search. Foundations and Trends in Information Retrieval, 5(1):1--102, 2011. Google ScholarDigital Library
R. Song, Z. Luo, J.-R. Wen, Y. Yu, and H.-W. Hon. Identifying ambiguous queries in web search. WWW '07, pages 1169--1170. ACM, 2007. Google ScholarDigital Library
J. Teevan, S. T. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. SIGIR '05, pages 449--456. ACM, 2005. Google ScholarDigital Library
G. Tsatsaronis, I. Varlamis, and K. Nørvåg. An experimental study on unsupervised graph-based word sense disambiguation. In A. Gelbukh, editor, Computational Linguistics and Intelligent Text Processing, volume 6008 of Lecture Notes in Computer Science, pages 184--198. Springer Berlin Heidelberg, 2010. Google ScholarDigital Library
X. Wang and C. Zhai. Mining term association patterns from search logs for effective query reformulation. CIKM '08, pages 479--488. ACM, 2008. Google ScholarDigital Library
X. Wei, F. Peng, and B. Dumoulin. Analyzing web text association to disambiguate abbreviation in queries. SIGIR '08, pages 751--752. ACM, 2008. Google ScholarDigital Library
R. White and S. Drucker. Investigating behavioral variability in web search. In WWW '07, 2007. Google ScholarDigital Library
R. W. White, M. Bilenko, and S. Cucerzan. Studying the use of popular destinations to enhance web search interaction. In SIGIR '07, pages 159--166, Amsterdam, The Netherlands, 2007. Google ScholarDigital Library
J. Xu, W. Wu, H. Li, and G. Xu. A kernel approach to addressing term mismatch. WWW '11, pages 153--154. ACM, 2011. Google ScholarDigital Library

Index Terms

Query Suggestion and Data Fusion in Contextual Disambiguation
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing

Recommendations

Visual query suggestion: Towards capturing user intent in internet image search

Query suggestion is an effective approach to bridge the Intention Gap between the users' search intents and queries. Most existing search engines are able to automatically suggest a list of textual query terms based on users' current query input, which ...
Read More
Visual query suggestion
MM '09: Proceedings of the 17th ACM international conference on Multimedia

Query suggestion is an effective approach to improve the usability of image search. Most existing search engines are able to automatically suggest a list of textual query terms based on users' current query input, which can be called Textual Query ...
Read More
Personalized Query Suggestion Diversification
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

Query suggestions help users refine their queries after they input an initial query. We consider the task of generating query suggestions that are personalized and diversified. We propose a personalized query suggestion diversification model (PQSD), ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '15: Proceedings of the 24th International Conference on World Wide Web
May 2015
1460 pages
ISBN:9781450334693
General Chairs:
Aldo Gangemi
National Research Council, Italy & Paris 13 University-CNRS, France
,
Stefano Leonardi
Sapienza University of Rome, Italy
,
Alessandro Panconesi
Sapienza University of Rome, Italy
Copyright © 2015 Copyright is held by the International World Wide Web Conference Committee (IW3C2)
Sponsors
In-Cooperation
Publisher
International World Wide Web Conferences Steering Committee
Republic and Canton of Geneva, Switzerland
Publication History
- Published: 18 May 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
blending
contextual disambiguation
contextual fusion
personalized search
Qualifiers
- research-article
Conference

Acceptance Rates
WWW '15 Paper Acceptance Rate131of929submissions,14%Overall Acceptance Rate1,899of8,196submissions,23%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 10
  Total Citations
  View Citations
- 247
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Query Suggestion and Data Fusion in Contextual Disambiguation

WWW '15: Proceedings of the 24th International Conference on World Wide Web

ABSTRACT

References

Cited By

Index Terms

Recommendations

Visual query suggestion: Towards capturing user intent in internet image search

Visual query suggestion

Personalized Query Suggestion Diversification