ABSTRACT
Queries issued to a search engine are often under-specified or ambiguous. The user's search context or background may provide information that disambiguates their information need in order to automatically predict and issue a more effective query. The disambiguation can take place at different stages of the retrieval process. For instance, contextual query suggestions may be computed and recommended to users on the result page when appropriate, an approach that does not require modifying the original query's results. Alternatively, the search engine can attempt to provide efficient access to new relevant documents by injecting these documents directly into search results based on the user's context.
In this paper, we explore these complementary approaches and how they might be combined. We first develop a general framework for mining context-sensitive query reformulations for query suggestion. We evaluate our context-sensitive suggestions against a state-of-the-art baseline using a click-based metric. The resulting query suggestions generated by our approach outperform the baseline by 13% overall and by 16% on an ambiguous query subset.
While the query suggestions generated by our approach have higher quality than the existing baselines, we demonstrate that using them naively for injecting new documents into search results can lead to inferior rankings. To remedy this issue, we develop a classifier that decides when to inject new search results using features based on suggestion quality and user context. We show that our context-sensitive result fusion approach (Corfu) improves retrieval quality for ambiguous queries by up to 2.92%. Our approaches can efficiently scale to massive search logs, enabling a data-driven strategy that benefits from observing how users issue and reformulate queries in different contexts.
- R. Baeza-Yates, C. Hurtado, and M. Mendoza. Query recommendation using query logs in search engines. EDBT'04, pages 588--596. Springer-Verlag, 2004. Google ScholarDigital Library
- Z. Bar-Yossef and N. Kraus. Context-sensitive query auto-completion. WWW '11, pages 107--116. ACM, 2011. Google ScholarDigital Library
- N. Belkin, P. Kantor, E. Fox, and J. Shaw. Combining the evidence of multiple query representations for information retrieval. Information Processing and Management, 31:431--448, 1995. Google ScholarDigital Library
- P. N. Bennett, K. Svore, and S. T. Dumais. Classification-enhanced ranking. WWW '10, pages 111--120. ACM, 2010. Google ScholarDigital Library
- P. N. Bennett, R. W. White, W. Chu, S. T. Dumais, P. Bailey, F. Borisyuk, and X. Cui. Modeling the impact of short- and long-term behavior on search personalization. SIGIR '12, pages 185--194. ACM, 2012. Google ScholarDigital Library
- S. Bhatia, D. Majumdar, and P. Mitra. Query suggestions in the absence of query logs. SIGIR '11, pages 795--804. ACM, 2011. Google ScholarDigital Library
- P. Boldi, F. Bonchi, C. Castillo, D. Donato, A. Gionis, and S. Vigna. The query-flow graph: Model and applications. CIKM '08, pages 609--618. ACM, 2008. Google ScholarDigital Library
- H. Cao, D. H. Hu, D. Shen, D. Jiang, J.-T. Sun, E. Chen, and Q. Yang. Context-aware query classification. SIGIR '09, pages 3--10. ACM, 2009. Google ScholarDigital Library
- H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. Context-aware query suggestion by mining click-through and session data. KDD '08, pages 875--883. ACM, 2008. Google ScholarDigital Library
- K. Collins-Thompson, P. N. Bennett, R. W. White, S. de la Chica, and D. Sontag. Personalizing web search results by reading level. In Proceedings of CIKM 2011, pages 403--412. ACM, 2011. Google ScholarDigital Library
- N. Craswell and M. Szummer. Random walks on the click graph. SIGIR '07, pages 239--246. ACM, 2007. Google ScholarDigital Library
- S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In D. Lin and D. Wu, editors, EMNLP 2004, pages 293--300, Barcelona, Spain, 2004. Association for Computational Linguistics.Google Scholar
- H. Cui, J.-R. Wen, J.-Y. Nie, and W.-Y. Ma. Query expansion by mining user logs. IEEE Trans. on Knowl. and Data Eng., 15(4):829-- 839, July 2003. Google ScholarDigital Library
- D. Downey, S. Dumais, D. Liebling, and E. Horvitz. Understanding the relationship between searchers' queries and information goals. In CIKM '08, pages 449--458, Napa Valley, CA, 2008. Google ScholarDigital Library
- C. Eickhoff, K. Collins-Thompson, P. N. Bennett, and S. Dumais. Personalizing atypical web search sessions. WSDM '13, pages 285--294. ACM, 2013. Google ScholarDigital Library
- S. Fox, K. Karnawat, M. Mydland, S. Dumais, and T. White. Evaluating implicit measures to improve web search. ACM Trans. Inf. Syst., 23(2):147--168, Apr. 2005. Google ScholarDigital Library
- J. H. Friedman. Greedy function approximation: a gradient boosting machine. Annals of Statistics, pages 1189--1232, 2001.Google ScholarCross Ref
- J. Huang and E. N. Efthimiadis. Analyzing and evaluating query reformulation strategies in web search logs. CIKM '09, pages 77--86. ACM, 2009. Google ScholarDigital Library
- B. J. Jansen, A. Spink, and J. O. Pedersen. A Temporal Comparison of AltaVista Web Searching. Journal of The American Society for Information Science and Technology, 56:559--570, 2005. Google ScholarDigital Library
- E. C. Jensen, S. M. Beitzel, A. Chowdhury, and O. Frieder. Query phrase suggestion from topically tagged session logs. FQAS'06, pages 185--196. Springer-Verlag, 2006. Google ScholarDigital Library
- M. P. Kato, T. Sakai, and K. Tanaka. When do people use query suggestion? a query suggestion log analysis. Inf. Retr., 16(6):725--746, Dec. 2013. Google ScholarDigital Library
- M. P. Kato, T. Yamamoto, H. Ohshima, and K. Tanaka. Investigating users' query formulations for cognitive search intents. SIGIR '14, pages 577--586. ACM, 2014. Google ScholarDigital Library
- F. Liu, C. Yu, and W. Meng. Personalized web search by mapping user queries to categories. CIKM '02, pages 558--565. ACM, 2002. Google ScholarDigital Library
- C. Makris, Y. Plegas, and S. Stamou. Web query disambiguation using pagerank. J. Am. Soc. Inf. Sci. Technol., 63(8):1581--1592, Aug. 2012. Google ScholarDigital Library
- L. Mihalkova and R. Mooney. Search query disambiguation from short sessions. In Beyond Search: Computational Intelligence for the Web Workshop at NIPS, 2008.Google Scholar
- J. Minker. An evaluation of query expansion by the addition of clustered terms for a document retrieval system. Information Storage and Retrieval, 8(6):329--348, Dec. 1972.Google ScholarCross Ref
- U. Ozertem, O. Chapelle, P. Donmez, and E. Velipasaoglu. Learning to suggest: A machine learning framework for ranking query suggestions. SIGIR '12, pages 25--34, Portland, OR, 2012. Google ScholarDigital Library
- K. Raman, P. N. Bennett, and K. Collins-Thompson. Toward whole session relevance: Exploring intrinsic diversity in web search. SIGIR '13, New York, NY, USA, 2013. ACM. Google ScholarDigital Library
- T. Sakai. On the reliability of information retrieval metrics based on graded relevance. Inf. Process. Manage., 43(2):531--548, Mar. 2007. Google ScholarDigital Library
- T. Sakai and Z. Dou. Summaries, ranked retrieval and sessions: A unified framework for information access evaluation. SIGIR '13, pages 473--482, Dublin, Ireland, 2013. Google ScholarDigital Library
- M. Sanderson. Ambiguous queries: Test collections need more sense. SIGIR '08, pages 499--506. ACM, 2008. Google ScholarDigital Library
- J. A. Shaw and E. A. Fox. Combination of Multiple Searches. In Text REtrieval Conference, 1994.Google Scholar
- D. Sheldon, M. Shokouhi, M. Szummer, and N. Craswell. Lambdamerge: Merging the results of query reformulations. WSDM '11, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
- M. Shokouhi. Learning to personalize query auto-completion. SIGIR '13, New York, NY, USA, 2013. ACM. Google ScholarDigital Library
- M. Shokouhi and L. Si. Federated search. Foundations and Trends in Information Retrieval, 5(1):1--102, 2011. Google ScholarDigital Library
- R. Song, Z. Luo, J.-R. Wen, Y. Yu, and H.-W. Hon. Identifying ambiguous queries in web search. WWW '07, pages 1169--1170. ACM, 2007. Google ScholarDigital Library
- J. Teevan, S. T. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. SIGIR '05, pages 449--456. ACM, 2005. Google ScholarDigital Library
- G. Tsatsaronis, I. Varlamis, and K. Nørvåg. An experimental study on unsupervised graph-based word sense disambiguation. In A. Gelbukh, editor, Computational Linguistics and Intelligent Text Processing, volume 6008 of Lecture Notes in Computer Science, pages 184--198. Springer Berlin Heidelberg, 2010. Google ScholarDigital Library
- X. Wang and C. Zhai. Mining term association patterns from search logs for effective query reformulation. CIKM '08, pages 479--488. ACM, 2008. Google ScholarDigital Library
- X. Wei, F. Peng, and B. Dumoulin. Analyzing web text association to disambiguate abbreviation in queries. SIGIR '08, pages 751--752. ACM, 2008. Google ScholarDigital Library
- R. White and S. Drucker. Investigating behavioral variability in web search. In WWW '07, 2007. Google ScholarDigital Library
- R. W. White, M. Bilenko, and S. Cucerzan. Studying the use of popular destinations to enhance web search interaction. In SIGIR '07, pages 159--166, Amsterdam, The Netherlands, 2007. Google ScholarDigital Library
- J. Xu, W. Wu, H. Li, and G. Xu. A kernel approach to addressing term mismatch. WWW '11, pages 153--154. ACM, 2011. Google ScholarDigital Library
Index Terms
- Query Suggestion and Data Fusion in Contextual Disambiguation
Recommendations
Visual query suggestion: Towards capturing user intent in internet image search
Query suggestion is an effective approach to bridge the Intention Gap between the users' search intents and queries. Most existing search engines are able to automatically suggest a list of textual query terms based on users' current query input, which ...
Visual query suggestion
MM '09: Proceedings of the 17th ACM international conference on MultimediaQuery suggestion is an effective approach to improve the usability of image search. Most existing search engines are able to automatically suggest a list of textual query terms based on users' current query input, which can be called Textual Query ...
Personalized Query Suggestion Diversification
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information RetrievalQuery suggestions help users refine their queries after they input an initial query. We consider the task of generating query suggestions that are personalized and diversified. We propose a personalized query suggestion diversification model (PQSD), ...
Comments