skip to main content
10.1145/2736277.2741646acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Query Suggestion and Data Fusion in Contextual Disambiguation

Published:18 May 2015Publication History

ABSTRACT

Queries issued to a search engine are often under-specified or ambiguous. The user's search context or background may provide information that disambiguates their information need in order to automatically predict and issue a more effective query. The disambiguation can take place at different stages of the retrieval process. For instance, contextual query suggestions may be computed and recommended to users on the result page when appropriate, an approach that does not require modifying the original query's results. Alternatively, the search engine can attempt to provide efficient access to new relevant documents by injecting these documents directly into search results based on the user's context.

In this paper, we explore these complementary approaches and how they might be combined. We first develop a general framework for mining context-sensitive query reformulations for query suggestion. We evaluate our context-sensitive suggestions against a state-of-the-art baseline using a click-based metric. The resulting query suggestions generated by our approach outperform the baseline by 13% overall and by 16% on an ambiguous query subset.

While the query suggestions generated by our approach have higher quality than the existing baselines, we demonstrate that using them naively for injecting new documents into search results can lead to inferior rankings. To remedy this issue, we develop a classifier that decides when to inject new search results using features based on suggestion quality and user context. We show that our context-sensitive result fusion approach (Corfu) improves retrieval quality for ambiguous queries by up to 2.92%. Our approaches can efficiently scale to massive search logs, enabling a data-driven strategy that benefits from observing how users issue and reformulate queries in different contexts.

References

  1. R. Baeza-Yates, C. Hurtado, and M. Mendoza. Query recommendation using query logs in search engines. EDBT'04, pages 588--596. Springer-Verlag, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Z. Bar-Yossef and N. Kraus. Context-sensitive query auto-completion. WWW '11, pages 107--116. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. N. Belkin, P. Kantor, E. Fox, and J. Shaw. Combining the evidence of multiple query representations for information retrieval. Information Processing and Management, 31:431--448, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. N. Bennett, K. Svore, and S. T. Dumais. Classification-enhanced ranking. WWW '10, pages 111--120. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. N. Bennett, R. W. White, W. Chu, S. T. Dumais, P. Bailey, F. Borisyuk, and X. Cui. Modeling the impact of short- and long-term behavior on search personalization. SIGIR '12, pages 185--194. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Bhatia, D. Majumdar, and P. Mitra. Query suggestions in the absence of query logs. SIGIR '11, pages 795--804. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Boldi, F. Bonchi, C. Castillo, D. Donato, A. Gionis, and S. Vigna. The query-flow graph: Model and applications. CIKM '08, pages 609--618. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. H. Cao, D. H. Hu, D. Shen, D. Jiang, J.-T. Sun, E. Chen, and Q. Yang. Context-aware query classification. SIGIR '09, pages 3--10. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. Context-aware query suggestion by mining click-through and session data. KDD '08, pages 875--883. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. K. Collins-Thompson, P. N. Bennett, R. W. White, S. de la Chica, and D. Sontag. Personalizing web search results by reading level. In Proceedings of CIKM 2011, pages 403--412. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. N. Craswell and M. Szummer. Random walks on the click graph. SIGIR '07, pages 239--246. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In D. Lin and D. Wu, editors, EMNLP 2004, pages 293--300, Barcelona, Spain, 2004. Association for Computational Linguistics.Google ScholarGoogle Scholar
  13. H. Cui, J.-R. Wen, J.-Y. Nie, and W.-Y. Ma. Query expansion by mining user logs. IEEE Trans. on Knowl. and Data Eng., 15(4):829-- 839, July 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Downey, S. Dumais, D. Liebling, and E. Horvitz. Understanding the relationship between searchers' queries and information goals. In CIKM '08, pages 449--458, Napa Valley, CA, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Eickhoff, K. Collins-Thompson, P. N. Bennett, and S. Dumais. Personalizing atypical web search sessions. WSDM '13, pages 285--294. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Fox, K. Karnawat, M. Mydland, S. Dumais, and T. White. Evaluating implicit measures to improve web search. ACM Trans. Inf. Syst., 23(2):147--168, Apr. 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. H. Friedman. Greedy function approximation: a gradient boosting machine. Annals of Statistics, pages 1189--1232, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  18. J. Huang and E. N. Efthimiadis. Analyzing and evaluating query reformulation strategies in web search logs. CIKM '09, pages 77--86. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. J. Jansen, A. Spink, and J. O. Pedersen. A Temporal Comparison of AltaVista Web Searching. Journal of The American Society for Information Science and Technology, 56:559--570, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. E. C. Jensen, S. M. Beitzel, A. Chowdhury, and O. Frieder. Query phrase suggestion from topically tagged session logs. FQAS'06, pages 185--196. Springer-Verlag, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. P. Kato, T. Sakai, and K. Tanaka. When do people use query suggestion? a query suggestion log analysis. Inf. Retr., 16(6):725--746, Dec. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. P. Kato, T. Yamamoto, H. Ohshima, and K. Tanaka. Investigating users' query formulations for cognitive search intents. SIGIR '14, pages 577--586. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. F. Liu, C. Yu, and W. Meng. Personalized web search by mapping user queries to categories. CIKM '02, pages 558--565. ACM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. C. Makris, Y. Plegas, and S. Stamou. Web query disambiguation using pagerank. J. Am. Soc. Inf. Sci. Technol., 63(8):1581--1592, Aug. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. L. Mihalkova and R. Mooney. Search query disambiguation from short sessions. In Beyond Search: Computational Intelligence for the Web Workshop at NIPS, 2008.Google ScholarGoogle Scholar
  26. J. Minker. An evaluation of query expansion by the addition of clustered terms for a document retrieval system. Information Storage and Retrieval, 8(6):329--348, Dec. 1972.Google ScholarGoogle ScholarCross RefCross Ref
  27. U. Ozertem, O. Chapelle, P. Donmez, and E. Velipasaoglu. Learning to suggest: A machine learning framework for ranking query suggestions. SIGIR '12, pages 25--34, Portland, OR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. K. Raman, P. N. Bennett, and K. Collins-Thompson. Toward whole session relevance: Exploring intrinsic diversity in web search. SIGIR '13, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. T. Sakai. On the reliability of information retrieval metrics based on graded relevance. Inf. Process. Manage., 43(2):531--548, Mar. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. T. Sakai and Z. Dou. Summaries, ranked retrieval and sessions: A unified framework for information access evaluation. SIGIR '13, pages 473--482, Dublin, Ireland, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. M. Sanderson. Ambiguous queries: Test collections need more sense. SIGIR '08, pages 499--506. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. J. A. Shaw and E. A. Fox. Combination of Multiple Searches. In Text REtrieval Conference, 1994.Google ScholarGoogle Scholar
  33. D. Sheldon, M. Shokouhi, M. Szummer, and N. Craswell. Lambdamerge: Merging the results of query reformulations. WSDM '11, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. M. Shokouhi. Learning to personalize query auto-completion. SIGIR '13, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. M. Shokouhi and L. Si. Federated search. Foundations and Trends in Information Retrieval, 5(1):1--102, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. R. Song, Z. Luo, J.-R. Wen, Y. Yu, and H.-W. Hon. Identifying ambiguous queries in web search. WWW '07, pages 1169--1170. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. J. Teevan, S. T. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. SIGIR '05, pages 449--456. ACM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. G. Tsatsaronis, I. Varlamis, and K. Nørvåg. An experimental study on unsupervised graph-based word sense disambiguation. In A. Gelbukh, editor, Computational Linguistics and Intelligent Text Processing, volume 6008 of Lecture Notes in Computer Science, pages 184--198. Springer Berlin Heidelberg, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. X. Wang and C. Zhai. Mining term association patterns from search logs for effective query reformulation. CIKM '08, pages 479--488. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. X. Wei, F. Peng, and B. Dumoulin. Analyzing web text association to disambiguate abbreviation in queries. SIGIR '08, pages 751--752. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. R. White and S. Drucker. Investigating behavioral variability in web search. In WWW '07, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. R. W. White, M. Bilenko, and S. Cucerzan. Studying the use of popular destinations to enhance web search interaction. In SIGIR '07, pages 159--166, Amsterdam, The Netherlands, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. J. Xu, W. Wu, H. Li, and G. Xu. A kernel approach to addressing term mismatch. WWW '11, pages 153--154. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Query Suggestion and Data Fusion in Contextual Disambiguation

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '15: Proceedings of the 24th International Conference on World Wide Web
      May 2015
      1460 pages
      ISBN:9781450334693

      Copyright © 2015 Copyright is held by the International World Wide Web Conference Committee (IW3C2)

      Publisher

      International World Wide Web Conferences Steering Committee

      Republic and Canton of Geneva, Switzerland

      Publication History

      • Published: 18 May 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      WWW '15 Paper Acceptance Rate131of929submissions,14%Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader