skip to main content
research-article

Simrank++: query rewriting through link analysis of the click graph

Published:01 August 2008Publication History
Skip Abstract Section

Abstract

We focus on the problem of query rewriting for sponsored search. We base rewrites on a historical click graph that records the ads that have been clicked on in response to past user queries. Given a query q, we first consider Simrank [7] as a way to identify queries similar to q, i.e., queries whose ads a user may be interested in. We argue that Simrank fails to properly identify query similarities in our application, and we present two enhanced versions of Simrank: one that exploits weights on click graph edges and another that exploits "evidence." We experimentally evaluate our new schemes against Simrank, using actual click graphs and queries from Yahoo!, and using a variety of metrics. Our results show that the enhanced methods can yield more and better query rewrites.

References

  1. Reid Andersen, Fan Chung, and Kevin Lang. Local graph partitioning using pagerank vectors. In FOCS '06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. I. Antonellis, H. Garcia-Molina, and C. Chang. Simrank++: Query rewriting through link analysis of the click graph. In Technical Report, url: http://dbpubs.stanford.edu/pub/2007--32, 2007.Google ScholarGoogle Scholar
  3. Doug Beeferman and Adam Berger. Agglomerative clustering of a search engine query log. In KDD '00. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Nick Craswell and Martin Szummer. Random walks on the click graph. In Proc. SIGIR '07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified data processing on large clusters. In OSDI 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. 1990.Google ScholarGoogle Scholar
  7. Glen Jeh and Jennifer Widom. Simrank: a measure of structural-context similarity. In KDD '02. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Rosie Jones and Daniel C. Fain. Query word deletion prediction. In SIGIR '03. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Rosie Jones, Benjamin Rey, Omid Madani, and Wiley Greiner. Generating query substitutions. In WWW '07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Christos H. Papadimitriou, Hisao Tamaki, Prabhakar Raghavan, and Santosh Vempala. Latent semantic indexing: a probabilistic analysis. In PODS '98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Regelson and D. Fain. Predicting click-through rate using keyword clusters. In Proc. 2nd Workshop on Sponsored Search Auctions.Google ScholarGoogle Scholar
  12. Matthew Richardson, Ewa Dominowska, and Robert Ragno. Predicting clicks: Estimating the click-through rate for new ads. In WWW '07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Ian Ruthven. Re-examining the potential effectiveness of interactive query expansion. In SIGIR '03, pages 213--220. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Sinclair. Algorithms for random generation and counting: A markov chain approach. In Birkhauser, Boston-Basel-Berlin, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Egidio Terra and Charles L. A. Clarke. Scoring missing terms in information retrieval tasks. In CIKM '04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Ji-Rong Wen, Jian-Yun Nie, and Hong-Jiang Zhang. Query clustering using user logs. ACM Trans. Inf. Syst., 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Wei Vivian Zhang, Xiaofei He, Benjamin Rey, and Rosie Jones. Query rewriting using active learning for sponsored search. In SIGIR '07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Wei Vivian Zhang and Rosie Jones. Comparing click logs and editorial labels for training query rewriting. In Query Log Analysis Workshop, WWW '07.Google ScholarGoogle Scholar

Index Terms

  1. Simrank++: query rewriting through link analysis of the click graph

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader