skip to main content
10.1145/1390334.1390438acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

A general optimization framework for smoothing language models on graph structures

Authors Info & Claims
Published:20 July 2008Publication History

ABSTRACT

Recent work on language models for information retrieval has shown that smoothing language models is crucial for achieving good retrieval performance. Many different effective smoothing methods have been proposed, which mostly implement various heuristics to exploit corpus structures. In this paper, we propose a general and unified optimization framework for smoothing language models on graph structures. This framework not only provides a unified formulation of the existing smoothing heuristics, but also serves as a road map for systematically exploring smoothing methods for language models. We follow this road map and derive several different instantiations of the framework. Some of the instantiations lead to novel smoothing methods. Empirical results show that all such instantiations are effective with some outperforming the state of the art smoothing methods.

References

  1. M. Belkin and P. Niyogi. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput., 15(6):1373--1396, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst., 30(1-7):107--117, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. F. Chen and J. Goodman. An empirical study of smoothing techniques for language modeling. Technical Report TR-10-98, Harvard University, 1998.Google ScholarGoogle Scholar
  4. K. W. Church and P. Hanks. Word association norms, mutual information, and lexicography. Comput. Linguist., 16(1):22--29, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. W. B. Croft and J. Lafferty, editors. Language Modeling and Information Retrieval. Kluwer Academic Publishers, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. F. Diaz. Regularizing ad hoc retrieval scores. In Proceedings of CIKM'05, pages 672--679, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Hiemstra and W. Kraaij. Twenty-one at TREC-7: Ad-hoc and cross-language track. In Proceedings of TREC 7, pages 227--238, 1998.Google ScholarGoogle Scholar
  8. J. M. Kleinberg. Authoritative sources in a hyperlinked environment. J. ACM, 46(5):604--632. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. O. Kurland and L. Lee. Corpus structure, language models, and ad hoc information retrieval. In Proceedings of SIGIR'04, pages 194--201. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. O. Kurland and L. Lee. Pagerank without hyperlinks: structural re-ranking using links induced by language models. In Proceedings of SIGIR '05, pages 306--313. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. O. Kurland and L. Lee. Respect my authority!: Hits without hyperlinks, utilizing cluster-based language models. In Proceedings of SIGIR '06, pages 83--90. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Lafferty and C. Zhai. Document language models, query models, and risk minimization for information retrieval. In Proceedings of SIGIR'01, pages 111--119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. V. Lavrenko and B. Croft. Relevance-based language models. In Proceedings of SIGIR'01, pages 120--127. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. X. Liu and W. B. Croft. Cluster-based retrieval using language models. In Proceedings of SIGIR'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Mihalcea and D. R. Radev, editors. Textgraphs: Graph-based methods for NLP, 2006.Google ScholarGoogle Scholar
  16. D. H. Miller, T. Leek, and R. Schwartz. A hidden Markov model information retrieval system. In Proceedings of SIGIR 1999, pages 214--221, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In Proceedings of SIGIR 1998, pages 275--281, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. T. Qin, T.-Y. Liu, X.-D. Zhang, Z. Chen, and W.-Y. Ma. A study of relevance propagation for web search. In Proceedings of SIGIR 2005, pages 408--415, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Shakery and C. Zhai. Smoothing document language models with probabilistic term count propagation. Information Retrieval, 11(2):139--164, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. T. Tao, X. Wang, Q. Mei, and C. Zhai. Language model information retrieval with document expansion. In Proceedings of HLT/NAACL 2006, pages 407--414. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Xu and W. B. Croft. Cluster-based language models for distributed retrieval. In Proceedings of SIGIR'99, pages 254--261, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. C. Zhai and J. Lafferty. Model-based feedback in the language modeling approach to information retrieval. In Proceedings of CIKM'01, pages 403--410, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of ACM SIGIR'01, pages 334--342, Sept 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf. Learning with local and global consistency. In NIPS, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. D. Zhou and B. Schölkopf. Discrete regularization. Semi-supervised learning, pages 221--232, 2006.Google ScholarGoogle Scholar
  27. X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In ICML, pages 912--919, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A general optimization framework for smoothing language models on graph structures

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
      July 2008
      934 pages
      ISBN:9781605581644
      DOI:10.1145/1390334

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 July 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader