skip to main content
10.1145/3077136.3080786acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article
Honorable Mention

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

Published:07 August 2017Publication History

ABSTRACT

This paper provides a unified account of two schools of thinking in information retrieval modelling: the generative retrieval focusing on predicting relevant documents given a query, and the discriminative retrieval focusing on predicting relevancy given a query-document pair. We propose a game theoretical minimax game to iteratively optimise both models. On one hand, the discriminative model, aiming to mine signals from labelled and unlabelled data, provides guidance to train the generative model towards fitting the underlying relevance distribution over documents given the query. On the other hand, the generative model, acting as an attacker to the current discriminative model, generates difficult examples for the discriminative model in an adversarial way by minimising its discrimination objective. With the competition between these two models, we show that the unified framework takes advantage of both schools of thinking: (i) the generative model learns to fit the relevance distribution over documents via the signals from the discriminative model, and (ii) the discriminative model is able to exploit the unlabelled data selected by the generative model to achieve a better estimation for document ranking. Our experimental results have demonstrated significant performance gains as much as 23.96% on Precision@5 and 15.50% on MAP over strong baselines in a variety of applications including web search, item recommendation, and question answering.

References

  1. Ricardo Baeza-Yates, Berthier Ribeiro-Neto, and others. 1999. Modern information retrieval.Google ScholarGoogle Scholar
  2. Oren Barkan and Noam Koenigstein 2016. Item2vec: neural item embedding for collaborative filtering MLSP Workshop.Google ScholarGoogle Scholar
  3. Chris Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Greg Hullender 2005. Learning to Rank Using Gradient Descent. In ICML. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Christopher J. C. Burges. 2010. From RankNet to LambdaRank to LambdaMART: An Overview. Learning (2010).Google ScholarGoogle Scholar
  5. Christopher J. C. Burges, Robert Ragno, and Quoc Viet Le. 2006. Learning to Rank with Nonsmooth Cost Functions. In NIPS.Google ScholarGoogle Scholar
  6. Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li 2007. Learning to rank: from pairwise approach to listwise approach ICML.Google ScholarGoogle Scholar
  7. Wei Chen, Tie-Yan Liu, Yanyan Lan, Zhi-Ming Ma, and Hang Li 2009. Ranking Measures and Loss Functions in Learning to Rank NIPS. 315--323.Google ScholarGoogle Scholar
  8. Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for youtube recommendations. RecSys. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Cícero Nogueira dos Santos, Ming Tan, Bing Xiang, and Bowen Zhou 2016. Attentive Pooling Networks. CoRR (2016).Google ScholarGoogle Scholar
  10. Minwei Feng, Bing Xiang, Michael R Glass, Lidan Wang, and Bowen Zhou 2015. Applying deep learning to answer selection: A study and an open task ASRU Workshop.Google ScholarGoogle Scholar
  11. Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer. 2003. An Efficient Boosting Algorithm for Combining Preferences. JMLR (2003).Google ScholarGoogle Scholar
  12. Ian Goodfellow. 2016. NIPS 2016 Tutorial: Generative Adversarial Networks. arXiv preprint arXiv:1701.00160 (2016).Google ScholarGoogle Scholar
  13. Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In NIPS.Google ScholarGoogle Scholar
  14. Ian J. Goodfellow. 2014. On Distinguishability Criteria for Estimating Generative Models. arXiv:1412.6515 (2014).Google ScholarGoogle Scholar
  15. Michael Gutmann and Aapo Hyvärinen 2010. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models AISTATS.Google ScholarGoogle Scholar
  16. Ralf Herbrich, Thore Graepel, and Klaus Obermayer. 2000. Large Margin Rank Boundaries for Ordinal Regression. Advances in Large Margin Classifiers.Google ScholarGoogle Scholar
  17. Thomas Hofmann. 1999. Probabilistic Latent Semantic Indexing. In SIGIR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Ferenc Huszár. 2015. How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? arXiv preprint arXiv:1511.05101 (2015).Google ScholarGoogle Scholar
  19. Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. KDD. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yoon Kim 2014. Convolutional Neural Networks for Sentence Classification. arXiv preprint arXiv:1408.5882 (2014).Google ScholarGoogle Scholar
  21. Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer (2009).Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. John Lafferty and Chengxiang Zhai 2002. Probabilistic Relevance Models Based on Document and Query Generation Language Modeling and Information Retrieval.Google ScholarGoogle Scholar
  23. Ping Li, Christopher J. C. Burges, Qiang Wu, J. C. Platt, D. Koller, Y. Singer, and S. Roweis 2007. McRank: Learning to Rank Using Multiple Classification and Gradient Boosting. NIPS.Google ScholarGoogle Scholar
  24. Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval (2009).Google ScholarGoogle Scholar
  25. Tie-Yan Liu, Jun Xu, Tao Qin, Wenying Xiong, and Hang Li 2007. LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval Proceedings of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval.Google ScholarGoogle Scholar
  26. Jiyun Luo, Sicong Zhang, and Hui Yang 2014. Win-win search: dual-agent stochastic game in session search SIGIR.Google ScholarGoogle Scholar
  27. H. Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, and others 2013. Ad click prediction: a view from the trenches. In KDD. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean 2013. Distributed representations of words and phrases and their compositionality NIPS.Google ScholarGoogle Scholar
  29. Mehdi Mirza and Simon Osindero 2014. Conditional Generative Adversarial Nets. arXiv preprint arXiv:1411.1784 (2014).Google ScholarGoogle Scholar
  30. In Jae Myung. 2003. Tutorial on maximum likelihood estimation. Journal of mathematical Psychology (2003).Google ScholarGoogle Scholar
  31. Ramesh Nallapati. 2004. Discriminative Models for Information Retrieval. SIGIR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Jay M. Ponte and W. Bruce Croft 1998. A language modeling approach to information retrieval SIGIR.Google ScholarGoogle Scholar
  33. Steffen Rendle. 2010. Factorization machines. In ICDM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme 2009. BPR: Bayesian personalized ranking from implicit feedback UAI.Google ScholarGoogle Scholar
  35. Stephen E. Robertson and K. Sparck Jones 1976. Relevance weighting of search terms. Journal of the American Society for Information science (1976).Google ScholarGoogle Scholar
  36. Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. 2016. Improved Techniques for Training GANs. In NIPS.Google ScholarGoogle Scholar
  37. Aliaksei Severyn and Alessandro Moschitti 2015. Learning to rank short text pairs with convolutional deep neural networks SIGIR.Google ScholarGoogle Scholar
  38. Richard S. Sutton, David A. McAllester, Satinder P. Singh, Yishay Mansour, and others 1999. Policy Gradient Methods for Reinforcement Learning with Function Approximation NIPS.Google ScholarGoogle Scholar
  39. Tao Tao and ChengXiang Zhai 2006. Regularized estimation of mixture models for robust pseudo-relevance feedback SIGIR. ACM, 162--169.Google ScholarGoogle Scholar
  40. Di Wang and Eric Nyberg 2015. A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering ACL.Google ScholarGoogle Scholar
  41. Jun Wang, Arjen P. De Vries, and Marcel J. T. Reinders. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion SIGIR.Google ScholarGoogle Scholar
  42. Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning (1992).Google ScholarGoogle Scholar
  43. Yingce Xia, Di He, Tao Qin, Liwei Wang, Nenghai Yu, Tie-Yan Liu, and Wei-Ying Ma 2016. Dual Learning for Machine Translation. In NIPS.Google ScholarGoogle Scholar
  44. Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient AAAI.Google ScholarGoogle Scholar
  45. Shipeng Yu, Deng Cai, Ji-Rong Wen, and Wei-Ying Ma. 2003. Improving pseudo-relevance feedback in web information retrieval using web page segmentation. In WWW. ACM, 11--18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Fajie Yuan, Guibing Guo, Joemon M. Jose, Long Chen, Haitao Yu, and Weinan Zhang. 2016. Lambdafm: learning optimal ranking with factorization machines using lambda surrogates CIKM.Google ScholarGoogle Scholar
  47. ChengXiang Zhai. 2016. Towards a game-theoretic framework for text data retrieval. IEEE Data Eng. Bull. (2016).Google ScholarGoogle Scholar
  48. Chengxiang Zhai and John Lafferty 2004. A study of smoothing methods for language models applied to information retrieval. TOIS (2004).Google ScholarGoogle Scholar
  49. ChengXiang Zhai and John D. Lafferty 2001. Model-based Feedback in the Language Modeling Approach to Information Retrieval CIKM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Peng Zhang, Qian Yu, Yuexian Hou, Dawei Song, Jingfei Li, and Bin Hu. 2017. A Distribution Separation Method Using Irrelevance Feedback Data for Information Retrieval. ACM TIST (2017).Google ScholarGoogle Scholar
  51. Weinan Zhang, Tianqi Chen, Jun Wang, and Yong Yu. 2013. Optimizing top-n collaborative filtering via dynamic negative item sampling SIGIR.Google ScholarGoogle Scholar

Index Terms

  1. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
      August 2017
      1476 pages
      ISBN:9781450350228
      DOI:10.1145/3077136

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 August 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SIGIR '17 Paper Acceptance Rate78of362submissions,22%Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader