skip to main content
10.1145/1526709.1526725acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Matchbox: large scale online bayesian recommendations

Published:20 April 2009Publication History

ABSTRACT

We present a probabilistic model for generating personalised recommendations of items to users of a web service. The Matchbox system makes use of content information in the form of user and item meta data in combination with collaborative filtering information from previous user behavior in order to predict the value of an item for a user. Users and items are represented by feature vectors which are mapped into a low-dimensional `trait space' in which similarity is measured in terms of inner products. The model can be trained from different types of feedback in order to learn user-item preferences. Here we present three alternatives: direct observation of an absolute rating each user gives to some items, observation of a binary preference (like/ don't like) and observation of a set of ordinal ratings on a user-specific scale. Efficient inference is achieved by approximate message passing involving a combination of Expectation Propagation (EP) and Variational Message Passing. We also include a dynamics model which allows an item's popularity, a user's taste or a user's personal rating scale to drift over time. By using Assumed-Density Filtering (ADF) for training, the model requires only a single pass through the training data. This is an on-line learning algorithm capable of incrementally taking account of new data so the system can immediately reflect the latest user preferences. We evaluate the performance of the algorithm on the MovieLens and Netflix data sets consisting of approximately 1,000,000 and 100,000,000 ratings respectively. This demonstrates that training the model using the on-line ADF approach yields state-of-the-art performance with the option of improving performance further if computational resources are available by performing multiple EP passes over the training data.

References

  1. Netflix Cinematch: http://www.netflix.com.Google ScholarGoogle Scholar
  2. R. M. Bell and Y. Koren. Lessons from the Netflix prize challenge. ACM SIGKDD Explorations Newsletter, 9:75--79, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th ACM Conference on Uncertainty in Artificial Intelligence, pages 34--52, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. W. Chu and Z. Ghahramani. Gaussian processes for ordinal regression. pages 1019--1041, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Dangauthier, R. Herbrich, T. Minka, and T. Graepel. Trueskill through time: Revisiting the history of chess. In Advances in Neural Information Processing Systems 20, pages 337--344, 2008.Google ScholarGoogle Scholar
  6. Abhinandan S. Das, Mayur Datar, Ashutosh Garg, and Shyam Rajaram. Google news personalization: scalable online collaborative filtering. In WWW '07: Proceedings of the 16th international conference on World Wide Web, pages 271--280, New York, NY, USA, 2007. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. N. Goldberg, B. M. Oki, and D. Terry. Using collaborative filtering to weave an information tapestry. Communications of the ACM, 35:61--70, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Herbrich, T. Minka, and T. Graepel. TrueSkill(TM): A Bayesian skill rating system. In Advances in Neural Information Processing Systems 20, pages 569--576, 2007.Google ScholarGoogle Scholar
  9. M. K. Hughey and M. W. Berry. Improved query matching using kd--trees: A latent semantic indexing enhancement. Information Retrieval, 2:287--302, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. F. R. Kschischang, B. Frey, and H.-A. Loeliger. Factor graphs and the sum--product algorithm. IEEE Trans. Inform. Theory, 47(2):498--519, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. X. N. Lam, T. Vu, T. D. Le, and A. D. Duong. Addressing cold-start problem in recommendation systems. In Proceedings of the 2nd international conference on Ubiquitous information management and communication, pages 208--211, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. J. Lim and Y. W. Teh. Variational Bayesian approach to movie rating prediction. In Proceedings of KDD Cup and Workshop, 2007.Google ScholarGoogle Scholar
  13. Benjamin Marlin. Collaborative filtering: A machine learning perspective. Master's thesis, University of Toronto, 2004.Google ScholarGoogle Scholar
  14. T. Minka. Divergence measures and message passing. Technical Report MSR-TR-2007-173, Microsoft Research Ltd., 2005.Google ScholarGoogle Scholar
  15. T. Minka, J.M. Winn, J.P. Guiver, and A. Kannan. Infer.NET 2.2, 2009. Microsoft Research Cambridge. http://research.microsoft.com/infernet.Google ScholarGoogle Scholar
  16. Thomas Minka. A family of algorithms for approximate Bayesian inference. PhD thesis, MIT, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Mnih R. Salakhutdinov and G. Hinton. Restricted Boltzmann machines for collaborative filtering. In Proceedings of the 24th Annual International Conference on Machine Learning, pages 791--798, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. B. Schafer, J. Konstan, and J. Riedi. Recommender systems in E-commerce. In Proceedings of the 1st ACM conference on Electronic commerce, pages 158--166, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. H. R. Varian and P. Resnick. Recommender systems. Communications of the ACM, 40:56--58, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. M. Winn. Variational message passing and its application. PhD thesis, Department of Physics, University of Cambridge, 2003.Google ScholarGoogle Scholar

Index Terms

  1. Matchbox: large scale online bayesian recommendations

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader