skip to main content
10.1145/2645710.2645754acmconferencesArticle/Chapter ViewAbstractPublication PagesrecsysConference Proceedingsconference-collections
research-article

Bayesian binomial mixture model for collaborative prediction with non-random missing data

Authors Info & Claims
Published:06 October 2014Publication History

ABSTRACT

Collaborative prediction involves filling in missing entries of a user-item matrix to predict preferences of users based on their observed preferences. Most of existing models assume that the data is missing at random (MAR), which is often violated in recommender systems in practice. Incorrect assumption on missing data ignores the missing data mechanism, leading to biased inferences and prediction. In this paper we present a Bayesian binomial mixture model for collaborative prediction, where the generative process for data and missing data mechanism are jointly modeled to handle non-random missing data. Missing data mechanism is modeled by three factors, each of which is related to users, items, and rating values. Each factor is modeled by Bernoulli random variable, and the observation of rating value is determined by the Boolean OR operation of three binary variables. We develop computationally-efficient variational inference algorithms, where variational parameters have closed-form update rules and the computational complexity depends on the number of observed ratings, instead of the size of the rating data matrix. We also discuss implementation issues on hyperparameter tuning and estimation based on empirical Bayes. Experiments on Yahoo! Music and MovieLens datasets confirm the useful behavior of our model by demonstrating that: (1) it outperforms state-of-the-art methods in yielding higher predictive performance; (2) it finds meaningful solutions instead of undesirable boundary solutions; (3) it provides rating trend analysis on why ratings are observed.

Skip Supplemental Material Section

Supplemental Material

p201-sidebyside.mp4

mp4

45.4 MB

References

  1. E. Candès and T. Tao. The power of convex relaxation: Near-optimal matrix completion. IEEE Transactions on Information Theory, 56(5):2053--2080, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. G. Dror, N. Koenigstein, Y. Koren, and M. Weimer. The yahoo! music dataset and KDD-Cup'11. In Proceedings of KDD Cup and Workshop, 2011.Google ScholarGoogle Scholar
  3. Y.-D. Kim and S. Choi. Variational Bayesian view of weighted trace norm regularization for matrix factorization. IEEE Signal Processing Letters, 20(3):261--264, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  4. Y.-D. Kim and S. Choi. Scalable variational Bayesian matrix factorization with side information. In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), Reykjavik, Iceland, 2014.Google ScholarGoogle Scholar
  5. Y. J. Lim and Y. W. Teh. Variational Bayesian approach to movie rating prediction. In Proceedings of KDD Cup and Workshop, San Jose, CA, 2007.Google ScholarGoogle Scholar
  6. G. Ling, H. Yang, M. R. Lyu, and I. King. Response aware model-based collaborative filtering. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence (UAI), Catalina Island, California, USA, 2012.Google ScholarGoogle Scholar
  7. R. J. A. Little and D. B. Rubin. Statistical Analysis with Missing Data. John Willey & Sons, Inc., 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. B. M. Marlin. Modeling user rating profiles for collaborative filtering. In Advances in Neural Information Processing Systems (NIPS), volume 16, 2004.Google ScholarGoogle Scholar
  9. B. M. Marlin. Missing Data Problems in Machine Learning. PhD thesis, University of Toronto, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. M. Marlin and R. S. Zemel. Collaborative prediction and ranking with non-random missing data. In Proceedings of the ACM International Conference on Recommender Systems (RecSys), New York, New York, USA, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. M. Marlin, R. S. Zemel, S. T. Roweis, and M. Slaney. Collaborative filtering and the missing at random assumption. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence (UAI), Vancouver, Canada, 2007.Google ScholarGoogle Scholar
  12. T. Minka. Estimating a Dirichlet distribution. Technical report, Microsoft Research, 2000.Google ScholarGoogle Scholar
  13. U. Paquet and N. Koenigstein. One-class collaborative filtering with random graphs. In Proceedings of the International Conference on World Wide Web (WWW), Rio de Janeiro, Brazil, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Park, Y.-D. Kim, and S. Choi. Hierarchical Bayesian matrix factorization with side information. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. Raiko, A. Ilin, and J. Karhunen. Principal component analysis for large scale problems with lots of missing values. In Proceedings of the European Conference on Machine Learning (ECML), pages 691--698, Warsaw, Poland, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Salakhutdinov and A. Mnih. Bayesian probablistic matrix factorization using MCMC. In Proceedings of the International Conference on Machine Learning (ICML), Helsinki, Finland, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Salakhutdinov, A. Mnih, and G. Hinton. Restricted Boltzmann machines for collaborative filtering. In Proceedings of the International Conference on Machine Learning (ICML), Corvallis, OR, USA, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R. Salakhutdinov and N.Srebro. Collaborative filtering in a non-uniform world: Learning with weighted trace norm. In Advances in Neural Information Processing Systems (NIPS), volume 23. MIT Press, 2010.Google ScholarGoogle Scholar
  19. H. Steck. Training and testing of recommender systems on data missing not at random. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Washington, DC, USA, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Wu. Binomial matrix factorization for discrete collaborative filtering. In Proceedings of the IEEE International Conference on Data Mining (ICDM), Miami, Florida, USA, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Yoo and S. Choi. Bayesian matrix co-factorization: Variational algorithm and Cramér-Rao bound. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), Athens, Greece, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Bayesian binomial mixture model for collaborative prediction with non-random missing data

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              RecSys '14: Proceedings of the 8th ACM Conference on Recommender systems
              October 2014
              458 pages
              ISBN:9781450326681
              DOI:10.1145/2645710

              Copyright © 2014 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 6 October 2014

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              RecSys '14 Paper Acceptance Rate35of234submissions,15%Overall Acceptance Rate254of1,295submissions,20%

              Upcoming Conference

              RecSys '24
              18th ACM Conference on Recommender Systems
              October 14 - 18, 2024
              Bari , Italy

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader