ABSTRACT
Most existing content-based filtering approaches including Rocchio, Language Models, SVM, Logistic Regression, Neural Networks, etc. learn user profiles independently without capturing the similarity among users. The Bayesian hierarchical models learn user profiles jointly and have the advantage of being able to borrow information from other users through a Bayesian prior. The standard Bayesian hierarchical model assumes all user profiles are generated from the same prior. However, considering the diversity of user interests, this assumption might not be optimal. Besides, most existing content-based filtering approaches implicitly assume that each user profile corresponds to exactly one user interest and fail to capture a user's multiple interests (information needs).
In this paper, we present a flexible Bayesian hierarchical modeling approach to model both commonality and diversity among users as well as individual users' multiple interests. We propose two models each with different assumptions, and the proposed models are called Discriminative Factored Prior Models (DFPM). In our models, each user profile is modeled as a discriminative classifier with a factored model as its prior, and different factors contribute in different levels to each user profile. Compared with existing content-based filtering models, DFPM are interesting because they can 1) borrow discriminative criteria of other users while learning a particular user profile through the factored prior; 2) trade off well between diversity and commonality among users; and 3) handle the challenging classification situation where each class contains multiple concepts. The experimental results on a dataset collected from real users on digg.com show that our models significantly outperform the baseline models of L-2 regularized logistic regression and the standard Bayesian hierarchical model with logistic regression
- The digg website. https://www.digg.com.Google Scholar
- Empirical bayes method. http://en.wikipedia.org/wiki/Empirical_Bayes_method.Google Scholar
- Hierarchical modeling in bbr. http://www.stat.rutgers.edu/~madigan/BBR/hier.html.Google Scholar
- K. Yu, V. Tresp, and S. Yu. A nonparametric hierarchical bayesian framework for information filtering. In SIGIR, 2004. Google ScholarDigital Library
- J. Zhang, Z. Ghahramani, and Y. Yang. Flexible latent variable models for multi-task learning. Mach. Learn., 73(3):221--242, 2008. Google ScholarDigital Library
- Y. Zhang and J. Koren. Efficient bayesian hierarchical user modeling for recommendation system. In SIGIR ’07, pages 47--54, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
Index Terms
- Discriminative factored prior models for personalized content-based recommendation
Recommendations
Personalized rough-set-based recommendation by integrating multiple contents and collaborative information
In recent years, explosively-growing information makes the users confused in making decisions among various kinds of products such as music, movies, books, etc. As a result, it is a challenging issue to help the user identify what she/he prefers. To ...
Content-based filtering for recommendation systems using multiattribute networks
We propose a content-based filtering algorithm based on a multiattribute network.Network analysis can consider similarities among indirectly-connected items.The proposed method addresses the data sparsity and over-specialization problems.The experiment ...
Personalized Recommendation Algorithm Using User Demography Information
WKDD '09: Proceedings of the 2009 Second International Workshop on Knowledge Discovery and Data MiningPersonalized recommendation systems are web-based systems that aim at predicting a user’s interest on available products and services by relying on previously rated items and dealing with the problem of information and product overload. User demography ...
Comments