Skip to main content
Erschienen in: Journal of Intelligent Information Systems 3/2015

01.06.2015

A hybrid approach of topic model and matrix factorization based on two-step recommendation framework

verfasst von: Xiangyu Zhao, Zhendong Niu, Wei Chen, Chongyang Shi, Ke Niu, Donglei Liu

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 3/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recommender systems become increasingly significant in solving the information explosion problem. Two typical kinds of techniques treat the recommendation problem as either a rating prediction or a ranking prediction one. In contrast, we propose a two-step framework that considers recommendation as a simulation of users’ behaviors to generate ratings. The first step is to predict the probability that a user rates an item, and the second step is to predict rating values. After that, the predicted results from both steps are combined to compute the expectations of users’ ratings on items, which are used to generate recommendations. Based on this framework, we propose a hybrid approach which uses topic model in the first step and matrix factorization in the second to solve the recommendation problem. Experiments with MovieLens and EachMovie datasets demonstrate the effectiveness of the proposed framework and the recommendation approach.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
0 is a typical value out of the range of rating scale, which can be used to distinguish the rating value and the rating behavior.
 
2
The effectiveness is demonstrated in the experiment section, in which HTMMF using rating behaviors in the first step gets better results than the one (L&S) with same model but using rating values in the first step.
 
3
Types of implicit feedback include rating behaviors, purchase history, browsing history, and search patterns.
 
6
The original ratings from EM1 are in 0-to-1 scale. We convert it to 0-to-5 scale, and then exclude the ratings with value 0 in order to fit the proposed recommendation framework.
 
7
Since the restriction of the column width, we only report some typical performances for different K T .
 
Literatur
Zurück zum Zitat Adomavicius, G., & Kwon, Y. (2012). Improving aggregate recommendation diversity using ranking-based techniques. IEEE Transactions on Knowledge and Data Engineering, 24(5), 896–911.CrossRef Adomavicius, G., & Kwon, Y. (2012). Improving aggregate recommendation diversity using ranking-based techniques. IEEE Transactions on Knowledge and Data Engineering, 24(5), 896–911.CrossRef
Zurück zum Zitat Aytekin, T., Karakaya, M.Ö. (2014). Clustering-based diversity improvement in top-n recommendation. Journal of Intelligent Information Systems, 42(1), 1–18.CrossRef Aytekin, T., Karakaya, M.Ö. (2014). Clustering-based diversity improvement in top-n recommendation. Journal of Intelligent Information Systems, 42(1), 1–18.CrossRef
Zurück zum Zitat Blei, D.M., Ng, A.Y., Jordan, M.I. (2003). Latent dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.MATH Blei, D.M., Ng, A.Y., Jordan, M.I. (2003). Latent dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.MATH
Zurück zum Zitat Breese, J.S., Heckerman, D., Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the fourteenth conference on uncertainty in artificial intelligence (pp. 43–52). Morgan Kaufmann. Breese, J.S., Heckerman, D., Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the fourteenth conference on uncertainty in artificial intelligence (pp. 43–52). Morgan Kaufmann.
Zurück zum Zitat Chen, W., Niu, Z., Zhao, X., Li, Y. (2014). A hybrid recommendation algorithm adapted in e-learning environments. World Wide Web, 17(2), 271–284.CrossRef Chen, W., Niu, Z., Zhao, X., Li, Y. (2014). A hybrid recommendation algorithm adapted in e-learning environments. World Wide Web, 17(2), 271–284.CrossRef
Zurück zum Zitat Cremonesi, P., Koren, Y., Turrin, R. (2010). Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the fourth ACM conference on recommender systems (pp. 39–46). ACM. Cremonesi, P., Koren, Y., Turrin, R. (2010). Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the fourth ACM conference on recommender systems (pp. 39–46). ACM.
Zurück zum Zitat Delgado, J., & Ishii, N. (1999). Memory-based weighted majority prediction. In ACM SIGIR’99 workshop on recommender systems. Citeseer. Delgado, J., & Ishii, N. (1999). Memory-based weighted majority prediction. In ACM SIGIR’99 workshop on recommender systems. Citeseer.
Zurück zum Zitat Griffiths, T.L., & Steyvers, M. (2004). Finding scientific topics. In Proceedings of the National Academy of Sciences of the United States of America (Vol. 101, Suppl 1, pp. 5228–5235). Griffiths, T.L., & Steyvers, M. (2004). Finding scientific topics. In Proceedings of the National Academy of Sciences of the United States of America (Vol. 101, Suppl 1, pp. 5228–5235).
Zurück zum Zitat Hofmann, T. (1999). Probabilistic latent semantic analysis. In Proceedings of the fifteenth conference on uncertainty in artificial intelligence (pp. 289–296). Morgan Kaufmann Hofmann, T. (1999). Probabilistic latent semantic analysis. In Proceedings of the fifteenth conference on uncertainty in artificial intelligence (pp. 289–296). Morgan Kaufmann
Zurück zum Zitat Hofmann, T. (2004). Latent semantic models for collaborative filtering. ACM Transactions on Information Systems (TOIS), 22(1), 89–115.CrossRef Hofmann, T. (2004). Latent semantic models for collaborative filtering. ACM Transactions on Information Systems (TOIS), 22(1), 89–115.CrossRef
Zurück zum Zitat Järvelin, K., & Kekäläinen, J. (2002). Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems (TOIS), 20(4), 422–446.CrossRef Järvelin, K., & Kekäläinen, J. (2002). Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems (TOIS), 20(4), 422–446.CrossRef
Zurück zum Zitat Kantor, P.B., Rokach, L., Ricci, F., Shapira, B. (2011). Recommender systems handbook. Springer. Kantor, P.B., Rokach, L., Ricci, F., Shapira, B. (2011). Recommender systems handbook. Springer.
Zurück zum Zitat Koren, Y. (2008). Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 426–434). ACM. Koren, Y. (2008). Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 426–434). ACM.
Zurück zum Zitat Koren, Y., & Sill, J. (2011). Ordrec: An ordinal model for predicting personalized item rating distributions. In Proceedings of the fifth ACM conference on recommender systems (pp. 117–124). ACM. Koren, Y., & Sill, J. (2011). Ordrec: An ordinal model for predicting personalized item rating distributions. In Proceedings of the fifth ACM conference on recommender systems (pp. 117–124). ACM.
Zurück zum Zitat Li, Y., Hu, J., Zhai, C., Chen, Y. (2010). Improving one-class collaborative filtering by incorporating rich user information. In Proceedings of the 19th ACM international conference on information and knowledge management (pp. 959–968). ACM. Li, Y., Hu, J., Zhai, C., Chen, Y. (2010). Improving one-class collaborative filtering by incorporating rich user information. In Proceedings of the 19th ACM international conference on information and knowledge management (pp. 959–968). ACM.
Zurück zum Zitat Liu, N.N., Xiang, E.W., Zhao, M., Yang, Q. (2010). Unifying explicit and implicit feedback for collaborative filtering. In Proceedings of the 19th ACM international conference on Information and knowledge management (pp. 1445–1448). ACM. Liu, N.N., Xiang, E.W., Zhao, M., Yang, Q. (2010). Unifying explicit and implicit feedback for collaborative filtering. In Proceedings of the 19th ACM international conference on Information and knowledge management (pp. 1445–1448). ACM.
Zurück zum Zitat Liu, N.N., Zhao, M., Yang, Q. (2009). Probabilistic latent preference analysis for collaborative filtering. In Proceedings of the 18th ACM conference on information and knowledge management (pp. 759–766). ACM. Liu, N.N., Zhao, M., Yang, Q. (2009). Probabilistic latent preference analysis for collaborative filtering. In Proceedings of the 18th ACM conference on information and knowledge management (pp. 759–766). ACM.
Zurück zum Zitat Liu, Q., Chen, E., Xiong, H., Ding, C.H., Chen, J. (2012). Enhancing collaborative filtering by user interest expansion via personalized ranking. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 42 (1), 218–233.CrossRefMATH Liu, Q., Chen, E., Xiong, H., Ding, C.H., Chen, J. (2012). Enhancing collaborative filtering by user interest expansion via personalized ranking. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 42 (1), 218–233.CrossRefMATH
Zurück zum Zitat Park, Y.J., & Tuzhilin, A. (2008). The long tail of recommender systems and how to leverage it. In Proceedings of the 2008 ACM conference on recommender systems(pp. 11–18). ACM. Park, Y.J., & Tuzhilin, A. (2008). The long tail of recommender systems and how to leverage it. In Proceedings of the 2008 ACM conference on recommender systems(pp. 11–18). ACM.
Zurück zum Zitat Perugini, S., Gonçalves, M.A., Fox, E.A (2004). Recommender systems research: A connection-centric survey. Journal of Intelligent Information Systems, 23(2), 107–143.CrossRefMATH Perugini, S., Gonçalves, M.A., Fox, E.A (2004). Recommender systems research: A connection-centric survey. Journal of Intelligent Information Systems, 23(2), 107–143.CrossRefMATH
Zurück zum Zitat Sarwar, B., Karypis, G., Konstan, J., Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th international conference on World Wide Web (pp. 285–295). ACM. Sarwar, B., Karypis, G., Konstan, J., Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th international conference on World Wide Web (pp. 285–295). ACM.
Zurück zum Zitat Shang, M.S., Lü, L., Zeng, W., Zhang, Y.C., Zhou, T. (2009). Relevance is more significant than correlation: Information filtering on sparse data. EPL (Europhysics Letters), 88 (6), 68, 008.CrossRef Shang, M.S., Lü, L., Zeng, W., Zhang, Y.C., Zhou, T. (2009). Relevance is more significant than correlation: Information filtering on sparse data. EPL (Europhysics Letters), 88 (6), 68, 008.CrossRef
Zurück zum Zitat Wei, X., & Croft, W.B. (2006). Lda-based document models for ad-hoc retrieval. In Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval (pp. 178–185). ACM. Wei, X., & Croft, W.B. (2006). Lda-based document models for ad-hoc retrieval. In Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval (pp. 178–185). ACM.
Zurück zum Zitat Weimer, M., Karatzoglou, A., Le, Q.V., Smola, A.J. (2007). Cofi rank-maximum margin matrix factorization for collaborative ranking. In Advances in neural information processing systems (pp. 1593–1600). Weimer, M., Karatzoglou, A., Le, Q.V., Smola, A.J. (2007). Cofi rank-maximum margin matrix factorization for collaborative ranking. In Advances in neural information processing systems (pp. 1593–1600).
Zurück zum Zitat Zhao, X., Niu, Z., Chen, W. (2013). Interest before liking: Two-step recommendation approaches. Knowledge-Based Systems, 48, 46–56.CrossRef Zhao, X., Niu, Z., Chen, W. (2013). Interest before liking: Two-step recommendation approaches. Knowledge-Based Systems, 48, 46–56.CrossRef
Zurück zum Zitat Zhou, Y., Wilkinson, D., Schreiber, R., Pan, R. (2008). Large-scale parallel collaborative filtering for the netflix prize. In Algorithmic aspects in information and management (pp. 337–348). Springer. Zhou, Y., Wilkinson, D., Schreiber, R., Pan, R. (2008). Large-scale parallel collaborative filtering for the netflix prize. In Algorithmic aspects in information and management (pp. 337–348). Springer.
Metadaten
Titel
A hybrid approach of topic model and matrix factorization based on two-step recommendation framework
verfasst von
Xiangyu Zhao
Zhendong Niu
Wei Chen
Chongyang Shi
Ke Niu
Donglei Liu
Publikationsdatum
01.06.2015
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 3/2015
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-014-0334-3

Weitere Artikel der Ausgabe 3/2015

Journal of Intelligent Information Systems 3/2015 Zur Ausgabe

Premium Partner