Skip to main content
Top
Published in: Data Mining and Knowledge Discovery 5/2017

24-03-2017

Social regularized von Mises–Fisher mixture model for item recommendation

Authors: Aghiles Salah, Mohamed Nadif

Published in: Data Mining and Knowledge Discovery | Issue 5/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Collaborative filtering (CF) is a widely used technique to guide the users of web applications towards items that might interest them. CF approaches are severely challenged by the characteristics of user-item preference matrices, which are often high dimensional and extremely sparse. Recently, several works have shown that incorporating information from social networks—such as friendship and trust relationships—into traditional CF alleviates the sparsity related issues and yields a better recommendation quality, in most cases. More interestingly, even with comparable performances, social-based CF is more beneficial than traditional CF; the former makes it possible to provide recommendations for cold start users. In this paper, we propose a novel model that leverages information from social networks to improve recommendations. While existing social CF models are based on popular modelling assumptions such as Gaussian or Multinomial, our model builds on the von Mises–Fisher assumption which turns out to be more adequate, than the aforementioned assumptions, for high dimensional sparse data. Setting the estimate of the model parameters under the maximum likelihood approach, we derive a scalable learning algorithm for analyzing data with our model. Empirical results on several real-world datasets provide strong support for the advantages of the proposed model.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
In the rest of this paper we treat “direction data” and “\(L_2\) normalized data” as synonyms.
 
6
Several variants of nDCG exist, here we adopt the same as in LibRec for fairness purpose.
 
7
Cold start users are users who have expressed only few rating/social-interactions. Following previous works (Jamali and Ester 2010; Guo et al. 2015) we consider users who have expressed less than five ratings as cold start users in the preference matrix. Similarly, users who have less than five social relations are considered as cold start users in the social network.
 
8
We observed the same behaviour on the Flixster dataset, not reported here for presentation purpose.
 
Literature
go back to reference Amatriain X, Castells P, de Vries A, Posse C (2012) Workshop on recommendation utility evaluation: beyond RMSE–RUE 2012. In: ACM conference on recommender systems (RecSys), pp 351–352 Amatriain X, Castells P, de Vries A, Posse C (2012) Workshop on recommendation utility evaluation: beyond RMSE–RUE 2012. In: ACM conference on recommender systems (RecSys), pp 351–352
go back to reference Banerjee A, Dhillon IS, Ghosh J, Sra S (2005) Clustering on the unit hypersphere using von Mises–Fisher distributions. J Mach Learn Res 6:1345–1382MathSciNetMATH Banerjee A, Dhillon IS, Ghosh J, Sra S (2005) Clustering on the unit hypersphere using von Mises–Fisher distributions. J Mach Learn Res 6:1345–1382MathSciNetMATH
go back to reference Barbieri N, Manco G, Ritacco E (2014) Probabilistic approaches to recommendations. Synth Lect Data Min Knowl Discov 5(2):1–197CrossRefMATH Barbieri N, Manco G, Ritacco E (2014) Probabilistic approaches to recommendations. Synth Lect Data Min Knowl Discov 5(2):1–197CrossRefMATH
go back to reference Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH
go back to reference Bobadilla J, Ortega F, Hernando A, Gutirrez A (2013) Recommender systems survey. Knowl Based Syst 46:109–132CrossRef Bobadilla J, Ortega F, Hernando A, Gutirrez A (2013) Recommender systems survey. Knowl Based Syst 46:109–132CrossRef
go back to reference Cai D, Mei Q, Han J, Zhai C (2008) Modeling hidden topics on document manifold. In: Proceedings of the ACM conference on information and knowledge management, pp 911–920 Cai D, Mei Q, Han J, Zhai C (2008) Modeling hidden topics on document manifold. In: Proceedings of the ACM conference on information and knowledge management, pp 911–920
go back to reference Chaney AJ, Blei DM, Eliassi-Rad T (2015) A probabilistic model for using social networks in personalized item recommendation. In: ACM conference on recommender systems (RecSys), pp 43–50 Chaney AJ, Blei DM, Eliassi-Rad T (2015) A probabilistic model for using social networks in personalized item recommendation. In: ACM conference on recommender systems (RecSys), pp 43–50
go back to reference Cremonesi P, Koren Y, Turrin R (2010) Performance of recommender algorithms on top-n recommendation tasks. In: ACM conference on recommender systems (RecSys), pp 39–46 Cremonesi P, Koren Y, Turrin R (2010) Performance of recommender algorithms on top-n recommendation tasks. In: ACM conference on recommender systems (RecSys), pp 39–46
go back to reference Delporte J, Karatzoglou A, Matuszczyk T, Canu S (2013) Socially enabled preference learning from implicit feedback data. In: Joint european conference on machine learning and knowledge discovery in databases (ECML PKDD), Springer, Berlin, pp 145–160 Delporte J, Karatzoglou A, Matuszczyk T, Canu S (2013) Socially enabled preference learning from implicit feedback data. In: Joint european conference on machine learning and knowledge discovery in databases (ECML PKDD), Springer, Berlin, pp 145–160
go back to reference Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodol) 39:1–38MathSciNetMATH Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodol) 39:1–38MathSciNetMATH
go back to reference Dhillon IS, Modha DS (2001) Concept decompositions for large sparse text data using clustering. Mach Learn 42(1–2):143–175CrossRefMATH Dhillon IS, Modha DS (2001) Concept decompositions for large sparse text data using clustering. Mach Learn 42(1–2):143–175CrossRefMATH
go back to reference Dhillon IS, Mallela S, Modha DS (2003) Information-theoretic co-clustering. In: Proceedings of the ACM SIGKDD international conference on Knowledge discovery and data mining, pp 89–98 Dhillon IS, Mallela S, Modha DS (2003) Information-theoretic co-clustering. In: Proceedings of the ACM SIGKDD international conference on Knowledge discovery and data mining, pp 89–98
go back to reference Gopal S, Yang Y (2014) Von Mises–Fisher clustering models. In: Proceedings of the international conference on machine learning (ICML), pp 154–162 Gopal S, Yang Y (2014) Von Mises–Fisher clustering models. In: Proceedings of the international conference on machine learning (ICML), pp 154–162
go back to reference Guo G, Zhang J, Yorke-Smith N (2013) A novel Bayesian similarity measure for recommender systems. In: Proceedings of the international joint conference on artificial intelligence (IJCAI), pp 2619–2625 Guo G, Zhang J, Yorke-Smith N (2013) A novel Bayesian similarity measure for recommender systems. In: Proceedings of the international joint conference on artificial intelligence (IJCAI), pp 2619–2625
go back to reference Guo G, Zhang J, Thalmann D, Yorke-Smith N (2014) ETAF: an extended trust antecedents framework for trust prediction. In: IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM), pp 540–547 Guo G, Zhang J, Thalmann D, Yorke-Smith N (2014) ETAF: an extended trust antecedents framework for trust prediction. In: IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM), pp 540–547
go back to reference Guo G, Zhang J, Yorke-Smith N (2015) TrustSVD: collaborative filtering with both the explicit and implicit influence of user trust and of item ratings. In: Proceedings of the international joint conference on artificial intelligence (AAAI), pp 123–129 Guo G, Zhang J, Yorke-Smith N (2015) TrustSVD: collaborative filtering with both the explicit and implicit influence of user trust and of item ratings. In: Proceedings of the international joint conference on artificial intelligence (AAAI), pp 123–129
go back to reference He X, Cai D, Shao Y, Bao H, Han J (2011) Laplacian regularized gaussian mixture model for data clustering. IEEE Trans Knowl Data Eng (TKDE) 23(9):1406–1418CrossRef He X, Cai D, Shao Y, Bao H, Han J (2011) Laplacian regularized gaussian mixture model for data clustering. IEEE Trans Knowl Data Eng (TKDE) 23(9):1406–1418CrossRef
go back to reference Jamali M, Ester M (2010) A matrix factorization technique with trust propagation for recommendation in social networks. In: ACM conference on recommender systems (RecSys), pp 135–142 Jamali M, Ester M (2010) A matrix factorization technique with trust propagation for recommendation in social networks. In: ACM conference on recommender systems (RecSys), pp 135–142
go back to reference Koren Y (2008) Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, ACM, pp 426–434 Koren Y (2008) Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, ACM, pp 426–434
go back to reference Koren Y, Bell R, Volinsky C (2009) Matrix factorization techniques for recommender systems. Computer 42(8):30–37CrossRef Koren Y, Bell R, Volinsky C (2009) Matrix factorization techniques for recommender systems. Computer 42(8):30–37CrossRef
go back to reference Le T, Lauw HW (2014) Semantic visualization for spherical representation. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp 1007–1016 Le T, Lauw HW (2014) Semantic visualization for spherical representation. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp 1007–1016
go back to reference Linden G, Smith B, York J (2003) Amazon.com recommendations: item-to-item collaborative filtering. IEEE Internet Comput 7(1):76–80CrossRef Linden G, Smith B, York J (2003) Amazon.com recommendations: item-to-item collaborative filtering. IEEE Internet Comput 7(1):76–80CrossRef
go back to reference Liu H, Hu Z, Mian A, Tian H, Zhu X (2014) A new user similarity model to improve the accuracy of collaborative filtering. Knowl Based Syst 56:156–166CrossRef Liu H, Hu Z, Mian A, Tian H, Zhu X (2014) A new user similarity model to improve the accuracy of collaborative filtering. Knowl Based Syst 56:156–166CrossRef
go back to reference Loiacono D, Lommatzsch A, Turrin R (2014) An analysis of the 2014 RecSys challenge. In: ACM conference on recommender systems (RecSys), p 1 Loiacono D, Lommatzsch A, Turrin R (2014) An analysis of the 2014 RecSys challenge. In: ACM conference on recommender systems (RecSys), p 1
go back to reference Ma H, Yang H, Lyu MR, King I (2008) Sorec: social recommendation using probabilistic matrix factorization. In: Proceedings of the ACM international on conference on information and knowledge management (CIKM), pp 931–940 Ma H, Yang H, Lyu MR, King I (2008) Sorec: social recommendation using probabilistic matrix factorization. In: Proceedings of the ACM international on conference on information and knowledge management (CIKM), pp 931–940
go back to reference Ma H, King I, Lyu MR (2009) Learning to recommend with social trust ensemble. In: Proceedings of the international ACM SIGIR conference on research and development in information retrieval, ACM, pp 203–210 Ma H, King I, Lyu MR (2009) Learning to recommend with social trust ensemble. In: Proceedings of the international ACM SIGIR conference on research and development in information retrieval, ACM, pp 203–210
go back to reference Ma H, Zhou D, Liu C, Lyu MR, King I (2011) Recommender systems with social regularization. In: Proceedings of the ACM WSDM international conference on web search and data mining, pp 287–296 Ma H, Zhou D, Liu C, Lyu MR, King I (2011) Recommender systems with social regularization. In: Proceedings of the ACM WSDM international conference on web search and data mining, pp 287–296
go back to reference Mardia K, Jupp P (2009) Directional statistics. Wiley Series in Probability and Statistics. Wiley, New York Mardia K, Jupp P (2009) Directional statistics. Wiley Series in Probability and Statistics. Wiley, New York
go back to reference McLachlan G, Krishnan T (2007) The EM algorithm and extensions, vol 382. Wiley, New YorkMATH McLachlan G, Krishnan T (2007) The EM algorithm and extensions, vol 382. Wiley, New YorkMATH
go back to reference McLachlan G, Peel D (2004) Finite mixture models. Wiley, New YorkMATH McLachlan G, Peel D (2004) Finite mixture models. Wiley, New YorkMATH
go back to reference Mei Q, Cai D, Zhang D, Zhai C (2008) Topic modeling with network regularization. In: Proceedings of the international conference on world wide web (WWW), pp 101–110 Mei Q, Cai D, Zhang D, Zhai C (2008) Topic modeling with network regularization. In: Proceedings of the international conference on world wide web (WWW), pp 101–110
go back to reference Nadif M, Govaert G (2010) Model-based co-clustering for continuous data. In: Proceedings of international conference on machine learning and applications (ICMLA), pp 175–180 Nadif M, Govaert G (2010) Model-based co-clustering for continuous data. In: Proceedings of international conference on machine learning and applications (ICMLA), pp 175–180
go back to reference Reisinger J, Waters A, Silverthorn B, Mooney RJ (2010) Spherical topic models. In: Proceedings of the international conference on machine learning (ICML), pp 903–910 Reisinger J, Waters A, Silverthorn B, Mooney RJ (2010) Spherical topic models. In: Proceedings of the international conference on machine learning (ICML), pp 903–910
go back to reference Salah A, Rogovschi N, Nadif M (2016a) A dynamic collaborative filtering system via a weighted clustering approach. Neurocomputing 175:206–215CrossRef Salah A, Rogovschi N, Nadif M (2016a) A dynamic collaborative filtering system via a weighted clustering approach. Neurocomputing 175:206–215CrossRef
go back to reference Salah A, Rogovschi N, Nadif M (2016b) Model-based co-clustering for high dimensional sparse data. In: Proceedings of the 19th international conference on artificial intelligence and statistics (AISTATS), pp 866–874 Salah A, Rogovschi N, Nadif M (2016b) Model-based co-clustering for high dimensional sparse data. In: Proceedings of the 19th international conference on artificial intelligence and statistics (AISTATS), pp 866–874
go back to reference Salah A, Rogovschi N, Nadif M (2016c) Stochastic co-clustering for document-term data. In: Proceedings of the SIAM SDM international conference on data mining, pp 306–314 Salah A, Rogovschi N, Nadif M (2016c) Stochastic co-clustering for document-term data. In: Proceedings of the SIAM SDM international conference on data mining, pp 306–314
go back to reference Salakhutdinov R, Mnih A (2008) Probabilistic matrix factorization. Adv Neural Inf Process Syst (NIPS) 20:1257–1264 Salakhutdinov R, Mnih A (2008) Probabilistic matrix factorization. Adv Neural Inf Process Syst (NIPS) 20:1257–1264
go back to reference Sarwar B, Karypis G, Konstan J, Riedl J (2000) Application of dimensionality reduction in recommender system-a case study. Technical Report, DTIC Document Sarwar B, Karypis G, Konstan J, Riedl J (2000) Application of dimensionality reduction in recommender system-a case study. Technical Report, DTIC Document
go back to reference Sarwar B, Karypis G, Konstan J, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. In: Proceedings of the international conference on world wide web (WWW), ACM, pp 285–295 Sarwar B, Karypis G, Konstan J, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. In: Proceedings of the international conference on world wide web (WWW), ACM, pp 285–295
go back to reference Sra S (2012) A short note on parameter approximation for von Mises–Fisher distributions: and a fast implementation of I s (x). Comput Stat 27(1):177–190MathSciNetCrossRefMATH Sra S (2012) A short note on parameter approximation for von Mises–Fisher distributions: and a fast implementation of I s (x). Comput Stat 27(1):177–190MathSciNetCrossRefMATH
go back to reference Tanabe A, Fukumizu K, Oba S, Takenouchi T, Ishii S (2007) Parameter estimation for von Mises–Fisher distributions. Comput Stat 22(1):145–157MathSciNetCrossRefMATH Tanabe A, Fukumizu K, Oba S, Takenouchi T, Ishii S (2007) Parameter estimation for von Mises–Fisher distributions. Comput Stat 22(1):145–157MathSciNetCrossRefMATH
go back to reference Tang J, Gao H, Liu H (2012) mTrust: discerning multi-faceted trust in a connected world. In: Proceedings of the ACM WSDM international conference on web search and data mining, pp 93–102 Tang J, Gao H, Liu H (2012) mTrust: discerning multi-faceted trust in a connected world. In: Proceedings of the ACM WSDM international conference on web search and data mining, pp 93–102
go back to reference Ungar LH, Foster DP (1998) Clustering methods for collaborative filtering. AAAI workshop on recommendation systems, vol 1, pp 114–129 Ungar LH, Foster DP (1998) Clustering methods for collaborative filtering. AAAI workshop on recommendation systems, vol 1, pp 114–129
go back to reference Yang B, Lei Y, Liu D, Liu J (2013) Social collaborative filtering by trust. In: Proceedings of the international joint conference on artificial intelligence (AAAI), pp 2747–2753 Yang B, Lei Y, Liu D, Liu J (2013) Social collaborative filtering by trust. In: Proceedings of the international joint conference on artificial intelligence (AAAI), pp 2747–2753
go back to reference Zhu X, Lafferty J (2005) Harmonic mixtures: combining mixture models and graph-based methods for inductive and scalable semi-supervised learning. In: Proceedings of the international conference on machine learning (ICML), pp 1052–1059 Zhu X, Lafferty J (2005) Harmonic mixtures: combining mixture models and graph-based methods for inductive and scalable semi-supervised learning. In: Proceedings of the international conference on machine learning (ICML), pp 1052–1059
Metadata
Title
Social regularized von Mises–Fisher mixture model for item recommendation
Authors
Aghiles Salah
Mohamed Nadif
Publication date
24-03-2017
Publisher
Springer US
Published in
Data Mining and Knowledge Discovery / Issue 5/2017
Print ISSN: 1384-5810
Electronic ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-017-0499-9

Other articles of this Issue 5/2017

Data Mining and Knowledge Discovery 5/2017 Go to the issue

Premium Partner