ABSTRACT
In this paper, we introduce a supervised machine learning framework for the link prediction problem. The social network we conducted our empirical evaluation on originates from the restaurant review portal, yelp.com. The proposed framework not only uses the structure of the social network to predict non-existing edges in it, but also makes use of further graphs that were constructed based on implicit information provided in the dataset. The implicit information we relied on includes the language use of the members of the social network and their ratings with respect the businesses they reviewed. Here, we also investigate the possibility of building supervised learning models to predict social links without relying on features derived from the structure of the social network itself, but based on such implicit information alone. Our empirical results not only revealed that the features derived from different sources of implicit information can be useful on their own, but also that incorporating them in a unified framework has the potential to improve classification results, as the different sources of implicit information can provide independent and useful views about the connectedness of users.
- N. Benchettara, R. Kanawati, and C. Rouveirol. Supervised machine learning applied to link prediction in bipartite social networks. In Proceedings of the 2010 International Conference on Advances in Social Networks Analysis and Mining, ASONAM '10, pages 326--330, Washington, DC, USA, 2010. IEEE Computer Society. Google ScholarDigital Library
- K. Buza and I. Galambos. An application of link prediction in bipartite graphs: Personalized blog feedback prediction. In 8th Japanese-Hungarian Symposium on Discrete Mathematics and Its Applications June 4--7, 2013, Veszprém, Hungary, page 89.Google Scholar
- M. Hasan and M. Zaki. A survey of link prediction in social networks. In C. C. Aggarwal, editor, Social Network Data Analytics, pages 243--275. Springer US, 2011.Google ScholarCross Ref
- M. A. Hasan, V. Chaoji, S. Salem, and M. Zaki. Link prediction using supervised learning. In In Proc. of SDM 06 workshop on Link Analysis, Counterterrorism and Security, 2006.Google Scholar
- Y. Koren and R. Bell. Advances in collaborative filtering. In Recommender systems handbook, pages 145--186. Springer, 2011.Google ScholarCross Ref
- Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, (8):30--37, 2009. Google ScholarDigital Library
- V. Krebs. Mapping networks of terrorist cells. CONNECTIONS, 24(3):43--52, 2002.Google Scholar
- D. Liben-Nowell and J. Kleinberg. The link prediction problem for social networks. In Proceedings of the Twelfth International Conference on Information and Knowledge Management, CIKM '03, pages 556--559, New York, NY, USA, 2003. ACM. Google ScholarDigital Library
- C.-J. Lin. Projected gradient methods for nonnegative matrix factorization. Neural Comput., 19(10):2756--2779, Oct. 2007. Google ScholarDigital Library
- C. D. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and D. McClosky. The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 55--60, 2014.Google ScholarCross Ref
- A. K. McCallum. MALLET: A machine learning for language toolkit. http://mallet.cs.umass.edu, 2002.Google Scholar
- A. K. Menon and C. Elkan. Link prediction via matrix factorization. In Machine Learning and Knowledge Discovery in Databases, pages 437--452. Springer, 2011. Google ScholarDigital Library
- A. Mnih and R. Salakhutdinov. Probabilistic matrix factorization. In Advances in neural information processing systems, pages 1257--1264, 2007.Google ScholarDigital Library
- L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical Report 1999--66, Stanford InfoLab, November 1999.Google Scholar
- R. Schifanella, A. Barrat, C. Cattuto, B. Markines, and F. Menczer. Folks in folksonomies: Social link prediction from shared metadata. In Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM '10, pages 271--280, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- Y. Sun, R. Barber, M. Gupta, C. C. Aggarwal, and J. Han. Co-author relationship prediction in heterogeneous bibliographic networks. In Proceedings of the 2011 International Conference on Advances in Social Networks Analysis and Mining, ASONAM '11, pages 121--128, Washington, DC, USA, 2011. IEEE Computer Society. Google ScholarDigital Library
- H. Tong, C. Faloutsos, and J.-Y. Pan. Fast random walk with restart and its applications. In Proceedings of the Sixth International Conference on Data Mining, ICDM '06, pages 613--622, Washington, DC, USA, 2006. IEEE Computer Society. Google ScholarDigital Library
Index Terms
- Supervised Prediction of Social Network Links Using Implicit Sources of Information
Recommendations
Extracting Implicit Social Relation for Social Recommendation Techniques in User Rating Prediction
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web CompanionRecommendation plays an increasingly important role in our daily lives. Recommender systems automatically suggest items to users that might be interesting for them. Recent studies illustrate that incorporating social trust in Matrix Factorization ...
Link Prediction in Multi-modal Social Networks
Machine Learning and Knowledge Discovery in DatabasesAbstractOnline social networks like Facebook recommend new friends to users based on an explicit social network that users build by adding each other as friends. The majority of earlier work in link prediction infers new interactions between users by ...
Adaptive Implicit Friends Identification over Heterogeneous Network for Social Recommendation
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge ManagementThe explicitly observed social relations from online social platforms have been widely incorporated into recommender systems to mitigate the data sparsity issue. However, the direct usage of explicit social relations may lead to an inferior performance ...
Comments