ABSTRACT
As a tremendous number of mobile applications (apps) are readily available, users have difficulty in identifying apps that are relevant to their interests. Recommender systems that depend on previous user ratings (i.e., collaborative filtering, or CF) can address this problem for apps that have sufficient ratings from past users. But for apps that are newly released, CF does not have any user ratings to base recommendations on, which leads to the cold-start problem.
In this paper, we describe a method that accounts for nascent information culled from Twitter to provide relevant recommendation in such cold-start situations. We use Twitter handles to access an app's Twitter account and extract the IDs of their Twitter-followers. We create pseudo-documents that contain the IDs of Twitter users interested in an app and then apply latent Dirichlet allocation to generate latent groups. At test time, a target user seeking recommendations is mapped to these latent groups. By using the transitive relationship of latent groups to apps, we estimate the probability of the user liking the app. We show that by incorporating information from Twitter, our approach overcomes the difficulty of cold-start app recommendation and significantly outperforms other state-of-the-art recommendation techniques by up to 33%.
- C. C. Aggarwal, J. L. Wolf, K.-L. Wu, and P. S. Yu. Horting Hatches an Egg: A New Graph-theoretic Approach to Collaborative Filtering. In Proc. of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'99), pages 201--212, 1999. Google ScholarDigital Library
- R. Ayalew. Consumer Behaviour in Apple's App Store. PhD thesis, Uppsala University, 2011.Google Scholar
- E. Bakshy, J. M. Hofman, W. A. Mason, and D. J. Watts. Everyone's an Influencer: Quantifying Influence on Twitter. In Proc. of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), pages 65--74, 2011. Google ScholarDigital Library
- R. Bell, Y. Koren, and C. Volinsky. Modeling Relationships at Multiple Scales to Improve Accuracy of Large Recommender Systems. In Proc. of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07), pages 95--104, 2007. Google ScholarDigital Library
- D. M. Blei and J. D. Lafferty. Topic Models. Text mining: Classification, Clustering, and Applications, 10:71, 2009.Google ScholarCross Ref
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3:993--1022, 2003. Google ScholarDigital Library
- J. Boyd-Graber, J. Chang, S. Gerrish, C. Wang, and D. M. Blei. Reading Tea Leaves: How Humans Interpret Topic Models. In Proc. of the 23rd Annual Conference on Neural Information Processing Systems (NIPS'09), 2009.Google Scholar
- J. S. Breese, D. Heckerman, and C. Kadie. Empirical Analysis of Predictive Algorithms for Collaborative Filtering. In Proc. of the 14th Conference on Uncertainty in Artificial Intelligence (UAI'98), pages 43--52, 1998. Google ScholarDigital Library
- M. Cha, H. Haddadi, F. Benevenuto, and K. P. Gummadi. Measuring User Influence in Twitter: The Million Follower Fallacy. In Proc. of the 4th International AAAI Conference on Weblogs and Social Media (ICWSM'10), pages 10--17, 2010.Google Scholar
- J. H. Friedman. Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics, 29:1189--1232, 2001.Google ScholarCross Ref
- J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl. An Algorithmic Framework for Performing Collaborative Filtering. In Proc. of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'99), pages 230--237, 1999. Google ScholarDigital Library
- T. Hofmann. Latent Semantic Models for Collaborative Filtering. ACM Transactions on Information Systems (TOIS), 22:89--115, 2004. Google ScholarDigital Library
- T. Hofmann and J. Puzicha. Latent Class Models for Collaborative Filtering. In Proc. of the 16th International Joint Conference on Artificial Intelligence (IJCAI'99), pages 688--693, 1999. Google ScholarDigital Library
- M. Jamali and M. Ester. A Matrix Factorization Technique with Trust Propagation for Recommendation in Social Networks. In Proc. of the 4th ACM Conference on Recommender Systems (RecSys'10), pages 135--142, 2010. Google ScholarDigital Library
- D. Kim and B.-J. Yum. Collaborative Filtering Based on Iterative Principal Component Analysis. Expert Systems with Applications, 28(4):823--830, 2005. Google ScholarDigital Library
- Y. Koren. Factorization Meets the Neighborhood: A Multifaceted Collaborative Filtering Model. In Proc. of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), pages 426--434, 2008. Google ScholarDigital Library
- Y. Koren and R. Bell. Advances in Collaborative Filtering. Recommender Systems Handbook, pages 145--186, 2011.Google ScholarCross Ref
- H. Kwak, C. Lee, H. Park, and S. Moon. What is Twitter, a Social Network or a News Media? In Proc. of the 19th International Conference on World Wide Web (WWW'10), pages 591--600, 2010. Google ScholarDigital Library
- N. N. Liu, X. Meng, C. Liu, and Q. Yang. Wisdom of the Better Few: Cold-Start Recommendation via Representative based Rating Elicitation. In Proc. of the 5th ACM Conference on Recommender Systems (RecSys'11), pages 37--44, 2011. Google ScholarDigital Library
- H. Misra, O. Cappé, and F. Yvon. Using LDA to Detect Semantically Incoherent Documents. In Proc. of the 12th Conference on Computational Natural Language Learning (CoNLL'08), pages 41--48, 2008. Google ScholarDigital Library
- Y. Moshfeghi, B. Piwowarski, and J. M. Jose. Handling Data Sparsity in Collaborative Filtering using Emotion and Semantic-based Features. In Proc. of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'11), pages 625--634, 2011. Google ScholarDigital Library
- L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report SIDL-WP-1999-0120, Stanford Digital Library Technologies Project, 1998.Google Scholar
- S.-T. Park and W. Chu. Pairwise Preference Regression for Cold-Start Recommendation. In Proc. of the 3rd ACM Conference on Recommender Systems (RecSys'09), pages 21--28, 2009. Google ScholarDigital Library
- D. Ramage, S. Dumais, and D. Liebling. Characterizing Microblogs with Topic Models. In Proc. of the 4th International AAAI Conference on Weblogs and Social Media (ICWSM'10), pages 130--137, 2010.Google Scholar
- R. Recuero, R. Araujo, and G. Zago. How Does Social Capital Affect Retweets. In Proc. of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM'11), pages 305--312, 2011.Google Scholar
- A. Said, E. W. De Luca, and S. Albayrak. How Social Relationships Affect User Similarities. In Proc. of the 2010 Workshop on Social Recommender Systems, pages 1--4, 2010.Google Scholar
- R. Salakhutdinov, A. Mnih, and G. Hinton. Restricted Boltzmann Machines for Collaborative Filtering. In Proc. of the 24th International Conference on Machine Learning (ICML'07), pages 791--798, 2007. Google ScholarDigital Library
- B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Item-based Collaborative Filtering Recommendation Algorithms. In Proc. of the 10th International Conference on World Wide Web (WWW'01), pages 285--295, 2001. Google ScholarDigital Library
- A. I. Schein, A. Popescul, L. H. Ungar, and D. M. Pennock. Methods and Metrics for Cold-Start Recommendations. In Proc. of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02), pages 253--260, 2002. Google ScholarDigital Library
- G. Takács, I. Pilászy, B. Németh, and D. Tikk. Matrix Factorization and Neighbor based Algorithms for the Netflix Prize Problem. In Proc. of the 2nd ACM Conference on Recommender Systems (RecSys'08), pages 267--274, 2008. Google ScholarDigital Library
- C. Wang and D. M. Blei. Collaborative Topic Modeling for Recommending Scientific Articles. In Proc. of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), pages 448--456, 2011. Google ScholarDigital Library
- J. Weng, E.-P. Lim, J. Jiang, and Q. He. TwitterRank: Finding Topic-sensitive Influential Twitterers. In Proc. of the 3rd ACM International Conference on Web Search and Data Mining (WSDM'10), pages 261--270, 2010. Google ScholarDigital Library
- S. Wu, J. M. Hofman, W. A. Mason, and D. J. Watts. Who Says What to Whom on Twitter. In Proc. of the 20th International Conference on World Wide Web (WWW'11), pages 705--714, 2011. Google ScholarDigital Library
- G.-R. Xue, C. Lin, Q. Yang, W. Xi, H.-J. Zeng, Y. Yu, and Z. Chen. Scalable Collaborative Filtering using Cluster-based Smoothing. In Proc. of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'05), pages 114--121, 2005. Google ScholarDigital Library
- K. Zhou, S.-H. Yang, and H. Zha. Functional Matrix Factorizations for Cold-Start Recommendation. In Proc. of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'11), pages 315--324, 2011. Google ScholarDigital Library
Index Terms
- Addressing cold-start in app recommendation: latent user models constructed from twitter followers
Recommendations
Addressing cold-start problem in recommendation systems
ICUIMC '08: Proceedings of the 2nd international conference on Ubiquitous information management and communicationRecommender systems for automatically suggested items of interest to users have become increasingly essential in fields where mass personalization is highly valued. The popular core techniques of such systems are collaborative filtering, content-based ...
Naïve filterbots for robust cold-start recommendations
KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data miningThe goal of a recommender system is to suggest items of interest to a user based on historical behavior of a community of users. Given detailed enough history, item-based collaborative filtering (CF) often performs as well or better than almost any ...
An empirical study of a cross-level association rule mining approach to cold-start recommendations
We propose a novel hybrid recommendation approach to address the well-known cold-start problem in Collaborative Filtering (CF). Our approach makes use of Cross-Level Association RulEs (CLARE) to integrate content information about domain items into ...
Comments