Abstract
People's personal social networks are big and cluttered, and currently there is no good way to automatically organize them. Social networking sites allow users to manually categorize their friends into social circles (e.g., “circles” on Google+, and “lists” on Facebook and Twitter). However, circles are laborious to construct and must be manually updated whenever a user's network grows. In this article, we study the novel task of automatically identifying users' social circles. We pose this task as a multimembership node clustering problem on a user's ego network, a network of connections between her friends. We develop a model for detecting circles that combines network structure as well as user profile information. For each circle, we learn its members and the circle-specific user profile similarity metric. Modeling node membership to multiple circles allows us to detect overlapping as well as hierarchically nested circles. Experiments show that our model accurately identifies circles on a diverse set of data from Facebook, Google+, and Twitter, for all of which we obtain hand-labeled ground truth.
- D. Agarwal, B.-C. Chen, P. Elango, N. Motgi, S.-T. Park, R. Ramakrishnan, S. Roy, and J. Zachariah. 2008. Online models for content optimization. In Neural Information Processing Systems.Google Scholar
- Y.-Y. Ahn, J. Bagrow, and S. Lehmann. 2010. Link communities reveal multiscale complexity in networks. Nature.Google Scholar
- E. Airoldi, D. Blei, S. Fienberg, and E. Xing. 2008. Mixed membership stochastic blockmodels. Journal of Machine Learning Research. Google ScholarDigital Library
- R. Andersen and K. Lang. 2006. Communities from seed sets. In WWW. Google ScholarDigital Library
- R. Balasubramanyan and W. Cohen. 2011. Block-LDA: Jointly modeling entity-annotated text and entity-entity links. In SIAM International Conference on Data Mining.Google Scholar
- E. Boros and P. Hammer. 2002. Pseudo-boolean optimization. Discrete Applied Mathematics. Google ScholarDigital Library
- J. Chang and D. Blei. 2009. Relational topic models for document networks. In International Conference on Artificial Intelligence and Statistics.Google Scholar
- J. Chang, J. Boyd-Graber, and D. Blei. 2009. Connections between the lines: Augmenting social networks with text. In Knowledge Discovery and Data Mining. Google ScholarDigital Library
- H. Chen and R. Karger. 2006. Less is more: Probabilistic models for retrieving fewer relevant documents. In Special Interest Group on Information Retrieval. Google ScholarDigital Library
- Y. Chen and C. Lin. 2006. Combining SVMs with Various Feature Selection Strategies. Springer.Google Scholar
- K. El-Arini, G. Veda, D. Shahaf, and C. Guestrin. 2009. Turning down the noise in the blogosphere. In Knowledge Discovery and Data Mining. Google ScholarDigital Library
- Scott L. Feld. 1981. The focused organization of social ties. American Journal of Sociology.Google Scholar
- Mario Frank, Andreas P. Streich, David Basin, and Joachim M. Buhmann. 2012. Multi-assignment clustering for Boolean data. Journal of Machine Learning Research. Google ScholarDigital Library
- Steve Gregory. 2010a. Finding overlapping communities in networks by label propagation. New Journal of Physics.Google Scholar
- Steve Gregory. 2010b. Fuzzy overlapping communities in networks. CoRR abs/1010.1523.Google Scholar
- P. Hammer, P. Hansen, and B. Simeone. 1984. Roof duality, complementation and persistency in quadratic 0-1 optimization. Mathematical Programming.Google Scholar
- M. Handcock, A. Raftery, and J. Tantrum. 2007a. Model-based clustering for social networks. Journal of the Royal Statistical Society Series A.Google Scholar
- Mark S. Handcock, Adrian E. Raftery, and Jeremy M. Tantrum. 2007b. Model-based clustering for social networks. Journal of the Royal Statistical Society.Google Scholar
- M. B. Hastings. 2006. Community detection as an inference problem. Physical Review E.Google Scholar
- David Haussler. 1999. Convolution Kernels on Discrete Structures. Technical Report. University of California at Santa Cruz.Google Scholar
- Peter D. Hoff, Adrian E. Raftery, and Mark S. Handcock. 2002. Latent space approaches to social network analysis. Journal of the American Statistical Association.Google Scholar
- S. Johnson. 1967. Hierarchical clustering schemes. Psychometrika.Google Scholar
- D. Kim, Y. Jo, L.-C. Moon, and A. Oh. 2010. Analysis of Twitter lists as a potential source for discovering latent characteristics of users. In CHI.Google Scholar
- P. Kohli and P. Torr. 2005. Efficiently solving dynamic Markov random fields using graph cuts. In International Conference on Computer Vision. Google ScholarDigital Library
- Vladimir Kolmogorov and Carsten Rother. 2007. Minimizing nonsubmodular functions with graph cuts: A review. IEEE Transactions on Pattern Analysis and Machine Intelligence. Google ScholarDigital Library
- P. Krivitsky, M. Handcock, A. Raftery, and P. Hoff. 2009. Representing degree distributions, clustering, and homophily in social networks with latent cluster random effects models. Social Networks.Google Scholar
- Andrea Lancichinetti and Santo Fortunato. 2009a. Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Physical Review E.Google Scholar
- A. Lancichinetti and S. Fortunato. 2009b. Community detection algorithms: A comparative analysis. arXiv:0908.1062.Google Scholar
- Andrea Lancichinetti, Santo Fortunato, and Janos Kertesz. 2009. Detecting the overlapping and hierarchical community structure in complex networks. New Journal of Physics.Google Scholar
- P. Lazarsfeld and R. Merton. 1954. Friendship as a social process: A substantive and methodological analysis. In Freedom and Control in Modern Society.Google Scholar
- J. Leskovec, K. Lang, and M. Mahoney. 2010. Empirical comparison of algorithms for network community detection. In WWW. Google ScholarDigital Library
- Y. Liu, A. Niculescu-Mizil, and W. Gryc. 2009. Topic-link LDA: Joint models of topic and author community. In International Conference on Machine Learning. Google ScholarDigital Library
- D. MacKay. 2003. Information Theory, Inference and Learning Algorithms. Cambridge University Press. Google ScholarDigital Library
- J. McAuley and J. Leskovec. 2012. Learning to discover social circles in ego networks. In Neural Information Processing Systems.Google Scholar
- M. McPherson. 1983. An ecology of affiliation. American Sociological Review.Google Scholar
- Miller McPherson, Lynn Smith-Lovin, and James M. Cook. 2001. Birds of a feather: Homophily in social networks. Annual Review of Sociology.Google Scholar
- Aditya Menon and Charles Elkan. 2010. A log-linear model with latent features for dyadic prediction. In International Conference on Data Mining. Google ScholarDigital Library
- A. Menon and C. Elkan. 2011. Link prediction via matrix factorization. In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Google ScholarDigital Library
- A. Mislove, B. Viswanath, K. Gummadi, and P. Druschel. 2010. You are who you know: Inferring user profiles in online social networks. In International Conference on Web Search and Data Mining. Google ScholarDigital Library
- P. Nasirifard and C. Hayes. 2011. Tadvise: A Twitter assistant based on Twitter lists. In SocInfo. Google ScholarDigital Library
- M. Newman. 2006. Modularity and community structure in networks. In Proceedings of the National Academy of Sciences.Google ScholarCross Ref
- M. E. J. Newman. 2003. Fast algorithm for detecting community structure in networks. Physical Review E.Google Scholar
- M. E. J. Newman. 2004. Detecting community structure in networks. European Physical Journal B.Google Scholar
- M. E. J. Newman and G. T. Barkema. 1999. Monte Carlo Methods in Statistical Physics. Oxford University Press.Google Scholar
- Jorge Nocedal. 1980. Updating quasi-Newton matrices with limited storage. Mathematics of Computation.Google Scholar
- G. Palla, I. Derenyi, I. Farkas, and T. Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature.Google Scholar
- M. A. Porter, J.-P. Onnela, and P. J. Mucha. 2009. Communities in networks.Google Scholar
- E. Ravasz and A.-L. Barabási. 2003. Hierarchical organization in complex networks. Physical Review E.Google Scholar
- C. Rother, V. Kolmogorov, V. Lempitsky, and M. Szummer. 2007. Optimizing binary MRFs via extended roof duality. In Computer Vision and Pattern Recognition.Google Scholar
- S. E. Schaeffer. 2007. Graph clustering. Computer Science Review. Google ScholarDigital Library
- Georg Simmel. 1964. Conflict and the Web of Group Affiliations. Simon and Schuster.Google Scholar
- J. Ugander, B. Karrer, L. Backstrom, and C. Marlow. 2011. The Anatomy of the Facebook Social Graph. Preprint.Google Scholar
- S. V. N. Vishwanathan and Alexander J. Smola. 2002. Fast kernels for string and tree matching. In Neural Information Processing Systems.Google Scholar
- C. Volinsky and A. Raftery. 2000. Bayesian information criterion for censored survival models. Biometrics.Google Scholar
- D. Vu, A. Asuncion, D. Hunter, and P. Smyth. 2011. Dynamic egocentric models for citation networks. In International Conference on Machine Learning.Google Scholar
- S. Wu, J. Hofman, W. Mason, and D. Watts. 2011. Who says what to whom on Twitter. In WWW. Google ScholarDigital Library
- J. Yang and J. Leskovec. 2012. Community-affiliation graph model for overlapping community detection. In International Conference on Data Mining. Google ScholarDigital Library
- T. Yoshida. 2010. Toward finding hidden communities based on user profiles. In ICDM Workshops. Google ScholarDigital Library
- J. Zhao. 2011. Examining the evolution of networks based on lists in Twitter. In International Conference on Internet Multimedia System Architectures and Applications.Google ScholarCross Ref
Index Terms
- Discovering social circles in ego networks
Recommendations
Learning to discover social circles in ego networks
NIPS'12: Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1Our personal social networks are big and cluttered, and currently there is no good way to organize them. Social networking sites allow users to manually categorize their friends into social circles (e.g. 'circles' on Google+, and 'lists' on Facebook and ...
Ego-net digger: a new way to study ego networks in online social networks
HotSocial '12: Proceedings of the First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks ResearchThe vast proliferation of Online Social Networks (OSN) is generating many new ways to interact and create social relationships with others. While substantial results have been obtained in anthropology literature describing the properties of human social ...
Common Features against Similarity for Discovering Social Circles in Networks
ENIC '15: Proceedings of the 2015 Second European Network Intelligence ConferenceThis paper presents a new method for automatically identifying circles of friends in ego networks. To detect these circles the users' network is investigated in two different ways: (1) based on features that the users own, and (2) based on similarity ...
Comments