skip to main content
research-article
Public Access

User Modeling on Demographic Attributes in Big Mobile Social Networks

Published:11 July 2017Publication History
Skip Abstract Section

Abstract

Users with demographic profiles in social networks offer the potential to understand the social principles that underpin our highly connected world, from individuals, to groups, to societies. In this article, we harness the power of network and data sciences to model the interplay between user demographics and social behavior and further study to what extent users’ demographic profiles can be inferred from their mobile communication patterns. By modeling over 7 million users and 1 billion mobile communication records, we find that during the active dating period (i.e., 18--35 years old), users are active in broadening social connections with males and females alike, while after reaching 35 years of age people tend to keep small, closed, and same-gender social circles. Further, we formalize the demographic prediction problem of inferring users’ gender and age simultaneously. We propose a factor graph-based WhoAmI method to address the problem by leveraging not only the correlations between network features and users’ gender/age, but also the interrelations between gender and age. In addition, we identify a new problem—coupled network demographic prediction across multiple mobile operators—and present a coupled variant of the WhoAmI method to address its unique challenges. Our extensive experiments demonstrate the effectiveness, scalability, and applicability of the WhoAmI methods. Finally, our study finds a greater than 80% potential predictability for inferring users’ gender from phone call behavior and 73% for users’ age from text messaging interactions.

References

  1. Talayeh Aledavood, Eduardo López, Sam G. B. Roberts, Felix Reed-Tsochas, Esteban Moro Egido, Robin I. M. Dunbar, and Jari Saramäki. 2015. Channel-specific daily patterns in mobile phone communication. CoRR, abs/1507.04596 (2015).Google ScholarGoogle Scholar
  2. Bin Bi, Milad Shokouhi, Michal Kosinski, and Thore Graepel. 2013. Inferring the demographics of search users: Social data meets search queries. In WWW’13. 131--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Vincent D. Blondel, Adeline Decuyper, and Gautier Krings. 2015. A survey of results on mobile phone datasets analysis. arXiv:1502.03406 (2015).Google ScholarGoogle Scholar
  4. F. Calabrese, L. Ferrari, and V. Blondel. 2014. Urban sensing using mobile phones network data: A survey of research. ACM Comput. Surv. (2014). Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Deepayan Chakrabarti, Stanislav Funiak, Jonathan Chang, and Sofus A. Macskassy. 2014. Joint inference of multiple label types in large networks. In ICML’14. 874--882. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Benjamin Cornwell. 2011. Age trends in daily social contact patterns. Res. Aging 33, 5 (2011), 598--631.Google ScholarGoogle ScholarCross RefCross Ref
  7. Yuxiao Dong, Fabio Pinelli, Yiannis Gkoufas, Zubair Nabi, Francesco Calabrese, and Nitesh V. Chawla. 2015a. Inferring unusual crowd events from mobile phone call detail records. In Machine Learning and Knowledge Discovery in Databases. Springer International Publishing, 474--492.Google ScholarGoogle Scholar
  8. Yuxiao Dong, Jie Tang, Nitesh V. Chawla, Tiancheng Lou, Yang Yang, and Bai Wang. 2015b. Inferring social status and rich club effects in enterprise communication networks. PLoS ONE 10 (03 2015), e0119446.Google ScholarGoogle Scholar
  9. Yuxiao Dong, Jie Tang, Tiancheng Lou, Bin Wu, and Nitesh V. Chawla. 2013. How long will she call me? Distribution, social theory and duration prediction. In Machine Learning and Knowledge Discovery in Databases. Springer, 16--31.Google ScholarGoogle Scholar
  10. Yuxiao Dong, Yang Yang, Jie Tang, Yang Yang, and Nitesh V. Chawla. 2014. Inferring user demographics and social strategies in mobile social networks. In KDD’14. ACM, 15--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Yuxiao Dong, Jing Zhang, Jie Tang, Nitesh V. Chawla, and Bai Wang. 2015. CoupledLP: Link prediction in coupled networks. In KDD’15. ACM, 199--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Nan Du, Christos Faloutsos, Bai Wang, and Leman Akoglu. 2009. Large human communication networks: Patterns and a utility-driven generator. In KDD’09. ACM, 269--278. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Nathan Eagle, Alex (Sandy) Pentland, and David Lazer. 2009. Inferring social network structure using mobile phone data. Proc. Natl. Acad. Sci. U.S.A. 106, 36 (2009).Google ScholarGoogle Scholar
  14. David Easley and Jon Kleinberg. 2010. Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Mária Ercsey-Ravasz, Ryan N. Lichtenwalter, Nitesh V. Chawla, and Zoltán Toroczkai. 2012. Range-limited centrality measures in complex networks. Phys. Rev. E 85, 6 (Jun 2012), 066103.Google ScholarGoogle ScholarCross RefCross Ref
  16. Linton C. Freeman. 1982. Centered graphs and the structure of ego networks. Math. Soc. Sci. 3, 3 (1982), 291--304.Google ScholarGoogle ScholarCross RefCross Ref
  17. Huiji Gao, Jiliang Tang, Xia Hu, and Huan Liu. 2013. Modeling temporal effects of human mobile behavior on location-based social networks. In CIKM’13. 1673--1678. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Andrew Gelman and Jennifer Hill. 2006. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press.Google ScholarGoogle Scholar
  19. Marta C. Gonzalez, Cesar A. Hidalgo, and Albert-Laszlo Barabasi. 2008. Understanding individual human mobility patterns. Nature 453, 7196 (2008), 779--782.Google ScholarGoogle Scholar
  20. Mark Granovetter. 1973. The strength of weak ties. Am. J. Sociology 78, 6 (1973), 1360--1380.Google ScholarGoogle ScholarCross RefCross Ref
  21. Mark Granovetter. 1985. Economic action and social structure: The problem of embeddedness. Amer. J. Sociol. (1985).Google ScholarGoogle Scholar
  22. Susan C. Herring. 2003. Gender and power in on-line communication. In Handbook of Language and Gender. Wiley-Blackwell, 202.Google ScholarGoogle Scholar
  23. Cesar A. Hidalgo and C. Rodriguez-Sickert. 2008. The dynamics of a mobile phone network. Physica A: Stat. Mech. Appl. 387, 12 (2008), 3017--3024.Google ScholarGoogle ScholarCross RefCross Ref
  24. Jian Hu, Hua-Jun Zeng, Hua Li, Cheng Niu, and Zheng Chen. 2007. Demographic prediction based on user’s browsing behavior. In WWW’07. 151--160. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Xia Hu and Huan Liu. 2012. Social status and role analysis of Palin’s email network. In WWW’12 Companion. ACM, 531--532. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Lauri Kovanen, Kimmo Kaski, János Kertész, and Jari Saramäki. 2013. Temporal motifs reveal homophily, gender-specific patterns, and group talk in call sequences. PNAS 110, 45 (2013), 18070--18075.Google ScholarGoogle ScholarCross RefCross Ref
  27. David Krackhardt. 1992. The Strength of Strong Ties. Cambridge, Harvard Business School Press, Hershey, USA.Google ScholarGoogle Scholar
  28. Frank R. Kschischang, Brendan J. Frey, and Hans A. Loeliger. 2001. Factor graphs and the sum-product algorithm. IEEE Trans. Internet Technol. 47 (2001), 498--519. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. P. F. Lazarsfeld and R. K. Merton. 1954. Friendship as a social process: A substantive and methodological analysis. Freedom and Control in Modern Society,Van Nostrand, New York (1954), 8--66.Google ScholarGoogle Scholar
  30. Jure Leskovec and Eric Horvitz. 2008. Planetary-scale views on a large instant-messaging network. In WWW’08. ACM, 915--924. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Quim Llimona, Jordi Luque, Xavier Anguera, Zoraida Hidalgo, Souneil Park, and Nuria Oliver. 2015. Effect of gender and call duration on customer satisfaction in call center big data. In INTERSPEECH’15.Google ScholarGoogle Scholar
  32. Tiancheng Lou, Jie Tang, John Hopcroft, Zhanpeng Fang, and Xiaowen Ding. 2013. Learning to predict reciprocity and triadic closure in social networks. ACM Trans. Knowl. Discov. Data 7, 2 (2013), 5:1--5:25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Peter V. Marsden. 1987. Core discussion networks of americans. Amer. Sociol. Rev. (1987), 122--131.Google ScholarGoogle Scholar
  34. Manfred Max-Neef, Antonio Elizalde, and Martin Hopenhayn. 1992. Development and human needs. Real-life Economics: Understanding Wealth Creation (1992), 197--213.Google ScholarGoogle Scholar
  35. M. Mead. 1970. Culture and Commitment: A Study of the Generation Gap. Natural History Press.Google ScholarGoogle Scholar
  36. L. Meng, Y. Hulovatyy, A. Striegel, and T. Milenković. 2016. On the interplay between individuals’ evolving interaction patterns and traits in dynamic multiplex social networks. IEEE Trans. Netw. Sci. Eng. 3, 1 (2016), 32--43.Google ScholarGoogle ScholarCross RefCross Ref
  37. Matthew Michelson and Sofus A. Macskassy. 2011. What blogs tell us about websites: A demographics study. In WSDM’11. ACM, 365--374. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Giovanna Miritello, Rubén Lara, Manuel Cebrian, and Esteban Moro. 2013. Limited communication capacity unveils strategies for human interaction. Sci. Rep. 3 (2013).Google ScholarGoogle Scholar
  39. Alan Mislove, Sune Lehmann, Yong-Yeol Ahn, Jukka-Pekka Onnela, and J. Niels Rosenquist. 2011. Understanding the demographics of twitter users. In ICWSM’11.Google ScholarGoogle Scholar
  40. Kaixiang Mo, Ben Tan, Erheng Zhong, and Qiang Yang. 2012. Your phone understands you. In Nokia MDC’12.Google ScholarGoogle Scholar
  41. Kevin P. Murphy, Yair Weiss, and Michael I. Jordan. 1999. Loopy belief propagation for approximate inference: An empirical study. In UAI’99. 467--475. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. J. P. Onnela, J. Saramäki, J. Hyvönen, G. Szabó, D. Lazer, K. Kaski, J. Kertész, and A.-L. Barabási. 2007. Structure and tie strengths in mobile communication networks. Proc. Natl. Acad. Sci. U.S.A. (2007).Google ScholarGoogle ScholarCross RefCross Ref
  43. Vasyl Palchykov, Kimmo Kaski, János Kertész, Albert-László Barabási, and Robin I. M. Dunbar. 2012. Sex differences in intimate relationships. Sci. Rep. 2:370 (2012).Google ScholarGoogle Scholar
  44. Stephen W. Raudenbush and Anthony S. Bryk. 2002. Hierarchical Linear Models: Applications and Data Analysis Methods. Vol. 1. Sage.Google ScholarGoogle Scholar
  45. Jari Saramaki and Esteban Moro. 2015. From seconds to months: multi-scale dynamics of mobile telephone calls. arXiv:1504.01479 (2015).Google ScholarGoogle Scholar
  46. Mukund Seshadri, Sridhar Machiraju, Ashwin Sridharan, Jean Bolot, Christos Faloutsos, and Jure Leskovec. 2008. Mobile call graphs: Beyond power-law and lognormal distributions. In KDD’08. ACM, 596--604. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Xiaolin Shi, Lada A. Adamic, and Martin J. Strauss. 2007. Networks of strong ties. Physica A: Stat. Mech. Appl. 378, 1 (2007), 33--47.Google ScholarGoogle ScholarCross RefCross Ref
  48. Bai-En Shie, S. Yu Philip, and Vincent S. Tseng. 2013. Mining interesting user behavior patterns in mobile commerce environments. Appl. Intell. 38, 3 (2013), 418--435. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Zbigniew Smoreda and Christian Licoppe. 2000. Gender-specific use of the domestic telephone. Soc. Psych. Quart. 63, 3 (2000), 238--252.Google ScholarGoogle ScholarCross RefCross Ref
  50. Richard C. Sprinthall. 2011. Basic Statistical Analysis. Pearson.Google ScholarGoogle Scholar
  51. Arkadiusz Stopczynski, Vedran Sekara, Piotr Sapiezynski, Andrea Cuttone, Jakob Eg Larsen, and Sune Lehmann. 2014. Measuring large-scale social networks with high resolution. PLOS One 9, 4 (2014), e95978.Google ScholarGoogle ScholarCross RefCross Ref
  52. Michael Szell and Stefan Thurner. 2013. How women organize social networks different from men. Sci. Rep. 3 (July 2013).Google ScholarGoogle Scholar
  53. Jie Tang, Tiancheng Lou, Jon Kleinberg, and Sen Wu. 2016. Transfer learning to infer social ties across heterogeneous networks. ACM Trans. Inf. Syst. 34, 2, Article 7 (April 2016). Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Jie Tang, Sen Wu, and Jimeng Sun. 2013. Confluence: Conformity influence in large social networks. In KDD’13. ACM, 347--355. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. ArnetMiner: Extraction and mining of academic social networks. In KDD’08. 990--998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Dashun Wang, Dino Pedreschi, Chaoming Song, Fosca Giannotti, and Albert-Laszlo Barabasi. 2011. Human mobility, social ties, and link prediction. In KDD’11. ACM, 1100--1108. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Josh Ying, Yao-Jen Chang, Chi-Min Huang, and Vincent S. Tseng. 2012. Demographic prediction based on user’s mobile behaviors. In Nokia MDC’12.Google ScholarGoogle Scholar
  58. Yuchen Zhao, Guan Wang, Philip S. Yu, Shaobo Liu, and Simon Zhang. 2013. Inferring social roles and statuses in social networks. In KDD’13. 695--703. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Yu Zheng. 2015. Trajectory data mining: An overview. ACM Trans. Intell. Syst. Technol. (TIST) 6, 3 (2015), 29. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. User Modeling on Demographic Attributes in Big Mobile Social Networks

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in

              Full Access

              • Published in

                cover image ACM Transactions on Information Systems
                ACM Transactions on Information Systems  Volume 35, Issue 4
                Special issue: Search, Mining and their Applications on Mobile Devices
                October 2017
                461 pages
                ISSN:1046-8188
                EISSN:1558-2868
                DOI:10.1145/3112649
                Issue’s Table of Contents

                Copyright © 2017 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 11 July 2017
                • Accepted: 1 December 2016
                • Revised: 1 October 2016
                • Received: 1 June 2016
                Published in tois Volume 35, Issue 4

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article
                • Research
                • Refereed

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader