skip to main content
10.1145/2124295.2124380acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Finding your friends and following them to where you are

Published:08 February 2012Publication History

ABSTRACT

Location plays an essential role in our lives, bridging our online and offline worlds. This paper explores the interplay between people's location, interactions, and their social ties within a large real-world dataset. We present and evaluate Flap, a system that solves two intimately related tasks: link and location prediction in online social networks. For link prediction, Flap infers social ties by considering patterns in friendship formation, the content of people's messages, and user location. We show that while each component is a weak predictor of friendship alone, combining them results in a strong model, accurately identifying the majority of friendships. For location prediction, Flap implements a scalable probabilistic model of human mobility, where we treat users with known GPS positions as noisy sensors of the location of their friends. We explore supervised and unsupervised learning scenarios, and focus on the efficiency of both learning and inference. We evaluate Flap on a large sample of highly active users from two distinct geographical areas and show that it (1) reconstructs the entire friendship graph with high accuracy even when no edges are given; and (2) infers people's fine-grained location, even when they keep their data private and we can only access the location of their friends. Our models significantly outperform current comparable approaches to either task.

Skip Supplemental Material Section

Supplemental Material

wsdm_day3_session2_3.mp4

mp4

108.9 MB

References

  1. L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 635--644. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Backstrom, E. Sun, and C. Marlow. Find me if you can: improving geographical prediction with social and spatial proximity. In Proceedings of the 19th international conference on World wide web, pages 61--70. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Bell, Y. Koren, and C. Volinsky. Modeling relationships at multiple scales to improve accuracy of large recommender systems. In KDD, pages 95--104, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. L. Breiman et al. Classification and Regression Trees. Chapman & Hall, New York, 1984.Google ScholarGoogle Scholar
  5. A. B. Brush, J. Krumm, and J. Scott. Exploring end user preferences for location obfuscation, location-based services, and the value of location. In Ubicomp, Ubicomp '10, pages 95--104, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. E. Cho, S. A. Myers, and J. Leskovec. Friendship and mobility: User movement in location-based social networks. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Crandall, L. Backstrom, D. Cosley, S. Suri, D. Huttenlocher, and J. Kleinberg. Inferring social ties from geographic coincidences. Proceedings of the National Academy of Sciences, 107(52):22436, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  8. M. Datar, N. Immorlica, P. Indyk, and V. Mirrokni. Locality-sensitive hashing scheme based on p-stable distributions. In Proceedings of the twentieth annual symposium on Computational geometry, pages 253--262. ACM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, 39(1):1--38, 1977.Google ScholarGoogle Scholar
  10. N. Eagle and A. Pentland. Reality mining: sensing complex social systems. Personal and Ubiquitous Computing, 10(4):255--268, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Easley and J. Kleinberg. Networks, Crowds, and Markets: Reasoning About a Highly Connected World. Cambridge University Press, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Eubank, H. Guclu, V. Anil Kumar, M. Marathe, A. Srinivasan, Z. Toroczkai, and N. Wang. Modelling disease outbreaks in realistic urban social networks. Nature, 429(6988):180--184, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  13. Z. Ghahramani. Learning dynamic Bayesian networks. In Adaptive Processing of Sequences and Data Structures, page 168. Springer, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Gruzd, B. Wellman, and Y. Takhteyev. Imagining twitter as an imagined community. In American Behavioral Scientist, Special issue on Imagined Communities, 2011.Google ScholarGoogle Scholar
  15. L. Hong, B. Suh, and E. H. Chi. Tweets from justin bieber's heart: the dynamics of the "location" field in user profiles. In ACM CHI, 2011.Google ScholarGoogle Scholar
  16. A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: understanding microblogging usage and communities. In WebKDD/SNA-KDD, pages 56--65, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Jordan. Learning in graphical models. Kluwer Academic Publishers, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. Kwak, C. Lee, H. Park, and S. Moon. What is Twitter, a Social Network or a News Media? In WWW, April 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Liben-Nowell and J. Kleinberg. The link-prediction problem for social networks. J. Am. Soc. Inf. Sci. Technol., 58:1019--1031, May 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Liben-Nowell, J. Novak, R. Kumar, P. Raghavan, and A. Tomkins. Geographic routing in social networks. Proceedings of the National Academy of Sciences of the United States of America, 102(33):11623, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  21. K. Murphy. Dynamic bayesian networks: representation, inference and learning. PhD thesis, University of California, Berkeley, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. K. P. Murphy, Y. Weiss, and M. I. Jordan. Loopy belief propagation for approximate inference: An empirical study. In In Proceedings of Uncertainty in AI, pages 467--475, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. S. Pentland. Honest Signals: How They Shape Our World. The MIT Press, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Scellato, A. Noulas, R. Lambiotte, and C. Mascolo. Socio-spatial properties of online location-based social networks. Proceedings of ICWSM, 11, 2011.Google ScholarGoogle Scholar
  26. D. Smith and J. Eisner. Dependency parsing by belief propagation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 145--156. Association for Computational Linguistics, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. C. Song, Z. Qu, N. Blumm, and A. Barabási. Limits of predictability in human mobility. Science, 327(5968):1018, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  28. B. Taskar, M.-F. Wong, P. Abbeel, and D. Koller. Link prediction in relational data. In in Neural Information Processing Systems, 2003.Google ScholarGoogle Scholar
  29. N. Ueda and R. Nakano. Deterministic annealing em algorithm. Neural Networks, 11(2):271--282, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Finding your friends and following them to where you are

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WSDM '12: Proceedings of the fifth ACM international conference on Web search and data mining
      February 2012
      792 pages
      ISBN:9781450307475
      DOI:10.1145/2124295

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 February 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate498of2,863submissions,17%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader