Abstract
Call detail records (CDRs) have recently been used in studying different aspects of human mobility. While CDRs provide a means of sampling user locations at large population scales, they may not sample all locations proportionate to the visitation frequency of a user, owing to sparsity in time and space of voice-calls, thereby introducing a bias. Also, as the rate of sampling is inherently dependent on the calling frequencies of an individual, high voice-call activity users are often chosen for conducting a meaningful study. Such a selection process can, inadvertently, lead to a biased view as high frequency callers may not always be representative of an entire population. With the advent of 3G technology and wide adoption of smart-phones, cellular devices have become versatile end-hosts. As the data accessed on these devices does not always require human initiation, it affords us with an unprecedented opportunity to validate the utility of CDRs for studying human mobility. In this work, we investigate various metrics for human mobility studied in literature for over a million cellular users in the San Francisco bay-area, for over a month. Our findings reveal that although the voice-call process does well to sample significant locations, such as home and work, it may in some cases incur biases in capturing the overall spatio-temporal characteristics of individual human mobility. Additionally, we motivate an "artificially" imposed sampling process, vis-a-vis the voice-call process with the same average intensity. We observe that in many cases such an imposed sampling process yields better performance results based on the usual metrics like entropies and marginal distributions used often in literature.
- A.-L. Barabasi, The origin of bursts and heavy tails in human dynamics, Nature 435 (2005), 207--2011.Google ScholarCross Ref
- R. Becker, R. Cáceres, K. Hanson, J. M. Loh, S. Urbanek, A. Varshavsky, and C. Volinsky, Classifying routes using cellular handoff patterns, Proc. of Netmob 2011 (2011).Google Scholar
- D. Borckmann, L. Hufnagel, and T. Geisel, The scaling laws of human travel, Nature 439 (2006), 462--465.Google ScholarCross Ref
- A. Chaintreau, P. Hui, J. Crowcroft, C. Diot, R. Gass, and J. Scott, Impact of human mobility on the design of opportunistic forwarding algorithms, Proc. IEEE Infocom'06, Barcelona, Spain, Apr. 2006.Google ScholarCross Ref
- T. Couronné, Z. Smoreda, and A.-M. Olteanu, Chatty mobiles: Individual mobility and communication patterns, Proc. of Netmob 2011 (2011).Google Scholar
- T. M. Cover and J. A. Thomas, Elements of information theory, Wiley-Interscience, 1991. Google ScholarDigital Library
- M. C. Gonzalez, C. A. Hidalgo, and A.-L. Barabasi, Understanding individual human mobility patterns, Nature 435 (2008), 779--782.Google ScholarCross Ref
- J. A. Hartigan, Clustering algorithms, John Wiley & Sons, New York (1975). Google ScholarDigital Library
- http://crawdad.cs.dartmouth.edu/.Google Scholar
- S. Isaacman, R. Becker, R. Cáceres, S. Kobourov, M. Martonosi, J. Rowland, and A. Varshavsky, Identifying important places in people's lives from cellular network data, 9th International Conference on Pervasive Computing Pervasive (2011). Google ScholarDigital Library
- M. Kim and D. Kotz, Periodic properties of user mobility and access-point popularity, Journal of Personal and Ubiquitous Computing 11 (2007), no. 6. Google ScholarDigital Library
- M. Kim, D. Kotz, and S. Kim, Extracting a mobility model from real user traces, Proc. IEEE Infocom'06 (2006).Google ScholarCross Ref
- J. Lin, Divergence measures based on the Shannon entropy, IEEE transactions on information theory 37, no. 1, 145--151. Google ScholarDigital Library
- A. Nanavati, S. Gurumurthy, G. Das, D. Chakraborty, K. Dasgupta, S. Mukherjea, and A. Joshi, On the structural properties of massive telecom call graphs: findings and implications, Proc. of 15th ACM Conference on Information and Knowledge Management, 2006, pp. 435--444. Google ScholarDigital Library
- A. Noulas, S. Scellato, R. Lambiotte, M. Pontil, and C. Mascolo, A tale of many cities: universal patterns in human urban mobility, PLoS One 7 (2012).Google Scholar
- G. Ranjan, Z.-L. Zhang, S. Ranjan, R. Keralapura, and J. Robinson, Un-zipping cellular infrastructure locations via user geo-intent, Proc. of Infocom (2011).Google ScholarCross Ref
- I. Rhee,M. Shin, S. Hong, K. Lee, and S. Chong, On the levy-walk nature of human mobility: do humans walk like monkeys?, Proc. IEEE Infocom'08, 2008.Google Scholar
- C. Song, Z. Qu, N. Blumm, and A.-L. Barabasi, Limits of predictability in human mobility, Science 327 (2010), 1018--1021.Google ScholarCross Ref
- L. Song, D. Kotz, R. Jain, and X. He, Evaluating next-cell predictors with extensive WiFi mobility data, IEEE Transactions on Mobile Computing 5 (December 2006), no. 12. Google ScholarDigital Library
- I. Trestian, S. Ranjan, A. Kuzmanovic, and A. Nucci, Measuring serendipity: Connecting people, locations and interest in a mobile 3G network, Proc. of ACM Internet Measurement Conference (2009). Google ScholarDigital Library
Index Terms
- Are call detail records biased for sampling human mobility?
Recommendations
Call Detail Records for Human Mobility Studies: Taking Stock of the Situation in the "Always Connected Era"
Big-DAMA '17: Proceedings of the Workshop on Big Data Analytics and Machine Learning for Data Communication NetworksThe exploitation of cellular network data for studying human mobility has been a popular research topic in the last decade. Indeed, mobile terminals could be considered ubiquitous sensors that allow the observation of human movements on large scale ...
Understanding the bias of call detail records in human mobility research
Human Dynamics in the Mobile and Big Data EraIn recent years, call detail records CDRs have been widely used in human mobility research. Although CDRs are originally collected for billing purposes, the vast amount of digital footprints generated by calling and texting activities provide useful ...
A collective human mobility analysis method based on data usage detail records
Human mobility patterns have been widely investigated due to their application in a wide variety of fields, for example urban planning and epidemiology. Many studies have introduced spatial networks into human mobility analyses at the collective level. ...
Comments