Skip to main content
Erschienen in: Knowledge and Information Systems 2/2015

01.08.2015 | Regular Paper

Reconstructing individual mobility from smart card transactions: a collaborative space alignment approach

verfasst von: Fuzheng Zhang, Nicholas Jing Yuan, Yingzi Wang, Xing Xie

Erschienen in: Knowledge and Information Systems | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Smart card transactions capture rich information of human mobility and urban dynamics and therefore are of particular interest to urban planners and location-based service providers. However, since most transaction systems are only designated for billing purpose, typically, fine-grained location information, such as the exact boarding and alighting stops of a bus trip, is only partially or not available at all, which blocks deep exploitation of this rich and valuable data at individual level. This paper presents a collaborative space alignment framework to reconstruct individual mobility history from a metropolitan-scale smart card transaction dataset. Specifically, we show that by delicately aligning the monetary space and geospatial space with the temporal space, we are able to extrapolate a series of critical domain-specific constraints. Later, these constraints are naturally incorporated into a semi-supervised conditional random field (CRF) to infer the exact boarding and alighting stops of all transit routes, where the features of the CRF model consist of not only pre-defined indicator features extracted from individual trips but also latent features crafted from different users’ trips using collaborative filtering. Here, we consider two types of collaborative features: (1) the similarity in terms of users’ choices of bus lines and (2) latent temporal patterns of users’ commuting behaviors. Extensive experimental results show that our approach achieves a high accuracy, e.g., given only 10 % trips with known alighting/boarding stops, and we successfully inferred more than 79 % alighting and boarding stops from all unlabeled trips. In particular, we validated that the extracted collaborative features significantly contribute to the accuracy of our model. In addition, we have demonstrated that by applying our approach to enrich the data, the performance of a conventional method for identifying users’ home and work places can be dramatically improved (with 83 % improvement on home detection and 38 % improvement on work place detection). The proposed method offers the possibility to mine individual mobility from common public transit transactions, and showcases how uncertain data can be leveraged with domain knowledge and constraints, to support cross-application data-mining tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
6
We take the right limit here since \(b(t)\) is a step function as depicted on the top part of Fig. 3.
 
7
We employed the square distance as the score function in our implementation, following [9].
 
8
We adopt the term “interest” following the idioms used in the recommender system community.
 
10
According to our privacy agreement with the participants, we cannot show the density distribution of their home and work places (as Fig. 12) here.
 
Literatur
1.
Zurück zum Zitat Agard B, Morency C, Trépanier M (2006) Mining public transport user behaviour from smart card data. In: 12th IFAC symposium on information control problems in manufacturing-INCOM, pp 17–19 Agard B, Morency C, Trépanier M (2006) Mining public transport user behaviour from smart card data. In: 12th IFAC symposium on information control problems in manufacturing-INCOM, pp 17–19
2.
Zurück zum Zitat Barry JJ, Freimer R, Slavin H (2009) Use of entry-only automatic fare collection data to estimate linked transit trips in New York city. Transp Res Rec J Transp Res Board 2112(1):53–61CrossRef Barry JJ, Freimer R, Slavin H (2009) Use of entry-only automatic fare collection data to estimate linked transit trips in New York city. Transp Res Rec J Transp Res Board 2112(1):53–61CrossRef
3.
Zurück zum Zitat Bassett DR Jr, Wyatt HR, Thompson H, Peters JC, Hill JO (2010) Pedometer-measured physical activity and health behaviors in United States adults. Med Sci Sports Exerc 42(10):1819CrossRef Bassett DR Jr, Wyatt HR, Thompson H, Peters JC, Hill JO (2010) Pedometer-measured physical activity and health behaviors in United States adults. Med Sci Sports Exerc 42(10):1819CrossRef
4.
Zurück zum Zitat Bin M (2009) The spatial organization of the separation between jobs and residential locations in beijing. Acta Geogr Sinica 12:009 Bin M (2009) The spatial organization of the separation between jobs and residential locations in beijing. Acta Geogr Sinica 12:009
5.
Zurück zum Zitat Ceapa I, Smith C, Capra L (2012) Avoiding the crowds: understanding tube station congestion patterns from trip data. In: Proceedings of the ACM SIGKDD international workshop on urban computing, pp 134–141 Ceapa I, Smith C, Capra L (2012) Avoiding the crowds: understanding tube station congestion patterns from trip data. In: Proceedings of the ACM SIGKDD international workshop on urban computing, pp 134–141
6.
Zurück zum Zitat Cranshaw J, Toch E, Hong J, Kittur A, Sadeh N (2010) Bridging the gap between physical location and online social networks. In: Ubicomp, pp 119–128 Cranshaw J, Toch E, Hong J, Kittur A, Sadeh N (2010) Bridging the gap between physical location and online social networks. In: Ubicomp, pp 119–128
7.
Zurück zum Zitat Cui A (2006) Bus passenger origin-destination matrix estimation using automated data collection systems. Master’s thesis, Massachusetts Institute of Technology Cui A (2006) Bus passenger origin-destination matrix estimation using automated data collection systems. Master’s thesis, Massachusetts Institute of Technology
8.
Zurück zum Zitat Daniels R, Mulley C (2011) Explaining walking distance to public transport: the dominance of public transport supply. World 28:30 Daniels R, Mulley C (2011) Explaining walking distance to public transport: the dominance of public transport supply. World 28:30
9.
Zurück zum Zitat Druck G, Mann G, McCallum A (2009) Semi-supervised learning of dependency parsers using generalized expectation criteria. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP, vol 1, pp 360–368 Druck G, Mann G, McCallum A (2009) Semi-supervised learning of dependency parsers using generalized expectation criteria. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP, vol 1, pp 360–368
10.
Zurück zum Zitat Ge Y, Xiong H, Tuzhilin A, Xiao K, Gruteser M, Pazzani M (2010) An energy-efficient mobile recommender system. In: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 899–908 Ge Y, Xiong H, Tuzhilin A, Xiao K, Gruteser M, Pazzani M (2010) An energy-efficient mobile recommender system. In: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 899–908
11.
Zurück zum Zitat Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L (2008) Detecting influenza epidemics using search engine query data. Nature 457(7232):1012–1014CrossRef Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L (2008) Detecting influenza epidemics using search engine query data. Nature 457(7232):1012–1014CrossRef
12.
Zurück zum Zitat Gonzalez MC, Hidalgo CA, Barabasi AL (2008) Understanding individual human mobility patterns. Nature 453(7196):779–782CrossRef Gonzalez MC, Hidalgo CA, Barabasi AL (2008) Understanding individual human mobility patterns. Nature 453(7196):779–782CrossRef
13.
Zurück zum Zitat Hoh B, Gruteser M, Xiong H, Alrabady A (2010) Achieving guaranteed anonymity in GPS traces via uncertainty-aware path cloaking. IEEE Trans Mob Comput 9(8):1089–1107CrossRef Hoh B, Gruteser M, Xiong H, Alrabady A (2010) Achieving guaranteed anonymity in GPS traces via uncertainty-aware path cloaking. IEEE Trans Mob Comput 9(8):1089–1107CrossRef
14.
Zurück zum Zitat Isaacman S, Becker R, Cáceres R, Kobourov S, Martonosi M, Rowland J, Varshavsky A (2011) Identifying important places in peoples lives from cellular network data. In: Pervasive computing, pp 133–151 Isaacman S, Becker R, Cáceres R, Kobourov S, Martonosi M, Rowland J, Varshavsky A (2011) Identifying important places in peoples lives from cellular network data. In: Pervasive computing, pp 133–151
15.
Zurück zum Zitat Lafferty JD, McCallum A, Pereira FCN (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the eighteenth international conference on machine learning, ICML ’01, pp 282–289 Lafferty JD, McCallum A, Pereira FCN (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the eighteenth international conference on machine learning, ICML ’01, pp 282–289
16.
Zurück zum Zitat Lathia N, Capra L (2011) Mining mobility data to minimise travellers’ spending on public transport. In: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 1181–1189 Lathia N, Capra L (2011) Mining mobility data to minimise travellers’ spending on public transport. In: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 1181–1189
17.
Zurück zum Zitat Lathia N, Froehlich J, Capra L (2010) Mining public transport usage for personalised intelligent transport systems. In: 2010 IEEE 10th international conference on data mining (ICDM), IEEE, pp 887–892 Lathia N, Froehlich J, Capra L (2010) Mining public transport usage for personalised intelligent transport systems. In: 2010 IEEE 10th international conference on data mining (ICDM), IEEE, pp 887–892
18.
Zurück zum Zitat Liu L, Hou A, Biderman A, Ratti C, Chen J (2009) Understanding individual and collective mobility patterns from smart card records: a case study in shenzhen. In: Intelligent transportation systems, 2009. ITSC’09, IEEE, pp 1–6 Liu L, Hou A, Biderman A, Ratti C, Chen J (2009) Understanding individual and collective mobility patterns from smart card records: a case study in shenzhen. In: Intelligent transportation systems, 2009. ITSC’09, IEEE, pp 1–6
19.
Zurück zum Zitat Mann G, McCallum A (2008) Generalized expectation criteria for semi-supervised learning of conditional random fields. In: Proceedings of ACL, pp 870–878 Mann G, McCallum A (2008) Generalized expectation criteria for semi-supervised learning of conditional random fields. In: Proceedings of ACL, pp 870–878
20.
Zurück zum Zitat Mann GS, McCallum A (2010) Generalized expectation criteria for semi-supervised learning with weakly labeled data. J Mach Learn Res 11:955–984 Mann GS, McCallum A (2010) Generalized expectation criteria for semi-supervised learning with weakly labeled data. J Mach Learn Res 11:955–984
21.
Zurück zum Zitat de Montjoye YA, Hidalgo CA, Verleysen M, Blondel VD (2013) Unique in the crowd: the privacy bounds of human mobility. Scientific Reports 3 de Montjoye YA, Hidalgo CA, Verleysen M, Blondel VD (2013) Unique in the crowd: the privacy bounds of human mobility. Scientific Reports 3
22.
Zurück zum Zitat Pelletier MP, Trépanier M, Morency C (2011) Smart card data use in public transit: a literature review. Transp Res Part C Emerg Technol 19(4):557–568CrossRef Pelletier MP, Trépanier M, Morency C (2011) Smart card data use in public transit: a literature review. Transp Res Part C Emerg Technol 19(4):557–568CrossRef
23.
Zurück zum Zitat Rendle S, Freudenthaler C, Gantner Z, Schmidt-Thieme L (2009) Bpr: Bayesian personalized ranking from implicit feedback. In: Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence, AUAI Press, pp 452–461 Rendle S, Freudenthaler C, Gantner Z, Schmidt-Thieme L (2009) Bpr: Bayesian personalized ranking from implicit feedback. In: Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence, AUAI Press, pp 452–461
24.
Zurück zum Zitat Sarawagi S, Cohen WW (2004) Semi-markov conditional random fields for information extraction. Adv Neural Inf Process Syst 17:1185–1192 Sarawagi S, Cohen WW (2004) Semi-markov conditional random fields for information extraction. Adv Neural Inf Process Syst 17:1185–1192
25.
Zurück zum Zitat Song C, Qu Z, Blumm N, Barabási AL (2010) Limits of predictability in human mobility. Science 327(5968):1018–1021CrossRef Song C, Qu Z, Blumm N, Barabási AL (2010) Limits of predictability in human mobility. Science 327(5968):1018–1021CrossRef
26.
Zurück zum Zitat Trépanier M, Tranchant N, Chapleau R (2007) Individual trip destination estimation in a transit smart card automated fare collection system. J Intell Transp Syst 11(1):1–14CrossRef Trépanier M, Tranchant N, Chapleau R (2007) Individual trip destination estimation in a transit smart card automated fare collection system. J Intell Transp Syst 11(1):1–14CrossRef
27.
Zurück zum Zitat Utsunomiya M, Attanucci J, Wilson N (2006) Potential uses of transit smart card registration and transaction data to improve transit planning. Transp Res Rec J Transp Res Board 1971(1):119–126CrossRef Utsunomiya M, Attanucci J, Wilson N (2006) Potential uses of transit smart card registration and transaction data to improve transit planning. Transp Res Rec J Transp Res Board 1971(1):119–126CrossRef
28.
Zurück zum Zitat Wang D, Pedreschi D, Song C, Giannotti F, Barabasi AL (2011a) Human mobility, social ties, and link prediction. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1100–1108 Wang D, Pedreschi D, Song C, Giannotti F, Barabasi AL (2011a) Human mobility, social ties, and link prediction. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1100–1108
29.
Zurück zum Zitat Wang W, Attanucci JP, Wilson NH (2011b) Bus passenger origin-destination estimation and related analyses using automated data collection systems. J Public Transp 14(4) Wang W, Attanucci JP, Wilson NH (2011b) Bus passenger origin-destination estimation and related analyses using automated data collection systems. J Public Transp 14(4)
30.
Zurück zum Zitat Yuan J, Zheng Y, Xie X, Sun G (2013) T-drive: enhancing driving directions with taxi drivers’ intelligence. IEEE Trans Knowl Data Eng 25(1):220–232CrossRef Yuan J, Zheng Y, Xie X, Sun G (2013) T-drive: enhancing driving directions with taxi drivers’ intelligence. IEEE Trans Knowl Data Eng 25(1):220–232CrossRef
31.
Zurück zum Zitat Zhao J, Rahbee A, Wilson NH (2007) Estimating a rail passenger trip origin-destination matrix using automatic data collection systems. Comput Aid Civil Infrastruct Eng 22(5):376–387CrossRef Zhao J, Rahbee A, Wilson NH (2007) Estimating a rail passenger trip origin-destination matrix using automatic data collection systems. Comput Aid Civil Infrastruct Eng 22(5):376–387CrossRef
Metadaten
Titel
Reconstructing individual mobility from smart card transactions: a collaborative space alignment approach
verfasst von
Fuzheng Zhang
Nicholas Jing Yuan
Yingzi Wang
Xing Xie
Publikationsdatum
01.08.2015
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 2/2015
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-014-0763-x

Weitere Artikel der Ausgabe 2/2015

Knowledge and Information Systems 2/2015 Zur Ausgabe