Skip to main content
Erschienen in: Data Mining and Knowledge Discovery 5/2017

04.04.2017

On temporal-constrained sub-trajectory cluster analysis

verfasst von: Nikos Pelekis, Panagiotis Tampakis, Marios Vodas, Christos Doulkeridis, Yannis Theodoridis

Erschienen in: Data Mining and Knowledge Discovery | Ausgabe 5/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Cluster analysis over Moving Object Databases (MODs) is a challenging research topic that has attracted the attention of the mobility data mining community. In this paper, we study the temporal-constrained sub-trajectory cluster analysis problem, where the aim is to discover clusters of sub-trajectories given an ad-hoc, user-specified temporal constraint within the dataset’s lifetime. The problem is challenging because: (a) the time window is not known in advance, instead it is specified at query time, and (b) the MOD is continuously updated with new trajectories. Existing solutions first filter the trajectory database according to the temporal constraint, and then apply a clustering algorithm from scratch on the filtered data. However, this approach is extremely inefficient, when considering explorative data analysis where multiple clustering tasks need to be performed over different temporal subsets of the database, while the database is updated with new trajectories. To address this problem, we propose an incremental and scalable solution to the problem, which is built upon a novel indexing structure, called Representative Trajectory Tree (ReTraTree). ReTraTree acts as an effective spatio-temporal partitioning technique; partitions in ReTraTree correspond to groupings of sub-trajectories, which are incrementally maintained and assigned to representative (sub-)trajectories. Due to the proposed organization of sub-trajectories, the problem under study can be efficiently solved as simply as executing a query operator on ReTraTree, while insertion of new trajectories is supported. Our extensive experimental study performed on real and synthetic datasets shows that our approach outperforms a state-of-the-art in-DBMS solution supported by PostgreSQL by orders of magnitude.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Hermes@PostgreSQL MOD engine. URL: http://​infolab.​cs.​unipi.​gr/​hermes/​.
 
2
The GeoLife dataset is publicly available. The other two datasets are publicly available at chorochronos.datastories.org repository under the names ‘imis7days’ and ‘smod’, respectively.
 
Literatur
Zurück zum Zitat Ankerst M, Breunig MM, Kriegel H-P, Sander J (1999) OPTICS: ordering points to identify the clustering structure. In: Proceedings of ACM SIGMOD international conference on management of data, pp 49–60 Ankerst M, Breunig MM, Kriegel H-P, Sander J (1999) OPTICS: ordering points to identify the clustering structure. In: Proceedings of ACM SIGMOD international conference on management of data, pp 49–60
Zurück zum Zitat Benkert M, Gudmundsson J, Hubner F, Wolle T (2006) Reporting flock patterns. In: Proceedings of 14th annual european symposium (ESA), pp 660–671 Benkert M, Gudmundsson J, Hubner F, Wolle T (2006) Reporting flock patterns. In: Proceedings of 14th annual european symposium (ESA), pp 660–671
Zurück zum Zitat Buchin M, Driemel A, van Kreveld M, Sacristán V (2010) An algorithmic framework for segmenting trajectories based on spatio-temporal criteria. In: Proceedings of of the 18th SIGSPATIAL international conference on advances in geographic information systems, pp 202–211 Buchin M, Driemel A, van Kreveld M, Sacristán V (2010) An algorithmic framework for segmenting trajectories based on spatio-temporal criteria. In: Proceedings of of the 18th SIGSPATIAL international conference on advances in geographic information systems, pp 202–211
Zurück zum Zitat Cudre-Mauroux P, Wu E, Madden S (2010) Trajstore: an adaptive storage system for very large trajectory data sets. In: Proceedings of IEEE 26th international conference on data engineering (ICDE) Cudre-Mauroux P, Wu E, Madden S (2010) Trajstore: an adaptive storage system for very large trajectory data sets. In: Proceedings of IEEE 26th international conference on data engineering (ICDE)
Zurück zum Zitat Ester M, Kriegel H-P, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd international conference on knowledge discovery and data mining (KDD), pp 226–231 Ester M, Kriegel H-P, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd international conference on knowledge discovery and data mining (KDD), pp 226–231
Zurück zum Zitat Ferreira N, Klosowski JT, Scheidegger CE, Silva CT (2013) Vector field k-means: clustering trajectories by fitting multiple vector fields. In: Proceedings of EuroVis, pp 201–210 Ferreira N, Klosowski JT, Scheidegger CE, Silva CT (2013) Vector field k-means: clustering trajectories by fitting multiple vector fields. In: Proceedings of EuroVis, pp 201–210
Zurück zum Zitat Frentzos E, Gratsias K, Theodoridis Y (2007) Index-based most similar trajectory search. In: Proceedings of IEEE 23rd international conference on data engineering (ICDE) Frentzos E, Gratsias K, Theodoridis Y (2007) Index-based most similar trajectory search. In: Proceedings of IEEE 23rd international conference on data engineering (ICDE)
Zurück zum Zitat Gaffney S, Smyth P (1999) Trajectory clustering with mixtures of regression models. In: Proceedings of the 5th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 63–72 Gaffney S, Smyth P (1999) Trajectory clustering with mixtures of regression models. In: Proceedings of the 5th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 63–72
Zurück zum Zitat Giannotti F, Nanni M, Pedreschi D, Pinelli F, Renso C, Rinzivillo S, Trasarti R (2011) Unveiling the complexity of human mobility by querying and mining massive trajectory data. VLDB J Int J Very Large Data Bases 20(5):695–719CrossRef Giannotti F, Nanni M, Pedreschi D, Pinelli F, Renso C, Rinzivillo S, Trasarti R (2011) Unveiling the complexity of human mobility by querying and mining massive trajectory data. VLDB J Int J Very Large Data Bases 20(5):695–719CrossRef
Zurück zum Zitat Guha S, Rastigi R, Shim K (1998) CURE: an efficient clustering algorithm for large databases. In: Proceedings of ACM SIGMOD international conference on management of data, pp 73–84 Guha S, Rastigi R, Shim K (1998) CURE: an efficient clustering algorithm for large databases. In: Proceedings of ACM SIGMOD international conference on management of data, pp 73–84
Zurück zum Zitat Hadjieleftheriou M, Kollios G, Gunopulos D, Tsotras VJ (2006) Indexing spatio-temporal archives. VLDB J Int J Very Large Data Bases 15(2):143–164CrossRefMATH Hadjieleftheriou M, Kollios G, Gunopulos D, Tsotras VJ (2006) Indexing spatio-temporal archives. VLDB J Int J Very Large Data Bases 15(2):143–164CrossRefMATH
Zurück zum Zitat Hung CC, Peng WC, Lee WC (2015) Clustering and aggregating clues of trajectories for mining trajectory patterns and routes. VLDB J Int J Very Large Data Bases 24(2):169–192CrossRef Hung CC, Peng WC, Lee WC (2015) Clustering and aggregating clues of trajectories for mining trajectory patterns and routes. VLDB J Int J Very Large Data Bases 24(2):169–192CrossRef
Zurück zum Zitat Jeung H, Yiu ML, Zhou X, Jensen CS, Shen HT (2008) Discovery of convoys in trajectory databases. In: Proceedings of the VLDB endowment, pp 1068–1080 Jeung H, Yiu ML, Zhou X, Jensen CS, Shen HT (2008) Discovery of convoys in trajectory databases. In: Proceedings of the VLDB endowment, pp 1068–1080
Zurück zum Zitat Kalnis P, Mamoulis N, Bakiras S (2005) On discovering moving clusters in spatio-temporal data. In: Proceedings of the 9th international conference on advances in spatial and temporal databases (SSTD), pp 364–381 Kalnis P, Mamoulis N, Bakiras S (2005) On discovering moving clusters in spatio-temporal data. In: Proceedings of the 9th international conference on advances in spatial and temporal databases (SSTD), pp 364–381
Zurück zum Zitat Laube P, Imfeld S, Weibel R (2005) Discovering relative motion patterns in groups of moving point objects. Int J Geogr Inf Sci 19(6):639–668CrossRef Laube P, Imfeld S, Weibel R (2005) Discovering relative motion patterns in groups of moving point objects. Int J Geogr Inf Sci 19(6):639–668CrossRef
Zurück zum Zitat Lee JG, Han J, Whang KY (2007) Trajectory clustering: a partition-and-group framework. In: Proceedings of ACM SIGMOD international conference on management of data, pp 593–604 Lee JG, Han J, Whang KY (2007) Trajectory clustering: a partition-and-group framework. In: Proceedings of ACM SIGMOD international conference on management of data, pp 593–604
Zurück zum Zitat Li Z, Ding B, Han J, Kays R (2010) Swarm: mining relaxed temporal moving object clusters. Proc VLDB Endow 3(1–2):723–734CrossRef Li Z, Ding B, Han J, Kays R (2010) Swarm: mining relaxed temporal moving object clusters. Proc VLDB Endow 3(1–2):723–734CrossRef
Zurück zum Zitat Li Z, Lee JG, Li X, Han J (2010b) Incremental clustering for trajectories. In: Proceedings of the 15th international conference on database systems for advanced applications (DASFAA), pp 32–46 Li Z, Lee JG, Li X, Han J (2010b) Incremental clustering for trajectories. In: Proceedings of the 15th international conference on database systems for advanced applications (DASFAA), pp 32–46
Zurück zum Zitat Li Y, Bailey J, Kulik L (2015) Efficient mining of platoon patterns in trajectory databases. Data Knowl Eng 100(PA):167–187CrossRef Li Y, Bailey J, Kulik L (2015) Efficient mining of platoon patterns in trajectory databases. Data Knowl Eng 100(PA):167–187CrossRef
Zurück zum Zitat MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the \(5{{th}}\) Berkeley symposium on mathematical statistics and probability, pp 281–297 MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the \(5{{th}}\) Berkeley symposium on mathematical statistics and probability, pp 281–297
Zurück zum Zitat Nanni M, Pedreschi D (2006) Time-focused clustering of trajectories of moving objects. J Intell Inf Syst 27(3):267–289CrossRef Nanni M, Pedreschi D (2006) Time-focused clustering of trajectories of moving objects. J Intell Inf Syst 27(3):267–289CrossRef
Zurück zum Zitat Ni J, Ravishankar CV (2007) Indexing spatio-temporal trajectories with efficient polynomial approximations. IEEE Trans Knowl Data Eng 19(5):663–678CrossRef Ni J, Ravishankar CV (2007) Indexing spatio-temporal trajectories with efficient polynomial approximations. IEEE Trans Knowl Data Eng 19(5):663–678CrossRef
Zurück zum Zitat Panagiotakis C, Pelekis N, Kopanakis I, Ramasso E, Theodoridis Y (2012) Segmentation and sampling of moving object trajectories based on representativeness. IEEE Trans Knowl Data Eng 24(7):1328–1343CrossRef Panagiotakis C, Pelekis N, Kopanakis I, Ramasso E, Theodoridis Y (2012) Segmentation and sampling of moving object trajectories based on representativeness. IEEE Trans Knowl Data Eng 24(7):1328–1343CrossRef
Zurück zum Zitat Pelekis N, Theodoridis Y (2014) Mobility data management and exploration. Springer, BerlinCrossRef Pelekis N, Theodoridis Y (2014) Mobility data management and exploration. Springer, BerlinCrossRef
Zurück zum Zitat Pelekis N, Kopanakis I, Kotsifakos E, Frentzos E, Theodoridis Y (2011) Clustering uncertain trajectories. Knowl Inf Syst 28(1):117–147CrossRef Pelekis N, Kopanakis I, Kotsifakos E, Frentzos E, Theodoridis Y (2011) Clustering uncertain trajectories. Knowl Inf Syst 28(1):117–147CrossRef
Zurück zum Zitat Pfoser D, Jensen CS, Theodoridis Y (2000) Novel approaches to the indexing of moving object trajectories. In: Proceedings of the VLDB endowment, pp 395–406 Pfoser D, Jensen CS, Theodoridis Y (2000) Novel approaches to the indexing of moving object trajectories. In: Proceedings of the VLDB endowment, pp 395–406
Zurück zum Zitat Tang LA, Zheng Y, Yuan J, Han J, Leung A, Peng W, Porta TL (2013) A framework of traveling companion discovery on trajectory data streams. ACM Trans Intell Syst Technol 5(1):3CrossRef Tang LA, Zheng Y, Yuan J, Han J, Leung A, Peng W, Porta TL (2013) A framework of traveling companion discovery on trajectory data streams. ACM Trans Intell Syst Technol 5(1):3CrossRef
Zurück zum Zitat Tao Y, Papadias D (2001) MV3R-tree: a spatio-temporal access method for timestamp and interval queries. In: Proceedings of the VLDB endowment, pp 431–440 Tao Y, Papadias D (2001) MV3R-tree: a spatio-temporal access method for timestamp and interval queries. In: Proceedings of the VLDB endowment, pp 431–440
Zurück zum Zitat Theodoridis Y, Vazirgiannis M, Sellis T (1996) Spatio-temporal indexing for large multimedia applications. In: Proceedings of the 3rd IEEE international conference on multimedia computing and systems (ICMCS) Theodoridis Y, Vazirgiannis M, Sellis T (1996) Spatio-temporal indexing for large multimedia applications. In: Proceedings of the 3rd IEEE international conference on multimedia computing and systems (ICMCS)
Zurück zum Zitat Wang S, Wu L, Zhou F, Zheng C, Wang H (2015) Group pattern mining algorithm of moving objects’ uncertain trajectories. Int J Comput Commun Control 10(3):428–440CrossRef Wang S, Wu L, Zhou F, Zheng C, Wang H (2015) Group pattern mining algorithm of moving objects’ uncertain trajectories. Int J Comput Commun Control 10(3):428–440CrossRef
Zurück zum Zitat Xu H, Zhou Y, Lin W, Zha H (2015) Unsupervised trajectory clustering via adaptive multi-kernel-based shrinkage. In: Proceedings of IEEE international conference on computer vision (ICCV) Xu H, Zhou Y, Lin W, Zha H (2015) Unsupervised trajectory clustering via adaptive multi-kernel-based shrinkage. In: Proceedings of IEEE international conference on computer vision (ICCV)
Zurück zum Zitat Yuan G, Sun P, Zhao J, Li D, Wang C (2017) A review of moving object trajectory clustering algorithms. Artif Intell Rev 47(1):123–144CrossRef Yuan G, Sun P, Zhao J, Li D, Wang C (2017) A review of moving object trajectory clustering algorithms. Artif Intell Rev 47(1):123–144CrossRef
Zurück zum Zitat Zhang T, Ramakrishnan R, Livny M (1996) BIRCH: an efficient data clustering method for very large databases. In: Proceedings of ACM SIGMOD international conference on management of data, pp 103–114 Zhang T, Ramakrishnan R, Livny M (1996) BIRCH: an efficient data clustering method for very large databases. In: Proceedings of ACM SIGMOD international conference on management of data, pp 103–114
Zurück zum Zitat Zheng Y (2015) Trajectory data mining: an overview. ACM Trans Intell Syst Technol 6(3):29CrossRef Zheng Y (2015) Trajectory data mining: an overview. ACM Trans Intell Syst Technol 6(3):29CrossRef
Zurück zum Zitat Zheng Y, Xie X, Ma WY (2010) GeoLife: a collaborative social networking service among user, location and trajectory. IEEE Data Eng Bull 33(2):32–40 Zheng Y, Xie X, Ma WY (2010) GeoLife: a collaborative social networking service among user, location and trajectory. IEEE Data Eng Bull 33(2):32–40
Zurück zum Zitat Zheng K, Zheng Y, Yuan NJ, Shang S (2013) On discovery of gathering patterns from trajectories. In: Proceedings of IEEE 23rd international conference on data engineering (ICDE) Zheng K, Zheng Y, Yuan NJ, Shang S (2013) On discovery of gathering patterns from trajectories. In: Proceedings of IEEE 23rd international conference on data engineering (ICDE)
Zurück zum Zitat Zheng K, Zheng Y, Yuan NJ, Shang S, Zhou X (2014) Online discovery of gathering patterns over trajectories. IEEE Trans Knowl Data Eng 26(8):1974–1988CrossRef Zheng K, Zheng Y, Yuan NJ, Shang S, Zhou X (2014) Online discovery of gathering patterns over trajectories. IEEE Trans Knowl Data Eng 26(8):1974–1988CrossRef
Metadaten
Titel
On temporal-constrained sub-trajectory cluster analysis
verfasst von
Nikos Pelekis
Panagiotis Tampakis
Marios Vodas
Christos Doulkeridis
Yannis Theodoridis
Publikationsdatum
04.04.2017
Verlag
Springer US
Erschienen in
Data Mining and Knowledge Discovery / Ausgabe 5/2017
Print ISSN: 1384-5810
Elektronische ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-017-0503-4

Weitere Artikel der Ausgabe 5/2017

Data Mining and Knowledge Discovery 5/2017 Zur Ausgabe