Skip to main content
Top
Published in: Journal of Intelligent Information Systems 1/2014

01-02-2014

Dealing with trajectory streams by clustering and mathematical transforms

Authors: Gianni Costa, Giuseppe Manco, Elio Masciari

Published in: Journal of Intelligent Information Systems | Issue 1/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nowadays, almost all kind of electronic devices leave traces of their movements (e.g. smartphone, GPS devices and so on). Thus, the huge number of this “tiny” data sources leads to the generation of massive data streams of geo-referenced data. As a matter of fact, the effective analysis of such amounts of data is challenging, since the possibility to extract useful information from this peculiar kind of data is crucial in many application scenarios such as vehicle traffic management, hand-off in cellular networks, supply chain management. Moreover, spatial data streams management poses new challenges both for their proper definition and acquisition, thus making the overall process harder than for classical point data. In particular, we are interested in solving the problem of effective trajectory data streams clustering, that revealed really intriguing as we deal with sequential data that have to be properly managed due to their ordering. We propose a framework that allow data pre-elaboration in order to make the mining step more effective. As for every data mining tool, the experimental evaluation is crucial, thus we performed several tests on real world datasets that confirmed the efficiency and effectiveness of the proposed approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
For the sake of completeness we recall that \(L^2(\mathcal{R})\) is the metric space of square-integrable functions, i.e. the measurable functions for which the integral of the square of the absolute value is finite.
 
2
It is always possible to find a basis that allows this representation for the search space.
 
3
Note that here we need to compute modulo two polynomials to ensure that the dimension of A is finite.
 
Literature
go back to reference Aggarwal, C.C., Han, J., Wang, J., Yu, P.S. (2003). A framework for clustering evolving data streams. In VLDB (pp. 81–92). Aggarwal, C.C., Han, J., Wang, J., Yu, P.S. (2003). A framework for clustering evolving data streams. In VLDB (pp. 81–92).
go back to reference Arthur, D., & Vassilvitskii, S. (2007). k-means++ the advantages of careful seeding. In SODA (pp. 1027–1035). Arthur, D., & Vassilvitskii, S. (2007). k-means++ the advantages of careful seeding. In SODA (pp. 1027–1035).
go back to reference Cadez, I.V., Gaffney, S., Smyth, P. (2000). A general probabilistic framework for clustering individuals and objects. In KDD (pp. 140–149). Cadez, I.V., Gaffney, S., Smyth, P. (2000). A general probabilistic framework for clustering individuals and objects. In KDD (pp. 140–149).
go back to reference Cao, H., & Wolfson, O. (2005). Nonmaterialized motion information in transport networks. In ICDT (pp. 173–188). Cao, H., & Wolfson, O. (2005). Nonmaterialized motion information in transport networks. In ICDT (pp. 173–188).
go back to reference Chen, L., Özsu, M.T., Oria, V. (2005). Robust and fast similarity search for moving object trajectories. In SIGMOD (pp. 491–502). New York: ACM. Chen, L., Özsu, M.T., Oria, V. (2005). Robust and fast similarity search for moving object trajectories. In SIGMOD (pp. 491–502). New York: ACM.
go back to reference Chihara, T.S. (1978). An introduction to orthogonal polynomials. New York: Gordon and Breach.MATH Chihara, T.S. (1978). An introduction to orthogonal polynomials. New York: Gordon and Breach.MATH
go back to reference Chong, Z., Ni, W., Xu, L., Xu, Z., Shu, H., Zheng, J. (2010). Approximate k-median of location streams with redundancy and inconsistency. International Journal of Software and Informatics, 4(2), 165–182. Chong, Z., Ni, W., Xu, L., Xu, Z., Shu, H., Zheng, J. (2010). Approximate k-median of location streams with redundancy and inconsistency. International Journal of Software and Informatics, 4(2), 165–182.
go back to reference Ester, M., Kriegel, H.P., Sander, J., Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD. Ester, M., Kriegel, H.P., Sander, J., Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD.
go back to reference Flesca, S., Manco, G., Masciari, E., Pontieri, L., Pugliese, A. (2005). Fast detection of xml structural similarity. IEEE TKDE, 17(2), 160–175. Flesca, S., Manco, G., Masciari, E., Pontieri, L., Pugliese, A. (2005). Fast detection of xml structural similarity. IEEE TKDE, 17(2), 160–175.
go back to reference Gaffney, S., & Smyth, P. (1999). Trajectory clustering with mixtures of regression models. In KDD (pp. 63–72). Gaffney, S., & Smyth, P. (1999). Trajectory clustering with mixtures of regression models. In KDD (pp. 63–72).
go back to reference Giannotti, F., Nanni, M., Pinelli, F., Pedreschi, D. (2007). Trajectory pattern mining. In KDD (pp. 330–339). Giannotti, F., Nanni, M., Pinelli, F., Pedreschi, D. (2007). Trajectory pattern mining. In KDD (pp. 330–339).
go back to reference Gonzalez, H., Han, J., Li, X., Klabjan, D. (2006). Warehousing and analyzing massive RFID data sets. In ICDE. Gonzalez, H., Han, J., Li, X., Klabjan, D. (2006). Warehousing and analyzing massive RFID data sets. In ICDE.
go back to reference Gudmundsson, J., Katajainen, J., Merrick, D., Ong, C., Wolle, T. (2007). Compressing spatio-temporal trajectories. In Int. conf. algorithms and computation (pp. 763–775). Gudmundsson, J., Katajainen, J., Merrick, D., Ong, C., Wolle, T. (2007). Compressing spatio-temporal trajectories. In Int. conf. algorithms and computation (pp. 763–775).
go back to reference Han, J., & Kamber, M. (2000). Data mining: Concepts and techniques. San Mateo: Morgan Kaufmann. Han, J., & Kamber, M. (2000). Data mining: Concepts and techniques. San Mateo: Morgan Kaufmann.
go back to reference Hönle, N., Grossmann, M., Reimann, S., Mitschang, B. (2010). Usability analysis of compression algorithms for position data streams. In GIS (pp. 240–249). Hönle, N., Grossmann, M., Reimann, S., Mitschang, B. (2010). Usability analysis of compression algorithms for position data streams. In GIS (pp. 240–249).
go back to reference Jeung, H., Yiu, M.L., Zhou, X., Jensen, C.S., Shen, H.T. (2008). Discovery of convoys in trajectory databases. In Proceedings of the VLDB Endowement, vol. 1(1) (pp. 1068–1080). Jeung, H., Yiu, M.L., Zhou, X., Jensen, C.S., Shen, H.T. (2008). Discovery of convoys in trajectory databases. In Proceedings of the VLDB Endowement, vol. 1(1) (pp. 1068–1080).
go back to reference Keogh, E. (2002). Exact indexing of dynamic time warping. In VLDB (pp. 406–417). VLDB Endowment. Keogh, E. (2002). Exact indexing of dynamic time warping. In VLDB (pp. 406–417). VLDB Endowment.
go back to reference Lee, J.G., Han, J., Li, X. (2008a). Trajectory outlier detection: A partition-and-detect framework. In ICDE (pp. 140–149). Lee, J.G., Han, J., Li, X. (2008a). Trajectory outlier detection: A partition-and-detect framework. In ICDE (pp. 140–149).
go back to reference Lee, J.G., Han, J., Li, X., Gonzalez, H. (2008b). TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering. PVLDB, 1(1), 1081–1094. Lee, J.G., Han, J., Li, X., Gonzalez, H. (2008b). TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering. PVLDB, 1(1), 1081–1094.
go back to reference Lee, J.G., Han, J., Whang, K.Y. (2007). Trajectory clustering: A partition-and-group framework. In SIGMOD. Lee, J.G., Han, J., Whang, K.Y. (2007). Trajectory clustering: A partition-and-group framework. In SIGMOD.
go back to reference Li, Y., Han, J., Yang, J. (2004). Clustering moving objects. In KDD (pp. 617–622). Li, Y., Han, J., Yang, J. (2004). Clustering moving objects. In KDD (pp. 617–622).
go back to reference Li, Z., Lee, J.G., Li, X., Han, J. (2010). Incremental clustering for trajectories. In DASFAA (2) (pp. 32–46). Li, Z., Lee, J.G., Li, X., Han, J. (2010). Incremental clustering for trajectories. In DASFAA (2) (pp. 32–46).
go back to reference Liu, Y., Chen, L., Pei, J., Chen, Q., Zhao, Y. (2007). Mining frequent trajectory patterns for activity monitoring using radio frequency tag arrays. In PerCom (pp. 37–46). Liu, Y., Chen, L., Pei, J., Chen, Q., Zhao, Y. (2007). Mining frequent trajectory patterns for activity monitoring using radio frequency tag arrays. In PerCom (pp. 37–46).
go back to reference Lloyd, S. (1982). Least squares quantization in pcm. IEEE TOIT, 28. Lloyd, S. (1982). Least squares quantization in pcm. IEEE TOIT, 28.
go back to reference Masciari, E. (2009a). A complete framework for clustering trajectories. In ICTAI (pp. 9–16). Masciari, E. (2009a). A complete framework for clustering trajectories. In ICTAI (pp. 9–16).
go back to reference Masciari, E. (2009b). Trajectory clustering via effective partitioning. In FQAS (pp. 358–370). Masciari, E. (2009b). Trajectory clustering via effective partitioning. In FQAS (pp. 358–370).
go back to reference Nehme, R.V., & Rundensteiner, E.A. (2006). Scuba: scalable cluster-based algorithm for evaluating continuous spatio-temporal queries on moving objects. In EDBT (pp. 1001–1019). Nehme, R.V., & Rundensteiner, E.A. (2006). Scuba: scalable cluster-based algorithm for evaluating continuous spatio-temporal queries on moving objects. In EDBT (pp. 1001–1019).
go back to reference Oppenheim, A.V., & Shafer, R.W. (1999). Discrete-time signal processing. Englewood Cliffs: Prentice Hall. Oppenheim, A.V., & Shafer, R.W. (1999). Discrete-time signal processing. Englewood Cliffs: Prentice Hall.
go back to reference Press, W.H., et al. (2001). Numerical recipes in C++. Cambridge: Cambridge University Press. Press, W.H., et al. (2001). Numerical recipes in C++. Cambridge: Cambridge University Press.
go back to reference Puschel, M., & Rotteler, M. (2005). Fourier transform for the directed quincunx lattice. In ICASSP. Puschel, M., & Rotteler, M. (2005). Fourier transform for the directed quincunx lattice. In ICASSP.
go back to reference Secker, A., & Taubman, D. (2003). Lifting-based invertible motion adaptive transform (limat) framework for highly scalable video compression. IEEE Transactions on Image Processing, 12(12), 1530–1542.CrossRef Secker, A., & Taubman, D. (2003). Lifting-based invertible motion adaptive transform (limat) framework for highly scalable video compression. IEEE Transactions on Image Processing, 12(12), 1530–1542.CrossRef
go back to reference Veenman, C.J., & Reinders, M.J.T. (2005). The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(9), 1417–1429.CrossRef Veenman, C.J., & Reinders, M.J.T. (2005). The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(9), 1417–1429.CrossRef
go back to reference Vlachos, M., Gunopoulos, D., Kollios, G. (2002). Discovering similar multidimensional trajectories. In ICDE (p. 673). Vlachos, M., Gunopoulos, D., Kollios, G. (2002). Discovering similar multidimensional trajectories. In ICDE (p. 673).
go back to reference Wang, W., Yang, J., Muntz, R.R. (1997). Sting: a statistical information grid approach to spatial data mining. In VLDB (pp. 186–195). Wang, W., Yang, J., Muntz, R.R. (1997). Sting: a statistical information grid approach to spatial data mining. In VLDB (pp. 186–195).
go back to reference Yang, J., & Trajpattern, M.Hu. (2006). Mining sequential patterns from imprecise trajectories of mobile objects. In EDBT (pp. 664–681). Yang, J., & Trajpattern, M.Hu. (2006). Mining sequential patterns from imprecise trajectories of mobile objects. In EDBT (pp. 664–681).
go back to reference Yi, B., Jagadish, H.V., Faloutsos, C. (1998). Efficient retrieval of similar time sequences under time warping. In ICDE (pp. 201–208). Yi, B., Jagadish, H.V., Faloutsos, C. (1998). Efficient retrieval of similar time sequences under time warping. In ICDE (pp. 201–208).
go back to reference Zhang, T., Ramakrishnan, R., Livny, M. (1996). Birch: an efficient data clustering method for very large databases. In SIGMOD (pp. 103–114). Zhang, T., Ramakrishnan, R., Livny, M. (1996). Birch: an efficient data clustering method for very large databases. In SIGMOD (pp. 103–114).
go back to reference Zhang, X., Wu, X., Wu, F. (2007). Image coding on quincunx lattice with adaptive lifting and interpolation. In Data compression conf. (pp. 193–202). Zhang, X., Wu, X., Wu, F. (2007). Image coding on quincunx lattice with adaptive lifting and interpolation. In Data compression conf. (pp. 193–202).
go back to reference Zheng, Y., Zhang, L., Xie, X., Ma, W.Y. (2009). Mining interesting locations and travel sequences from gps trajectories. In WWW (pp. 791–800). Zheng, Y., Zhang, L., Xie, X., Ma, W.Y. (2009). Mining interesting locations and travel sequences from gps trajectories. In WWW (pp. 791–800).
Metadata
Title
Dealing with trajectory streams by clustering and mathematical transforms
Authors
Gianni Costa
Giuseppe Manco
Elio Masciari
Publication date
01-02-2014
Publisher
Springer US
Published in
Journal of Intelligent Information Systems / Issue 1/2014
Print ISSN: 0925-9902
Electronic ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-013-0267-2

Other articles of this Issue 1/2014

Journal of Intelligent Information Systems 1/2014 Go to the issue

Premium Partner