Skip to main content
Erschienen in: The VLDB Journal 3/2024

25.01.2024 | Regular Paper

Sub-trajectory clustering with deep reinforcement learning

verfasst von: Anqi Liang, Bin Yao, Bo Wang, Yinpei Liu, Zhida Chen, Jiong Xie, Feifei Li

Erschienen in: The VLDB Journal | Ausgabe 3/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sub-trajectory clustering is a fundamental problem in many trajectory applications. Existing approaches usually divide the clustering procedure into two phases: segmenting trajectories into sub-trajectories and then clustering these sub-trajectories. However, researchers need to develop complex human-crafted segmentation rules for specific applications, making the clustering results sensitive to the segmentation rules and lacking in generality. To solve this problem, we propose a novel algorithm using the clustering results to guide the segmentation, which is based on reinforcement learning (RL). The novelty is that the segmentation and clustering components cooperate closely and improve each other continuously to yield better clustering results. To devise our RL-based algorithm, we model the procedure of trajectory segmentation as a Markov decision process (MDP). We apply Deep-Q-Network (DQN) learning to train an RL model for the segmentation and achieve excellent clustering results. Experimental results on real datasets demonstrate the superior performance of the proposed RL-based approach over state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agarwal, P.K., Fox, K., Munagala, K., Nath, A., Pan, J., Taylor, E.: Subtrajectory clustering: Models and algorithms. In: SIGMOD, pp. 75–87 (2018) Agarwal, P.K., Fox, K., Munagala, K., Nath, A., Pan, J., Taylor, E.: Subtrajectory clustering: Models and algorithms. In: SIGMOD, pp. 75–87 (2018)
2.
Zurück zum Zitat Alt, H., Godau, M.: Computing the fréchet distance between two polygonal curves. Int. J. Comput. Geom. Appl 5(01–02), 75–91 (1995)CrossRef Alt, H., Godau, M.: Computing the fréchet distance between two polygonal curves. Int. J. Comput. Geom. Appl 5(01–02), 75–91 (1995)CrossRef
3.
Zurück zum Zitat Anagnostopoulos, A., Vlachos, M., Hadjieleftheriou, M., Keogh, E., Yu, P.S.: Global distance-based segmentation of trajectories. In: SIGKDD, pp. 34–43 (2006) Anagnostopoulos, A., Vlachos, M., Hadjieleftheriou, M., Keogh, E., Yu, P.S.: Global distance-based segmentation of trajectories. In: SIGKDD, pp. 34–43 (2006)
4.
Zurück zum Zitat Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: Optics: Ordering points to identify the clustering structure. ACM SIGMOD Rec. 28(2), 49–60 (1999)CrossRef Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: Optics: Ordering points to identify the clustering structure. ACM SIGMOD Rec. 28(2), 49–60 (1999)CrossRef
5.
Zurück zum Zitat Buchin, K., Buchin, M., Gudmundsson, J., Löffler, M., Luo, J.: Detecting commuting patterns by clustering subtrajectories. Int. J. Comput. Geom. Appl. 21(03), 253–282 (2011)MathSciNetCrossRef Buchin, K., Buchin, M., Gudmundsson, J., Löffler, M., Luo, J.: Detecting commuting patterns by clustering subtrajectories. Int. J. Comput. Geom. Appl. 21(03), 253–282 (2011)MathSciNetCrossRef
6.
Zurück zum Zitat Buchin, M., Driemel, A., Van Kreveld, M., Sacristán, V.: Segmenting trajectories: a framework and algorithms using spatiotemporal criteria. J. Spatial Inf. Sci. 3, 33–63 (2011) Buchin, M., Driemel, A., Van Kreveld, M., Sacristán, V.: Segmenting trajectories: a framework and algorithms using spatiotemporal criteria. J. Spatial Inf. Sci. 3, 33–63 (2011)
7.
Zurück zum Zitat Chen, L., Gao, Y., Fang, Z., Miao, X., Jensen, C.S., Guo, C.: Real-time distributed co-movement pattern detection on streaming trajectories. In: Proceedings of the VLDB Endowment (2019) Chen, L., Gao, Y., Fang, Z., Miao, X., Jensen, C.S., Guo, C.: Real-time distributed co-movement pattern detection on streaming trajectories. In: Proceedings of the VLDB Endowment (2019)
8.
Zurück zum Zitat Dutta, S., Das, A., Patra, B.K.: Clustmosa: Clustering for gps trajectory data based on multi-objective simulated annealing to develop mobility application. Appl. Soft Comput. 130, 109655 (2022)CrossRef Dutta, S., Das, A., Patra, B.K.: Clustmosa: Clustering for gps trajectory data based on multi-objective simulated annealing to develop mobility application. Appl. Soft Comput. 130, 109655 (2022)CrossRef
9.
Zurück zum Zitat Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: SIGKDD, pp. 226–231 (1996) Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: SIGKDD, pp. 226–231 (1996)
10.
Zurück zum Zitat Etemad, M., Júnior, A.S., Hoseyni, A., Rose, J., Matwin, S.: A trajectory segmentation algorithm based on interpolation-based change detection strategies. In: EDBT/ICDT Workshops (2019) Etemad, M., Júnior, A.S., Hoseyni, A., Rose, J., Matwin, S.: A trajectory segmentation algorithm based on interpolation-based change detection strategies. In: EDBT/ICDT Workshops (2019)
11.
Zurück zum Zitat Fang, Z., Du, Y., Chen, L., Hu, Y., Gao, Y., Chen, G.: E 2 dtc: An end to end deep trajectory clustering framework via self-training. In: ICDE, pp. 696–707 (2021) Fang, Z., Du, Y., Chen, L., Hu, Y., Gao, Y., Chen, G.: E 2 dtc: An end to end deep trajectory clustering framework via self-training. In: ICDE, pp. 696–707 (2021)
12.
Zurück zum Zitat Ferreira, N., Klosowski, J.T., Scheidegger, C.E., Silva, C.T.: Vector field k-means: Clustering trajectories by fitting multiple vector fields. In: Computer Graphics Forum, vol. 32, pp. 201–210. Wiley Online Library (2013) Ferreira, N., Klosowski, J.T., Scheidegger, C.E., Silva, C.T.: Vector field k-means: Clustering trajectories by fitting multiple vector fields. In: Computer Graphics Forum, vol. 32, pp. 201–210. Wiley Online Library (2013)
13.
Zurück zum Zitat Frentzos, E., Gratsias, K., Theodoridis, Y.: Index-based most similar trajectory search. In: ICDE (2007) Frentzos, E., Gratsias, K., Theodoridis, Y.: Index-based most similar trajectory search. In: ICDE (2007)
14.
Zurück zum Zitat Gaffney, S., Smyth, P.: Trajectory clustering with mixtures of regression models. In: SIGKDD, pp. 63–72 (1999) Gaffney, S., Smyth, P.: Trajectory clustering with mixtures of regression models. In: SIGKDD, pp. 63–72 (1999)
15.
Zurück zum Zitat Grünwald, P.D., Myung, I.J., Pitt, M.A.: Advances in minimum description length: theory and applications (2005) Grünwald, P.D., Myung, I.J., Pitt, M.A.: Advances in minimum description length: theory and applications (2005)
16.
Zurück zum Zitat Gu, T., Feng, K., Cong, G., Long, C., Wang, Z., Wang, S.: A reinforcement learning based r-tree for spatial data indexing in dynamic environments. arXiv preprint arXiv:2103.04541 (2021) Gu, T., Feng, K., Cong, G., Long, C., Wang, Z., Wang, S.: A reinforcement learning based r-tree for spatial data indexing in dynamic environments. arXiv preprint arXiv:​2103.​04541 (2021)
17.
Zurück zum Zitat Lee, J.G., Han, J., Li, X., Gonzalez, H.: Traclass: trajectory classification using hierarchical region-based and trajectory-based clustering. VLDB 1(1), 1081–1094 (2008) Lee, J.G., Han, J., Li, X., Gonzalez, H.: Traclass: trajectory classification using hierarchical region-based and trajectory-based clustering. VLDB 1(1), 1081–1094 (2008)
18.
Zurück zum Zitat Lee, J.G., Han, J., Whang, K.Y.: Trajectory clustering: a partition-and-group framework. In: SIGMOD, pp. 593–604 (2007) Lee, J.G., Han, J., Whang, K.Y.: Trajectory clustering: a partition-and-group framework. In: SIGMOD, pp. 593–604 (2007)
19.
Zurück zum Zitat Li, Y., Luo, J., Chow, C.Y., Chan, K.L., Ding, Y., Zhang, F.: Growing the charging station network for electric vehicles with trajectory data analytics. In: ICDE, pp. 1376–1387 (2015) Li, Y., Luo, J., Chow, C.Y., Chan, K.L., Ding, Y., Zhang, F.: Growing the charging station network for electric vehicles with trajectory data analytics. In: ICDE, pp. 1376–1387 (2015)
20.
Zurück zum Zitat Li, Z., Lee, J.G., Li, X., Han, J.: Incremental clustering for trajectories. In: International conference on database systems for advanced applications, pp. 32–46 (2010) Li, Z., Lee, J.G., Li, X., Han, J.: Incremental clustering for trajectories. In: International conference on database systems for advanced applications, pp. 32–46 (2010)
22.
Zurück zum Zitat Marcus, R., Negi, P., Mao, H., Tatbul, N., Alizadeh, M., Kraska, T.: Bao: Making learned query optimization practical. In: SIGMOD, pp. 1275–1288 (2021) Marcus, R., Negi, P., Mao, H., Tatbul, N., Alizadeh, M., Kraska, T.: Bao: Making learned query optimization practical. In: SIGMOD, pp. 1275–1288 (2021)
23.
Zurück zum Zitat Meratnia, N., et al.: Spatiotemporal compression techniques for moving point objects. In: EDBT (2004) Meratnia, N., et al.: Spatiotemporal compression techniques for moving point objects. In: EDBT (2004)
24.
Zurück zum Zitat Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., Riedmiller, M.: Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013) Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., Riedmiller, M.: Playing atari with deep reinforcement learning. arXiv preprint arXiv:​1312.​5602 (2013)
25.
Zurück zum Zitat Pelekis, N., Tampakis, P., Vodas, M., Panagiotakis, C., Theodoridis, Y.: In-dbms sampling-based sub-trajectory clustering. In: EDBT, pp. 632–643 (2017) Pelekis, N., Tampakis, P., Vodas, M., Panagiotakis, C., Theodoridis, Y.: In-dbms sampling-based sub-trajectory clustering. In: EDBT, pp. 632–643 (2017)
26.
Zurück zum Zitat Puterman, M.L.: Markov decision processes: discrete stochastic dynamic programming. Wiley, New York (2014) Puterman, M.L.: Markov decision processes: discrete stochastic dynamic programming. Wiley, New York (2014)
27.
Zurück zum Zitat Qian, W.N., Zhou, A.Y.: Analyzing popular clustering algorithms from different viewpoints. J. Softw. 13(8), 1382–1394 (2002) Qian, W.N., Zhou, A.Y.: Analyzing popular clustering algorithms from different viewpoints. J. Softw. 13(8), 1382–1394 (2002)
28.
Zurück zum Zitat Qiao, D., Yang, X., Liang, Y., Hao, X.: Rapid trajectory clustering based on neighbor spatial analysis. Pattern Recogn. Lett. 156, 167–173 (2022)CrossRef Qiao, D., Yang, X., Liang, Y., Hao, X.: Rapid trajectory clustering based on neighbor spatial analysis. Pattern Recogn. Lett. 156, 167–173 (2022)CrossRef
29.
Zurück zum Zitat Schiller, P.L., Kenworthy, J.R.: An introduction to sustainable transportation: Policy, planning and implementation. Routledge (2017) Schiller, P.L., Kenworthy, J.R.: An introduction to sustainable transportation: Policy, planning and implementation. Routledge (2017)
30.
Zurück zum Zitat Soares Júnior, A., Moreno, B.N., Times, V.C., Matwin, S., Cabral, L.D.A.F.: Grasp-uts: an algorithm for unsupervised trajectory segmentation. Int. J. Geogr. Inf. Sci. 29(1), 46–68 (2015) Soares Júnior, A., Moreno, B.N., Times, V.C., Matwin, S., Cabral, L.D.A.F.: Grasp-uts: an algorithm for unsupervised trajectory segmentation. Int. J. Geogr. Inf. Sci. 29(1), 46–68 (2015)
31.
Zurück zum Zitat Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. MIT Press, Cambridge (2018) Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. MIT Press, Cambridge (2018)
32.
Zurück zum Zitat Tampakis, P., Doulkeridis, C., Pelekis, N., Theodoridis, Y.: Distributed subtrajectory join on massive datasets. ACM Trans. Spatial Algorith. Syst. (TSAS) 6(2), 1–29 (2020)CrossRef Tampakis, P., Doulkeridis, C., Pelekis, N., Theodoridis, Y.: Distributed subtrajectory join on massive datasets. ACM Trans. Spatial Algorith. Syst. (TSAS) 6(2), 1–29 (2020)CrossRef
33.
Zurück zum Zitat Tampakis, P., Pelekis, N., Doulkeridis, C., Theodoridis, Y.: Scalable distributed subtrajectory clustering. In: 2019 IEEE international conference on big data (Big Data), pp. 950–959. IEEE (2019) Tampakis, P., Pelekis, N., Doulkeridis, C., Theodoridis, Y.: Scalable distributed subtrajectory clustering. In: 2019 IEEE international conference on big data (Big Data), pp. 950–959. IEEE (2019)
34.
Zurück zum Zitat Wang, S., Bao, Z., Culpepper, J.S., Sellis, T., Qin, X.: Fast large-scale trajectory clustering. VLDB 13(1), 29–42 (2019) Wang, S., Bao, Z., Culpepper, J.S., Sellis, T., Qin, X.: Fast large-scale trajectory clustering. VLDB 13(1), 29–42 (2019)
35.
Zurück zum Zitat Wang, S., Bao, Z., Culpepper, J.S., Sellis, T., Sanderson, M., Qin, X.: Answering top-k exemplar trajectory queries. In: ICDE, pp. 597–608 (2017) Wang, S., Bao, Z., Culpepper, J.S., Sellis, T., Sanderson, M., Qin, X.: Answering top-k exemplar trajectory queries. In: ICDE, pp. 597–608 (2017)
36.
Zurück zum Zitat Wang, S., Shen, Y., Bao, Z., Qin, X.: Intelligent traffic analytics: from monitoring to controlling. In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, pp. 778–781 (2019) Wang, S., Shen, Y., Bao, Z., Qin, X.: Intelligent traffic analytics: from monitoring to controlling. In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, pp. 778–781 (2019)
37.
Zurück zum Zitat Wang, W., Xia, F., Nie, H., Chen, Z., Gong, Z., Kong, X., Wei, W.: Vehicle trajectory clustering based on dynamic representation learning of internet of vehicles. IEEE Trans. Intell. Transp. Syst. 22(6), 3567–3576 (2020)CrossRef Wang, W., Xia, F., Nie, H., Chen, Z., Gong, Z., Kong, X., Wei, W.: Vehicle trajectory clustering based on dynamic representation learning of internet of vehicles. IEEE Trans. Intell. Transp. Syst. 22(6), 3567–3576 (2020)CrossRef
38.
Zurück zum Zitat Wang, Z., Long, C., Cong, G.: Trajectory simplification with reinforcement learning. In: ICDE, pp. 684–695 (2021) Wang, Z., Long, C., Cong, G.: Trajectory simplification with reinforcement learning. In: ICDE, pp. 684–695 (2021)
39.
Zurück zum Zitat Wang, Z., Long, C., Cong, G., Liu, Y.: Efficient and effective similar subtrajectory search with deep reinforcement learning. VLDB 13(12), 2312–2325 (2020) Wang, Z., Long, C., Cong, G., Liu, Y.: Efficient and effective similar subtrajectory search with deep reinforcement learning. VLDB 13(12), 2312–2325 (2020)
40.
Zurück zum Zitat Watkins, C.J., Dayan, P.: Q-learning. Mach. Learn. 8(3), 279–292 (1992)CrossRef Watkins, C.J., Dayan, P.: Q-learning. Mach. Learn. 8(3), 279–292 (1992)CrossRef
41.
Zurück zum Zitat Wei, H., Zheng, G., Gayah, V., Li, Z.: Recent advances in reinforcement learning for traffic signal control: A survey of models and evaluation. ACM SIGKDD Explorat. Newsl. 22(2), 12–18 (2021)CrossRef Wei, H., Zheng, G., Gayah, V., Li, Z.: Recent advances in reinforcement learning for traffic signal control: A survey of models and evaluation. ACM SIGKDD Explorat. Newsl. 22(2), 12–18 (2021)CrossRef
42.
Zurück zum Zitat Xia, Y., Zhou, L.: Improved clustering algorithm based on hypercube. In: 2022 International Conference on Machine Learning, Control, and Robotics (MLCR), pp. 32–37 (2022) Xia, Y., Zhou, L.: Improved clustering algorithm based on hypercube. In: 2022 International Conference on Machine Learning, Control, and Robotics (MLCR), pp. 32–37 (2022)
43.
Zurück zum Zitat Yang, Z., Chandramouli, B., Wang, C., Gehrke, J., Li, Y., Minhas, U.F., Larson, P.Å., Kossmann, D., Acharya, R.: Qd-tree: Learning data layouts for big data analytics. In: SIGMOD, pp. 193–208 (2020) Yang, Z., Chandramouli, B., Wang, C., Gehrke, J., Li, Y., Minhas, U.F., Larson, P.Å., Kossmann, D., Acharya, R.: Qd-tree: Learning data layouts for big data analytics. In: SIGMOD, pp. 193–208 (2020)
44.
Zurück zum Zitat Yao, D., Zhang, C., Zhu, Z., Huang, J., Bi, J.: Trajectory clustering via deep representation learning. In: IJCNN, pp. 3880–3887 (2017) Yao, D., Zhang, C., Zhu, Z., Huang, J., Bi, J.: Trajectory clustering via deep representation learning. In: IJCNN, pp. 3880–3887 (2017)
45.
Zurück zum Zitat Yi, B.K., Jagadish, H.V., Faloutsos, C.: Efficient retrieval of similar time sequences under time warping. In: ICDE, pp. 201–208 (1998) Yi, B.K., Jagadish, H.V., Faloutsos, C.: Efficient retrieval of similar time sequences under time warping. In: ICDE, pp. 201–208 (1998)
46.
Zurück zum Zitat Yu, X., Li, G., Chai, C., Tang, N.: Reinforcement learning with tree-lstm for join order selection. In: ICDE, pp. 1297–1308 (2020) Yu, X., Li, G., Chai, C., Tang, N.: Reinforcement learning with tree-lstm for join order selection. In: ICDE, pp. 1297–1308 (2020)
47.
Zurück zum Zitat Yuan, J., Zheng, Y., Zhang, C., Xie, W., Xie, X., Sun, G., Huang, Y.: T-drive: driving directions based on taxi trajectories. In: SIGSPATIAL, pp. 99–108 (2010) Yuan, J., Zheng, Y., Zhang, C., Xie, W., Xie, X., Sun, G., Huang, Y.: T-drive: driving directions based on taxi trajectories. In: SIGSPATIAL, pp. 99–108 (2010)
48.
Zurück zum Zitat Zhang, D., Chang, Z., Yang, D., Li, D., Tan, K.L., Chen, K., Chen, G.: Squid: subtrajectory query in trillion-scale gps database. VLDB J. pp. 1–18 (2023) Zhang, D., Chang, Z., Yang, D., Li, D., Tan, K.L., Chen, K., Chen, G.: Squid: subtrajectory query in trillion-scale gps database. VLDB J. pp. 1–18 (2023)
49.
Zurück zum Zitat Zhang, T., Ramakrishnan, R., Livny, M.: Birch: an efficient data clustering method for very large databases. ACM SIGMOD Rec. 25(2), 103–114 (1996)CrossRef Zhang, T., Ramakrishnan, R., Livny, M.: Birch: an efficient data clustering method for very large databases. ACM SIGMOD Rec. 25(2), 103–114 (1996)CrossRef
50.
Zurück zum Zitat Zhang, X., Meng, F., Xu, J.: Perfinsight: A robust clustering-based abnormal behavior detection system for large-scale cloud. In: IEEE CLOUD, pp. 896–899 (2018) Zhang, X., Meng, F., Xu, J.: Perfinsight: A robust clustering-based abnormal behavior detection system for large-scale cloud. In: IEEE CLOUD, pp. 896–899 (2018)
51.
Zurück zum Zitat Zheng, Y., Xie, X., Ma, W.Y., et al.: Geolife: A collaborative social networking service among user, location and trajectory. IEEE Data Eng. Bull. 33(2), 32–39 (2010) Zheng, Y., Xie, X., Ma, W.Y., et al.: Geolife: A collaborative social networking service among user, location and trajectory. IEEE Data Eng. Bull. 33(2), 32–39 (2010)
52.
Zurück zum Zitat Zygouras, N., Gunopulos, D.: Corridor learning using individual trajectories. In: MDM, pp. 155–160 (2018) Zygouras, N., Gunopulos, D.: Corridor learning using individual trajectories. In: MDM, pp. 155–160 (2018)
Metadaten
Titel
Sub-trajectory clustering with deep reinforcement learning
verfasst von
Anqi Liang
Bin Yao
Bo Wang
Yinpei Liu
Zhida Chen
Jiong Xie
Feifei Li
Publikationsdatum
25.01.2024
Verlag
Springer Berlin Heidelberg
Erschienen in
The VLDB Journal / Ausgabe 3/2024
Print ISSN: 1066-8888
Elektronische ISSN: 0949-877X
DOI
https://doi.org/10.1007/s00778-023-00833-w

Weitere Artikel der Ausgabe 3/2024

The VLDB Journal 3/2024 Zur Ausgabe

Regular Paper

MM-DIRECT