Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 9/2022

04-04-2022 | Original Article

CIRAN: extracting crowd interaction with residual attention network for pedestrian trajectory prediction

Authors: Shang Liu, Xiaoyu Chen, Hao Chen

Published in: International Journal of Machine Learning and Cybernetics | Issue 9/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper proposes a new deep learning network based on the spatial attention mechanism—crowd interaction with residual attention network (CIRAN), which combines the position and velocity information of neighbor pedestrians for trajectory prediction. It adaptively selects the most effective areas of the scene by using the residual attention module to obtain more accurate and reasonable pedestrian trajectories. Therefore, the accuracy of prediction can be improved. In addition, the velocity encoding module is introduced to transform the coordinate based pedestrian social interaction process into the spatial grid based pedestrian social interaction process. Based on two public data, ETH and UCY, this paper obtains the most advanced experimental results up to now, and these results show the validity of the proposed CIRAN.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
go back to reference Song Z, Wu K, Shao J (2020) Destination prediction using deep echo state network. Neurocomputing 406:343–353CrossRef Song Z, Wu K, Shao J (2020) Destination prediction using deep echo state network. Neurocomputing 406:343–353CrossRef
2.
go back to reference Cheng Y, Sun L, Liu C et al (2020) Towards efficient human robot collaboration with robust plan recognition and trajectory prediction. IEEE Robot Autom Lett 5(2):2602–2609CrossRef Cheng Y, Sun L, Liu C et al (2020) Towards efficient human robot collaboration with robust plan recognition and trajectory prediction. IEEE Robot Autom Lett 5(2):2602–2609CrossRef
3.
go back to reference Eiffert S, Li K, Shan M et al (2020) Probabilistic crowd GAN: multimodal pedestrian trajectory prediction using a graph vehicle-pedestrian attention network. IEEE Robot Autom Lett 5(4):5026–5033CrossRef Eiffert S, Li K, Shan M et al (2020) Probabilistic crowd GAN: multimodal pedestrian trajectory prediction using a graph vehicle-pedestrian attention network. IEEE Robot Autom Lett 5(4):5026–5033CrossRef
4.
go back to reference Li X, Lu L, Yu J et al (2013) A method of abnormal pedestrian behavior detection based on the trajectory model. In: International conference on transportation engineering Li X, Lu L, Yu J et al (2013) A method of abnormal pedestrian behavior detection based on the trajectory model. In: International conference on transportation engineering
5.
go back to reference Alahi A, Goel K, Ramanathan V et al (2016) Social LSTM: human trajectory prediction in crowded spaces. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR). IEEE Alahi A, Goel K, Ramanathan V et al (2016) Social LSTM: human trajectory prediction in crowded spaces. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR). IEEE
7.
go back to reference Li Y (2019) Which way are you going? imitative decision learning for path forecasting in dynamic scenes. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Li Y (2019) Which way are you going? imitative decision learning for path forecasting in dynamic scenes. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
8.
go back to reference Pellegrini S, Ess A, Schindler K et al (2009) You'll never walk alone: modeling social behavior for multi-target tracking. In: IEEE international conference on computer vision. IEEE Pellegrini S, Ess A, Schindler K et al (2009) You'll never walk alone: modeling social behavior for multi-target tracking. In: IEEE international conference on computer vision. IEEE
9.
go back to reference Gupta A, Johnson J, Li FF et al (2018) Social GAN: socially acceptable trajectories with generative adversarial networks. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Gupta A, Johnson J, Li FF et al (2018) Social GAN: socially acceptable trajectories with generative adversarial networks. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
10.
go back to reference Pellegrini S, Ess A, Gool LV (2010) Improving data association by joint modeling of pedestrian trajectories and groupings. In: Proceedings of the 11th European conference on Computer vision: part I. Springer, Berlin Pellegrini S, Ess A, Gool LV (2010) Improving data association by joint modeling of pedestrian trajectories and groupings. In: Proceedings of the 11th European conference on Computer vision: part I. Springer, Berlin
11.
go back to reference Lealtaixe L, Fenzi M, Kuznetsova A et al (2014) Learning an image-based motion context for multiple people tracking. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE Lealtaixe L, Fenzi M, Kuznetsova A et al (2014) Learning an image-based motion context for multiple people tracking. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE
13.
go back to reference J Dong, Zhu P, Ferrari S (2020) Oriented pedestrian social interaction modeling and inference. In: 2020 American control conference (ACC) J Dong, Zhu P, Ferrari S (2020) Oriented pedestrian social interaction modeling and inference. In: 2020 American control conference (ACC)
14.
go back to reference Liang J, Lu J, Niebles J C et al. Peeking into the future: predicting future person activities and locations in videos. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Liang J, Lu J, Niebles J C et al. Peeking into the future: predicting future person activities and locations in videos. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
15.
go back to reference Zhang P, Ouyang W, Zhang P et al (2020) SR-LSTM: state refinement for LSTM towards pedestrian trajectory prediction. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Zhang P, Ouyang W, Zhang P et al (2020) SR-LSTM: state refinement for LSTM towards pedestrian trajectory prediction. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
16.
go back to reference Xu Y, Piao Z, Gao S (2018) Encoding crowd interaction with deep neural network for pedestrian trajectory prediction. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Xu Y, Piao Z, Gao S (2018) Encoding crowd interaction with deep neural network for pedestrian trajectory prediction. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
17.
go back to reference Li F, Gui Z, Zhang Z et al (2020) A hierarchical temporal attention-based LSTM encoder-decoder model for individual mobility prediction. Neurocomputing 403(25):153–166CrossRef Li F, Gui Z, Zhang Z et al (2020) A hierarchical temporal attention-based LSTM encoder-decoder model for individual mobility prediction. Neurocomputing 403(25):153–166CrossRef
20.
go back to reference X Chen, Liu S, Xu Z et al (2021) SCSG attention: a self-centered star graph with attention for pedestrian trajectory prediction X Chen, Liu S, Xu Z et al (2021) SCSG attention: a self-centered star graph with attention for pedestrian trajectory prediction
21.
go back to reference Zhou Y, Wu H, Cheng H et al (2021) Social graph convolutional LSTM for pedestrian trajectory prediction. IET Intell Transport Syst 15(3):396–405CrossRef Zhou Y, Wu H, Cheng H et al (2021) Social graph convolutional LSTM for pedestrian trajectory prediction. IET Intell Transport Syst 15(3):396–405CrossRef
22.
go back to reference Chen K, Song X, Ren X (2021) Modeling social interaction and intention for pedestrian trajectory prediction. Phys A Stat Mech Appl 570(9):125790CrossRef Chen K, Song X, Ren X (2021) Modeling social interaction and intention for pedestrian trajectory prediction. Phys A Stat Mech Appl 570(9):125790CrossRef
23.
go back to reference Li J, Wei Y, Liang X et al (2016) Attentive contexts for object detection. IEEE Trans Multimed 19(5):944–954CrossRef Li J, Wei Y, Liang X et al (2016) Attentive contexts for object detection. IEEE Trans Multimed 19(5):944–954CrossRef
26.
go back to reference Xu K, Ba J, Kiros R et al (2015) Show, attend and tell: neural image caption generation with visual attention. Comput Sci 2048–2057 Xu K, Ba J, Kiros R et al (2015) Show, attend and tell: neural image caption generation with visual attention. Comput Sci 2048–2057
27.
go back to reference Fernando T, Denman S, Sridharan S et al (2017) Soft + hardwired attention: an LSTM framework for human trajectory prediction and abnormal event detection. Neural Netw 108:466–478CrossRef Fernando T, Denman S, Sridharan S et al (2017) Soft + hardwired attention: an LSTM framework for human trajectory prediction and abnormal event detection. Neural Netw 108:466–478CrossRef
28.
go back to reference Hang S, Dong Y, Zhu J et al (2016) Crowd scene understanding with coherent recurrent neural networks. In: IJCAI'16: proceedings of the twenty-fifth international joint conference on artificial intelligence Hang S, Dong Y, Zhu J et al (2016) Crowd scene understanding with coherent recurrent neural networks. In: IJCAI'16: proceedings of the twenty-fifth international joint conference on artificial intelligence
29.
go back to reference Sadeghian A, Kosaraju V, Sadeghian A et al (2020) SoPhie: an attentive GAN for predicting paths compliant to social and physical constraints. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Sadeghian A, Kosaraju V, Sadeghian A et al (2020) SoPhie: an attentive GAN for predicting paths compliant to social and physical constraints. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
30.
go back to reference Su H, Zhu J, Dong Y et al (2017) Forecast the plausible paths in crowd scenes. In: Twenty-sixth international joint conference on artificial intelligence Su H, Zhu J, Dong Y et al (2017) Forecast the plausible paths in crowd scenes. In: Twenty-sixth international joint conference on artificial intelligence
31.
go back to reference Vemula A, Muelling K, Oh J (2018) Social attention: modeling attention in human crowds. In: 2018 IEEE international conference on robotics and automation (ICRA) Vemula A, Muelling K, Oh J (2018) Social attention: modeling attention in human crowds. In: 2018 IEEE international conference on robotics and automation (ICRA)
32.
go back to reference Hao X, Du QH, Reynolds M (2020) A location-velocity-temporal attention LSTM model for pedestrian trajectory prediction. IEEE Access 8:44576–44589CrossRef Hao X, Du QH, Reynolds M (2020) A location-velocity-temporal attention LSTM model for pedestrian trajectory prediction. IEEE Access 8:44576–44589CrossRef
33.
go back to reference Li X, Liu Y, Wang K et al (2020) A recurrent attention and interaction model for pedestrian trajectory prediction. IEEE/CAA J Autom Sin 7(5):1361–1370 Li X, Liu Y, Wang K et al (2020) A recurrent attention and interaction model for pedestrian trajectory prediction. IEEE/CAA J Autom Sin 7(5):1361–1370
34.
go back to reference He K, Zhang X, Ren S et al (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904–1916CrossRef He K, Zhang X, Ren S et al (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904–1916CrossRef
35.
go back to reference Wang F, Jiang M, Qian C et al (2017) Residual attention network for image classification. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR). IEEE Wang F, Jiang M, Qian C et al (2017) Residual attention network for image classification. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR). IEEE
36.
go back to reference Mohamed A, Qian K, Elhoseiny M et al (2020) Social-STGCNN: a social spatio-temporal graph convolutional neural network for human trajectory prediction. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Mohamed A, Qian K, Elhoseiny M et al (2020) Social-STGCNN: a social spatio-temporal graph convolutional neural network for human trajectory prediction. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
39.
go back to reference Amirian J, Hayet JB, Pettre J (2019) Social ways: learning multi-modal distributions of pedestrian trajectories with GANs. 1 Amirian J, Hayet JB, Pettre J (2019) Social ways: learning multi-modal distributions of pedestrian trajectories with GANs. 1
40.
go back to reference Huang Y, Bi H, Li Z et al (2019) STGAT: modeling spatial-temporal interactions for human trajectory prediction. In: 2019 international conference in computer vision Huang Y, Bi H, Li Z et al (2019) STGAT: modeling spatial-temporal interactions for human trajectory prediction. In: 2019 international conference in computer vision
41.
go back to reference Fang L, Jiang Q, Shi J et al (2020) TPNet: trajectory proposal network for motion prediction. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE Fang L, Jiang Q, Shi J et al (2020) TPNet: trajectory proposal network for motion prediction. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR). IEEE
42.
go back to reference Shuai Y, Li H, Wang X (2015) Understanding pedestrian behaviors from stationary crowd groups. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR). IEEE Shuai Y, Li H, Wang X (2015) Understanding pedestrian behaviors from stationary crowd groups. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR). IEEE
Metadata
Title
CIRAN: extracting crowd interaction with residual attention network for pedestrian trajectory prediction
Authors
Shang Liu
Xiaoyu Chen
Hao Chen
Publication date
04-04-2022
Publisher
Springer Berlin Heidelberg
Published in
International Journal of Machine Learning and Cybernetics / Issue 9/2022
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-022-01548-0

Other articles of this Issue 9/2022

International Journal of Machine Learning and Cybernetics 9/2022 Go to the issue