Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 1/2023

01.06.2023 | Regular Paper

Video anomaly detection with memory-guided multilevel embedding

verfasst von: Liuping Zhou, Jing Yang

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Playing a vitally important role in the operation of intelligent video surveillance system and smart city, video anomaly detection (VAD) has been widely practiced and studied in both industrial circles and academia. In the present study, a new anomaly detection method is proposed for multi-level memory embedding. According to the novel method, the feature prototype of the sample is stored in the memory pool, which enhances the diversity of the sample feature prototype paradigm. Besides, the memory is embedded in the decoder in a hierarchical integrating manner, which makes the feature information of the object more complete and improves the quality of features. At the end of the model, modeling is performed for the channel relationship between the features of the object in the channel dimension, thus making the model capable of more efficient anomaly detection. This method is verified by conducting evaluation on three publicly available datasets: UCSD Ped2, CUHK Avenue, ShanghaiTech.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Tao X, Gong X, Zhang X et al (2022) Deep learning for unsupervised anomaly localization in industrial images: a survey. IEEE Trans Instrum Meas 71:1–21 Tao X, Gong X, Zhang X et al (2022) Deep learning for unsupervised anomaly localization in industrial images: a survey. IEEE Trans Instrum Meas 71:1–21
2.
Zurück zum Zitat Suarez JJP, Naval Jr P C (2020) A survey on deep learning techniques for video anomaly detection. arXiv preprint arXiv:2009.14146 Suarez JJP, Naval Jr P C (2020) A survey on deep learning techniques for video anomaly detection. arXiv preprint arXiv:​2009.​14146
3.
Zurück zum Zitat Saligrama V, Konrad J, Jodoin PM (2010) Video anomaly identification. IEEE Signal Process Mag 27(5):18–33CrossRef Saligrama V, Konrad J, Jodoin PM (2010) Video anomaly identification. IEEE Signal Process Mag 27(5):18–33CrossRef
4.
Zurück zum Zitat Gong MG, Zeng HM, Xie Y et al (2020) Local distinguishability aggrandizing network for human anomaly detection. Neural Netw 122:364–373CrossRef Gong MG, Zeng HM, Xie Y et al (2020) Local distinguishability aggrandizing network for human anomaly detection. Neural Netw 122:364–373CrossRef
5.
Zurück zum Zitat Redmon J, Divvala S, Girshick R, et al (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 779–788 Redmon J, Divvala S, Girshick R, et al (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 779–788
6.
Zurück zum Zitat Srivastava N, Mansimov E, Salakhudinov R (2015) Unsupervised learning of video representations using lstms, In: International conference on machine learning, pp 843–852 Srivastava N, Mansimov E, Salakhudinov R (2015) Unsupervised learning of video representations using lstms, In: International conference on machine learning, pp 843–852
7.
Zurück zum Zitat Zou Y F (2109) Recognition and research about abnormal behavior of human based on video, Yunnan University, Kunming Zou Y F (2109) Recognition and research about abnormal behavior of human based on video, Yunnan University, Kunming
8.
Zurück zum Zitat Ristea NC, Madan N, Ionescu RT, et al (2022) Self-supervised predictive convolutional attentive block for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 13576–13586 Ristea NC, Madan N, Ionescu RT, et al (2022) Self-supervised predictive convolutional attentive block for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 13576–13586
9.
Zurück zum Zitat Peng J, Zhao Y, Wang L (2021) Research on video abnormal behavior detection based on deep learning. Prog Laser Optoelectron 58(6):0600004CrossRef Peng J, Zhao Y, Wang L (2021) Research on video abnormal behavior detection based on deep learning. Prog Laser Optoelectron 58(6):0600004CrossRef
10.
Zurück zum Zitat Liu W, Luo W, Lian D, et al (2018) Future frame prediction for anomaly detection–a new baseline. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6536–6545 Liu W, Luo W, Lian D, et al (2018) Future frame prediction for anomaly detection–a new baseline. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6536–6545
11.
Zurück zum Zitat Wang G, Wang Y, Qin J, et al (2022) Video anomaly detection by solving decoupled spatio-temporal jigsaw puzzles. In: Proceedings of the ieee conference on european conference on computer vision, pp 494–511 Wang G, Wang Y, Qin J, et al (2022) Video anomaly detection by solving decoupled spatio-temporal jigsaw puzzles. In: Proceedings of the ieee conference on european conference on computer vision, pp 494–511
12.
Zurück zum Zitat Zaheer MZ, Mahmood A, Khan MH, et al (2022) Generative cooperative learning for unsupervised video anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14744–14754 Zaheer MZ, Mahmood A, Khan MH, et al (2022) Generative cooperative learning for unsupervised video anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14744–14754
13.
Zurück zum Zitat Yang J, Shi Y, Qi Z (2022) Learning deep feature correspondence for unsupervised anomaly detection and segmentation. Patt Recogn 132:108874CrossRef Yang J, Shi Y, Qi Z (2022) Learning deep feature correspondence for unsupervised anomaly detection and segmentation. Patt Recogn 132:108874CrossRef
14.
Zurück zum Zitat Zong B, Song Q, Min MR, et al (2018). Deep autoencoding gaussian mixture model for unsupervised anomaly detection, In: International conference on learning representations Zong B, Song Q, Min MR, et al (2018). Deep autoencoding gaussian mixture model for unsupervised anomaly detection, In: International conference on learning representations
15.
Zurück zum Zitat Lu Y, Kumar KM, Shahabeddin Nabavi S, et al (2019). Future frame prediction using convolutional vrnn for anomaly detection. In: 2019 16th IEEE international conference on advanced video and signal based surveillance, pp 1–8 Lu Y, Kumar KM, Shahabeddin Nabavi S, et al (2019). Future frame prediction using convolutional vrnn for anomaly detection. In: 2019 16th IEEE international conference on advanced video and signal based surveillance, pp 1–8
16.
Zurück zum Zitat Yu G, Wang S, Cai Z, et al (2020) Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of the 28th ACM international conference on multimedia, pp 583–591 Yu G, Wang S, Cai Z, et al (2020) Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of the 28th ACM international conference on multimedia, pp 583–591
17.
Zurück zum Zitat Paffenroth R, Du Toit P, Nong R et al (2013) Space-time signal processing for distributed pattern detection in sensor networks. IEEE J Select Topics Sign Process 7(1):38–49CrossRef Paffenroth R, Du Toit P, Nong R et al (2013) Space-time signal processing for distributed pattern detection in sensor networks. IEEE J Select Topics Sign Process 7(1):38–49CrossRef
18.
19.
Zurück zum Zitat Laxhammar R, Falkman G, Sviestins E (2009) Anomaly detection in sea traffic-a comparison of the gaussian mixture model and the kernel density estimator In: 2009 12th international conference on information fusion. pp 756–763 Laxhammar R, Falkman G, Sviestins E (2009) Anomaly detection in sea traffic-a comparison of the gaussian mixture model and the kernel density estimator In: 2009 12th international conference on information fusion. pp 756–763
20.
Zurück zum Zitat Latecki LJ, Lazarevic A, Pokrajac D (2007) Outlier detection with kernel density functions. MLDM 7:61–75 Latecki LJ, Lazarevic A, Pokrajac D (2007) Outlier detection with kernel density functions. MLDM 7:61–75
21.
Zurück zum Zitat Ma J, Perkins S (2003) Time-series novelty detection using one-class support vector machines. In: Proceedings of the international joint conference on neural networks, pp 1741–1745 Ma J, Perkins S (2003) Time-series novelty detection using one-class support vector machines. In: Proceedings of the international joint conference on neural networks, pp 1741–1745
22.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90CrossRef Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90CrossRef
23.
Zurück zum Zitat Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742 Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742
24.
Zurück zum Zitat Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. pp 234–241 Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. pp 234–241
25.
Zurück zum Zitat Goodfellow IJ, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial networks. Adv Neural Inf Process Syst 3:2672–2680 Goodfellow IJ, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial networks. Adv Neural Inf Process Syst 3:2672–2680
26.
Zurück zum Zitat Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742 Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742
27.
Zurück zum Zitat Zhao Y, Deng B, Shen C, et al (2017) Spatio temporal autoencoder for video anomaly detection In: Proceedings of the 25th ACM international conference on multimedia. pp 1933–1941 Zhao Y, Deng B, Shen C, et al (2017) Spatio temporal autoencoder for video anomaly detection In: Proceedings of the 25th ACM international conference on multimedia. pp 1933–1941
28.
Zurück zum Zitat Zaheer MZ, Lee JH, Astrid M, et al (2020) Old is gold: redefining the adversarially learned one-class classifier training paradigm. In: 2020 IEEE/CVF Conference on computer vision and pattern recognition, pp 14183–14193 Zaheer MZ, Lee JH, Astrid M, et al (2020) Old is gold: redefining the adversarially learned one-class classifier training paradigm. In: 2020 IEEE/CVF Conference on computer vision and pattern recognition, pp 14183–14193
29.
Zurück zum Zitat Zhang C, Song D, Chen Y et al (2019) A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data. Proceed AAAI Conf Artif Intell 33(1):1409–1416 Zhang C, Song D, Chen Y et al (2019) A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data. Proceed AAAI Conf Artif Intell 33(1):1409–1416
30.
Zurück zum Zitat Park H, Noh J, Ham B (2020) Learning memory-guided normality for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14372–14381 Park H, Noh J, Ham B (2020) Learning memory-guided normality for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14372–14381
31.
Zurück zum Zitat Medel J R, Savakis A (2016) Anomaly detection in video using predictive convolutional long short-term memory networks. arXiv preprint arXiv:1612.00390 Medel J R, Savakis A (2016) Anomaly detection in video using predictive convolutional long short-term memory networks. arXiv preprint arXiv:​1612.​00390
32.
Zurück zum Zitat Zhao Y, Deng B, Shen C, et al (2017) Spatio-temporal autoencoder for video anomaly detection. In: Proceedings of the 25th ACM international conference on Multimedia. pp 1933–1941 Zhao Y, Deng B, Shen C, et al (2017) Spatio-temporal autoencoder for video anomaly detection. In: Proceedings of the 25th ACM international conference on Multimedia. pp 1933–1941
33.
Zurück zum Zitat Le VT, Kim YG (2022). Attention-based residual autoencoder for video anomaly detection. Applied Intelligence, pp 1–15 Le VT, Kim YG (2022). Attention-based residual autoencoder for video anomaly detection. Applied Intelligence, pp 1–15
34.
Zurück zum Zitat Liu Z, Nie Y, Long C, et al (2021) A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 13588–13597 Liu Z, Nie Y, Long C, et al (2021) A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 13588–13597
35.
Zurück zum Zitat Chen D, Wang P, Yue L et al (2020) Anomaly detection in surveillance video based on bidirectional prediction. Image Vis Comput 98:103915CrossRef Chen D, Wang P, Yue L et al (2020) Anomaly detection in surveillance video based on bidirectional prediction. Image Vis Comput 98:103915CrossRef
36.
Zurück zum Zitat Zhang XX, Zhu Z, Zhao Y, Chang DX (2017) Learning a general assignment model for video analytics. IEEE Trans Circuits Syst Video Technol 28(10):3066–3076CrossRef Zhang XX, Zhu Z, Zhao Y, Chang DX (2017) Learning a general assignment model for video analytics. IEEE Trans Circuits Syst Video Technol 28(10):3066–3076CrossRef
37.
Zurück zum Zitat Xia GY, Chen BJ, Sun HJ, Liu QS (2020) Nonconvex low-rank kernel sparse subspace learning for keyframe extraction and motion segmentation. IEEE Trans Neural Netw Learn Sys 32(4):1612–1626MathSciNetCrossRef Xia GY, Chen BJ, Sun HJ, Liu QS (2020) Nonconvex low-rank kernel sparse subspace learning for keyframe extraction and motion segmentation. IEEE Trans Neural Netw Learn Sys 32(4):1612–1626MathSciNetCrossRef
38.
Zurück zum Zitat Meng J, Wang H, Yuan J, et al (2016) From keyframes to key objects: video summarization by representative object proposal selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1039–1048 Meng J, Wang H, Yuan J, et al (2016) From keyframes to key objects: video summarization by representative object proposal selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1039–1048
39.
Zurück zum Zitat Wang H, Kawahara Y, Weng C et al (2017) Representative selection with structured sparsity. Pattern Recogn 63:268–278CrossRef Wang H, Kawahara Y, Weng C et al (2017) Representative selection with structured sparsity. Pattern Recogn 63:268–278CrossRef
40.
Zurück zum Zitat Gong D, Liu L, Le V, et al (2019) Memorizing normality to detect anomaly: memory- augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1705–1714 Gong D, Liu L, Le V, et al (2019) Memorizing normality to detect anomaly: memory- augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1705–1714
41.
Zurück zum Zitat Graves A, Wayne G, Reynolds M et al (2016) Hybrid computing using a neural network with dynamic external memory. Nature 538(7626):471–476CrossRef Graves A, Wayne G, Reynolds M et al (2016) Hybrid computing using a neural network with dynamic external memory. Nature 538(7626):471–476CrossRef
42.
Zurück zum Zitat Lv H, Chen C, Cui Z, et al (2021) Learning normal dynamics in videos with meta prototype network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 15425–15434 Lv H, Chen C, Cui Z, et al (2021) Learning normal dynamics in videos with meta prototype network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 15425–15434
45.
Zurück zum Zitat Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 7132–7141 Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 7132–7141
46.
Zurück zum Zitat Mahadevan V, Li W, Bhalodia V, et al (2010) Anomaly detection in crowded scenes. In: 2010 IEEE computer society conference on computer vision and pattern recognition. pp 1975–1981 Mahadevan V, Li W, Bhalodia V, et al (2010) Anomaly detection in crowded scenes. In: 2010 IEEE computer society conference on computer vision and pattern recognition. pp 1975–1981
47.
Zurück zum Zitat Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 fps in matlab. In: Proceedings of the IEEE international conference on computer vision. pp 2720–2727 Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 fps in matlab. In: Proceedings of the IEEE international conference on computer vision. pp 2720–2727
48.
Zurück zum Zitat Luo W, Liu W, Gao S (2017) A revisit of sparse coding based anomaly detection in stacked rnn framework. In: Proceedings of the IEEE international conference on computer vision. pp 341–349 Luo W, Liu W, Gao S (2017) A revisit of sparse coding based anomaly detection in stacked rnn framework. In: Proceedings of the IEEE international conference on computer vision. pp 341–349
49.
Zurück zum Zitat Paszke A, Gross S, Chintala S, et al (2017) Automatic differentiation in pytorch. In: Advances in neural information processing systems 30 Paszke A, Gross S, Chintala S, et al (2017) Automatic differentiation in pytorch. In: Advances in neural information processing systems 30
50.
Zurück zum Zitat Kim J, Grauman K (2009) Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates. In: 2009 IEEE conference on computer vision and pattern recognition, pp 2921–2928 Kim J, Grauman K (2009) Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates. In: 2009 IEEE conference on computer vision and pattern recognition, pp 2921–2928
51.
Zurück zum Zitat Xu D, Ricci E, Yan Y, et al (2015) Learning deep representations of appearance and motion for anomalous event detection. arXiv preprint arXiv:1510.01553 Xu D, Ricci E, Yan Y, et al (2015) Learning deep representations of appearance and motion for anomalous event detection. arXiv preprint arXiv:​1510.​01553
52.
Zurück zum Zitat Tudor Ionescu R, Smeureanu S, Alexe B, et al (2017) Unmasking the abnormal events in video. In: Proceedings of the IEEE international conference on computer vision. pp 2895–2903 Tudor Ionescu R, Smeureanu S, Alexe B, et al (2017) Unmasking the abnormal events in video. In: Proceedings of the IEEE international conference on computer vision. pp 2895–2903
53.
Zurück zum Zitat Hinami R, Mei T, Satoh S (2017) Joint detection and recounting of abnormal events by learning deep generic knowledge. In: Proceedings of the IEEE international conference on computer vision. pp 3619–3627 Hinami R, Mei T, Satoh S (2017) Joint detection and recounting of abnormal events by learning deep generic knowledge. In: Proceedings of the IEEE international conference on computer vision. pp 3619–3627
54.
Zurück zum Zitat Nguyen T N, Meunier J (2019) Anomaly detection in video sequence with appearance-motion correspondence. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1273–1283. Nguyen T N, Meunier J (2019) Anomaly detection in video sequence with appearance-motion correspondence. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1273–1283.
55.
Zurück zum Zitat Chang Y, Tu Z, Xie W, et al (2020) Clustering driven deep autoencoder for video anomaly detection. In: European conference on computer vision. pp 329–345 Chang Y, Tu Z, Xie W, et al (2020) Clustering driven deep autoencoder for video anomaly detection. In: European conference on computer vision. pp 329–345
56.
Zurück zum Zitat Tang Y, Zhao L, Zhang S et al (2020) Integrating prediction and reconstruction for anomaly detection. Pattern Recogn Lett 129:123–130CrossRef Tang Y, Zhao L, Zhang S et al (2020) Integrating prediction and reconstruction for anomaly detection. Pattern Recogn Lett 129:123–130CrossRef
57.
Zurück zum Zitat Kanu-Asiegbu A M, Vasudevan R, Du X (2021) Leveraging trajectory prediction for pedestrian video anomaly detection. In: 2021 IEEE symposium series on computational intelligence, pp 01–08 Kanu-Asiegbu A M, Vasudevan R, Du X (2021) Leveraging trajectory prediction for pedestrian video anomaly detection. In: 2021 IEEE symposium series on computational intelligence, pp 01–08
58.
Zurück zum Zitat Li B, Leroux S, Simoens P (2021) Decoupled appearance and motion learning for efficient anomaly detection in surveillance video. Comput Vis Image Underst 210:103249CrossRef Li B, Leroux S, Simoens P (2021) Decoupled appearance and motion learning for efficient anomaly detection in surveillance video. Comput Vis Image Underst 210:103249CrossRef
Metadaten
Titel
Video anomaly detection with memory-guided multilevel embedding
verfasst von
Liuping Zhou
Jing Yang
Publikationsdatum
01.06.2023
Verlag
Springer London
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 1/2023
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-023-00272-x

Weitere Artikel der Ausgabe 1/2023

International Journal of Multimedia Information Retrieval 1/2023 Zur Ausgabe

Premium Partner