Skip to main content
Top

2024 | OriginalPaper | Chapter

SATJiP: Spatial and Augmented Temporal Jigsaw Puzzles for Video Anomaly Detection

Authors : Liheng Shen, Tetsu Matsukawa, Einoshin Suzuki

Published in: Advances in Knowledge Discovery and Data Mining

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Video Anomaly Detection (VAD) is a significant task, which refers to taking a video clip as input and outputting class labels, e.g., normal or abnormal, at the frame level. Wang et al. proposed a method called DSTJiP, which trains the model by solving Decoupled Spatial and Temporal Jigsaw Puzzles and achieves impressive VAD performance. However, the model sometimes fails to detect abnormal human actions where abnormal motions are accompanied by normal motions. The reason is that the model learns representations of little- and non-motion parts of training examples, resulting in being insensitive to abnormal motions. To circumvent this problem, we propose to solve Spatial and Augmented Temporal Jigsaw Puzzles (SATJiP) as an extension of DSTJiP. SATJiP encourages the model to focus on motions by a novel pretext task, enabling it to detect abnormal motions accompanied by normal motions. Experiments conducted on three standard VAD benchmarks demonstrate that SATJiP outperforms the state-of-the-art methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Astrid, M., Zaheer, M.Z., Lee, J.Y., Lee, S.I.: Learning not to reconstruct anomalies. In: Proceedings of BMVC (2021) Astrid, M., Zaheer, M.Z., Lee, J.Y., Lee, S.I.: Learning not to reconstruct anomalies. In: Proceedings of BMVC (2021)
2.
go back to reference Astrid, M., Zaheer, M.Z., Lee, S.I.: Synthetic temporal anomaly guided end-to-end video anomaly detection. In: Proceedings of ICCVW (2021) Astrid, M., Zaheer, M.Z., Lee, S.I.: Synthetic temporal anomaly guided end-to-end video anomaly detection. In: Proceedings of ICCVW (2021)
3.
go back to reference Barbalau, A., et al.: SSMTL++: revisiting self-supervised multi-task learning for video anomaly detection. Comput. Vis. Image Underst. 229, 103656 (2023)CrossRef Barbalau, A., et al.: SSMTL++: revisiting self-supervised multi-task learning for video anomaly detection. Comput. Vis. Image Underst. 229, 103656 (2023)CrossRef
4.
go back to reference Cai, R., Zhang, H., Liu, W., Gao, S., Hao, Z.: Appearance-motion memory consistency network for video anomaly detection. In: Proceedings of AAAI (2021) Cai, R., Zhang, H., Liu, W., Gao, S., Hao, Z.: Appearance-motion memory consistency network for video anomaly detection. In: Proceedings of AAAI (2021)
5.
go back to reference Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Computi. Surv. (CSUR) 41(3), 1–58 (2009)CrossRef Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Computi. Surv. (CSUR) 41(3), 1–58 (2009)CrossRef
7.
go back to reference Chen, C., et al.: Comprehensive regularization in a bi-directional predictive network for video anomaly detection. In: Proceedings of AAAI, vol. 36 (2022) Chen, C., et al.: Comprehensive regularization in a bi-directional predictive network for video anomaly detection. In: Proceedings of AAAI, vol. 36 (2022)
8.
go back to reference Deng, H., Zhang, Z., Zou, S., Li, X.: Bi-directional frame interpolation for unsupervised video anomaly detection. In: Proceedings of WACV (2023) Deng, H., Zhang, Z., Zou, S., Li, X.: Bi-directional frame interpolation for unsupervised video anomaly detection. In: Proceedings of WACV (2023)
9.
go back to reference Feichtenhofer, C., Li, Y., He, K., et al.: Masked autoencoders as spatiotemporal learners. In: Proceedings of NeurIPS, vol. 35 (2022) Feichtenhofer, C., Li, Y., He, K., et al.: Masked autoencoders as spatiotemporal learners. In: Proceedings of NeurIPS, vol. 35 (2022)
10.
go back to reference Feng, X., Song, D., Chen, Y., Chen, Z., Ni, J., Chen, H.: Convolutional transformer based dual discriminator generative adversarial networks for video anomaly detection. In: Proceedings of MM (2021) Feng, X., Song, D., Chen, Y., Chen, Z., Ni, J., Chen, H.: Convolutional transformer based dual discriminator generative adversarial networks for video anomaly detection. In: Proceedings of MM (2021)
11.
go back to reference Georgescu, M., Barbalau, A., Ionescu, R.T., Khan, F.S., Popescu, M., Shah, M.: Anomaly detection in video via self-supervised and multi-task learning. In: Proceedings of CVPR (2021) Georgescu, M., Barbalau, A., Ionescu, R.T., Khan, F.S., Popescu, M., Shah, M.: Anomaly detection in video via self-supervised and multi-task learning. In: Proceedings of CVPR (2021)
12.
go back to reference Gong, D., et al.: Memorizing normality to detect anomaly: memory-augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of ICCV (2019) Gong, D., et al.: Memorizing normality to detect anomaly: memory-augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of ICCV (2019)
13.
go back to reference Huang, X., Zhao, C., Wu, Z.: A video anomaly detection framework based on appearance-motion semantics representation consistency. In: Proceedings of ICASSP (2023) Huang, X., Zhao, C., Wu, Z.: A video anomaly detection framework based on appearance-motion semantics representation consistency. In: Proceedings of ICASSP (2023)
14.
go back to reference Ilg, E., Mayer, N., Saikia, T., Keuper, M., Dosovitskiy, A., Brox, T.: Flownet 2.0: evolution of optical flow estimation with deep networks. In: Proceedings of CVPR (2017) Ilg, E., Mayer, N., Saikia, T., Keuper, M., Dosovitskiy, A., Brox, T.: Flownet 2.0: evolution of optical flow estimation with deep networks. In: Proceedings of CVPR (2017)
15.
go back to reference Ionescu, R.T., Khan, F.S., Georgescu, M.I., Shao, L.: Object-centric auto-encoders and dummy anomalies for abnormal event detection in video. In: Proceedings of CVPR (2019) Ionescu, R.T., Khan, F.S., Georgescu, M.I., Shao, L.: Object-centric auto-encoders and dummy anomalies for abnormal event detection in video. In: Proceedings of CVPR (2019)
16.
go back to reference Lai, Y., Han, Y., Wang, Y.: Anomaly detection with prototype-guided discriminative latent embeddings. In: Proceedings of ICDM (2021) Lai, Y., Han, Y., Wang, Y.: Anomaly detection with prototype-guided discriminative latent embeddings. In: Proceedings of ICDM (2021)
17.
go back to reference Lee, S., Kim, H.G., Ro, Y.M.: BMAN: bidirectional multi-scale aggregation networks for abnormal event detection. IEEE Trans. Image Process. 29, 2395–2408 (2020)CrossRef Lee, S., Kim, H.G., Ro, Y.M.: BMAN: bidirectional multi-scale aggregation networks for abnormal event detection. IEEE Trans. Image Process. 29, 2395–2408 (2020)CrossRef
18.
go back to reference Liu, W., Luo, W., Lian, D., Gao, S.: Future frame prediction for anomaly detection–a new baseline. In: Proceedings of CVPR (2018) Liu, W., Luo, W., Lian, D., Gao, S.: Future frame prediction for anomaly detection–a new baseline. In: Proceedings of CVPR (2018)
19.
go back to reference Liu, Y., Liu, J., Zhao, M., Yang, D., Zhu, X., Song, L.: Learning appearance-motion normality for video anomaly detection. In: Proceedings of ICME (2022) Liu, Y., Liu, J., Zhao, M., Yang, D., Zhu, X., Song, L.: Learning appearance-motion normality for video anomaly detection. In: Proceedings of ICME (2022)
20.
go back to reference Liu, Z., Nie, Y., Long, C., Zhang, Q., Li, G.: A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of ICCV (2021) Liu, Z., Nie, Y., Long, C., Zhang, Q., Li, G.: A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of ICCV (2021)
21.
go back to reference Lu, C., Shi, J., Jia, J.: Abnormal event detection at 150 FPS in MATLAB. In: Proceedings of ICCV (2013) Lu, C., Shi, J., Jia, J.: Abnormal event detection at 150 FPS in MATLAB. In: Proceedings of ICCV (2013)
22.
go back to reference Luo, W., Liu, W., Gao, S.: Remembering history with convolutional LSTM for anomaly detection. In: Proceedings of ICME (2017) Luo, W., Liu, W., Gao, S.: Remembering history with convolutional LSTM for anomaly detection. In: Proceedings of ICME (2017)
23.
go back to reference Luo, W., Liu, W., Gao, S.: A revisit of sparse coding based anomaly detection in stacked RNN framework. In: Proceedings of ICCV (2017) Luo, W., Liu, W., Gao, S.: A revisit of sparse coding based anomaly detection in stacked RNN framework. In: Proceedings of ICCV (2017)
24.
go back to reference Mahadevan, V., Li, W., Bhalodia, V., Vasconcelos, N.: Anomaly detection in crowded scenes. In: Proceedings of CVPR (2010) Mahadevan, V., Li, W., Bhalodia, V., Vasconcelos, N.: Anomaly detection in crowded scenes. In: Proceedings of CVPR (2010)
25.
go back to reference Park, H., Noh, J., Ham, B.: Learning memory-guided normality for anomaly detection. In: Proceedings of CVPR (2020) Park, H., Noh, J., Ham, B.: Learning memory-guided normality for anomaly detection. In: Proceedings of CVPR (2020)
26.
go back to reference Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., Efros, A.A.: Context encoders: feature learning by inpainting. In: Proceedings of CVPR (2016) Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., Efros, A.A.: Context encoders: feature learning by inpainting. In: Proceedings of CVPR (2016)
28.
go back to reference Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of CVPR (2017) Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of CVPR (2017)
29.
go back to reference Shen, L., Matsukawa, T., Suzuki, E.: Detecting video anomalous events with an enhanced abnormality score. In: Proceedings of PRICAI, vol. 13629 (2022) Shen, L., Matsukawa, T., Suzuki, E.: Detecting video anomalous events with an enhanced abnormality score. In: Proceedings of PRICAI, vol. 13629 (2022)
30.
go back to reference Sun, C., Shi, C., Jia, Y., Wu, Y.: Learning event-relevant factors for video anomaly detection. In: Proceedings of AAAI, vol. 37 (2023) Sun, C., Shi, C., Jia, Y., Wu, Y.: Learning event-relevant factors for video anomaly detection. In: Proceedings of AAAI, vol. 37 (2023)
31.
go back to reference Tang, Y., Zhao, L., Zhang, S., Gong, C., Li, G., Yang, J.: Integrating prediction and reconstruction for anomaly detection. Pattern Recogn. Lett. 129, 123–130 (2020)CrossRef Tang, Y., Zhao, L., Zhang, S., Gong, C., Li, G., Yang, J.: Integrating prediction and reconstruction for anomaly detection. Pattern Recogn. Lett. 129, 123–130 (2020)CrossRef
32.
go back to reference Tishby, N., Zaslavsky, N.: Deep learning and the information bottleneck principle. In: Proceedings of ITW, pp. 1–5 (2015) Tishby, N., Zaslavsky, N.: Deep learning and the information bottleneck principle. In: Proceedings of ITW, pp. 1–5 (2015)
33.
go back to reference Tong, Z., Song, Y., Wang, J., Wang, L.: VideoMAE: masked autoencoders are data-efficient learners for self-supervised video pre-training. In: Procedings of NeurIPS, vol. 35 (2022) Tong, Z., Song, Y., Wang, J., Wang, L.: VideoMAE: masked autoencoders are data-efficient learners for self-supervised video pre-training. In: Procedings of NeurIPS, vol. 35 (2022)
34.
go back to reference Vu, H., Nguyen, T.D., Travers, A., Venkatesh, S., Phung, D.: Energy-based localized anomaly detection in video surveillance. In: Proceedings of PAKDD (2017) Vu, H., Nguyen, T.D., Travers, A., Venkatesh, S., Phung, D.: Energy-based localized anomaly detection in video surveillance. In: Proceedings of PAKDD (2017)
36.
go back to reference Wang, X., Wang, X., et al.: Robust unsupervised video anomaly detection by multipath frame prediction. IEEE Trans. Neural Netw, Learn. Syst. 33(6), 2301–2312 (2022)MathSciNetCrossRef Wang, X., Wang, X., et al.: Robust unsupervised video anomaly detection by multipath frame prediction. IEEE Trans. Neural Netw, Learn. Syst. 33(6), 2301–2312 (2022)MathSciNetCrossRef
37.
go back to reference Wang, Y., Qin, C., Bai, Y., Xu, Y., Ma, X., Fu, Y.: Making reconstruction-based method great again for video anomaly detection. In: Proceedings of ICDM (2022) Wang, Y., Qin, C., Bai, Y., Xu, Y., Ma, X., Fu, Y.: Making reconstruction-based method great again for video anomaly detection. In: Proceedings of ICDM (2022)
38.
go back to reference Wang, Z., Zou, Y., Zhang, Z.: Cluster attention contrast for video anomaly detection. In: Proceedings of MM (2020) Wang, Z., Zou, Y., Zhang, Z.: Cluster attention contrast for video anomaly detection. In: Proceedings of MM (2020)
39.
go back to reference Yang, Z., Liu, J., Wu, Z., Wu, P., Liu, X.: Video event restoration based on keyframes for video anomaly detection. In: Proceedings of CVPR (2023) Yang, Z., Liu, J., Wu, Z., Wu, P., Liu, X.: Video event restoration based on keyframes for video anomaly detection. In: Proceedings of CVPR (2023)
40.
go back to reference Ye, M., Peng, X., Gan, W., Wu, W., Qiao, Y.: AnoPCN: video anomaly detection via deep predictive coding network. In: Proceedings of MM (2019) Ye, M., Peng, X., Gan, W., Wu, W., Qiao, Y.: AnoPCN: video anomaly detection via deep predictive coding network. In: Proceedings of MM (2019)
41.
go back to reference Yu, G., et al.: Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of MM (2020) Yu, G., et al.: Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of MM (2020)
42.
go back to reference Zhou, W., Li, Y., Zhao, C.: Object-guided and motion-refined attention network for video anomaly detection. In: Proceedings of ICME (2022) Zhou, W., Li, Y., Zhao, C.: Object-guided and motion-refined attention network for video anomaly detection. In: Proceedings of ICME (2022)
Metadata
Title
SATJiP: Spatial and Augmented Temporal Jigsaw Puzzles for Video Anomaly Detection
Authors
Liheng Shen
Tetsu Matsukawa
Einoshin Suzuki
Copyright Year
2024
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-2242-6_3

Premium Partner