Skip to main content
Log in

Abnormal event detection for video surveillance using deep one-class learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Abnormal event detection and localization is a challenging research problem in intelligent video surveillance. It is designed to automatically identify abnormal events from monitoring videos. The main difficulty of this task lies in that there is only one class called “normal event” in training video sequences. In recent years, many advanced algorithms have been proposed on the basis of hand-crafted features. Only a few algorithms are based on high-level features, but almost all these methods use two-stage learning. In this paper, we propose a novel end-to-end model which integrates the one-class Support Vector Machine (SVM) into Convolutional Neural Network (CNN), named Deep One-Class (DOC) model. Specifically, the robust loss function derived from the one-class SVM is proposed to optimize the parameters of this model. Compared with the hierarchical models, our model not only simplifies the complexity of the process, but also obtains the global optimal solution of the whole process. In the experiments, we validate our DOC model with a publicly available dataset and compare it with some state-of-art methods. The comparison results demonstrate that our model has great performance and it is effective for abnormal events detection from surveillance videos.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Adam A, Rivlin E, Shimshoni I, Reinitz D (2008) Robust real-time unusual event detection using multiple fixed-location monitors. IEEE Trans Pattern Anal Mach Intell 30(3):555–560

    Article  Google Scholar 

  2. Bengio Y (2009) Learning deep architectures for AI. Foundations and Trends in Machine Learning 2(1):1–127

    Article  MathSciNet  Google Scholar 

  3. Boiman O, Irani M (2007) Detecting irregularities in images and in video. Int J Comput Vis 74(1):17–31

    Article  Google Scholar 

  4. Chan T, Jia K, Gao S, Lu J, Zeng Z, Ma Y (2015) Pcanet: a simple deep learning baseline for image classification. IEEE Trans Image Processing 24(12):5017–5032

    Article  MathSciNet  Google Scholar 

  5. Chen Y, Zhou XS, Huang TS (2001) One-class SVM for learning in image retrieval. In: Proceedings of the 2001 international conference on image processing, ICIP 2001, Thessaloniki, Greece, October 7–10, 2001, pp 34–37

  6. Cong Y, Yuan J, Liu J (2011) Sparse reconstruction cost for abnormal event detection. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3449–3456

  7. Cui X, Liu Q, Gao M, Metaxas DN (2011) Abnormal detection using interaction energy potentials. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3161–3167

  8. Fanello SR, Gori I, Metta G, Odone F (2013) One-shot learning for real-time action recognition. In: Pattern recognition and image analysis - 6th Iberian conference, IbPRIA 2013, Funchal, Madeira, Portugal, June 5–7, 2013. Proceedings, pp 31–40

  9. Feichtenhofer C, Pinz A, Wildes RP (2016) Spatiotemporal residual networks for video action recognition. In: Advances in neural information processing systems 29: Annual conference on neural information processing systems 2016, December 5–10, 2016, Barcelona, Spain, pp 3468–3476

  10. Feng Y, Yuan Y, Lu X (2016) Deep representation for abnormal event detection in crowded scenes. In: Proceedings of the 2016 ACM conference on multimedia conference, MM 2016, Amsterdam, The Netherlands, October 15–19, 2016, pp 591–595

  11. Hu R, Zhu X, Cheng D, He W, Yan Y, Song J, Zhang S (2017) Graph self-representation method for unsupervised feature selection. Neurocomputing 220:130–137

    Article  Google Scholar 

  12. Itti L, Baldi P (2005) A principled approach to detecting surprising events in video. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), 20–26 June 2005, San Diego, CA, USA, pp 631–637

  13. Kratz L, Nishino K (2009) Anomaly detection in extremely crowded scenes using spatio-temporal motion pattern models. In: 2009 IEEE computer society conference on computer vision and pattern recognition (CVPR 2009), 20–25 June 2009, Miami, Florida, USA, pp 1446–1453

  14. Li W, Mahadevan V, Vasconcelos N (2014) Anomaly detection and localization in crowded scenes. IEEE Trans Pattern Anal Mach Intell 36(1):18–32

    Article  Google Scholar 

  15. Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 FPS in MATLAB. In: IEEE international conference on computer vision, ICCV 2013, Sydney, Australia, December 1–8, 2013, pp 2720–2727

  16. Mahadevan V, Li W, Bhalodia V, Vasconcelos N (2010) Anomaly detection in crowded scenes. In: The twenty-third IEEE conference on computer vision and pattern recognition, CVPR 2010, San Francisco, CA, USA, 13–18 June 2010, pp 1975–1981

  17. Reddy V, Sanderson C, Lovell BC (2011) Improved anomaly detection in crowded scenes via cell-based analysis of foreground speed, size and texture. In: IEEE conference on computer vision and pattern recognition, CVPR workshops 2011, Colorado Springs, CO, USA, 20-25 June, 2011, pp 55–61

  18. Sabokrou M, Fayyaz M, Fathy M, Klette R (2017) Deep-cascade: cascading 3d deep neural networks for fast anomaly detection and localization in crowded scenes. IEEE Trans Image Processing 26(4):1992–2004

    Article  MathSciNet  Google Scholar 

  19. Wang P, Cao Y, Shen C, Liu L, Shen HT (2017) Temporal pyramid pooling based convolutional neural networks for action recognition. IEEE Trans Circuits Syst Video Techn. https://doi.org/10.1109/TCSVT.2016.2576761

  20. Wang P, Liu L, Shen C, Huang Z, van den Hengel A, Shen HT (2016) What’s wrong with that object? Identifying images of unusual objects by modelling the detection score distribution. In: 2016 IEEE conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27–30, 2016, pp 1573–1581

  21. Wang P, Liu L, Shen C, Huang Z, van den Hengel A, Shen HT (2017) Multi-attention network for one shot learning. In: 2017 IEEE conference on computer vision and pattern recognition, CVPR 2017, Honolulu, Hawaii, USA, July 22–25, 2017, pp 2721–2729

  22. Xu D, Ricci E, Yan Y, Song J, Sebe N (2015) Learning deep representations of appearance and motion for anomalous event detection. In: Proceedings of the british machine vision conference 2015, BMVC 2015, Swansea, UK, September 7–10, 2015, pp 8.1–8.12

  23. Yuan Y, Feng Y, Lu X (2017) Statistical hypothesis detector for abnormal event detection in crowded scenes. IEEE Trans Cybernetics. https://doi.org/10.1109/TCYB.2016.2572609

  24. Zhang B, Wang L, Wang Z, Qiao Y, Wang H (2016) Real-time action recognition with enhanced motion vector cnns. In: 2016 IEEE conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27–30, 2016, pp 2718–2726

  25. Zhao B, Li F, Xing EP (2011) Online detection of unusual events in videos via dynamic sparse coding. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3313–3320

  26. Zhu X, Li X, Zhang S (2016) Block-row sparse multiview multilabel learning for image classification. IEEE Trans Cybernetics 46(2):450–461

    Article  Google Scholar 

  27. Zhu X, Li X, Zhang S, Ju C, Wu X (2017) Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans Neural Netw Learning Syst 28(6):1263–1275

    Article  MathSciNet  Google Scholar 

  28. Zhu X, Zhang L, Huang Z (2014) A sparse embedding and least variance encoding approach to hashing. IEEE Trans Image Processing 23(9):3737–3750

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (grants No. 61672133 and No. 61632007), and the Fundamental Research Funds for the Central Universities (grants No. ZYGX2015J058 and No. ZYGX2014Z007).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jie Shao.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sun, J., Shao, J. & He, C. Abnormal event detection for video surveillance using deep one-class learning. Multimed Tools Appl 78, 3633–3647 (2019). https://doi.org/10.1007/s11042-017-5244-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-017-5244-2

Keywords

Navigation