Skip to main content
Top
Published in: Progress in Artificial Intelligence 3/2018

17-02-2018 | Regular Paper

Following event detection method based on human skeleton motion analysis by Kinect sensor

Authors: Hong-Bo Zhang, Miaohui Zhang, Jinyang Guo, Qing Lei, Tsung-Chih Hsiao

Published in: Progress in Artificial Intelligence | Issue 3/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Detecting following events is important for intelligent video surveillance systems, especially at secure entrances. However, there is no effective approach for recognizing following actions. In this paper, we propose a novel following action recognition method based on motion analysis of the human skeleton. We use a Kinect to capture the human skeleton and represent the following action as a joint-trajectory difference feature. The most difficult aspect of recognizing a following action lies in classifying both the following action and the companion action. To avoid false alarms from companion actions, this type of human interaction should be detected by the relative displacement of joint points. In our approach, an SVM classifier is trained to recognize following actions, companion actions and others. To verify the proposed method, we built a new following action dataset during this study. The experimental results show that the proposed method is effective at recognizing following actions.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Guo, H., Wu, X., Cai, S., Li, N., Cheng, J., Chen, Y.L.: Quaternion discrete cosine transformation signature analysis in crowd scenes for abnormal event detection. Neurocomputing 204, 106–15 (2016)CrossRef Guo, H., Wu, X., Cai, S., Li, N., Cheng, J., Chen, Y.L.: Quaternion discrete cosine transformation signature analysis in crowd scenes for abnormal event detection. Neurocomputing 204, 106–15 (2016)CrossRef
2.
go back to reference Sandhan, T., Sethi, A., Srivastava, T., Choi, JY. (eds.): Unsupervised learning approach for abnormal event detection in surveillance video by revealing infrequent patterns. In: 2013 28th International Conference on Image and Vision Computing New Zealand (IVCNZ 2013), pp. 27–29 (2013) Sandhan, T., Sethi, A., Srivastava, T., Choi, JY. (eds.): Unsupervised learning approach for abnormal event detection in surveillance video by revealing infrequent patterns. In: 2013 28th International Conference on Image and Vision Computing New Zealand (IVCNZ 2013), pp. 27–29 (2013)
3.
go back to reference Jiang, F., Yuan, J., Tsaftaris, S.A., Katsaggelos, A.K.: Anomalous video event detection using spatiotemporal context. Comput. Vis. Image Understand. 115(3), 323–33 (2011)CrossRef Jiang, F., Yuan, J., Tsaftaris, S.A., Katsaggelos, A.K.: Anomalous video event detection using spatiotemporal context. Comput. Vis. Image Understand. 115(3), 323–33 (2011)CrossRef
4.
go back to reference Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 1–58 (2009)CrossRef Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 1–58 (2009)CrossRef
5.
go back to reference Du, T., Yuan, J. (eds.): Optimal spatio-temporal path discovery for video event detection. In: Computer Vision and Pattern Recognition (2011) Du, T., Yuan, J. (eds.): Optimal spatio-temporal path discovery for video event detection. In: Computer Vision and Pattern Recognition (2011)
6.
go back to reference Yang, C., Yuan, J., Liu, J.: Abnormal event detection in crowded scenes using sparse representation. Pattern Recogn. 46(7), 1851–64 (2013)CrossRef Yang, C., Yuan, J., Liu, J.: Abnormal event detection in crowded scenes using sparse representation. Pattern Recogn. 46(7), 1851–64 (2013)CrossRef
7.
go back to reference Su, S., Wu, S.S., Chen, S.Y., Duh, D.J., Li, S.: Multi-view fall detection based on spatio-temporal interest points. Multimed. Tools Appl. 75(14), 8469–92 (2016)CrossRef Su, S., Wu, S.S., Chen, S.Y., Duh, D.J., Li, S.: Multi-view fall detection based on spatio-temporal interest points. Multimed. Tools Appl. 75(14), 8469–92 (2016)CrossRef
8.
go back to reference Akoglu, L., Tong, H., Koutra, D.: Graph based anomaly detection and description: a survey. Data Min. Knowl. Disc. 29(3), 626–88 (2015)MathSciNetCrossRef Akoglu, L., Tong, H., Koutra, D.: Graph based anomaly detection and description: a survey. Data Min. Knowl. Disc. 29(3), 626–88 (2015)MathSciNetCrossRef
9.
go back to reference Zhang, H.B., Li, S.Z., Su, S.Z., Chen, S.Y.: Selecting effective and discriminative spatio-temporal interest points for recognizing human action. IEICE Trans. Inf. Syst. E96d(8), 1783–92 (2013)CrossRef Zhang, H.B., Li, S.Z., Su, S.Z., Chen, S.Y.: Selecting effective and discriminative spatio-temporal interest points for recognizing human action. IEICE Trans. Inf. Syst. E96d(8), 1783–92 (2013)CrossRef
10.
go back to reference Chakraborty, B., Holte, M.B., Moeslund, T.B., Gonzalez, J.: Selective spatio-temporal interest points. Comput. Vis. Image Understand. 116(3), 396–410 (2012)CrossRef Chakraborty, B., Holte, M.B., Moeslund, T.B., Gonzalez, J.: Selective spatio-temporal interest points. Comput. Vis. Image Understand. 116(3), 396–410 (2012)CrossRef
11.
go back to reference Zhang, H.B., Lei, Q., Chen, D.S., Zhong, B.N., Peng, J., Du, J.X., et al.: Probability-based method for boosting human action recognition using scene context. IET Comput. Vis. 10(6), 528–536 (2016)CrossRef Zhang, H.B., Lei, Q., Chen, D.S., Zhong, B.N., Peng, J., Du, J.X., et al.: Probability-based method for boosting human action recognition using scene context. IET Comput. Vis. 10(6), 528–536 (2016)CrossRef
12.
go back to reference Li, M., Leung, H., Shum, HPH.: Human action recognition via skeletal and depth based feature fusion. In: Motion in Games, pp. 123–32. ACM, San Francisco (2016) Li, M., Leung, H., Shum, HPH.: Human action recognition via skeletal and depth based feature fusion. In: Motion in Games, pp. 123–32. ACM, San Francisco (2016)
13.
go back to reference Yang, X., Tian, YL. (eds.): Super normal vector for activity recognition using depth sequences. In: IEEE Conference on Computer Vision and Pattern Recognition (2014) Yang, X., Tian, YL. (eds.): Super normal vector for activity recognition using depth sequences. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
14.
go back to reference Rahmani, H., Mahmood, A., Huynh, D.Q., Mian, A. (eds.): Real time action recognition using histograms of depth gradients and random decision forests. In: WACV 2014: IEEE Winter Conference on Applications of Computer Vision (2014) Rahmani, H., Mahmood, A., Huynh, D.Q., Mian, A. (eds.): Real time action recognition using histograms of depth gradients and random decision forests. In: WACV 2014: IEEE Winter Conference on Applications of Computer Vision (2014)
15.
go back to reference Oreifej, O., Liu, Z. (eds.): HON4D: Histogram of oriented 4D normals for activity recognition from depth sequences. In: Computer Vision and Pattern Recognition (2013) Oreifej, O., Liu, Z. (eds.): HON4D: Histogram of oriented 4D normals for activity recognition from depth sequences. In: Computer Vision and Pattern Recognition (2013)
16.
go back to reference Chen, C., Liu, K., Kehtarnavaz, N.: Real-time human action recognition based on depth motion maps. J. Real-Time Image Proc. 12(1), 155–63 (2016)CrossRef Chen, C., Liu, K., Kehtarnavaz, N.: Real-time human action recognition based on depth motion maps. J. Real-Time Image Proc. 12(1), 155–63 (2016)CrossRef
17.
go back to reference Yang, X., Zhang, C., Tian, Y.L. (eds.): Recognizing actions using depth motion maps-based histograms of oriented gradients. In: ACM International Conference on Multimedia (2012) Yang, X., Zhang, C., Tian, Y.L. (eds.): Recognizing actions using depth motion maps-based histograms of oriented gradients. In: ACM International Conference on Multimedia (2012)
18.
go back to reference Yang, X., Tian, Y.L.: Effective 3D action recognition using eigenjoints. J. Vis. Commun. Image Represent. 25(1), 2–11 (2014)CrossRef Yang, X., Tian, Y.L.: Effective 3D action recognition using eigenjoints. J. Vis. Commun. Image Represent. 25(1), 2–11 (2014)CrossRef
19.
go back to reference Xia, L., Chen, CC., Aggarwal, JK. (eds.): View invariant human action recognition using histograms of 3D joints. In: Computer Vision and Pattern Recognition Workshops (2012) Xia, L., Chen, CC., Aggarwal, JK. (eds.): View invariant human action recognition using histograms of 3D joints. In: Computer Vision and Pattern Recognition Workshops (2012)
20.
go back to reference Wu, Y.: Mining actionlet ensemble for action recognition with depth cameras. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2354966, p. 1290-7. IEEE Computer Society (2012) Wu, Y.: Mining actionlet ensemble for action recognition with depth cameras. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2354966, p. 1290-7. IEEE Computer Society (2012)
21.
go back to reference Wang, J., Liu, Z., Wu, Y., Yuan, J.: Learning actionlet ensemble for 3D human action recognition. IEEE Trans. Softw. Eng. 36(5), 914–27 (2013) Wang, J., Liu, Z., Wu, Y., Yuan, J.: Learning actionlet ensemble for 3D human action recognition. IEEE Trans. Softw. Eng. 36(5), 914–27 (2013)
22.
go back to reference Yu, G., Yuan, J., Liu, Z.: Predicting human activities using spatio-temporal structure of interest points. In: Proceedings of the 20th ACM International Conference on Multimedia, Nara, Japan. 2396380, p. 1049-52. ACM (2012) Yu, G., Yuan, J., Liu, Z.: Predicting human activities using spatio-temporal structure of interest points. In: Proceedings of the 20th ACM International Conference on Multimedia, Nara, Japan. 2396380, p. 1049-52. ACM (2012)
23.
go back to reference Yao, B., Fei-Fei, L.: Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses. IEEE Trans. Pattern Anal. 34(9), 1691–703 (2012)CrossRef Yao, B., Fei-Fei, L.: Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses. IEEE Trans. Pattern Anal. 34(9), 1691–703 (2012)CrossRef
24.
go back to reference Ramanan, D. (ed.): Face detection, pose estimation, and landmark localization in the wild. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2012, June 16, 2012–June 21, 2012. IEEE Computer Society, Providence Ramanan, D. (ed.): Face detection, pose estimation, and landmark localization in the wild. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2012, June 16, 2012–June 21, 2012. IEEE Computer Society, Providence
25.
go back to reference Desai, C., Ramanan, D. (eds.): Detecting actions, poses, and objects with relational phraselets. In: 12th European Conference on Computer Vision, ECCV 2012, October 7, 2012–October 13, 2012. Springer, Florence Desai, C., Ramanan, D. (eds.): Detecting actions, poses, and objects with relational phraselets. In: 12th European Conference on Computer Vision, ECCV 2012, October 7, 2012–October 13, 2012. Springer, Florence
26.
go back to reference Shotton, J., Sharp, T., Fitzgibbon, A., Blake, A., Cook, M., Kipman, A., et al.: Real-time human pose recognition in parts from single depth images. Commun. ACM 56(1), 116–24 (2013)CrossRef Shotton, J., Sharp, T., Fitzgibbon, A., Blake, A., Cook, M., Kipman, A., et al.: Real-time human pose recognition in parts from single depth images. Commun. ACM 56(1), 116–24 (2013)CrossRef
27.
go back to reference Liu, L., Shao, L. (eds.): Learning discriminative representations from RGB-D video data. In: International Joint Conference on Artificial Intelligence (2013) Liu, L., Shao, L. (eds.): Learning discriminative representations from RGB-D video data. In: International Joint Conference on Artificial Intelligence (2013)
28.
go back to reference Zhang, Z., Liu, S., Liu, S., Han, L., Shao, Y., Zhou, W.: Human action recognition using salient region detection in complex scenes. In: Mu, J., Liang, Q., Wang, W., Zhang, B., Pi, Y. (eds.) The Proceedings of the Third International Conference on Communications, Signal Processing, and Systems. Lecture Notes in Electrical Engineering, vol. 322. Springer, Cham (2015) Zhang, Z., Liu, S., Liu, S., Han, L., Shao, Y., Zhou, W.: Human action recognition using salient region detection in complex scenes. In: Mu, J., Liang, Q., Wang, W., Zhang, B., Pi, Y. (eds.) The Proceedings of the Third International Conference on Communications, Signal Processing, and Systems. Lecture Notes in Electrical Engineering, vol. 322. Springer, Cham (2015)
29.
go back to reference Shao, L., Zhen, X.T., Tao, D.C., Li, X.L.: Spatio-temporal laplacian pyramid coding for action recognition. IEEE Trans. Cybern. 44(6), 817–27 (2014)CrossRef Shao, L., Zhen, X.T., Tao, D.C., Li, X.L.: Spatio-temporal laplacian pyramid coding for action recognition. IEEE Trans. Cybern. 44(6), 817–27 (2014)CrossRef
Metadata
Title
Following event detection method based on human skeleton motion analysis by Kinect sensor
Authors
Hong-Bo Zhang
Miaohui Zhang
Jinyang Guo
Qing Lei
Tsung-Chih Hsiao
Publication date
17-02-2018
Publisher
Springer Berlin Heidelberg
Published in
Progress in Artificial Intelligence / Issue 3/2018
Print ISSN: 2192-6352
Electronic ISSN: 2192-6360
DOI
https://doi.org/10.1007/s13748-018-0143-y

Other articles of this Issue 3/2018

Progress in Artificial Intelligence 3/2018 Go to the issue

Premium Partner