Skip to main content
Top

2017 | OriginalPaper | Chapter

11. Study of Human Action Recognition Based on Improved Spatio-Temporal Features

Authors : Honghai Liu, Zhaojie Ju, Xiaofei Ji, Chee Seng Chan, Mehdi Khoury

Published in: Human Motion Sensing and Recognition

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Most of the existed action recognition methods mainly utilise spatio-temporal descriptors of single interest point ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information (PDI) of interest points, a novel motion descriptor is proposed in this chapter. The proposed method detects interest points by using an improved interest points detection method. Then 3-dimensional scale-invariant feature transform (3D SIFT) descriptors are extracted for every interest point. In order to obtain compact description and efficient computation, Principal Component Analysis (PCA) method is utilised twice on the 3D SIFT descriptors of single-frame and multi-frame. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using Support Vector Machine (SVM) and AdaBoost-SVM recognition algorithm on the public KTH dataset. The testing results showed that the recognition rate has been significantly improved. Meantime, the test results verified the proposed features can more accurately describe human motion with high adaptability to scenarios.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference H. J. Seo and P. Milanfar. Action recognition from one example. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(5):867–882, 2011.CrossRef H. J. Seo and P. Milanfar. Action recognition from one example. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(5):867–882, 2011.CrossRef
2.
go back to reference D. Weinland, R. Ronfard, and E. Boyer. A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding, 115(2):24–241, 2011.CrossRef D. Weinland, R. Ronfard, and E. Boyer. A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding, 115(2):24–241, 2011.CrossRef
3.
go back to reference X. Ji and H. Liu. Advances in View-Invariant Human Motion Analysis: A Review. IEEE Transactions on Systems, Man and Cybernetics Part C, 40(1):13–24, 2010. X. Ji and H. Liu. Advances in View-Invariant Human Motion Analysis: A Review. IEEE Transactions on Systems, Man and Cybernetics Part C, 40(1):13–24, 2010.
4.
go back to reference X. Li. Hmm based action recognition using oriented histograms of optical flow field. Electronics Letters, 43(10):560–561, 2007.CrossRef X. Li. Hmm based action recognition using oriented histograms of optical flow field. Electronics Letters, 43(10):560–561, 2007.CrossRef
5.
go back to reference S. Ali and M. Shah. Human action recognition in videos using kinematic features and multiple instance learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(2):288–303, 2007.CrossRef S. Ali and M. Shah. Human action recognition in videos using kinematic features and multiple instance learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(2):288–303, 2007.CrossRef
6.
go back to reference M. Hahn, L. Krüger, and C. Wöhler. 3d action recognition and long-term prediction of human motion. Computer Vision Systems, pages 23–32, 2008. M. Hahn, L. Krüger, and C. Wöhler. 3d action recognition and long-term prediction of human motion. Computer Vision Systems, pages 23–32, 2008.
7.
go back to reference F. Jiang, Y. Wu, and A. K. Katsaggelos. A dynamic hierarchical clustering method for trajectory-based unusual video event detection. IEEE Transactions on Image Processing, 18(4):907–913, 2009.MathSciNetCrossRef F. Jiang, Y. Wu, and A. K. Katsaggelos. A dynamic hierarchical clustering method for trajectory-based unusual video event detection. IEEE Transactions on Image Processing, 18(4):907–913, 2009.MathSciNetCrossRef
8.
go back to reference H. Zhou, H. Hu, H. Liu, and J. Tang. Classification of upper limb motion trajectories using shape features. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 42(6):970–982, 2012.CrossRef H. Zhou, H. Hu, H. Liu, and J. Tang. Classification of upper limb motion trajectories using shape features. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 42(6):970–982, 2012.CrossRef
9.
go back to reference X. Cao, B. Ning, P. Yan, and X. Li. Selecting key poses on manifold for pairwise action recognition. IEEE Transactions on Industrial Informatics, 8(1):168–177, 2012.CrossRef X. Cao, B. Ning, P. Yan, and X. Li. Selecting key poses on manifold for pairwise action recognition. IEEE Transactions on Industrial Informatics, 8(1):168–177, 2012.CrossRef
10.
go back to reference A. A. Chaaraoui, P. Climent-Pérez, and F. Flórez-Revuelta. Silhouette-based human action recognition using sequences of key poses. Pattern Recognition Letters, 34(15):1799–1807, 2013.CrossRef A. A. Chaaraoui, P. Climent-Pérez, and F. Flórez-Revuelta. Silhouette-based human action recognition using sequences of key poses. Pattern Recognition Letters, 34(15):1799–1807, 2013.CrossRef
11.
go back to reference R. Poppe. A survey on vision-based human action recognition. Image and Vision computing, 28(6):976–990, 2010.CrossRef R. Poppe. A survey on vision-based human action recognition. Image and Vision computing, 28(6):976–990, 2010.CrossRef
12.
go back to reference P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Proceeding of Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pages 65–72. Beijing, 2005. P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Proceeding of Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pages 65–72. Beijing, 2005.
13.
go back to reference I. Laptev and T. Lindeberg. Local descriptors for spatio-temporal recognition. In Proceeding of First Workshop on Spatial Coherence for Visual Motion Analysis, Springer, pages 91–103, 2006. I. Laptev and T. Lindeberg. Local descriptors for spatio-temporal recognition. In Proceeding of First Workshop on Spatial Coherence for Visual Motion Analysis, Springer, pages 91–103, 2006.
14.
go back to reference J. C. Niebles, H. Wang, and L. Fei-Fei. Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision, 79(3):299–318, 2008.CrossRef J. C. Niebles, H. Wang, and L. Fei-Fei. Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision, 79(3):299–318, 2008.CrossRef
15.
go back to reference J. Zhu, J. Qi, and X. Kong. An improved method of action recognition based on sparse spatio-temporal features. Artificial Intelligence: Methodology, Systems, and Applications, pages 240–245, 2012. J. Zhu, J. Qi, and X. Kong. An improved method of action recognition based on sparse spatio-temporal features. Artificial Intelligence: Methodology, Systems, and Applications, pages 240–245, 2012.
16.
go back to reference P. Liu, J. Wang, M. She, and H. Liu. Human action recognition based on 3d sift and lda model. In Proceeding of IEEE Workshop on Robotic Intelligence In Informationally Structured Space (RiiSS), pages 12–17, 2011. P. Liu, J. Wang, M. She, and H. Liu. Human action recognition based on 3d sift and lda model. In Proceeding of IEEE Workshop on Robotic Intelligence In Informationally Structured Space (RiiSS), pages 12–17, 2011.
17.
go back to reference X. Jiang, T. Sun, B. Feng, and C. Jiang. A space-time surf descriptor and its application to action recognition with video words. In Proceeding of Eighth International Conference on Fuzzy Systems and Knowledge Discovery, pages 1911–1915. Vol.3Vol.3, 2011. X. Jiang, T. Sun, B. Feng, and C. Jiang. A space-time surf descriptor and its application to action recognition with video words. In Proceeding of Eighth International Conference on Fuzzy Systems and Knowledge Discovery, pages 1911–1915. Vol.3Vol.3, 2011.
18.
go back to reference P. Scovanner, S. Ali, and M. Shah. A 3-dimensional sift descriptor and its application to action recognition. In Proceeding of the 15th international conference on Multimedia, pages 357–360. ACM, 2007. P. Scovanner, S. Ali, and M. Shah. A 3-dimensional sift descriptor and its application to action recognition. In Proceeding of the 15th international conference on Multimedia, pages 357–360. ACM, 2007.
19.
go back to reference A. Kläser, M. Marszałek, C. Schmid, and L. Lear. A spatio-temporal descriptor based on 3d-gradients. In Proceeding of British Machine Vision Conference, UK, pages 1–10, 2008. A. Kläser, M. Marszałek, C. Schmid, and L. Lear. A spatio-temporal descriptor based on 3d-gradients. In Proceeding of British Machine Vision Conference, UK, pages 1–10, 2008.
20.
go back to reference G. Willems, T. Tuytelaars, and L. Van Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. In Proceeding of European Conference on Computer Vision, Springer,France, pages 650–663, 2008. G. Willems, T. Tuytelaars, and L. Van Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. In Proceeding of European Conference on Computer Vision, Springer,France, pages 650–663, 2008.
21.
go back to reference F. Li, C. Xiamen, and J. Du. Local spatio-temporal interest point detection for human action recognition. In IEEE 5th International Conference on Advanced Computational Intelligence, pages 1–10, 2012. F. Li, C. Xiamen, and J. Du. Local spatio-temporal interest point detection for human action recognition. In IEEE 5th International Conference on Advanced Computational Intelligence, pages 1–10, 2012.
22.
go back to reference M. Bregonzio, S. Gong, and T. Xiang. Recognising action as clouds of space-time interest points. In Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pages 1948–1955, Florida, USA, 2009. IEEE. M. Bregonzio, S. Gong, and T. Xiang. Recognising action as clouds of space-time interest points. In Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pages 1948–1955, Florida, USA, 2009. IEEE.
23.
go back to reference C.C. Chang and C.J. Lin. Libsvm: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1–27:27, May 2011. C.C. Chang and C.J. Lin. Libsvm: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1–27:27, May 2011.
24.
go back to reference S. Umakanthan, S. Denman, S. Sridharan, C. Fookes, and T. Wark. Spatio temporal feature evaluation for action recognition. In Proceeding of International Conference on Digital Image Computing Techniques and Applications, pages 1–8, 2012. S. Umakanthan, S. Denman, S. Sridharan, C. Fookes, and T. Wark. Spatio temporal feature evaluation for action recognition. In Proceeding of International Conference on Digital Image Computing Techniques and Applications, pages 1–8, 2012.
25.
go back to reference Jose M Chaquet, Enrique J Carmona, and Antonio Fernández-Caballero. A survey of video datasets for human action and activity recognition. Computer Vision and Image Understanding, 117(6):633–659, 2013. Jose M Chaquet, Enrique J Carmona, and Antonio Fernández-Caballero. A survey of video datasets for human action and activity recognition. Computer Vision and Image Understanding, 117(6):633–659, 2013.
Metadata
Title
Study of Human Action Recognition Based on Improved Spatio-Temporal Features
Authors
Honghai Liu
Zhaojie Ju
Xiaofei Ji
Chee Seng Chan
Mehdi Khoury
Copyright Year
2017
Publisher
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-53692-6_11

Premium Partner