Skip to main content

2017 | OriginalPaper | Buchkapitel

11. Study of Human Action Recognition Based on Improved Spatio-Temporal Features

verfasst von : Honghai Liu, Zhaojie Ju, Xiaofei Ji, Chee Seng Chan, Mehdi Khoury

Erschienen in: Human Motion Sensing and Recognition

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Most of the existed action recognition methods mainly utilise spatio-temporal descriptors of single interest point ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information (PDI) of interest points, a novel motion descriptor is proposed in this chapter. The proposed method detects interest points by using an improved interest points detection method. Then 3-dimensional scale-invariant feature transform (3D SIFT) descriptors are extracted for every interest point. In order to obtain compact description and efficient computation, Principal Component Analysis (PCA) method is utilised twice on the 3D SIFT descriptors of single-frame and multi-frame. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using Support Vector Machine (SVM) and AdaBoost-SVM recognition algorithm on the public KTH dataset. The testing results showed that the recognition rate has been significantly improved. Meantime, the test results verified the proposed features can more accurately describe human motion with high adaptability to scenarios.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat H. J. Seo and P. Milanfar. Action recognition from one example. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(5):867–882, 2011.CrossRef H. J. Seo and P. Milanfar. Action recognition from one example. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(5):867–882, 2011.CrossRef
2.
Zurück zum Zitat D. Weinland, R. Ronfard, and E. Boyer. A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding, 115(2):24–241, 2011.CrossRef D. Weinland, R. Ronfard, and E. Boyer. A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding, 115(2):24–241, 2011.CrossRef
3.
Zurück zum Zitat X. Ji and H. Liu. Advances in View-Invariant Human Motion Analysis: A Review. IEEE Transactions on Systems, Man and Cybernetics Part C, 40(1):13–24, 2010. X. Ji and H. Liu. Advances in View-Invariant Human Motion Analysis: A Review. IEEE Transactions on Systems, Man and Cybernetics Part C, 40(1):13–24, 2010.
4.
Zurück zum Zitat X. Li. Hmm based action recognition using oriented histograms of optical flow field. Electronics Letters, 43(10):560–561, 2007.CrossRef X. Li. Hmm based action recognition using oriented histograms of optical flow field. Electronics Letters, 43(10):560–561, 2007.CrossRef
5.
Zurück zum Zitat S. Ali and M. Shah. Human action recognition in videos using kinematic features and multiple instance learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(2):288–303, 2007.CrossRef S. Ali and M. Shah. Human action recognition in videos using kinematic features and multiple instance learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(2):288–303, 2007.CrossRef
6.
Zurück zum Zitat M. Hahn, L. Krüger, and C. Wöhler. 3d action recognition and long-term prediction of human motion. Computer Vision Systems, pages 23–32, 2008. M. Hahn, L. Krüger, and C. Wöhler. 3d action recognition and long-term prediction of human motion. Computer Vision Systems, pages 23–32, 2008.
7.
Zurück zum Zitat F. Jiang, Y. Wu, and A. K. Katsaggelos. A dynamic hierarchical clustering method for trajectory-based unusual video event detection. IEEE Transactions on Image Processing, 18(4):907–913, 2009.MathSciNetCrossRef F. Jiang, Y. Wu, and A. K. Katsaggelos. A dynamic hierarchical clustering method for trajectory-based unusual video event detection. IEEE Transactions on Image Processing, 18(4):907–913, 2009.MathSciNetCrossRef
8.
Zurück zum Zitat H. Zhou, H. Hu, H. Liu, and J. Tang. Classification of upper limb motion trajectories using shape features. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 42(6):970–982, 2012.CrossRef H. Zhou, H. Hu, H. Liu, and J. Tang. Classification of upper limb motion trajectories using shape features. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 42(6):970–982, 2012.CrossRef
9.
Zurück zum Zitat X. Cao, B. Ning, P. Yan, and X. Li. Selecting key poses on manifold for pairwise action recognition. IEEE Transactions on Industrial Informatics, 8(1):168–177, 2012.CrossRef X. Cao, B. Ning, P. Yan, and X. Li. Selecting key poses on manifold for pairwise action recognition. IEEE Transactions on Industrial Informatics, 8(1):168–177, 2012.CrossRef
10.
Zurück zum Zitat A. A. Chaaraoui, P. Climent-Pérez, and F. Flórez-Revuelta. Silhouette-based human action recognition using sequences of key poses. Pattern Recognition Letters, 34(15):1799–1807, 2013.CrossRef A. A. Chaaraoui, P. Climent-Pérez, and F. Flórez-Revuelta. Silhouette-based human action recognition using sequences of key poses. Pattern Recognition Letters, 34(15):1799–1807, 2013.CrossRef
11.
Zurück zum Zitat R. Poppe. A survey on vision-based human action recognition. Image and Vision computing, 28(6):976–990, 2010.CrossRef R. Poppe. A survey on vision-based human action recognition. Image and Vision computing, 28(6):976–990, 2010.CrossRef
12.
Zurück zum Zitat P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Proceeding of Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pages 65–72. Beijing, 2005. P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Proceeding of Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pages 65–72. Beijing, 2005.
13.
Zurück zum Zitat I. Laptev and T. Lindeberg. Local descriptors for spatio-temporal recognition. In Proceeding of First Workshop on Spatial Coherence for Visual Motion Analysis, Springer, pages 91–103, 2006. I. Laptev and T. Lindeberg. Local descriptors for spatio-temporal recognition. In Proceeding of First Workshop on Spatial Coherence for Visual Motion Analysis, Springer, pages 91–103, 2006.
14.
Zurück zum Zitat J. C. Niebles, H. Wang, and L. Fei-Fei. Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision, 79(3):299–318, 2008.CrossRef J. C. Niebles, H. Wang, and L. Fei-Fei. Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision, 79(3):299–318, 2008.CrossRef
15.
Zurück zum Zitat J. Zhu, J. Qi, and X. Kong. An improved method of action recognition based on sparse spatio-temporal features. Artificial Intelligence: Methodology, Systems, and Applications, pages 240–245, 2012. J. Zhu, J. Qi, and X. Kong. An improved method of action recognition based on sparse spatio-temporal features. Artificial Intelligence: Methodology, Systems, and Applications, pages 240–245, 2012.
16.
Zurück zum Zitat P. Liu, J. Wang, M. She, and H. Liu. Human action recognition based on 3d sift and lda model. In Proceeding of IEEE Workshop on Robotic Intelligence In Informationally Structured Space (RiiSS), pages 12–17, 2011. P. Liu, J. Wang, M. She, and H. Liu. Human action recognition based on 3d sift and lda model. In Proceeding of IEEE Workshop on Robotic Intelligence In Informationally Structured Space (RiiSS), pages 12–17, 2011.
17.
Zurück zum Zitat X. Jiang, T. Sun, B. Feng, and C. Jiang. A space-time surf descriptor and its application to action recognition with video words. In Proceeding of Eighth International Conference on Fuzzy Systems and Knowledge Discovery, pages 1911–1915. Vol.3Vol.3, 2011. X. Jiang, T. Sun, B. Feng, and C. Jiang. A space-time surf descriptor and its application to action recognition with video words. In Proceeding of Eighth International Conference on Fuzzy Systems and Knowledge Discovery, pages 1911–1915. Vol.3Vol.3, 2011.
18.
Zurück zum Zitat P. Scovanner, S. Ali, and M. Shah. A 3-dimensional sift descriptor and its application to action recognition. In Proceeding of the 15th international conference on Multimedia, pages 357–360. ACM, 2007. P. Scovanner, S. Ali, and M. Shah. A 3-dimensional sift descriptor and its application to action recognition. In Proceeding of the 15th international conference on Multimedia, pages 357–360. ACM, 2007.
19.
Zurück zum Zitat A. Kläser, M. Marszałek, C. Schmid, and L. Lear. A spatio-temporal descriptor based on 3d-gradients. In Proceeding of British Machine Vision Conference, UK, pages 1–10, 2008. A. Kläser, M. Marszałek, C. Schmid, and L. Lear. A spatio-temporal descriptor based on 3d-gradients. In Proceeding of British Machine Vision Conference, UK, pages 1–10, 2008.
20.
Zurück zum Zitat G. Willems, T. Tuytelaars, and L. Van Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. In Proceeding of European Conference on Computer Vision, Springer,France, pages 650–663, 2008. G. Willems, T. Tuytelaars, and L. Van Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. In Proceeding of European Conference on Computer Vision, Springer,France, pages 650–663, 2008.
21.
Zurück zum Zitat F. Li, C. Xiamen, and J. Du. Local spatio-temporal interest point detection for human action recognition. In IEEE 5th International Conference on Advanced Computational Intelligence, pages 1–10, 2012. F. Li, C. Xiamen, and J. Du. Local spatio-temporal interest point detection for human action recognition. In IEEE 5th International Conference on Advanced Computational Intelligence, pages 1–10, 2012.
22.
Zurück zum Zitat M. Bregonzio, S. Gong, and T. Xiang. Recognising action as clouds of space-time interest points. In Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pages 1948–1955, Florida, USA, 2009. IEEE. M. Bregonzio, S. Gong, and T. Xiang. Recognising action as clouds of space-time interest points. In Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pages 1948–1955, Florida, USA, 2009. IEEE.
23.
Zurück zum Zitat C.C. Chang and C.J. Lin. Libsvm: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1–27:27, May 2011. C.C. Chang and C.J. Lin. Libsvm: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1–27:27, May 2011.
24.
Zurück zum Zitat S. Umakanthan, S. Denman, S. Sridharan, C. Fookes, and T. Wark. Spatio temporal feature evaluation for action recognition. In Proceeding of International Conference on Digital Image Computing Techniques and Applications, pages 1–8, 2012. S. Umakanthan, S. Denman, S. Sridharan, C. Fookes, and T. Wark. Spatio temporal feature evaluation for action recognition. In Proceeding of International Conference on Digital Image Computing Techniques and Applications, pages 1–8, 2012.
25.
Zurück zum Zitat Jose M Chaquet, Enrique J Carmona, and Antonio Fernández-Caballero. A survey of video datasets for human action and activity recognition. Computer Vision and Image Understanding, 117(6):633–659, 2013. Jose M Chaquet, Enrique J Carmona, and Antonio Fernández-Caballero. A survey of video datasets for human action and activity recognition. Computer Vision and Image Understanding, 117(6):633–659, 2013.
Metadaten
Titel
Study of Human Action Recognition Based on Improved Spatio-Temporal Features
verfasst von
Honghai Liu
Zhaojie Ju
Xiaofei Ji
Chee Seng Chan
Mehdi Khoury
Copyright-Jahr
2017
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-53692-6_11