Skip to main content
Erschienen in: Pattern Analysis and Applications 3/2012

01.08.2012 | Short Paper

Three-dimensional action recognition using volume integrals

verfasst von: Luis Díaz-Más, Rafael Muñoz-Salinas, F. J. Madrid-Cuevas, R. Medina-Carnicer

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This work proposes the volume integral (VI) as a new descriptor for three-dimensional action recognition. The descriptor transforms the actor’s volumetric information into a two-dimensional representation by projecting the voxel data to a set of planes that maximize the discrimination of actions. Our descriptor significantly reduces the amount of data of the three-dimensional representations yet preserves the most important information. As a consequence, the action recognition process is greatly speeded up while achieving very high success rates. The method proposed is therefore especially appropriate for applications in which limitations of computing power and space are significant aspects to consider, such as real-time applications or mobile devices. Additionally, the descriptor is sensitive to reflected actions, i.e., same actions performed with different limbs can be differentiated. This paper tests the VI using several Dimensionality Reduction techniques (namely PCA, 2D-PCA, LDA) and different Machine Learning approaches (namely Clustering, SVM and HMM) so as to determine the best combination of these for the action recognition task. Experiments conducted on the public IXMAS dataset show that the VI compares favorably with state-of-the-art descriptors both in terms of classification rates and computing times.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Poppe R (2010) A survey on vision-based human action recognition. Image Vis Comput 28:976–990 Poppe R (2010) A survey on vision-based human action recognition. Image Vis Comput 28:976–990
2.
Zurück zum Zitat Turaga P, Chellappa R, Subrahmanian VS, Udrea O (2008) Machine recognition of human activities: a survey. IEEE Trans Circuits Syst Video Technol 18(11):1473–1488CrossRef Turaga P, Chellappa R, Subrahmanian VS, Udrea O (2008) Machine recognition of human activities: a survey. IEEE Trans Circuits Syst Video Technol 18(11):1473–1488CrossRef
3.
Zurück zum Zitat Bobick AF, Davis JW (2001) The recognition of human movement using temporal templates. IEEE Trans Pattern Anal Mach Intell 23:257–267CrossRef Bobick AF, Davis JW (2001) The recognition of human movement using temporal templates. IEEE Trans Pattern Anal Mach Intell 23:257–267CrossRef
4.
Zurück zum Zitat Ikizler N, Duygulu P (2009) Histogram of oriented rectangles: a new pose descriptor for human action recognition. Image Vis Comput 27(10):1515–1526CrossRef Ikizler N, Duygulu P (2009) Histogram of oriented rectangles: a new pose descriptor for human action recognition. Image Vis Comput 27(10):1515–1526CrossRef
5.
Zurück zum Zitat Chakraborty B, Rudovic O, Gonzalez J (2008) View-invariant human-body detection with extension to human action recognition using component-wise HMM of body parts. In: 2008 8th IEEE international conference on automatic face & gesture recognition. IEEE, pp 1–6 Chakraborty B, Rudovic O, Gonzalez J (2008) View-invariant human-body detection with extension to human action recognition using component-wise HMM of body parts. In: 2008 8th IEEE international conference on automatic face & gesture recognition. IEEE, pp 1–6
6.
Zurück zum Zitat Shin H-K, Lee S-W, Lee S-W (2005) Real-time gesture recognition using 3D motion history model. In: Proceedings of ICIC (1), pp 888–898 Shin H-K, Lee S-W, Lee S-W (2005) Real-time gesture recognition using 3D motion history model. In: Proceedings of ICIC (1), pp 888–898
7.
Zurück zum Zitat Roh M-C, Shin H-K, Lee S-W (2010) View-independent human action recognition with volume motion template on single stereo camera. Pattern Recognit Lett 31(7) Roh M-C, Shin H-K, Lee S-W (2010) View-independent human action recognition with volume motion template on single stereo camera. Pattern Recognit Lett 31(7)
8.
Zurück zum Zitat Muñoz-Salinas R, Medina-Carnicer R, Madrid-Cuevas FJ, Carmona-Poyato A (2008) Depth silhouettes for gesture recognition. Pattern Recognit Lett 29:319–329CrossRef Muñoz-Salinas R, Medina-Carnicer R, Madrid-Cuevas FJ, Carmona-Poyato A (2008) Depth silhouettes for gesture recognition. Pattern Recognit Lett 29:319–329CrossRef
9.
Zurück zum Zitat Weinland D, Ronfard R, Boyer E (2006) Free viewpoint action recognition using motion history volumes. Comput Vis Image Underst 104(2):249–257CrossRef Weinland D, Ronfard R, Boyer E (2006) Free viewpoint action recognition using motion history volumes. Comput Vis Image Underst 104(2):249–257CrossRef
10.
Zurück zum Zitat Yang Y, Hao A, Zhao Q (2008) View-invariant action recognition using interest points. In: International multimedia conference Yang Y, Hao A, Zhao Q (2008) View-invariant action recognition using interest points. In: International multimedia conference
11.
Zurück zum Zitat Cherla S, Kulkarni K, Kale A, Ramasubramanian V (2008) Towards fast, view-invariant human action recognition. In: 2008 IEEE Computer Society conference on computer vision and pattern recognition workshops. IEEE, pp 1–8 Cherla S, Kulkarni K, Kale A, Ramasubramanian V (2008) Towards fast, view-invariant human action recognition. In: 2008 IEEE Computer Society conference on computer vision and pattern recognition workshops. IEEE, pp 1–8
12.
Zurück zum Zitat Pingkun Y, Khan SM, Shah M (2008) Learning 4D action feature models for arbitrary view action recognition. In: 2008 IEEE conference on computer vision and pattern recognition. IEEE, pp 1–7 Pingkun Y, Khan SM, Shah M (2008) Learning 4D action feature models for arbitrary view action recognition. In: 2008 IEEE conference on computer vision and pattern recognition. IEEE, pp 1–7
13.
Zurück zum Zitat Ji X, Liu H (2010) Advances in view-invariant human motion analysis: a review. IEEE Trans Syst Man Cybernet C (Appl Rev) 40(1):13–24CrossRef Ji X, Liu H (2010) Advances in view-invariant human motion analysis: a review. IEEE Trans Syst Man Cybernet C (Appl Rev) 40(1):13–24CrossRef
14.
Zurück zum Zitat Peng B, Qian G, Rajko S (2009) View-invariant full-body gesture recognition via multilinear analysis of voxel data. ICDSC Peng B, Qian G, Rajko S (2009) View-invariant full-body gesture recognition via multilinear analysis of voxel data. ICDSC
15.
Zurück zum Zitat Brubaker MA, Fleet DJ, Hertzmann A (2009) Physics-based person tracking using the anthropomorphic walker. Int J Comput Vis 87(1–2):140–155 Brubaker MA, Fleet DJ, Hertzmann A (2009) Physics-based person tracking using the anthropomorphic walker. Int J Comput Vis 87(1–2):140–155
16.
Zurück zum Zitat Corazza S, Mündermann L, Gambaretto E, Ferrigno G, Andriacchi TP (2009) Markerless motion capture through visual hull, articulated ICP and subject specific model generation. Int J Comput Vis 87(1–2):156–169 Corazza S, Mündermann L, Gambaretto E, Ferrigno G, Andriacchi TP (2009) Markerless motion capture through visual hull, articulated ICP and subject specific model generation. Int J Comput Vis 87(1–2):156–169
17.
Zurück zum Zitat Li R, Tian T-P, Sclaroff S, Yang M-H (2009) 3D human motion tracking with a coordinated mixture of factor analyzers. Int J Comput Vis 87(1–2):170–190 Li R, Tian T-P, Sclaroff S, Yang M-H (2009) 3D human motion tracking with a coordinated mixture of factor analyzers. Int J Comput Vis 87(1–2):170–190
18.
Zurück zum Zitat Haritaoglu I, Harwood D, Davis LS (2000) W4: real-time surveillance of people and their activities. IEEE Trans Pattern Anal Mach Intell 22:809–830CrossRef Haritaoglu I, Harwood D, Davis LS (2000) W4: real-time surveillance of people and their activities. IEEE Trans Pattern Anal Mach Intell 22:809–830CrossRef
19.
Zurück zum Zitat Haritaoglu I, Cutler R, Harwood D, Davis LS (1999) Backpack: detection of people carrying objects using silhouettes. Comput Vis Image Underst 81:102–107 Haritaoglu I, Cutler R, Harwood D, Davis LS (1999) Backpack: detection of people carrying objects using silhouettes. Comput Vis Image Underst 81:102–107
20.
Zurück zum Zitat Cucchiara R, Grana C, Prati A, Vezzani R (2005) Probabilistic posture classification for human-behavior analysis. IEEE Trans Syst Man Cybernet A: Syst Humans 35(1):42–54CrossRef Cucchiara R, Grana C, Prati A, Vezzani R (2005) Probabilistic posture classification for human-behavior analysis. IEEE Trans Syst Man Cybernet A: Syst Humans 35(1):42–54CrossRef
21.
Zurück zum Zitat Juang C-F, Chang C-M (2007) Human body posture classification by a neural fuzzy network and home care system application. IEEE Trans Syst Man Cybernet A: Syst Humans 37(6):984–994MathSciNetCrossRef Juang C-F, Chang C-M (2007) Human body posture classification by a neural fuzzy network and home care system application. IEEE Trans Syst Man Cybernet A: Syst Humans 37(6):984–994MathSciNetCrossRef
22.
Zurück zum Zitat Souvenir R, Parrigan K (2009) Viewpoint manifolds for action recognition. EURASIP J Image Video Process 2009:1–13 Souvenir R, Parrigan K (2009) Viewpoint manifolds for action recognition. EURASIP J Image Video Process 2009:1–13
23.
Zurück zum Zitat Lv F, Nevatia R (2007) Single view human action recognition using key pose matching and viterbi path searching. In: IEEE conference on computer vision and pattern recognition, pp 1–8 Lv F, Nevatia R (2007) Single view human action recognition using key pose matching and viterbi path searching. In: IEEE conference on computer vision and pattern recognition, pp 1–8
24.
Zurück zum Zitat Ji X, Liu H (2009) View-invariant human action recognition using exemplar-based hidden Markov models. Lect Notes Comput Sci 5928:78–89CrossRef Ji X, Liu H (2009) View-invariant human action recognition using exemplar-based hidden Markov models. Lect Notes Comput Sci 5928:78–89CrossRef
25.
Zurück zum Zitat Weinland D, Boyer E, Ronfard R (2007) Action recognition from arbitrary views using 3D exemplars. In: 2007 IEEE 11th international conference on computer vision. IEEE, pp 1–7 Weinland D, Boyer E, Ronfard R (2007) Action recognition from arbitrary views using 3D exemplars. In: 2007 IEEE 11th international conference on computer vision. IEEE, pp 1–7
26.
Zurück zum Zitat Laurentini A (1991) The visual hull: a new tool for contour-based image understanding. In: Proceedings of seventh Scandinavian conference on image processing, pp 993–1002 Laurentini A (1991) The visual hull: a new tool for contour-based image understanding. In: Proceedings of seventh Scandinavian conference on image processing, pp 993–1002
27.
Zurück zum Zitat Díaz-Más L, Muñoz-Salinas R, Madrid-Cuevas FJ, Medina-Carnicer R (2010) Shape from silhouette using dempster-shafer theory. Pattern Recognit 43(6):2119–2131 Díaz-Más L, Muñoz-Salinas R, Madrid-Cuevas FJ, Medina-Carnicer R (2010) Shape from silhouette using dempster-shafer theory. Pattern Recognit 43(6):2119–2131
28.
Zurück zum Zitat Landabaso JL, Pardàs M, Ramon Casas J (2008) Shape from inconsistent silhouette. Comput Vis Image Underst 112:210–224CrossRef Landabaso JL, Pardàs M, Ramon Casas J (2008) Shape from inconsistent silhouette. Comput Vis Image Underst 112:210–224CrossRef
29.
Zurück zum Zitat Bishop CM (2007) Pattern recognition and machine learning (information science and statistics), 1st edn, 2006. Springer. corr. 2nd printing edition, October 2007 Bishop CM (2007) Pattern recognition and machine learning (information science and statistics), 1st edn, 2006. Springer. corr. 2nd printing edition, October 2007
30.
Zurück zum Zitat Sheskin DJ (2007) Handbook of parametric and nonparametric statistical procedures, 4th edn. Chapman & Hall/CRC Sheskin DJ (2007) Handbook of parametric and nonparametric statistical procedures, 4th edn. Chapman & Hall/CRC
31.
Zurück zum Zitat Devore JL, (2008) Probability and statistics for engineering and the sciences, 7th edn. Thomson Brooks/Cole Devore JL, (2008) Probability and statistics for engineering and the sciences, 7th edn. Thomson Brooks/Cole
Metadaten
Titel
Three-dimensional action recognition using volume integrals
verfasst von
Luis Díaz-Más
Rafael Muñoz-Salinas
F. J. Madrid-Cuevas
R. Medina-Carnicer
Publikationsdatum
01.08.2012
Verlag
Springer-Verlag
Erschienen in
Pattern Analysis and Applications / Ausgabe 3/2012
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-011-0239-5

Weitere Artikel der Ausgabe 3/2012

Pattern Analysis and Applications 3/2012 Zur Ausgabe

Premium Partner