nach oben

Neural Computing and Applications

Erschienen in:

01.10.2016 | Original Article

Human action recognition on depth dataset

verfasst von: Zan Gao, Hua Zhang, Anan A. Liu, Guangping Xu, Yanbing Xue

Erschienen in: Neural Computing and Applications | Ausgabe 7/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Human action recognition is a hot research topic; however, the change in shapes, the high variability of appearances, dynamitic background, potential occlusions in different actions and the image limit of 2D sensor make it more difficult. To solve these problems, we pay more attention to the depth channel and the fusion of different features. Thus, we firstly extract different features for depth image sequence, and then, multi-feature mapping and dictionary learning model (MMDLM) is proposed to deeply discover the relationship between these different features, where two dictionaries and a feature mapping function are simultaneously learned. What is more, these dictionaries can fully characterize the structure information of different features, while the feature mapping function is a regularization term, which can reveal the intrinsic relationship between these two features. Large-scale experiments on two public depth datasets, MSRAction3D and DHA, show that the performances of these different depth features have a big difference, but they are complementary. Further, the features fusion by MMDLM is very efficient and effective on both datasets, which is comparable to the state-of-the-art methods.

Vorheriger Artikel Optimal PID-type fuzzy logic controller for a multi-input multi-output active magnetic bearing system

Nächster Artikel A graph-theoretic approach to exponential stability of stochastic BAM neural networks with time-varying delays

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

http://research.microsoft.com/enus/um/people/zliu/actionrecorsrc/default.htm.

http://mclab.citi.sinica.edu.tw/dataset/dha/dha.html.

Lin Y-C, Hua M-C, Cheng W-H, Hsieh Y-H, Chen H-M (2012) Human action recognition and retrieval using sole depth information, ACM MM 2012, pp 1–8

Wang J, Liu Z, Wu Y, Yuan J (2012) Mining actionlet ensemble for action recognition with depth cameras. CPRR, pp 1290–1297

Li W, Zhang Z, Liu Z (2010) Action recognition based on a bag of 3D points. Human Communicative Behavior Analysis Workshop (in conjunction with CVPR), 2010, pp 2, 5, 6

Ni B, Wang G, Moulin P (2012) RGBD-HuDaAct: A color-depth video database for human daily activity recognition. ICCV workshop, pp 1–8

Megavannan V, Agarwal B, Venkatesh Babu R (2012) Human action recognition using depth maps. In: International conference on signal processing and communications (SPCOM), pp 1–8

Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE TPAMI 31(2):210–227CrossRef

Wang JJY, Bensmail H, Yao N, Gao X (2013) Discriminative sparse coding on multi-manifolds. Knowl Based Syst 54:199–206CrossRef

Wang JJY, Bensmail H, Gao X (2013) Joint learning and weighting of visual vocabulary for bag-of-feature based tissue classification. Pattern Recognit 46(12):3249–3255CrossRef

Uddin MD, Thang ND, Kim JT, Kim T-S (2011) Human activity recognition using body joint-angle features and hidden markov model. ETRI J 33(4):569–579CrossRef

10.

Jalal A, Kim JT, Kim T-S (2012) Human activity recognition using the labeled depth body parts information of depth silhouettes. In: Proceeding of the 6th international symposium on sustainable healthy buildings

11.

Hu M-C, Chen C-W, Cheng W-H, Chang C-H et al (2014) Real-time human movement retrieval and assessment with Kinect sensor. IEEE Trans Cybern 45(4). doi:10.1109/TCYB.2014.2335540

12.

Ofli F, Chaudhry R, Kurillo G et al (2012) Sequence of the most informative joints (SMIJ): a new representation for human skeletal action recognition. In: Proceeding of IEEE conference on CVPR workshop, pp 8–13

13.

Xia L, Chen C-C, Aggarwal JK (2012) View invariant human action recognition using histograms of 3D joints. In: Proceeding of IEEE conference on CVPR workshop, pp 20–27

14.

Gao Z, Zhang H, Liu A-A, Xue Y-b, Guang-ping X (2014) Human action recognition using pyramid histograms of oriented gradients and collaborative multi-task learning. KSII Trans Internet Inf Syst 8(2):483–503CrossRef

15.

Schwarz LA, Mateus D, Castaneda V, Navab N (2010) Manifold learning for TOF-based human body tracking and activity recognition. In: Proceeding of the British machine vision conference, pp 1–11

16.

Yang X, Zhang C, Tian Y (2012) Recognizing actions using depth motion maps-based histograms of oriented gradients. In: Proceeding of ACM multimedia, pp 1057–1060

17.

Gao Z, Song J, Zhang H, Liu AA, Xue Y, Xu G (2014) Human action recognition via multi-modality information. J Electr Eng Technol 9(2):739–748CrossRef

18.

Wang J, Liu Z, Chorowski J, Chen Z, Ying W (2012) Robust 3D action recognizing with random occupancy patterns. Proc ECCV 2:872–885

19.

Vieira AW, Nascimento ER, Oliveira GL et al (2012) STOP: space-time occupancy patterns for 3D action recognition from depth map sequences. In: Proceeding of 17th Iberoamerican congress on pattern recognition, pp 252–259

20.

Gao Z, Chen M, Hauptmann AG, Cai A (2010) Comparing evaluation protocols on the KTH dataset. In: International conference on pattern recognition, pp 88–100

21.

Xia L, Aggarwal JK (2013) Spatio-temporal depth cuboid similarity feature for activity recognition using depth camera. In: 24th IEEE conference on computer vision and pattern recognition (CVPR), Portland, Oregon, June 2013

22.

Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175CrossRefMATH

23.

Gao Y, Wang M, Ji R, Wu X, Dai Q (2014) 3D object retrieval with hausdorff distance learning. IEEE Trans Ind Electron 61(4):2088–2098CrossRef

24.

Hu M-C, Cheng W-H, Hu C-S et al (2014) Efficient human detection in crowded environment. Multimedia Systems, pp 1432–1882

25.

Lin D, Tang X (2005) Coupled space learning of image style transformation. In: ICCV, IEEE, pp 1, 2, 3, 4

26.

Chang KW, Hsieh CJ, Lin CJ (2008) Coordinate descent method for large-scale L2-loss linear support vector machines. J Mach Learn Res 9(7):1369–1398MathSciNetMATH

27.

Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann Stat 32(2):407–499MathSciNetCrossRefMATH

28.

Oreifej O, Liu Z (2013) HON4D: histogram of oriented 4D normals for activity recognition from depth sequences. In: CVPR, Portland, Oregon, June 2013

29.

Yang X, Tian Y (2012) EigenJoints-based action recognition using naïve-bayes-nearest-neighbor. In: IEEE workshop on CVPR, pp 14–19

Titel: Human action recognition on depth dataset
verfasst von: Zan Gao
Hua Zhang
Anan A. Liu
Guangping Xu
Yanbing Xue
Publikationsdatum: 01.10.2016
Verlag: Springer London
Erschienen in: Neural Computing and Applications / Ausgabe 7/2016
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-015-2002-0

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 7/2016

Heuristic techniques to optimize neural network architecture in manufacturing applications

Online versus offline Arabic script classification

Research on prediction of traffic flow based on dynamic fuzzy neural networks

Pixel plot and trace based segmentation method for bilingual handwritten scripts using feedforward neural network

Active fuzzy modeling for estimating problems in hydrocarbon reservoirs

Resource-dependent scheduling with deteriorating jobs and learning effects on unrelated parallel machine