Skip to main content
Erschienen in: Neural Computing and Applications 9/2020

21.08.2018 | Original Article

NMF with local constraint and Deep NMF with temporal dependencies constraint for action recognition

verfasst von: Ming Tong, Yiran Chen, Lei Ma, He Bai, Xing Yue

Erschienen in: Neural Computing and Applications | Ausgabe 9/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In order to improve action recognition accuracy, a new nonnegative matrix factorization with local constraint (LC-NMF) is firstly presented. By applying it for effective trajectory clustering, complex backgrounds are removed and then the motion salient regions are obtained. Secondly, a nonnegative matrix factorization with temporal dependencies constraint (TD-NMF) is proposed, which fully mines the spatiotemporal relationship in a video not only between adjacent frames, but also between multi-interval frames. Meanwhile, the introduction of \( l_{2,1} \)-norm makes the spatiotemporal features possess better sparseness and robustness. In addition, these features are directly learned from data and thus have an inherent generalization ability. Finally, a Deep NMF method is established, which takes the proposed TD-NMF as the unit algorithm of each layer. By introducing the hierarchical feature extraction strategy, the base matrix of the first layer is gradually decomposed; then, it is supplemented and completed layer by layer. Consequently, the more complete and accurate local feature estimations are obtained, and then the discriminative and expressive abilities of features are effectively enhanced and recognition performance is further improved. Adequate and extensive experiments verify the effectiveness of the proposed methods. Moreover, the update rules and convergence proofs for LC-NMF and TD-NMF are also given.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Gao Z, Zhang H, Liu AA, Xu G, Xue Y (2016) Human action recognition on depth dataset. Neural Comput Appl 27(7):2047–2054 Gao Z, Zhang H, Liu AA, Xu G, Xue Y (2016) Human action recognition on depth dataset. Neural Comput Appl 27(7):2047–2054
2.
Zurück zum Zitat Laptev I, Marszalek M, Schmid C, Rozenfeld B (2008) Learning realistic human actions from movies. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8 Laptev I, Marszalek M, Schmid C, Rozenfeld B (2008) Learning realistic human actions from movies. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8
3.
Zurück zum Zitat Wang H, Oneata D, Verbeek J, Schmid C (2016) A robust and efficient video representation for action recognition. Int J Comput Vis 119(3):219–238MathSciNet Wang H, Oneata D, Verbeek J, Schmid C (2016) A robust and efficient video representation for action recognition. Int J Comput Vis 119(3):219–238MathSciNet
4.
Zurück zum Zitat Caetano C, dos Santos JA, Schwartz WR (2016) Optical flow co-occurrence matrices: a novel spatiotemporal feature descriptor. In: Proceedings of the international conference on pattern recognition (ICPR), pp 1947–1952 Caetano C, dos Santos JA, Schwartz WR (2016) Optical flow co-occurrence matrices: a novel spatiotemporal feature descriptor. In: Proceedings of the international conference on pattern recognition (ICPR), pp 1947–1952
5.
Zurück zum Zitat Colque RVHM, Caetano C, de Andrade MTL, Schwartz WR (2017) Histograms of optical flow orientation and magnitude and entropy to detect anomalous events in videos. IEEE Trans Circuits Syst Video Technol 27(3):673–682 Colque RVHM, Caetano C, de Andrade MTL, Schwartz WR (2017) Histograms of optical flow orientation and magnitude and entropy to detect anomalous events in videos. IEEE Trans Circuits Syst Video Technol 27(3):673–682
6.
Zurück zum Zitat Dollar P, Rabaud V, Cottrell G, Belongie S (2005) Behavior recognition via sparse spatio-temporal features. In: IEEE international workshop on visual surveillance and performance evaluation of tracking and surveillance, pp 65–72 Dollar P, Rabaud V, Cottrell G, Belongie S (2005) Behavior recognition via sparse spatio-temporal features. In: IEEE international workshop on visual surveillance and performance evaluation of tracking and surveillance, pp 65–72
7.
Zurück zum Zitat Wang H, Kläser A, Schmid C, Liu CL (2013) Dense trajectories and motion boundary descriptors for action recognition. Int J Comput Vis 103(1):60–79MathSciNet Wang H, Kläser A, Schmid C, Liu CL (2013) Dense trajectories and motion boundary descriptors for action recognition. Int J Comput Vis 103(1):60–79MathSciNet
8.
Zurück zum Zitat Sun L, Jia K, Chan TH, Fang Y, Wang G, Yan S (2014) DL-SFA: deeply-learned slow feature analysis for action recognition In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2625–2632 Sun L, Jia K, Chan TH, Fang Y, Wang G, Yan S (2014) DL-SFA: deeply-learned slow feature analysis for action recognition In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2625–2632
9.
Zurück zum Zitat Le QV, Zou WY, Yeung SY, Ng AY (2011) Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3361–3368 Le QV, Zou WY, Yeung SY, Ng AY (2011) Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3361–3368
10.
Zurück zum Zitat Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788–791MATH Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788–791MATH
11.
Zurück zum Zitat Hinton GE, Osindero S, Teh YW (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554MathSciNetMATH Hinton GE, Osindero S, Teh YW (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554MathSciNetMATH
12.
Zurück zum Zitat Taheri S, Qiu Q, Chellappa R (2014) Structure-preserving sparse decomposition for facial expression analysis. IEEE Trans Image Process 23(8):3590–3603MathSciNetMATH Taheri S, Qiu Q, Chellappa R (2014) Structure-preserving sparse decomposition for facial expression analysis. IEEE Trans Image Process 23(8):3590–3603MathSciNetMATH
13.
Zurück zum Zitat Sun Y, Quan Y, Fu J (2018) Sparse coding and dictionary learning with class-specific group sparsity. Neural Comput Appl 30(4):1265–1275 Sun Y, Quan Y, Fu J (2018) Sparse coding and dictionary learning with class-specific group sparsity. Neural Comput Appl 30(4):1265–1275
14.
Zurück zum Zitat Wang X, Gao L, Song J, Zhen X, Sebe N, Shen HT (2018) Deep appearance and motion learning for egocentric activity recognition. Neurocomputing 275:438–447 Wang X, Gao L, Song J, Zhen X, Sebe N, Shen HT (2018) Deep appearance and motion learning for egocentric activity recognition. Neurocomputing 275:438–447
15.
Zurück zum Zitat Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1–3):37–52 Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1–3):37–52
16.
Zurück zum Zitat Delorme A, Makeig S (2004) EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods 134(1):9–21 Delorme A, Makeig S (2004) EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods 134(1):9–21
17.
Zurück zum Zitat Lu Y, Lai Z, Xu Y, Zhang D, Yuan C (2017) Nonnegative discriminant matrix factorization. IEEE Trans Circuits Syst Video Technol 27(7):1392–1405 Lu Y, Lai Z, Xu Y, Zhang D, Yuan C (2017) Nonnegative discriminant matrix factorization. IEEE Trans Circuits Syst Video Technol 27(7):1392–1405
18.
Zurück zum Zitat Gong C, Tao D, Fu K, Yang J (2015) Fick’s law assisted propagation for semisupervised learning. IEEE Trans Neural Netw Learn Syst 26(9):2148–2162MathSciNet Gong C, Tao D, Fu K, Yang J (2015) Fick’s law assisted propagation for semisupervised learning. IEEE Trans Neural Netw Learn Syst 26(9):2148–2162MathSciNet
19.
Zurück zum Zitat Li Z, Tang J, He X (2018) Robust structured nonnegative matrix factorization for image representation. IEEE Trans Neural Netw Learn Syst 29(5):1947–1960MathSciNet Li Z, Tang J, He X (2018) Robust structured nonnegative matrix factorization for image representation. IEEE Trans Neural Netw Learn Syst 29(5):1947–1960MathSciNet
20.
Zurück zum Zitat Tepper M, Sapiro G (2016) Compressed nonnegative matrix factorization is fast and accurate. IEEE Trans Signal Process 64(9):2269–2283MathSciNetMATH Tepper M, Sapiro G (2016) Compressed nonnegative matrix factorization is fast and accurate. IEEE Trans Signal Process 64(9):2269–2283MathSciNetMATH
21.
Zurück zum Zitat Trigeorgis G, Bousmalis K, Zafeiriou S, Schuller BW (2017) A deep matrix factorization method for learning attribute representations. IEEE Trans Pattern Anal Mach Intell 39(3):417–429 Trigeorgis G, Bousmalis K, Zafeiriou S, Schuller BW (2017) A deep matrix factorization method for learning attribute representations. IEEE Trans Pattern Anal Mach Intell 39(3):417–429
22.
Zurück zum Zitat Thurau C, Hlavác V (2008) Pose primitive based human action recognition in videos or still images. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8 Thurau C, Hlavác V (2008) Pose primitive based human action recognition in videos or still images. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8
23.
Zurück zum Zitat Yang Y, Tu D, Li G (2014) Gait recognition using flow histogram energy image. In: Proceedings of the international conference on pattern recognition (ICPR), pp 444–449 Yang Y, Tu D, Li G (2014) Gait recognition using flow histogram energy image. In: Proceedings of the international conference on pattern recognition (ICPR), pp 444–449
24.
Zurück zum Zitat Zhang H, Cao X, Ho JKL, Chow TWS (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inf 13(2):520–531 Zhang H, Cao X, Ho JKL, Chow TWS (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inf 13(2):520–531
25.
Zurück zum Zitat Yi Y, Lin M (2016) Human action recognition with graph-based multiple-instance learning. Pattern Recognit 53:148–162 Yi Y, Lin M (2016) Human action recognition with graph-based multiple-instance learning. Pattern Recognit 53:148–162
26.
Zurück zum Zitat Cho J, Lee M, Chang HJ, Oh S (2014) Robust action recognition using local motion and group sparsity. Pattern Recognit 47(5):1813–1825 Cho J, Lee M, Chang HJ, Oh S (2014) Robust action recognition using local motion and group sparsity. Pattern Recognit 47(5):1813–1825
27.
Zurück zum Zitat Hu W, Choi KS, Wang P, Jiang Y, Wang S (2015) Convex nonnegative matrix factorization with manifold regularization. Neural Netw 63:94–103MATH Hu W, Choi KS, Wang P, Jiang Y, Wang S (2015) Convex nonnegative matrix factorization with manifold regularization. Neural Netw 63:94–103MATH
28.
Zurück zum Zitat Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666 Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666
29.
Zurück zum Zitat Arias-Castro E, Lerman G, Zhang T (2017) Spectral clustering based on local PCA. J Mach Learn Res 18(1):253–309MathSciNetMATH Arias-Castro E, Lerman G, Zhang T (2017) Spectral clustering based on local PCA. J Mach Learn Res 18(1):253–309MathSciNetMATH
30.
Zurück zum Zitat Tian Y, Ruan Q, An G, Liu R (2014) Local non-negative component representation for human action recognition. In: Proceedings of the IEEE international conference on signal processing (ICSP), pp 1317–1320 Tian Y, Ruan Q, An G, Liu R (2014) Local non-negative component representation for human action recognition. In: Proceedings of the IEEE international conference on signal processing (ICSP), pp 1317–1320
31.
Zurück zum Zitat Vollmer C, Hellbach S, Eggert J, Gross HM (2014) Sparse coding of human motion trajectories with non-negative matrix factorization. Neurocomputing 124:22–32 Vollmer C, Hellbach S, Eggert J, Gross HM (2014) Sparse coding of human motion trajectories with non-negative matrix factorization. Neurocomputing 124:22–32
32.
Zurück zum Zitat Zafeiriou L, Nikitidis S, Zafeiriou S, Pantic M (2014) Slow features nonnegative matrix factorization for temporal data decomposition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 1430–1434 Zafeiriou L, Nikitidis S, Zafeiriou S, Pantic M (2014) Slow features nonnegative matrix factorization for temporal data decomposition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 1430–1434
33.
Zurück zum Zitat Zafeiriou L, Panagakis Y, Pantic M, Zafeiriou S (2017) Nonnegative decompositions for dynamic visual data Analysis. IEEE Trans Image Process 26(12):5603–5617MathSciNetMATH Zafeiriou L, Panagakis Y, Pantic M, Zafeiriou S (2017) Nonnegative decompositions for dynamic visual data Analysis. IEEE Trans Image Process 26(12):5603–5617MathSciNetMATH
34.
Zurück zum Zitat Xiao Q, Cheng J, Jiang J, Feng W (2014) Position-based action recognition using high dimension index tree. In: Proceedings of the international conference on pattern recognition (ICPR), pp 4400–4405 Xiao Q, Cheng J, Jiang J, Feng W (2014) Position-based action recognition using high dimension index tree. In: Proceedings of the international conference on pattern recognition (ICPR), pp 4400–4405
35.
Zurück zum Zitat Ji X, Cheng J, Tao D (2015) Local mean spatio-temporal feature for depth image-based speed-up action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 2389–2393 Ji X, Cheng J, Tao D (2015) Local mean spatio-temporal feature for depth image-based speed-up action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 2389–2393
36.
Zurück zum Zitat Roth PM, Mauthner T, Khan I, Bischof H (2009) Efficient human action recognition by cascaded linear classification. In: Proceedings of the IEEE international conference on computer vision workshops (ICCV Workshops), pp 546–553 Roth PM, Mauthner T, Khan I, Bischof H (2009) Efficient human action recognition by cascaded linear classification. In: Proceedings of the IEEE international conference on computer vision workshops (ICCV Workshops), pp 546–553
37.
Zurück zum Zitat Wang H, Yuan C, Hu W, Ling H, Yang W, Sun C (2014) Action recognition using nonnegative action component representation and sparse basis selection. IEEE Trans Image Process 23(2):570–581MathSciNetMATH Wang H, Yuan C, Hu W, Ling H, Yang W, Sun C (2014) Action recognition using nonnegative action component representation and sparse basis selection. IEEE Trans Image Process 23(2):570–581MathSciNetMATH
38.
Zurück zum Zitat Wang J, Zhang P, Luo L (2016) Nonnegative component representation with hierarchical dictionary learning strategy for action recognition. IEICE Trans Inf Syst 99(4):1259–1263 Wang J, Zhang P, Luo L (2016) Nonnegative component representation with hierarchical dictionary learning strategy for action recognition. IEICE Trans Inf Syst 99(4):1259–1263
39.
Zurück zum Zitat Sekma M, Mejdoub M, Amar CB (2015) Human action recognition based on multi-layer fisher vector encoding method. Pattern Recognit Lett 65:37–43 Sekma M, Mejdoub M, Amar CB (2015) Human action recognition based on multi-layer fisher vector encoding method. Pattern Recognit Lett 65:37–43
40.
Zurück zum Zitat Yu YF, Dai DQ, Ren CX, Hang KK (2017) Discriminative multi-layer illumination-robust feature extraction for face recognition. Pattern Recognit 67:201–212 Yu YF, Dai DQ, Ren CX, Hang KK (2017) Discriminative multi-layer illumination-robust feature extraction for face recognition. Pattern Recognit 67:201–212
42.
Zurück zum Zitat Liao Q, Zhang Q (2016) Local coordinate based graph-regularized NMF for image representation. Signal Process 124:103–114 Liao Q, Zhang Q (2016) Local coordinate based graph-regularized NMF for image representation. Signal Process 124:103–114
43.
Zurück zum Zitat Wang H, Schmid C (2013) Action recognition with improved trajectories. In: Proceedings of IEEE international conference on computer vision (ICCV), pp 3551–3558 Wang H, Schmid C (2013) Action recognition with improved trajectories. In: Proceedings of IEEE international conference on computer vision (ICCV), pp 3551–3558
44.
Zurück zum Zitat Nicholson WB, Matteson DS, Bien J (2014) Structured regularization for large vector autoregression. Cornell University, Ithaca Nicholson WB, Matteson DS, Bien J (2014) Structured regularization for large vector autoregression. Cornell University, Ithaca
45.
Zurück zum Zitat Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253 Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253
46.
Zurück zum Zitat Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: a local SVM approach. In: Proceedings of the international conference on pattern recognition (ICPR), pp 32–36 Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: a local SVM approach. In: Proceedings of the international conference on pattern recognition (ICPR), pp 32–36
47.
Zurück zum Zitat Rodriguez MD, Ahmed J, Shah M (2008) Action mach a spatio-temporal maximum average correlation height filter for action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8 Rodriguez MD, Ahmed J, Shah M (2008) Action mach a spatio-temporal maximum average correlation height filter for action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–8
48.
Zurück zum Zitat Liu J, Luo J, Shah M (2009) Recognizing realistic actions from videos “in the wild”. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1996–2003 Liu J, Luo J, Shah M (2009) Recognizing realistic actions from videos “in the wild”. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1996–2003
49.
Zurück zum Zitat Kimura K, Kudo M, Tanaka Y (2016) A column-wise update algorithm for nonnegative matrix factorization in Bregman divergence with an orthogonal constraint. Mach Learn 103(2):285–306MathSciNetMATH Kimura K, Kudo M, Tanaka Y (2016) A column-wise update algorithm for nonnegative matrix factorization in Bregman divergence with an orthogonal constraint. Mach Learn 103(2):285–306MathSciNetMATH
50.
Zurück zum Zitat Allab K, Labiod L, Nadif M (2017) A semi-NMF-PCA unified framework for data clustering. IEEE Trans Knowl Data Eng 29(1):2–16MATH Allab K, Labiod L, Nadif M (2017) A semi-NMF-PCA unified framework for data clustering. IEEE Trans Knowl Data Eng 29(1):2–16MATH
51.
Zurück zum Zitat Zhang X, Yang Y, Jia H, Zhou H, Jiao L (2014) Low-rank representation based action recognition. In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), pp 1812–1818 Zhang X, Yang Y, Jia H, Zhou H, Jiao L (2014) Low-rank representation based action recognition. In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), pp 1812–1818
52.
Zurück zum Zitat Sheng B, Yang W, Sun C (2015) Action recognition using direction-dependent feature pairs and non-negative low rank sparse model. Neurocomputing 158:73–80 Sheng B, Yang W, Sun C (2015) Action recognition using direction-dependent feature pairs and non-negative low rank sparse model. Neurocomputing 158:73–80
53.
Zurück zum Zitat Kulkarni K, Turaga P (2016) Reconstruction-free action inference from compressive imagers. IEEE Trans Pattern Anal Mach Intell 38(4):772–784 Kulkarni K, Turaga P (2016) Reconstruction-free action inference from compressive imagers. IEEE Trans Pattern Anal Mach Intell 38(4):772–784
54.
Zurück zum Zitat Barrett DP, Siskind JM (2016) Action recognition by time series of retinotopic appearance and motion features. IEEE Trans Circuits Syst Video Technol 26(12):2250–2263 Barrett DP, Siskind JM (2016) Action recognition by time series of retinotopic appearance and motion features. IEEE Trans Circuits Syst Video Technol 26(12):2250–2263
55.
Zurück zum Zitat Azhar F, Li CT (2017) Hierarchical relaxed partitioning system for activity recognition. IEEE Trans Cybern 47(3):784–795 Azhar F, Li CT (2017) Hierarchical relaxed partitioning system for activity recognition. IEEE Trans Cybern 47(3):784–795
56.
Zurück zum Zitat Ji S, Xu W, Yang M, Yu K (2013) 3D convolutional neural networks for human action recognition. IEEE Trans Pattern Anal Mach Intell 35(1):221–231 Ji S, Xu W, Yang M, Yu K (2013) 3D convolutional neural networks for human action recognition. IEEE Trans Pattern Anal Mach Intell 35(1):221–231
57.
Zurück zum Zitat Umakanthan S, Denman S, Fookes C, Sridharan S (2014) Multiple instance dictionary learning for activity representation. In: Proceedings of the international conference on pattern recognition (ICPR), pp 1377–1382 Umakanthan S, Denman S, Fookes C, Sridharan S (2014) Multiple instance dictionary learning for activity representation. In: Proceedings of the international conference on pattern recognition (ICPR), pp 1377–1382
58.
Zurück zum Zitat Liu AA, Xu N, Su YT, Hao T, Yang ZX (2015) Single/multi-view human action recognition via regularized multi-task learning. Neurocomputing 151:544–553 Liu AA, Xu N, Su YT, Hao T, Yang ZX (2015) Single/multi-view human action recognition via regularized multi-task learning. Neurocomputing 151:544–553
59.
Zurück zum Zitat Leyva R, Sanchez V, Li CT (2016) A fast binary pair-based video descriptor for action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 4185–4189 Leyva R, Sanchez V, Li CT (2016) A fast binary pair-based video descriptor for action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 4185–4189
60.
Zurück zum Zitat Baumann F, Ehlers A, Rosenhahn B, Liao J (2016) Recognizing human actions using novel space-time volume binary patterns. Neurocomputing 173:54–63 Baumann F, Ehlers A, Rosenhahn B, Liao J (2016) Recognizing human actions using novel space-time volume binary patterns. Neurocomputing 173:54–63
61.
Zurück zum Zitat Lan T, Zhu Y, Roshan Zamir A, Savarse Silvio (2015) Action recognition by hierarchical mid-level action elements. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 4552–4560 Lan T, Zhu Y, Roshan Zamir A, Savarse Silvio (2015) Action recognition by hierarchical mid-level action elements. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 4552–4560
62.
Zurück zum Zitat Yuan C, Wu B, Li X, Hu W, Maybank S, Wang F (2016) Fusing R features and local features with context-aware kernels for action recognition. Int J Comput Vis 118(2):151–171MathSciNet Yuan C, Wu B, Li X, Hu W, Maybank S, Wang F (2016) Fusing R features and local features with context-aware kernels for action recognition. Int J Comput Vis 118(2):151–171MathSciNet
63.
Zurück zum Zitat Yao T, Wang Z, Xie Z, Gao J, Feng DD (2017) Learning universal multiview dictionary for human action recognition. Pattern Recognit 64:236–244 Yao T, Wang Z, Xie Z, Gao J, Feng DD (2017) Learning universal multiview dictionary for human action recognition. Pattern Recognit 64:236–244
64.
Zurück zum Zitat Tian Y, Kong Y, Ruan Q, An G, Fu Y (2018) Hierarchical and spatio-temporal sparse representation for human action recognition. IEEE Trans Image Process 27(4):1748–1762MathSciNet Tian Y, Kong Y, Ruan Q, An G, Fu Y (2018) Hierarchical and spatio-temporal sparse representation for human action recognition. IEEE Trans Image Process 27(4):1748–1762MathSciNet
65.
Zurück zum Zitat Yang Y, Saleemi I, Shah M (2013) Discovering motion primitives for unsupervised grouping and one-shot learning of human actions, gestures, and expressions. IEEE Trans Pattern Anal Mach Intell 35(7):1635–1648 Yang Y, Saleemi I, Shah M (2013) Discovering motion primitives for unsupervised grouping and one-shot learning of human actions, gestures, and expressions. IEEE Trans Pattern Anal Mach Intell 35(7):1635–1648
66.
Zurück zum Zitat Samanta S, Chanda B (2014) Space-time facet model for human activity classification. IEEE Trans Multimedia 16(6):1525–1535 Samanta S, Chanda B (2014) Space-time facet model for human activity classification. IEEE Trans Multimedia 16(6):1525–1535
67.
Zurück zum Zitat Tian Y, Ruan Q, An G, Xu W (2015) Context and locality constrained linear coding for human action recognition. Neurocomputing 167:359–370 Tian Y, Ruan Q, An G, Xu W (2015) Context and locality constrained linear coding for human action recognition. Neurocomputing 167:359–370
68.
Zurück zum Zitat Chatzis SP, Kosmopoulos D (2015) A nonparametric bayesian approach toward stacked convolutional independent component analysis. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 2803–2811 Chatzis SP, Kosmopoulos D (2015) A nonparametric bayesian approach toward stacked convolutional independent component analysis. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 2803–2811
69.
Zurück zum Zitat Zhou T, Li N, Cheng X, Xu Q, Zhou L, Wu Z (2016) Learning semantic context feature-tree for action recognition via nearest neighbor fusion. Neurocomputing 201:1–11 Zhou T, Li N, Cheng X, Xu Q, Zhou L, Wu Z (2016) Learning semantic context feature-tree for action recognition via nearest neighbor fusion. Neurocomputing 201:1–11
70.
Zurück zum Zitat Tian Y, Sukthankar R, Shah M (2013) Spatiotemporal deformable part models for action detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2642–2649 Tian Y, Sukthankar R, Shah M (2013) Spatiotemporal deformable part models for action detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2642–2649
71.
Zurück zum Zitat Hsu YP, Liu C, Chen TY, Fu LC (2016) Online view-invariant human action recognition using rgb-d spatio-temporal matrix. Pattern Recognit 60:215–226 Hsu YP, Liu C, Chen TY, Fu LC (2016) Online view-invariant human action recognition using rgb-d spatio-temporal matrix. Pattern Recognit 60:215–226
72.
Zurück zum Zitat Parisi GI, Wermter S (2017) Lifelong learning of action representations with deep neural self-organization. In: The AAAI 2017 spring symposium on science of intelligence: computational principles of natural and artificial intelligence, pp 608–612 Parisi GI, Wermter S (2017) Lifelong learning of action representations with deep neural self-organization. In: The AAAI 2017 spring symposium on science of intelligence: computational principles of natural and artificial intelligence, pp 608–612
73.
Zurück zum Zitat Rodriguez M, Orrite C, Medrano C, Makris D (2017) One-shot learning of human activity with an map adapted GMM and simplex-HMM. IEEE Trans Cybern 47(7):1769–1780 Rodriguez M, Orrite C, Medrano C, Makris D (2017) One-shot learning of human activity with an map adapted GMM and simplex-HMM. IEEE Trans Cybern 47(7):1769–1780
74.
Zurück zum Zitat Liu L, Shao L, Li X, Lu K (2016) Learning spatio-temporal representations for action recognition: a genetic programming approach. IEEE Trans Cybern 46(1):158–170 Liu L, Shao L, Li X, Lu K (2016) Learning spatio-temporal representations for action recognition: a genetic programming approach. IEEE Trans Cybern 46(1):158–170
75.
Zurück zum Zitat Cheng H, Liu Z, Hou L, Yang J (2016) Sparsity-induced similarity measure and its applications. IEEE Trans Circuits Syst Video Technol 26(4):613–626 Cheng H, Liu Z, Hou L, Yang J (2016) Sparsity-induced similarity measure and its applications. IEEE Trans Circuits Syst Video Technol 26(4):613–626
76.
Zurück zum Zitat Shao L, Liu L, Yu M (2016) Kernelized multiview projection for robust action recognition. Int J Comput Vis 118(2):115–129MathSciNet Shao L, Liu L, Yu M (2016) Kernelized multiview projection for robust action recognition. Int J Comput Vis 118(2):115–129MathSciNet
77.
Zurück zum Zitat Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114 Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114
78.
Zurück zum Zitat Shi Y, Tian Y, Wang Y, Huang T (2017) Sequential deep trajectory descriptor for action recognition with three-stream CNN. IEEE Trans Multimedia 19(7):1510–1520 Shi Y, Tian Y, Wang Y, Huang T (2017) Sequential deep trajectory descriptor for action recognition with three-stream CNN. IEEE Trans Multimedia 19(7):1510–1520
79.
Zurück zum Zitat Zhang S, Gao C, Zhang J, Chen F, Sang N (2018) Discriminative part selection for human action recognition. IEEE Trans Multimedia 20(4):769–780 Zhang S, Gao C, Zhang J, Chen F, Sang N (2018) Discriminative part selection for human action recognition. IEEE Trans Multimedia 20(4):769–780
80.
Zurück zum Zitat Byrne J (2015) Nested motion descriptors. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 502–510 Byrne J (2015) Nested motion descriptors. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 502–510
81.
Zurück zum Zitat Sun C, Junejo IN, Tappen M, Foroosh H (2015) Exploring sparseness and self-similarity for action recognition. IEEE Trans Image Process 24(8):2488–2501MathSciNetMATH Sun C, Junejo IN, Tappen M, Foroosh H (2015) Exploring sparseness and self-similarity for action recognition. IEEE Trans Image Process 24(8):2488–2501MathSciNetMATH
82.
Zurück zum Zitat Nguyen TV, Song Z, Yan S (2015) STAP: spatial-temporal attention-aware pooling for action recognition. IEEE Trans Circuits Syst Video Technol 25(1):77–86 Nguyen TV, Song Z, Yan S (2015) STAP: spatial-temporal attention-aware pooling for action recognition. IEEE Trans Circuits Syst Video Technol 25(1):77–86
83.
Zurück zum Zitat Tian Y, Ruan Q, An G, Fu Y (2016) Action recognition using local consistent group sparse coding with spatio-temporal structure. In: Proceedings of the 2016 ACM on multimedia conference, pp 317–321 Tian Y, Ruan Q, An G, Fu Y (2016) Action recognition using local consistent group sparse coding with spatio-temporal structure. In: Proceedings of the 2016 ACM on multimedia conference, pp 317–321
84.
Zurück zum Zitat Xu W, Miao Z, Zhang XP, Tian Y (2017) A hierarchical spatio-temporal model for human activity recognition. IEEE Trans Multimedia 19(7):1494–1509 Xu W, Miao Z, Zhang XP, Tian Y (2017) A hierarchical spatio-temporal model for human activity recognition. IEEE Trans Multimedia 19(7):1494–1509
85.
Zurück zum Zitat Yi Y, Zheng Z, Lin M (2017) Realistic action recognition with salient foreground trajectories. Expert Syst Appl 75:44–55 Yi Y, Zheng Z, Lin M (2017) Realistic action recognition with salient foreground trajectories. Expert Syst Appl 75:44–55
86.
Zurück zum Zitat Yi Y, Cheng Y, Xu C (2017) Mining human movement evolution for complex action recognition. Expert Syst Appl 78:259–272 Yi Y, Cheng Y, Xu C (2017) Mining human movement evolution for complex action recognition. Expert Syst Appl 78:259–272
87.
Zurück zum Zitat Rahmani H, Mian A, Shah M (2018) Learning a deep model for human action recognition from novel viewpoints. IEEE Trans Pattern Anal Mach Intell 40(3):667–681 Rahmani H, Mian A, Shah M (2018) Learning a deep model for human action recognition from novel viewpoints. IEEE Trans Pattern Anal Mach Intell 40(3):667–681
88.
Zurück zum Zitat Liu L, Shao L, Zheng F, Li X (2014) Realistic action recognition via sparsely-constructed Gaussian processes. Pattern Recognit 47(12):3819–3827 Liu L, Shao L, Zheng F, Li X (2014) Realistic action recognition via sparsely-constructed Gaussian processes. Pattern Recognit 47(12):3819–3827
89.
Zurück zum Zitat Kihl O, Picard D, Gosselin PH (2015) A unified framework for local visual descriptors evaluation. Pattern Recognit 48(4):1174–1184 Kihl O, Picard D, Gosselin PH (2015) A unified framework for local visual descriptors evaluation. Pattern Recognit 48(4):1174–1184
90.
Zurück zum Zitat Sun Q, Liu H, Ma L, Zhang T (2016) A novel hierarchical Bag-of-Words model for compact action representation. Neurocomputing 174:722–732 Sun Q, Liu H, Ma L, Zhang T (2016) A novel hierarchical Bag-of-Words model for compact action representation. Neurocomputing 174:722–732
Metadaten
Titel
NMF with local constraint and Deep NMF with temporal dependencies constraint for action recognition
verfasst von
Ming Tong
Yiran Chen
Lei Ma
He Bai
Xing Yue
Publikationsdatum
21.08.2018
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 9/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3685-9

Weitere Artikel der Ausgabe 9/2020

Neural Computing and Applications 9/2020 Zur Ausgabe

Emerging Trends of Applied Neural Computation - E_TRAINCO

Deep Bayesian Self-Training

Premium Partner