Skip to main content
Erschienen in: Neural Computing and Applications 17/2020

20.02.2020 | Original Article

PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint

verfasst von: Ming Tong, He Bai, Xing Yue, Haili Bu

Erschienen in: Neural Computing and Applications | Ausgabe 17/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Complex action recognition possesses significant academic research value, potential commercial value and broad market application prospect. For improving its performance, a local-weighted nonnegative matrix factorization with rank regularization constraint (LWNMF_RC) is firstly presented, which removes complex background and then obtains motion salient regions. Secondly, a dual-manifold regularized nonnegative matrix factorization with sparsity constraint (DMNMF_SC) is proposed, which not only considers the short-term and middle-term temporal dependencies implied in data manifold, but also mines the geometric structure hidden in feature manifold. In addition, the introduction of sparsity constraint makes features possess better discriminativeness. Thirdly, a deep DMNMF_SC method is constructed, which acquires more hierarchical and discriminative features. Finally, a long-term temporal memory model with probability transfer learning (PTL-LTM) is proposed, which accurately memorizes the long-term temporal dependency among multiple simple action segments and, meanwhile, makes full use of the probability features of rich labeled simple actions and then applies the knowledge learned from simple actions for complex action recognition. Consequently, the performance is effectively improved.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Chen Y, Yi Z (2019) Locality-constrained least squares regression for subspace clustering. Knowl-Based Syst 163:51–56CrossRef Chen Y, Yi Z (2019) Locality-constrained least squares regression for subspace clustering. Knowl-Based Syst 163:51–56CrossRef
2.
Zurück zum Zitat Lu Y, Lai Z, Xu Y, Li X, Zhang D, Yuan C (2017) Nonnegative discriminant matrix factorization. IEEE Trans Circuits Syst Video Technol 27(7):1392–1405CrossRef Lu Y, Lai Z, Xu Y, Li X, Zhang D, Yuan C (2017) Nonnegative discriminant matrix factorization. IEEE Trans Circuits Syst Video Technol 27(7):1392–1405CrossRef
3.
Zurück zum Zitat Lu C, Feng J, Lin Z, Mei T, Yan S (2019) Subspace clustering by block diagonal representation. IEEE Trans Pattern Anal Mach Intell 41(2):487–501CrossRef Lu C, Feng J, Lin Z, Mei T, Yan S (2019) Subspace clustering by block diagonal representation. IEEE Trans Pattern Anal Mach Intell 41(2):487–501CrossRef
6.
Zurück zum Zitat Xu KK, Li HX, Liu Z (2018) ISOMAP-based spatiotemporal modeling for lithium-ion battery thermal process. IEEE Trans Ind Inf 14(2):569–577CrossRef Xu KK, Li HX, Liu Z (2018) ISOMAP-based spatiotemporal modeling for lithium-ion battery thermal process. IEEE Trans Ind Inf 14(2):569–577CrossRef
7.
Zurück zum Zitat Lee DD, Seung HS (1999) Learning the parts of objects by nonnegative matrix factorization. Nature 401(6755):788–791MATHCrossRef Lee DD, Seung HS (1999) Learning the parts of objects by nonnegative matrix factorization. Nature 401(6755):788–791MATHCrossRef
8.
Zurück zum Zitat Yuan X, Han L, Qian S, Xu G, Yan H (2019) Singular value decomposition based recommendation using imputed data. Knowl-Based Syst 163:485–494CrossRef Yuan X, Han L, Qian S, Xu G, Yan H (2019) Singular value decomposition based recommendation using imputed data. Knowl-Based Syst 163:485–494CrossRef
12.
Zurück zum Zitat Zhang H, Wang S, Xu X, Chow TW, Wu QJ (2018) Tree2Vector: learning a vectorial representation for tree-structured data. IEEE Trans Neural Netw Learn Syst 99:1–15MathSciNet Zhang H, Wang S, Xu X, Chow TW, Wu QJ (2018) Tree2Vector: learning a vectorial representation for tree-structured data. IEEE Trans Neural Netw Learn Syst 99:1–15MathSciNet
13.
Zurück zum Zitat Gao H, Nie F, Huang H (2017) Local centroids structured non-negative matrix factorization. In: Proceedings of the thirty-first AAAI conference on artificial intelligence, pp 1905–1911 Gao H, Nie F, Huang H (2017) Local centroids structured non-negative matrix factorization. In: Proceedings of the thirty-first AAAI conference on artificial intelligence, pp 1905–1911
14.
Zurück zum Zitat Huang S, Zhao P, Ren Y, Li T, Xu Z (2019) Self-paced and soft-weighted nonnegative matrix factorization for data representation. Knowl-Based Syst 164:29–37CrossRef Huang S, Zhao P, Ren Y, Li T, Xu Z (2019) Self-paced and soft-weighted nonnegative matrix factorization for data representation. Knowl-Based Syst 164:29–37CrossRef
15.
Zurück zum Zitat Liu F, Xu X, Qiu S, Tao D (2016) Simple to complex transfer learning for action recognition. IEEE Trans Image Process 25(2):949–960MathSciNetMATHCrossRef Liu F, Xu X, Qiu S, Tao D (2016) Simple to complex transfer learning for action recognition. IEEE Trans Image Process 25(2):949–960MathSciNetMATHCrossRef
16.
Zurück zum Zitat Zhang J, Hu H (2019) Domain learning joint with semantic adaptation for human action recognition. Pattern Recognit 90:196–209CrossRef Zhang J, Hu H (2019) Domain learning joint with semantic adaptation for human action recognition. Pattern Recognit 90:196–209CrossRef
17.
Zurück zum Zitat Li J, Wong Y, Zhao Q, Kankanhalli MS (2017) Attention transfer from web images for video recognition. In: Proceedings of the 25th ACM international conference on multimedia, pp 1–9 Li J, Wong Y, Zhao Q, Kankanhalli MS (2017) Attention transfer from web images for video recognition. In: Proceedings of the 25th ACM international conference on multimedia, pp 1–9
18.
Zurück zum Zitat Luo Z, Zou Y, Hoffman J, Fei-Fei L (2017) Label efficient learning of transferable representations acrosss domains and tasks. In: Proceedings of advances in neural information processing systems (NIPS), pp 165–177 Luo Z, Zou Y, Hoffman J, Fei-Fei L (2017) Label efficient learning of transferable representations acrosss domains and tasks. In: Proceedings of advances in neural information processing systems (NIPS), pp 165–177
19.
Zurück zum Zitat Duan L, Xu D, Tsang IWH, Luo J (2012) Visual event recognition in videos by learning from web data. IEEE Trans Pattern Anal Mach Intell 34(9):1667–1680CrossRef Duan L, Xu D, Tsang IWH, Luo J (2012) Visual event recognition in videos by learning from web data. IEEE Trans Pattern Anal Mach Intell 34(9):1667–1680CrossRef
20.
Zurück zum Zitat Rahmani H, Mian A, Shah M (2018) Learning a deep model for human action recognition from novel viewpoints. IEEE Trans Pattern Anal Mach Intell 40(3):667–681CrossRef Rahmani H, Mian A, Shah M (2018) Learning a deep model for human action recognition from novel viewpoints. IEEE Trans Pattern Anal Mach Intell 40(3):667–681CrossRef
21.
Zurück zum Zitat Wu F, Hu Y, Gao J, Sun Y, Yin B (2016) Ordered subspace clustering with block-diagonal priors. IEEE Trans Cybern 46(12):3209–3219CrossRef Wu F, Hu Y, Gao J, Sun Y, Yin B (2016) Ordered subspace clustering with block-diagonal priors. IEEE Trans Cybern 46(12):3209–3219CrossRef
22.
Zurück zum Zitat Wang J, Tian F, Liu CH, Yu H, Wang X, Tang X (2017) Robust nonnegative matrix factorization with ordered structure constraints. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 478–485 Wang J, Tian F, Liu CH, Yu H, Wang X, Tang X (2017) Robust nonnegative matrix factorization with ordered structure constraints. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 478–485
23.
Zurück zum Zitat Xiang Y, Zhang G, Gu S, Cai J (2018) Online multi-layer dictionary pair learning for visual classification. Expert Syst Appl 105:174–182CrossRef Xiang Y, Zhang G, Gu S, Cai J (2018) Online multi-layer dictionary pair learning for visual classification. Expert Syst Appl 105:174–182CrossRef
24.
Zurück zum Zitat Su B, Zhou J, Ding X, Wang H, Wu Y (2016) Hierarchical dynamic parsing and encoding for action recognition. In: Proceedings of European conference on computer vision (ECCV), pp 202–217 Su B, Zhou J, Ding X, Wang H, Wu Y (2016) Hierarchical dynamic parsing and encoding for action recognition. In: Proceedings of European conference on computer vision (ECCV), pp 202–217
25.
Zurück zum Zitat Trigeorgis G, Zafeiriou S, Schuller BW (2017) A deep matrix factorization method for learning attribute representations. IEEE Trans Pattern Anal Mach Intell 39(3):417–429CrossRef Trigeorgis G, Zafeiriou S, Schuller BW (2017) A deep matrix factorization method for learning attribute representations. IEEE Trans Pattern Anal Mach Intell 39(3):417–429CrossRef
27.
Zurück zum Zitat Wang H, Kläser A, Schmid C, Liu CL (2013) Dense trajectories and motion boundary descriptors for action recognition. Int J Comput Vis 103(1):60–79MathSciNetCrossRef Wang H, Kläser A, Schmid C, Liu CL (2013) Dense trajectories and motion boundary descriptors for action recognition. Int J Comput Vis 103(1):60–79MathSciNetCrossRef
28.
Zurück zum Zitat Cai D, He X, Han J, Huang TS (2011) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560CrossRef Cai D, He X, Han J, Huang TS (2011) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560CrossRef
29.
Zurück zum Zitat Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22:888–905CrossRef Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22:888–905CrossRef
30.
Zurück zum Zitat Reddy KK, Shah M (2013) Recognizing 50 human action categories of web videos. Mach Vis Appl 24(5):971–981CrossRef Reddy KK, Shah M (2013) Recognizing 50 human action categories of web videos. Mach Vis Appl 24(5):971–981CrossRef
31.
Zurück zum Zitat Niebles JC, Chen CW, Fei-Fei L (2010) Modeling temporal structure of decomposable motion segments for activity classification. In: Proceedings of European conference on computer vision (ECCV), pp 392–405 Niebles JC, Chen CW, Fei-Fei L (2010) Modeling temporal structure of decomposable motion segments for activity classification. In: Proceedings of European conference on computer vision (ECCV), pp 392–405
32.
Zurück zum Zitat Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: A local SVM approach. In: Proceedings of the international conference on pattern recognition (ICPR), pp 32–36 Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: A local SVM approach. In: Proceedings of the international conference on pattern recognition (ICPR), pp 32–36
33.
Zurück zum Zitat Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253CrossRef Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253CrossRef
34.
Zurück zum Zitat Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666CrossRef Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666CrossRef
35.
Zurück zum Zitat Allab K, Labiod L, Nadif M (2017) A semi-NMF-PCA unified framework for data clustering. IEEE Trans Knowl Data Eng 29(1):2–16MATHCrossRef Allab K, Labiod L, Nadif M (2017) A semi-NMF-PCA unified framework for data clustering. IEEE Trans Knowl Data Eng 29(1):2–16MATHCrossRef
36.
Zurück zum Zitat Arias-Castro E, Lerman G, Zhang T (2017) Spectral clustering based on local PCA. J Mach Learn Res 18(9):1–57MathSciNetMATH Arias-Castro E, Lerman G, Zhang T (2017) Spectral clustering based on local PCA. J Mach Learn Res 18(9):1–57MathSciNetMATH
37.
Zurück zum Zitat Liu G, Lin Z, Yu Y (2010) Robust subspace segmentation by low-rank representation. In: Proceedings of the international conference on machine learning (ICML), pp 663–670 Liu G, Lin Z, Yu Y (2010) Robust subspace segmentation by low-rank representation. In: Proceedings of the international conference on machine learning (ICML), pp 663–670
38.
Zurück zum Zitat Hu W, Choi KS, Wang P, Jiang Y, Wang S (2015) Convex nonnegative matrix factorization with manifold regularization. Neural Netw 63:94–103MATHCrossRef Hu W, Choi KS, Wang P, Jiang Y, Wang S (2015) Convex nonnegative matrix factorization with manifold regularization. Neural Netw 63:94–103MATHCrossRef
39.
Zurück zum Zitat Xia G, Sun H, Feng L, Zhang G, Liu Y (2018) Human motion segmentation via robust kernel sparse subspace clustering. IEEE Trans Image Process 27(1):135–150MathSciNetMATHCrossRef Xia G, Sun H, Feng L, Zhang G, Liu Y (2018) Human motion segmentation via robust kernel sparse subspace clustering. IEEE Trans Image Process 27(1):135–150MathSciNetMATHCrossRef
40.
Zurück zum Zitat Everts I, Van Gemert JC, Gevers T (2014) Evaluation of color spatio-temporal interest points for human action recognition. IEEE Trans Image Process 23(4):1569–1580MathSciNetMATHCrossRef Everts I, Van Gemert JC, Gevers T (2014) Evaluation of color spatio-temporal interest points for human action recognition. IEEE Trans Image Process 23(4):1569–1580MathSciNetMATHCrossRef
41.
Zurück zum Zitat Ciptadi A, Goodwin MS, Rehg JM (2014) Movement pattern histogram for action recognition and retrieval. In: Proceedings of European conference on computer vision (ECCV), pp 695–710 Ciptadi A, Goodwin MS, Rehg JM (2014) Movement pattern histogram for action recognition and retrieval. In: Proceedings of European conference on computer vision (ECCV), pp 695–710
42.
Zurück zum Zitat Narayan S, Ramakrishnan KR (2014) A cause and effect analysis of motion trajectories for modeling actions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2633–2640 Narayan S, Ramakrishnan KR (2014) A cause and effect analysis of motion trajectories for modeling actions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2633–2640
43.
Zurück zum Zitat Liu J, Huang Y, Peng X, Wang L (2015) Multi-view descriptor mining via codeword net for action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 793–797 Liu J, Huang Y, Peng X, Wang L (2015) Multi-view descriptor mining via codeword net for action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 793–797
44.
Zurück zum Zitat Chen QQ, Zhang YJ (2016) Cluster trees of improved trajectories for action recognition. Neurocomputing 173:364–372CrossRef Chen QQ, Zhang YJ (2016) Cluster trees of improved trajectories for action recognition. Neurocomputing 173:364–372CrossRef
45.
Zurück zum Zitat Wang H, Oneata D, Verbeek J, Schmid C (2016) A robust and efficient video representation for action recognition. Int J Comput Vis 119(3):219–238MathSciNetCrossRef Wang H, Oneata D, Verbeek J, Schmid C (2016) A robust and efficient video representation for action recognition. Int J Comput Vis 119(3):219–238MathSciNetCrossRef
46.
Zurück zum Zitat Peng X, Wang L, Wang X, Qiao Y (2016) Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput Vis Image Underst 150:109–125CrossRef Peng X, Wang L, Wang X, Qiao Y (2016) Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput Vis Image Underst 150:109–125CrossRef
47.
Zurück zum Zitat Wang L, Qiao Y, Tang X (2016) MoFAP: a multi-level representation for action recognition. Int J Comput Vis 119(3):254–271MathSciNetCrossRef Wang L, Qiao Y, Tang X (2016) MoFAP: a multi-level representation for action recognition. Int J Comput Vis 119(3):254–271MathSciNetCrossRef
48.
Zurück zum Zitat Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114CrossRef Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114CrossRef
49.
Zurück zum Zitat Wang H, Chang X, Shi L, Yang Y, Shen YD (2018) Uncertainty sampling for action recognition via maximizing expected average precision. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence (IJCAI), pp 964–970 Wang H, Chang X, Shi L, Yang Y, Shen YD (2018) Uncertainty sampling for action recognition via maximizing expected average precision. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence (IJCAI), pp 964–970
50.
Zurück zum Zitat Ni B, Moulin P, Yang X, Yan S (2015) Motion part regularization: Improving action recognition via trajectory selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3698–3706 Ni B, Moulin P, Yang X, Yan S (2015) Motion part regularization: Improving action recognition via trajectory selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3698–3706
51.
Zurück zum Zitat Liu C, Wu X, Jia Y (2016) A hierarchical video description for complex activity understanding. Int J Comput Vis 118(2):240–255MathSciNetCrossRef Liu C, Wu X, Jia Y (2016) A hierarchical video description for complex activity understanding. Int J Comput Vis 118(2):240–255MathSciNetCrossRef
52.
Zurück zum Zitat Yi Y, Zheng Z, Lin M (2017) Realistic action recognition with salient foreground trajectories. Expert Syst Appl 75:44–55CrossRef Yi Y, Zheng Z, Lin M (2017) Realistic action recognition with salient foreground trajectories. Expert Syst Appl 75:44–55CrossRef
53.
Zurück zum Zitat Xu K, Jiang X, Sun T (2017) Two-stream dictionary learning architecture for action recognition. IEEE Trans Circuits Syst Video Technol 27(3):567–576CrossRef Xu K, Jiang X, Sun T (2017) Two-stream dictionary learning architecture for action recognition. IEEE Trans Circuits Syst Video Technol 27(3):567–576CrossRef
54.
Zurück zum Zitat Li WX, Vasconcelos N (2017) Complex activity recognition via attribute dynamics. Int J Comput Vis 122(2):334–370MathSciNetCrossRef Li WX, Vasconcelos N (2017) Complex activity recognition via attribute dynamics. Int J Comput Vis 122(2):334–370MathSciNetCrossRef
55.
Zurück zum Zitat Tian Y, Kong Y, Ruan Q, An G, Fu Y (2018) Hierarchical and spatio-temporal sparse representation for human action recognition. IEEE Trans Image Process 27(4):1748–1762MathSciNetCrossRef Tian Y, Kong Y, Ruan Q, An G, Fu Y (2018) Hierarchical and spatio-temporal sparse representation for human action recognition. IEEE Trans Image Process 27(4):1748–1762MathSciNetCrossRef
56.
Zurück zum Zitat Tang K, Fei-Fei L, Koller D (2012) Learning latent temporal structure for complex event detection. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 1250–1257 Tang K, Fei-Fei L, Koller D (2012) Learning latent temporal structure for complex event detection. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 1250–1257
57.
Zurück zum Zitat Li W, Yu Q, Divakaran A, Vasconcelos N (2013) Dynamic pooling for complex event recognition. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 2728–2735 Li W, Yu Q, Divakaran A, Vasconcelos N (2013) Dynamic pooling for complex event recognition. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 2728–2735
58.
Zurück zum Zitat Zheng J, Jiang Z, Chellappa R (2016) Submodular attribute selection for visual recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2242–2255CrossRef Zheng J, Jiang Z, Chellappa R (2016) Submodular attribute selection for visual recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2242–2255CrossRef
Metadaten
Titel
PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint
verfasst von
Ming Tong
He Bai
Xing Yue
Haili Bu
Publikationsdatum
20.02.2020
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 17/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-020-04783-0

Weitere Artikel der Ausgabe 17/2020

Neural Computing and Applications 17/2020 Zur Ausgabe

Premium Partner