nach oben

Neural Computing and Applications

Erschienen in:

20.02.2020 | Original Article

PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint

verfasst von: Ming Tong, He Bai, Xing Yue, Haili Bu

Erschienen in: Neural Computing and Applications | Ausgabe 17/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Complex action recognition possesses significant academic research value, potential commercial value and broad market application prospect. For improving its performance, a local-weighted nonnegative matrix factorization with rank regularization constraint (LWNMF_RC) is firstly presented, which removes complex background and then obtains motion salient regions. Secondly, a dual-manifold regularized nonnegative matrix factorization with sparsity constraint (DMNMF_SC) is proposed, which not only considers the short-term and middle-term temporal dependencies implied in data manifold, but also mines the geometric structure hidden in feature manifold. In addition, the introduction of sparsity constraint makes features possess better discriminativeness. Thirdly, a deep DMNMF_SC method is constructed, which acquires more hierarchical and discriminative features. Finally, a long-term temporal memory model with probability transfer learning (PTL-LTM) is proposed, which accurately memorizes the long-term temporal dependency among multiple simple action segments and, meanwhile, makes full use of the probability features of rich labeled simple actions and then applies the knowledge learned from simple actions for complex action recognition. Consequently, the performance is effectively improved.

Vorheriger Artikel Using eye-tracking into decision makers evaluation in evolutionary interactive UA-FLP algorithms

Nächster Artikel Feature construction as a bi-level optimization problem

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Nur mit Berechtigung zugänglich

Chen Y, Yi Z (2019) Locality-constrained least squares regression for subspace clustering. Knowl-Based Syst 163:51–56CrossRef

Lu Y, Lai Z, Xu Y, Li X, Zhang D, Yuan C (2017) Nonnegative discriminant matrix factorization. IEEE Trans Circuits Syst Video Technol 27(7):1392–1405CrossRef

Lu C, Feng J, Lin Z, Mei T, Yan S (2019) Subspace clustering by block diagonal representation. IEEE Trans Pattern Anal Mach Intell 41(2):487–501CrossRef

Lu C, Feng J, Chen Y, Liu W, Lin Z, Yan S (2019) Tensor robust principal component analysis with a new tensor nuclear norm. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/tpami.2019.2891760CrossRef

Alawadi S, Fernández-Delgado M, Mera D, Barro S (2018) Polynomial kernel discriminant analysis for 2D visualization of classification problems. Neural Comput Appl. https://doi.org/10.1007/s00521-017-3290-3CrossRef

Xu KK, Li HX, Liu Z (2018) ISOMAP-based spatiotemporal modeling for lithium-ion battery thermal process. IEEE Trans Ind Inf 14(2):569–577CrossRef

Lee DD, Seung HS (1999) Learning the parts of objects by nonnegative matrix factorization. Nature 401(6755):788–791MATHCrossRef

Yuan X, Han L, Qian S, Xu G, Yan H (2019) Singular value decomposition based recommendation using imputed data. Knowl-Based Syst 163:485–494CrossRef

Yi Y, Wang J, Zhou W, Zheng C, Kong J, Qiao S (2019) Non-negative matrix factorization with locality constrained adaptive graph. IEEE Trans Circuits Syst Video Technol. https://doi.org/10.1109/tcsvt.2019.2892971CrossRef

10.

Zhu W, Yan Y, Peng Y (2018) Topological structure regularized nonnegative matrix factorization for image clustering. Neural Comput Appl. https://doi.org/10.1007/s00521-018-3572-4CrossRef

11.

Yang S, Zhang L, He X, Yi Z (2019) Learning manifold structures with subspace segmentations. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2019.2895497CrossRef

12.

Zhang H, Wang S, Xu X, Chow TW, Wu QJ (2018) Tree2Vector: learning a vectorial representation for tree-structured data. IEEE Trans Neural Netw Learn Syst 99:1–15MathSciNet

13.

Gao H, Nie F, Huang H (2017) Local centroids structured non-negative matrix factorization. In: Proceedings of the thirty-first AAAI conference on artificial intelligence, pp 1905–1911

14.

Huang S, Zhao P, Ren Y, Li T, Xu Z (2019) Self-paced and soft-weighted nonnegative matrix factorization for data representation. Knowl-Based Syst 164:29–37CrossRef

15.

Liu F, Xu X, Qiu S, Tao D (2016) Simple to complex transfer learning for action recognition. IEEE Trans Image Process 25(2):949–960MathSciNetMATHCrossRef

16.

Zhang J, Hu H (2019) Domain learning joint with semantic adaptation for human action recognition. Pattern Recognit 90:196–209CrossRef

17.

Li J, Wong Y, Zhao Q, Kankanhalli MS (2017) Attention transfer from web images for video recognition. In: Proceedings of the 25th ACM international conference on multimedia, pp 1–9

18.

Luo Z, Zou Y, Hoffman J, Fei-Fei L (2017) Label efficient learning of transferable representations acrosss domains and tasks. In: Proceedings of advances in neural information processing systems (NIPS), pp 165–177

19.

Duan L, Xu D, Tsang IWH, Luo J (2012) Visual event recognition in videos by learning from web data. IEEE Trans Pattern Anal Mach Intell 34(9):1667–1680CrossRef

20.

Rahmani H, Mian A, Shah M (2018) Learning a deep model for human action recognition from novel viewpoints. IEEE Trans Pattern Anal Mach Intell 40(3):667–681CrossRef

21.

Wu F, Hu Y, Gao J, Sun Y, Yin B (2016) Ordered subspace clustering with block-diagonal priors. IEEE Trans Cybern 46(12):3209–3219CrossRef

22.

Wang J, Tian F, Liu CH, Yu H, Wang X, Tang X (2017) Robust nonnegative matrix factorization with ordered structure constraints. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 478–485

23.

Xiang Y, Zhang G, Gu S, Cai J (2018) Online multi-layer dictionary pair learning for visual classification. Expert Syst Appl 105:174–182CrossRef

24.

Su B, Zhou J, Ding X, Wang H, Wu Y (2016) Hierarchical dynamic parsing and encoding for action recognition. In: Proceedings of European conference on computer vision (ECCV), pp 202–217

25.

Trigeorgis G, Zafeiriou S, Schuller BW (2017) A deep matrix factorization method for learning attribute representations. IEEE Trans Pattern Anal Mach Intell 39(3):417–429CrossRef

26.

Kulis B (2012) Metric learning: a survey. Found Trends Mach Learn 5(4):287–364MathSciNetMATHCrossRef

27.

Wang H, Kläser A, Schmid C, Liu CL (2013) Dense trajectories and motion boundary descriptors for action recognition. Int J Comput Vis 103(1):60–79MathSciNetCrossRef

28.

Cai D, He X, Han J, Huang TS (2011) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560CrossRef

29.

Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22:888–905CrossRef

30.

Reddy KK, Shah M (2013) Recognizing 50 human action categories of web videos. Mach Vis Appl 24(5):971–981CrossRef

31.

Niebles JC, Chen CW, Fei-Fei L (2010) Modeling temporal structure of decomposable motion segments for activity classification. In: Proceedings of European conference on computer vision (ECCV), pp 392–405

32.

Schuldt C, Laptev I, Caputo B (2004) Recognizing human actions: A local SVM approach. In: Proceedings of the international conference on pattern recognition (ICPR), pp 32–36

33.

Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253CrossRef

34.

Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666CrossRef

35.

Allab K, Labiod L, Nadif M (2017) A semi-NMF-PCA unified framework for data clustering. IEEE Trans Knowl Data Eng 29(1):2–16MATHCrossRef

36.

Arias-Castro E, Lerman G, Zhang T (2017) Spectral clustering based on local PCA. J Mach Learn Res 18(9):1–57MathSciNetMATH

37.

Liu G, Lin Z, Yu Y (2010) Robust subspace segmentation by low-rank representation. In: Proceedings of the international conference on machine learning (ICML), pp 663–670

38.

Hu W, Choi KS, Wang P, Jiang Y, Wang S (2015) Convex nonnegative matrix factorization with manifold regularization. Neural Netw 63:94–103MATHCrossRef

39.

Xia G, Sun H, Feng L, Zhang G, Liu Y (2018) Human motion segmentation via robust kernel sparse subspace clustering. IEEE Trans Image Process 27(1):135–150MathSciNetMATHCrossRef

40.

Everts I, Van Gemert JC, Gevers T (2014) Evaluation of color spatio-temporal interest points for human action recognition. IEEE Trans Image Process 23(4):1569–1580MathSciNetMATHCrossRef

41.

Ciptadi A, Goodwin MS, Rehg JM (2014) Movement pattern histogram for action recognition and retrieval. In: Proceedings of European conference on computer vision (ECCV), pp 695–710

42.

Narayan S, Ramakrishnan KR (2014) A cause and effect analysis of motion trajectories for modeling actions. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2633–2640

43.

Liu J, Huang Y, Peng X, Wang L (2015) Multi-view descriptor mining via codeword net for action recognition. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 793–797

44.

Chen QQ, Zhang YJ (2016) Cluster trees of improved trajectories for action recognition. Neurocomputing 173:364–372CrossRef

45.

Wang H, Oneata D, Verbeek J, Schmid C (2016) A robust and efficient video representation for action recognition. Int J Comput Vis 119(3):219–238MathSciNetCrossRef

46.

Peng X, Wang L, Wang X, Qiao Y (2016) Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput Vis Image Underst 150:109–125CrossRef

47.

Wang L, Qiao Y, Tang X (2016) MoFAP: a multi-level representation for action recognition. Int J Comput Vis 119(3):254–271MathSciNetCrossRef

48.

Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114CrossRef

49.

Wang H, Chang X, Shi L, Yang Y, Shen YD (2018) Uncertainty sampling for action recognition via maximizing expected average precision. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence (IJCAI), pp 964–970

50.

Ni B, Moulin P, Yang X, Yan S (2015) Motion part regularization: Improving action recognition via trajectory selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3698–3706

51.

Liu C, Wu X, Jia Y (2016) A hierarchical video description for complex activity understanding. Int J Comput Vis 118(2):240–255MathSciNetCrossRef

52.

Yi Y, Zheng Z, Lin M (2017) Realistic action recognition with salient foreground trajectories. Expert Syst Appl 75:44–55CrossRef

53.

Xu K, Jiang X, Sun T (2017) Two-stream dictionary learning architecture for action recognition. IEEE Trans Circuits Syst Video Technol 27(3):567–576CrossRef

54.

Li WX, Vasconcelos N (2017) Complex activity recognition via attribute dynamics. Int J Comput Vis 122(2):334–370MathSciNetCrossRef

55.

Tian Y, Kong Y, Ruan Q, An G, Fu Y (2018) Hierarchical and spatio-temporal sparse representation for human action recognition. IEEE Trans Image Process 27(4):1748–1762MathSciNetCrossRef

56.

Tang K, Fei-Fei L, Koller D (2012) Learning latent temporal structure for complex event detection. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 1250–1257

57.

Li W, Yu Q, Divakaran A, Vasconcelos N (2013) Dynamic pooling for complex event recognition. In: Proceedings of the IEEE international conference on computer vision (CVPR), pp 2728–2735

58.

Zheng J, Jiang Z, Chellappa R (2016) Submodular attribute selection for visual recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2242–2255CrossRef

Titel: PTL-LTM model for complex action recognition using local-weighted NMF and deep dual-manifold regularized NMF with sparsity constraint
verfasst von: Ming Tong
He Bai
Xing Yue
Haili Bu
Publikationsdatum: 20.02.2020
Verlag: Springer London
Erschienen in: Neural Computing and Applications / Ausgabe 17/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-020-04783-0

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 17/2020

Camera model identification using a deep network and a reduced edge dataset

A chaotic optimization method based on logistic-sine map for numerical function optimization

A multi-objective open set orienteering problem

An enhanced sitting–sizing scheme for shunt capacitors in radial distribution systems using improved atom search optimization

A neural network approach to remove rain using reconstruction and feature losses

Performance optimization of QoS-supported dense WLANs using machine-learning-enabled enhanced distributed channel access (MEDCA) mechanism

Premium Partner