Skip to main content
Erschienen in: Soft Computing 8/2020

16.09.2019 | Focus

Research on key issues of gesture recognition for artificial intelligence

verfasst von: Taiping Mo, Peng Sun

Erschienen in: Soft Computing | Ausgabe 8/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Gesture recognition has become a hot spot in the direction of artificial intelligence and has great research significance. At present, some classical algorithms, such as the neural network method and the hidden Markov method, have the disadvantages of large computational complexity and long training time. This paper proposes the support vector machine (SVM) algorithm to realize gesture recognition. In order to make the recognition more accurate, SVM is combined with the principal component analysis (PCA) algorithm, performs the dimensionality reduction on the gesture image to form the PCA + SVM algorithm for gesture recognition. At the same time, a new dynamic gesture recognition processing method is proposed, and its effectiveness is proved by various methods. Using open-source computer vision library (OPENCV), the algorithm is simulated on visual studio 2015 environment. The results show that the algorithm has an excellent recognition effect.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Alistair S (2018) Handshape recognition using principal component analysis and convolutional neural networks applied to sign language. Doctoral Thesis, Dublin City University Alistair S (2018) Handshape recognition using principal component analysis and convolutional neural networks applied to sign language. Doctoral Thesis, Dublin City University
Zurück zum Zitat Arif M, Muhammad U, Somaya AM (2018a) Smart home based on WiFi sensing: a survey. IEEE Access 6(3):13317–13325 Arif M, Muhammad U, Somaya AM (2018a) Smart home based on WiFi sensing: a survey. IEEE Access 6(3):13317–13325
Zurück zum Zitat Arif M, Muhammad U, Somaya AM (2018b) Multi-order statistical descriptors for real-time face recognition and object classification. IEEE Access 6(1):12993–13004 Arif M, Muhammad U, Somaya AM (2018b) Multi-order statistical descriptors for real-time face recognition and object classification. IEEE Access 6(1):12993–13004
Zurück zum Zitat Chakraborty BK, Sarma D, Bhuyan MK, Macdorman KF (2018) Review of constraints on vision-based gesture recognition for human–computer interaction. IET Comput Vis 12(1):3–15CrossRef Chakraborty BK, Sarma D, Bhuyan MK, Macdorman KF (2018) Review of constraints on vision-based gesture recognition for human–computer interaction. IET Comput Vis 12(1):3–15CrossRef
Zurück zum Zitat Chang X, Yu YL, Yang Y, Xing EP (2016) Semantic pooling for complex event analysis in untrimmed videos. IEEE Trans Softw Eng 39(8):1617–1632 Chang X, Yu YL, Yang Y, Xing EP (2016) Semantic pooling for complex event analysis in untrimmed videos. IEEE Trans Softw Eng 39(8):1617–1632
Zurück zum Zitat Chen X, Ma H, Zhu C, Wang X, Zhao Z (2016) Boundary-aware box refinement for object proposal generation. Neurocomputing 219(1):323–332 Chen X, Ma H, Zhu C, Wang X, Zhao Z (2016) Boundary-aware box refinement for object proposal generation. Neurocomputing 219(1):323–332
Zurück zum Zitat Cruz L, Lucio D, Velho L (2012) Kinect and RGBD images: challenges and applications. In: Conference on graphics, patterns and images. IEEE, vol 8(12), pp 36–49 Cruz L, Lucio D, Velho L (2012) Kinect and RGBD images: challenges and applications. In: Conference on graphics, patterns and images. IEEE, vol 8(12), pp 36–49
Zurück zum Zitat Eg SG, Mohd SS, Ajune WI (2019) 3D object manipulation techniques in handheld mobile augmented reality interface: a review. IEEE Access 2(3):914–920 Eg SG, Mohd SS, Ajune WI (2019) 3D object manipulation techniques in handheld mobile augmented reality interface: a review. IEEE Access 2(3):914–920
Zurück zum Zitat Ghinea G, Kannan R, Kannaiyan S (2014) Gradient-orientation-based PCA subspace for novel face recognition. IEEE Access 2:914–920CrossRef Ghinea G, Kannan R, Kannaiyan S (2014) Gradient-orientation-based PCA subspace for novel face recognition. IEEE Access 2:914–920CrossRef
Zurück zum Zitat Gongfa L, Hao W, Guozhang J, Shuang X, Honghai L (2019) Dynamic gesture recognition in the Internet of Things. IEEE Access 7(12):23713–23724 Gongfa L, Hao W, Guozhang J, Shuang X, Honghai L (2019) Dynamic gesture recognition in the Internet of Things. IEEE Access 7(12):23713–23724
Zurück zum Zitat Gumus E, Kilic N, Sertbas A, Ucan ON (2010) Evaluation of face recognition techniques using PCA, wavelets and SVM. Expert Syst Appl 37(9):6404–6408CrossRef Gumus E, Kilic N, Sertbas A, Ucan ON (2010) Evaluation of face recognition techniques using PCA, wavelets and SVM. Expert Syst Appl 37(9):6404–6408CrossRef
Zurück zum Zitat Hassen H, Dörnemann K, Khemakhem M (2017) Advanced distributed architecture for a complex and large scale Arabic handwriting recognition framework. Int J High Perform Comput Netw 10(6):505–514CrossRef Hassen H, Dörnemann K, Khemakhem M (2017) Advanced distributed architecture for a complex and large scale Arabic handwriting recognition framework. Int J High Perform Comput Netw 10(6):505–514CrossRef
Zurück zum Zitat Hongxin X, Yanping B, Hongping H, Ting X (2019) A novel hybrid model based on TVIW-PSO-GSA algorithm and support vector machine for classification problems. IEEE Access 2(3):914–920 Hongxin X, Yanping B, Hongping H, Ting X (2019) A novel hybrid model based on TVIW-PSO-GSA algorithm and support vector machine for classification problems. IEEE Access 2(3):914–920
Zurück zum Zitat Jun Z, Jizhao H, Zhenglan T, Feng W (2017) Face detection based on LBP. In: 2017 13th IEEE international conference on electronic measurement & instruments (ICEMI). IEEE Jun Z, Jizhao H, Zhenglan T, Feng W (2017) Face detection based on LBP. In: 2017 13th IEEE international conference on electronic measurement & instruments (ICEMI). IEEE
Zurück zum Zitat Jun K, Min C, Min J, Jinhua S, Jian H (2018) Face recognition based on CSGF(2D)2PCANet. IEEE Access 6(8):45153–45165 Jun K, Min C, Min J, Jinhua S, Jian H (2018) Face recognition based on CSGF(2D)2PCANet. IEEE Access 6(8):45153–45165
Zurück zum Zitat Kim TK, Arandjelovi O, Cipolla R (2007) Boosted manifold principal angles for image set-based recognition. Pattern Recognit 40(9):2475–2484CrossRefMATH Kim TK, Arandjelovi O, Cipolla R (2007) Boosted manifold principal angles for image set-based recognition. Pattern Recognit 40(9):2475–2484CrossRefMATH
Zurück zum Zitat Li Z, Nie F, Chang X, Yang Y (2017) Beyond trace ratio: weighted harmonic mean of trace ratios for multiclass discriminant analysis. IEEE Trans Knowl Data Eng 29(10):2100–2110CrossRef Li Z, Nie F, Chang X, Yang Y (2017) Beyond trace ratio: weighted harmonic mean of trace ratios for multiclass discriminant analysis. IEEE Trans Knowl Data Eng 29(10):2100–2110CrossRef
Zurück zum Zitat Mahmood A, Mian A, Owens R (2014) Semi-supervised spectral clustering for image set classification. In: Computer vision pattern recognition (CVPR) 2014. IEEE Mahmood A, Mian A, Owens R (2014) Semi-supervised spectral clustering for image set classification. In: Computer vision pattern recognition (CVPR) 2014. IEEE
Zurück zum Zitat Nugroho AS, Witarto AB, Handoko D (2016) Support vector machine. Support Vector Mach Chem Nugroho AS, Witarto AB, Handoko D (2016) Support vector machine. Support Vector Mach Chem
Zurück zum Zitat Ronald B, Bonin F, Campbell N, Poppe R (2016) International workshop on multimodal analyses enabling artificial agents in human–machine interaction (workshop summary). In: ACM international conference on multimodal interaction. ACM Ronald B, Bonin F, Campbell N, Poppe R (2016) International workshop on multimodal analyses enabling artificial agents in human–machine interaction (workshop summary). In: ACM international conference on multimodal interaction. ACM
Zurück zum Zitat Saleh A, Ahmed M (2019) Unknown-length handwritten numeral string recognition using cascade of PCA-SVMNet classifiers. IEEE Access 7(4):52024–52034MathSciNet Saleh A, Ahmed M (2019) Unknown-length handwritten numeral string recognition using cascade of PCA-SVMNet classifiers. IEEE Access 7(4):52024–52034MathSciNet
Zurück zum Zitat Syed MSS, Husnain AN, Javed IK, Muhammad RZ, Hikmat UK (2018) Shape based Pakistan sign language categorization using statistical features and support vector machines. IEEE Access 6(10):59242–59252 Syed MSS, Husnain AN, Javed IK, Muhammad RZ, Hikmat UK (2018) Shape based Pakistan sign language categorization using statistical features and support vector machines. IEEE Access 6(10):59242–59252
Zurück zum Zitat Uzair M, Mahmood A, Mian A, Mcdonald C (2015) Periocular region-based person identification in the visible, infrared and hyperspectral imagery. Neurocomputing 149(2):854–867CrossRef Uzair M, Mahmood A, Mian A, Mcdonald C (2015) Periocular region-based person identification in the visible, infrared and hyperspectral imagery. Neurocomputing 149(2):854–867CrossRef
Zurück zum Zitat Wei F, Yewen D, Feihong Z, Jack S (2019) Gesture recognition based on CNN and DCGAN for calculation and text output. IEEE Access 7(2):28230–28237 Wei F, Yewen D, Feihong Z, Jack S (2019) Gesture recognition based on CNN and DCGAN for calculation and text output. IEEE Access 7(2):28230–28237
Zurück zum Zitat Yanqiu L, Pengwen X, Weidong M, Weiqiong M, Jiahao L (2019) Dynamic sign language recognition based on video sequence with BLSTM-3D residual networks. IEEE Access 7(3):38044–38054 Yanqiu L, Pengwen X, Weidong M, Weiqiong M, Jiahao L (2019) Dynamic sign language recognition based on video sequence with BLSTM-3D residual networks. IEEE Access 7(3):38044–38054
Zurück zum Zitat Yewale SK, Bharne PK (2011) Hand gesture recognition using different algorithms based on artificial neural network. Int J Eng Sci Technol 3(4):2603–2608 Yewale SK, Bharne PK (2011) Hand gesture recognition using different algorithms based on artificial neural network. Int J Eng Sci Technol 3(4):2603–2608
Zurück zum Zitat Yuliang Z, Chao L, Xueliang Z, Xiaopeng S, Guangyi S, Wen JL (2019) Wireless IoT motion-recognition rings and a paper keyboard. IEEE Access 7(4):44514–44524 Yuliang Z, Chao L, Xueliang Z, Xiaopeng S, Guangyi S, Wen JL (2019) Wireless IoT motion-recognition rings and a paper keyboard. IEEE Access 7(4):44514–44524
Zurück zum Zitat Zeng QS, Lai JH, Wang CD (2014) Multi-local model image set matching based on domain description. Pattern Recognit 47(2):694–704CrossRef Zeng QS, Lai JH, Wang CD (2014) Multi-local model image set matching based on domain description. Pattern Recognit 47(2):694–704CrossRef
Metadaten
Titel
Research on key issues of gesture recognition for artificial intelligence
verfasst von
Taiping Mo
Peng Sun
Publikationsdatum
16.09.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 8/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04342-3

Weitere Artikel der Ausgabe 8/2020

Soft Computing 8/2020 Zur Ausgabe