Skip to main content
Erschienen in: Neural Processing Letters 9/2023

25.10.2023

Deep Learning-Based Hand Gesture Recognition System and Design of a Human–Machine Interface

verfasst von: Abir Sen, Tapas Kumar Mishra, Ratnakar Dash

Erschienen in: Neural Processing Letters | Ausgabe 9/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Hand gesture recognition plays an important role in developing effective human–machine interfaces (HMIs) that enable direct communication between humans and machines. But in real-time scenarios, it is difficult to identify the correct hand gesture to control an application while moving the hands. To address this issue, in this work, a low-cost hand gesture recognition system based human-computer interface (HCI) is presented in real-time scenarios. The system consists of six stages: (1) hand detection, (2) gesture segmentation, (3) feature extraction and gesture classification using five pre-trained convolutional neural network models (CNN) and vision transformer (ViT), (4) building an interactive human–machine interface (HMI), (5) development of a gesture-controlled virtual mouse, (6) smoothing of virtual mouse pointer using of Kalman filter. In our work, five pre-trained CNN models (VGG16, VGG19, ResNet50, ResNet101, and Inception-V1) and ViT have been employed to classify hand gesture images. Two multi-class datasets (one public and one custom) have been used to validate the models. Considering the model’s performances, it is observed that Inception-V1 has significantly shown a better classification performance compared to the other four CNN models and ViT in terms of accuracy, precision, recall, and F-score values. We have also expanded this system to control some multimedia applications (such as VLC player, audio player, playing 2D Super-Mario-Bros game, etc.) with different customized gesture commands in real-time scenarios. The average speed of this system has reached 25 fps (frames per second), which meets the requirements for the real-time scenario. Performance of the proposed gesture control system obtained the average response time in milisecond for each control which makes it suitable for real-time. This model (prototype) will benefit physically disabled people interacting with desktops.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Berezhnoy V, Popov D, Afanasyev I, Mavridis N (2018) The hand-gesture-based control interface with wearable glove system. In: ICINCO (2), pp 458–465 Berezhnoy V, Popov D, Afanasyev I, Mavridis N (2018) The hand-gesture-based control interface with wearable glove system. In: ICINCO (2), pp 458–465
2.
Zurück zum Zitat Abhishek KS, Qubeley LCF, Ho D (2016) Glove-based hand gesture recognition sign language translator using capacitive touch sensor. In: 2016 IEEE international conference on electron devices and solid-state circuits (EDSSC), IEEE, pp 334–337 Abhishek KS, Qubeley LCF, Ho D (2016) Glove-based hand gesture recognition sign language translator using capacitive touch sensor. In: 2016 IEEE international conference on electron devices and solid-state circuits (EDSSC), IEEE, pp 334–337
4.
Zurück zum Zitat Al Farid F, Hashim N, Abdullah J, Bhuiyan MR, Shahida Mohd Isa WN, Uddin J, Haque MA, Husen MN (2022) A structured and methodological review on vision-based hand gesture recognition system. J Imaging 8(6):153CrossRef Al Farid F, Hashim N, Abdullah J, Bhuiyan MR, Shahida Mohd Isa WN, Uddin J, Haque MA, Husen MN (2022) A structured and methodological review on vision-based hand gesture recognition system. J Imaging 8(6):153CrossRef
5.
Zurück zum Zitat Mantecón T, del Blanco CR, Jaureguizar F, García N (2016) Hand gesture recognition using infrared imagery provided by leap motion controller. In: International conference on advanced concepts for intelligent vision systems, Springer, pp 47–57 Mantecón T, del Blanco CR, Jaureguizar F, García N (2016) Hand gesture recognition using infrared imagery provided by leap motion controller. In: International conference on advanced concepts for intelligent vision systems, Springer, pp 47–57
6.
Zurück zum Zitat Huang D-Y, Hu W-C, Chang S-H (2011) Gabor filter-based hand-pose angle estimation for hand gesture recognition under varying illumination. Expert Syst Appl 38(5):6031–6042CrossRef Huang D-Y, Hu W-C, Chang S-H (2011) Gabor filter-based hand-pose angle estimation for hand gesture recognition under varying illumination. Expert Syst Appl 38(5):6031–6042CrossRef
7.
Zurück zum Zitat Singha J, Roy A, Laskar RH (2018) Dynamic hand gesture recognition using vision-based approach for human-computer interaction. Neural Comput Appl 29(4):1129–1141CrossRef Singha J, Roy A, Laskar RH (2018) Dynamic hand gesture recognition using vision-based approach for human-computer interaction. Neural Comput Appl 29(4):1129–1141CrossRef
8.
Zurück zum Zitat Yang Z, Li Y, Chen W, Zheng Y (2012) Dynamic hand gesture recognition using hidden markov models. In: 2012 7th international conference on computer science & education (ICCSE), IEEE, pp 360–365 Yang Z, Li Y, Chen W, Zheng Y (2012) Dynamic hand gesture recognition using hidden markov models. In: 2012 7th international conference on computer science & education (ICCSE), IEEE, pp 360–365
9.
Zurück zum Zitat Yingxin X, Jinghua L, Lichun W, Dehui K (2016) A robust hand gesture recognition method via convolutional neural network. In: 6th international conference on digital home (ICDH). IEEE 2016:64–67 Yingxin X, Jinghua L, Lichun W, Dehui K (2016) A robust hand gesture recognition method via convolutional neural network. In: 6th international conference on digital home (ICDH). IEEE 2016:64–67
10.
Zurück zum Zitat Oyedotun OK, Khashman A (2017) Deep learning in vision-based static hand gesture recognition. Neural Comput Appl 28(12):3941–3951CrossRef Oyedotun OK, Khashman A (2017) Deep learning in vision-based static hand gesture recognition. Neural Comput Appl 28(12):3941–3951CrossRef
11.
Zurück zum Zitat Fang W, Ding Y, Zhang F, Sheng J (2019) Gesture recognition based on CNN and DCGAN for calculation and text output. IEEE Access 7:28230–28237CrossRef Fang W, Ding Y, Zhang F, Sheng J (2019) Gesture recognition based on CNN and DCGAN for calculation and text output. IEEE Access 7:28230–28237CrossRef
12.
Zurück zum Zitat Adithya V, Rajesh R (2020) A deep convolutional neural network approach for static hand gesture recognition. Proc Comput Sci 171:2353–2361CrossRef Adithya V, Rajesh R (2020) A deep convolutional neural network approach for static hand gesture recognition. Proc Comput Sci 171:2353–2361CrossRef
13.
Zurück zum Zitat Neethu P, Suguna R, Sathish D (2020) An efficient method for human hand gesture detection and recognition using deep learning convolutional neural networks. Soft Comput 24:15239–15248CrossRef Neethu P, Suguna R, Sathish D (2020) An efficient method for human hand gesture detection and recognition using deep learning convolutional neural networks. Soft Comput 24:15239–15248CrossRef
14.
Zurück zum Zitat Sen A, Mishra TK, Dash R (2022) A novel hand gesture detection and recognition system based on ensemble-based convolutional neural network. Multimed Tools Appl 81(28):40043–40066CrossRef Sen A, Mishra TK, Dash R (2022) A novel hand gesture detection and recognition system based on ensemble-based convolutional neural network. Multimed Tools Appl 81(28):40043–40066CrossRef
15.
Zurück zum Zitat Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, et al. (2020) An image is worth 16x16 words: Transformers for image recognition at scale, arXiv preprint arXiv:2010.11929 Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, et al. (2020) An image is worth 16x16 words: Transformers for image recognition at scale, arXiv preprint arXiv:​2010.​11929
16.
Zurück zum Zitat Godoy RV, Lahr GJ, Dwivedi A, Reis TJ, Polegato PH, Becker M, Caurin GA, Liarokapis M (2022) Electromyography-based, robust hand motion classification employing temporal multi-channel vision transformers. IEEE Robot Autom Lett 7(4):10200–10207CrossRef Godoy RV, Lahr GJ, Dwivedi A, Reis TJ, Polegato PH, Becker M, Caurin GA, Liarokapis M (2022) Electromyography-based, robust hand motion classification employing temporal multi-channel vision transformers. IEEE Robot Autom Lett 7(4):10200–10207CrossRef
17.
Zurück zum Zitat Montazerin M, Zabihi S, Rahimian E, Mohammadi A, Naderkhani F (2022) Vit-hgr: Vision transformer-based hand gesture recognition from high density surface EMG signals, arXiv preprint arXiv:2201.10060 Montazerin M, Zabihi S, Rahimian E, Mohammadi A, Naderkhani F (2022) Vit-hgr: Vision transformer-based hand gesture recognition from high density surface EMG signals, arXiv preprint arXiv:​2201.​10060
18.
Zurück zum Zitat Rautaray SS, Agrawal A (2010) A novel human computer interface based on hand gesture recognition using computer vision techniques. In: Proceedings of the first international conference on intelligent interactive technologies and multimedia, pp 292–296 Rautaray SS, Agrawal A (2010) A novel human computer interface based on hand gesture recognition using computer vision techniques. In: Proceedings of the first international conference on intelligent interactive technologies and multimedia, pp 292–296
19.
Zurück zum Zitat Kim K-S, Jang D-S, Choi H-I (2007) Real time face tracking with pyramidal lucas-kanade feature tracker. In: Computational science and its applications–ICCSA 2007: international conference, Kuala Lumpur, Malaysia, August 26-29, 2007. Proceedings, Part I 7, Springer, pp 1074–1082 Kim K-S, Jang D-S, Choi H-I (2007) Real time face tracking with pyramidal lucas-kanade feature tracker. In: Computational science and its applications–ICCSA 2007: international conference, Kuala Lumpur, Malaysia, August 26-29, 2007. Proceedings, Part I 7, Springer, pp 1074–1082
20.
Zurück zum Zitat Paliwal M, Sharma G, Nath D, Rathore A, Mishra H, Mondal S (2013) A dynamic hand gesture recognition system for controlling vlc media player. In: 2013 international conference on advances in technology and engineering (ICATE), IEEE, pp 1–4 Paliwal M, Sharma G, Nath D, Rathore A, Mishra H, Mondal S (2013) A dynamic hand gesture recognition system for controlling vlc media player. In: 2013 international conference on advances in technology and engineering (ICATE), IEEE, pp 1–4
21.
Zurück zum Zitat Shibly KH, Dey SK, Islam MA, Showrav SI (2019) Design and development of hand gesture based virtual mouse. In: 2019 1st international conference on advances in science, engineering and robotics technology (ICASERT), IEEE, pp 1–5 Shibly KH, Dey SK, Islam MA, Showrav SI (2019) Design and development of hand gesture based virtual mouse. In: 2019 1st international conference on advances in science, engineering and robotics technology (ICASERT), IEEE, pp 1–5
22.
Zurück zum Zitat Tsai T-H, Huang C-C, Zhang K-L (2020) Design of hand gesture recognition system for human-computer interaction. Multimed Tools Appl 79(9):5989–6007CrossRef Tsai T-H, Huang C-C, Zhang K-L (2020) Design of hand gesture recognition system for human-computer interaction. Multimed Tools Appl 79(9):5989–6007CrossRef
23.
26.
Zurück zum Zitat Jamil N, Sembok TMT, Bakar ZA (2008) Noise removal and enhancement of binary images using morphological operations. In: International symposium on information technology, Vol. 4. IEEE 2008:1–6 Jamil N, Sembok TMT, Bakar ZA (2008) Noise removal and enhancement of binary images using morphological operations. In: International symposium on information technology, Vol. 4. IEEE 2008:1–6
27.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv:1409.1556 Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv:​1409.​1556
28.
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
29.
Zurück zum Zitat Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9 Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
32.
Zurück zum Zitat Kauten C (2018) Super Mario Bros for OpenAI Gym, GitHub Kauten C (2018) Super Mario Bros for OpenAI Gym, GitHub
33.
Zurück zum Zitat Asaari MSM, Suandi SA (2010) Hand gesture tracking system using adaptive Kalman filter. In: 2010 10th international conference on intelligent systems design and applications, IEEE, pp 166–171 Asaari MSM, Suandi SA (2010) Hand gesture tracking system using adaptive Kalman filter. In: 2010 10th international conference on intelligent systems design and applications, IEEE, pp 166–171
34.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J. Faster (2015) r-cnn: Towards real-time object detection with region proposal networks. Adv Neural Inf Proc Syst 28 Ren S, He K, Girshick R, Sun J. Faster (2015) r-cnn: Towards real-time object detection with region proposal networks. Adv Neural Inf Proc Syst 28
35.
Zurück zum Zitat Tan M, Pang R, Le QV (2020) Efficientdet: scalable and efficient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10781–10790 Tan M, Pang R, Le QV (2020) Efficientdet: scalable and efficient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10781–10790
36.
Zurück zum Zitat Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L, Imagenet: A large-scale hierarchical image database, in, (2009) IEEE conference on computer vision and pattern recognition. Ieee 2009:248–255 Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L, Imagenet: A large-scale hierarchical image database, in, (2009) IEEE conference on computer vision and pattern recognition. Ieee 2009:248–255
37.
Zurück zum Zitat Bazi Y, Bashmal L, Rahhal MMA, Dayil RA, Ajlan NA (2021) Vision transformers for remote sensing image classification. Remote Sens 13(3):516CrossRef Bazi Y, Bashmal L, Rahhal MMA, Dayil RA, Ajlan NA (2021) Vision transformers for remote sensing image classification. Remote Sens 13(3):516CrossRef
Metadaten
Titel
Deep Learning-Based Hand Gesture Recognition System and Design of a Human–Machine Interface
verfasst von
Abir Sen
Tapas Kumar Mishra
Ratnakar Dash
Publikationsdatum
25.10.2023
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 9/2023
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-023-11433-8

Weitere Artikel der Ausgabe 9/2023

Neural Processing Letters 9/2023 Zur Ausgabe

Neuer Inhalt