Skip to main content
Erschienen in: Neural Computing and Applications 12/2017

11.04.2016 | Original Article

Deep learning in vision-based static hand gesture recognition

verfasst von: Oyebade K. Oyedotun, Adnan Khashman

Erschienen in: Neural Computing and Applications | Ausgabe 12/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Hand gesture for communication has proven effective for humans, and active research is ongoing in replicating the same success in computer vision systems. Human–computer interaction can be significantly improved from advances in systems that are capable of recognizing different hand gestures. In contrast to many earlier works, which consider the recognition of significantly differentiable hand gestures, and therefore often selecting a few gestures from the American Sign Language (ASL) for recognition, we propose applying deep learning to the problem of hand gesture recognition for the whole 24 hand gestures obtained from the Thomas Moeslund’s gesture recognition database. We show that more biologically inspired and deep neural networks such as convolutional neural network and stacked denoising autoencoder are capable of learning the complex hand gesture classification task with lower error rates. The considered networks are trained and tested on data obtained from the above-mentioned public database; results comparison is then made against earlier works in which only small subsets of the ASL hand gestures are considered for recognition.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Nguyen T-N, Huynh H-H, Meunier J (2013) Static hand gesture recognition using artificial neural network. J Image Graph 1(1):34–38CrossRef Nguyen T-N, Huynh H-H, Meunier J (2013) Static hand gesture recognition using artificial neural network. J Image Graph 1(1):34–38CrossRef
2.
Zurück zum Zitat Nagi J, Ducatelle F, Di Caro GA et al (2011) Max-pooling convolutional neural networks for vision-based hand gesture recognition. In: 2011 IEEE international conference on signal and image processing applications (ICSIPA2011), pp 342–347 Nagi J, Ducatelle F, Di Caro GA et al (2011) Max-pooling convolutional neural networks for vision-based hand gesture recognition. In: 2011 IEEE international conference on signal and image processing applications (ICSIPA2011), pp 342–347
3.
Zurück zum Zitat Rahman MdH, Afrin J (2013) Hand gesture recognition using multiclass support vector machine. Int J Comput Appl 74(1):39–43 Rahman MdH, Afrin J (2013) Hand gesture recognition using multiclass support vector machine. Int J Comput Appl 74(1):39–43
4.
Zurück zum Zitat Sultana A, Rajapuspha T (2012) Vision based gesture recognition for alphabetical hand gestures using the SVM classifier. Int J Comput Sci Eng Technol 3(7):218–223 Sultana A, Rajapuspha T (2012) Vision based gesture recognition for alphabetical hand gestures using the SVM classifier. Int J Comput Sci Eng Technol 3(7):218–223
5.
Zurück zum Zitat Yewale SK, Bharne PK (2011) Hand gesture recognition using different algorithms based on artificial neural network. In: 2011 International conference on emerging trends in networks and computer communications (ETNCC), 22–24 April 2011, Udaipur, pp 287–292 Yewale SK, Bharne PK (2011) Hand gesture recognition using different algorithms based on artificial neural network. In: 2011 International conference on emerging trends in networks and computer communications (ETNCC), 22–24 April 2011, Udaipur, pp 287–292
6.
Zurück zum Zitat Triesch J, von Malsburg C (2011) A system for person-independent hand posture recognition against complex backgrounds. IEEE Trans Pattern Anal Mach Intell 23(12):1449–1453CrossRef Triesch J, von Malsburg C (2011) A system for person-independent hand posture recognition against complex backgrounds. IEEE Trans Pattern Anal Mach Intell 23(12):1449–1453CrossRef
7.
Zurück zum Zitat Oyedotun OK, Olaniyi EO, Helwan A, Khashman A (2014) Decision support models for iris nevus diagnosis considering potential malignancy. Int J Sci Eng Res 5(12):419–426 Oyedotun OK, Olaniyi EO, Helwan A, Khashman A (2014) Decision support models for iris nevus diagnosis considering potential malignancy. Int J Sci Eng Res 5(12):419–426
8.
Zurück zum Zitat Ahmed T (2012) A neural network based real time hand gesture recognition system. Int J Comput Appl 59(4):17–22 Ahmed T (2012) A neural network based real time hand gesture recognition system. Int J Comput Appl 59(4):17–22
9.
Zurück zum Zitat Phu JJ, Tay YH (2006) Computer vision based hand gesture recognition using artificial neural network. Faculty of Information and Communication Technology, Universiti Tunku Abdul Rahman, pp 1–6 Phu JJ, Tay YH (2006) Computer vision based hand gesture recognition using artificial neural network. Faculty of Information and Communication Technology, Universiti Tunku Abdul Rahman, pp 1–6
10.
Zurück zum Zitat Ibraheem NA, Khan RZ (2012) Vision based gesture recognition using neural networks approaches: a review. Int J Hum Comput Interact 3(1):1–14 Ibraheem NA, Khan RZ (2012) Vision based gesture recognition using neural networks approaches: a review. Int J Hum Comput Interact 3(1):1–14
11.
Zurück zum Zitat Khashman A (2012) Investigation of different neural models for blood cell type identification. Neural Comput Appl 21(6):1177–1183CrossRef Khashman A (2012) Investigation of different neural models for blood cell type identification. Neural Comput Appl 21(6):1177–1183CrossRef
12.
Zurück zum Zitat Khashman A (2009) Application of an emotional neural network to facial recognition. Neural Comput Appl 18(4):309–320CrossRef Khashman A (2009) Application of an emotional neural network to facial recognition. Neural Comput Appl 18(4):309–320CrossRef
13.
Zurück zum Zitat Oyedotun OK, Tackie SN, Olaniyi EO, Khashman A (2015) Data mining of students’ performance: Turkish students as a case study. Int J Intell Syst Appl 7(9):20–27 Oyedotun OK, Tackie SN, Olaniyi EO, Khashman A (2015) Data mining of students’ performance: Turkish students as a case study. Int J Intell Syst Appl 7(9):20–27
14.
Zurück zum Zitat Wang W, Yang J, Xiao J et al (2015) Face recognition based on deep learning. Lect Notes Comput Sci 8944:812–820CrossRef Wang W, Yang J, Xiao J et al (2015) Face recognition based on deep learning. Lect Notes Comput Sci 8944:812–820CrossRef
15.
Zurück zum Zitat Noda K, Yamaguchi Y, Nakadai K et al (2015) Audio-visual speech recognition using deep learning. Appl Intell 42(4):722–737CrossRef Noda K, Yamaguchi Y, Nakadai K et al (2015) Audio-visual speech recognition using deep learning. Appl Intell 42(4):722–737CrossRef
16.
Zurück zum Zitat Collobert R, Weston J, Bottou L et al (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12:2493–2537MATH Collobert R, Weston J, Bottou L et al (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12:2493–2537MATH
17.
Zurück zum Zitat Kruger N et al (2013) Deep hierarchies in the primate visual cortex: What can we learn for computer vision? IEEE Trans Pattern Anal Mach Intell 35(8):1847–1871CrossRef Kruger N et al (2013) Deep hierarchies in the primate visual cortex: What can we learn for computer vision? IEEE Trans Pattern Anal Mach Intell 35(8):1847–1871CrossRef
19.
Zurück zum Zitat Najafabadi MM et al (2015) Deep learning applications and challenges in big data analytics. J Big Data 2(1):1–21CrossRef Najafabadi MM et al (2015) Deep learning applications and challenges in big data analytics. J Big Data 2(1):1–21CrossRef
20.
Zurück zum Zitat Pierre B (2012) Autoencoders, unsupervised learning, and deep architectures. Workshop Unsuperv Transf Learn 27:37–50 Pierre B (2012) Autoencoders, unsupervised learning, and deep architectures. Workshop Unsuperv Transf Learn 27:37–50
21.
Zurück zum Zitat Erhan D, Bengio Y, Courville A (2010) Why does unsupervised pre-training help deep learning? J Mach Learn Res 11:625–660MathSciNetMATH Erhan D, Bengio Y, Courville A (2010) Why does unsupervised pre-training help deep learning? J Mach Learn Res 11:625–660MathSciNetMATH
22.
Zurück zum Zitat Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of 13th international conference on artificial intelligence and statistics, pp 249–256 Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of 13th international conference on artificial intelligence and statistics, pp 249–256
23.
24.
Zurück zum Zitat Oyedotun OK, Dimililer K (2016) Pattern recognition: invariance learning in convolutional auto encoder network. Int J Image Graph Signal Process 8(3):19–27CrossRef Oyedotun OK, Dimililer K (2016) Pattern recognition: invariance learning in convolutional auto encoder network. Int J Image Graph Signal Process 8(3):19–27CrossRef
25.
Zurück zum Zitat LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
26.
Zurück zum Zitat Oyedotun OK, Olaniyi EO, Khashman A (2015) Deep learning in character recognition considering pattern invariance constraints. Int J Intell Syst Appl 7(7):1–10 Oyedotun OK, Olaniyi EO, Khashman A (2015) Deep learning in character recognition considering pattern invariance constraints. Int J Intell Syst Appl 7(7):1–10
27.
Zurück zum Zitat Vincent P et al (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371–3408MathSciNetMATH Vincent P et al (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371–3408MathSciNetMATH
28.
Zurück zum Zitat Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958MathSciNetMATH Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958MathSciNetMATH
29.
Zurück zum Zitat Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: Fleet D, Pajdla T, Schiele B, Tuytelaars T (eds) Computer vision—ECCV 2014. Springer, Berlin, pp 818–833 Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: Fleet D, Pajdla T, Schiele B, Tuytelaars T (eds) Computer vision—ECCV 2014. Springer, Berlin, pp 818–833
30.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Pereira F, Burges CJC, Bottou L, Weinberger KQ (eds) Advances in neural information processing systems. Neural Information Processing Systems (NIPS), pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Pereira F, Burges CJC, Bottou L, Weinberger KQ (eds) Advances in neural information processing systems. Neural Information Processing Systems (NIPS), pp 1097–1105
31.
Zurück zum Zitat Sutskever I, Martens J, Dahl G, Hinton G (2013) On the importance of initialization and momentum in deep learning. In: Proceedings of the 30th international conference on machine learning (ICML-13), pp 1139–1147 Sutskever I, Martens J, Dahl G, Hinton G (2013) On the importance of initialization and momentum in deep learning. In: Proceedings of the 30th international conference on machine learning (ICML-13), pp 1139–1147
32.
Zurück zum Zitat Scherer D, Müller A, Behnke S (2010) Evaluation of pooling operations in convolutional architectures for object recognition. In: Diamantaras KI, Duch W, Iliadis LS (eds) Artificial neural networks—ICANN. Springer, Berlin, pp 92–101 Scherer D, Müller A, Behnke S (2010) Evaluation of pooling operations in convolutional architectures for object recognition. In: Diamantaras KI, Duch W, Iliadis LS (eds) Artificial neural networks—ICANN. Springer, Berlin, pp 92–101
33.
Zurück zum Zitat Hasan H, Abdul-Kareem S (2014) Static hand gesture recognition using neural networks. Artif Intell Rev 41(2):147–181CrossRef Hasan H, Abdul-Kareem S (2014) Static hand gesture recognition using neural networks. Artif Intell Rev 41(2):147–181CrossRef
34.
Zurück zum Zitat Avraam M (2014) Static gesture recognition combining graph and appearance features. Int J Adv Res Artif Intell 3(2):1–4CrossRef Avraam M (2014) Static gesture recognition combining graph and appearance features. Int J Adv Res Artif Intell 3(2):1–4CrossRef
35.
Zurück zum Zitat Nguyen T-N, Huynh H-H, Meunier J (2015) Static hand gesture recognition using principal component analysis combined with artificial neural network. J Autom Control Eng 3(1):40–45CrossRef Nguyen T-N, Huynh H-H, Meunier J (2015) Static hand gesture recognition using principal component analysis combined with artificial neural network. J Autom Control Eng 3(1):40–45CrossRef
Metadaten
Titel
Deep learning in vision-based static hand gesture recognition
verfasst von
Oyebade K. Oyedotun
Adnan Khashman
Publikationsdatum
11.04.2016
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 12/2017
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-016-2294-8

Weitere Artikel der Ausgabe 12/2017

Neural Computing and Applications 12/2017 Zur Ausgabe