Skip to main content

2015 | OriginalPaper | Buchkapitel

Dynamic Hand Gesture Recognition Using Generalized Time Warping and Deep Belief Networks

verfasst von : Cristian A. Torres-Valencia, Hernán F. García, Germán A. Holguín, Mauricio A. Álvarez, Álvaro Orozco

Erschienen in: Advances in Visual Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Body gestures play an important role in human communications, specially hand gestures are the most distinctive features in sign languages. Several works have been proposed in order to recognize hand gestures using static and dynamic approaches. Nevertheless, due to the high variety of signs and the dynamic changes exhibited in different hand motions, a strategy for modeling these dynamic changes in hand signs must be fulfilled. In this work we propose a framework for dynamic hand gesture recognition using a well known method for alignment of time series as the Generalized Time Warping (GTW). Several features are extracted from the aligned sequences of hand gestures based on texture descriptors. Then a methodology for hand motion recognition is carried out based on Convolutional Neural Networks. The obtained results show that the methodology proposed allows an accurate recognition of several hand gestures obtained from the RVL-SLLL American Sign Language Database.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
We use the GTW implementation available in http://​www.​f-zhou.​com/​ta_​code.​html.
 
3
We use the Deep Learning toolbox available on http://​deeplearning.​cs.​toronto.​edu/​codes.
 
Literatur
1.
Zurück zum Zitat Kausar, S., Javed, M.: A survey on sign language recognition. In: Frontiers of Information Technology (FIT 2011), pp. 95–98 (2011) Kausar, S., Javed, M.: A survey on sign language recognition. In: Frontiers of Information Technology (FIT 2011), pp. 95–98 (2011)
2.
Zurück zum Zitat Collumeau, J.F., Leconge, R., Emile, B., Laurent, H.: Hand gesture recognition using a dedicated geometric descriptor. In: 2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA), pp. 287–292 (2012) Collumeau, J.F., Leconge, R., Emile, B., Laurent, H.: Hand gesture recognition using a dedicated geometric descriptor. In: 2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA), pp. 287–292 (2012)
3.
Zurück zum Zitat Kiani Sarkaleh, A., Poorahangaryan, F., Zanj, B., Karami, A.: A neural network based system for persian sign language recognition. In: 2009 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), pp. 145–149 (2009) Kiani Sarkaleh, A., Poorahangaryan, F., Zanj, B., Karami, A.: A neural network based system for persian sign language recognition. In: 2009 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), pp. 145–149 (2009)
4.
Zurück zum Zitat Sinith, M., Kamal, S., Nisha, B., Nayana, S., Surendran, K., Jith, P.: Sign gesture recongnition using support vector machine. In: 2012 International Conference on Advances in Computing and Communications (ICACC), pp. 122–125 (2012) Sinith, M., Kamal, S., Nisha, B., Nayana, S., Surendran, K., Jith, P.: Sign gesture recongnition using support vector machine. In: 2012 International Conference on Advances in Computing and Communications (ICACC), pp. 122–125 (2012)
5.
Zurück zum Zitat Isaacs, J., Foo, S.: Hand pose estimation for american sign language recognition. In: Proceedings of the Thirty-Sixth Southeastern Symposium on System Theory, pp. 132–136 (2004) Isaacs, J., Foo, S.: Hand pose estimation for american sign language recognition. In: Proceedings of the Thirty-Sixth Southeastern Symposium on System Theory, pp. 132–136 (2004)
6.
Zurück zum Zitat Li, H., Greenspan, M.A.: Model-based segmentation and recognition of dynamic gestures in continuous video streams. Pattern Recogn. 44, 1614–1628 (2011)CrossRef Li, H., Greenspan, M.A.: Model-based segmentation and recognition of dynamic gestures in continuous video streams. Pattern Recogn. 44, 1614–1628 (2011)CrossRef
7.
Zurück zum Zitat Nandy, A., Prasad, J.S., Mondal, S., Chakraborty, P., Nandi, G.C.: Recognition of isolated indian sign language gesture in real time. In: Das, V.V., et al. (eds.) BAIP 2010. CCIS, vol. 70, pp. 102–107. Springer, Heidelberg (2010) CrossRef Nandy, A., Prasad, J.S., Mondal, S., Chakraborty, P., Nandi, G.C.: Recognition of isolated indian sign language gesture in real time. In: Das, V.V., et al. (eds.) BAIP 2010. CCIS, vol. 70, pp. 102–107. Springer, Heidelberg (2010) CrossRef
8.
Zurück zum Zitat Ahmed, A., Aly, S.: Appearance-based arabic sign language recognition using hidden markov models. In: 2014 International Conference on Engineering and Technology (ICET), pp. 1–6 (2014) Ahmed, A., Aly, S.: Appearance-based arabic sign language recognition using hidden markov models. In: 2014 International Conference on Engineering and Technology (ICET), pp. 1–6 (2014)
9.
Zurück zum Zitat Zhou, F., De la Torre Frade, F.: Generalized time warping for multi-modal alignment of human motion. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2012) Zhou, F., De la Torre Frade, F.: Generalized time warping for multi-modal alignment of human motion. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2012)
10.
Zurück zum Zitat Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)CrossRef
11.
Zurück zum Zitat Martnez, A.M., Wilbur, R.B., Shay, R., Kak, A.C.: Purdue RVL-SLLL ASL database for automatic recognition of american sign language. In: ICMI, pp. 167–172. IEEE Computer Society (2002) Martnez, A.M., Wilbur, R.B., Shay, R., Kak, A.C.: Purdue RVL-SLLL ASL database for automatic recognition of american sign language. In: ICMI, pp. 167–172. IEEE Computer Society (2002)
12.
Zurück zum Zitat Lee, H., Pham, P.T., Largman, Y., Ng, A.Y.: Unsupervised feature learning for audio classification using convolutional deep belief networks. In Bengio, Y., Schuurmans, D., Lafferty, J.D., Williams, C.K.I., Culotta, A., (eds.) NIPS, pp. 1096–1104. Curran Associates, Inc. (2009) Lee, H., Pham, P.T., Largman, Y., Ng, A.Y.: Unsupervised feature learning for audio classification using convolutional deep belief networks. In Bengio, Y., Schuurmans, D., Lafferty, J.D., Williams, C.K.I., Culotta, A., (eds.) NIPS, pp. 1096–1104. Curran Associates, Inc. (2009)
13.
Zurück zum Zitat Lee, H., Grosse, R., Ranganath, R., Ng, A.Y.: Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, pp. 609–616. ACM, New York (2009) Lee, H., Grosse, R., Ranganath, R., Ng, A.Y.: Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, pp. 609–616. ACM, New York (2009)
14.
Zurück zum Zitat O’Connor, P., Neil, D., Liu, S.C., Delbruck, T., Pfeiffer, M.: Real-time classification and sensor fusion with a spiking deep belief network. Front. Neurosci. 7 (2013) O’Connor, P., Neil, D., Liu, S.C., Delbruck, T., Pfeiffer, M.: Real-time classification and sensor fusion with a spiking deep belief network. Front. Neurosci. 7 (2013)
Metadaten
Titel
Dynamic Hand Gesture Recognition Using Generalized Time Warping and Deep Belief Networks
verfasst von
Cristian A. Torres-Valencia
Hernán F. García
Germán A. Holguín
Mauricio A. Álvarez
Álvaro Orozco
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-27863-6_64

Premium Partner