Skip to main content
Top

2019 | OriginalPaper | Chapter

Towards a Professional Gesture Recognition with RGB-D from Smartphone

Authors : Pablo Vicente Moñivar, Sotiris Manitsaris, Alina Glushkova

Published in: Computer Vision Systems

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The goal of this work is to build the basis for a smartphone application that provides functionalities for recording human motion data, train machine learning algorithms and recognize professional gestures. First, we take advantage of the new mobile phone cameras, either infrared or stereoscopic, to record RGB-D data. Then, a bottom-up pose estimation algorithm based on Deep Learning extracts the 2D human skeleton and exports the 3rd dimension using the depth. Finally, we use a gesture recognition engine, which is based on K-means and Hidden Markov Models (HMMs). The performance of the machine learning algorithm has been tested with professional gestures using a silk-weaving and a TV-assembly datasets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Fang, H., Xie, S., Lu, C.: RMPE: regional multi-person pose estimation (2016) Fang, H., Xie, S., Lu, C.: RMPE: regional multi-person pose estimation (2016)
2.
go back to reference Güler, R.A., Neverova, N., Kokkinos, I.: DensePose: dense human pose estimation in the wild (2018) Güler, R.A., Neverova, N., Kokkinos, I.: DensePose: dense human pose estimation in the wild (2018)
3.
go back to reference Abdulla, W.: Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow. GitHub repository (2017) Abdulla, W.: Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow. GitHub repository (2017)
4.
go back to reference Pishchulin, L., et al.: DeepCut: joint subset partition and labeling for multi person pose estimation (2015) Pishchulin, L., et al.: DeepCut: joint subset partition and labeling for multi person pose estimation (2015)
5.
go back to reference Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Openpose: realtime multi-person 2D pose estimation using part affinity fields (2018) Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Openpose: realtime multi-person 2D pose estimation using part affinity fields (2018)
6.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556 (2014)
7.
go back to reference Ignatov, A., et al.: AI benchmark: running deep neural networks on android smartphones (2018) Ignatov, A., et al.: AI benchmark: running deep neural networks on android smartphones (2018)
8.
go back to reference Krafka, K.: Eye tracking for everyone (2016) Krafka, K.: Eye tracking for everyone (2016)
9.
go back to reference Zhang, L., Zhu, G., Shen, P., Song, J.: Learning spatiotemporal features using 3DCNN and convolutional LSTM for gesture recognition. In: ICCV Workshop (2017) Zhang, L., Zhu, G., Shen, P., Song, J.: Learning spatiotemporal features using 3DCNN and convolutional LSTM for gesture recognition. In: ICCV Workshop (2017)
10.
go back to reference Wang, H., Wang, P., Song, Z., Li, W.: Large-scale multimodal gesture recognition using heterogeneous networks. In: ICCV 2017 Workshop, pp. 3129–3137 (2017) Wang, H., Wang, P., Song, Z., Li, W.: Large-scale multimodal gesture recognition using heterogeneous networks. In: ICCV 2017 Workshop, pp. 3129–3137 (2017)
11.
go back to reference Wang, P., Li, W., Liu, S., Gao, Z., Tang, C., Ogunbona, P.: Large-scale isolated gesture recognition using convolutional neural networks (2017) Wang, P., Li, W., Liu, S., Gao, Z., Tang, C., Ogunbona, P.: Large-scale isolated gesture recognition using convolutional neural networks (2017)
12.
go back to reference Corradini, A.: Dynamic time warping for off-line recognition of a small gesture vocabulary, pp. 82-89 (2001) Corradini, A.: Dynamic time warping for off-line recognition of a small gesture vocabulary, pp. 82-89 (2001)
13.
go back to reference Coupeté, E., Moutarde, F., Manitsaris, S.: Multi-users online recognition of technical gestures for natural human-robot collaboration in manufacturing. Auton. Robot. 43, 1309–1325 (2018)CrossRef Coupeté, E., Moutarde, F., Manitsaris, S.: Multi-users online recognition of technical gestures for natural human-robot collaboration in manufacturing. Auton. Robot. 43, 1309–1325 (2018)CrossRef
14.
go back to reference Gillian, N., Paradiso, J.A.: The gesture recognition toolkit. J. Mach. Learn. Res. 15, 3483–3487 (2014)MathSciNet Gillian, N., Paradiso, J.A.: The gesture recognition toolkit. J. Mach. Learn. Res. 15, 3483–3487 (2014)MathSciNet
Metadata
Title
Towards a Professional Gesture Recognition with RGB-D from Smartphone
Authors
Pablo Vicente Moñivar
Sotiris Manitsaris
Alina Glushkova
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-34995-0_22

Premium Partner