Skip to main content
Erschienen in: International Journal of Computer Vision 3/2013

01.02.2013

Using Segmented 3D Point Clouds for Accurate Likelihood Approximation in Human Pose Tracking

verfasst von: Nicolas Lehment, Moritz Kaiser, Gerhard Rigoll

Erschienen in: International Journal of Computer Vision | Ausgabe 3/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The observation likelihood approximation is a central problem in stochastic human pose tracking. In this article we present a new approach to quantify the correspondence between hypothetical and observed human poses in depth images. Our approach is based on segmented point clouds, enabling accurate approximations even under conditions of self-occlusion and in the absence of color or texture cues. The segmentation step extracts small regions of high saliency such as hands or arms and ensures that the information contained in these regions is not marginalized by larger, less salient regions such as the chest. To enable the rapid, parallel evaluation of many poses, a fast ellipsoid body model is used which handles occlusion and intersection detection in an integrated manner. The proposed approximation function is evaluated on both synthetic and real camera data. In addition, we compare our approximation function against the corresponding function used by a state-of-the-art pose tracker. The approach is suitable for parallelization on GPUs or multicore CPUs.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
The data sets used are available from the author upon request.
 
Literatur
Zurück zum Zitat Azad, P., Asfour, T., & Dillmann, R. (2008). Robust real-time stereo-based markerless human motion capture. In 8th IEEE-RAS international conference on humanoid robots (pp. 700–707). Azad, P., Asfour, T., & Dillmann, R. (2008). Robust real-time stereo-based markerless human motion capture. In 8th IEEE-RAS international conference on humanoid robots (pp. 700–707).
Zurück zum Zitat Baak, A., Müller, M., Bharaj, G., Seidel, H. P., & Theobalt, C. (2011). A data-driven approach for real-time full body pose reconstruction from a depth camera. In IEEE 13th international conference on computer vision (pp. 1092–1099). Baak, A., Müller, M., Bharaj, G., Seidel, H. P., & Theobalt, C. (2011). A data-driven approach for real-time full body pose reconstruction from a depth camera. In IEEE 13th international conference on computer vision (pp. 1092–1099).
Zurück zum Zitat Bernier, O., Cheungmonchan, P., & Bouguet, A. (2008). Fast nonparametric belief propagation for real-time stereo articulated body tracking. Computer Vision and Image Understanding, 113(1), 29–47. CrossRef Bernier, O., Cheungmonchan, P., & Bouguet, A. (2008). Fast nonparametric belief propagation for real-time stereo articulated body tracking. Computer Vision and Image Understanding, 113(1), 29–47. CrossRef
Zurück zum Zitat Cayton, L. (2011). A nearest neighbor data structure for graphics hardware. In Proceedings of the first international workshop on accelerating data management systems using modern processor and storage architectures (ADMS 2010) (pp. 243–251). Cayton, L. (2011). A nearest neighbor data structure for graphics hardware. In Proceedings of the first international workshop on accelerating data management systems using modern processor and storage architectures (ADMS 2010) (pp. 243–251).
Zurück zum Zitat Deutscher, J., & Reid, I. (2005). Articulated body motion capture by stochastic search. International Journal of Computer Vision, 61(2), 185–205. CrossRef Deutscher, J., & Reid, I. (2005). Articulated body motion capture by stochastic search. International Journal of Computer Vision, 61(2), 185–205. CrossRef
Zurück zum Zitat Fontmarty, M., Lerasle, F., & Danes, P. (2007). Data fusion within a modified annealed particle filter dedicated to human motion capture. In IEEE/RSJ international conference on intelligent robots and systems (pp. 3391–3396). Fontmarty, M., Lerasle, F., & Danes, P. (2007). Data fusion within a modified annealed particle filter dedicated to human motion capture. In IEEE/RSJ international conference on intelligent robots and systems (pp. 3391–3396).
Zurück zum Zitat Fontmarty, M., Lerasle, F., & Danes, P. (2009). Likelihood tuning for particle filter in visual tracking. In 16th IEEE international conference on image processing (pp. 4101–4104). Fontmarty, M., Lerasle, F., & Danes, P. (2009). Likelihood tuning for particle filter in visual tracking. In 16th IEEE international conference on image processing (pp. 4101–4104).
Zurück zum Zitat Gall, J., Stoll, C, de Aguiar, E., Theobalt, C., Rosenhahn, B., & Seidel, H. (2009). Motion capture using joint skeleton tracking and surface estimation. In: IEEE computer society conference on computer vision and pattern recognition (workshops), pp. 1746–1753. Gall, J., Stoll, C, de Aguiar, E., Theobalt, C., Rosenhahn, B., & Seidel, H. (2009). Motion capture using joint skeleton tracking and surface estimation. In: IEEE computer society conference on computer vision and pattern recognition (workshops), pp. 1746–1753.
Zurück zum Zitat Ganapathi, V., Plagemann, C., Koller, D., & Thrun, S. (2010). Real time motion capture using a single time-of-flight camera. In 23rd IEEE conference on computer vision and pattern recognition (pp. 755–762). Ganapathi, V., Plagemann, C., Koller, D., & Thrun, S. (2010). Real time motion capture using a single time-of-flight camera. In 23rd IEEE conference on computer vision and pattern recognition (pp. 755–762).
Zurück zum Zitat Girshick, R., Shotton, J., Kohli, P., Criminisi, A., & Fitzgibbon, A. (2011). Efficient regression of general-activity human poses from depth images. In IEEE 13th international conference on computer vision. IEEE Press, New York (pp. 415–422). Girshick, R., Shotton, J., Kohli, P., Criminisi, A., & Fitzgibbon, A. (2011). Efficient regression of general-activity human poses from depth images. In IEEE 13th international conference on computer vision. IEEE Press, New York (pp. 415–422).
Zurück zum Zitat Isard, M., & Blake, A. (1998). Condensation—conditional density propagation for visual tracking. International Journal of Computer Vision, 29(1), 5–28. CrossRef Isard, M., & Blake, A. (1998). Condensation—conditional density propagation for visual tracking. International Journal of Computer Vision, 29(1), 5–28. CrossRef
Zurück zum Zitat Lehment, N. H., Arsić, D., & Rigoll, G. (2010). Cue-independent extending inverse kinematics for robust pose estimation in 3d point clouds. In 17th IEEE international conference on image processing (pp. 2465–2468). Lehment, N. H., Arsić, D., & Rigoll, G. (2010). Cue-independent extending inverse kinematics for robust pose estimation in 3d point clouds. In 17th IEEE international conference on image processing (pp. 2465–2468).
Zurück zum Zitat Lichtenauer, J., Reinders, M., & Hendriks, E. (2004). Influence of the observation likelihood function on particle filtering performance in tracking applications. In 6th IEEE international conference on automatic face and gesture recognition (pp. 767–772). Lichtenauer, J., Reinders, M., & Hendriks, E. (2004). Influence of the observation likelihood function on particle filtering performance in tracking applications. In 6th IEEE international conference on automatic face and gesture recognition (pp. 767–772).
Zurück zum Zitat Lorentz, H. (1915). The width of spectral lines. Koninklijke Nederlandse Akademie van Weteschappen Proceedings Series B Physical Sciences, 18, 134–150. Lorentz, H. (1915). The width of spectral lines. Koninklijke Nederlandse Akademie van Weteschappen Proceedings Series B Physical Sciences, 18, 134–150.
Zurück zum Zitat Markley, F., Cheng, Y., Crassidis, J., & Oshman, Y. (2007). Averaging quaternions. Journal of Guidance, Control, and Dynamics, 30(4), 1193. CrossRef Markley, F., Cheng, Y., Crassidis, J., & Oshman, Y. (2007). Averaging quaternions. Journal of Guidance, Control, and Dynamics, 30(4), 1193. CrossRef
Zurück zum Zitat Mikić, I., Trivedi, M., Hunter, E., & Cosman, P. (2003). Human body model acquisition and tracking using voxel data. International Journal of Computer Vision, 53, 199–223. CrossRef Mikić, I., Trivedi, M., Hunter, E., & Cosman, P. (2003). Human body model acquisition and tracking using voxel data. International Journal of Computer Vision, 53, 199–223. CrossRef
Zurück zum Zitat O’Leary, D. P. (1990). Robust regression computation using iteratively reweighted least squares. SIAM Journal on Matrix Analysis and Applications, 11, 466–480. MathSciNetMATHCrossRef O’Leary, D. P. (1990). Robust regression computation using iteratively reweighted least squares. SIAM Journal on Matrix Analysis and Applications, 11, 466–480. MathSciNetMATHCrossRef
Zurück zum Zitat Poppe, R. (2007). Vision-based human motion analysis: an overview. Computer Vision and Image Understanding, 108(1–2), 4–18. CrossRef Poppe, R. (2007). Vision-based human motion analysis: an overview. Computer Vision and Image Understanding, 108(1–2), 4–18. CrossRef
Zurück zum Zitat Rusu, R. B., & Cousins, S. (2011). 3D is here: point cloud library (PCL). In IEEE international conference on robotics and automation (ICRA) (pp. 1–4). Rusu, R. B., & Cousins, S. (2011). 3D is here: point cloud library (PCL). In IEEE international conference on robotics and automation (ICRA) (pp. 1–4).
Zurück zum Zitat Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., & Blake, A. (2011). Real-time human pose recognition in parts from a single depth image. In IEEE computer society conference on computer vision and pattern recognition (pp. 1297–1304). Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., & Blake, A. (2011). Real-time human pose recognition in parts from a single depth image. In IEEE computer society conference on computer vision and pattern recognition (pp. 1297–1304).
Zurück zum Zitat Sudderth, E. B., Ihler, A. T., Ihler, E. T., Freeman, W. T., & Willsky, A. S. (2002). Nonparametric belief propagation. In IEEE computer society conference on computer vision and pattern recognition (Vol. 1, pp. 605–612). Sudderth, E. B., Ihler, A. T., Ihler, E. T., Freeman, W. T., & Willsky, A. S. (2002). Nonparametric belief propagation. In IEEE computer society conference on computer vision and pattern recognition (Vol. 1, pp. 605–612).
Zurück zum Zitat Zhu, Y., & Fujimura, K. (2009). Bayesian 3D human body pose tracking from depth image sequences. In 9th Asian conference on computer vision (pp. 267–278). Zhu, Y., & Fujimura, K. (2009). Bayesian 3D human body pose tracking from depth image sequences. In 9th Asian conference on computer vision (pp. 267–278).
Metadaten
Titel
Using Segmented 3D Point Clouds for Accurate Likelihood Approximation in Human Pose Tracking
verfasst von
Nicolas Lehment
Moritz Kaiser
Gerhard Rigoll
Publikationsdatum
01.02.2013
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 3/2013
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-012-0557-0

Weitere Artikel der Ausgabe 3/2013

International Journal of Computer Vision 3/2013 Zur Ausgabe

Premium Partner