nach oben

International Journal of Computer Vision

Erschienen in:

01.02.2013

Using Segmented 3D Point Clouds for Accurate Likelihood Approximation in Human Pose Tracking

verfasst von: Nicolas Lehment, Moritz Kaiser, Gerhard Rigoll

Erschienen in: International Journal of Computer Vision | Ausgabe 3/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The observation likelihood approximation is a central problem in stochastic human pose tracking. In this article we present a new approach to quantify the correspondence between hypothetical and observed human poses in depth images. Our approach is based on segmented point clouds, enabling accurate approximations even under conditions of self-occlusion and in the absence of color or texture cues. The segmentation step extracts small regions of high saliency such as hands or arms and ensures that the information contained in these regions is not marginalized by larger, less salient regions such as the chest. To enable the rapid, parallel evaluation of many poses, a fast ellipsoid body model is used which handles occlusion and intersection detection in an integrated manner. The proposed approximation function is evaluated on both synthetic and real camera data. In addition, we compare our approximation function against the corresponding function used by a state-of-the-art pose tracker. The approach is suitable for parallelization on GPUs or multicore CPUs.

Vorheriger Artikel Improving Head Movement Tolerance of Cross-Ratio Based Eye Trackers

Nächster Artikel Euler Principal Component Analysis

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

The data sets used are available from the author upon request.

Azad, P., Asfour, T., & Dillmann, R. (2008). Robust real-time stereo-based markerless human motion capture. In 8th IEEE-RAS international conference on humanoid robots (pp. 700–707).

Baak, A., Müller, M., Bharaj, G., Seidel, H. P., & Theobalt, C. (2011). A data-driven approach for real-time full body pose reconstruction from a depth camera. In IEEE 13th international conference on computer vision (pp. 1092–1099).

Bernier, O., Cheungmonchan, P., & Bouguet, A. (2008). Fast nonparametric belief propagation for real-time stereo articulated body tracking. Computer Vision and Image Understanding, 113(1), 29–47. CrossRef

Cayton, L. (2011). A nearest neighbor data structure for graphics hardware. In Proceedings of the first international workshop on accelerating data management systems using modern processor and storage architectures (ADMS 2010) (pp. 243–251).

Darby, J., Li, B., & Costen, N. (2008). Human activity tracking from moving camera stereo data. In British machine vision conference. http://www.bmva.org/bmvc/2008/papers/232.pdf.

Deutscher, J., & Reid, I. (2005). Articulated body motion capture by stochastic search. International Journal of Computer Vision, 61(2), 185–205. CrossRef

Fontmarty, M., Lerasle, F., & Danes, P. (2007). Data fusion within a modified annealed particle filter dedicated to human motion capture. In IEEE/RSJ international conference on intelligent robots and systems (pp. 3391–3396).

Fontmarty, M., Lerasle, F., & Danes, P. (2009). Likelihood tuning for particle filter in visual tracking. In 16th IEEE international conference on image processing (pp. 4101–4104).

Gall, J., Stoll, C, de Aguiar, E., Theobalt, C., Rosenhahn, B., & Seidel, H. (2009). Motion capture using joint skeleton tracking and surface estimation. In: IEEE computer society conference on computer vision and pattern recognition (workshops), pp. 1746–1753.

Ganapathi, V., Plagemann, C., Koller, D., & Thrun, S. (2010). Real time motion capture using a single time-of-flight camera. In 23rd IEEE conference on computer vision and pattern recognition (pp. 755–762).

Girshick, R., Shotton, J., Kohli, P., Criminisi, A., & Fitzgibbon, A. (2011). Efficient regression of general-activity human poses from depth images. In IEEE 13th international conference on computer vision. IEEE Press, New York (pp. 415–422).

Isard, M., & Blake, A. (1998). Condensation—conditional density propagation for visual tracking. International Journal of Computer Vision, 29(1), 5–28. CrossRef

Lehment, N. H., Arsić, D., & Rigoll, G. (2010). Cue-independent extending inverse kinematics for robust pose estimation in 3d point clouds. In 17th IEEE international conference on image processing (pp. 2465–2468).

Lichtenauer, J., Reinders, M., & Hendriks, E. (2004). Influence of the observation likelihood function on particle filtering performance in tracking applications. In 6th IEEE international conference on automatic face and gesture recognition (pp. 767–772).

Lorentz, H. (1915). The width of spectral lines. Koninklijke Nederlandse Akademie van Weteschappen Proceedings Series B Physical Sciences, 18, 134–150.

Markley, F., Cheng, Y., Crassidis, J., & Oshman, Y. (2007). Averaging quaternions. Journal of Guidance, Control, and Dynamics, 30(4), 1193. CrossRef

Mikić, I., Trivedi, M., Hunter, E., & Cosman, P. (2003). Human body model acquisition and tracking using voxel data. International Journal of Computer Vision, 53, 199–223. CrossRef

O’Leary, D. P. (1990). Robust regression computation using iteratively reweighted least squares. SIAM Journal on Matrix Analysis and Applications, 11, 466–480. MathSciNetMATHCrossRef

Poppe, R. (2007). Vision-based human motion analysis: an overview. Computer Vision and Image Understanding, 108(1–2), 4–18. CrossRef

Rusu, R. B., & Cousins, S. (2011). 3D is here: point cloud library (PCL). In IEEE international conference on robotics and automation (ICRA) (pp. 1–4).

Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., & Blake, A. (2011). Real-time human pose recognition in parts from a single depth image. In IEEE computer society conference on computer vision and pattern recognition (pp. 1297–1304).

Sudderth, E. B., Ihler, A. T., Ihler, E. T., Freeman, W. T., & Willsky, A. S. (2002). Nonparametric belief propagation. In IEEE computer society conference on computer vision and pattern recognition (Vol. 1, pp. 605–612).

Wilhelms, J., & Gelder, A. V. (2001). Efficient spherical joint limits with reach cones. Tech. rep., University of California at Santa Cruz. http://users.soe.ucsc.edu/~avg/Papers/jtl-tr.pdf.

Zhu, Y., & Fujimura, K. (2009). Bayesian 3D human body pose tracking from depth image sequences. In 9th Asian conference on computer vision (pp. 267–278).

Titel: Using Segmented 3D Point Clouds for Accurate Likelihood Approximation in Human Pose Tracking
verfasst von: Nicolas Lehment
Moritz Kaiser
Gerhard Rigoll
Publikationsdatum: 01.02.2013
Verlag: Springer US
Erschienen in: International Journal of Computer Vision / Ausgabe 3/2013
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI: https://doi.org/10.1007/s11263-012-0557-0

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 3/2013

Euler Principal Component Analysis

Improving Head Movement Tolerance of Cross-Ratio Based Eye Trackers

Random Forests for Real Time 3D Face Analysis

Guest Editorial: Human–Computer Interaction: Real-Time Vision Aspects of Natural User Interfaces

Virtual Volumetric Graphics on Commodity Displays Using 3D Viewer Tracking

Attention Based Detection and Recognition of Hand Postures Against Complex Backgrounds

Premium Partner