Skip to main content
Top
Published in: Machine Vision and Applications 6/2014

01-08-2014 | Original Paper

3D human pose estimation from image using couple sparse coding

Authors: Mohammadreza Zolfaghari, Amin Jourabloo, Samira Ghareh Gozlou, Bahman Pedrood, Mohammad T. Manzuri-Shalmani

Published in: Machine Vision and Applications | Issue 6/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Recent studies have demonstrated that high-level semantics in data can be captured using sparse representation. In this paper, we propose an approach to human body pose estimation in static images based on sparse representation. Given a visual input, the objective is to estimate 3D human body pose using feature space information and geometrical information of the pose space. On the assumption that each data point and its neighbors are likely to reside on a locally linear patch of the underlying manifold, our method learns the sparse representation of the new input using both feature and pose space information and then estimates the corresponding 3D pose by a linear combination of the bases of the pose dictionary. Two strategies for dictionary construction are presented: (i) constructing the dictionary by randomly selecting the frames of a sequence and (ii) selecting specific frames of a sequence as dictionary atoms. We analyzed the effect of each strategy on the accuracy of pose estimation. Extensive experiments on datasets of various human activities show that our proposed method outperforms state-of-the-art methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
BVH format created by Biovision Company to describing 3D pose in animation production. http://​www.​cs.​wisc.​edu/​graphics/​Courses/​cs-838-1999/​Jeff/​BVH.​html
 
Literature
2.
go back to reference Agarwal, A., Triggs, B.: Monocular human motion capture with a mixture of regressors. In: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition—Workshops, vol. 03, CVPR ’05, pp. 72. IEEE Computer Society, Washington, DC (2005) doi:10.1109/CVPR.2005.496 Agarwal, A., Triggs, B.: Monocular human motion capture with a mixture of regressors. In: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition—Workshops, vol. 03, CVPR ’05, pp. 72. IEEE Computer Society, Washington, DC (2005) doi:10.​1109/​CVPR.​2005.​496
5.
go back to reference Andriluka, M., Roth, S., Schiele, B.: Monocular 3d pose estimation and tracking by detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 623–630 (2010). doi:10.1109/CVPR.2010.5540156 Andriluka, M., Roth, S., Schiele, B.: Monocular 3d pose estimation and tracking by detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 623–630 (2010). doi:10.​1109/​CVPR.​2010.​5540156
11.
go back to reference Christoudias, C.M., Darrell, T.: On modelling nonlinear shape-and-texture appearance manifolds. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, vol. 02, CVPR ’05, pp. 1067–1074. IEEE Computer Society, Washington, DC (2005). doi:10.1109/CVPR.2005.255 Christoudias, C.M., Darrell, T.: On modelling nonlinear shape-and-texture appearance manifolds. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, vol. 02, CVPR ’05, pp. 1067–1074. IEEE Computer Society, Washington, DC (2005). doi:10.​1109/​CVPR.​2005.​255
13.
go back to reference Donoho, D.L.: For most large underdetermined systems of linear equations the minimal l1-norm solution is also the sparsest solution. Commun. Pure Appl. Math. 59(6), 797–829 (2006)CrossRefMATHMathSciNet Donoho, D.L.: For most large underdetermined systems of linear equations the minimal l1-norm solution is also the sparsest solution. Commun. Pure Appl. Math. 59(6), 797–829 (2006)CrossRefMATHMathSciNet
15.
go back to reference Elgammal, A., Lee, C.S.: Inferring 3d body pose from silhouettes using activity manifold learning. In: Proceedings of the IEEE Computer Society Conference on Computer vision and Pattern Recognition. CVPR’04, pp. 681–688. IEEE Computer Society, Washington, DC (2004) Elgammal, A., Lee, C.S.: Inferring 3d body pose from silhouettes using activity manifold learning. In: Proceedings of the IEEE Computer Society Conference on Computer vision and Pattern Recognition. CVPR’04, pp. 681–688. IEEE Computer Society, Washington, DC (2004)
16.
go back to reference Hara, K., Kurokawa, T.: Human pose estimation using patch-based candidate generation and model-based verification. In: IEEE International Conference on Automatic Face Gesture Recognition and Workshops (FG), pp. 687–693 (2011). doi:10.1109/FG.2011.5771331 Hara, K., Kurokawa, T.: Human pose estimation using patch-based candidate generation and model-based verification. In: IEEE International Conference on Automatic Face Gesture Recognition and Workshops (FG), pp. 687–693 (2011). doi:10.​1109/​FG.​2011.​5771331
17.
go back to reference Huang, J.B., Yang, M.H.: Estimating human pose from occluded images. In: ACCV (1), Lecture Notes in Computer Science, vol. 5994, pp. 48–60. Springer, Berlin (2009) Huang, J.B., Yang, M.H.: Estimating human pose from occluded images. In: ACCV (1), Lecture Notes in Computer Science, vol. 5994, pp. 48–60. Springer, Berlin (2009)
18.
go back to reference Huang, J.B., Yang, M.H.: Fast sparse representation with prototypes. In: CVPR, pp. 3618–3625. IEEE, New York (2010) Huang, J.B., Yang, M.H.: Fast sparse representation with prototypes. In: CVPR, pp. 3618–3625. IEEE, New York (2010)
19.
go back to reference Jiang, H.: 20th International Conference on 3d human pose reconstruction using millions of exemplars. In: Pattern Recognition (ICPR), pp. 1674–1677 (2010). doi:10.1109/ICPR.2010.414 Jiang, H.: 20th International Conference on 3d human pose reconstruction using millions of exemplars. In: Pattern Recognition (ICPR), pp. 1674–1677 (2010). doi:10.​1109/​ICPR.​2010.​414
20.
go back to reference Lee, C.S., Elgammal, A.M.: Modeling view and posture manifolds for tracking. In: ICCV, pp. 1–8. IEEE, New York (2007) Lee, C.S., Elgammal, A.M.: Modeling view and posture manifolds for tracking. In: ICCV, pp. 1–8. IEEE, New York (2007)
21.
go back to reference Lee, H., Battle, A., Raina, R., Ng, A.Y.: Efficient sparse coding algorithms. In: NIPS, pp. 801–808. NIPS, Kolkata (2007) Lee, H., Battle, A., Raina, R., Ng, A.Y.: Efficient sparse coding algorithms. In: NIPS, pp. 801–808. NIPS, Kolkata (2007)
22.
go back to reference Lee, M.W., Nevatia, R.: Human pose tracking in monocular sequence using multilevel structured models. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 27–38 (2009). doi:10.1109/TPAMI.2008.35. Lee, M.W., Nevatia, R.: Human pose tracking in monocular sequence using multilevel structured models. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 27–38 (2009). doi:10.​1109/​TPAMI.​2008.​35.
24.
go back to reference Mori, G., Malik, J.: Recovering 3d human body configurations using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 28(7), 1052–1062 (2006)CrossRef Mori, G., Malik, J.: Recovering 3d human body configurations using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 28(7), 1052–1062 (2006)CrossRef
25.
go back to reference Olshausen, B.A., Field, D.J.: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609 (1996)CrossRef Olshausen, B.A., Field, D.J.: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609 (1996)CrossRef
26.
go back to reference Olshausen, B.A., Field, D.J.: Sparse coding with an overcomplete basis set: a strategy employed by v1? Vision Res. 37, 3311–3325 (1997)CrossRef Olshausen, B.A., Field, D.J.: Sparse coding with an overcomplete basis set: a strategy employed by v1? Vision Res. 37, 3311–3325 (1997)CrossRef
28.
go back to reference Rao, R.P.N., Olshausen, B.A., Lewicki, M.S.: Probabilistic models of the brain: perception and neural function. MIT Press, Cambridge (2002) Rao, R.P.N., Olshausen, B.A., Lewicki, M.S.: Probabilistic models of the brain: perception and neural function. MIT Press, Cambridge (2002)
30.
go back to reference Serre, T.: Learning a dictionary of shape-components in visual cortex: comparison with neurons, humans and machines. Mass. Inst. Technol. (2006) Serre, T.: Learning a dictionary of shape-components in visual cortex: comparison with neurons, humans and machines. Mass. Inst. Technol. (2006)
31.
go back to reference Shakhnarovich, G., Viola, P., Darrell, T.: Fast pose estimation with parameter-sensitive hashing. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, vol. 2, ICCV ’03, pp. 750. IEEE Computer Society, Washington, DC (2003) Shakhnarovich, G., Viola, P., Darrell, T.: Fast pose estimation with parameter-sensitive hashing. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, vol. 2, ICCV ’03, pp. 750. IEEE Computer Society, Washington, DC (2003)
32.
go back to reference Shang, L., Zhou, Y., Tao, L., Sun, Z.l.: Super-resolution restoration of mmw image using sparse representation based on couple dictionaries. In: Emerging Intelligent Computing Technology and Applications, pp. 286–291. Springer, Berlin (2012) Shang, L., Zhou, Y., Tao, L., Sun, Z.l.: Super-resolution restoration of mmw image using sparse representation based on couple dictionaries. In: Emerging Intelligent Computing Technology and Applications, pp. 286–291. Springer, Berlin (2012)
33.
go back to reference Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: Sparse representations of image gradient orientations for visual recognition and tracking. In: Proceedings of IEEE International Conference Computer Vision and Pattern Recognition (CVPR-W11), Workshop on CVPR for Human Behaviour Analysis, pp. 26–33. Colorado Springs, USA (2011) Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: Sparse representations of image gradient orientations for visual recognition and tracking. In: Proceedings of IEEE International Conference Computer Vision and Pattern Recognition (CVPR-W11), Workshop on CVPR for Human Behaviour Analysis, pp. 26–33. Colorado Springs, USA (2011)
34.
go back to reference Urtasun, R., Fleet, D.J., Hertzmann, A., Fua, P.: Priors for people tracking from small training sets. In: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV) vol. 1, vol. 01, ICCV ’05, pp. 403–410. IEEE Computer Society, Washington, DC (2005) doi:10.1109/ICCV.2005.193 Urtasun, R., Fleet, D.J., Hertzmann, A., Fua, P.: Priors for people tracking from small training sets. In: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV) vol. 1, vol. 01, ICCV ’05, pp. 403–410. IEEE Computer Society, Washington, DC (2005) doi:10.​1109/​ICCV.​2005.​193
35.
go back to reference Wright, J., Ma, Y., Mairal, J., Sapiro, G., Huang, T., Yan, S.: Sparse representation for computer vision and pattern recognition (2009) Wright, J., Ma, Y., Mairal, J., Sapiro, G., Huang, T., Yan, S.: Sparse representation for computer vision and pattern recognition (2009)
36.
go back to reference Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2009). doi:10.1109/TPAMI.2008.79 Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2009). doi:10.​1109/​TPAMI.​2008.​79
37.
go back to reference Yang, J., Wang, Z., Lin, Z., Cohen, S., Huang, T.: Coupled dictionary training for image super-resolution. IEEE Trans. Image Process. 21(8), 3467–3478 (2012)CrossRefMathSciNet Yang, J., Wang, Z., Lin, Z., Cohen, S., Huang, T.: Coupled dictionary training for image super-resolution. IEEE Trans. Image Process. 21(8), 3467–3478 (2012)CrossRefMathSciNet
Metadata
Title
3D human pose estimation from image using couple sparse coding
Authors
Mohammadreza Zolfaghari
Amin Jourabloo
Samira Ghareh Gozlou
Bahman Pedrood
Mohammad T. Manzuri-Shalmani
Publication date
01-08-2014
Publisher
Springer Berlin Heidelberg
Published in
Machine Vision and Applications / Issue 6/2014
Print ISSN: 0932-8092
Electronic ISSN: 1432-1769
DOI
https://doi.org/10.1007/s00138-014-0613-6

Other articles of this Issue 6/2014

Machine Vision and Applications 6/2014 Go to the issue

Premium Partner