Skip to main content

2016 | OriginalPaper | Buchkapitel

Shape from Selfies: Human Body Shape Estimation Using CCA Regression Forests

verfasst von : Endri Dibra, Cengiz Öztireli, Remo Ziegler, Markus Gross

Erschienen in: Computer Vision – ECCV 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work, we revise the problem of human body shape estimation from monocular imagery. Starting from a statistical human shape model that describes a body shape with shape parameters, we describe a novel approach to automatically estimate these parameters from a single input shape silhouette using semi-supervised learning. By utilizing silhouette features that encode local and global properties robust to noise, pose and view changes, and projecting them to lower dimensional spaces obtained through multi-view learning with canonical correlation analysis, we show how regression forests can be used to compute an accurate mapping from the silhouette to the shape parameter space. This results in a very fast, robust and automatic system under mild self-occlusion assumptions. We extensively evaluate our method on thousands of synthetic and real data and compare it to the state-of-art approaches that operate under more restrictive assumptions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
http://store.sae.org/caesar/.
 
Literatur
2.
Zurück zum Zitat de Aguiar, E., Stoll, C., Theobalt, C., Ahmed, N., Seidel, H.P., Thrun, S.: Performance capture from sparse multi-view video. In: SIGGRAPH (2008) de Aguiar, E., Stoll, C., Theobalt, C., Ahmed, N., Seidel, H.P., Thrun, S.: Performance capture from sparse multi-view video. In: SIGGRAPH (2008)
3.
Zurück zum Zitat Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J., Davis, J.: Scape: Shape completion and animation of people. In: SIGGRAPH (2005) Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J., Davis, J.: Scape: Shape completion and animation of people. In: SIGGRAPH (2005)
4.
Zurück zum Zitat Balan, A.O., Sigal, L., Black, M.J., Davis, J.E., Haussecker, H.W.: Detailed human shape and pose from images. In: CVPR (2007) Balan, A.O., Sigal, L., Black, M.J., Davis, J.E., Haussecker, H.W.: Detailed human shape and pose from images. In: CVPR (2007)
5.
Zurück zum Zitat Baran, I., Popovic, J.: Automatic rigging and animation of 3d characters. ACM Trans. Graph. 26, 1–8 (2007)CrossRef Baran, I., Popovic, J.: Automatic rigging and animation of 3d characters. ACM Trans. Graph. 26, 1–8 (2007)CrossRef
6.
Zurück zum Zitat Boisvert, J., Shu, C., Wuhrer, S., Xi, P.: Three-dimensional human shape inference from silhouettes: reconstruction and validation. Mach. Vis. Appl. 24, 145–157 (2013)CrossRef Boisvert, J., Shu, C., Wuhrer, S., Xi, P.: Three-dimensional human shape inference from silhouettes: reconstruction and validation. Mach. Vis. Appl. 24, 145–157 (2013)CrossRef
7.
Zurück zum Zitat Boykov, Y., Jolly, M.: Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: ICCV (2001) Boykov, Y., Jolly, M.: Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: ICCV (2001)
8.
Zurück zum Zitat Breiman, L.: Random forests. Mach. Learn. 26, 123–140 (2001)MATH Breiman, L.: Random forests. Mach. Learn. 26, 123–140 (2001)MATH
9.
Zurück zum Zitat Bălan, A.O., Black, M.J.: The naked truth: estimating body shape under clothing. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5303, pp. 15–29. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88688-4_2 CrossRef Bălan, A.O., Black, M.J.: The naked truth: estimating body shape under clothing. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5303, pp. 15–29. Springer, Heidelberg (2008). doi:10.​1007/​978-3-540-88688-4_​2 CrossRef
10.
Zurück zum Zitat Casas, D., Volino, M., Collomosse, J., Hilton, A.: 4d video textures for interactive character appearance. Comp. Graph. Forum(Proc. Eurographics) 33, 371–380 (2014)CrossRef Casas, D., Volino, M., Collomosse, J., Hilton, A.: 4d video textures for interactive character appearance. Comp. Graph. Forum(Proc. Eurographics) 33, 371–380 (2014)CrossRef
11.
Zurück zum Zitat Chen, X., Guo, Y., Zhou, B., Zhao, Q.: Deformable model for estimating clothed and naked human shapes from a single image. Vis. Comput. 29, 1187–1196 (2013)CrossRef Chen, X., Guo, Y., Zhou, B., Zhao, Q.: Deformable model for estimating clothed and naked human shapes from a single image. Vis. Comput. 29, 1187–1196 (2013)CrossRef
12.
Zurück zum Zitat Chen, Y., Cipolla, R.: Learning shape priors for single view reconstruction. In: ICCV Workshops (2009) Chen, Y., Cipolla, R.: Learning shape priors for single view reconstruction. In: ICCV Workshops (2009)
13.
Zurück zum Zitat Chen, Y., Kim, T.-K., Cipolla, R.: Inferring 3D shapes and deformations from single views. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6313, pp. 300–313. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15558-1_22 CrossRef Chen, Y., Kim, T.-K., Cipolla, R.: Inferring 3D shapes and deformations from single views. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6313, pp. 300–313. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-15558-1_​22 CrossRef
14.
Zurück zum Zitat Chen, Y., Kim, T., Cipolla, R.: Silhouette-based object phenotype recognition using 3d shape priors. In: ICCV (2011) Chen, Y., Kim, T., Cipolla, R.: Silhouette-based object phenotype recognition using 3d shape priors. In: ICCV (2011)
15.
Zurück zum Zitat Delamarre, Q., Faugeras, O.: 3d articulated models and multi-view tracking with silhouettes. In: ICCV (1999) Delamarre, Q., Faugeras, O.: 3d articulated models and multi-view tracking with silhouettes. In: ICCV (1999)
16.
Zurück zum Zitat Guan, L., Franco, J., Pollefeys, M.: Multi-object shape estimation and tracking from silhouette cues. In: CVPR (2008) Guan, L., Franco, J., Pollefeys, M.: Multi-object shape estimation and tracking from silhouette cues. In: CVPR (2008)
17.
Zurück zum Zitat Guan, P., Reiss, L., Hirshberg, D.A., Weiss, A., Black, M.J.: Drape: dressing any person. ACM Trans. Graph 31, 1–10 (2012)CrossRef Guan, P., Reiss, L., Hirshberg, D.A., Weiss, A., Black, M.J.: Drape: dressing any person. ACM Trans. Graph 31, 1–10 (2012)CrossRef
18.
Zurück zum Zitat Guan, P., Weiss, A., Balan, A.O., Black, M.J.: Estimating human shape and pose from a single image. In: ICCV (2009) Guan, P., Weiss, A., Balan, A.O., Black, M.J.: Estimating human shape and pose from a single image. In: ICCV (2009)
19.
Zurück zum Zitat Hardoon, D.R., Mourão Miranda, J., Brammer, M., Shawe-Taylor, J.: Unsupervised analysis of fmri data using kernel canonical correlation. NeuroImage 37, 1250–1259 (2007)CrossRef Hardoon, D.R., Mourão Miranda, J., Brammer, M., Shawe-Taylor, J.: Unsupervised analysis of fmri data using kernel canonical correlation. NeuroImage 37, 1250–1259 (2007)CrossRef
20.
Zurück zum Zitat Hardoon, D.R., Szedmak, S.R., Shawe-taylor, J.R.: Canonical correlation analysis: an overview with application to learning methods. Neural Comput. 16, 2639–2664 (2004)CrossRefMATH Hardoon, D.R., Szedmak, S.R., Shawe-taylor, J.R.: Canonical correlation analysis: an overview with application to learning methods. Neural Comput. 16, 2639–2664 (2004)CrossRefMATH
21.
Zurück zum Zitat Hasler, N., Ackermann, H., Rosenhahn, B., Thormählen, T., Seidel, H.: Multilinear pose and body shape estimation of dressed subjects from image sets. In: CVPR (2010) Hasler, N., Ackermann, H., Rosenhahn, B., Thormählen, T., Seidel, H.: Multilinear pose and body shape estimation of dressed subjects from image sets. In: CVPR (2010)
22.
Zurück zum Zitat Hasler, N., Stoll, C., Sunkel, M., Rosenhahn, B., Seidel, H.: A statistical model of human pose and body shape. Comput. Graph. Forum 28, 337–246 (2009)CrossRef Hasler, N., Stoll, C., Sunkel, M., Rosenhahn, B., Seidel, H.: A statistical model of human pose and body shape. Comput. Graph. Forum 28, 337–246 (2009)CrossRef
23.
Zurück zum Zitat Helten, T., Baak, A., Bharaj, G., Müller, M., Seidel, H., Theobalt, C.: Personalization and evaluation of a real-time depth-based full body tracker. In: 3DV (2013) Helten, T., Baak, A., Bharaj, G., Müller, M., Seidel, H., Theobalt, C.: Personalization and evaluation of a real-time depth-based full body tracker. In: 3DV (2013)
24.
Zurück zum Zitat Hotelling, H.: Relations between two sets of variates. Biometrika 28, 321–377 (1936)CrossRefMATH Hotelling, H.: Relations between two sets of variates. Biometrika 28, 321–377 (1936)CrossRefMATH
25.
26.
Zurück zum Zitat Kakade, S.M., Foster, D.P.: Multi-view regression via canonical correlation analysis. In: Bshouty, N.H., Gentile, C. (eds.) COLT 2007. Lecture Notes in Artificial Intelligence (LNAI), vol. 4539, pp. 82–96. Springer, Heidelberg (2007). doi:10.1007/978-3-540-72927-3_8 CrossRef Kakade, S.M., Foster, D.P.: Multi-view regression via canonical correlation analysis. In: Bshouty, N.H., Gentile, C. (eds.) COLT 2007. Lecture Notes in Artificial Intelligence (LNAI), vol. 4539, pp. 82–96. Springer, Heidelberg (2007). doi:10.​1007/​978-3-540-72927-3_​8 CrossRef
27.
Zurück zum Zitat Kakadiaris, I.A., Metaxas, D.: Three-dimensional human body model acquisition from multiple views. IJCV 30, 191–218 (1998)CrossRef Kakadiaris, I.A., Metaxas, D.: Three-dimensional human body model acquisition from multiple views. IJCV 30, 191–218 (1998)CrossRef
28.
Zurück zum Zitat Kim, T.K., Wong, S.F., Cipolla, R.: Tensor canonical correlation analysis for action classification. In: CVPR (2007) Kim, T.K., Wong, S.F., Cipolla, R.: Tensor canonical correlation analysis for action classification. In: CVPR (2007)
29.
Zurück zum Zitat Lahner, Z., Rodola, E., Schmidt, F.R., Bronstein, M.M., Cremers, D.: Efficient globally optimal 2d-to-3d deformable shape matching. In: CVPR (2016) Lahner, Z., Rodola, E., Schmidt, F.R., Bronstein, M.M., Cremers, D.: Efficient globally optimal 2d-to-3d deformable shape matching. In: CVPR (2016)
30.
Zurück zum Zitat Laurentini, A.: The visual hull concept for silhouette-based image understanding. PAMI 16, 150–162 (1994)CrossRef Laurentini, A.: The visual hull concept for silhouette-based image understanding. PAMI 16, 150–162 (1994)CrossRef
31.
Zurück zum Zitat Lewis, J.P., Cordner, M., Fong, N.: Pose space deformation: a unified approach to shape interpolation and skeleton-driven deformation. In: SIGGRAPH (2000) Lewis, J.P., Cordner, M., Fong, N.: Pose space deformation: a unified approach to shape interpolation and skeleton-driven deformation. In: SIGGRAPH (2000)
32.
Zurück zum Zitat Ling, H., Jacobs, D.W.: Shape classification using the inner-distance. PAMI 29, 286–299 (2007)CrossRef Ling, H., Jacobs, D.W.: Shape classification using the inner-distance. PAMI 29, 286–299 (2007)CrossRef
33.
Zurück zum Zitat McWilliams, B., Balduzzi, D., Buhmann, J.M.: Correlated random features for fast semi-supervised learning. In: NIPS (2013) McWilliams, B., Balduzzi, D., Buhmann, J.M.: Correlated random features for fast semi-supervised learning. In: NIPS (2013)
34.
Zurück zum Zitat Mikic, I., Trivedi, M., Hunter, E., Cosman, P.: Human body model acquisition and tracking using voxel data. IJCV 53, 199–223 (2003)CrossRefMATH Mikic, I., Trivedi, M., Hunter, E., Cosman, P.: Human body model acquisition and tracking using voxel data. IJCV 53, 199–223 (2003)CrossRefMATH
35.
Zurück zum Zitat Neophytou, A., Hilton, A.: Shape and pose space deformation for subject specific animation. In: 3DV (2013) Neophytou, A., Hilton, A.: Shape and pose space deformation for subject specific animation. In: 3DV (2013)
36.
Zurück zum Zitat Neophytou, A., Hilton, A.: A layered model of human body and garment deformation. In: 3DV (2014) Neophytou, A., Hilton, A.: A layered model of human body and garment deformation. In: 3DV (2014)
37.
Zurück zum Zitat Perbet, F., Johnson, S., Pham, M.T., Stenger, B.: Human body shape estimation using a multi-resolution manifold forest. In: CVPR (2014) Perbet, F., Johnson, S., Pham, M.T., Stenger, B.: Human body shape estimation using a multi-resolution manifold forest. In: CVPR (2014)
38.
Zurück zum Zitat Pishchulin, L., Wuhrer, S., Helten, T., Theobalt, C., Schiele, B.: Building statistical shape spaces for 3d human modeling. CoRR (2015) Pishchulin, L., Wuhrer, S., Helten, T., Theobalt, C., Schiele, B.: Building statistical shape spaces for 3d human modeling. CoRR (2015)
39.
Zurück zum Zitat Robinette, K.M., Daanen, H.A.M.: The caesar project: a 3-d surface anthropometry survey. In: 3DIM (1999) Robinette, K.M., Daanen, H.A.M.: The caesar project: a 3-d surface anthropometry survey. In: 3DIM (1999)
40.
Zurück zum Zitat Rogge, L., Klose, F., Stengel, M., Eisemann, M., Magnor, M.: Garment replacement in monocular video sequences. ACM Trans. Graph. 34, 1–10 (2014)CrossRef Rogge, L., Klose, F., Stengel, M., Eisemann, M., Magnor, M.: Garment replacement in monocular video sequences. ACM Trans. Graph. 34, 1–10 (2014)CrossRef
41.
Zurück zum Zitat Sargin, M.E., Yemez, Y., Erzin, E., Tekalp, A.M.: Audiovisual synchronization and fusion using canonical correlation analysis. Trans. Multimedia 9, 1396–1403 (2007)CrossRef Sargin, M.E., Yemez, Y., Erzin, E., Tekalp, A.M.: Audiovisual synchronization and fusion using canonical correlation analysis. Trans. Multimedia 9, 1396–1403 (2007)CrossRef
42.
Zurück zum Zitat Schmidt, F.R., Farin, D., Cremers, D.: Fast matching of planar shapes in sub-cubic runtime. In: ICCV (2007) Schmidt, F.R., Farin, D., Cremers, D.: Fast matching of planar shapes in sub-cubic runtime. In: ICCV (2007)
43.
Zurück zum Zitat Schmidt, F.R., Töppe, E., Cremers, D.: Efficient planar graph cuts with applications in computer vision. In: CVPR (2009) Schmidt, F.R., Töppe, E., Cremers, D.: Efficient planar graph cuts with applications in computer vision. In: CVPR (2009)
44.
Zurück zum Zitat Shapira, L., Shamir, A., Cohen-Or, D.: Consistent mesh partitioning and skeletonisation using the shape diameter function. Visual Comput. 24, 249–259 (2008)CrossRef Shapira, L., Shamir, A., Cohen-Or, D.: Consistent mesh partitioning and skeletonisation using the shape diameter function. Visual Comput. 24, 249–259 (2008)CrossRef
45.
Zurück zum Zitat Sharma, A., Kumar, A., Daume III, H., Jacobs, D.W.: Generalized multiview analysis: a discriminative latent space. In: CVPR (2012) Sharma, A., Kumar, A., Daume III, H., Jacobs, D.W.: Generalized multiview analysis: a discriminative latent space. In: CVPR (2012)
46.
Zurück zum Zitat Sigal, L., Balan, A.O., Black, M.J.: Combined discriminative and generative articulated pose and non-rigid shape estimation. In: NIPS (2007) Sigal, L., Balan, A.O., Black, M.J.: Combined discriminative and generative articulated pose and non-rigid shape estimation. In: NIPS (2007)
47.
Zurück zum Zitat Slama, R., Wannous, H., Daoudi, M.: Extremal human curves: a new human body shape and pose descriptor. In: FG (2013) Slama, R., Wannous, H., Daoudi, M.: Extremal human curves: a new human body shape and pose descriptor. In: FG (2013)
48.
Zurück zum Zitat Starck, J., Miller, G., Hilton, A.: Video-based character animation. In: ACM SIGGRAPH Eurographics SCA (2005) Starck, J., Miller, G., Hilton, A.: Video-based character animation. In: ACM SIGGRAPH Eurographics SCA (2005)
49.
Zurück zum Zitat Stoll, C., Gall, J., de Aguiar, E., Thrun, S., Theobalt, C.: Video-based reconstruction of animatable human characters. In: SIGGRAPH Asia (2010) Stoll, C., Gall, J., de Aguiar, E., Thrun, S., Theobalt, C.: Video-based reconstruction of animatable human characters. In: SIGGRAPH Asia (2010)
50.
Zurück zum Zitat Weiss, A., Hirshberg, D.A., Black, M.J.: Home 3d body scans from noisy image and range data. In: ICCV (2011) Weiss, A., Hirshberg, D.A., Black, M.J.: Home 3d body scans from noisy image and range data. In: ICCV (2011)
51.
Zurück zum Zitat Wuhrer, S., Pishchulin, L., Brunton, A., Shu, C., Lang, J.: Estimation of human body shape and posture under clothing. CVIU 127, 31–42 (2014) Wuhrer, S., Pishchulin, L., Brunton, A., Shu, C., Lang, J.: Estimation of human body shape and posture under clothing. CVIU 127, 31–42 (2014)
52.
Zurück zum Zitat Xi, P., Lee, W., Shu, C.: A data-driven approach to human-body cloning using a segmented body database. In: Pacific Graphics (2007) Xi, P., Lee, W., Shu, C.: A data-driven approach to human-body cloning using a segmented body database. In: Pacific Graphics (2007)
53.
Zurück zum Zitat Xu, F., Liu, Y., Stoll, C., Tompkin, J., Bharaj, G., Dai, Q., Seidel, H.P., Kautz, J., Theobalt, C.: Video-based characters: Creating new human performances from a multi-view video database. In: SIGGRAPH (2011) Xu, F., Liu, Y., Stoll, C., Tompkin, J., Bharaj, G., Dai, Q., Seidel, H.P., Kautz, J., Theobalt, C.: Video-based characters: Creating new human performances from a multi-view video database. In: SIGGRAPH (2011)
54.
Zurück zum Zitat Yang, Y., Yu, Y., Zhou, Y., Du, S., Davis, J., Yang, R.: Semantic parametric reshaping of human body models. In: 3DV (2014) Yang, Y., Yu, Y., Zhou, Y., Du, S., Davis, J., Yang, R.: Semantic parametric reshaping of human body models. In: 3DV (2014)
55.
Zurück zum Zitat Ye, M., Yang, R.: Real-time simultaneous pose and shape estimation for articulated objects using a single depth camera. In: CVPR (2014) Ye, M., Yang, R.: Real-time simultaneous pose and shape estimation for articulated objects using a single depth camera. In: CVPR (2014)
Metadaten
Titel
Shape from Selfies: Human Body Shape Estimation Using CCA Regression Forests
verfasst von
Endri Dibra
Cengiz Öztireli
Remo Ziegler
Markus Gross
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46493-0_6