Skip to main content

2016 | OriginalPaper | Buchkapitel

Symmetric Non-rigid Structure from Motion for Category-Specific Object Structure Estimation

verfasst von : Yuan Gao, Alan L. Yuille

Erschienen in: Computer Vision – ECCV 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many objects, especially these made by humans, are symmetric, e.g. cars and aeroplanes. This paper addresses the estimation of 3D structures of symmetric objects from multiple images of the same object category, e.g. different cars, seen from various viewpoints. We assume that the deformation between different instances from the same object category is non-rigid and symmetric. In this paper, we extend two leading non-rigid structure from motion (SfM) algorithms to exploit symmetry constraints. We model the both methods as energy minimization, in which we also recover the missing observations caused by occlusions. In particularly, we show that by rotating the coordinate system, the energy can be decoupled into two independent terms, which still exploit symmetry, to apply matrix factorization separately on each of them for initialization. The results on the Pascal3D+ dataset show that our methods significantly improve performance over baseline methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
However, the general framework in [33] cannot be used to SfM directly, because they did not constrain that all the keypoints have the same translation.
 
2
We assume symmetric deformations because our problem involves deformations from one symmetric object to another. But it also can be extended to non-symmetric deformations straightforwardly.
 
3
We set hard constraints on \(\mathbb {\bar{S}}\) and \(\mathbb {\bar{S}}^{\dag }\), i.e. replace \(\mathbb {\bar{S}}^{\dag }\) by \(\mathcal {A}_P \mathbb {\bar{S}}\) in Eq. (5), because it can be guaranteed by the Sym-RSfM initialization in our companion paper [26]. While the initialization on \(\mathbf {V}\) and \(\mathbf {V}^{\dag }\) by PCA cannot guarantee such a desirable property, thus a Language multiplier term is used for the constraint on \(\mathbf {V}\) and \(\mathbf {V}^{\dag }\) in Eq. (9).
 
Literatur
1.
Zurück zum Zitat Tomasi, C., Kanade, T.: Shape and motion from image streams under orthography: a factorization method. Int. J. Comput. Vis. 9(2), 137–154 (1992)CrossRef Tomasi, C., Kanade, T.: Shape and motion from image streams under orthography: a factorization method. Int. J. Comput. Vis. 9(2), 137–154 (1992)CrossRef
2.
Zurück zum Zitat Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004)CrossRefMATH Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004)CrossRefMATH
3.
Zurück zum Zitat Torresani, L., Hertzmann, A., Bregler, C.: Learning non-rigid 3D shape from 2D motion. In: NIPS (2003) Torresani, L., Hertzmann, A., Bregler, C.: Learning non-rigid 3D shape from 2D motion. In: NIPS (2003)
4.
Zurück zum Zitat Xiao, J., Chai, J., Kanade, T.: A closed-form solution to non-rigid shape and motion recovery. In: Pajdla, T., Matas, J.G. (eds.) ECCV 2004. LNCS, vol. 3024, pp. 573–587. Springer, Heidelberg (2004)CrossRef Xiao, J., Chai, J., Kanade, T.: A closed-form solution to non-rigid shape and motion recovery. In: Pajdla, T., Matas, J.G. (eds.) ECCV 2004. LNCS, vol. 3024, pp. 573–587. Springer, Heidelberg (2004)CrossRef
5.
Zurück zum Zitat Torresani, L., Hertzmann, A., Bregler, C.: Nonrigid structure-from-motion: estimating shape and motion with hierarchical priors. IEEE Trans. Pattern Anal. Mach. Intell. 30, 878–892 (2008)CrossRef Torresani, L., Hertzmann, A., Bregler, C.: Nonrigid structure-from-motion: estimating shape and motion with hierarchical priors. IEEE Trans. Pattern Anal. Mach. Intell. 30, 878–892 (2008)CrossRef
6.
Zurück zum Zitat Akhter, I., Sheikh, Y., Khan, S., Kanade, T.: Trajectory space: a dual representation for nonrigid structure from motion. IEEE Trans. Pattern Anal. Mach. Intell. 33(7), 1442–1456 (2011)CrossRef Akhter, I., Sheikh, Y., Khan, S., Kanade, T.: Trajectory space: a dual representation for nonrigid structure from motion. IEEE Trans. Pattern Anal. Mach. Intell. 33(7), 1442–1456 (2011)CrossRef
7.
Zurück zum Zitat Gotardo, P., Martinez, A.: Computing smooth timetrajectories for camera and deformable shape in structure from motion with occlusion. IEEE Trans. Pattern Anal. Mach. Intell. 33, 2051–2065 (2011)CrossRef Gotardo, P., Martinez, A.: Computing smooth timetrajectories for camera and deformable shape in structure from motion with occlusion. IEEE Trans. Pattern Anal. Mach. Intell. 33, 2051–2065 (2011)CrossRef
8.
Zurück zum Zitat Hamsici, O.C., Gotardo, P.F.U., Martinez, A.M.: Learning spatially-smooth mappings in non-rigid structure from motion. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part IV. LNCS, vol. 7575, pp. 260–273. Springer, Heidelberg (2012) Hamsici, O.C., Gotardo, P.F.U., Martinez, A.M.: Learning spatially-smooth mappings in non-rigid structure from motion. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part IV. LNCS, vol. 7575, pp. 260–273. Springer, Heidelberg (2012)
9.
Zurück zum Zitat Dai, Y., Li, H., He, M.: A simple prior-free method for non-rigid structure-from-motion factorization. In: CVPR (2012) Dai, Y., Li, H., He, M.: A simple prior-free method for non-rigid structure-from-motion factorization. In: CVPR (2012)
10.
Zurück zum Zitat Dai, Y., Li, H., He, M.: A simple prior-free method for non-rigid structure-from-motion factorization. Int. J. Comput. Vis. 107, 101–122 (2014)MathSciNetCrossRefMATH Dai, Y., Li, H., He, M.: A simple prior-free method for non-rigid structure-from-motion factorization. Int. J. Comput. Vis. 107, 101–122 (2014)MathSciNetCrossRefMATH
11.
Zurück zum Zitat Ma, J., Zhao, J., Ma, Y., Tian, J.: Non-rigid visible and infrared face registration via regularized gaussian fields criterion. Pattern Recogn. 48(3), 772–784 (2015)CrossRef Ma, J., Zhao, J., Ma, Y., Tian, J.: Non-rigid visible and infrared face registration via regularized gaussian fields criterion. Pattern Recogn. 48(3), 772–784 (2015)CrossRef
12.
Zurück zum Zitat Ma, J., Zhao, J., Tian, J., Tu, Z., Yuille, A.L.: Robust estimation of nonrigid transformation for point set registration. In: CVPR (2013) Ma, J., Zhao, J., Tian, J., Tu, Z., Yuille, A.L.: Robust estimation of nonrigid transformation for point set registration. In: CVPR (2013)
13.
Zurück zum Zitat Ma, J., Zhao, J., Tian, J., Bai, X., Tu, Z.: Regularized vector field learning with sparse approximation for mismatch removal. Pattern Recogn. 46(12), 3519–3532 (2013)CrossRefMATH Ma, J., Zhao, J., Tian, J., Bai, X., Tu, Z.: Regularized vector field learning with sparse approximation for mismatch removal. Pattern Recogn. 46(12), 3519–3532 (2013)CrossRefMATH
14.
Zurück zum Zitat Agudo, A., Agapito, L., Calvo, B., Montiel, J.: Good vibrations: a modal analysis approach for sequential non-rigid structure from motion. In: CVPR, pp. 1558–1565 (2014) Agudo, A., Agapito, L., Calvo, B., Montiel, J.: Good vibrations: a modal analysis approach for sequential non-rigid structure from motion. In: CVPR, pp. 1558–1565 (2014)
15.
Zurück zum Zitat Rosen, J.: Symmetry Discovered: Concepts and Applications in Nature and Science. Dover Publications, Mineola (2011)MATH Rosen, J.: Symmetry Discovered: Concepts and Applications in Nature and Science. Dover Publications, Mineola (2011)MATH
16.
Zurück zum Zitat Hong, W., Yang, A.Y., Huang, K., Ma, Y.: On symmetry and multiple-view geometry: structure, pose, and calibration from a single image. Int. J. Comput. Vis. 60, 241–265 (2004)CrossRef Hong, W., Yang, A.Y., Huang, K., Ma, Y.: On symmetry and multiple-view geometry: structure, pose, and calibration from a single image. Int. J. Comput. Vis. 60, 241–265 (2004)CrossRef
17.
Zurück zum Zitat Gordon, G.G.: Shape from symmetry. In: Proceedings of SPIE (1990) Gordon, G.G.: Shape from symmetry. In: Proceedings of SPIE (1990)
18.
Zurück zum Zitat Kontsevich, L.L.: Pairwise comparison technique: a simple solution for depth reconstruction. JOSA A 10(6), 1129–1135 (1993)CrossRef Kontsevich, L.L.: Pairwise comparison technique: a simple solution for depth reconstruction. JOSA A 10(6), 1129–1135 (1993)CrossRef
19.
Zurück zum Zitat Vetter, T., Poggio, T.: Symmetric 3D objects are an easy case for 2D object recognition. Spat. Vis. 8, 443–453 (1994)CrossRef Vetter, T., Poggio, T.: Symmetric 3D objects are an easy case for 2D object recognition. Spat. Vis. 8, 443–453 (1994)CrossRef
20.
Zurück zum Zitat Mukherjee, D.P., Zisserman, A., Brady, M.: Shape from symmetry: detecting and exploiting symmetry in affine images. Philos. Trans. Phys. Sci. Eng. 351, 77–106 (1995)CrossRefMATH Mukherjee, D.P., Zisserman, A., Brady, M.: Shape from symmetry: detecting and exploiting symmetry in affine images. Philos. Trans. Phys. Sci. Eng. 351, 77–106 (1995)CrossRefMATH
21.
Zurück zum Zitat Thrun, S., Wegbreit, B.: Shape from symmetry. In: ICCV (2005) Thrun, S., Wegbreit, B.: Shape from symmetry. In: ICCV (2005)
22.
Zurück zum Zitat Li, Y., Pizlo, Z.: Reconstruction of shapes of 3D symmetric objects by using planarity and compactness constraints. In: Proceedings of SPIE-IS&T Electronic Imaging (2007) Li, Y., Pizlo, Z.: Reconstruction of shapes of 3D symmetric objects by using planarity and compactness constraints. In: Proceedings of SPIE-IS&T Electronic Imaging (2007)
23.
Zurück zum Zitat Vicente, S., Carreira, J., Agapito, L., Batista, J.: Reconstructing PASCAL VOC. In: CVPR (2014) Vicente, S., Carreira, J., Agapito, L., Batista, J.: Reconstructing PASCAL VOC. In: CVPR (2014)
24.
Zurück zum Zitat Kar, A., Tulsiani, S., Carreira, J., Malik, J.: Category-specific object reconstruction from a single image. In: CVPR (2015) Kar, A., Tulsiani, S., Carreira, J., Malik, J.: Category-specific object reconstruction from a single image. In: CVPR (2015)
25.
Zurück zum Zitat Akhter, I., Sheikh, Y., Khan, S.: In defense of orthonormality constraints for nonrigid structure from motion. In: CVPR (2009) Akhter, I., Sheikh, Y., Khan, S.: In defense of orthonormality constraints for nonrigid structure from motion. In: CVPR (2009)
26.
Zurück zum Zitat Gao, Y., Yuille, A.L.: Exploiting symmetry and/or Manhattan properties for 3D object structure estimation from single and multiple images (2016). arXiv preprint arXiv:1607.07129 Gao, Y., Yuille, A.L.: Exploiting symmetry and/or Manhattan properties for 3D object structure estimation from single and multiple images (2016). arXiv preprint arXiv:​1607.​07129
27.
Zurück zum Zitat Xiang, Y., Mottaghi, R., Savarese, S.: Beyond PASCAL: a benchmark for 3D object detection in the wild. In: WACV (2014) Xiang, Y., Mottaghi, R., Savarese, S.: Beyond PASCAL: a benchmark for 3D object detection in the wild. In: WACV (2014)
28.
Zurück zum Zitat Grossmann, E., Santos-Victor, J.: Maximum likehood 3D reconstruction from one or more images under geometric constraints. In: BMVC (2002) Grossmann, E., Santos-Victor, J.: Maximum likehood 3D reconstruction from one or more images under geometric constraints. In: BMVC (2002)
29.
Zurück zum Zitat Grossmann, E., Santos-Victor, J.: Least-squares 3D reconstruction from one or more views and geometric clues. Comput. Vis. Image Underst. 99(2), 151–174 (2005)CrossRef Grossmann, E., Santos-Victor, J.: Least-squares 3D reconstruction from one or more views and geometric clues. Comput. Vis. Image Underst. 99(2), 151–174 (2005)CrossRef
30.
Zurück zum Zitat Grossmann, E., Ortin, D., Santos-Victor, J.: Single and multi-view reconstruction of structured scenes. In: ACCV (2002) Grossmann, E., Ortin, D., Santos-Victor, J.: Single and multi-view reconstruction of structured scenes. In: ACCV (2002)
31.
Zurück zum Zitat Kontsevich, L.L., Kontsevich, M.L., Shen, A.K.: Two algorithms for reconstructing shapes. Optoelectron. Instrum. Data Process. 5, 76–81 (1987) Kontsevich, L.L., Kontsevich, M.L., Shen, A.K.: Two algorithms for reconstructing shapes. Optoelectron. Instrum. Data Process. 5, 76–81 (1987)
32.
Zurück zum Zitat Bregler, C., Hertzmann, A., Biermann, H.: Recovering non-rigid 3D shape from image streams. In: CVPR (2000) Bregler, C., Hertzmann, A., Biermann, H.: Recovering non-rigid 3D shape from image streams. In: CVPR (2000)
33.
Zurück zum Zitat Hong, J.H., Fitzgibbon, A.: Secrets of matrix factorization: approximations, numerics, manifold optimization and random restarts. In: ICCV (2015) Hong, J.H., Fitzgibbon, A.: Secrets of matrix factorization: approximations, numerics, manifold optimization and random restarts. In: ICCV (2015)
34.
Zurück zum Zitat Olsen, S.I., Bartoli, A.: Implicit non-rigid structure-from-motion with priors. J. Math. Imaging Vis. 31(2–3), 233–244 (2008)MathSciNetCrossRef Olsen, S.I., Bartoli, A.: Implicit non-rigid structure-from-motion with priors. J. Math. Imaging Vis. 31(2–3), 233–244 (2008)MathSciNetCrossRef
35.
Zurück zum Zitat Akhter, I., Sheikh, Y., Khan, S., Kanade, T.: Nonrigid structure from motion in trajectory space. In: NIPS (2008) Akhter, I., Sheikh, Y., Khan, S., Kanade, T.: Nonrigid structure from motion in trajectory space. In: NIPS (2008)
36.
Zurück zum Zitat Ceylan, D., Mitra, N.J., Zheng, Y., Pauly, M.: Coupled structure-from-motion and 3D symmetry detection for urban facades. ACM Trans. Graph. 33, 2 (2014)CrossRefMATH Ceylan, D., Mitra, N.J., Zheng, Y., Pauly, M.: Coupled structure-from-motion and 3D symmetry detection for urban facades. ACM Trans. Graph. 33, 2 (2014)CrossRefMATH
37.
Zurück zum Zitat Morris, D.D., Kanatani, K., Kanade, T.: Gauge fixing for accurate 3D estimation. In: CVPR (2001) Morris, D.D., Kanatani, K., Kanade, T.: Gauge fixing for accurate 3D estimation. In: CVPR (2001)
38.
Zurück zum Zitat Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)MATH Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)MATH
39.
Zurück zum Zitat Bourdev, L., Maji, S., Brox, T., Malik, J.: Detecting people using mutually consistent poselet activations. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 168–181. Springer, Heidelberg (2010)CrossRef Bourdev, L., Maji, S., Brox, T., Malik, J.: Detecting people using mutually consistent poselet activations. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 168–181. Springer, Heidelberg (2010)CrossRef
41.
Zurück zum Zitat Chen, X., Yuille, A.L.: Articulated pose estimation by a graphical model with image dependent pairwise relations. In: NIPS, pp. 1736–1744 (2014) Chen, X., Yuille, A.L.: Articulated pose estimation by a graphical model with image dependent pairwise relations. In: NIPS, pp. 1736–1744 (2014)
Metadaten
Titel
Symmetric Non-rigid Structure from Motion for Category-Specific Object Structure Estimation
verfasst von
Yuan Gao
Alan L. Yuille
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46475-6_26