Skip to main content
Top
Published in: International Journal of Computer Vision 2/2017

05-12-2016

Combining Local-Physical and Global-Statistical Models for Sequential Deformable Shape from Motion

Authors: Antonio Agudo, Francesc Moreno-Noguer

Published in: International Journal of Computer Vision | Issue 2/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we simultaneously estimate camera pose and non-rigid 3D shape from a monocular video, using a sequential solution that combines local and global representations. We model the object as an ensemble of particles, each ruled by the linear equation of the Newton’s second law of motion. This dynamic model is incorporated into a bundle adjustment framework, in combination with simple regularization components that ensure temporal and spatial consistency. The resulting approach allows to sequentially estimate shape and camera poses, while progressively learning a global low-rank model of the shape that is fed back into the optimization scheme, introducing thus, global constraints. The overall combination of local (physical) and global (statistical) constraints yields a solution that is both efficient and robust to several artifacts such as noisy and missing data or sudden camera motions, without requiring any training data at all. Validation is done in a variety of real application domains, including articulated and non-rigid motion, both for continuous and discontinuous shapes. Our on-line methodology yields significantly more accurate reconstructions than competing sequential approaches, being even comparable to the more computationally demanding batch methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
A third-order backward model to code the displacement vector can be expressed by considering 4-time instances as \( \mathbf {f}_{i}^t \approx m_{i} \left[ \frac{ -\mathbf {y}^{t-3}_{i}\,+\,4\mathbf {y}^{t-2}_{i} -\,5\mathbf {y}^{t-1}_{i}\,+\, 2\mathbf {y}^{t}_{i}}{(\Delta t)^{2}} \right] \).
 
2
\(\frac{[\text {force}]}{[\text {mass}][\text {time}]^{-2}}=\frac{[\text {mass}][\text {length}][\text {time}]^{-2}}{[\text {mass}][\text {time}]^{-2}}={{{[\text {length}]}}}\)
 
3
Note that although \(\mathbf {R}^{j}\) and \(\mathbf {t}^j\) for \(j=\{t-2,t-1\}\) are allowed to change while optimizing the pose and shape at frame t, their value is not propagated back in time. That is, our approach remains purely sequential.
 
4
The computational complexity of the product \(\mathbf {A}^\top \mathbf {A}\), where \(\mathbf {A}\) is a sparse \(m\times n\) matrix with \(n_{nz}\) non-zero elements is \(\mathcal {O}(n_{nz}+m+n)\), that is, it depends linearly on \(n_{nz}\), the row size m and column size n of the matrix, but is independent of the product mn. See: http://​es.​mathworks.​com/​help/​matlab/​math/​sparse-matrix-operations.​html#f6-13058.
 
Literature
go back to reference Agudo, A., & Moreno-Noguer, F. (2015). Simultaneous pose and non-rigid shape with particle dynamics. In Conference on computer vision and pattern recognition, pp. 2179–2187 Agudo, A., & Moreno-Noguer, F. (2015). Simultaneous pose and non-rigid shape with particle dynamics. In Conference on computer vision and pattern recognition, pp. 2179–2187
go back to reference Agudo, A., Calvo, B., Montiel, & J. M. M. (2012). Finite element based sequential Bayesian non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1418–1425 Agudo, A., Calvo, B., Montiel, & J. M. M. (2012). Finite element based sequential Bayesian non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1418–1425
go back to reference Agudo, A., Agapito, L., Calvo, B., & Montiel, J. M. M. (2014a). Good vibrations: A modal analysis approach for sequential non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1558–1565 Agudo, A., Agapito, L., Calvo, B., & Montiel, J. M. M. (2014a). Good vibrations: A modal analysis approach for sequential non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1558–1565
go back to reference Agudo, A., Montiel, J. M. M., Agapito, L., & Calvo, B. (2014b). Online dense non-rigid 3D shape and camera motion recovery. In British machine vision conference Agudo, A., Montiel, J. M. M., Agapito, L., & Calvo, B. (2014b). Online dense non-rigid 3D shape and camera motion recovery. In British machine vision conference
go back to reference Agudo, A., Moreno-Noguer, F., Calvo, B., & Montiel, J. M. M. (2016). Sequential non-rigid structure from motion using physical priors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(5), 979–994.CrossRef Agudo, A., Moreno-Noguer, F., Calvo, B., & Montiel, J. M. M. (2016). Sequential non-rigid structure from motion using physical priors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(5), 979–994.CrossRef
go back to reference Akhter, I., Sheikh, Y., Khan, S., & Kanade, T. (2008). Non-rigid structure from motion in trajectory space. In Neural information processing systems, pp. 41–48 Akhter, I., Sheikh, Y., Khan, S., & Kanade, T. (2008). Non-rigid structure from motion in trajectory space. In Neural information processing systems, pp. 41–48
go back to reference Baraff, D. (1989). Analytical methods for dynamic simulation of non-penetrating rigid bodies. In Conference on computer graphics and interactive techniques, pp. 223–232 Baraff, D. (1989). Analytical methods for dynamic simulation of non-penetrating rigid bodies. In Conference on computer graphics and interactive techniques, pp. 223–232
go back to reference Bartoli, A., Gay-Bellile, V., Castellani, U., Peyras, J., Olsen, S., & Sayd, P. (2008). Coarse-to-fine low-rank structure-from-motion. In Conference on computer vision and pattern recognition, pp. 1–8 Bartoli, A., Gay-Bellile, V., Castellani, U., Peyras, J., Olsen, S., & Sayd, P. (2008). Coarse-to-fine low-rank structure-from-motion. In Conference on computer vision and pattern recognition, pp. 1–8
go back to reference Brand, M. (2001). Morphable 3D models from video. In Conference on computer vision and pattern recognition, pp. 456–463 Brand, M. (2001). Morphable 3D models from video. In Conference on computer vision and pattern recognition, pp. 456–463
go back to reference Bregler, C., Hertzmann, A., & Biermann, H. (2000). Recovering non-rigid 3D shape from image streams. In Conference on computer vision and pattern recognition, pp. 690–696 Bregler, C., Hertzmann, A., & Biermann, H. (2000). Recovering non-rigid 3D shape from image streams. In Conference on computer vision and pattern recognition, pp. 690–696
go back to reference Brubaker, M., Sigal, L., & Fleet, D. (2009). Estimating contact dynamics. In International conference on computer vision, pp. 2389–2396 Brubaker, M., Sigal, L., & Fleet, D. (2009). Estimating contact dynamics. In International conference on computer vision, pp. 2389–2396
go back to reference Chhatkuli, A., Pizarro, D., & Bartoli, A. (2014). Non-rigid shape-from-motion for isometric surfaces using infinitesimal planarity. In British machine vision conference Chhatkuli, A., Pizarro, D., & Bartoli, A. (2014). Non-rigid shape-from-motion for isometric surfaces using infinitesimal planarity. In British machine vision conference
go back to reference Dai, Y., Li, H., & He, M. (2012). A simple prior-free method for non-rigid structure from motion factorization. In Conference on computer vision and pattern recognition, pp. 2018–2025 Dai, Y., Li, H., & He, M. (2012). A simple prior-free method for non-rigid structure from motion factorization. In Conference on computer vision and pattern recognition, pp. 2018–2025
go back to reference Del Bue, A., Llado, X., & Agapito, L. (2006). Non-rigid metric shape and motion recovery from uncalibrated images using priors. In Conference on computer vision and pattern recognition, pp. 1191–1198 Del Bue, A., Llado, X., & Agapito, L. (2006). Non-rigid metric shape and motion recovery from uncalibrated images using priors. In Conference on computer vision and pattern recognition, pp. 1191–1198
go back to reference Fayad, J., Agapito, L., & Del Bue, A. (2010). Piecewise quadratic reconstruction of non-rigid surfaces from monocular sequences. In European conference on computer vision, pp. 297–310 Fayad, J., Agapito, L., & Del Bue, A. (2010). Piecewise quadratic reconstruction of non-rigid surfaces from monocular sequences. In European conference on computer vision, pp. 297–310
go back to reference Garg, R., Roussos, A., & Agapito, L. (2013). Dense variational reconstruction of non-rigid surfaces from monocular video. In Conference on computer vision and pattern recognition, pp. 1272–1279 Garg, R., Roussos, A., & Agapito, L. (2013). Dense variational reconstruction of non-rigid surfaces from monocular video. In Conference on computer vision and pattern recognition, pp. 1272–1279
go back to reference Gotardo, P. F. U., & Martínez, A. M. (2011a). Kernel non-rigid structure from motion. In International conference on computer vision, pp. 802–809 Gotardo, P. F. U., & Martínez, A. M. (2011a). Kernel non-rigid structure from motion. In International conference on computer vision, pp. 802–809
go back to reference Gotardo, P. F. U., & Martínez, A. M. (2011b). Non-rigid structure from motion with complementary rank-3 spaces. In Conference on computer vision and pattern recognition, pp. 3065–3072 Gotardo, P. F. U., & Martínez, A. M. (2011b). Non-rigid structure from motion with complementary rank-3 spaces. In Conference on computer vision and pattern recognition, pp. 3065–3072
go back to reference Koh, W., Narain, R., & O’Brien, J. F. (2014). View-dependent adaptive cloth simulation. In ACM SIGGRAPH/Eurographics symposium on computer animation, pp. 159–166 Koh, W., Narain, R., & O’Brien, J. F. (2014). View-dependent adaptive cloth simulation. In ACM SIGGRAPH/Eurographics symposium on computer animation, pp. 159–166
go back to reference Lee, M., Cho, J., Choi, C. H., & Oh, S. (2013). Procrustean normal distribution for non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1280–1287 Lee, M., Cho, J., Choi, C. H., & Oh, S. (2013). Procrustean normal distribution for non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1280–1287
go back to reference Lim, J., Frahm, J., & Pollefeys, M. (2011). Online environment mapping. In Conference on computer vision and pattern recognition, pp. 3489–3496 Lim, J., Frahm, J., & Pollefeys, M. (2011). Online environment mapping. In Conference on computer vision and pattern recognition, pp. 3489–3496
go back to reference Ma, Y., Kosecka, J., & Sastry, S. (1999). Optimization criteria and geometric algorithms for motion and structure estimation. International Journal on Computer Vision, 44(3), 219–249.CrossRefMATH Ma, Y., Kosecka, J., & Sastry, S. (1999). Optimization criteria and geometric algorithms for motion and structure estimation. International Journal on Computer Vision, 44(3), 219–249.CrossRefMATH
go back to reference Maier-Hein, L., Groch, A., Bartoli, A., Bodenstedt, S., Boissonnat, G., Chang, P. L., et al. (2014). Comparative validation of single-shot optical techniques for laparoscopic 3D surface reconstruction. IEEE Transactions on Medical Imaging, 33(10), 1913–1930.CrossRef Maier-Hein, L., Groch, A., Bartoli, A., Bodenstedt, S., Boissonnat, G., Chang, P. L., et al. (2014). Comparative validation of single-shot optical techniques for laparoscopic 3D surface reconstruction. IEEE Transactions on Medical Imaging, 33(10), 1913–1930.CrossRef
go back to reference Marques, M., & Costeira, J. (2008). Optimal shape from estimation with missing and degenerate data. In Workshop on motion and video computing, pp. 1–6 Marques, M., & Costeira, J. (2008). Optimal shape from estimation with missing and degenerate data. In Workshop on motion and video computing, pp. 1–6
go back to reference Metaxas, D., & Terzopoulos, D. (1993). Shape and nonrigid motion estimation through physics-based synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6), 580–591.CrossRef Metaxas, D., & Terzopoulos, D. (1993). Shape and nonrigid motion estimation through physics-based synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6), 580–591.CrossRef
go back to reference Moreno-Noguer, F., & Porta, J. M. (2011). Probabilistic simultaneous pose and non-rigid shape recovery. In Conference on computer vision and pattern recognition, pp. 1289–1296 Moreno-Noguer, F., & Porta, J. M. (2011). Probabilistic simultaneous pose and non-rigid shape recovery. In Conference on computer vision and pattern recognition, pp. 1289–1296
go back to reference Newcome, R., & Davison, A. J. (2010). Live dense reconstruction with a single moving camera. In Conference on computer vision and pattern recognition, pp. 1498–1505 Newcome, R., & Davison, A. J. (2010). Live dense reconstruction with a single moving camera. In Conference on computer vision and pattern recognition, pp. 1498–1505
go back to reference Paladini, M., Del Bue, A., Stosic, M., Dodig, M., Xavier, J., & Agapito, L. (2009). Factorization for non-rigid and articulated structure using metric projections. In Conference on computer vision and pattern recognition, pp. 2898–2905 Paladini, M., Del Bue, A., Stosic, M., Dodig, M., Xavier, J., & Agapito, L. (2009). Factorization for non-rigid and articulated structure using metric projections. In Conference on computer vision and pattern recognition, pp. 2898–2905
go back to reference Paladini, M., Bartoli, A., & Agapito, L. (2010). Sequential non rigid structure from motion with the 3D implicit low rank shape model. In European conference on computer vision, pp. 15–28 Paladini, M., Bartoli, A., & Agapito, L. (2010). Sequential non rigid structure from motion with the 3D implicit low rank shape model. In European conference on computer vision, pp. 15–28
go back to reference Park, H. S., Shiratori, T., Matthews, I., & Sheikh, Y. (2010). 3D reconstruction of a moving point from a series of 2D projections. In European conference on computer vision, pp. 158–171 Park, H. S., Shiratori, T., Matthews, I., & Sheikh, Y. (2010). 3D reconstruction of a moving point from a series of 2D projections. In European conference on computer vision, pp. 158–171
go back to reference Popovic, Z., & Witkin, A. (1999). Physically based motion transformation. In Conference on computer graphics and interactive techniques, pp. 11–20 Popovic, Z., & Witkin, A. (1999). Physically based motion transformation. In Conference on computer graphics and interactive techniques, pp. 11–20
go back to reference Russell, C., Fayad, J., & Agapito, L. (2011). Energy based multiple model fitting for non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 3009–3016 Russell, C., Fayad, J., & Agapito, L. (2011). Energy based multiple model fitting for non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 3009–3016
go back to reference Russell, C., Yu, R., & Agapito, L. (2014). Video pop-up: Monocular 3D reconstruction of dynamic scenes. In European conference on computer vision, pp. 583–598 Russell, C., Yu, R., & Agapito, L. (2014). Video pop-up: Monocular 3D reconstruction of dynamic scenes. In European conference on computer vision, pp. 583–598
go back to reference Salzmann, M., & Urtasun, R. (2011). Physically-based motion models for 3D tracking: A convex formulation. In International conference on computer vision, pp. 2064–2071 Salzmann, M., & Urtasun, R. (2011). Physically-based motion models for 3D tracking: A convex formulation. In International conference on computer vision, pp. 2064–2071
go back to reference Shaji, A., & Chandran, S. (2008). Riemannian manifold optimisation for non-rigid structure from motion. In Workshop on non-rigid shape analysis and deformable image alignment, pp. 1–6 Shaji, A., & Chandran, S. (2008). Riemannian manifold optimisation for non-rigid structure from motion. In Workshop on non-rigid shape analysis and deformable image alignment, pp. 1–6
go back to reference Tao, L., Mein, S. J., Quan, W., & Matuszewski, B. J. (2013). Recursive non-rigid structure from motion with online learned shape prior. Computer Vision and Image Understanding, 117(10), 1287–1298.CrossRef Tao, L., Mein, S. J., Quan, W., & Matuszewski, B. J. (2013). Recursive non-rigid structure from motion with online learned shape prior. Computer Vision and Image Understanding, 117(10), 1287–1298.CrossRef
go back to reference Taylor, J., Jepson, A.D., & Kutulakos, K.N. (2010). Non-rigid structure from locally-rigid motion. In Conference on computer vision and pattern recognition, pp. 2761–2768 Taylor, J., Jepson, A.D., & Kutulakos, K.N. (2010). Non-rigid structure from locally-rigid motion. In Conference on computer vision and pattern recognition, pp. 2761–2768
go back to reference Tomasi, C., & Kanade, T. (1992). Shape and motion from image streams under orthography: A factorization approach. International Journal on Computer Vision, 9(2), 137–154.CrossRef Tomasi, C., & Kanade, T. (1992). Shape and motion from image streams under orthography: A factorization approach. International Journal on Computer Vision, 9(2), 137–154.CrossRef
go back to reference Torresani, L., Hertzmann, A., & Bregler, C. (2008). Nonrigid structure-from-motion: Estimating shape and motion with hierarchical priors. Transactions on Pattern Analysis and Machine Intelligence, 30(5), 878–892.CrossRef Torresani, L., Hertzmann, A., & Bregler, C. (2008). Nonrigid structure-from-motion: Estimating shape and motion with hierarchical priors. Transactions on Pattern Analysis and Machine Intelligence, 30(5), 878–892.CrossRef
go back to reference Valmadre, J., & Lucey, S. (2012). General trajectory prior for non-rigid reconstruction. In Conference on computer vision and pattern recognition, pp. 1394–1401 Valmadre, J., & Lucey, S. (2012). General trajectory prior for non-rigid reconstruction. In Conference on computer vision and pattern recognition, pp. 1394–1401
go back to reference Varol, A., Salzmann, M., Tola, E., & Fua, P. (2009). Template-free monocular reconstruction of deformable surfaces. In International conference on computer vision, pp. 1811–1818 Varol, A., Salzmann, M., Tola, E., & Fua, P. (2009). Template-free monocular reconstruction of deformable surfaces. In International conference on computer vision, pp. 1811–1818
go back to reference Vondrak, M., Sigal, L., & Jenkins, O.C. (2008). Physical simulation for probabilistic motion tracking. In Conference on computer vision and pattern recognition, pp. 1–8 Vondrak, M., Sigal, L., & Jenkins, O.C. (2008). Physical simulation for probabilistic motion tracking. In Conference on computer vision and pattern recognition, pp. 1–8
go back to reference Xiao, J., Chai, J., & Kanade, T. (2006). A closed-form solution to non-rigid shape and motion. International Journal on Computer Vision, 67(2), 233–246.CrossRef Xiao, J., Chai, J., & Kanade, T. (2006). A closed-form solution to non-rigid shape and motion. International Journal on Computer Vision, 67(2), 233–246.CrossRef
Metadata
Title
Combining Local-Physical and Global-Statistical Models for Sequential Deformable Shape from Motion
Authors
Antonio Agudo
Francesc Moreno-Noguer
Publication date
05-12-2016
Publisher
Springer US
Published in
International Journal of Computer Vision / Issue 2/2017
Print ISSN: 0920-5691
Electronic ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-016-0972-8

Other articles of this Issue 2/2017

International Journal of Computer Vision 2/2017 Go to the issue

Premium Partner