Top

International Journal of Computer Vision

Published in:

05-12-2016

Combining Local-Physical and Global-Statistical Models for Sequential Deformable Shape from Motion

Authors: Antonio Agudo, Francesc Moreno-Noguer

Published in: International Journal of Computer Vision | Issue 2/2017

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

In this paper, we simultaneously estimate camera pose and non-rigid 3D shape from a monocular video, using a sequential solution that combines local and global representations. We model the object as an ensemble of particles, each ruled by the linear equation of the Newton’s second law of motion. This dynamic model is incorporated into a bundle adjustment framework, in combination with simple regularization components that ensure temporal and spatial consistency. The resulting approach allows to sequentially estimate shape and camera poses, while progressively learning a global low-rank model of the shape that is fed back into the optimization scheme, introducing thus, global constraints. The overall combination of local (physical) and global (statistical) constraints yields a solution that is both efficient and robust to several artifacts such as noisy and missing data or sudden camera motions, without requiring any training data at all. Validation is done in a variety of real application domains, including articulated and non-rigid motion, both for continuous and discontinuous shapes. Our on-line methodology yields significantly more accurate reconstructions than competing sequential approaches, being even comparable to the more computationally demanding batch methods.

previous article Complex Activity Recognition Via Attribute Dynamics

next article 3D Human Pose Tracking Priors using Geodesic Mixture Models

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

A third-order backward model to code the displacement vector can be expressed by considering 4-time instances as \( \mathbf {f}_{i}^t \approx m_{i} \left[ \frac{ -\mathbf {y}^{t-3}_{i}\,+\,4\mathbf {y}^{t-2}_{i} -\,5\mathbf {y}^{t-1}_{i}\,+\, 2\mathbf {y}^{t}_{i}}{(\Delta t)^{2}} \right] \).

\(\frac{[\text {force}]}{[\text {mass}][\text {time}]^{-2}}=\frac{[\text {mass}][\text {length}][\text {time}]^{-2}}{[\text {mass}][\text {time}]^{-2}}={{{[\text {length}]}}}\)

Note that although \(\mathbf {R}^{j}\) and \(\mathbf {t}^j\) for \(j=\{t-2,t-1\}\) are allowed to change while optimizing the pose and shape at frame t, their value is not propagated back in time. That is, our approach remains purely sequential.

The computational complexity of the product \(\mathbf {A}^\top \mathbf {A}\), where \(\mathbf {A}\) is a sparse \(m\times n\) matrix with \(n_{nz}\) non-zero elements is \(\mathcal {O}(n_{nz}+m+n)\), that is, it depends linearly on \(n_{nz}\), the row size m and column size n of the matrix, but is independent of the product mn. See: http://es.mathworks.com/help/matlab/math/sparse-matrix-operations.html#f6-13058.

http://www.iri.upc.edu/people/aagudo.

Agudo, A., & Moreno-Noguer, F. (2015). Simultaneous pose and non-rigid shape with particle dynamics. In Conference on computer vision and pattern recognition, pp. 2179–2187

Agudo, A., Calvo, B., Montiel, & J. M. M. (2012). Finite element based sequential Bayesian non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1418–1425

Agudo, A., Agapito, L., Calvo, B., & Montiel, J. M. M. (2014a). Good vibrations: A modal analysis approach for sequential non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1558–1565

Agudo, A., Montiel, J. M. M., Agapito, L., & Calvo, B. (2014b). Online dense non-rigid 3D shape and camera motion recovery. In British machine vision conference

Agudo, A., Moreno-Noguer, F., Calvo, B., & Montiel, J. M. M. (2016). Sequential non-rigid structure from motion using physical priors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(5), 979–994.CrossRef

Akhter, I., Sheikh, Y., Khan, S., & Kanade, T. (2008). Non-rigid structure from motion in trajectory space. In Neural information processing systems, pp. 41–48

Baraff, D. (1989). Analytical methods for dynamic simulation of non-penetrating rigid bodies. In Conference on computer graphics and interactive techniques, pp. 223–232

Bartoli, A., Gay-Bellile, V., Castellani, U., Peyras, J., Olsen, S., & Sayd, P. (2008). Coarse-to-fine low-rank structure-from-motion. In Conference on computer vision and pattern recognition, pp. 1–8

Brand, M. (2001). Morphable 3D models from video. In Conference on computer vision and pattern recognition, pp. 456–463

Bregler, C., Hertzmann, A., & Biermann, H. (2000). Recovering non-rigid 3D shape from image streams. In Conference on computer vision and pattern recognition, pp. 690–696

Brubaker, M., Sigal, L., & Fleet, D. (2009). Estimating contact dynamics. In International conference on computer vision, pp. 2389–2396

Chhatkuli, A., Pizarro, D., & Bartoli, A. (2014). Non-rigid shape-from-motion for isometric surfaces using infinitesimal planarity. In British machine vision conference

Dai, Y., Li, H., & He, M. (2012). A simple prior-free method for non-rigid structure from motion factorization. In Conference on computer vision and pattern recognition, pp. 2018–2025

Del Bue, A., Llado, X., & Agapito, L. (2006). Non-rigid metric shape and motion recovery from uncalibrated images using priors. In Conference on computer vision and pattern recognition, pp. 1191–1198

Fayad, J., Agapito, L., & Del Bue, A. (2010). Piecewise quadratic reconstruction of non-rigid surfaces from monocular sequences. In European conference on computer vision, pp. 297–310

Garg, R., Roussos, A., & Agapito, L. (2013). Dense variational reconstruction of non-rigid surfaces from monocular video. In Conference on computer vision and pattern recognition, pp. 1272–1279

Gotardo, P. F. U., & Martínez, A. M. (2011a). Kernel non-rigid structure from motion. In International conference on computer vision, pp. 802–809

Gotardo, P. F. U., & Martínez, A. M. (2011b). Non-rigid structure from motion with complementary rank-3 spaces. In Conference on computer vision and pattern recognition, pp. 3065–3072

Koh, W., Narain, R., & O’Brien, J. F. (2014). View-dependent adaptive cloth simulation. In ACM SIGGRAPH/Eurographics symposium on computer animation, pp. 159–166

Lee, M., Cho, J., Choi, C. H., & Oh, S. (2013). Procrustean normal distribution for non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 1280–1287

Lim, J., Frahm, J., & Pollefeys, M. (2011). Online environment mapping. In Conference on computer vision and pattern recognition, pp. 3489–3496

Ma, Y., Kosecka, J., & Sastry, S. (1999). Optimization criteria and geometric algorithms for motion and structure estimation. International Journal on Computer Vision, 44(3), 219–249.CrossRefMATH

Maier-Hein, L., Groch, A., Bartoli, A., Bodenstedt, S., Boissonnat, G., Chang, P. L., et al. (2014). Comparative validation of single-shot optical techniques for laparoscopic 3D surface reconstruction. IEEE Transactions on Medical Imaging, 33(10), 1913–1930.CrossRef

Marques, M., & Costeira, J. (2008). Optimal shape from estimation with missing and degenerate data. In Workshop on motion and video computing, pp. 1–6

Metaxas, D., & Terzopoulos, D. (1993). Shape and nonrigid motion estimation through physics-based synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6), 580–591.CrossRef

Moreno-Noguer, F., & Porta, J. M. (2011). Probabilistic simultaneous pose and non-rigid shape recovery. In Conference on computer vision and pattern recognition, pp. 1289–1296

Newcome, R., & Davison, A. J. (2010). Live dense reconstruction with a single moving camera. In Conference on computer vision and pattern recognition, pp. 1498–1505

Paladini, M., Del Bue, A., Stosic, M., Dodig, M., Xavier, J., & Agapito, L. (2009). Factorization for non-rigid and articulated structure using metric projections. In Conference on computer vision and pattern recognition, pp. 2898–2905

Paladini, M., Bartoli, A., & Agapito, L. (2010). Sequential non rigid structure from motion with the 3D implicit low rank shape model. In European conference on computer vision, pp. 15–28

Park, H. S., Shiratori, T., Matthews, I., & Sheikh, Y. (2010). 3D reconstruction of a moving point from a series of 2D projections. In European conference on computer vision, pp. 158–171

Popovic, Z., & Witkin, A. (1999). Physically based motion transformation. In Conference on computer graphics and interactive techniques, pp. 11–20

Russell, C., Fayad, J., & Agapito, L. (2011). Energy based multiple model fitting for non-rigid structure from motion. In Conference on computer vision and pattern recognition, pp. 3009–3016

Russell, C., Yu, R., & Agapito, L. (2014). Video pop-up: Monocular 3D reconstruction of dynamic scenes. In European conference on computer vision, pp. 583–598

Salzmann, M., & Urtasun, R. (2011). Physically-based motion models for 3D tracking: A convex formulation. In International conference on computer vision, pp. 2064–2071

Shaji, A., & Chandran, S. (2008). Riemannian manifold optimisation for non-rigid structure from motion. In Workshop on non-rigid shape analysis and deformable image alignment, pp. 1–6

Tao, L., Mein, S. J., Quan, W., & Matuszewski, B. J. (2013). Recursive non-rigid structure from motion with online learned shape prior. Computer Vision and Image Understanding, 117(10), 1287–1298.CrossRef

Taylor, J., Jepson, A.D., & Kutulakos, K.N. (2010). Non-rigid structure from locally-rigid motion. In Conference on computer vision and pattern recognition, pp. 2761–2768

Tomasi, C., & Kanade, T. (1992). Shape and motion from image streams under orthography: A factorization approach. International Journal on Computer Vision, 9(2), 137–154.CrossRef

Torresani, L., Hertzmann, A., & Bregler, C. (2008). Nonrigid structure-from-motion: Estimating shape and motion with hierarchical priors. Transactions on Pattern Analysis and Machine Intelligence, 30(5), 878–892.CrossRef

Valmadre, J., & Lucey, S. (2012). General trajectory prior for non-rigid reconstruction. In Conference on computer vision and pattern recognition, pp. 1394–1401

Varol, A., Salzmann, M., Tola, E., & Fua, P. (2009). Template-free monocular reconstruction of deformable surfaces. In International conference on computer vision, pp. 1811–1818

Vondrak, M., Sigal, L., & Jenkins, O.C. (2008). Physical simulation for probabilistic motion tracking. In Conference on computer vision and pattern recognition, pp. 1–8

Xiao, J., Chai, J., & Kanade, T. (2006). A closed-form solution to non-rigid shape and motion. International Journal on Computer Vision, 67(2), 233–246.CrossRef

Title: Combining Local-Physical and Global-Statistical Models for Sequential Deformable Shape from Motion
Authors: Antonio Agudo
Francesc Moreno-Noguer
Publication date: 05-12-2016
Publisher: Springer US
Published in: International Journal of Computer Vision / Issue 2/2017
Print ISSN: 0920-5691
Electronic ISSN: 1573-1405
DOI: https://doi.org/10.1007/s11263-016-0972-8

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Other articles of this Issue 2/2017

Automatic Sleep System Recommendation by Multi-modal RBG-Depth-Pressure Anthropometric Analysis

Adaptive Spatial-Spectral Dictionary Learning for Hyperspectral Image Restoration

Guest Editorial: Machine Vision Applications

Robust Statistical Frontalization of Human and Animal Faces

Multi-Camera Multi-Target Tracking with Space-Time-View Hyper-graph

Domain Adaptation for Automatic OLED Panel Defect Detection Using Adaptive Support Vector Data Description

Premium Partner