Skip to main content

2018 | OriginalPaper | Buchkapitel

9. Optimization Problems Associated with Manifold-Valued Curves with Applications in Computer Vision

verfasst von : Rushil Anirudh, Pavan Turaga, Anuj Srivastava

Erschienen in: Handbook of Convex Optimization Methods in Imaging Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A commonly occurring requirement in many computer vision applications is the need to represent, compare, and manipulate manifold-valued curves, while allowing for enough flexibility to operate in resource constrained environments. We address these concerns in this chapter, by proposing a dictionary learning scheme that takes geometry and time into account, while performing better than the original data in applications such as activity recognition. We are able to do this with the use of the transport square-root velocity function, which provides an elastic representation for trajectories on Riemannian manifolds. Since these operations can be computationally very expensive, we also present a geometry-based symbolic approximation framework, as a result of which low-bandwidth transmission and accurate real-time analysis for recognition or searching through sequential data become fairly straightforward. We discuss the different optimization problems encountered in this context—learning a sparse representation for actions using extrinsic and intrinsic features, solving the registration problem between two Riemannian trajectories, and learning an optimal clustering scheme for symbolic approximation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
We are following the notation to denote the vector space (\(\xi \in \mathbb{R}^{6}\)) and the equivalent Lie algebra representation (\(\widehat{\xi }\in \mathfrak{s}\mathfrak{e}(3)\)) as described in p. 411 of [31].
 
Literatur
1.
Zurück zum Zitat Absil P-A, Mahony R, Sepulchre R (2008) Optimization algorithms on matrix manifolds. Princeton University Press, Princeton, NJCrossRefMATH Absil P-A, Mahony R, Sepulchre R (2008) Optimization algorithms on matrix manifolds. Princeton University Press, Princeton, NJCrossRefMATH
2.
Zurück zum Zitat Allauzen C, Raffinot M (2000) Simple optimal string matching algorithm. In: Combinatorial pattern matching, vol 1848 of Lecture notes in computer science, pp 364–374. Springer, Heidelberg Allauzen C, Raffinot M (2000) Simple optimal string matching algorithm. In: Combinatorial pattern matching, vol 1848 of Lecture notes in computer science, pp 364–374. Springer, Heidelberg
3.
Zurück zum Zitat Anirudh R, Ramamurthy K, Thiagarajan JJ, Turaga P, Spanias A (2013) A heterogeneous dictionary model for representation and recognition of human actions. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 3472–3476 Anirudh R, Ramamurthy K, Thiagarajan JJ, Turaga P, Spanias A (2013) A heterogeneous dictionary model for representation and recognition of human actions. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 3472–3476
4.
Zurück zum Zitat Anirudh R, Turaga P, Su J, Srivastava A (2015) Elasting functional coding of human actions: from vector fields to latent variables. In: CVPR, pp 3147–3155 Anirudh R, Turaga P, Su J, Srivastava A (2015) Elasting functional coding of human actions: from vector fields to latent variables. In: CVPR, pp 3147–3155
5.
Zurück zum Zitat Anirudh R, Turaga P, Su J, Srivastava A (2016) Elastic functional coding of Riemannian trajectories. IEEE Trans Pattern Anal Mach Intell 39(5):922–936CrossRef Anirudh R, Turaga P, Su J, Srivastava A (2016) Elastic functional coding of Riemannian trajectories. IEEE Trans Pattern Anal Mach Intell 39(5):922–936CrossRef
6.
Zurück zum Zitat Begelfor E, Werman M (2006) Affine invariance revisited. In: IEEE conference on computer vision and pattern recognition. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2. IEEE, pp 2087–2094 Begelfor E, Werman M (2006) Affine invariance revisited. In: IEEE conference on computer vision and pattern recognition. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2. IEEE, pp 2087–2094
7.
Zurück zum Zitat Berndt D, Clifford J (1994) Using dynamic time warping to find patterns in time series. In: KDD Workshop, vol 10, pp 359–370. Seattle, WA Berndt D, Clifford J (1994) Using dynamic time warping to find patterns in time series. In: KDD Workshop, vol 10, pp 359–370. Seattle, WA
8.
Zurück zum Zitat Boothby WM (2003) An introduction to differentiable manifolds and Riemannian geometry. Revised 2nd edn. Academic, New York Boothby WM (2003) An introduction to differentiable manifolds and Riemannian geometry. Revised 2nd edn. Academic, New York
9.
Zurück zum Zitat Chakrabarti K, Keogh EJ, Mehrotra S, Pazzani MJ (2002) Locally adaptive dimensionality reduction for indexing large time series databases. ACM Trans Database Syst 27(2):188–228CrossRef Chakrabarti K, Keogh EJ, Mehrotra S, Pazzani MJ (2002) Locally adaptive dimensionality reduction for indexing large time series databases. ACM Trans Database Syst 27(2):188–228CrossRef
10.
Zurück zum Zitat Chaudhry R, Ravichandran A, Hager G, Vidal R (2009) Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions. In: IEEE Conference on Computer Vision and Pattern Recognition, 2009 (CVPR 2009). IEEE, pp 1932–1939 Chaudhry R, Ravichandran A, Hager G, Vidal R (2009) Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions. In: IEEE Conference on Computer Vision and Pattern Recognition, 2009 (CVPR 2009). IEEE, pp 1932–1939
11.
Zurück zum Zitat Cherian A, Sra S (2015) Riemannian dictionary learning and sparse coding for positive definite matrices. arXiv preprint arXiv:1507.02772 Cherian A, Sra S (2015) Riemannian dictionary learning and sparse coding for positive definite matrices. arXiv preprint arXiv:1507.02772
12.
Zurück zum Zitat Desieno D (1988) Adding a conscience to competitive learning. IEEE International Conference on Neural Networks 1:117–124CrossRef Desieno D (1988) Adding a conscience to competitive learning. IEEE International Conference on Neural Networks 1:117–124CrossRef
13.
Zurück zum Zitat Devanne M, Wannous H, Berretti S, Pala P, Daoudi M, Del Bimbo A (2014) 3-d human action recognition by shape analysis of motion trajectories on riemannian manifold. IEEE Transactions on Cybernetics PP(99):1–1 Devanne M, Wannous H, Berretti S, Pala P, Daoudi M, Del Bimbo A (2014) 3-d human action recognition by shape analysis of motion trajectories on riemannian manifold. IEEE Transactions on Cybernetics PP(99):1–1
15.
Zurück zum Zitat Faraki M, Harandi M, Porikli F (2015) More about VLAD: A leap from Euclidean to Riemannian manifolds. In: Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on, pp 4951–4960 Faraki M, Harandi M, Porikli F (2015) More about VLAD: A leap from Euclidean to Riemannian manifolds. In: Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on, pp 4951–4960
16.
Zurück zum Zitat Fletcher PT, Lu C, Pizer SM, Joshi SC (2004) Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Transactions on Medical Imaging 23(8):995–1005CrossRef Fletcher PT, Lu C, Pizer SM, Joshi SC (2004) Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Transactions on Medical Imaging 23(8):995–1005CrossRef
17.
Zurück zum Zitat Gaur U, Zhu Y, Song B, Chowdhury AKR (2011) A “string of feature graphs” model for recognition of complex activities in natural videos. In: ICCV, pp 2595–2602 Gaur U, Zhu Y, Song B, Chowdhury AKR (2011) A “string of feature graphs” model for recognition of complex activities in natural videos. In: ICCV, pp 2595–2602
18.
Zurück zum Zitat Goodall CR, Mardia KV (1999) Projective shape analysis. J Comput Graph Stat 8(2):143–168 (1999) Goodall CR, Mardia KV (1999) Projective shape analysis. J Comput Graph Stat 8(2):143–168 (1999)
19.
Zurück zum Zitat Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253CrossRef Gorelick L, Blank M, Shechtman E, Irani M, Basri R (2007) Actions as space-time shapes. IEEE Trans Pattern Anal Mach Intell 29(12):2247–2253CrossRef
21.
Zurück zum Zitat Guha T, Ward R (2012) Learning sparse representations for human action recognition. IEEE Trans PAMI 34(8):1576–1588CrossRef Guha T, Ward R (2012) Learning sparse representations for human action recognition. IEEE Trans PAMI 34(8):1576–1588CrossRef
22.
Zurück zum Zitat Harandi MT, Salzmann M, Hartley R (2014) From manifold to manifold: Geometry-aware dimensionality reduction for SPD matrices. In: ECCV 2014, pp 17–32MATH Harandi MT, Salzmann M, Hartley R (2014) From manifold to manifold: Geometry-aware dimensionality reduction for SPD matrices. In: ECCV 2014, pp 17–32MATH
23.
Zurück zum Zitat Harandi MT, Sanderson C, Shen C, Lovell BC (2013) Dictionary learning and sparse coding on grassmann manifolds: An extrinsic solution. In: ICCV, pp 3120–3127 Harandi MT, Sanderson C, Shen C, Lovell BC (2013) Dictionary learning and sparse coding on grassmann manifolds: An extrinsic solution. In: ICCV, pp 3120–3127
24.
Zurück zum Zitat He Z, Cichocki A, Li Y, Xie S, Sanei S (2009) K-hyperline clustering learning for sparse component analysis. Signal Process 89(6):1011–1022CrossRefMATH He Z, Cichocki A, Li Y, Xie S, Sanei S (2009) K-hyperline clustering learning for sparse component analysis. Signal Process 89(6):1011–1022CrossRefMATH
25.
Zurück zum Zitat Ho J, Xie Y, Vemuri BC (2013) On a nonlinear generalization of sparse coding and dictionary learning. In: ICML (3), pp 1480–1488 Ho J, Xie Y, Vemuri BC (2013) On a nonlinear generalization of sparse coding and dictionary learning. In: ICML (3), pp 1480–1488
26.
Zurück zum Zitat Jordan MI (1998) Learning in Graphical Models. Cambridge, MA: MIT PressCrossRef Jordan MI (1998) Learning in Graphical Models. Cambridge, MA: MIT PressCrossRef
28.
Zurück zum Zitat Li W, Zhang Z, Liu Z (2010) Action recognition based on a bag of 3d points. In: Computer Vision and Pattern Recognition Workshops (CVPRW), 2010, IEEE, pp 9–14 Li W, Zhang Z, Liu Z (2010) Action recognition based on a bag of 3d points. In: Computer Vision and Pattern Recognition Workshops (CVPRW), 2010, IEEE, pp 9–14
29.
Zurück zum Zitat Lin J, Keogh EJ, Lonardi S, Chi Chiu BY (2003) A symbolic representation of time series, with implications for streaming algorithms. In: DMKD, pp 2–11 Lin J, Keogh EJ, Lonardi S, Chi Chiu BY (2003) A symbolic representation of time series, with implications for streaming algorithms. In: DMKD, pp 2–11
30.
Zurück zum Zitat Lin J, Li Y (2010) Finding approximate frequent patterns in streaming medical data. In: CBMS, pp 13–18 Lin J, Li Y (2010) Finding approximate frequent patterns in streaming medical data. In: CBMS, pp 13–18
31.
Zurück zum Zitat Murray RM, Li Z, Sastry SS, Sastry SS (1994) A mathematical introduction to robotic manipulation. CRC Press Murray RM, Li Z, Sastry SS, Sastry SS (1994) A mathematical introduction to robotic manipulation. CRC Press
32.
Zurück zum Zitat Pennec X (2006) Intrinsic statistics on Riemannian manifolds: Basic tools for geometric measurements. J Math Imag Vis 25(1):127–154MathSciNetCrossRef Pennec X (2006) Intrinsic statistics on Riemannian manifolds: Basic tools for geometric measurements. J Math Imag Vis 25(1):127–154MathSciNetCrossRef
33.
Zurück zum Zitat Ripley BD (1996) Pattern Recognition and Neural Networks. Cambridge University Press, CambridgeCrossRefMATH Ripley BD (1996) Pattern Recognition and Neural Networks. Cambridge University Press, CambridgeCrossRefMATH
34.
Zurück zum Zitat Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326CrossRef Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326CrossRef
35.
Zurück zum Zitat Seidenari L, Varano V, Berretti S, Bimbo AD, Pala P (2013) Recognizing actions from depth cameras as weakly aligned multi-part bag-of-poses. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR Workshops 2013, pp 479–485 Seidenari L, Varano V, Berretti S, Bimbo AD, Pala P (2013) Recognizing actions from depth cameras as weakly aligned multi-part bag-of-poses. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR Workshops 2013, pp 479–485
37.
Zurück zum Zitat Srivastava A, Jermyn I, Joshi S (2007) Riemannian analysis of probability density functions with applications in vision. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 1–8 Srivastava A, Jermyn I, Joshi S (2007) Riemannian analysis of probability density functions with applications in vision. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 1–8
38.
Zurück zum Zitat Srivastava A, Klassen E, Joshi SH, Jermyn IH (2011) Shape analysis of elastic curves in Euclidean spaces. IEEE Trans Pattern Anal Mach Intell 33:1415–1428CrossRef Srivastava A, Klassen E, Joshi SH, Jermyn IH (2011) Shape analysis of elastic curves in Euclidean spaces. IEEE Trans Pattern Anal Mach Intell 33:1415–1428CrossRef
39.
Zurück zum Zitat Su J, Kurtek S, Klassen E, Srivastava A (2014) Statistical analysis of trajectories on Riemannian manifolds: Bird migration, hurricane tracking, and video surveillance. Ann Appl Stat 8(1):530–552MathSciNetCrossRefMATH Su J, Kurtek S, Klassen E, Srivastava A (2014) Statistical analysis of trajectories on Riemannian manifolds: Bird migration, hurricane tracking, and video surveillance. Ann Appl Stat 8(1):530–552MathSciNetCrossRefMATH
40.
Zurück zum Zitat Su J, Srivastava A, de Souza FDM, Sarkar S (2014) Rate-invariant analysis of trajectories on riemannian manifolds with application in visual speech recognition. In: CVPR 2014, Columbus, OH, USA, June 23–28, 2014, pp 620–627 Su J, Srivastava A, de Souza FDM, Sarkar S (2014) Rate-invariant analysis of trajectories on riemannian manifolds with application in visual speech recognition. In: CVPR 2014, Columbus, OH, USA, June 23–28, 2014, pp 620–627
41.
Zurück zum Zitat Tenenbaum JB, Silva VD, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500): 2319–2323CrossRef Tenenbaum JB, Silva VD, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500): 2319–2323CrossRef
42.
Zurück zum Zitat Thiagarajan J, Ramamurthy K, Spanias A (2011) Optimality and stability of the K-hyperline clustering algorithm. Pattern Recog Lett 32(9):1299–1304CrossRef Thiagarajan J, Ramamurthy K, Spanias A (2011) Optimality and stability of the K-hyperline clustering algorithm. Pattern Recog Lett 32(9):1299–1304CrossRef
43.
Zurück zum Zitat Turaga PK, Chellappa R (2009) Locally time-invariant models of human activities using trajectories on the Grassmannian. In: CVPR, pp 2435–2441 Turaga PK, Chellappa R (2009) Locally time-invariant models of human activities using trajectories on the Grassmannian. In: CVPR, pp 2435–2441
44.
Zurück zum Zitat Turaga PK, Veeraraghavan A, Srivastava A, Chellappa R (2011) Statistical computations on Grassmann and Stiefel manifolds for image and video-based recognition. IEEE Trans Pattern Anal Mach Intell 33(11):2273–2286CrossRef Turaga PK, Veeraraghavan A, Srivastava A, Chellappa R (2011) Statistical computations on Grassmann and Stiefel manifolds for image and video-based recognition. IEEE Trans Pattern Anal Mach Intell 33(11):2273–2286CrossRef
45.
Zurück zum Zitat Tuzel O, Porikli FM, Meer P (2006) Region covariance: A fast descriptor for detection and classification. European Conference on Computer Vision II:589–600 Tuzel O, Porikli FM, Meer P (2006) Region covariance: A fast descriptor for detection and classification. European Conference on Computer Vision II:589–600
46.
Zurück zum Zitat Veeraraghavan A, Chellappa R Roy-Chowdhury AK (2006) The function space of an activity. IEEE CVPR, pp 959–968 Veeraraghavan A, Chellappa R Roy-Chowdhury AK (2006) The function space of an activity. IEEE CVPR, pp 959–968
47.
Zurück zum Zitat Veeraraghavan A, Chowdhury AKR (2006) The function space of an activity. In: CVPR (1), pp 959–968 Veeraraghavan A, Chowdhury AKR (2006) The function space of an activity. In: CVPR (1), pp 959–968
48.
Zurück zum Zitat Veeraraghavan A, Chowdhury AKR, Chellappa R (2005) Matching shape sequences in video with applications in human movement analysis. IEEE Trans Pattern Anal Mach Intell 27(12):1896–1909CrossRef Veeraraghavan A, Chowdhury AKR, Chellappa R (2005) Matching shape sequences in video with applications in human movement analysis. IEEE Trans Pattern Anal Mach Intell 27(12):1896–1909CrossRef
49.
Zurück zum Zitat Vemulapalli R, Arrate F, Chellappa R (2014) Human action recognition by representing 3d skeletons as points in a lie group. In: (CVPR), 2014, pp 588–595 Vemulapalli R, Arrate F, Chellappa R (2014) Human action recognition by representing 3d skeletons as points in a lie group. In: (CVPR), 2014, pp 588–595
50.
Zurück zum Zitat Xia L, Chen C, Aggarwal J (2012) View invariant human action recognition using histograms of 3d joints. In: Computer Vision and Pattern Recognition Workshops (CVPRW) 2012, IEEE, pp 20–27 Xia L, Chen C, Aggarwal J (2012) View invariant human action recognition using histograms of 3d joints. In: Computer Vision and Pattern Recognition Workshops (CVPRW) 2012, IEEE, pp 20–27
51.
Zurück zum Zitat Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc Ser B (Statistical Methodology) 68(1):49–67MathSciNetCrossRefMATH Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc Ser B (Statistical Methodology) 68(1):49–67MathSciNetCrossRefMATH
52.
Zurück zum Zitat Zador P (1982) Asymptotic quantization error of continuous signals and the quantization dimension. IEEE Transactions on Information Theory 28(2):139–149MathSciNetCrossRefMATH Zador P (1982) Asymptotic quantization error of continuous signals and the quantization dimension. IEEE Transactions on Information Theory 28(2):139–149MathSciNetCrossRefMATH
53.
Zurück zum Zitat Zhang S, Kasiviswanathan S, Yuen PC, Harandi M (2015) Online dictionary learning on symmetric positive definite manifolds with vision applications. In: AAAI, pp 3165–3173 Zhang S, Kasiviswanathan S, Yuen PC, Harandi M (2015) Online dictionary learning on symmetric positive definite manifolds with vision applications. In: AAAI, pp 3165–3173
54.
Zurück zum Zitat Zhang Z, Su J, Klassen E, Le H, Srivastava A (2015) Video-based action recognition using rate-invariant analysis of covariance trajectories. CoRR, abs/1503.06699 Zhang Z, Su J, Klassen E, Le H, Srivastava A (2015) Video-based action recognition using rate-invariant analysis of covariance trajectories. CoRR, abs/1503.06699
55.
Zurück zum Zitat Zhao G, Barnard M, Pietikäinen M (2009) Lipreading with local spatiotemporal descriptors. IEEE Trans Multimedia 11(7):1254–1265CrossRef Zhao G, Barnard M, Pietikäinen M (2009) Lipreading with local spatiotemporal descriptors. IEEE Trans Multimedia 11(7):1254–1265CrossRef
56.
Zurück zum Zitat Zhou F, De la Torre F (2012) Generalized time warping for multi-modal alignment of human motion. In: (CVPR), 2012, IEEE, pp 1282–1289 Zhou F, De la Torre F (2012) Generalized time warping for multi-modal alignment of human motion. In: (CVPR), 2012, IEEE, pp 1282–1289
Metadaten
Titel
Optimization Problems Associated with Manifold-Valued Curves with Applications in Computer Vision
verfasst von
Rushil Anirudh
Pavan Turaga
Anuj Srivastava
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-61609-4_9