Skip to main content
Top

2017 | OriginalPaper | Chapter

The Importance of Structure

Authors : Carl Henrik Ek, Danica Kragic

Published in: Robotics Research

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Many tasks in robotics and computer vision are concerned with inferring a continuous or discrete state variable from observations and measurements from the environment. Due to the high-dimensional nature of the input data the inference is often cast as a two stage process: first a low-dimensional feature representation is extracted on which secondly a learning algorithm is applied. Due to the significant progress that have been achieved within the field of machine learning over the last decade focus have placed at the second stage of the inference process, improving the process by exploiting more advanced learning techniques applied to the same (or more of the same) data. We believe that for many scenarios significant strides in performance could be achieved by focusing on representation rather than aiming to alleviate inconclusive and/or redundant information by exploiting more advanced inference methods. This stems from the notion that; given the “correct” representation the inference problem becomes easier to solve. In this paper we argue that one important mode of information for many application scenarios is not the actual variation in the data but the rather the higher order statistics as the structure of variations. We will exemplify this through a set of applications and show different ways of representing the structure of data.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference R. Rensink, J. ORegan, J. Clark, On the failure to detect changes in scenes across brief interruptions. Vis. Cogn. 7(1), 127–145 (2000)CrossRef R. Rensink, J. ORegan, J. Clark, On the failure to detect changes in scenes across brief interruptions. Vis. Cogn. 7(1), 127–145 (2000)CrossRef
2.
go back to reference D.J. Simons, C.F. Chabris, Gorillas in our midst: sustained inattentional blindness for dynamic events. Perception 28, 1059–1074 (1999)CrossRef D.J. Simons, C.F. Chabris, Gorillas in our midst: sustained inattentional blindness for dynamic events. Perception 28, 1059–1074 (1999)CrossRef
3.
go back to reference D.G. Lowe, Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef D.G. Lowe, Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
4.
go back to reference B.D. Argalla, S. Chernova, M. Veloso, B. Browning, A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–548 (2009)CrossRef B.D. Argalla, S. Chernova, M. Veloso, B. Browning, A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–548 (2009)CrossRef
5.
go back to reference V. Kruger, D. Kragic, A. Ude, C. Geib, The meaning of action: a review on action recognition and mapping. Adv. Robot. 21(13), 1473–1501 (2007) V. Kruger, D. Kragic, A. Ude, C. Geib, The meaning of action: a review on action recognition and mapping. Adv. Robot. 21(13), 1473–1501 (2007)
6.
go back to reference E. Aksoy, A. Abramov, F. Wörgötter, B. Dellen, Categorizing object-action relations from semantic scene graphs, in IEEE International Conference on Robotics and Automation, 2010, pp. 398–405 E. Aksoy, A. Abramov, F. Wörgötter, B. Dellen, Categorizing object-action relations from semantic scene graphs, in IEEE International Conference on Robotics and Automation, 2010, pp. 398–405
7.
go back to reference I. Laptev, P. Perez, Retrieving actions in movies, in IEEE International Conference on Computer Vision, 2007, pp. 1–8 I. Laptev, P. Perez, Retrieving actions in movies, in IEEE International Conference on Computer Vision, 2007, pp. 1–8
8.
go back to reference I. Laptev, M. Marszalek, C. Schmid, B. Rozenfeld, Learning realistic human actions from movies, in IEEE Conference on Computer Vision and Pattern Recognition, 2008, pp. 1–8 I. Laptev, M. Marszalek, C. Schmid, B. Rozenfeld, Learning realistic human actions from movies, in IEEE Conference on Computer Vision and Pattern Recognition, 2008, pp. 1–8
9.
go back to reference H. Kjellstro¨m, J. Romero, D. Kragic, Visual object-action recognition: Inferring object affordances from human demonstration. Comput. Vis. Image Underst. 115, 81–90 (2011)CrossRef H. Kjellstro¨m, J. Romero, D. Kragic, Visual object-action recognition: Inferring object affordances from human demonstration. Comput. Vis. Image Underst. 115, 81–90 (2011)CrossRef
10.
go back to reference G. Luo, N. Bergström, C.H. Ek, D. Kragic, Representing actions with kernels, in International Conference of Intelligent Robots and Systems, 2011 G. Luo, N. Bergström, C.H. Ek, D. Kragic, Representing actions with kernels, in International Conference of Intelligent Robots and Systems, 2011
11.
go back to reference H. Lodhi, C. Saunders, J. Shawe-Taylor, N. Cristianini, C. Watkins, Text classification using string kernels. J. Mach. Learn. Res. 2, 419–444 (2002)MATH H. Lodhi, C. Saunders, J. Shawe-Taylor, N. Cristianini, C. Watkins, Text classification using string kernels. J. Mach. Learn. Res. 2, 419–444 (2002)MATH
12.
go back to reference N. Cristianini, J. Shawe-Taylor, An introduction to Support Vector Machines and Other Kernel Based Learning Methods (Cambridge University Press, 2006) N. Cristianini, J. Shawe-Taylor, An introduction to Support Vector Machines and Other Kernel Based Learning Methods (Cambridge University Press, 2006)
13.
go back to reference V. Kruger, D.L. Herzog, Sanmohan A. Ude, D. Kragic, Learning actions from observations. Robot. Autom. Mag. 17(2), 30–43 (2010)CrossRef V. Kruger, D.L. Herzog, Sanmohan A. Ude, D. Kragic, Learning actions from observations. Robot. Autom. Mag. 17(2), 30–43 (2010)CrossRef
14.
go back to reference D. Comaniciu, P. Meer, Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef D. Comaniciu, P. Meer, Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef
15.
go back to reference Y. Boykov, M.-P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in N-D Images, in IEEE International Conference on Computer Vision, 2005, pp. 105–112 Y. Boykov, M.-P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in N-D Images, in IEEE International Conference on Computer Vision, 2005, pp. 105–112
16.
go back to reference N. Bergström, C.H. Ek, M. Björkman, D. Kragic, Scene understanding through interactive perception, in International Conference on Vision Systems, 2011 N. Bergström, C.H. Ek, M. Björkman, D. Kragic, Scene understanding through interactive perception, in International Conference on Vision Systems, 2011
17.
go back to reference M. Everingham, L. Van Gool, C. K. Williams, J. Winn, A. Zisserman, The PASCAL Visual Object Classes Challenge 2010 (VOC2010) (2010) M. Everingham, L. Van Gool, C. K. Williams, J. Winn, A. Zisserman, The PASCAL Visual Object Classes Challenge 2010 (VOC2010) (2010)
18.
go back to reference R. Rusu, N. Blodow, M. Beetz, Fast Point Feature Histograms (FPFH) for 3D registration, in International Conference on Robotics and Automation, 2009, pp. 3212–3217 R. Rusu, N. Blodow, M. Beetz, Fast Point Feature Histograms (FPFH) for 3D registration, in International Conference on Robotics and Automation, 2009, pp. 3212–3217
19.
go back to reference R. Rusu, A. Holzbach, N. Blodow, M. Beetz, Fast geometric point labeling using conditional random fields, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009, pp. 7–12 R. Rusu, A. Holzbach, N. Blodow, M. Beetz, Fast geometric point labeling using conditional random fields, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009, pp. 7–12
20.
go back to reference R.B. Rusu, Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments, Ph.D. thesis, Technische Universität München (2009) R.B. Rusu, Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments, Ph.D. thesis, Technische Universität München (2009)
21.
go back to reference J.B. Tenenbaum, V. de Silva, J.C. Langford, A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef J.B. Tenenbaum, V. de Silva, J.C. Langford, A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef
22.
23.
go back to reference J. Pitman, Combinatorial Stochastic Processes (Springer, St. Flour Summer School, Berlin, 2006)MATH J. Pitman, Combinatorial Stochastic Processes (Springer, St. Flour Summer School, Berlin, 2006)MATH
24.
go back to reference H. Wallach, S. Jensen, L. Dicker, K. Heller, An alternative prior process for nonparametric bayesian clustering, in International Conference on Artificial Intelligence and Statistics H. Wallach, S. Jensen, L. Dicker, K. Heller, An alternative prior process for nonparametric bayesian clustering, in International Conference on Artificial Intelligence and Statistics
25.
go back to reference R. Adams, H. Wallach, Z. Ghahramani, Learning the Structure of Deep Sparse Graphical Models, in International Conference on Artificial Intelligence and Statistics, 2010 R. Adams, H. Wallach, Z. Ghahramani, Learning the Structure of Deep Sparse Graphical Models, in International Conference on Artificial Intelligence and Statistics, 2010
26.
go back to reference T.L. Griffiths, Z. Ghahrmani, Infinite latent feature models and the Indian buffet process, in: Advances in Neural Information Processing, 2006, pp. 475–482 T.L. Griffiths, Z. Ghahrmani, Infinite latent feature models and the Indian buffet process, in: Advances in Neural Information Processing, 2006, pp. 475–482
27.
go back to reference D. Song, K. Huebner, V. Kyrki, D. Kragic, Learning task constraints for robot grasping using graphical models, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010, pp. 1579–1585 D. Song, K. Huebner, V. Kyrki, D. Kragic, Learning task constraints for robot grasping using graphical models, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010, pp. 1579–1585
28.
go back to reference C.H. Ek, D. Song, D. Kragic, Learning conditional structures in graphical models from a large set of observation streams through efficient discretisation, in International Conference on Robotics and Automation, Workshop on Manipulation under Uncertainty, 2011 C.H. Ek, D. Song, D. Kragic, Learning conditional structures in graphical models from a large set of observation streams through efficient discretisation, in International Conference on Robotics and Automation, Workshop on Manipulation under Uncertainty, 2011
29.
go back to reference D. Song, C.H. Ek, K. Huebner, D. Kragic, Embodiment-specific representation of robot grasping using graphical models and latent-space discretization, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011, pp. 1–8 D. Song, C.H. Ek, K. Huebner, D. Kragic, Embodiment-specific representation of robot grasping using graphical models and latent-space discretization, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011, pp. 1–8
30.
go back to reference D. Song, C.H. Ek, K. Huebner, D. Kragic, Multivariate discretization for bayesian network structure learning in robot grasping, in International Conference on Robotics and Automation, 2011 D. Song, C.H. Ek, K. Huebner, D. Kragic, Multivariate discretization for bayesian network structure learning in robot grasping, in International Conference on Robotics and Automation, 2011
31.
go back to reference M. Titsias, N. Lawrence, Bayesian gaussian process latent variable model, in International Conference on Artificial Intelligence and Statistics, 2010 M. Titsias, N. Lawrence, Bayesian gaussian process latent variable model, in International Conference on Artificial Intelligence and Statistics, 2010
33.
go back to reference G. Shakhnarovich, T. Darrell, P. Indyk, Nearest-Neighbor Methods in Learning and Vision (MIT Press, 2005) G. Shakhnarovich, T. Darrell, P. Indyk, Nearest-Neighbor Methods in Learning and Vision (MIT Press, 2005)
34.
go back to reference G. Shakhnarovich, P. Viola, T. Darrell, Fast pose estimation with parameter-sensitive hashing, in IEEE International Conference on Computer Vision, 2003, pp. 750–757 G. Shakhnarovich, P. Viola, T. Darrell, Fast pose estimation with parameter-sensitive hashing, in IEEE International Conference on Computer Vision, 2003, pp. 750–757
35.
go back to reference O. Boiman, E. Shechtman, M. Irani, In defense of nearest-neighbor based image classification, in Computer Vision and Pattern Recognition, 2008, pp. 1–8 O. Boiman, E. Shechtman, M. Irani, In defense of nearest-neighbor based image classification, in Computer Vision and Pattern Recognition, 2008, pp. 1–8
36.
go back to reference K.Q. Weinberger, F. Sha, L. K. Saul, Learning a kernel matrix for nonlinear dimensionality reduction, in International Conference on Machine Learning, 2004 K.Q. Weinberger, F. Sha, L. K. Saul, Learning a kernel matrix for nonlinear dimensionality reduction, in International Conference on Machine Learning, 2004
37.
go back to reference S.T. Roweis, L.K. Saul, Nonlinear dimensionality reduction by locally linear embedding. Science 290 S.T. Roweis, L.K. Saul, Nonlinear dimensionality reduction by locally linear embedding. Science 290
Metadata
Title
The Importance of Structure
Authors
Carl Henrik Ek
Danica Kragic
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-29363-9_7