Skip to main content

2017 | OriginalPaper | Buchkapitel

The Importance of Structure

verfasst von : Carl Henrik Ek, Danica Kragic

Erschienen in: Robotics Research

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many tasks in robotics and computer vision are concerned with inferring a continuous or discrete state variable from observations and measurements from the environment. Due to the high-dimensional nature of the input data the inference is often cast as a two stage process: first a low-dimensional feature representation is extracted on which secondly a learning algorithm is applied. Due to the significant progress that have been achieved within the field of machine learning over the last decade focus have placed at the second stage of the inference process, improving the process by exploiting more advanced learning techniques applied to the same (or more of the same) data. We believe that for many scenarios significant strides in performance could be achieved by focusing on representation rather than aiming to alleviate inconclusive and/or redundant information by exploiting more advanced inference methods. This stems from the notion that; given the “correct” representation the inference problem becomes easier to solve. In this paper we argue that one important mode of information for many application scenarios is not the actual variation in the data but the rather the higher order statistics as the structure of variations. We will exemplify this through a set of applications and show different ways of representing the structure of data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat R. Rensink, J. ORegan, J. Clark, On the failure to detect changes in scenes across brief interruptions. Vis. Cogn. 7(1), 127–145 (2000)CrossRef R. Rensink, J. ORegan, J. Clark, On the failure to detect changes in scenes across brief interruptions. Vis. Cogn. 7(1), 127–145 (2000)CrossRef
2.
Zurück zum Zitat D.J. Simons, C.F. Chabris, Gorillas in our midst: sustained inattentional blindness for dynamic events. Perception 28, 1059–1074 (1999)CrossRef D.J. Simons, C.F. Chabris, Gorillas in our midst: sustained inattentional blindness for dynamic events. Perception 28, 1059–1074 (1999)CrossRef
3.
Zurück zum Zitat D.G. Lowe, Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef D.G. Lowe, Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
4.
Zurück zum Zitat B.D. Argalla, S. Chernova, M. Veloso, B. Browning, A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–548 (2009)CrossRef B.D. Argalla, S. Chernova, M. Veloso, B. Browning, A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–548 (2009)CrossRef
5.
Zurück zum Zitat V. Kruger, D. Kragic, A. Ude, C. Geib, The meaning of action: a review on action recognition and mapping. Adv. Robot. 21(13), 1473–1501 (2007) V. Kruger, D. Kragic, A. Ude, C. Geib, The meaning of action: a review on action recognition and mapping. Adv. Robot. 21(13), 1473–1501 (2007)
6.
Zurück zum Zitat E. Aksoy, A. Abramov, F. Wörgötter, B. Dellen, Categorizing object-action relations from semantic scene graphs, in IEEE International Conference on Robotics and Automation, 2010, pp. 398–405 E. Aksoy, A. Abramov, F. Wörgötter, B. Dellen, Categorizing object-action relations from semantic scene graphs, in IEEE International Conference on Robotics and Automation, 2010, pp. 398–405
7.
Zurück zum Zitat I. Laptev, P. Perez, Retrieving actions in movies, in IEEE International Conference on Computer Vision, 2007, pp. 1–8 I. Laptev, P. Perez, Retrieving actions in movies, in IEEE International Conference on Computer Vision, 2007, pp. 1–8
8.
Zurück zum Zitat I. Laptev, M. Marszalek, C. Schmid, B. Rozenfeld, Learning realistic human actions from movies, in IEEE Conference on Computer Vision and Pattern Recognition, 2008, pp. 1–8 I. Laptev, M. Marszalek, C. Schmid, B. Rozenfeld, Learning realistic human actions from movies, in IEEE Conference on Computer Vision and Pattern Recognition, 2008, pp. 1–8
9.
Zurück zum Zitat H. Kjellstro¨m, J. Romero, D. Kragic, Visual object-action recognition: Inferring object affordances from human demonstration. Comput. Vis. Image Underst. 115, 81–90 (2011)CrossRef H. Kjellstro¨m, J. Romero, D. Kragic, Visual object-action recognition: Inferring object affordances from human demonstration. Comput. Vis. Image Underst. 115, 81–90 (2011)CrossRef
10.
Zurück zum Zitat G. Luo, N. Bergström, C.H. Ek, D. Kragic, Representing actions with kernels, in International Conference of Intelligent Robots and Systems, 2011 G. Luo, N. Bergström, C.H. Ek, D. Kragic, Representing actions with kernels, in International Conference of Intelligent Robots and Systems, 2011
11.
Zurück zum Zitat H. Lodhi, C. Saunders, J. Shawe-Taylor, N. Cristianini, C. Watkins, Text classification using string kernels. J. Mach. Learn. Res. 2, 419–444 (2002)MATH H. Lodhi, C. Saunders, J. Shawe-Taylor, N. Cristianini, C. Watkins, Text classification using string kernels. J. Mach. Learn. Res. 2, 419–444 (2002)MATH
12.
Zurück zum Zitat N. Cristianini, J. Shawe-Taylor, An introduction to Support Vector Machines and Other Kernel Based Learning Methods (Cambridge University Press, 2006) N. Cristianini, J. Shawe-Taylor, An introduction to Support Vector Machines and Other Kernel Based Learning Methods (Cambridge University Press, 2006)
13.
Zurück zum Zitat V. Kruger, D.L. Herzog, Sanmohan A. Ude, D. Kragic, Learning actions from observations. Robot. Autom. Mag. 17(2), 30–43 (2010)CrossRef V. Kruger, D.L. Herzog, Sanmohan A. Ude, D. Kragic, Learning actions from observations. Robot. Autom. Mag. 17(2), 30–43 (2010)CrossRef
14.
Zurück zum Zitat D. Comaniciu, P. Meer, Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef D. Comaniciu, P. Meer, Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef
15.
Zurück zum Zitat Y. Boykov, M.-P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in N-D Images, in IEEE International Conference on Computer Vision, 2005, pp. 105–112 Y. Boykov, M.-P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in N-D Images, in IEEE International Conference on Computer Vision, 2005, pp. 105–112
16.
Zurück zum Zitat N. Bergström, C.H. Ek, M. Björkman, D. Kragic, Scene understanding through interactive perception, in International Conference on Vision Systems, 2011 N. Bergström, C.H. Ek, M. Björkman, D. Kragic, Scene understanding through interactive perception, in International Conference on Vision Systems, 2011
17.
Zurück zum Zitat M. Everingham, L. Van Gool, C. K. Williams, J. Winn, A. Zisserman, The PASCAL Visual Object Classes Challenge 2010 (VOC2010) (2010) M. Everingham, L. Van Gool, C. K. Williams, J. Winn, A. Zisserman, The PASCAL Visual Object Classes Challenge 2010 (VOC2010) (2010)
18.
Zurück zum Zitat R. Rusu, N. Blodow, M. Beetz, Fast Point Feature Histograms (FPFH) for 3D registration, in International Conference on Robotics and Automation, 2009, pp. 3212–3217 R. Rusu, N. Blodow, M. Beetz, Fast Point Feature Histograms (FPFH) for 3D registration, in International Conference on Robotics and Automation, 2009, pp. 3212–3217
19.
Zurück zum Zitat R. Rusu, A. Holzbach, N. Blodow, M. Beetz, Fast geometric point labeling using conditional random fields, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009, pp. 7–12 R. Rusu, A. Holzbach, N. Blodow, M. Beetz, Fast geometric point labeling using conditional random fields, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009, pp. 7–12
20.
Zurück zum Zitat R.B. Rusu, Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments, Ph.D. thesis, Technische Universität München (2009) R.B. Rusu, Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments, Ph.D. thesis, Technische Universität München (2009)
21.
Zurück zum Zitat J.B. Tenenbaum, V. de Silva, J.C. Langford, A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef J.B. Tenenbaum, V. de Silva, J.C. Langford, A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef
22.
23.
Zurück zum Zitat J. Pitman, Combinatorial Stochastic Processes (Springer, St. Flour Summer School, Berlin, 2006)MATH J. Pitman, Combinatorial Stochastic Processes (Springer, St. Flour Summer School, Berlin, 2006)MATH
24.
Zurück zum Zitat H. Wallach, S. Jensen, L. Dicker, K. Heller, An alternative prior process for nonparametric bayesian clustering, in International Conference on Artificial Intelligence and Statistics H. Wallach, S. Jensen, L. Dicker, K. Heller, An alternative prior process for nonparametric bayesian clustering, in International Conference on Artificial Intelligence and Statistics
25.
Zurück zum Zitat R. Adams, H. Wallach, Z. Ghahramani, Learning the Structure of Deep Sparse Graphical Models, in International Conference on Artificial Intelligence and Statistics, 2010 R. Adams, H. Wallach, Z. Ghahramani, Learning the Structure of Deep Sparse Graphical Models, in International Conference on Artificial Intelligence and Statistics, 2010
26.
Zurück zum Zitat T.L. Griffiths, Z. Ghahrmani, Infinite latent feature models and the Indian buffet process, in: Advances in Neural Information Processing, 2006, pp. 475–482 T.L. Griffiths, Z. Ghahrmani, Infinite latent feature models and the Indian buffet process, in: Advances in Neural Information Processing, 2006, pp. 475–482
27.
Zurück zum Zitat D. Song, K. Huebner, V. Kyrki, D. Kragic, Learning task constraints for robot grasping using graphical models, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010, pp. 1579–1585 D. Song, K. Huebner, V. Kyrki, D. Kragic, Learning task constraints for robot grasping using graphical models, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010, pp. 1579–1585
28.
Zurück zum Zitat C.H. Ek, D. Song, D. Kragic, Learning conditional structures in graphical models from a large set of observation streams through efficient discretisation, in International Conference on Robotics and Automation, Workshop on Manipulation under Uncertainty, 2011 C.H. Ek, D. Song, D. Kragic, Learning conditional structures in graphical models from a large set of observation streams through efficient discretisation, in International Conference on Robotics and Automation, Workshop on Manipulation under Uncertainty, 2011
29.
Zurück zum Zitat D. Song, C.H. Ek, K. Huebner, D. Kragic, Embodiment-specific representation of robot grasping using graphical models and latent-space discretization, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011, pp. 1–8 D. Song, C.H. Ek, K. Huebner, D. Kragic, Embodiment-specific representation of robot grasping using graphical models and latent-space discretization, in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011, pp. 1–8
30.
Zurück zum Zitat D. Song, C.H. Ek, K. Huebner, D. Kragic, Multivariate discretization for bayesian network structure learning in robot grasping, in International Conference on Robotics and Automation, 2011 D. Song, C.H. Ek, K. Huebner, D. Kragic, Multivariate discretization for bayesian network structure learning in robot grasping, in International Conference on Robotics and Automation, 2011
31.
Zurück zum Zitat M. Titsias, N. Lawrence, Bayesian gaussian process latent variable model, in International Conference on Artificial Intelligence and Statistics, 2010 M. Titsias, N. Lawrence, Bayesian gaussian process latent variable model, in International Conference on Artificial Intelligence and Statistics, 2010
33.
Zurück zum Zitat G. Shakhnarovich, T. Darrell, P. Indyk, Nearest-Neighbor Methods in Learning and Vision (MIT Press, 2005) G. Shakhnarovich, T. Darrell, P. Indyk, Nearest-Neighbor Methods in Learning and Vision (MIT Press, 2005)
34.
Zurück zum Zitat G. Shakhnarovich, P. Viola, T. Darrell, Fast pose estimation with parameter-sensitive hashing, in IEEE International Conference on Computer Vision, 2003, pp. 750–757 G. Shakhnarovich, P. Viola, T. Darrell, Fast pose estimation with parameter-sensitive hashing, in IEEE International Conference on Computer Vision, 2003, pp. 750–757
35.
Zurück zum Zitat O. Boiman, E. Shechtman, M. Irani, In defense of nearest-neighbor based image classification, in Computer Vision and Pattern Recognition, 2008, pp. 1–8 O. Boiman, E. Shechtman, M. Irani, In defense of nearest-neighbor based image classification, in Computer Vision and Pattern Recognition, 2008, pp. 1–8
36.
Zurück zum Zitat K.Q. Weinberger, F. Sha, L. K. Saul, Learning a kernel matrix for nonlinear dimensionality reduction, in International Conference on Machine Learning, 2004 K.Q. Weinberger, F. Sha, L. K. Saul, Learning a kernel matrix for nonlinear dimensionality reduction, in International Conference on Machine Learning, 2004
37.
Zurück zum Zitat S.T. Roweis, L.K. Saul, Nonlinear dimensionality reduction by locally linear embedding. Science 290 S.T. Roweis, L.K. Saul, Nonlinear dimensionality reduction by locally linear embedding. Science 290
Metadaten
Titel
The Importance of Structure
verfasst von
Carl Henrik Ek
Danica Kragic
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-29363-9_7

Neuer Inhalt