Skip to main content

2014 | OriginalPaper | Buchkapitel

Bayesian Gait Optimization for Bipedal Locomotion

verfasst von : Roberto Calandra, Nakul Gopalan, André Seyfarth, Jan Peters, Marc Peter Deisenroth

Erschienen in: Learning and Intelligent Optimization

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

One of the key challenges in robotic bipedal locomotion is finding gait parameters that optimize a desired performance criterion, such as speed, robustness or energy efficiency. Typically, gait optimization requires extensive robot experiments and specific expert knowledge. We propose to apply data-driven machine learning to automate and speed up the process of gait optimization. In particular, we use Bayesian optimization to efficiently find gait parameters that optimize the desired performance metric. As a proof of concept we demonstrate that Bayesian optimization is near-optimal in a classical stochastic optimal control framework. Moreover, we validate our approach to Bayesian gait optimization on a low-cost and fragile real bipedal walker and show that good walking gaits can be efficiently found by Bayesian optimization.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The correct notation would be \(\alpha (\hat{f}(\varvec{\theta }))\), but we use \(\alpha (\varvec{\theta })\) for notational convenience.
 
Literatur
1.
Zurück zum Zitat Auer, P.: Using confidence bounds for exploitation-exploration trade-offs. J. Mach. Learn. Res. (JMLR) 3, 397–422 (2003)MATHMathSciNet Auer, P.: Using confidence bounds for exploitation-exploration trade-offs. J. Mach. Learn. Res. (JMLR) 3, 397–422 (2003)MATHMathSciNet
2.
Zurück zum Zitat Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. (JMLR) 13, 281–305 (2012)MATHMathSciNet Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. (JMLR) 13, 281–305 (2012)MATHMathSciNet
3.
Zurück zum Zitat Bertsekas, D.P.: Dynamic Programming and Optimal Control, 3rd edn. Athena Scientific, Belmont (2007) Bertsekas, D.P.: Dynamic Programming and Optimal Control, 3rd edn. Athena Scientific, Belmont (2007)
4.
Zurück zum Zitat Brochu, E., Cora, V.M., De Freitas, N.:. A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv preprint arXiv:1012.2599 (2010) Brochu, E., Cora, V.M., De Freitas, N.:. A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv preprint arXiv:​1012.​2599 (2010)
5.
Zurück zum Zitat Byrd, R.H., Lu, P., Nocedal, J., Zhu, C.: A limited memory algorithm for bound constrained optimization. SIAM J. Sci. Comput. 16(5), 1190–1208 (1995)CrossRefMATHMathSciNet Byrd, R.H., Lu, P., Nocedal, J., Zhu, C.: A limited memory algorithm for bound constrained optimization. SIAM J. Sci. Comput. 16(5), 1190–1208 (1995)CrossRefMATHMathSciNet
6.
Zurück zum Zitat Calandra, R., Seyfarth, A., Peters, J., Deisenroth, M.P.: An experimental comparison of Bayesian optimization for bipedal locomotion. In: Proceedings of 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 1951–1958 (2014) Calandra, R., Seyfarth, A., Peters, J., Deisenroth, M.P.: An experimental comparison of Bayesian optimization for bipedal locomotion. In: Proceedings of 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 1951–1958 (2014)
7.
Zurück zum Zitat Chernova, S., Veloso, M.: An evolutionary approach to gait learning for four-legged robots. In: Proceedings of Intelligent Robots and Systems (IROS), vol. 3, pp. 2562–2567. IEEE (2004) Chernova, S., Veloso, M.: An evolutionary approach to gait learning for four-legged robots. In: Proceedings of Intelligent Robots and Systems (IROS), vol. 3, pp. 2562–2567. IEEE (2004)
8.
Zurück zum Zitat Cox, D.D., John, S.: SDO: a statistical method for global optimization. In: Alexandrov, N., Hussaini, M.Y. (eds.) Multidisciplinary Design Optimization: State of the Art, pp. 315–329. SIAM, Philadelpha (1997) Cox, D.D., John, S.: SDO: a statistical method for global optimization. In: Alexandrov, N., Hussaini, M.Y. (eds.) Multidisciplinary Design Optimization: State of the Art, pp. 315–329. SIAM, Philadelpha (1997)
9.
Zurück zum Zitat Garnett, R., Osborne, M.A., Roberts, S.J.: Bayesian optimization for sensor set selection. In: Information Processing in Sensor Networks, pp. 209–219. ACM (2010) Garnett, R., Osborne, M.A., Roberts, S.J.: Bayesian optimization for sensor set selection. In: Information Processing in Sensor Networks, pp. 209–219. ACM (2010)
10.
Zurück zum Zitat Gopalan, N., Deisenroth, M.P., Peters, J.: Feedback error learning for rhythmic motor primitives. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) (2013) Gopalan, N., Deisenroth, M.P., Peters, J.: Feedback error learning for rhythmic motor primitives. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) (2013)
11.
Zurück zum Zitat Grizzle, J.W., Abba, G., Plestan, F.: Asymptotically stable walking for biped robots: analysis via systems with impulse effects. IEEE Trans. Autom. Control 46(1), 51–64 (2001)CrossRefMATHMathSciNet Grizzle, J.W., Abba, G., Plestan, F.: Asymptotically stable walking for biped robots: analysis via systems with impulse effects. IEEE Trans. Autom. Control 46(1), 51–64 (2001)CrossRefMATHMathSciNet
12.
Zurück zum Zitat Hennig, P., Schuler, C.J.: Entropy search for information-efficient global optimization. J. Mach. Learn. Res. 13, 1809–1837 (2012)MATHMathSciNet Hennig, P., Schuler, C.J.: Entropy search for information-efficient global optimization. J. Mach. Learn. Res. 13, 1809–1837 (2012)MATHMathSciNet
13.
Zurück zum Zitat Ijspeert, A.J., Nakanishi, J., Schaal, S.: Learning attractor landscapes for learning motor primitives. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Advances in Neural Information Processing Systems (NIPS), vol. 15. MIT Press, Cambridge (2003) Ijspeert, A.J., Nakanishi, J., Schaal, S.: Learning attractor landscapes for learning motor primitives. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Advances in Neural Information Processing Systems (NIPS), vol. 15. MIT Press, Cambridge (2003)
14.
15.
Zurück zum Zitat Jones, D.R., Perttune, C.D., Stuckman, B.E.: Lipschitzian optimization without the Lipschitz constant. J. Optim. Theor. Appl. 79(1), 157–181 (1993)CrossRefMATH Jones, D.R., Perttune, C.D., Stuckman, B.E.: Lipschitzian optimization without the Lipschitz constant. J. Optim. Theor. Appl. 79(1), 157–181 (1993)CrossRefMATH
16.
Zurück zum Zitat Kober, J., Peters, J.: Learning motor primitives for robotics. In: International Conference on Robotics and Automation (ICRA) (2009) Kober, J., Peters, J.: Learning motor primitives for robotics. In: International Conference on Robotics and Automation (ICRA) (2009)
17.
Zurück zum Zitat Kushner, H.J.: A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise. J. Basic Eng. 86, 97 (1964)CrossRef Kushner, H.J.: A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise. J. Basic Eng. 86, 97 (1964)CrossRef
18.
Zurück zum Zitat Lizotte, D., Wang, T., Bowling, M., Schuurmans, D.: Automatic gait optimization with Gaussian process regression. In: Proceedings of International Joint Conferences on Artificial Intelligence (IJCAI), pp. 944–949 (2007) Lizotte, D., Wang, T., Bowling, M., Schuurmans, D.: Automatic gait optimization with Gaussian process regression. In: Proceedings of International Joint Conferences on Artificial Intelligence (IJCAI), pp. 944–949 (2007)
19.
Zurück zum Zitat Mockus, J., Tiesis, V., Zilinskas, A.: The application of Bayesian methods for seeking the extremum. Towards Global Optim. 2, 117–129 (1978) Mockus, J., Tiesis, V., Zilinskas, A.: The application of Bayesian methods for seeking the extremum. Towards Global Optim. 2, 117–129 (1978)
20.
Zurück zum Zitat Nakanishi, J., Morimoto, J., Endo, G., Cheng, G., Schaal, S., Kawato, M.: Learning from demonstration and adaptation of biped locomotion. Robot. Autonom. Syst. 47(2), 79–91 (2004)CrossRef Nakanishi, J., Morimoto, J., Endo, G., Cheng, G., Schaal, S., Kawato, M.: Learning from demonstration and adaptation of biped locomotion. Robot. Autonom. Syst. 47(2), 79–91 (2004)CrossRef
21.
Zurück zum Zitat Osborne, M.A., Garnett, R., Roberts, S.J.: Gaussian processes for global optimization. In: 3rd International Conference on Learning and Intelligent Optimization (LION3), pp. 1–15 (2009) Osborne, M.A., Garnett, R., Roberts, S.J.: Gaussian processes for global optimization. In: 3rd International Conference on Learning and Intelligent Optimization (LION3), pp. 1–15 (2009)
22.
Zurück zum Zitat Pongas, D., Billard, A., Schaal, S.:. Rapid synchronization and accurate phase-locking of rhythmic motor primitives. In: Proceedings of Intelligent Robots and Systems (IROS), pp. 2911–2916. IEEE (2005) Pongas, D., Billard, A., Schaal, S.:. Rapid synchronization and accurate phase-locking of rhythmic motor primitives. In: Proceedings of Intelligent Robots and Systems (IROS), pp. 2911–2916. IEEE (2005)
23.
Zurück zum Zitat Rasmussen, C.E., Williams, C.K.I.: Gaussian Processes for Machine Learning. The MIT Press, Cambridge (2006)MATH Rasmussen, C.E., Williams, C.K.I.: Gaussian Processes for Machine Learning. The MIT Press, Cambridge (2006)MATH
24.
Zurück zum Zitat Renjewski, D., Seyfarth, A.: Robots in human biomechanics - a study on ankle push-off in walking. Bioinspir. Biomimetics 7(3), 036005 (2012)CrossRef Renjewski, D., Seyfarth, A.: Robots in human biomechanics - a study on ankle push-off in walking. Bioinspir. Biomimetics 7(3), 036005 (2012)CrossRef
25.
Zurück zum Zitat Renjewski, D.: An engineering contribution to human gait biomechanics. Ph.D. Thesis, TU Ilmenau (2012) Renjewski, D.: An engineering contribution to human gait biomechanics. Ph.D. Thesis, TU Ilmenau (2012)
26.
Zurück zum Zitat Snoek, J., Larochelle, H., Adams, R.P.: Practical Bayesian optimization of machine learning algorithms. In: Advances in Neural Information Processing Systems (NIPS) (2012) Snoek, J., Larochelle, H., Adams, R.P.: Practical Bayesian optimization of machine learning algorithms. In: Advances in Neural Information Processing Systems (NIPS) (2012)
27.
Zurück zum Zitat Srinivas, N., Krause, A., Kakade, S., Seeger, M.: Gaussian process optimization in the bandit setting: no regret and experimental design. In: Proceedings of International Conference on Machine Learning (ICML), pp. 1015–1022 (2010) Srinivas, N., Krause, A., Kakade, S., Seeger, M.: Gaussian process optimization in the bandit setting: no regret and experimental design. In: Proceedings of International Conference on Machine Learning (ICML), pp. 1015–1022 (2010)
28.
Zurück zum Zitat Tedrake, R., Weirui Zhang, T., Sebastian Seung, H.: Learning to walk in 20 minutes. In: Proceedings of the 14th Yale Workshop on Adaptive and Learning Systems (2005) Tedrake, R., Weirui Zhang, T., Sebastian Seung, H.: Learning to walk in 20 minutes. In: Proceedings of the 14th Yale Workshop on Adaptive and Learning Systems (2005)
29.
Zurück zum Zitat Tesch, M., Schneider, J., Choset, H.: Using response surfaces and expected improvement to optimize snake robot gait parameters. In: Proceedings of Intelligent Robots and Systems (IROS), pp. 1069–1074. IEEE (2011) Tesch, M., Schneider, J., Choset, H.: Using response surfaces and expected improvement to optimize snake robot gait parameters. In: Proceedings of Intelligent Robots and Systems (IROS), pp. 1069–1074. IEEE (2011)
Metadaten
Titel
Bayesian Gait Optimization for Bipedal Locomotion
verfasst von
Roberto Calandra
Nakul Gopalan
André Seyfarth
Jan Peters
Marc Peter Deisenroth
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-09584-4_25