Skip to main content
Top

2021 | OriginalPaper | Chapter

A Neurodynamic Approach to Stabilization of a 10 DOF Biped Mechanism Using Reinforcement Learning

Authors : Aditya Kameswara Rao Nandula, Sudhir Raj, A. K. Deb, C. S. Kumar

Published in: Mechanism and Machine Science

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we propose a Reinforcement Learning (RL) based approach to stabilizing and control of a biped robot. Bipeds have complex requirements of stabilization and gait planning in multiple degrees of freedom. A reinforcement learning strategy is presented in this work for the stabilization of a single leg as well as a biped to create a learned behavior of a system. In this paper, each leg of the Humanoid biped robot is approximated as a double inverted pendulum, and its static stabilization is studied in the sagittal plane. The equations of motion are derived using Lagrange’s formulation method. An equivalent Humanoid robot single leg and biped model developed in Gazebo. Through Robotic Operating System (ROS), a reinforcement learning based control algorithm was developed for static stabilization, and the simulation was carried out on the Gazebo model. A total of 1458 states are used for training the reinforcement learning algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Werbos P (1992b) Neurocontrol and supervised learning: an overview and evaluation. In: White, Sofge (eds) Handbook of intelligent control. V.N. Reinhold, New York, pp. 65–89 Werbos P (1992b) Neurocontrol and supervised learning: an overview and evaluation. In: White, Sofge (eds) Handbook of intelligent control. V.N. Reinhold, New York, pp. 65–89
2.
go back to reference Werbos P (2004) 1 ADP: goals, opportunities, and principles. In: SiBarto J, Powell W, Wunsch D (eds) Handbook of learning and approximate dynamic programming. Wiley-IEEE Press, Piscataway, NJ, pp 3–44 Werbos P (2004) 1 ADP: goals, opportunities, and principles. In: SiBarto J, Powell W, Wunsch D (eds) Handbook of learning and approximate dynamic programming. Wiley-IEEE Press, Piscataway, NJ, pp 3–44
3.
go back to reference Werbos P (2009) Intelligence in the brain: a theory of how it works and how to build it. Neural Netw 22(3):200–212. Goal-Directed neural system Werbos P (2009) Intelligence in the brain: a theory of how it works and how to build it. Neural Netw 22(3):200–212. Goal-Directed neural system
4.
go back to reference Bertsekas D (2009) Neuro-dynamic programming. In: Encyclopedia of optimization, pp 2555–2560 Bertsekas D (2009) Neuro-dynamic programming. In: Encyclopedia of optimization, pp 2555–2560
6.
go back to reference Bertsekas D, Tsitsiklis J (1996) Neuro-dynamic programming. Athena Scientific Bertsekas D, Tsitsiklis J (1996) Neuro-dynamic programming. Athena Scientific
7.
go back to reference Khan SG, Herrmann G, Lewis FL, Pipe T, Melhuish C Reinforcement learning and optimal adaptive control: An overview and implementation examples Khan SG, Herrmann G, Lewis FL, Pipe T, Melhuish C Reinforcement learning and optimal adaptive control: An overview and implementation examples
8.
go back to reference Raj S (2016) Reinforcement learning based controller for stabilization of double inverted pendulum. In: 1st IEEE international conference on power electronics, intelligent control and energy systems (ICPEICES-2016) Raj S (2016) Reinforcement learning based controller for stabilization of double inverted pendulum. In: 1st IEEE international conference on power electronics, intelligent control and energy systems (ICPEICES-2016)
9.
go back to reference Yonghwan Oh, Kyung-ho A, Doikkimand chang wan K (2006) An analytical method to generate walking pattern of humanoid robot. In: IEEE industrial electronics, IECON 2006, pp 4159–4164 Yonghwan Oh, Kyung-ho A, Doikkimand chang wan K (2006) An analytical method to generate walking pattern of humanoid robot. In: IEEE industrial electronics, IECON 2006, pp 4159–4164
10.
go back to reference Tang Z, Joo Er M (2007) Humanoid 3D gait generation based on inverted pendulum model. In: 22nd IEEE international symposium on intelligent control, Singapore, 1–3 October 2007 Tang Z, Joo Er M (2007) Humanoid 3D gait generation based on inverted pendulum model. In: 22nd IEEE international symposium on intelligent control, Singapore, 1–3 October 2007
11.
go back to reference Thant AA, Aye KK (2009) Application of cubic spline interpolation to walking patterns of biped robot. World Academy of Science, Engineering and Technology Thant AA, Aye KK (2009) Application of cubic spline interpolation to walking patterns of biped robot. World Academy of Science, Engineering and Technology
12.
go back to reference Benbrahim H, Franklin JA (1997) Biped dynamic walking using Reinforcement learning. Robot Auton Syst 22:283–302CrossRef Benbrahim H, Franklin JA (1997) Biped dynamic walking using Reinforcement learning. Robot Auton Syst 22:283–302CrossRef
Metadata
Title
A Neurodynamic Approach to Stabilization of a 10 DOF Biped Mechanism Using Reinforcement Learning
Authors
Aditya Kameswara Rao Nandula
Sudhir Raj
A. K. Deb
C. S. Kumar
Copyright Year
2021
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-4477-4_34

Premium Partners