Top

Published in:

2021 | OriginalPaper | Chapter

A Neurodynamic Approach to Stabilization of a 10 DOF Biped Mechanism Using Reinforcement Learning

Authors : Aditya Kameswara Rao Nandula, Sudhir Raj, A. K. Deb, C. S. Kumar

Published in: Mechanism and Machine Science

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

In this paper, we propose a Reinforcement Learning (RL) based approach to stabilizing and control of a biped robot. Bipeds have complex requirements of stabilization and gait planning in multiple degrees of freedom. A reinforcement learning strategy is presented in this work for the stabilization of a single leg as well as a biped to create a learned behavior of a system. In this paper, each leg of the Humanoid biped robot is approximated as a double inverted pendulum, and its static stabilization is studied in the sagittal plane. The equations of motion are derived using Lagrange’s formulation method. An equivalent Humanoid robot single leg and biped model developed in Gazebo. Through Robotic Operating System (ROS), a reinforcement learning based control algorithm was developed for static stabilization, and the simulation was carried out on the Gazebo model. A total of 1458 states are used for training the reinforcement learning algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

previous chapter Design of Passive Compliant Constant-Force Mechanism

next chapter Human-Centric Optimal Design of Biomimetic Exosuit for Loaded Walking: A Simulation Study

Werbos P (1992b) Neurocontrol and supervised learning: an overview and evaluation. In: White, Sofge (eds) Handbook of intelligent control. V.N. Reinhold, New York, pp. 65–89

Werbos P (2004) 1 ADP: goals, opportunities, and principles. In: SiBarto J, Powell W, Wunsch D (eds) Handbook of learning and approximate dynamic programming. Wiley-IEEE Press, Piscataway, NJ, pp 3–44

Werbos P (2009) Intelligence in the brain: a theory of how it works and how to build it. Neural Netw 22(3):200–212. Goal-Directed neural system

Bertsekas D (2009) Neuro-dynamic programming. In: Encyclopedia of optimization, pp 2555–2560

Bertsekas D (2010a) Approximate policy iteration: a survey and some new methods. http://stuff.mit.edu/people/dimitrib/API_Survey.pdf

Bertsekas D, Tsitsiklis J (1996) Neuro-dynamic programming. Athena Scientific

Khan SG, Herrmann G, Lewis FL, Pipe T, Melhuish C Reinforcement learning and optimal adaptive control: An overview and implementation examples

Raj S (2016) Reinforcement learning based controller for stabilization of double inverted pendulum. In: 1st IEEE international conference on power electronics, intelligent control and energy systems (ICPEICES-2016)

Yonghwan Oh, Kyung-ho A, Doikkimand chang wan K (2006) An analytical method to generate walking pattern of humanoid robot. In: IEEE industrial electronics, IECON 2006, pp 4159–4164

10.

Tang Z, Joo Er M (2007) Humanoid 3D gait generation based on inverted pendulum model. In: 22nd IEEE international symposium on intelligent control, Singapore, 1–3 October 2007

11.

Thant AA, Aye KK (2009) Application of cubic spline interpolation to walking patterns of biped robot. World Academy of Science, Engineering and Technology

12.

Benbrahim H, Franklin JA (1997) Biped dynamic walking using Reinforcement learning. Robot Auton Syst 22:283–302CrossRef

Title: A Neurodynamic Approach to Stabilization of a 10 DOF Biped Mechanism Using Reinforcement Learning
Authors: Aditya Kameswara Rao Nandula
Sudhir Raj
A. K. Deb
C. S. Kumar
Publisher: Springer Singapore
Book: Mechanism and Machine Science
Print ISBN: 978-981-15-4476-7

Electronic ISBN: 978-981-15-4477-4

Copyright Year: 2021
DOI: https://doi.org/10.1007/978-981-15-4477-4_34

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Premium Partners