Skip to main content

2016 | OriginalPaper | Buchkapitel

Humanoid Robot Gait on Sloping Floors Using Reinforcement Learning

verfasst von : Isaac J. Silva, Danilo H. Perico, Thiago P. D. Homem, Claudio O. Vilão Jr., Flavio Tonidandel, Reinaldo A. C. Bianchi

Erschienen in: Robotics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Climbing ramps is an important ability for humanoid robots: ramps exist everywhere in the world, such as in accessibility ramps and building entrances. This works proposes the use of Reinforcement Learning to learn the action policy that will make a robot walk in an upright position, in a lightly sloped terrain. The proposed architecture of our system is a two-layer combination of the traditional gait generation control loop with a reinforcement learning component. This allows the use of an accelerometer to generate a correction for the gait, when the slope of the floor where the robot is walking changes. Experiments performed on a real robot showed that the proposed architecture is a good solution for the stability problem.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Westervelt, E.R., Grizzle, J.W., Chevallereau, C., Choi, J.H., Morris, B.: Feedback Control of Dynamic Bipedal Robot Locomotion, vol. 28. CRC Press, Boca Raton (2007)CrossRef Westervelt, E.R., Grizzle, J.W., Chevallereau, C., Choi, J.H., Morris, B.: Feedback Control of Dynamic Bipedal Robot Locomotion, vol. 28. CRC Press, Boca Raton (2007)CrossRef
2.
Zurück zum Zitat Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, vol. 1. MIT Press, Cambridge (1998) Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, vol. 1. MIT Press, Cambridge (1998)
3.
Zurück zum Zitat Ha, I., Tamura, Y., Asama, H.: Gait pattern generation and stabilization for humanoid robot based on coupled oscillators. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3207–3212 (2011) Ha, I., Tamura, Y., Asama, H.: Gait pattern generation and stabilization for humanoid robot based on coupled oscillators. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3207–3212 (2011)
4.
Zurück zum Zitat Ha, I., Tamura, Y., Asama, H., Han, J., Hong, D.W.: Development of open humanoid platform DARwIn-OP. In: SICE Annual Conference (SICE), pp. 2178–2181 (2011) Ha, I., Tamura, Y., Asama, H., Han, J., Hong, D.W.: Development of open humanoid platform DARwIn-OP. In: SICE Annual Conference (SICE), pp. 2178–2181 (2011)
5.
Zurück zum Zitat Perico, D.H., Silva, I.J., Vilão Jr., C.O., Homem, T.P., Destro, R.C., Tonidandel, F., Bianchi, R.A.: Newton: a high level control humanoid robot for the RoboCup Soccer KidSize League. In: Osório, F.S., Wolf, D.F., Branco, K.C., Grassi, V., Becker, M., Romero, R.A.F. (eds.) Robotics. CCIS, vol. 507, pp. 53–73. Springer, Heidelberg (2015). doi:10.1007/978-3-662-48134-9_4 CrossRef Perico, D.H., Silva, I.J., Vilão Jr., C.O., Homem, T.P., Destro, R.C., Tonidandel, F., Bianchi, R.A.: Newton: a high level control humanoid robot for the RoboCup Soccer KidSize League. In: Osório, F.S., Wolf, D.F., Branco, K.C., Grassi, V., Becker, M., Romero, R.A.F. (eds.) Robotics. CCIS, vol. 507, pp. 53–73. Springer, Heidelberg (2015). doi:10.​1007/​978-3-662-48134-9_​4 CrossRef
8.
Zurück zum Zitat Watkins, C.: Learning from delayed rewards. Doctoral dissertation, University of Cambridge (1989) Watkins, C.: Learning from delayed rewards. Doctoral dissertation, University of Cambridge (1989)
9.
Zurück zum Zitat Marder, E., Bucher, D.: Central pattern generators and the control of rhythmic movements. Curr. Biol. 11, R986–R996 (2001)CrossRef Marder, E., Bucher, D.: Central pattern generators and the control of rhythmic movements. Curr. Biol. 11, R986–R996 (2001)CrossRef
10.
Zurück zum Zitat Mcgeer, T.: Passive dynamic walking. Int. J. Robot. Res. 9, 62–82 (1990)CrossRef Mcgeer, T.: Passive dynamic walking. Int. J. Robot. Res. 9, 62–82 (1990)CrossRef
11.
Zurück zum Zitat Vukobratović, M., Borovac, B.: Zero-moment point thirty five years of its life. Int. J. Humanoid Rob. 1, 157–173 (2004). World ScientificCrossRef Vukobratović, M., Borovac, B.: Zero-moment point thirty five years of its life. Int. J. Humanoid Rob. 1, 157–173 (2004). World ScientificCrossRef
12.
Zurück zum Zitat Kajita, S., Kanehiro, F., Kaneko, K., Yokoi, K., Hirukawa, H.: The 3D linear inverted pendulum mode: a simple modeling for a biped walking pattern generation. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, vol. 1, pp. 239–246 (2001) Kajita, S., Kanehiro, F., Kaneko, K., Yokoi, K., Hirukawa, H.: The 3D linear inverted pendulum mode: a simple modeling for a biped walking pattern generation. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, vol. 1, pp. 239–246 (2001)
13.
Zurück zum Zitat Zhao, M., Dong, H., Zhang, N.: The instantaneous leg extension model of virtual slope walking. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS, pp. 3220–3225 (2009) Zhao, M., Dong, H., Zhang, N.: The instantaneous leg extension model of virtual slope walking. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS, pp. 3220–3225 (2009)
14.
Zurück zum Zitat Vukobratović, M., Juricic, D.: Contribution to the synthesis of biped gait. IEEE Trans. Biomed. Eng. 16(2), 1–6 (1969)CrossRef Vukobratović, M., Juricic, D.: Contribution to the synthesis of biped gait. IEEE Trans. Biomed. Eng. 16(2), 1–6 (1969)CrossRef
15.
Zurück zum Zitat Zheng, Y.F., Shen, J.: Gait synthesis for the SD-2 biped robot to climb sloping surface. IEEE Trans. Robot. Autom. 6, 86–96 (1990)CrossRef Zheng, Y.F., Shen, J.: Gait synthesis for the SD-2 biped robot to climb sloping surface. IEEE Trans. Robot. Autom. 6, 86–96 (1990)CrossRef
16.
Zurück zum Zitat Chew, C.M., Pratt, J., Pratt, G.: Blind walking of a planar bipedal robot on sloped terrain. In: IEEE International Conference on Robotics and Automation, vol. 1, pp. 381–386 (1999) Chew, C.M., Pratt, J., Pratt, G.: Blind walking of a planar bipedal robot on sloped terrain. In: IEEE International Conference on Robotics and Automation, vol. 1, pp. 381–386 (1999)
17.
Zurück zum Zitat Zhou, C., Yue, P.K., Ni, J., Chan, S.B.: Dynamically stable gait planning for a humanoid robot to climb sloping surface. In: IEEE Conference on Robotics, Automation and Mechatronics, vol. 1, pp. 341–346 (2004) Zhou, C., Yue, P.K., Ni, J., Chan, S.B.: Dynamically stable gait planning for a humanoid robot to climb sloping surface. In: IEEE Conference on Robotics, Automation and Mechatronics, vol. 1, pp. 341–346 (2004)
18.
Zurück zum Zitat Hong, Y.D., Lee, B.J., Kim, J.H.: Command state-based modifiable walking pattern generation on an inclined plane in pitch and roll directions for humanoid robots. IEEE/ASME Trans. Mechatron. 16, 783–789 (2011)CrossRef Hong, Y.D., Lee, B.J., Kim, J.H.: Command state-based modifiable walking pattern generation on an inclined plane in pitch and roll directions for humanoid robots. IEEE/ASME Trans. Mechatron. 16, 783–789 (2011)CrossRef
19.
Zurück zum Zitat Huang, W., Chew, C.M., Zheng, Y., Hong, G.S.: Pattern generation for bipedal walking on slopes and stairs. In: 8th IEEE-RAS International Conference on Humanoid Robots, pp. 205–210 (2008) Huang, W., Chew, C.M., Zheng, Y., Hong, G.S.: Pattern generation for bipedal walking on slopes and stairs. In: 8th IEEE-RAS International Conference on Humanoid Robots, pp. 205–210 (2008)
20.
Zurück zum Zitat Iverach-Brereton, C., Baltes, J., Postnikoff, B., Carrier, D., Anderson, J.: Fuzzy logic control of a humanoid robot on unstable terrain. In: Almeida, L., Ji, J., Steinbauer, G., Luke, S. (eds.) RoboCup 2015. LNCS, vol. 9513, pp. 202–213. Springer, Heidelberg (2015). doi:10.1007/978-3-319-29339-4_17 CrossRef Iverach-Brereton, C., Baltes, J., Postnikoff, B., Carrier, D., Anderson, J.: Fuzzy logic control of a humanoid robot on unstable terrain. In: Almeida, L., Ji, J., Steinbauer, G., Luke, S. (eds.) RoboCup 2015. LNCS, vol. 9513, pp. 202–213. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-29339-4_​17 CrossRef
21.
Zurück zum Zitat Baltes, J., Iverach-Brereton, C., Anderson, J.: Human inspired control of a small humanoid robot in highly dynamic environments or Jimmy Darwin rocks the Bongo Board. In: Bianchi, R.A.C., Akin, H.L., Ramamoorthy, S., Sugiura, K. (eds.) RoboCup 2014. LNCS, vol. 8992, pp. 466–477. Springer, Heidelberg (2015). doi:10.1007/978-3-319-18615-3_38 CrossRef Baltes, J., Iverach-Brereton, C., Anderson, J.: Human inspired control of a small humanoid robot in highly dynamic environments or Jimmy Darwin rocks the Bongo Board. In: Bianchi, R.A.C., Akin, H.L., Ramamoorthy, S., Sugiura, K. (eds.) RoboCup 2014. LNCS, vol. 8992, pp. 466–477. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-18615-3_​38 CrossRef
22.
Zurück zum Zitat Perico, D.H., Silva, I.J., Vilao, C.O., Homem, T.P., Destro, R.C., Tonidandel, F., Bianchi, R.: Hardware and software aspects of the design and assembly of a new humanoid robot for RoboCup Soccer. In: Robotics: SBR-LARS Robotics Symposium and Robocontrol (SBR LARS Robocontrol), pp. 73–78 (2014) Perico, D.H., Silva, I.J., Vilao, C.O., Homem, T.P., Destro, R.C., Tonidandel, F., Bianchi, R.: Hardware and software aspects of the design and assembly of a new humanoid robot for RoboCup Soccer. In: Robotics: SBR-LARS Robotics Symposium and Robocontrol (SBR LARS Robocontrol), pp. 73–78 (2014)
23.
Zurück zum Zitat Farchy, A., Barrett, S., MacAlpine, P., Stone, P.: Humanoid robots learning to walk faster: from the real world to simulation and back. In: Proceedings of the 2013 International Conference on Autonomous Agents and Multi-agent Systems, pp. 39–46 (2013) Farchy, A., Barrett, S., MacAlpine, P., Stone, P.: Humanoid robots learning to walk faster: from the real world to simulation and back. In: Proceedings of the 2013 International Conference on Autonomous Agents and Multi-agent Systems, pp. 39–46 (2013)
24.
Zurück zum Zitat Hester, T., Quinlan, M., Stone, P.: Generalized model learning for reinforcement learning on a humanoid robot. In: IEEE International Conference on Robotics and Automation (ICRA), pp. 2369–2374 (2010) Hester, T., Quinlan, M., Stone, P.: Generalized model learning for reinforcement learning on a humanoid robot. In: IEEE International Conference on Robotics and Automation (ICRA), pp. 2369–2374 (2010)
Metadaten
Titel
Humanoid Robot Gait on Sloping Floors Using Reinforcement Learning
verfasst von
Isaac J. Silva
Danilo H. Perico
Thiago P. D. Homem
Claudio O. Vilão Jr.
Flavio Tonidandel
Reinaldo A. C. Bianchi
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-47247-8_14

Neuer Inhalt