nach oben

Erschienen in:

2019 | OriginalPaper | Buchkapitel

Learning to Run Faster in a Humanoid Robot Soccer Environment Through Reinforcement Learning

verfasst von : Miguel Abreu, Luis Paulo Reis, Nuno Lau

Erschienen in: RoboCup 2019: Robot World Cup XXIII

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Reinforcement learning techniques bring a new perspective to enduring problems. Developing skills from scratch is not only appealing due to the artificial creation of knowledge. It can also replace years of work and refinement in a matter of hours. From all the developed skills in the RoboCup 3D Soccer Simulation League, running is still considerably relevant to determine the winner of any match. However, current approaches do not make full use of the robotic soccer agents’ potential. To narrow this gap, we propose a way of leveraging the Proximal Policy Optimization using the information provided by the simulator for official RoboCup matches. To do this, our algorithm uses a mix of raw, computed and internally generated data. The final result is a sprinting and a stopping behavior that work in tandem to bring the agent from point a to point b in a very short time. The sprinting speed stabilizes at around 2.5 m/s, which is a great improvement over current solutions. Both the sprinting and stopping behaviors are remarkably stable.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Nächstes Kapitel High-Frequency Multi Bus Servo and Sensor Communication Using the Dynamixel Protocol

Abreu, M., Lau, N., Sousa, A., Reis, L.P.: Learning low level skills from scratch for humanoid robot soccer using deep reinforcement learning. In: 19th IEEE International Conference on Autonomous Robot Systems and Competitions (IEEE ICARSC 2019), Gondomar, Porto, Portugal, 24–26 April 2019

Noda, I., Suzuki, S.J., Matsubara, H., Asada, M., Kitano, H.: RoboCup-97: the first robot world cup soccer games and conferences. AI Mag. 19(3), 49 (1998)

Glaser, S.: RoboCup Soccer - 3D Simulation League. https://archive.robocup.info/Soccer/Simulation/2D/binaries/RoboCup/2018/. Accessed 19 Apr 2019

MacAlpine, P., Torabi, F., Pavse, B., Sigmon, J., Stone, P.: UT Austin Villa: RoboCup 2018 3D simulation league champions. In: Holz, D., Genter, K., Saad, M., von Stryk, O. (eds.) RoboCup 2018. LNCS (LNAI), vol. 11374, pp. 462–475. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27544-0_38 CrossRef

Gazebo support for the RoboCup 3D simulator league. https://bitbucket.org/osrf/robocup3ds. Accessed 19 Apr 2019

MacAlpine, P., Stone, P.: UT Austin Villa: RoboCup 2017 3D simulation league competition and technical challenges champions. In: Akiyama, H., Obst, O., Sammut, C., Tonidandel, F. (eds.) RoboCup 2017. LNCS (LNAI), vol. 11175, pp. 473–485. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00308-1_39CrossRef

MacAlpine, P., Depinet, M., Liang, J., Stone, P.: UT Austin Villa: RoboCup 2014 3D simulation league competition and technical challenge champions. In: Bianchi, R.A.C., Akin, H.L., Ramamoorthy, S., Sugiura, K. (eds.) RoboCup 2014. LNCS (LNAI), vol. 8992, pp. 33–46. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18615-3_3CrossRef

Snafii, N., Abdolmaleki, A., Lau, N., Reis, L.P.: Development of an omnidirectional walk engine for soccer humanoid robots. Int. J. Adv. Rob. Syst. 12(12), 193 (2015)

Moradi, K., Fathian, M., Ghidary, S.S.: Omnidirectional walking using central pattern generator. Int. J. Mach. Learn. Cybernet. 7(6), 1023–1033 (2016)CrossRef

10.

Abdolmaleki, A., Lau, N., Reis, L.P., Peters, J., Neumann, G.: Contextual policy search for linear and nonlinear generalization of a humanoid walking controller. J. Intell. Rob. Syst. 83(3), 393–408 (2016)CrossRef

11.

Abdolmaleki, A., Lau, N., Reis, L.P., Peters, J., Neumann, G.: Contextual policy search for generalizing a parameterized biped walking controller. In: 2015 IEEE International Conference on Autonomous Robot Systems and Competitions, pp. 17–22. IEEE (2015)

12.

Shafii, N., Lau, N., Reis, L.P.: Learning to walk fast: optimized hip height movement for simulated and real humanoid robots. J. Intell. Rob. Syst. 80(3), 555–571 (2015)CrossRef

13.

Xu, Y., Vatankhah, H.: SimSpark: an open source robot simulator developed by the RoboCup community. In: Behnke, S., Veloso, M., Visser, A., Xiong, R. (eds.) RoboCup 2013. LNCS (LNAI), vol. 8371, pp. 632–639. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44468-9_59CrossRef

14.

SoftBank Robotics: Nao the humanoid robot. https://www.softbankrobotics.com/emea/en/nao. Accessed 19 Apr 2019

15.

Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control 19(6), 716–723 (1974)MathSciNetCrossRef

16.

Sugiura, N.: Further analysis of the data by Akaike’s information criterion and the finite corrections. Commun. Stat. Theory Methods 7(1), 13–26 (1978)CrossRef

17.

Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms. CoRR, vol. abs/1707.06347 (2017)

18.

Dhariwal, P., et al.: Openai baselines. https://github.com/openai/baselines. Accessed 20 Apr 2019

19.

The MagmaOffenburg RoboCup 3D Simulation Team: magmaChallenge: Benchmark tool for RoboCup 3D soccer simulation. https://github.com/magmaOffenburg/magmaChallenge. Accessed 19 Apr 2019

20.

Adelaar, R.S.: The practical biomechanics of running. Am. J. Sports Med. 14(6), 497–500 (1986)CrossRef

21.

Novacheck, T.F.: The biomechanics of running. Gait Posture 7(1), 77–95 (1998)CrossRef

22.

SoftBank Robotics: Aldebaran documentation: Nao - actuator & sensor list. http://doc.aldebaran.com/2-1/family/nao dcm/actuator sensor names.html

Titel: Learning to Run Faster in a Humanoid Robot Soccer Environment Through Reinforcement Learning
verfasst von: Miguel Abreu
Luis Paulo Reis
Nuno Lau
Verlag: Springer International Publishing
Buch: RoboCup 2019: Robot World Cup XXIII
Print ISBN: 978-3-030-35698-9

Electronic ISBN: 978-3-030-35699-6

Copyright-Jahr: 2019
DOI: https://doi.org/10.1007/978-3-030-35699-6_1

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner