Top

Artificial Life and Robotics

Published in:

04-05-2022 | Original Article

Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning

Authors: Charles H. Wu, Donald A. Sofge, Daniel M. Lofaro

Published in: Artificial Life and Robotics | Issue 2/2022

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

In this paper we study the multi-agent pursuit–evasion problem, and present an extension of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) deep reinforcement learning algorithm. Previous pursuit–evasion advancements with MADDPG have focused on training capture strategies dependent on the restriction of evader movement with environmental features. We demonstrate a method to train pursuer agents to collaboratively surround and encircle an evader for reliable capture without a strategy rooted in environment entrapment (i.e. cornering). Our method utilizes a novel two-stage, variable-aggression, continuous reward function based on geometrical inscribed circles (incircles), along with a corresponding observation space, with agents operating in an entrapment-disadvantaged environment. Our results show reliable capture of an intelligent, superior evader by three trained pursuers in open space with our encircling strategy. A key novelty of our work is demonstrating the ability to transition behaviors learned using deep reinforcement learning from a simulated robotic system with imperfect world assumptions to a real-world robotic agents.

previous article Joint failure recovery for snake robot locomotion using a shape-based approach

next article Collective transport of arbitrarily shaped objects using robot swarms

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Andreen D, Jenning P, Napp N, Petersen K (2016) Emergent structures assembled by large swarms of simple robots. In: Posthuman frontiers

Awheda MD, Schwartz HM (2016) A decentralized fuzzy learning algorithm for pursuit–evasion differential games with superior evaders. J Intell Robot Syst 83:35–53CrossRef

DeMarco K, Squires E, Day M, Pippin C (2018) Simulating collaborative robots in a massive multi-agent game environment (SCRIMMAGE). In: International symposium on distributed autonomous robotic systems

Guadarrama S, Korattikara A, Ramirez O, Castro P, Holly E, Fishman S, Wang K, Gonina E, Wu N, Kokiopoulou E, Sbaiz L, Smith J, Bartók G, Berent J, Harris C, Vanhoucke V, Brevdo E (2018) TF-Agents: a library for reinforcement learning in tensorflow . https://github.com/tensorflow/agents

Hüttenrauch M, Sosic A, Neumann G (2018) Deep reinforcement learning for swarm systems. CoRR

Lillicrap T.P, Hunt J.J, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D (2016) Continuous control with deep reinforcement learning. CoRR

Lowe R, Wu Y, Tamar A, Harb J, Abbeel P, Mordatch I (2017) Multi-agent actor-critic for mixed cooperative-competitive environments. In: Proceedings of the 31st international conference on neural information processing systems, NIPS’17, p 6382–6393. Curran Associates Inc., Red Hook, NY, USA

Mao H, Zhang Z, Xiao Z, Gong Z (2018) Modelling the dynamic joint policy of teammates with attention multi-agent DDPG. CoRR

Mnih V, Kavukcuoglu K, Silver D, Rusu A, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland A, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, Hassabis D (2015) Human-level control through deep reinforcement learning. Nature 518:529–533CrossRef

10.

Rycroft C (2009) Voro++: a three-dimensional voronoi cell library in c++. Chaos Interdiscip J Nonlinear Sci

11.

Sheikh H.U, Bölöni L (2019) Designing a multi-objective reward function for creating teams of robotic bodyguards using deep reinforcement learning. ArXiv

12.

Silver D, Lever G, Heess N, Degris T, Wierstra D, Riedmiller M (2014) Deterministic policy gradient algorithms. In: ICML

13.

Singh G, Lofaro D, Sofge D (2020) Pursuit-evasion with decentralized robotic swarm in continuous state space and action space via deep reinforcement learning. In: Proceedings of the 12th international conference on agents and artificial intelligence, vol 1, ICAART, p 226–233. INSTICC, SciTePress

14.

Wang J, Olson E (2016) Apriltag 2: efficient and robust fiducial detection. In: 2016 IEEE/RSJ International conference on intelligent robots and systems (IROS)

15.

Wang X, Cruz J, Chen G, Pham K, Blasch E (2007) Formation control in multi-player pursuit evasion game with superior evaders. In: Proceedings of SPIE—The International Society for Optical Engineering

16.

Weintraub I.E, Pachter M, Garcia E (2020) An introduction to pursuit-evasion differential games. In: 2020 American Control Conference (ACC), pp 1049–1066

17.

Wu C, Lofaro D, Sofge D (2021) A Learned Encircling Strategy for Robot Swarm Pursuit-Evasion Against a Superior Evader. In: The 4th International Symposium on Swarm Behavior and Bio-Inspired Robotics

18.

Wu C, Lofaro D, Sofge D (2021) A learned encircling strategy for robot swarm pursuit-evasion against a superior evader. In: The 15th International Symposium on Distributed Autonomous Robotic Systems (DARS)

Title: Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning
Authors: Charles H. Wu
Donald A. Sofge
Daniel M. Lofaro
Publication date: 04-05-2022
Publisher: Springer Japan
Published in: Artificial Life and Robotics / Issue 2/2022
Print ISSN: 1433-5298
Electronic ISSN: 1614-7456
DOI: https://doi.org/10.1007/s10015-022-00761-y

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Other articles of this Issue 2/2022

Gait design and experimental validation of a snake robot on a pipe with branches using spiral stairs function

Multimodal optimization by particle swarm optimization with graph-based speciation using -relaxed relative neighborhood graph and seed-centered mutation

Joint failure recovery for snake robot locomotion using a shape-based approach

Understanding of superorganisms: collective behavior, differentiation and social organization

Topology and weight evolving artificial neural networks in cooperative transport by a robotic swarm

A customizable, low-cost alternative for distributed 2D flow sensing in swarms