Sequential pursuit of multiple targets under external disturbances via Zermelo–Voronoi diagrams

doi:10.1016/j.automatica.2017.03.015

Automatica

Volume 81, July 2017, Pages 253-260

https://doi.org/10.1016/j.automatica.2017.03.015 Get rights and content

Abstract

We address the problem of pursuit between a collection of targets and a team of pursuers distributed in the plane subject to an environmental disturbance (e.g., wind, sea current). The objective of the pursuers is to intercept the moving targets which, however, are not affected by the presence of the flow field. We first solve the multiple-pursuers/single-target problem by assigning only one pursuer to chase the target at every instant of time, based on a Voronoi-like partition of the plane. During the pursuit, the pursuer assignment changes dynamically based on this partition. We then present an algorithm to efficiently update this Voronoi-like partition on-line. The results are then extended to the multiple-pursuers/multiple-targets case. Simulations are included to illustrate the theoretical results.

Introduction

Consider a scenario where a group of helicopters or small UAVs fly in the presence of a wind field and try to capture a group of vehicles moving on the ground, or a team of small marine or underwater vehicles attempting to intercept a ship which is large enough so that the sea currents do not significantly affect its motion. Given such a group of pursuers, we want to find a pursuit strategy to intercept the target(s) in minimum time. Problems of this nature fall under the general class of group pursuit. These are difficult problems to solve, in general (Blagodatskikh, 2008, Blagodatskikh, 2009, Pittsyk and Chikrii, 1982). Their solution is also based on the information the pursuers and the targets/evaders have about each other, resulting in either cooperative or non-cooperative strategies (Bakolas and Tsiotras, 2012, Devillers and Golin, 1993, Krasovskiǐ and Krasovskiǐ, 1994, Pashkov and Terekhov, 1987, Vidal et al., 2002, Yeung and Petrosjan, 2006). In order to solve such problems, in this work we propose to use a sequential pursuit strategy (Bakolas & Tsiotras, 2012). By sequential (or relay) pursuit we mean that for each target, only one pursuer is assigned to chase this target at every instant of time. In addition to simplifying significantly the group pursuit problem, a relay pursuit strategy may be desirable in cases where the power or energy/fuel consumption of the agents is an important consideration, when the agents also play a dual role, both as pursuers as well as guardians protecting a certain area, or in order to account for possible deceptive strategies of an intelligent opponent.

For the multiple-pursuers/single-target problem, in contrast to most existing similar problem formulations in the literature (Bakolas and Tsiotras, 2012, Ibragimov et al., 2012, Jang and Tomlin, 2005, Khaidarov, 1984, Pshenichnyi, 1976), where the effect of the environment is not taken into consideration, in our problem setup (only) the pursuers are affected by known exogenous environmental conditions (e.g., the winds or currents). As with all pursuit games, the solution of this problem depends on the knowledge each pursuer has about the current and future position of the target. In this paper it will be assumed that each pursuer has a stroboscopic view of each target position. That is, each pursuer knows the current position of the target but it knows neither its velocity nor its future position. Our objective is to find the optimal assignment to determine which pursuer to go after which target at each instant of time so as to reduce, or minimize, the overall capture time. No assumptions about the target strategy are explicitly imposed a priori, other than the target moves with maximum speed, the value of which is known to all pursuers.

Our solution strategy is based on a dynamic assignment of the pursuers, by which the “best” pursuer to go after the target is selected according to a Voronoi-like partition of the plane, called the Zermelo–Voronoi partition, or the Zermelo–Voronoi Diagram (ZVD) (Bakolas & Tsiotras, 2010). The major difficulty in this setting arises from the fact that, owing to the presence of the wind field, a point in the plane can be close to a pursuer in terms of Euclidean distance, but it may not be close in terms of minimum-time to intercept. As a result, standard Voronoi partitions for this problem may lead to erroneous conclusions.

The ZVD is based on time-to-intercept as the relevant distance metric, instead of the Euclidean metric used in the standard Voronoi diagram. Such Voronoi-like diagrams have been previously introduced in Bakolas and Tsiotras, 2010, Bakolas and Tsiotras, 2012, Bakolas and Tsiotras, 2013. For instance, in Bakolas (2014) the author constructs a Voronoi-like partition in a spatiotemporal flow field by taking the proximity metric as the time required for each vehicle to reach an arbitrary point using a line-of-sight (LOS) control law. Furthermore, as it is shown later on, the use of the so-called Zermelo Navigation Law (Eqs. (6)–(7) below) leads to a better solution (in terms of capture time) than one that uses a classical LOS control law.

Voronoi-like diagrams have been used in the past to solve group pursuit problems in the plane (Bakolas and Tsiotras, 2010, Bakolas and Tsiotras, 2012). In Bakolas and Tsiotras (2012), the authors proposed a relay pursuit problem where a group of pursuers aims at capturing a target that follows a specific evading strategy. The assignment problem is solved by dynamically assigning the pursuer whose Voronoi cell contains the target. Although our work follows closely the original work in Bakolas and Tsiotras (2012), where ZVDs were first employed to generate the pursuer assignments in an external wind field, our work differs from–and extends the results of–this work as follows: First, and contrary to Bakolas and Tsiotras (2012), we take into account the known disturbance of the environment. Furthermore, this disturbance affects only the pursuers, thus leading to asymmetric dynamics between the target and the pursuers. The asymmetry of the agents’ dynamics does not allow us to use a common reference frame to solve the problem, as was done, for example, in Bakolas and Tsiotras (2010). Second, we use a somewhat different Voronoi-like partition than the one in Bakolas and Tsiotras (2012). Owing to the fact that the target and the pursuers obey different dynamics, in our case we cannot check the condition in Bakolas and Tsiotras (2012) to update the ZVD; instead, we need to update the ZVD continuously. In the process, we propose a numerically efficient algorithm to update the ZVD on-the-fly that may be of independent interest (see Section 5). Third, we assume minimal knowledge of the target state, namely, only its instantaneous position is known to the pursuers. In Bakolas and Tsiotras (2012), on the other hand, it was assumed that the target implements a specific evading strategy, which is known to all of the pursuers.

Contributions. The major contributions of the paper are summarized below:

(a)
We provide a new formulation for the solution of an asymmetric group pursuit problem using the recently introduced concept of Zermelo–Voronoi diagram (ZVD). The formulation leads to a decentralized solution of the original group pursuit problem by decomposing it to a sequence of simpler pursuit problems that are much easier to solve.
(b)
We provide conditions so that the well-known Zermelo Navigation Law (ZNL) achieves capture against a maneuvering target. This key result allows us to implement the ZNL in a sequential manner against arbitrarily maneuvering targets and in the presence of unknown exogenous drift fields.
(c)
We propose a decentralized algorithm for updating efficiently the ZVD as the pursuit evolves based on the dual of the Voronoi diagram, that is, the Delaunay Triangulation (DT) and we provide a complexity analysis for updating the DT (and hence also the ZVD). The updated ZVD is then used to provide the best assignments of the active pursuer(s).
(d)
One of the major benefits of the proposed approach is that it scales well with the number of pursuers and the targets involved. We propose two algorithms that address multiple-pursuer/multi-target problems that make use of this nice property of the proposed ZVD decomposition.

Section snippets

Preliminaries on plane tessellations

Given a finite number of distinct points in the Euclidean plane, called the generators, we associate their locations with a set of points in the plane, such that each point in this set is closer (with respect to a given distance metric) to its own generator than to any other generator. The result is a tessellation of the plane into a set of regions associated with the given generators (Okabe, Boots, Sugihara, & Chiu, 2009).

Problem setup

In this section, we formulate the dynamic pursuit problem with multiple pursuers and a single target. Section 6 provides an extension of the methodology to problems with multiple targets. To this end, consider a group of $n$ pursuers in the plane indexed by the set $I$ , and assume that at time $t = 0$ the pursuers are located at $n$ distinct positions in the plane, designated by $P_{0} = {X_{P_{0}}^{i} \in R^{2}, i \in I}$ . The kinematics of the pursuers are described by ${\dot{X}}_{P}^{i} = u_{P}^{i} + w (t), X_{P}^{i} (0) = X_{P_{0}}^{i}, i \in I,$ where $X_{P}^{i} ≔ {[x_{P}^{i}, y_{P}^{i}]}^{⊤} \in R^{2}$

Analysis and implementation of the pursuer–target assignment problem

Before proceeding with the solution of the optimal pursuer assignment problem, we first need to determine the conditions on the target’s maneuverability such that there exists an assignment function leading to finite capture time. Below we provide a sufficient condition for the existence of capture time.

The robust line-of-sight navigation law (RLOS) steers a pursuer towards a target at every instant of time, while maximizing the speed along the ensuing path. This is the optimal strategy, among

Update algorithm for ZVD

In order to implement the previous algorithm we need, at every instant of time, to know the ZVD in order to determine which Zermelo–Voronoi cell the target resides in. We can either build a ZVD from scratch at each time step, or update the ZVD from the previous time step. Since at every time interval, only one generator moves relatively to the rest, it is expected that the latter option will be more efficient. Hereby, we present an algorithm that updates the ZVD from one time step to the next

Sequential pursuit of multiple targets

In this section, we extend the previous results to the case of a pursuit problem with multiple pursuers and targets. To this end, consider a group of $n$ pursuers in the plane, denoted by ${P_{1}, P_{2}, \dots, P_{n}}$ , and $m$ targets, denoted by ${T_{1}, T_{2}, \dots, T_{m}}$ . Let $J = {1, \dots, m}$ denote the index set of the targets. The objective of the pursuers is to intercept the targets. It is assumed that after a pursuer intercepts a target, it is still capable of going after other targets. The kinematics of the $i$ th pursuer, $i \in I$ , are

Simulation results

We consider a scenario with 12 pursuers and 3 targets. The initial positions of the targets are given by $X_{T_{0}}^{1} = [3, 5], X_{T_{0}}^{2} = [6, 7], X_{T_{0}}^{3} = [4.5, 6]$ . Without loss of generality, we assume that each target moves in a straight line with velocity ${[0.4, 0.6]}^{⊤}$ . The pursuers are initially located at distinct positions determined by $P_{0}$ and are shown in Fig. 3. The wind field that affects the pursuers is given by $w (t) = {[- 0.2 - 0.2 cos (t), 0.3]}^{⊤}$ .

Fig. 3 depicts the trajectories of the pursuers and targets when the

Conclusions

Under the assumption that at most one pursuer is actively chasing a moving target at every instant of time, we have proposed a target–pursuer assignment strategy to capture several moving targets by a set of pursuers in a wind field, when the only information about the targets known to the pursuers are their current locations at every instant of time. The targets are not affected by the wind field, resulting in asymmetric pursuer/target dynamics. We take advantage of the fact that the problem

Wei Sun received his B.S. degree in Mathematics from the Peking University, China, in 2010 and his M.S. degrees in both Aerospace Engineering and Mathematics from the Georgia Institute of Technology, Atlanta, in 2015. He is currently a Ph.D. candidate at the Georgia Institute of Technology. His current research interests include optimal control, differential games, reinforcement learning and trajectory optimization with applications in motion planning, and pursuit evasion games.

References (26)

E. Bakolas
Decentralized spatial partitioning for multi-vehicle systems in spatiotemporal flow-field
Automatica
(2014)
E. Bakolas et al.
The Zermelo-Voronoi diagram: A dynamic partition problem
Automatica
(2010)
E. Bakolas et al.
Relay pursuit of a maneuvering target using dynamic Voronoi diagrams
Automatica
(2012)
E. Bakolas et al.
Optimal partitioning for spatiotemporal coverage in a drift field
Automatica
(2013)
A.I. Blagodatskikh
Simultaneous multiple capture in a simple pursuit problem
Journal of Applied Mathematics and Mechanics
(2009)
G.I. Ibragimov et al.
Evasion from many pursuers in simple motion differential game with integral constraints
European Journal of Operational Research
(2012)
B.K. Khaidarov
Positional i-capture in the game of a single evader and several pursuers
Journal of Applied Mathematics and Mechanics
(1984)
M. Pittsyk et al.
On a group pursuit problem
Journal of Applied Mathematics and Mechanics
(1982)
T. Roos
Voronoi diagrams over dynamic scenes
Discrete Applied Mathematics
(1993)
Bakolas, E., & Tsiotras, P. (2010). Minimum-time paths for a light aircraft in the presence of regionally-varying...

E. Bakolas et al.

Feedback navigation in an uncertain flow-field and connections with pursuit strategies

AIAA Journal of Guidance, Control, and Dynamics

(2012)

A.I. Blagodatskikh

Group pursuit in Pontryagin’s nonstationary example

Differential Equations

(2008)

A.E. Bryson et al.

Applied optimal control: Optimization, estimation, and control

(1975)

Cited by (12)

Bounded rational Dubins vehicle coordination for target tracking using reinforcement learning
2023, Automatica
In this paper, we address the problem of cooperative tracking of multiple heterogeneous targets by deploying multiple and heterogeneous pursuers exhibiting different decision-making capabilities. Initially, under infinite resources, we formulate a game between the evader and the pursuing team, with an evader being the maximizing player and the pursuing team being the minimizing one. Subsequently, we relax the perfect rationality assumption via the use of a level- $k$ thinking framework that allows the evaders to not exhibit the same levels of rationality. Such rationality policies are computed by using a reinforcement learning-based architecture and are proven to form Nash policies as the thinking levels increase. Finally, in the case of multiple pursuers against multiple targets, we develop a switched learning scheme with multiple convergence sets by assigning the most intelligent pursuers to the most intelligent evaders.
Distributed coordination of multi-agent systems for neutralizing unknown threats based on a mixed coverage-tracking metric
2020, Journal of the Franklin Institute
Citation Excerpt :
For a bounded area of interest, these subtasks need to be solved simultaneously in defending against threats. Some works focus on optimal deployment of sensor networks to detect the emergence of threats [15,16], some works propose optimal threat tracking strategies considering costs of energy or resource [17,18], and others develop task assignment mechanisms to switch between sensor deployment and target tracking [19,20]. However, related studies often assume that the number and dynamics of threats are known in advance or the information of each threat is available with dense deployment of sensors.
This paper presents a coordination control method for neutralizing multiple threats by a team of mobile agents in a bounded area. Threat here refers to an unexpected target intruding into the area. Without knowing the number of threats, how they maneuver and when to appear a priori, agents equipped with active sensing and actuating devices are driven to detect, track and intercept those threats as many as possible. In order to increase the probability of detecting new threats, a metric (called the mixed coverage-tracking metric) with dynamic task assignment mechanism is introduced. More specifically, the metric is a weighted sum of the travel cost for area coverage and threat tracking respectively. The control objective is to find optimal trajectories and task assignment values of agents that can minimize the expected mixed metric. Based on Voronoi partition of the mission area, a gradient based control law is designed to drive each agent towards its locally optimum configuration. Meanwhile, a task assignment control law is employed to smoothly switch between area coverage and threat tracking depending on the density of detected threats within each agent’s Voronoi cell. Resorting to optimal control and Lyapunov stability theory, the proposed control method can guarantee that each agent asymptotically converges to its centroid of Voronoi cell. Simulation examples are provided to illustrate the effectiveness of the theoretical results.
Omni-Directional Capture for Multi-Drone Based on 3D-Voronoi Tessellation
2023, Drones
A Region-Based Relay Pursuit Scheme for a Pursuit-Evasion Game With a Single Evader and Multiple Pursuers
2023, IEEE Transactions on Systems, Man, and Cybernetics: Systems
Relay pursuit of an evader by a heterogeneous group of pursuers using potential games
2020, arXiv
Pursuit Differential Game of Many Pursuers with Integral Constraints on Compact Convex Set
2020, Bulletin of the Malaysian Mathematical Sciences Society

View all citing articles on Scopus

Panagiotis Tsiotras is the Dean’s Professor in the School of Aerospace Engineering at the Georgia Institute of Technology. He received his Engineering Diploma in Mechanical Engineering from the National Technical University of Athens, in Greece (1986) and his Ph.D. degree in Aeronautics and Astronautics from Purdue (1993). His main research interests are in optimal and nonlinear control, and vehicle autonomy. In 1996 he received the NSF CAREER award. He has served in the Editorial Boards of the Transactions on Automatic Control, the IEEE Control Systems Magazine, the Journal of Guidance, Control and Dynamics and the journal Dynamics and Control. He is a Fellow of AIAA and a Senior Member of IEEE.

^☆: This work has been supported by NSF award 1160780 and AFOSR award FA9550-13-1-0029. The material in this paper was partially presented at the 52nd IEEE Conference on Decision and Control, December 10–13, 2013, Florence, Italy. This paper was recommended for publication in revised form by Associate Editor Andrey V. Savkin under the direction of Editor Ian R. Petersen.

¹: Fax: +1 404 894 2760.

View full text

Brief paperSequential pursuit of multiple targets under external disturbances via Zermelo–Voronoi diagrams☆

Abstract

Introduction

Section snippets

Preliminaries on plane tessellations

Problem setup

Analysis and implementation of the pursuer–target assignment problem

Update algorithm for ZVD

Sequential pursuit of multiple targets

Simulation results

Conclusions

Automatica

Automatica

Automatica

Automatica

Journal of Applied Mathematics and Mechanics

European Journal of Operational Research

Journal of Applied Mathematics and Mechanics

Journal of Applied Mathematics and Mechanics

Discrete Applied Mathematics

Feedback navigation in an uncertain flow-field and connections with pursuit strategies

AIAA Journal of Guidance, Control, and Dynamics

Group pursuit in Pontryagin’s nonstationary example

Differential Equations

Applied optimal control: Optimization, estimation, and control

Brief paper
Sequential pursuit of multiple targets under external disturbances via Zermelo–Voronoi diagrams☆