A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L2-Gain for Coupling Attenuation

Li, Yinsheng; Wang, Bing; Chen, Yuquan

doi:10.3390/app12157551

Open AccessArticle

A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L₂-Gain for Coupling Attenuation

by

Yinsheng Li

,

Bing Wang

^*

and

Yuquan Chen

College of Energy and Electrical Engineering, Hohai University, Nanjing 211100, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(15), 7551; https://doi.org/10.3390/app12157551

Submission received: 31 May 2022 / Revised: 24 July 2022 / Accepted: 25 July 2022 / Published: 27 July 2022

(This article belongs to the Special Issue Navigation Control and Signal Processing Methods for Multiple Autonomous Unmanned Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Featured Application

This proposed method is suitable for the cooperative consensus control of various homogeneous Multiple Autonomous Unmanned linear systems, such as an underwater robot swarm and aerial UAV swarm.

Abstract

This paper addresses the distributed optimal decoupling synchronous control of multiple autonomous unmanned linear systems (MAUS) subject to complex network dynamic coupling. The leader–follower mechanism based on neighborhood error dynamics is established and the network coupling term is regarded as the external disturbance to realize the decoupling cooperative control of each agent. The Bounded L₂-Gain problem for the network coupling term is formulated into a multi-player zero-sum differential game. It is shown that the solution to the multi-player zero-sum differential game requires the solution to coupled Hamilton–Jacobi (HJ) equations. The coupled HJ equations are transformed into an algebraic Riccati equation (ARE), which can be solved to obtain the Nash equilibrium of a multi-player zero-sum game. It is shown that the bounded L₂-Gain for coupling attenuation can be realized by applying the zero-sum game solution as the control protocol and the ultimately uniform boundedness (UUB) of a local neighborhood error vector under conservative conditions is proved. A simulation example is provided to show the effectiveness of the proposed method.

Keywords:

synchronous control; multiple autonomous unmanned linear systems; bounded L₂-gain; multi-player zero-sum game; coupling attenuation; coupled Hamilton–Jacobi equation; nash equilibrium; algebraic Riccati equation

1. Introduction

In the field of multi-agent distributed control, synchronous cooperative control is one of the most popular research topics because of its wide application prospects in many engineering systems, such as the cooperative control of autonomous underwater vehicles, wind farm and unmanned aerial vehicles. A great deal of research has been developed on the distributed control methods for multi-agent synchronization [1,2,3,4]. The purpose of distributed synchronous control is to design a control protocol for each agent, depending only on the states of neighboring agents, to ensure that the states of all agents in the communication digraph finally achieve synchronization. A leader–follower mechanism is the most popular one for distributed synchronous control because of its simplicity and scalability. The basic idea is that a leader agent is set as the reference node, and other agents are set as the follower nodes whose goal is to track the reference node to achieve the ultimate synchronization of the entire communication network.

On the basis of this mechanism, ref. [5] defines the local neighborhood error of each agent and deduces that this is a dynamical system with multiple control inputs, from itself and all of its neighbors. This means that the local neighborhood error of each agent is the result of coupling from adjacent node agents, which brings considerable complexity to the design of the control protocol. Ref. [6] formulated this intricate relationship as a differential game, namely, a multi-agent differential graphical game, by defining a local performance index for each agent. Optimal control and game theory [7,8] has been successful utilized to formulate strategic behavior, where the dynamic of each agent relies on the actions of itself and its neighbors. In an optimal control and differential graphical game, each agent minimizes the performance objective (cost value function) by adjusting its control strategy to optimal. In [9], the finite-time optimal coordination problem of multi-agent systems (MASs) is investigated. The authors in [10] addressed the centralized optimal coordination problem under tree formation constraints. In [11], the robust optimal formation control of heterogeneous MASs is studied. These published graphical games and optimal control methods for consensus and synchronization are achieved based on the solution of coupled Hamilton–Jacobi–Isaacs (HJI) equations and Hamilton–Jacobi–Bellman (HJB) equations, respectively. In practice, coupled HJI and coupled HJB equation are difficult to be solved by analytical methods due to their inherent nonlinearity and uncertainty.

The reinforcement learning (RL) method is often regarded as the effective method to solve the coupled HJI and coupled HJB equation. RL is the branch of machine learning concerned with how to methodically adjust the control strategy of agents based on the rewards from the environment [12,13,14,15]. In [16], an online distributed optimal adaptive algorithm is proposed for a differential graphical game, the intelligent identifier is designed to find the unknown dynamic and the neural actor–critic network structure is introduced to find the solutions of the Nash equilibrium. In [17], the bounded L₂-gain consensus problem for the MASs with external disturbance is formulated into the zero-sum differential game by introducing a specific performance index and a policy iteration (PI) algorithm-based RL is provided to find the solution to the coupled HJI equations. In [18], the optimal synchronization control problem is studied for homogeneous MASs with input saturation by using the RL methods. This research utilizes the neural network as an approximator and a design-specific update law so that the neural network approximates the optimal value function and optimal control strategy with certain precision. However, the strict asymptotic convergence proof of the neural network is not given in these works, and only the boundedness of approximate errors is guaranteed. In addition, the neural network approximator-based RL needs to satisfy the persistence of the excitation condition (PE) [19,20,21,22,23], which also limits the practical engineering application of these methods.

The quadratic optimal control problem of a linear single system can be solved by solving an algebraic Riccati equation (ARE) [24], but the optimal control problem of MASs is far more complicated than that of a single system owing to the state coupling in the control design. At present, some optimal control methods of MASs are accompanied by a huge amount of calculations and strong assumptions.

Motivated by the above discussion, this paper focuses on the optimal cooperative control of Multiple Autonomous Unmanned linear systems (MAUS) from a new perspective, i.e., the adjacent nodes’ input coupling part is regarded as the external disturbance. Thus, the complex distributed multi-agent error dynamics are decoupled into centralized multi-input dynamics. Inspired by the idea of a zero-sum game in [17], this paper formulates these centralized multi-input dynamics into multiple independent multi-player zero-sum differential games. The motivation is to realize the decoupled optimal synchronous control of MASs and the main contributions of this paper are listed in the following:

(1): The coupling among the distributed multi-agents is equivalent to the disturbance from different channels, and the local neighborhood error dynamics of each agent are modeled as an independent centralized multi-player game.
(2): The bounded L₂-gain problem for coupling attenuation is introduced and is formulated into a multi-player zero-sum game by defining a modified performance index. Different from the L₂-gain problem of [17], concerning disturbance rejection, the motive of the bounded L₂-gain for the coupling attenuation studied here is to suppress the coupling effect on the performance.
(3): It is proved that the solution of the zero-sum game requires the solution of the coupled Hamilton–Jacobi (HJ) equation. The coupled HJ equation of each agent is transformed into an independent equivalent algebraic Riccati equation, which simplifies the solution process effectively.

This paper is organized as follows. Section 2 provides the mathematical background and derives the local error dynamics of each node that is coupled by its own control protocol and those of its neighbors. Section 3 proposes the problem formulation of the bounded L₂-gain for coupling attenuation and its equivalent multi-player zero-sum differential game. Section 4 transforms this zero-sum differential game into the solution of an algebraic Riccati equation and proves the ultimately uniform boundedness of the local neighborhood error, conservatively. The simulation results and conclusion is presented in Section 5 and Section 6, respectively.

2. Preliminaries and Problem Formulation

2.1. Graph Theory

In this paper, the multi-agent directed communication network is depicted. A directed connected graph is defined as

G (V, E, A)

, where

V = {v_{1}, v_{2}, \dots, v_{n}}

represents a finite non-empty set of nodes,

E \subseteq V \times V

is the ordered set of nodes pairs and

A = [a_{i j}]

is the adjacency matrix. If node

v_{i}

can receive the information from node

v_{j}

, then the node pairs

{\bar{v}}_{i j} = (v_{i}, v_{j}) \in E

, and node

v_{j}

is called a neighbor of node

v_{i}

. The neighbor set of node

v_{i}

is represented by

N_{i} = {v_{j} | (v_{i}, v_{j}) \in E}

. Correspondingly, the adjacency matrix element

a_{i j} = 1

when

v_{j} \in N_{i}

, otherwise

a_{i j} = 0

. The graph Laplacian matrix is defined as

L = D - A

, whose row sums are equal to zero [25]. Diagonal matrix

D = d i a g (d_{i})

is the in-degree matrix, where

d_{i} = \sum_{j \in N_{i}} a_{i j}

is the in-degree of node

v_{i}

.

Definition 1.

A directed graph is called as strongly connected if there is a directed path for any a pair of distinct nodes

(v_{i}, v_{j})

, where the directed path is the edge sequence

(v_{i 1}, v_{i 2})

,

(v_{i 2}, v_{i 3})

,...,

(v_{i k}, v_{j})

.

Definition 2 [17].

A directed tree is a connected graph where every node except the root node, has an in-degree equal to one. The graph is called to have a spanning tree if a subset of the node pairs constructs a directed tree.

In this paper,

\bar{λ} (B)

and

\underline{λ} (B)

represent the maximum and minimum singular values of the matrix

B

, respectively.

2.2. Problem Formulation

Considering the Multiple Autonomous Unmanned linear systems (MAUS) constructed by the directed communication graph

G (V, E, A)

having N agents, the dynamics of each agent is described in the following:

{\dot{x}}_{i} = A x_{i} + B u_{i}

(1)

where

x_{i} \in R^{n}, u_{i} \in R^{m_{i}}

are states and control inputs of node i, respectively. The cooperative control of homogeneous systems is investigated in this paper and the leader node

x_{0} \in R^{n}

is set to satisfy the following dynamic

{\dot{x}}_{0} = A x_{0}

(2)

The problem of MAUS synchronization is designing control protocols

u_{i}

for each agent so that states of each node track the leader node, i.e.,

‖ x_{0} - x_{i} ‖ \to 0, \forall i

.

The neighborhood error for each node is defined as [26]

δ_{i} = \sum_{j \in N_{i}} a_{i j} (x_{i} - x_{j}) + g_{i} (x_{i} - x_{0})

(3)

where

g_{i} \geq 0

denotes the pinning gain and there is at least one node that has a link to the leader node.

For the neighborhood error (3), the overall neighborhood error vector of graph

G (V, E, A)

is given by

δ = ((L + \bar{G}) \otimes I_{n}) (x_{i} - {\underline{x}}_{0})

(4)

where

x = {[x_{1}^{T} x_{2}^{T} \dots x_{N}^{T}]}^{T}

and

δ = {[δ_{1}^{T} δ_{2}^{T} \dots δ_{N}^{T}]}^{T}

denote the global state vector and global error vector, respectively. Moreover, for

{\underline{x}}_{0} = \underline{I} x_{0} \in R^{n N}

with

\underline{I} = 1_{N} \otimes I_{n} \in R^{n N \times n}

,

I_{n}

denotes the n dimensional identity matrix and

1_{N}

denotes the N-vector of ones. The symbol

\otimes

is the Kronecker product [27].

\bar{G} = d i a g (g_{1}, g_{2}, \dots, g_{N})

as a diagonal matrix represents the connection between all agents and the leader node.

The overall synchronization error is

ε = (x - {\underline{x}}_{0}) \in R^{n N}

(5)

Assumption 1.

The communication graph is strongly connected, i.e., there is a directed path for any a pair of distinct nodes.

On the basis of Assumption 1, if

\bar{G} \neq 0

, then

g_{i} \neq 0

for at least one. In this case, the matrix

L + \bar{G}

is non-singular and the real parts of all eigenvalues are positive [26]. The following lemma can be obtained, which shows that the overall neighborhood error vector

δ

is positively correlated with the overall synchronization error

ε

.

Lemma 1.

If the communication graph is strongly connected and

\bar{G} \neq 0

, the synchronization errors are bounded, as follows

δ / \bar{λ} (L + \bar{G}) \leq ‖ ε ‖ \leq δ / \underline{λ} (L + \bar{G})

(6)

Furthermore,

δ \equiv 0

if and only if all nodes are synchronized, i.e.,

ε = (x - {\underline{x}}_{0}) = 0

(7)

The dynamics of the local neighborhood tracking errors are given as

{\dot{δ}}_{i} = \sum_{j \in N_{i}} a_{i j} ({\dot{x}}_{i} - {\dot{x}}_{j}) + g_{i} ({\dot{x}}_{i} - {\dot{x}}_{0})

(8)

Substituting (1) and (2) into the above equation, it can be obtained that

{\dot{δ}}_{i} = A δ_{i} + (d_{i} + g_{i}) B_{i} u_{i} - \sum_{j \in N_{i}} a_{i j} B_{j} u_{j}

(9)

It can be seen that the dynamics of the local neighborhood error of each agent i is affected by multiple control inputs from node i and its adjacent nodes. The whole MAUS with the communication graph

G (V, E, A)

presents a complex coupling relationship. It is quite intricate to solve the optimal control problem of dynamic (9) affected by multi-coupling.

3. Multi-Player Zero-Sum Differential Game for Decoupled Multi-Agent System

3.1. The Bounded L₂-Gain Problem for Coupling Attenuation of Multi-Agent System

For decoupling, the inputs from adjacent nodes in the dynamics (9) are replaced by the virtual coupling actions

w_{i} (t) = {[{\overset{⌢}{u}}_{j}^{T}]}_{j \in N_{j}}^{T}

which is regarded as the external disturbances. The performance output is defined as

z_{i} (t) = {[δ_{i}^{T} u_{i}^{T}]}^{T}

. It is desired to designed the control protocol

u_{i}

to achieve synchronization while satisfying the follow bounded L₂-gain condition for the coupling actions with a given

γ_{i} > 0

\begin{array}{l} \int_{0}^{T} {‖ z_{i} (t) ‖}^{2} d t = \int_{0}^{T} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i}) d t \\ \leq γ_{i}^{2} \int_{0}^{T} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j} d t + β (δ_{i} (0)) \end{array}

(10)

where,

β (\cdot)

is a bounded function such that

β (0) = 0

,

Q_{i} > 0, R_{i i} > 0, R_{i j} > 0

.

γ_{i}^{*}

is defined as the minimum value of

γ_{i}

while the bounded L₂-gain condition (10) is satisfied.

3.2. Multi-Player Zero-Sum Differential Game

The following equation is used to define the following performance index function for each agent.

J_{i} (δ_{i} (0), u_{i}, {\overset{⌢}{u}}_{- i}) = \frac{1}{2} \int_{0}^{\infty} δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j} d t

(11)

where

{\overset{⌢}{u}}_{- i}

denotes the virtual coupling control inputs from neighboring nodes, i.e.,

{\overset{⌢}{u}}_{- i} = {{\overset{⌢}{u}}_{j} | j \in N_{i}}

. It should be noted that the main difference from [17] is that the coupling control inputs from neighboring nodes are regarded as the virtual external disturbances directly, which greatly simplifies the design of the control protocol

u_{i}

.

The solution for the bounded L₂-gain problem for coupling attenuation depicted in Section 3.1 can be equivalent to the Nash equilibrium solution of the multi-player zero-sum game-base on the performance index function (11). That is

V_{i} (δ_{i} (0)) = \min_{u_{i}} \max_{{\overset{⌢}{u}}_{- i}} J_{i} (δ_{i} (0), u_{i}, {\overset{⌢}{u}}_{- i})

(12)

In this multi-player zero-sum game, the goal of

u_{i}

is to minimize the value

V_{i} (δ_{i} (0))

. On the contrary, the virtual coupling inputs

{\overset{⌢}{u}}_{- i}

are assumed to maximize the value. This game has a unique solution if a game-theoretic saddle point

(u_{i}^{*}, {\overset{⌢}{u}}_{- i}^{*})

exists, i.e.,

V_{i}^{*} (δ_{i} (0)) = \min_{u_{i}} \max_{{\overset{⌢}{u}}_{- i}} J_{i} (δ_{i} (0), u_{i}, {\overset{⌢}{u}}_{- i}) = \max_{{\overset{⌢}{u}}_{- i}} \min_{u_{i}} J_{i} (δ_{i} (0), u_{i}, {\overset{⌢}{u}}_{- i})

(13)

Accordingly, the value

V_{i}^{*} (δ_{i} (0))

in the above equation is the value of the zero-sum game and satisfies the following Nash equilibrium condition for all policies

u_{i}^{}, {\overset{⌢}{u}}_{- i}^{}

J_{i} (δ_{i} (0), u_{i}^{*}, {\overset{⌢}{u}}_{- i}) \leq J_{i} (δ_{i} (0), u_{i}, {\overset{⌢}{u}}_{- i}) \leq J_{i} (δ_{i} (0), u_{i}, {\overset{⌢}{u}}_{- i}^{*})

(14)

When the policies

u_{i}^{}, {\overset{⌢}{u}}_{- i}^{}

are selected, the value function of node i can yield

V_{i} (δ_{i} (t), u_{i}, {\overset{⌢}{u}}_{- i}) = \frac{1}{2} \int_{t}^{\infty} δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j} d t

(15)

Differential equivalents to each value function are given as

\begin{array}{l} 0 = \frac{1}{2} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j}) \\ + \nabla V_{i}^{T} (A δ_{i} + (d_{i} + g_{i}) B_{i} u_{i} - \sum_{j \in N_{i}} a_{i j} B_{j} {\overset{⌢}{u}}_{j}), V_{i} (0) = 0, i \in N \end{array}

(16)

where

\nabla V_{i} = \frac{\partial V_{i}}{\partial δ_{i}} \in R^{n}

denotes the gradient vector. The Hamiltonian functions are defined as follows,

\begin{array}{l} H_{i} (δ_{i}, \nabla V_{i}, u_{i}, {\overset{⌢}{u}}_{- i}) \equiv \frac{1}{2} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j}) \\ + \nabla V_{i}^{T} (A δ_{i} + (d_{i} + g_{i}) B_{i} u_{i} - \sum_{j \in N_{i}} a_{i j} B_{j} {\overset{⌢}{u}}_{j}) \end{array}

(17)

Under certain policies

u_{i}^{}, {\overset{⌢}{u}}_{- i}^{}

, the partial differential equation

H_{i} (δ_{i}, \nabla V_{i}, u_{i}, {\overset{⌢}{u}}_{- i}) = 0

has a unique solution

V_{i} (δ_{i})

. The principle of optimality gives

\begin{array}{l} \frac{\partial H_{i} (δ_{i}, \nabla V_{i}, u_{i}, {\overset{⌢}{u}}_{- i})}{\partial u_{i}} = 0 \Rightarrow u_{i} = - (d_{i} + g_{i}) R_{_{i i}}^{- 1} B_{i}^{T} \nabla V_{i} \\ \frac{\partial H_{i} (δ_{i}, \nabla V_{i}, u_{i}, {\overset{⌢}{u}}_{- i})}{\partial {\overset{⌢}{u}}_{j}} = 0 \Rightarrow {\overset{⌢}{u}}_{j} = - \frac{1}{γ_{_{i}}^{2}} a_{i j} R_{_{i j}}^{- 1} B_{j}^{T} \nabla V_{i}, j \in N_{i} \end{array}

(18)

If the

V_{i}^{*}

is the Nash equilibrium solution of the multi-player zero-sum game, that is

V_{i}^{*} (δ_{i}) = \min_{u_{i}} \max_{{\overset{⌢}{u}}_{- i}} J_{i} (δ_{i}, u_{i}, {\overset{⌢}{u}}_{- i})

(19)

we can obtain

\min_{u_{i}} \max_{{\overset{⌢}{u}}_{- i}} H_{i} (δ_{i}, \nabla V_{i}^{*}, u_{i}, {\overset{⌢}{u}}_{- i}) = 0

(20)

Substituting the optimal strategy determined by (18) into (20), the coupled Hamilton–Jacobi (HJ) equations yield

\begin{array}{l} 0 = \frac{1}{2} δ_{i}^{T} Q_{i} δ_{i} + \frac{1}{2} {(d_{i} + g_{i})}^{2} \nabla V_{i}^{T} B_{i} R_{i i}^{- 1} B_{i}^{T} \nabla V_{i} - \frac{1}{2 γ_{i}^{2}} \sum_{j \in N_{i}} a_{i j}^{2} \nabla V_{i}^{T} B_{j} R_{i j}^{- 1} B_{j}^{T} \nabla V_{i} \\ + \nabla V_{_{i}}^{T} (A δ_{i} - {(d_{i} + g_{i})}^{2} B_{i} R_{i i}^{- 1} B_{i}^{T} \nabla V_{i} + \frac{1}{γ_{_{i}}^{2}} \sum_{j \in N_{i}} a_{i j}^{2} B_{j} R_{i j}^{- 1} B_{j}^{T} \nabla V_{i}), V_{i} (0) = 0, i \in N \end{array}

(21)

For a given solution

V_{i}^{*}

, in order to define

u_{i}^{*} = u_{i} (V_{i}^{*})

and

{\overset{⌢}{u}}_{j}^{*} = {\overset{⌢}{u}}_{j} (V_{i}^{*})

in the same way as (18), (21) can be written as

H_{i} (δ_{i}, \nabla V_{i}^{*}, u_{i}^{*}, {\overset{⌢}{u}}_{- i}^{*}) = 0, V_{i}^{*} (0) = 0

(22)

Lemma 2.

For any policies

u_{i}, {\overset{⌢}{u}}_{- i}

, the following equation holds

H_{i} (δ_{i}, \nabla V_{i}^{*}, u_{i}, {\overset{⌢}{u}}_{- i}) = \frac{1}{2} {(u_{i} - u_{i}^{*})}^{T} R_{i i} (u_{i} - u_{i}^{*}) - \sum_{j \in N_{i}} \frac{γ_{i}}{2} {({\overset{⌢}{u}}_{j} - {\overset{⌢}{u}}_{j}^{*})}^{T} R_{i j} ({\overset{⌢}{u}}_{j} - {\overset{⌢}{u}}_{j}^{*})

(23)

Proof of Lemma 2.

Substituting

u_{i}, {\overset{⌢}{u}}_{- i}

for

u_{i}^{*}, {\overset{⌢}{u}}_{- i}^{*}

in (22)

\begin{array}{l} H_{i} (δ_{i}, \nabla V_{i}^{*}, u_{i}, {\overset{⌢}{u}}_{- i}) = \nabla V_{i}^{* T} (A δ_{i} + (d_{i} + g_{i}) B_{i} u_{i} - \sum_{j \in N_{i}} a_{i j} B_{j} {\overset{⌢}{u}}_{j} \\ + (d_{i} + g_{i}) B_{i} u_{i}^{*} - (d_{i} + g_{i}) B_{i} u_{i}^{*} - \sum_{j \in N_{i}} a_{i j} B_{j} {\overset{⌢}{u}}_{j}^{*} + \sum_{j \in N_{i}} a_{i j} B_{j} {\overset{⌢}{u}}_{j}^{*}) \\ + \frac{1}{2} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j}) \\ + \frac{1}{2} (u_{i}^{* T} R_{i i} u_{i}^{*} - u_{i}^{* T} R_{i i} u_{i}^{*} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{* T} R_{i j} {\overset{⌢}{u}}_{j}^{*} + γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{* T} R_{i j} {\overset{⌢}{u}}_{j}^{*}) \end{array}

(24)

Substituting

H_{i} (δ_{i}, \nabla V_{i}^{*}, u_{i}^{*}, {\overset{⌢}{u}}_{- i}^{*}) = 0

into (24), we can obtain

\begin{array}{l} H_{i} (δ_{i}, \nabla V_{_{i}}^{*}, u_{i}, {\overset{⌢}{u}}_{- i}) = \nabla V_{i}^{* T} ((d_{i} + g_{i}) B_{i} (u_{i} - u_{i}^{*}) - \sum_{j \in N_{i}} a_{i j} B_{j} ({\overset{⌢}{u}}_{j} - {\overset{⌢}{u}}_{j}^{*})) \\ + \frac{1}{2} (u_{i}^{T} R_{i i} u_{i} - u_{i}^{* T} R_{i i} u_{i}^{*}) - \frac{1}{2} (γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{* T} R_{i j} {\overset{⌢}{u}}_{j}^{*}) \end{array}

(25)

Completing the squares in (25) gives (23) upon the relationship between

u_{i}^{*}, {\overset{⌢}{u}}_{- i}^{*}

and

\nabla V_{i}^{*}

. □

Remark 1.

The polices

{\overset{⌢}{u}}_{- i}^{}

in Section 3 are not the real policies of neighboring nodes. These are only defined as the virtual coupling input from neighboring nodes, which are regarded as the external disturbances and have the same channels as the control inputs of neighboring nodes. In this way, the bounded L₂-gain attenuation for the real coupling inputs from neighboring nodes can be realized. In addition, the complex relationships among agents are decoupled virtually during the control protocol design process, and the solving process of zero-sum game is effectively simplified. The coupled HJ equation of each agent is independent of each other.

4. Solution to Bounded L₂-Gain Problem for the Coupling Attenuation and the Equivalent Algebraic Riccati Equation

4.1. Solution to Bounded L₂-Gain Problem for Coupling Attenuation

In this subsection, the control policy

u_{i}^{}

is found to guarantee the condition (10) holds for a prescribed

γ_{i} > 0

and

{\overset{⌢}{u}}_{- i}^{} \in L_{2} [0, \infty)

. The following Theorem 1 shows that the solution of a coupled HJ equation (22) is actually the solution to the bounded L₂-gain problem for coupling attenuation.

Theorem 1.

Let

γ_{i} \geq γ_{i}^{*}

. Suppose the coupled HJ equation (22) has a smooth positive definite solution

V_{i}^{*} > 0, i \in N

. The control policy is selected as

u_{i}^{*} = u_{i} (V_{i}^{*})

, given by (18) in terms of

V_{i}^{*}

. The bounded L₂-gain condition (10) holds for all

{\overset{⌢}{u}}_{- i}^{} \in L_{2} [0, \infty)

.

Proof of Theorem 1.

According to Lemma 2

\begin{array}{l} H_{i} (δ_{i}, \nabla V_{i}^{*}, u_{i}, {\overset{⌢}{u}}_{- i}) \\ = \frac{1}{2} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j}) + \frac{d V_{i}^{*}}{d t} \\ = \frac{1}{2} {(u_{i} - u_{i}^{*})}^{T} R_{i i} (u_{i} - u_{i}^{*}) - \sum_{j \in N_{i}} \frac{γ_{i}}{2} {({\overset{⌢}{u}}_{j} - {\overset{⌢}{u}}_{j}^{*})}^{T} R_{i j} ({\overset{⌢}{u}}_{j} - {\overset{⌢}{u}}_{j}^{*}) \end{array}

(26)

Selecting

u_{i} = u_{i}^{*}

, we can obtain that

\begin{array}{l} \frac{1}{2} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j}) + \frac{d V_{i}^{*}}{d t} \\ = - \sum_{j \in N_{i}} \frac{γ_{i}}{2} {({\overset{⌢}{u}}_{j} - {\overset{⌢}{u}}_{j}^{*})}^{T} R_{i i} ({\overset{⌢}{u}}_{j} - {\overset{⌢}{u}}_{j}^{*}) \leq 0 \end{array}

(27)

Integrating (27) yields,

\begin{array}{l} \frac{1}{2} \int_{0}^{T} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j}) d t \\ + V_{i}^{*} (δ_{i} (T)) - V_{i}^{*} (δ_{i} (0)) \leq 0 \end{array}

(28)

V_{i}^{*}

is a smooth positive definite solution, i.e.,

V_{i}^{*} (δ_{i} (T)) \geq 0

, one has

\int_{0}^{T} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i}) d t \leq γ_{i}^{2} \int_{0}^{T} \sum_{j \in N_{i}} {\overset{⌢}{u}}_{j}^{T} R_{i j} {\overset{⌢}{u}}_{j} d t + V_{i}^{*} (δ_{i} (0))

(29)

Hence, the bounded L₂-gain condition (10) for coupling attenuation is satisfied. □

4.2. The Equivalent Algebraic Riccati Equation

It can be seen from the above results that the Nash equilibrium solution can be obtained by solving the coupled HJ equation (22). In this subsection, it will be shown that the coupled HJ equation (22) can be equivalent to an Algebraic Riccati equation (ARE).

Defining the optimal value function

V_{i}^{*} = δ_{i}^{T} P_{i} δ_{i}

, the corresponding optimal

u_{i}^{*}

and

{\overset{⌢}{u}}_{- i}^{*}

can be obtained as

\begin{array}{l} u_{i}^{*} = - 2 (d_{i} + g_{i}) R_{i i}^{- 1} B_{i}^{T} P_{i} δ_{i} \\ {\overset{⌢}{u}}_{j}^{*} = - \frac{2}{γ_{_{i}}^{2}} a_{i j} R_{i j}^{- 1} B_{j}^{T} P_{i} δ_{i}, j \in N_{i} \end{array}

(30)

Substituting (30) and

V_{i}^{*} = δ_{i}^{T} P δ_{i}

into (21) yields

\begin{array}{l} δ_{i}^{T} (P_{i} A + A^{T} P_{i}) δ_{i} + \frac{1}{2} δ_{i}^{T} Q_{i} δ_{i} & - 2 {(d_{i} + g_{i})}^{2} δ_{i}^{T} P_{i} B_{i} R_{i i}^{- 1} B_{i}^{T} P_{i} δ_{i} \\ + \frac{2}{γ_{i j}^{2}} δ_{i}^{T} (\sum_{j \in N_{i}} a_{i j}^{2} P_{i} B_{j} R_{i j}^{- 1} B_{j}^{T} P_{i}) δ_{i} = 0 \end{array}

(31)

The above equation can be equivalent to

\begin{array}{l} P_{i} A + A^{T} P_{i} + \frac{1}{2} Q_{i} - 2 {(d_{i} + g_{i})}^{2} P_{i} B_{i} R_{i i}^{- 1} B_{i}^{T} P_{i} \\ + \frac{2}{γ_{i j}^{2}} \sum_{j \in N_{i}} a_{i j}^{2} P_{i} B_{j} R_{i j}^{- 1} B_{j}^{T} P_{i} = 0 \end{array}

(32)

Defining the integrated matrix as

\begin{array}{l} R = d i a g (\frac{R_{i i}}{2 {(d_{i} + g_{i})}^{2}}, - \frac{γ_{i}^{2} R_{i j_{1}}}{2 a_{i j_{1}}}, - \frac{γ_{i}^{2} R_{i j_{2}}}{2 a_{i j_{2}}}, \dots, - \frac{γ_{i}^{2} R_{i j_{d_{i}}}}{2 a_{i j_{d_{i}}}}) \\ B = [B_{i} B_{j_{1}} B_{j_{2}} \dots B_{j_{d_{i}}}], j_{1}, j_{2}, \dots, j_{d_{i}} \in N_{i} \end{array}

Then, (32) can be rewritten as the ARE

P_{i} A + A^{T} P_{i} + \frac{1}{2} Q_{i} - P_{i} B R^{- 1} B^{T} P_{i} = 0

(33)

Theorem 2.

Assume that all the real control policies

u_{- i} = {u_{j} | j \in N_{i}}

of neighboring agents satisfy the bounded condition as follows,

{‖ u_{j} ‖}_{2} = u_{j}^{T} u_{j} \leq ξ, j \in N_{i}

(34)

Selecting the first equation of (30) as the control policies

u_{i}^{}

, based on the solution of ARE (34). The local neighborhood error vector will ultimately and uniformly enter the following bounded invariant set

Ω_{i} = {δ_{i} | {‖ δ_{i} ‖}_{2} \leq \frac{γ_{i}^{2} ξ \sum_{j \in N_{i}} \bar{λ} (R_{i j})}{2 \underline{λ} ({\bar{Q}}_{i})}}

(35)

where

{‖ δ_{i} ‖}_{2}

denotes the Euclidean norm of

δ_{i}

and

{\bar{Q}}_{i}

is the positive definite matrix as

{\bar{Q}}_{i} = \frac{1}{2} Q_{i} + 2 {(d_{i} + g_{i})}^{2} P_{i} B_{i} R_{_{i i}}^{- 1} B_{i}^{T} P_{i}

.

Proof of Theorem 2.

Selecting the optimal positive value function

V_{i}^{*} = δ_{i}^{T} P_{i} δ_{i} \geq 0

, in terms of the solution of the coupled HJ equation (21), as the Lyapunov function. According to Lemma 2, the derivative of

V_{i}^{*}

is

\begin{array}{l} \frac{d V_{i}^{*}}{d t} = - \frac{1}{2} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i} - γ_{i}^{2} \sum_{j \in N_{i}} u_{j}^{T} R_{i j} u_{j}) \\ + \frac{1}{2} {(u_{i} - u_{i}^{*})}^{T} R_{i i} (u_{i} - u_{i}^{*}) - \sum_{j \in N_{i}} \frac{γ_{i}}{2} {(u_{j} - {\overset{⌢}{u}}_{j}^{*})}^{T} R_{i j} (u_{j} - {\overset{⌢}{u}}_{j}^{*}) \end{array}

(36)

Selecting

u_{i}^{} = u_{i}^{*}

as the first equation of (30) yields

\begin{array}{l} \frac{d V_{i}^{*}}{d t} = - \frac{1}{2} δ_{i}^{T} Q_{i} δ_{i} - 2 {(d_{i} + g_{i})}^{2} δ_{i}^{T} P_{i} B_{i} R_{i i}^{- 1} B_{i}^{T} P_{i} δ_{i} \\ + \frac{γ_{i}^{2}}{2} \sum_{j \in N_{i}} u_{j}^{T} R_{i j} u_{j} - \sum_{j \in N_{i}} \frac{γ_{i}}{2} {(u_{j} - {\overset{⌢}{u}}_{j}^{*})}^{T} R_{i j} (u_{j} - {\overset{⌢}{u}}_{j}^{*}) \\ \leq - δ_{i}^{T} (\frac{1}{2} Q_{i} + 2 {(d_{i} + g_{i})}^{2} P_{i} B_{i} R_{i i}^{- 1} B_{i}^{T} P_{i}) δ_{i} + \frac{γ_{i}^{2} ξ \sum_{j \in N_{i}} \bar{λ} (R_{i j})}{2} \end{array}

(37)

Let

\frac{d V_{i}^{*}}{d t} \leq 0

, we can realize that

δ_{i}^{T} δ_{i} = {‖ δ_{i} ‖}_{2} \geq \frac{γ_{i}^{2} ξ \sum_{j \in N_{i}} \bar{λ} (R_{i j})}{2 \underline{λ} (\frac{1}{2} Q_{i} + 2 {(d_{i} + g_{i})}^{2} P_{i} B_{i} R_{i i}^{- 1} B_{i}^{T} P_{i})}

(38)

Defining

{\bar{Q}}_{i} = \frac{1}{2} Q_{i} + 2 {(d_{i} + g_{i})}^{2} P_{i} B_{i} R_{i i}^{- 1} B_{i}^{T} P_{i}

yields the bounded invariant set (35). □

Remark 2.

Theorem 2 shows the ultimately uniform boundedness (UUB) of the local neighborhood error vector

δ_{i}

. According to the bounded invariant sets

Ω_{i}

, the bound of

δ_{i}

can be arbitrarily small by presetting the matrix

Q_{i}

,

R_{i i}

and

R_{i j}

in the performance index function (11). In fact, this result is conservative because the term

- \sum_{j \in N_{i}} \frac{γ_{i}}{2} {(u_{j} - {\overset{⌢}{u}}_{j}^{*})}^{T} R_{i j} (u_{j} - {\overset{⌢}{u}}_{j}^{*})

in

\frac{d V_{i}^{*}}{d t}

is omitted. The real control inputs

u_{j}^{}

of adjacent agents differ greatly from

{\overset{⌢}{u}}_{j}^{*}

in fact, which guarantees the negative characterization of

\frac{d V_{i}^{*}}{d t}

. Therefore, the simulation results in the next section show that the local neighborhood error vector can converge asymptotically and uniformly to the origin.

Remark 3.

In a practical application, matrix

Q_{i}

,

R_{i i}

and

R_{i j}

and parameter

γ_{i}

can be selected according to engineering performance requirements. If a high convergence speed and synchronization accuracy are required,

Q_{i}

can be selected to make its eigenvalues large; if a low control energy consumption is required,

R_{i i}

can be selected to make its eigenvalues large; the coupling attenuation level can be adjusted by adjusting matrix

R_{i j}

and

γ_{i}

. It should be noted that,

Q_{i}

,

R_{i i}

,

R_{i j}

and

γ_{i}

must satisfy the condition in Theorem 1, so that the coupled HJ equation (22) has a smooth positive definite solution

V_{i}^{*} > 0

. That is, (33) has a positive definite solution

P_{i}

.

5. Simulation Results

This section shows the effectiveness of the equivalent ARE approach described in Section 4 and Theorem 2. The simulation is realized in MATLAB/Simulink. Consider a class of Multiple Autonomous Unmanned homogeneous linear systems referring to [5] which is shown as follows

{\dot{x}}_{i} = A x_{i} + B_{i} u_{i}

where

A = [\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}]

,

B_{1} = [\begin{matrix} 2 \\ 1 \end{matrix}]

,

B_{2} = [\begin{matrix} 2 \\ 3 \end{matrix}]

,

B_{3} = [\begin{matrix} 2 \\ 2 \end{matrix}]

and

B_{3} = [\begin{matrix} 1 \\ 1 \end{matrix}]

, with the Leader dynamics

{\dot{x}}_{0} = A x_{0}

. The communication digraph structure is shown in Figure 1. The edge weights and the pinning gains are taken equal to 1.

The selected the weight matrices in (11) are

R_{11} = 9

,

R_{14} = 1

,

R_{22} = 9

,

R_{21} = 1

,

R_{33} = 9

,

R_{32} = 1

,

R_{44} = 9

,

R_{43} = 1

and

Q_{1} = Q_{2} = Q_{3} = Q_{4} = [\begin{matrix} 8 & 0 \\ 0 & 8 \end{matrix}]

. The bounded L₂-gain coefficient in (11) for each agent are preset as

γ_{1} = 1.75

,

γ_{2} = 3.75

,

γ_{3} = 4.5

and

γ_{4} = 6.25

. The cooperative control protocol of each agent is implemented, as in Section 4.2, where the solution of ARE (33) is

P_{1} = [\begin{matrix} 4 . 8954 & - 5 . 6084 \\ - 5 . 6084 & 9 . 4934 \end{matrix}], P_{2} = [\begin{matrix} 41 . 9228 & - 7 . 3129 \\ - 7 . 3129 & 2 . 6742 \end{matrix}], P_{3} = [\begin{matrix} 14 . 4257 & - 9 . 363 \\ - 9 . 363 & 10 . 4565 \end{matrix}], P_{4} = [\begin{matrix} 17 . 4696 & - 0 . 5066 \\ - 0 . 5066 & 13 . 6 \end{matrix}] .

Remark 4.

In the process of a simulation design, the

γ_{i}

should be gradually reduced to search for a feasible and high coupling attenuation level under the premise that (33) has a positive definite solution

P_{i}

. Using the ARE solver in MATLAB, it is very convenient to solve (33) and obtain

P_{i}

. Then, the design of the coupling attenuation controller can be completed according to (30).

To elevate the Bounded L₂-gain problem for the coupling attenuation, the following variable

C_{γ}

is introduced based on (29)

C_{γ} = \int_{0}^{T} (δ_{i}^{T} Q_{i} δ_{i} + u_{i}^{T} R_{i i} u_{i}) d t - γ_{i}^{2} \int_{0}^{T} \sum_{j \in N_{i}} u_{j}^{T} R_{i j} u_{j} d t - V_{i}^{*} (δ_{i} (0))

(39)

That is,

C_{γ} \leq 0

, which means that the Bounded L₂-gain condition (10) is satisfied.

The local neighborhood error vector of each agent is shown in Figure 2. Figure 3 is the 3-D phase plane plot of the system’s evolution for agents 1, 2, 3, 4 and leader 0. The

C_{γ}

of the node 1 agent is shown in Figure 4. As can be seen from Figure 2 and Figure 3, the neighborhood error vector can converge asymptotically and uniformly to the origin and all agents in the communication digraph are eventually synchronized, which is also consistent with Remark 2. Figure 4 shows that

C_{γ}

is always negative, which is equivalent to that the node 1 agent satisfies the bounded L₂-gain condition with

γ_{1} = 1.75

for the coupling attenuation. The effectiveness of the proposed method is thus verified.

6. Conclusions

This paper provides a novel idea for the synchronization control of Multiple Autonomous Unmanned linear systems, in which the local neighborhood error dynamic’s coupling part is considered as the virtual external disturbance, so as to decouple the multi-agent cooperative control problem into a relatively independent bounded L₂-gain problem for coupling attenuation. The optimal control theory and differential game theory is utilized to formulate the bounded L₂-gain problem into a centralized multi-player zero-sum game. It is shown that the solution to the multi-player zero-sum game is equivalent to the solution of a coupled HJ equation. It is also shown that the coupled HJ equation can be transformed into an algebraic Riccati equation (ARE) and the solution guarantees the ultimately uniform boundedness (UUB) of the local neighborhood error vector under conservative conditions. The law of parameters selection is summarized. The simulation results show that the proposed method can ensure that the local neighborhood error vectors converge asymptotically to the origin, that is, the multiple autonomous unmanned linear systems can achieve final synchronization, which demonstrates that the UUB of errors is conservative. Meanwhile, the bounded L₂-gain condition for the coupling attenuation can be guaranteed.

This proposed method is suitable for the cooperative consensus control of various homogeneous multiple autonomous unmanned linear systems, such as an underwater robot swarm and aerial UAV swarm. Future work will focus on extending this method to nonlinear multi-agent systems and more serious models with uncertainties.

Author Contributions

Conceptualization, Y.L. and B.W.; methodology, Y.L.; software, Y.L.; validation, Y.L. and Y.C.; formal analysis, Y.C.; investigation, Y.L.; resources, B.W.; data curation, Y.L.; writing—original draft preparation, Y.L.; writing—review and editing, Y.L.; visualization, Y.L. and B.W.; supervision, B.W.; project administration, Y.C.; funding acquisition, B.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (NSFC), grant number 51777058, and in part by the Six Talent Peaks Project in the Jiangsu province, grant number XNY-010.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhou, Y.; Li, D.; Gao, F. Optimal synchronization control for heterogeneous multi-agent systems: Online adaptive learning solutions. Asian J. Control 2021. [Google Scholar] [CrossRef]
Jing, G.; Zheng, Y.; Wang, L. Consensus of Multiagent Systems With Distance-Dependent Communication Networks. IEEE Trans. Neural Netw. Learn. Syst. 2016, 28, 2712–2726. [Google Scholar] [CrossRef] [PubMed]
Liu, J.; Dai, M.-Z.; Zhang, C.; Wu, J. Edge-Event-Triggered Synchronization for Multi-Agent Systems with Nonlinear Controller Outputs. Appl. Sci. 2020, 10, 5250. [Google Scholar] [CrossRef]
Shi, H.; Hou, M.; Wu, Y. Distributed Control for Leader-Following Consensus Problem of Second-Order Multi-Agent Systems and Its Application to Motion Synchronization. Appl. Sci. 2019, 9, 4208. [Google Scholar] [CrossRef] [Green Version]
Vamvoudakis, K.G.; Lewis, F.L.; Hudas, G.R. Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality. Automatica 2012, 48, 1598–1611. [Google Scholar] [CrossRef]
Vamvoudakis, K.G.; Lewis, F.L. Multi-agent differential graphical games. In Proceedings of the 30th Chinese Control Con-ference, Yantai, China, 22–24 July 2011. [Google Scholar]
Liu, J.; Xu, F.; Lin, S.; Cai, H.; Yan, S. A Multi-Agent-Based Optimization Model for Microgrid Operation Using Dynamic Guiding Chaotic Search Particle Swarm Optimization. Energies 2018, 11, 3286. [Google Scholar] [CrossRef] [Green Version]
Lowe, R.; Wu, Y.; Tamar, A.; Harb, J.; Abbeel, P.; Mordatch, I. Multi-agent actor–critic for mixed cooperative-competitive environments. arXiv 2017, arXiv:1706.02275. [Google Scholar] [CrossRef]
Liu, Y.; Geng, Z. Finite-time optimal formation control of multi-agent systems on the Lie group SE(3). Int. J. Control 2013, 86, 1675–1686. [Google Scholar] [CrossRef]
Zhang, W.; Hu, J. Optimal multi-agent coordination under tree formation constraints. IEEE Trans. Autom. Control. 2008, 53, 692–705. [Google Scholar] [CrossRef] [Green Version]
Lin, W.; Zhao, W.; Liu, H. Robust Optimal Formation Control of Heterogeneous Multi-Agent System via Reinforcement Learning. IEEE Access 2020, 8, 218424–218432. [Google Scholar] [CrossRef]
Arulkumaran, K.; Deisenroth, M.P.; Brundage, M.; Bharath, A.A. Deep Reinforcement Learning: A Brief Survey. IEEE Signal Process. Mag. 2017, 34, 26–38. [Google Scholar] [CrossRef] [Green Version]
Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef] [PubMed]
Vinyals, O.; Babuschkin, I.; Czarnecki, W.M.; Mathieu, M.; Dudzik, A.; Chung, J.; Choi, D.H.; Powell, R.; Ewalds, T.; Georgiev, P.; et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature 2019, 575, 350–354. [Google Scholar] [CrossRef]
Levine, S.; Pastor, P.; Krizhevsky, A.; Ibarz, J.; Quillen, D. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. Int. J. Robot. Res. 2017, 37, 421–436. [Google Scholar] [CrossRef]
Tatari, F.; Vamvoudakis, K.G.; Mazouchi, M. Optimal distributed learning for disturbance rejection in networked non-linear games under unknown dynamics. IET Control Theory Appl. 2019, 13, 2838–2848. [Google Scholar] [CrossRef]
Jiao, Q.; Modares, H.; Xu, S.; Lewis, F.L.; Vamvoudakis, K.G. Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control. Automatica 2016, 69, 24–34. [Google Scholar] [CrossRef] [Green Version]
Qin, J.; Li, M.; Shi, Y.; Ma, Q.; Zheng, W.X. Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 85–96. [Google Scholar] [CrossRef]
Kamalapurkar, R.; Walters, P.; Dixon, W. Model-Based Reinforcement Learning for Approximate Optimal Regulation. Control. Complex Syst. 2016, 247–273. [Google Scholar] [CrossRef]
Farrell, J.A. Persistence of excitation conditions in passive learning control. Automatica 1997, 33, 699–703. [Google Scholar] [CrossRef]
Kamalapurkar, R.; Andrews, L.; Walters, P.; Dixon, W.E. Model-Based Reinforcement Learning for Infinite-Horizon Approximate Optimal Tracking. IEEE Trans. Neural Networks Learn. Syst. 2016, 28, 753–758. [Google Scholar] [CrossRef]
Yasini, S.; Karimpour, A.; Sistani, M.-B.N.; Modares, H. Online concurrent reinforcement learning algorithm to solve two-player zero-sum games for partially unknown nonlinear continuous-time systems. Int. J. Adapt. Control Signal Process. 2014, 29, 473–493. [Google Scholar] [CrossRef]
Vamvoudakis, K.G.; Lewis, F.L. Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton–Jacobi equations. Automatica 2011, 47, 1556–1569. [Google Scholar] [CrossRef]
Bucolo, M.; Buscarino, A.; Fortuna, L.; Frasca, M. LQG control of linear lossless positive-real systems: The continuous-time and discrete-time cases. Int. J. Dyn. Control 2022, 10, 1075–1083. [Google Scholar] [CrossRef]
Buscarino, A.; Fortuna, L.; Frasca, M.; Rizzo, A. Dynamical network interactions in distributed control of robots. Chaos: Interdiscip. J. Nonlinear Sci. 2006, 16, 015116. [Google Scholar] [CrossRef]
Khoo, S.; Xie, L.; Man, Z. Robust Finite-Time Consensus Tracking Algorithm for Multirobot Systems. IEEE/ASME Trans. Mechatron. 2009, 14, 219–228. [Google Scholar] [CrossRef]
Brewer, J. Kronecker products and matrix calculus in system theory. IEEE Trans. Circuits Syst. 1978, 25, 772–781. [Google Scholar] [CrossRef]

Figure 1. The communication digraph structure.

Figure 2. The local neighborhood error vector of each agent: (a) The node 1 agent; (b) The node 2 agent; (c) The node 3 agent; (d) The node 4 agent.

Figure 3. The 3-D phase plane plot of the system’s evolution for agents 1, 2, 3, 4 and leader 0.

Figure 4. The

C_{γ}

of node 1 agent.

Figure 4. The

C_{γ}

of node 1 agent.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Y.; Wang, B.; Chen, Y. A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L₂-Gain for Coupling Attenuation. Appl. Sci. 2022, 12, 7551. https://doi.org/10.3390/app12157551

AMA Style

Li Y, Wang B, Chen Y. A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L₂-Gain for Coupling Attenuation. Applied Sciences. 2022; 12(15):7551. https://doi.org/10.3390/app12157551

Chicago/Turabian Style

Li, Yinsheng, Bing Wang, and Yuquan Chen. 2022. "A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L₂-Gain for Coupling Attenuation" Applied Sciences 12, no. 15: 7551. https://doi.org/10.3390/app12157551

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L₂-Gain for Coupling Attenuation

Abstract

Featured Application

Abstract

1. Introduction

2. Preliminaries and Problem Formulation

2.1. Graph Theory

2.2. Problem Formulation

3. Multi-Player Zero-Sum Differential Game for Decoupled Multi-Agent System

3.1. The Bounded L₂-Gain Problem for Coupling Attenuation of Multi-Agent System

3.2. Multi-Player Zero-Sum Differential Game

4. Solution to Bounded L₂-Gain Problem for the Coupling Attenuation and the Equivalent Algebraic Riccati Equation

4.1. Solution to Bounded L₂-Gain Problem for Coupling Attenuation

4.2. The Equivalent Algebraic Riccati Equation

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L2-Gain for Coupling Attenuation

Abstract

Featured Application

Abstract

1. Introduction

2. Preliminaries and Problem Formulation

2.1. Graph Theory

2.2. Problem Formulation

3. Multi-Player Zero-Sum Differential Game for Decoupled Multi-Agent System

3.1. The Bounded L2-Gain Problem for Coupling Attenuation of Multi-Agent System

3.2. Multi-Player Zero-Sum Differential Game

4. Solution to Bounded L2-Gain Problem for the Coupling Attenuation and the Equivalent Algebraic Riccati Equation

4.1. Solution to Bounded L2-Gain Problem for Coupling Attenuation

4.2. The Equivalent Algebraic Riccati Equation

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

A Novel Decoupled Synchronous Control Method for Multiple Autonomous Unmanned Linear Systems: Bounded L₂-Gain for Coupling Attenuation

3.1. The Bounded L₂-Gain Problem for Coupling Attenuation of Multi-Agent System

4. Solution to Bounded L₂-Gain Problem for the Coupling Attenuation and the Equivalent Algebraic Riccati Equation

4.1. Solution to Bounded L₂-Gain Problem for Coupling Attenuation