RBFNN-Based Distributed Coverage Control on an Unknown Region

Zhang, Ankang; Wang, Xiaoling

doi:10.3390/math12010111

Open AccessArticle

RBFNN-Based Distributed Coverage Control on an Unknown Region

by

Ankang Zhang

^1,2 and

Xiaoling Wang

^1,2,*

¹

College of Automation & Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing 210023, China

²

Jiangsu Engineering Center for IOT Intelligent Robots, Nanjing 210023, China

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(1), 111; https://doi.org/10.3390/math12010111

Submission received: 22 November 2023 / Revised: 15 December 2023 / Accepted: 25 December 2023 / Published: 28 December 2023

(This article belongs to the Special Issue Mathematic Control and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we investigate the problem of achieving distributed coverage control of a mobile sensor network on an unknown region using local measurements. To accomplish this objective, each sensor is equipped with two-layer dynamics. The upper layer dynamic employs a completely distributed observer algorithm on the target region for state estimation of the density function. The lower layer dynamic utilizes a radial basis function neural network-based motion algorithm, which involves only the estimated state obtained by the upper layer dynamics, to guide the sensors towards an optimal coverage configuration. We demonstrate that with only the joint detectability of the partial outputs measurement, it is possible to achieve distributed coverage control in the unknown region without requiring additional information about the density function, communication topology associated with the sensors, or coupling gains. Finally, two examples are used to validate the theoretical findings.

Keywords:

coverage control; distributed observer; density function; radial basis function neural network

MSC:

93A16

1. Introduction

As a branch of cooperative control of multi-agent systems [1,2,3,4], coverage control of multi-agent systems has gained more and more attention [5,6,7,8,9]. A decentralized control law [10] is proposed to drive a network of mobile robots to an optimal sensing configuration by using an adaptive control architecture. In [11], the blanket coverage problem is addressed to cover a long region by imposing the dynamics of the boundaries on an agent’s respective control law, which ensures the locally optimal partitioning for the moving coverage area. They achieve optimal coverage in static [10] and dynamic [11] regions, respectively, with known environmental information. However, the actual application region information is probably unknown. Much attention has been focused on the coverage control of the unidentified region. With unknown environment information, a robust coverage control algorithm [6] is proposed to guide the unicycle robot network to the ideal configuration on the basis of an approximated density function. For a sensor network, the main object of coverage control is to drive the sensors to distribute over a region while aggregating in locations of their high interest [12]. As we know, there is a considerable amount of literature on the coverage control of a region with multiple robots, such as [6,7,8,9]. A dynamic coverage planning technique is given in [7] which can eliminate the convexity requirement restrictions of the targeted area by using the K-means approach. In [8,9], a deep reinforcement learning algorithm is introduced to address the problem of region coverage and exploration. Voronoi partition is one of the common ways to study the coverage control of multi-agent systems [9,13]. Further, the communication between agents should be considered when mobile sensor networks cover the targeted area. A hybrid scheme is proposed that decouples the optimization of the coverage objective from the control of the communication variables, which optimizes coverage and the routing of information [14]. With the Voronoi partition method, a convex region can be divided into several Voronoi cells, where the number of the Voronoi cells is equal to the number of sensors with one sensor per cell. The underlying idea of Voronoi-based coverage control is to design a control algorithm for each sensor to minimize a cost function so as to drive the sensor to the optimal coverage configuration. Namely, in coverage control, all the sensors are not dropped or thrown into the targeted region randomly, but deployed on the optimal locations of the corresponding Voronoi cells. This remarkable feature makes the Voronoi-based coverage control widely used in practical applications, such as monitoring and surveillance of targeted ocean regions.

As was stated, for the coverage control of a sensor network, the optimal location of each sensor is affected by its interest element. Generally speaking, the interest elements are unknown, so that the observer-based coverage control turns out [6]. In [6], a state estimation algorithm [15] is firstly equipped on each sensor to estimate the information of the targeted region; then, the coverage control algorithm is given to guide the motion of each sensor to reach the optimal coverage configuration. However, on one hand, it is important to consider that covering control is often associated with targeted regions that are typically expansive. This presents challenges in guaranteeing the detectability or observability of the interest elements through a single sensor measurement. On the other hand, let us consider the example of monitoring an oceanic region. The environmental variables that need to be monitored encompass temperature, pH, salinity, chemical plumes, and more. Achieving measurements for these numerous elements using a single sensor is arduous, if not impossible. Given the aforementioned circumstances, the implementation of a distributed observer [16,17,18,19,20] becomes necessary. In a distributed observer, a network of sensors are strategically deployed in spatial locations to effectively measure the output information of the targeted system and simultaneously, a distributed state estimation algorithm is integrated into each sensor. Each sensor in the distributed observer has only to access partial outputs, that is, the local measurement. Then, the distributed communication among the sensors on estimated states can prompt the implementation of the state estimation. Under the condition that the information of the targeted region is unknown and only partial output information of the objects is available, this paper approximates the unknown density function of the targeted region by using the radial basis function neural network (RBFNN) and considers the distributed observer-based coverage control problem of a sensor network.

1.1. Contribution

In this paper, an RBFNN-based distributed coverage algorithm is designed for a mobile sensor network to form an optimal coverage configuration on a complex targeted region with multiple objects. The difficulties, significant manipulations, and innovations discussed in this work are summarized as follows:

For a convex targeted region with multiple objects, considering the fact that the objects are unknown and that they may be spread over a vast region, two-layer dynamics is endowed to each sensor, where the upper dynamics is a distributed observer algorithm implementing the state estimation on the targets via the partial measurement, while the lower dynamics aims at guiding the motion of the i-th sensor to cover the multiple objects in the targeted region. The fact that each sensor in the distributed observer has only to measure partial outputs of the objects makes the distributed observer well suited for state estimation of objects spread over a vast region.
Notice that the lower layer dynamics is a negative feedback law driving each sensor to reach its own optimal location. With the estimated state of the objects given by the distributed observer algorithm in the upper layer dynamics, the RBFNN is adopted to estimate the unknown density function of the targeted region; it further determines the optimal location of each sensor which is determined by a estimated state-based cost function. Hence, the coverage motion control algorithm constructed in this paper relates only to the partial output information of the objects.
The Voronoi partitions and coverage motion of sensors result in a sensor network with dynamic communication topology. To the best of our knowledge, the existing Voronoi-based coverage control results of sensor networks are dependent on the communication topology of the sensors. In this paper, in order to eliminate the dependence of the distributed observer algorithm in the upper layer dynamics of each sensor on this dynamic communication topology, Lyapunov functions irrelevant to the communication topology are constructed to solve the completely distributed observer-based coverage control problem, eliminating the influence of the changes in the communication topology.

With these significant manipulations, coverage control of a complex region with multiple objects is accomplished by a sensor network, where each sensor can only access a part of the outputs and the partial outputs are jointly detectable with the system matrix.

1.2. Organization

The rest of this paper is organized as follows. Section 2 presents some preliminaries and the problem statement and designs the motion control algorithm. By adopting RBFNN to estimate the unknown density function of the targeted region, Section 3 proposes a distributed observer-based coverage control. Further, a density function observer without any global topology information is designed by using the adaptive strategy. In Section 4, numerical simulations are provided to verify the theoretical results and Section 5 concludes the whole paper.

2. Preliminary and Problem Statement

2.1. Notation

R^{n \times m}

denotes the set of

n \times m

real matrices while

I_{n}

is an identity matrix with

n \times n

-dimensions. For a matrix

A \in R^{n \times m}

,

\ker (A)

and

im (A)

denote its kernal and image, respectively. Moreover,

\ker (A) = {x \in R^{m \times 1} | A x = 0}

and

im (A) = {y \in R^{n \times 1} | y = A x, x \in R^{m \times 1}}

;

A^{⊥}

denotes the orthogonal complement of A.

λ_{min} (A)

and

λ_{max} (A)

stand for the minimum and maximal eigenvalues of A, respectively.

∥ \cdot ∥

is the 2-norm of a matrix or a vector, and

A ≻ 0

means that matrix A is positive definite. Throughout the paper,

γ_{0} > 0

is an arbitrary constant.

2.2. Graph Theory

The communication topology of the N sensors is determined by the Voronoi partition

{V_{1}, \dots, V_{N}}

of

Ω

. As declared in the previous statement, the i-th sensor is located in the Voronoi cell

V_{i}

. Here, a triple

G (t) = (\bar{N}, E (t), W (t))

is used to describe the communication relation, where

\bar{N} = {1, 2, \dots, N}

is the node set with the element i denoting the i-th sensor.

E (t) = \{(i, j) | i, j \in V\}

is the time-varying edge set which varies as the Voronoi partition of

Ω

. In

E (t)

, the unordered pairs of vertices present the undirected neighboring relations among the sensors. As stated in [9],

(i, j) \in E (t)

if the Voronoi cell

V_{i}

and the Voronoi cell

V_{j}

share a common edge. Associated with

E (t)

, the third element

W (t) \in R^{N \times N}

in

G (t)

, which is a matrix with time-varying element

w_{i j} (t)

with

i, j \in \bar{N}

, is defined as

w_{i j} (t) = \{\begin{matrix} 1, & if (i j) \in E (t), \\ 0, & otherwise . \end{matrix}

Then, the Laplacian matrix of

G (t)

can be defined as

L (t) = (l_{i j} (t)) \in R^{N \times N}

with

l_{i j} (t) = \{\begin{matrix} - w_{i j} (t), & if i \neq j, \\ \sum_{j = 1}^{n} w_{i j} (t), & if i = j . \end{matrix}

Moreover, based on the adjacency matrix, the time-varying neighbor set of i can be defined by

N_{i} (t) = \{j | w_{i j} (t) = 1, j \neq i, i = 1, \dots, \bar{N}\}

.

Next, in correspondence with the previous statement, some algorithms which play a key role are given.

2.3. Problem Statement

In this paper, we aim at covering a complex targeted region

Ω \in R^{2}

, which is convex and occupies multiple objects. Suppose that there are

M \in Z

objects in

Ω

, which are governed by the following dynamics:

\dot{x} = A x,

(1)

where

x \in R^{n}

describes the state of the M objects; A is the system matrix with compatible dimension. Notice that the state x not only contains the positions of the M targets but also some other variables. Without loss of generality, we assume that the first

2 M

elements in x, denoted by

x_{p} \in R^{2 M \times 1}

, describe the positions of the M targets, where

2 M \leq n

and

x_{p}^{T} = [\begin{matrix} x_{1 x} & x_{1 y} & x_{2 x} & x_{2 y} & \dots & x_{M x} & x_{M y} \end{matrix}],

with

{[\begin{matrix} x_{i x} & x_{i y} \end{matrix}]}^{T} \in R^{2} (i = 1, \dots, M)

describing the position of the i-th target. For brevity, we denote

x = {[\begin{matrix} x_{p} & x_{*} \end{matrix}]}^{T}

with

x_{*} \in R^{n - 2 M}

being some other variables. For example, in the monitoring and surveillance of a targeted ocean region,

x_{*}

may represent the temperature, pH, salinity, or chemical plumes.

If the targeted region

Ω

is known, the information of the objects is certainly known. In this case, as was done in [9], a density function with Gaussian distribution is given to describe the influence of the M objects on

Ω

, which is given as

\begin{matrix} ϕ (q, x_{s}) = \sum_{s = 1}^{M} μ_{s} ϕ_{s} (q, x_{s}) = \sum_{s = 1}^{M} μ_{s} e^{- \frac{∥ q - x_{s} ∥^{2}}{2 σ_{s}^{2}}} = \sum_{s = 1}^{M} μ_{s} e^{- \frac{{(q_{x} - x_{s x})}^{2}}{2 σ_{s}^{2}} - \frac{{(q_{y} - x_{s y})}^{2}}{2 σ_{s}^{2}}}, \end{matrix}

(2)

where

q = {[\begin{matrix} q_{x} & q_{y} \end{matrix}]}^{T}

is the position of the point of

q \in Ω

,

x_{s} = {[\begin{matrix} x_{s x} & x_{s y} \end{matrix}]}^{T}

is the position of the s-th object with

s = 1, \dots, M

,

σ_{s} > 0

is a spatial sensitivity coefficient which determines the height and width of the density function,

μ_{s} > 0

is the weight which describes the influence of the s-th object, and

ϕ_{s} (q, x_{s}) = e^{- \frac{∥ q - x_{s} ∥^{2}}{2 σ_{s}^{2}}}

is the influence model of the s-th object.

In this paper, a sensor network consisting of N agents is used to converge

Ω

, where each sensor regulates its location according to the following dynamics:

{\dot{p}}_{i} = u_{i}, i = 1, \dots, N,

(3)

where

p_{i} = {[\begin{matrix} p_{i x} & p_{i y} \end{matrix}]}^{T} \in R^{2}

is the position of the i-th sensor, denoting its location; the

u_{i} \in R^{2}

to be designed is the velocity of the sensor which is to drive the sensor to an optimal location for the coverage of

Ω

in a distributed manner. What is more, the communication among the sensors is determined by the Voronoi partition of

Ω

.

Associated with the N sensors,

Ω

is partitioned into N Voronoi cells, denoted by

{V_{1}, \dots, V_{N}}

. In particular, by [21],

V_{i}

is defined as

V_{i} = {q \in Ω | ∥ q - p_{i} ∥ \leq ∥ q - p_{j} ∥, \forall j \neq i},

which indicates that the Voronoi partitions are determined by the positions of the sensors. Then, to describe the coverage effect of each agent, the cost function

Q (p, V, t) = \sum_{i = 1}^{N} \int_{V_{i}} f (∥ q - p_{i} ∥) ϕ (q, t) d q,

(4)

is used to calculate the sensing performance at point

q \in Ω

around every position

p_{i}

, which is the function of the Euclidean distance

∥ q - p_{i} ∥

. In (4), the distance function

f (∥ q - p_{i} ∥)

is defined as

f (∥ q - p_{i} ∥) = α e^{- β ∥ q - p_{i} ∥^{2}},

with

α

,

β > 0

. Qualitatively, the larger the value of

Q

, the better the configuration for sensory coverage of the region

Ω

. Note that the density function

ϕ (q, t)

is affected by the position of the target.

However, on one hand, it is hard or even impossible to obtain the real position of each object directly in some practical applications, so that

x_{s}

cannot be used directly in (2). How to obtain the position of each object via the output y is another problem to be considered. Beyond these factors, regarding the wide occupancy of the region, the output measurement cannot be carried out by a single agent but requires a network of agents. In this sensor network, each agent measures only a part of the outputs. In mathematical expression, the output measured by the i-th agent is

y_{i} = C_{i} x,

(5)

where

C_{i} \in R^{p_{i} \times n}

contains a subset of the rows of C. The total measurement obtained by all sensors is the sum of all local measurements, shown as

{col}_{i = 1, \dots, N} {y_{i}} = ({col}_{i = 1, \dots, N} {C_{i}}) x

. On the other hand, the targeted region is unknown which implies that in (2),

μ_{s}

is unknown, so novel approaches are required to estimate it. Next, the RBFNN is introduced to calculate the density function of the targeted region.

2.4. RBFNN-Based Estimation on $ϕ_{s} (q, x_{s})$

In this subsection, our objective is to enhance the estimation of the density function for the targeted region using the RBFNN technique. Denote the estimated state of the i-th sensor on x by

{\hat{x}}_{i}

. Then, the estimated state of the i-th sensor on

x_{s}

by

{\hat{x}}_{i, s}

. In this case, the RBFNN-based estimation on the density function

ϕ_{s} (q, x_{s})

is carried out on the

ϕ_{s} (q, {\hat{x}}_{i, s})

.

The analysis is performed assuming that the pair

(C_{i}, A)

is jointly detectable, but it is important to note that the detectability of

(C_{i}, A)

cannot be guaranteed. Therefore, a detectability decomposition is necessary. By [17], let

f_{A} (s) = \det (s I_{n} - A) = 0

denote the characteristic polynomial of matrix A. Factor it as

f_{A} (s) = f_{A}^{+} (s) f_{A}^{-} (s)

, where

f_{A}^{+} (s)

and

f_{A}^{-} (s)

are the polynomials with roots in the closed right and open left half-planes of the complex plane, respectively. Then, the undetectable subspace of

(C_{i}, A)

is given by

U_{i} = [\cap_{ℓ = 1}^{n} \ker (C_{i} A^{ℓ})] \cap [\ker (f_{A}^{+} (A))]

. Let

ϱ_{i}

be the dimension of

U_{i}

and

U_{i}^{⊥}

be the orthogonal complement of

U_{i}

, where

0 \leq ϱ_{i} \leq n .

(6)

Denote

U_{i} \in R^{n \times ϱ_{i}}

and let

D_{i} \in R^{n \times (n - ϱ_{i})}

be the matrices whose columns are the orthogonal bases of

U_{i}

and

U_{i}^{⊥}

, respectively. Then, one can obtain

U_{i} = im (U_{i}) = \ker (D_{i}^{T})

. Let

T_{i} = [D_{i} U_{i}] \in R^{n \times n}

; then,

T_{i}

is an orthogonal matrix which can induce that

T_{i}^{T} A T_{i} = [\begin{matrix} A_{i d} & 0 \\ A_{i r} & A_{i u} \end{matrix}], C_{i} T_{i} = [\begin{matrix} C_{i d} & 0 \end{matrix}],

where

A_{i d} \in R^{(n - ϱ_{i}) \times (n - ϱ_{i})}

,

A_{i r} \in R^{ϱ_{i} \times (n - ϱ_{i})}

,

A_{i u} \in R^{ϱ_{i} \times ϱ_{i}}

,

C_{i d} \in R^{p_{i} \times (n - ϱ_{i})}

and the pair

(C_{i d}, A_{i d})

is detectable, so that one can choose

K_{i d} \in R^{(n - ϱ_{i}) \times p_{i}}

to make

A_{i d} + K_{i d} C_{i d}

Hurwitz. Hence, the observer of i-th agent is designed as follows

{\dot{\hat{x}}}_{i} = A {\hat{x}}_{i} - K_{i} (y_{i} - C_{i} {\hat{x}}_{i}) + γ U_{i} U_{i}^{T} \sum_{j \in N_{i} (t)} w_{i j} (t) ({\hat{x}}_{j} - {\hat{x}}_{i}),

(7)

where

K_{i} = T_{i} [\begin{matrix} K_{i d} \\ 0 \end{matrix}]

,

γ > 0

is the coupling gain of the i-th agent;

{\hat{x}}_{i}

is the estimated state of the i-th sensor on x.

Based on

{\hat{x}}_{i}

in (7), the estimated state of the s-th object which is gained by the i-th sensor is

{\hat{x}}_{i, s}

; the influence model of the M objects in (2) turns to

\begin{matrix} ϕ (q, {\hat{x}}_{i, s}) & = μ_{s} ϕ_{s} (q, {\hat{x}}_{i, s}, t) \\ = μ_{s} e^{- \frac{{∥q - {\hat{x}}_{i, s}∥}^{2}}{2 σ_{s}^{2}}} \\ = μ_{s} e^{- \frac{{(q_{x} - {\hat{x}}_{i, s x})}^{2}}{2 σ_{s}^{2}} - \frac{{(q_{y} - {\hat{x}}_{i, s y})}^{2}}{2 σ_{s}^{2}}} . \end{matrix}

(8)

Even though one can obtain the state of the objects by using only the partial output information, the weight

μ_{s}

is unknown and then

ϕ (q, {\hat{x}}_{s})

is still unknown. Yet, for a completely unknown region, the influence model

ϕ_{s} (\cdot)

is unknown. Thus, it is impossible to obtain

ϕ (q, {\hat{x}}_{i, s})

in (8) even though

{\hat{x}}_{i, s}

is known. In the following, RBFNN used in [22,23,24] is introduced to approximate the continuous function

ϕ (q, {\hat{x}}_{i, s}) : R^{2} \to R

. Motivated by [22,23,24],

ϕ (q, {\hat{x}}_{i, s})

can be approximated by

\begin{matrix} \hat{ϕ} (q, {\hat{x}}_{i, s}) = W^{* T} Φ ({\hat{x}}_{i, s}) + ϵ, \end{matrix}

(9)

where

Φ ({\hat{x}}_{i, s}) = {[Φ_{1} ({\hat{x}}_{i, s}), Φ_{2} ({\hat{x}}_{i, s}), \dots, Φ_{l} ({\hat{x}}_{i, s})]}^{T} : R^{2 ℓ} \to R^{l}

with

l \geq 1

being the NN node number;

W^{*} \in R^{ℓ}

is the optimal weight vector, which is defined by

\begin{matrix} W^{*} = arg min_{\hat{W}} \{sup_{{\hat{x}}_{i, s}} |\hat{ϕ} (q, {\hat{x}}_{i, s}) - {\hat{W}}^{T} Φ ({\hat{x}}_{i, s})|\}, \end{matrix}

(10)

where

\hat{W}

is the estimation of

W^{*}

. Notice that, by [25], the more NN nodes, the more accurate the approximation will be. Note also that for

k = 1, 2 \dots, ℓ

,

Φ_{k} ({\hat{x}}_{i, s})

is selected as the generic Gaussian function as follows

\begin{matrix} Φ_{k} ({\hat{x}}_{i, s}) = e^{- \frac{{∥{\hat{x}}_{i, s} - γ_{k}∥}^{2}}{η_{k}^{2}}}, k = 1, 2, \dots, ℓ, \end{matrix}

(11)

where

γ_{k}

and

η_{k}

express, respectively, the center and spread.

ϵ

is an approximation error bounded on

Π

, namely,

|ϵ| \leq ϵ_{*}

with

ϵ_{*} > 0

being an unknown constant.

2.5. Motion Control Algorithm Design

With

\hat{ϕ} (q, {\hat{x}}_{i, s})

in (9), the cost function in (4) becomes

\hat{Q} (p, V) = \sum_{i = 1}^{N} \int_{V_{i}} f (∥ q - p_{i} ∥) \hat{ϕ} (q, {\hat{x}}_{i, s}) d q,

(12)

which induces

\begin{matrix} \frac{\partial \hat{Q}}{\partial p_{i}} = & \int_{V_{i}} \frac{\partial}{\partial p_{i}} f (∥ q - p_{i} ∥) \hat{ϕ} (q, {\hat{x}}_{i, s}) d q \\ = & \int_{V_{i}} \frac{\partial}{\partial p_{i}} (α e^{- β ∥ q - p_{i} ∥^{2}}) \hat{ϕ} (q, {\hat{x}}_{i, s}) d q \\ = & \int_{V_{i}} 2 (q - p_{i}) α β e^{- β ∥ q - p_{i} ∥^{2}} \hat{ϕ} (q, {\hat{x}}_{i, s}) d q \\ = & {\hat{M}}_{V_{i}} ({\hat{C}}_{V_{i}} - p_{i}) . \end{matrix}

(13)

For brevity, we denote

\begin{matrix} {\hat{M}}_{V_{i}} & = \int_{V_{i}} 2 β f (∥ q - p_{i} ∥) \hat{ϕ} (q) d q, \\ {\hat{L}}_{V_{i}} & = \int_{V_{i}} 2 β q f (∥ q - p_{i} ∥) \hat{ϕ} (q) d q, \\ {\hat{C}}_{V_{i}} & = {\hat{L}}_{V_{i}} / {\hat{M}}_{V_{i}}, \end{matrix}

(14)

where

{\hat{M}}_{V_{i}}

and

{\hat{C}}_{V_{i}}

are generalized mass and generalized centroid of Voronoi cell

V_{i}

, respectively. Through the locational optimization function

\hat{Q}

, the agents’ local optimum point is the centroid of Voronoi cell, i.e., the critical points of

\hat{Q}

correspond to the configurations such that

p_{i} = {\hat{C}}_{V_{i}}, i = 1, \dots, N

, that is to say, the agents’ location converges asymptotically to the set of centroidal Voronoi configurations on

Ω

. Then, the controller

u_{i}

can be designed as

u_{i} = k {\hat{M}}_{V_{i}} ({\hat{C}}_{V_{i}} - p_{i}),

(15)

where

k > 0

is an arbitrary constant.

3. RBFNN-Based Distributed Coverage Control

As stated in Section 2, the controller is designed to drive sensors to cover multiple objects in the targeted region. This section aims at illustrating the performance of the distributed observer for state estimation on the targets and the conditions to be met, which is stated in the following theorems. RBFNN-based distributed coverage control can be designed for the system in (1) by implementing Algorithm 1.

Algorithm 1 Technical procedure for RBFNN-based distributed coverage control algorithm

➀: Do detectability decomposition for the system in (1); there exists an orthogonal matrix $T_{i}$ , such that

$T_{i}^{T} A T_{i} = [\begin{matrix} A_{i d} & 0 \\ A_{i r} & A_{i u} \end{matrix}], C_{i} T_{i} = [\begin{matrix} C_{i d} & 0 \end{matrix}],$

and the pair $(C_{i d}, A_{i d})$ is detectable.
➁: Choose $K_{i d}$ to make $A_{i d} + K_{i d} C_{i d}$ Hurwitz.
➂: For the system in (1), design a distributed observer as shown in (7).
➃: Approximate the density function in (9) by using RBFNN.
➄: Based on step ➃, use the cost function in (12) to design the controller $u_{i}$ in (15).

Let

e_{i} = {\hat{x}}_{i} - x

; then, it follows from (1) and (7) that

\begin{matrix} {\dot{e}}_{i} = (A + K_{i} C_{i}) e_{i} + γ U_{i} U_{i}^{T} \sum_{j \in N_{i} (t)} w_{i j} (t) (e_{j} - e_{i}) . \end{matrix}

(16)

First, a model transformation on

e_{i}

is given. Let

{\tilde{e}}_{i} = T_{i}^{- 1} e_{i} = [\begin{matrix} D_{i}^{T} \\ U_{i}^{T} \end{matrix}] e_{i} = [\begin{matrix} {\tilde{e}}_{i d} \\ {\tilde{e}}_{i u} \end{matrix}]

, then one has

\{\begin{matrix} {\dot{\tilde{e}}}_{i d} = & (A_{i d} + K_{i d} C_{i d}) {\tilde{e}}_{i d}, \\ {\dot{\tilde{e}}}_{i u} = & A_{i r} {\tilde{e}}_{i d} + A_{i u} {\tilde{e}}_{i u} - γ U_{i}^{T} \sum_{j = 1}^{N} l_{i j} (t) (D_{j} {\tilde{e}}_{j d} + U_{j} {\tilde{e}}_{j u}), \end{matrix}

(17)

Define some diagonal matrices as follows:

\begin{matrix} U & = diag {U_{1}, \dots, U_{N}}, D = diag {D_{1}, \dots, D_{N}}, \\ A_{d} & = diag {A_{1 d}, \dots, A_{N d}}, A_{r} = diag {A_{1 r}, \dots, A_{N r}}, \\ A_{u} & = diag {A_{1 u}, \dots, A_{N u}}, K_{d} = diag {K_{1 d}, \dots, K_{N d}}, \\ C_{d} & = diag {C_{1 d}, \dots, C_{N d}} . \end{matrix}

Then, (17) becomes

\{\begin{matrix} {\dot{\tilde{e}}}_{d} & = (A_{d} + K_{d} C_{d}) {\tilde{e}}_{d}, \\ {\dot{\tilde{e}}}_{u} & = A_{r} {\tilde{e}}_{d} + A_{u} {\tilde{e}}_{u} - γ U^{T} (L (t) \otimes I_{m}) (D {\tilde{e}}_{d} + U {\tilde{e}}_{u}) . \end{matrix}

(18)

Before moving on, Lemma 1 is introduced.

Lemma 1

([17]). Suppose that

G (t) = \{V, E (t)\}

is strongly connected for

t \geq 0

; then, the following statements are equivalent:

(i): $(C_{i}, A)$ is jointly detectable;
(ii): $U^{T} (L \otimes I_{n}) U$ is positive definite;
(iii): $U^{T} (L \otimes I_{n}) U$ is nonsingular.

Theorem 1.

Consider a multi-agent system (1) with N mobile sensors communicating through the graph

G (t)

, where each sensor is governed by the control law in (15) with

{\hat{M}}_{V_{i}}

and

{\hat{C}}_{V_{i}}

determined by

{\hat{x}}_{i}

in (7). Suppose that

(C_{i}, A)

is jointly detectable, and γ in (7) satisfies

γ > \frac{1 + ∥ A_{u} ∥ + γ_{0}}{λ_{min} (U^{T} (L \otimes I_{m}) U)}, γ_{0} > 0

(19)

then, each agent can converge to an optimal coverage configuration, i.e,

{lim}_{t \to \infty} ∥ p_{i} - C_{V_{i}} ∥ = 0

.

Proof.

Define a Lyapunov function as

V_{1} (p, {\hat{C}}_{v}, x, \hat{x}) = V_{1 a} + V_{1 b}

, where

\begin{matrix} V_{1 a} & = \frac{1}{2} \sum_{i = 1}^{N} \int_{V_{i}} f (∥ p_{i} - q ∥^{2}) ψ (q) d q, \\ V_{1 b} & = {\tilde{e}}_{d}^{T} P_{1} {\tilde{e}}_{d} + {\tilde{e}}_{u}^{T} {\tilde{e}}_{u}, \end{matrix}

with

P_{1} ≻ 0

being the solution of the Lyapunov equation

\begin{matrix} 0 & = {(A_{d} + K_{d} C_{d})}^{T} P_{1} + P_{1} (A_{d} + K_{d} C_{d}) + M_{1} I_{(n N - \sum_{i = 1}^{N} ϱ_{i})}, \\ M_{1} & = ∥ A_{r} ∥^{2} + 2 γ {∥ U^{T} (L \otimes I_{m}) D ∥}^{2} + γ_{0} . \end{matrix}

By (18), one has

\begin{matrix} \frac{d}{d t} ({\tilde{e}}_{d}^{T} P_{1} {\tilde{e}}_{d}) = & {\tilde{e}}_{d}^{T} {(A_{d} + K_{d} C_{d})}^{T} P_{1} {\tilde{e}}_{d} + {\tilde{e}}_{d}^{T} P_{1} (A_{d} + K_{d} C_{d}) {\tilde{e}}_{d} \\ = & - M_{1} {\tilde{e}}_{d}^{T} {\tilde{e}}_{d}, \\ \frac{d}{d t} ({\tilde{e}}_{u}^{T} {\tilde{e}}_{u}) = & 2 {\tilde{e}}_{u}^{T} A_{r}^{T} {\tilde{e}}_{d} - 2 {\tilde{e}}_{u}^{T} γ U^{T} (L \otimes I_{m}) D {\tilde{e}}_{d} + 2 {\tilde{e}}_{u}^{T} A_{u} {\tilde{e}}_{u} - 2 {\tilde{e}}_{u}^{T} γ U^{T} (L \otimes I_{m}) U {\tilde{e}}_{u} \\ \leq & 2 {\tilde{e}}_{u}^{T} {\tilde{e}}_{u} + ∥ A_{r} ∥^{2} {\tilde{e}}_{d}^{T} {\tilde{e}}_{d} + 2 γ {∥ U^{T} (L \otimes I_{m}) D ∥}^{2} {\tilde{e}}_{d}^{T} {\tilde{e}}_{d} \\ + 2 ∥ A_{u} ∥ {\tilde{e}}_{u}^{T} {\tilde{e}}_{u} - 2 γ {\tilde{e}}_{u}^{T} λ_{min} (U^{T} (L \otimes I_{m}) U) {\tilde{e}}_{u} \\ = & (∥ A_{r} ∥^{2} + 2 γ {∥ U^{T} (L \otimes I_{m}) D ∥}^{2}) {\tilde{e}}_{d}^{T} {\tilde{e}}_{d} \\ + 2 (1 + ∥ A_{u} ∥ - γ λ_{min} (U^{T} (L \otimes I_{m}) U)) {\tilde{e}}_{u}^{T} {\tilde{e}}_{u}, \end{matrix}

so that

\begin{matrix} {\dot{V}}_{1 b} = & \frac{d}{d t} ({\tilde{e}}_{d}^{T} P_{1} {\tilde{e}}_{d}) + \frac{d}{d t} ({\tilde{e}}_{u}^{T} {\tilde{e}}_{u}) \\ \leq & - M_{1} {\tilde{e}}_{d}^{T} {\tilde{e}}_{d} + (∥ A_{r} ∥^{2} + 2 γ {∥ U^{T} (L \otimes I_{m}) D ∥}^{2}) {\tilde{e}}_{d}^{T} {\tilde{e}}_{d} \\ + 2 (1 + ∥ A_{u} ∥ - γ λ_{min} (U^{T} (L \otimes I_{m}) U)) {\tilde{e}}_{u}^{T} {\tilde{e}}_{u} \\ = & - γ_{0} {\tilde{e}}_{d}^{T} {\tilde{e}}_{d} - 2 γ_{0} {\tilde{e}}_{u}^{T} {\tilde{e}}_{u} . \end{matrix}

On the other hand, there holds

\begin{matrix} {\dot{V}}_{1 a} = & - \sum_{i = 1}^{N} {\hat{M}}_{V_{i}} {({\hat{C}}_{V_{i}} - p_{i})}^{T} \hat{ψ} (q) {\dot{p}}_{i} \\ = & - k \sum_{i = 1}^{N} {\hat{M}}_{V_{i}} {∥ ({\hat{C}}_{V_{i}} - p_{i}) ∥}^{2} . \end{matrix}

By Lemma 1, one can obtain that

U^{T} (L \otimes I_{m}) U ≻ 0

. Therefore, under

γ

in (19), there holds

\begin{matrix} {\dot{V}}_{1} \leq & - k \sum_{i = 1}^{N} {\hat{M}}_{V_{i}} {∥ ({\hat{C}}_{V_{i}} - p_{i}) ∥}^{2} - γ_{0} {\tilde{e}}_{d}^{T} {\tilde{e}}_{d} - 2 γ_{0} {\tilde{e}}_{u}^{T} {\tilde{e}}_{u}, \end{matrix}

which implies

{lim}_{t \to \infty} p_{i} = {\hat{C}}_{V_{i}}

for

i = 1, \dots, N

,

{lim}_{t \to \infty} {\tilde{e}}_{d} = 0

and

{lim}_{t \to \infty} {\tilde{e}}_{u} = 0

. Notice that

{lim}_{t \to \infty} {\tilde{e}}_{d} = 0

and

{lim}_{t \to \infty} {\tilde{e}}_{u} = 0

together indicate that

{lim}_{t \to \infty} ∥ x - {\hat{x}}_{i} ∥ = 0

and

{lim}_{t \to \infty} \hat{ϕ} (q, {\hat{x}}_{i, s}) = \hat{ϕ} (q, x_{i, s})

for

i = 1, \dots, N

. On the other hand, with the help of the RBFNN-based estimation mentioned above, one has

{lim}_{t \to \infty} \hat{ϕ} (q, x_{i, s}) = ϕ (q, x_{i, s})

. Thus, one can obtain that

{lim}_{t \to \infty} \hat{ϕ} (q, {\hat{x}}_{i, s}) = ϕ (q, x_{i, s})

for

i = 1, \dots, N

, so that

{lim}_{t \to \infty} {\hat{C}}_{V_{i}} = C_{V_{i}}

for

i = 1, \dots, N

, which further results in

{lim}_{t \to \infty} p_{i} = C_{V_{i}}

for

i = 1, \dots, N

. □

The limitation of the distributed observer design is that

γ

needs to satisfy the condition in (19), which is related to the information of system matrix and communication topology. However, the details of the system matrix and communication topology may be unknown. To further optimize the results in Theorem 1, inspired by [17], the dynamic coupling gains are introduced, so that (7) can be improved as

{\dot{\hat{x}}}_{i} = A {\hat{x}}_{i} - K_{i} (y_{i} - C_{i} {\hat{x}}_{i}) + γ_{i} U_{i} U_{i}^{T} \sum_{j \in N_{i}} w_{i j} (t) ({\hat{x}}_{j} - {\hat{x}}_{i}),

(20)

where

γ_{i} > 0

is the adaptive coupling gain, which updates itself according to the following dynamics:

\begin{matrix} {\dot{γ}}_{i} & = {∥U_{i}^{T} \sum_{j \in N_{i}} w_{i j} (t) ({\hat{x}}_{j} - {\hat{x}}_{i})∥}^{2} \\ = {∥U_{i}^{T} \sum_{j \in N_{i}} l_{i j} (t) (D_{j} {\tilde{e}}_{j d} + U_{j} {\tilde{e}}_{j u})∥}^{2} . \end{matrix}

(21)

Define

\begin{matrix} ξ_{i d} & = {\tilde{e}}_{i d}, \\ ξ_{i u} & = U_{i}^{T} \sum_{j \in N_{i}} l_{i j} (D_{j} {\tilde{e}}_{j d} + U_{j} {\tilde{e}}_{j u}), \end{matrix}

(22)

then one has

{\dot{γ}}_{i} = ξ_{i u}^{T} ξ_{i u} .

(23)

Furthermore, denote

Γ = diag \{γ_{1} I_{ρ_{i}}, \dots, γ_{N} I_{ρ_{N}}\}

,

{\tilde{e}}_{d} = {col}_{i = 1, \dots, N} {{\tilde{e}}_{i d}}

and likewise define

ξ_{d}

,

ξ_{u}

as well as

{\tilde{e}}_{d}

in a similar manner. Then, (18) can be written as the following form

\{\begin{matrix} {\dot{\tilde{e}}}_{d} & = (A_{d} + K_{d} C_{d}) {\tilde{e}}_{d}, \\ {\dot{\tilde{e}}}_{u} & = A_{r} {\tilde{e}}_{d} + A_{u} {\tilde{e}}_{u} - Γ U^{T} (L (t) \otimes I_{n}) (D {\tilde{e}}_{d} + U {\tilde{e}}_{u}) . \end{matrix}

(24)

By (22), one has

\begin{matrix} ξ & = [\begin{matrix} ξ_{d} \\ ξ_{u} \end{matrix}] = [\begin{matrix} I_{(n N - \sum_{i = 1}^{N} ϱ_{i})} & 0 \\ U^{T} (L (t) \otimes I_{n}) D & U^{T} (L \otimes I_{n}) U \end{matrix}] [\begin{matrix} {\tilde{e}}_{d} \\ {\tilde{e}}_{u} \end{matrix}] . \end{matrix}

(25)

Hereafter, denote

Δ = U^{T} (L \otimes I_{n}) U

for convenience. Based on Lemma 1 in the foregoing,

Δ

is positive definite; there holds

{[\begin{matrix} I_{(n N - \sum_{i = 1}^{N} ϱ_{i})} & 0 \\ U^{T} (L \otimes I_{n}) D & Δ \end{matrix}]}^{- 1} = [\begin{matrix} I_{(n N - \sum_{i = 1}^{N} ϱ_{i})} & 0 \\ \begin{matrix} - Δ^{- 1} U^{T} (L \otimes I_{n}) D \end{matrix} & Δ^{- 1} \end{matrix}],

which further induces that

\{\begin{matrix} {\tilde{e}}_{d} & = ξ_{d}, \\ {\tilde{e}}_{u} & = - Δ^{- 1} [U^{T} (L \otimes I_{n}) D ξ_{d} - ξ_{u}], \end{matrix}

(26)

and so there holds that

\begin{matrix} \dot{ξ} = [\begin{matrix} A_{d} + K_{d} C_{d} & 0 \\ Θ & \begin{matrix} Δ A_{u} Δ^{- 1} - Δ Γ \end{matrix} \end{matrix}] ξ, \end{matrix}

(27)

with

Θ = U^{T} (L \otimes I_{n}) D (A_{d} + K_{d} C_{d}) + Δ A_{r} - Δ A_{u} Δ^{- 1} U^{T} (L \otimes I_{n}) D

. Note that in (25),

Δ

is nonsingular. Hence, the stability of

ξ

is equivalent to that of the system in (24). Define a Lyapunov equation with the unique solution

P_{2}

as follows:

\begin{matrix} {(A_{d} + K_{d} C_{d})}^{T} P_{2} + P_{2} (A_{d} + K_{d} C_{d}) = - M_{2} I_{(n N - \sum_{i = 1}^{N} ϱ_{i})} \end{matrix}

where

M_{2} = 1 + {∥Θ∥}^{2}

.

Theorem 2.

Consider a multi-agent system (1) with N mobile sensors communicating through the graph

G (t)

, where each sensor is governed by the control law in (15) with

{\hat{M}}_{V_{i}}

and

{\hat{C}}_{V_{i}}

determined by

{\hat{x}}_{i}

in (20). Suppose that

(C_{i}, A)

is jointly detectable; then, each agent can converge to an optimal coverage configuration, i.e,

{lim}_{t \to \infty} ∥ p_{i} - C_{V_{i}} ∥ = 0

.

Proof.

Define a Lyapunov function as

V_{2} (p, {\hat{C}}_{v}, x, \hat{x}) = V_{2 a} + V_{2 b}

, where

\begin{matrix} V_{2 a} & = \frac{1}{2} \sum_{i = 1}^{N} \int_{V_{i}} f (∥ p_{i} - q ∥^{2}) ψ (q) d q + ξ_{d}^{T} P_{2} ξ_{d} + ξ_{u}^{T} ξ_{u}, \\ V_{2 b} & = \sum_{i = 1}^{N} {[λ_{m i n} (Δ) γ_{i} - γ_{*}]}^{2}, \end{matrix}

(28)

with

γ_{*} > 0

being a sufficiently large constant to be determined later. Then, the derivatives of

V_{2 a}

and

V_{2 b}

along (27) yield

\begin{matrix} {\dot{V}}_{2 a} \leq & - \sum_{i = 1}^{N} {\hat{M}}_{V_{i}} {({\hat{C}}_{V_{i}} - p_{i})}^{T} \hat{ψ} (q) {\dot{p}}_{i} - M_{2} ξ_{d}^{T} ξ_{d} + ξ_{d}^{T} Θ^{T} Θ ξ_{d} \\ + ξ_{u}^{T} \{1 + {[Δ A_{u} Δ^{- 1}]}^{T} + Δ A_{u} Δ^{- 1} - Γ Δ - Δ Γ\} ξ_{u} \\ \leq & - k \sum_{i = 1}^{N} {\hat{M}}_{V_{i}} {∥ ({\hat{C}}_{V_{i}} - p_{i}) ∥}^{2} - M_{2} ξ_{d}^{T} ξ_{d} + {∥Θ∥}^{2} ξ_{d}^{T} ξ_{d} + (1 + 2 ∥A_{u}∥) ξ_{u}^{T} ξ_{u} \\ - ξ_{u}^{T} (Γ Δ + Δ Γ) ξ_{u}, \\ {\dot{V}}_{2 b} = & 2 \sum_{i = 1}^{N} [λ_{m i n} (Δ) γ_{i} - γ_{*}] ξ_{i u}^{T} ξ_{i u} \\ = & 2 λ_{m i n} (Δ) ξ_{u}^{T} Γ ξ_{u} - 2 γ_{*} ξ_{u}^{T} ξ_{u} \\ \leq & ξ_{u}^{T} (Γ Δ + Δ Γ) ξ_{u} - 2 γ_{*} ξ_{u}^{T} ξ_{u} . \end{matrix}

For the arbitrarily positive constant

γ_{*} > 0

, we choose it as

γ_{*} > \frac{1}{2} (1 + 2 ∥A_{u}∥ + γ_{0})

without loss of generality. Hence, one has

{\dot{V}}_{2} \leq - k \sum_{i = 1}^{N} {\hat{M}}_{V_{i}} {∥ ({\hat{C}}_{V_{i}} - p_{i}) ∥}^{2} - ξ_{d}^{T} ξ_{d} - γ_{0} ξ_{u}^{T} ξ_{u} \leq 0 .

Similar to the analysis in the proof of Theorem 1, one can obtain that each sensor converges to an optimal coverage configuration. □

Remark 1.

The coverage motion, the Voronoi partitions, and the edge set defined in Section 2.2 together result in a sensor network with dynamic communication topology. Yet, it is worth noting that the communication topology only switches from a connected graph to another connected one, different from the joint connectivity case in [26]. In this paper, Lyapunov functions (i.e.,

V_{1}

and

V_{2}

) irrelevant to the changes of the topology are constructed to eliminate the influence of the topology changes on the distributed observer-based coverage control.

4. Numerical Simulation and Discussion

In this section, numerical simulations are firstly given to verify the obtained theoretical results and then further discussions are provided to analyze the simulation results in more depth.

4.1. Numerical Example

In this subsection, two simulation examples are given. In detail, the targeted region Q is chosen to be a

100 \times 100

square.

μ_{s}

in (8) is chosen as

μ_{s} = 3

for

s = 1, \dots, M

. For the performance function

\hat{Q} (p, V)

defined in (12),

α

and

β

are chosen as

α = 0.005

and

β = 0.001

, respectively. A sensor network consisting of

N = 8

sensors is used to implement coverage control of the targeted region.

Example 1.

In this example, there are

M = 2

objects considered in Q, whose trajectory is described by (1) with

\begin{matrix} A = diag \{{\tilde{A}}_{1}, {\tilde{A}}_{2}\}, {\tilde{A}}_{i} = [\begin{matrix} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 2 \\ 0 & 0 & 0 & - 0.8 \\ 0 & 0 & 0 & 0 \end{matrix}], s = 1, 2 . \end{matrix}

(29)

The initial states of the two objects are chosen as

\begin{matrix} x_{1} (0) = {[\begin{matrix} 30 & 70 & 1 & 1 \end{matrix}]}^{T}, x_{2} (0) = {[\begin{matrix} 82 & 30 & 1.5 & 1.5 \end{matrix}]}^{T} . \end{matrix}

The measurement of each agent in (5) is chosen as

\begin{matrix} C_{i} = S_{i} \otimes [\begin{matrix} I_{2} & 0_{2 \times 2} \end{matrix}], i = 1, 2, \dots, 8 \end{matrix}

(30)

where

\begin{matrix} S_{1} & = [\begin{matrix} 1 & 0 \end{matrix}], S_{2} = [\begin{matrix} 1 & - 1 \end{matrix}], S_{3} = [\begin{matrix} 1 & 0 \end{matrix}], S_{4} = [\begin{matrix} 0 & 1 \end{matrix}], \\ S_{5} & = [\begin{matrix} - 1 & 1 \end{matrix}], S_{6} = [\begin{matrix} 1 & 0 \end{matrix}], S_{7} = [\begin{matrix} 0 & 1 \end{matrix}], S_{8} = [\begin{matrix} 1 & 1 \end{matrix}] . \end{matrix}

It is verified that

({col}_{i \in V} (C_{i}), A)

is observable. With the detectability decomposition on

(C_{i}, A_{i})

, one can obtain the detectable pair

(C_{i d}, A_{i d})

which is shown as follows:

A_{1 d} = A_{2 d} = [\begin{matrix} 0 & - 0.8 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & - 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \end{matrix}], C_{1 d} = [\begin{matrix} 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \end{matrix}], C_{2 d} = [\begin{matrix} 0 & 0 & 0 & - 1.414 \\ 0 & 0 & - 1.414 & 0 \end{matrix}] .

Choose

K_{i d}

as

\begin{matrix} K_{1 d} & = [\begin{matrix} - 7.8774 & 2.1377 & - 0.8593 & 5.6633 \\ 5.0502 & - 3.8080 & 4.3367 & - 1.4494 \end{matrix}], \\ K_{2 d} & = [\begin{matrix} 5.5702 & - 1.5116 & 0.6076 & - 4.0045 \\ - 3.5710 & 2.6927 & - 3.0665 & 1.0249 \end{matrix}], \end{matrix}

then,

A_{i d} + K_{i d} C_{i d}

is Hurwitz for

i = 1, 2

. With the Voronoi partitions on the given convex region Ω, the communication topology associated with the eight sensors can be obtained and it is connected. As mentioned above, the initial locations of these sensors are

\begin{matrix} p_{1} (0) & = [\begin{matrix} 82 \\ 95 \end{matrix}], p_{2} (0) = [\begin{matrix} 10 \\ 54 \end{matrix}], p_{3} (0) = [\begin{matrix} 25 \\ 89 \end{matrix}], p_{4} (0) = [\begin{matrix} 68 \\ 46 \end{matrix}], \\ p_{5} (0) & = [\begin{matrix} 62 \\ 81 \end{matrix}], p_{6} (0) = [\begin{matrix} 50 \\ 50 \end{matrix}], p_{7} (0) = [\begin{matrix} 76 \\ 24 \end{matrix}], p_{8} (0) = [\begin{matrix} 17 \\ 20 \end{matrix}] . \end{matrix}

(31)

By the definitions in Section 2.2, the Laplacian matrix of this sensor network at

t = 0

is

L (0) = [\begin{matrix} 4 & - 1 & - 1 & 0 & 0 & 0 & - 1 & - 1 \\ - 1 & 4 & - 1 & - 1 & 0 & 0 & 0 & - 1 \\ - 1 & - 1 & 4 & - 1 & - 1 & 0 & 0 & 0 \\ 0 & - 1 & - 1 & 4 & - 1 & - 1 & 0 & 0 \\ 0 & 0 & - 1 & - 1 & 4 & - 1 & - 1 & 0 \\ 0 & 0 & 0 & - 1 & - 1 & 4 & - 1 & - 1 \\ - 1 & 0 & 0 & 0 & - 1 & - 1 & 4 & - 1 \\ - 1 & - 1 & 0 & 0 & 0 & - 1 & - 1 & 4 \end{matrix}],

The smallest nonzero eigenvalue of

L (0)

is

2.5858

, which indicates that the sensor network is connected.

Figure 1a depicts the trajectory of

{\hat{x}}_{i} - x

which shows that the distributed observer can accurately estimate the states of two objects, and the estimation error

{\hat{x}}_{i} - x

approaches 0 as t goes to ∞. As shown in Figure 1b, the adaptive parameter

γ_{i}

increases and converges to a constant. Figure 2a,b, respectively, illustrate that

{\hat{C}}_{V_{i}} - p_{i}

and

{\hat{C}}_{V_{i}} - C_{V_{i}}

converge to 0 as t goes to ∞. This indicates that

p_{i}

approaches

C_{V_{i}}

as t goes to ∞ and the density function of the unknown targeted region can be well learned by the RBFNN. The sensors are denoted by eight solid blue-green dots in Figure 3 which illustrates that as time goes on, sensors driven by the underlying dynamic model can coverage the two objects successfully.

Example 2.

A multi-agent system consisting of

M = 6

agents is considered as the multiple objects in Q. The trajectories of these six objects are steered by (1) with A as follows:

\begin{matrix} A = diag \{{\tilde{A}}_{1}, \dots, {\tilde{A}}_{6}\}, {\tilde{A}}_{i} = [\begin{matrix} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & - 0.47 \\ 0 & 0 & 0.44 & 0 \end{matrix}], s = 1, \dots, 6 . \end{matrix}

(32)

The initial states of the six objects are chosen as

\begin{matrix} x_{1} (0) & = {[\begin{matrix} 30 & 70 & 4 & 2 \end{matrix}]}^{T}, x_{2} (0) = {[\begin{matrix} 30 & 65 & 4 & 2 \end{matrix}]}^{T}, x_{3} (0) = {[\begin{matrix} 25 & 70 & 4 & 2 \end{matrix}]}^{T}, \\ x_{4} (0) & = {[\begin{matrix} 35 & 75 & 3 & 1 \end{matrix}]}^{T}, x_{5} (0) = {[\begin{matrix} 67 & 47 & 3 & 1 \end{matrix}]}^{T}, x_{6} (0) = {[\begin{matrix} 70 & 50 & 3 & 1 \end{matrix}]}^{T} . \end{matrix}

The measurement of each agent in (5) is chosen as

\begin{matrix} C_{i} = S_{i} \otimes [\begin{matrix} I_{2} & 0_{2 \times 2} \end{matrix}], i = 1, 2, \dots, 8 \end{matrix}

(33)

where

\begin{matrix} S_{1} & = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \end{matrix}], S_{2} = [\begin{matrix} 1 & 1 & 0 & 0 & 0 & 0 \end{matrix}], \\ S_{3} & = [\begin{matrix} 1 & 0 & 1 & 0 & 0 & 0 \end{matrix}], S_{4} = [\begin{matrix} 1 & 0 & 0 & 1 & 0 & 0 \end{matrix}], \\ S_{5} & = [\begin{matrix} 1 & 0 & 0 & 0 & 1 & 0 \end{matrix}], S_{6} = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 1 \end{matrix}], \\ S_{7} & = [\begin{matrix} 0 & 1 & 1 & 0 & 0 & 0 \end{matrix}], S_{8} = [\begin{matrix} 0 & 0 & 1 & 1 & 0 & 0 \end{matrix}] . \end{matrix}

It is verified that

({col}_{i \in V} (C_{i}), A)

is observable. With the help of the detectability decomposition on

(C_{i}, A)

, one can obtain the corresponding detectable pair

(C_{i d}, A_{i d})

as follows:

A_{i d} = [\begin{matrix} 0 & - 0.47 & 0 & 0 \\ 0.44 & 0 & 0 & 0 \\ 0 & - 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \end{matrix}], A_{8 d} = [\begin{matrix} 0 & - 0.47 & 0 & 0 \\ 0.44 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 \end{matrix}],

with

i = 1, \dots, 7

and

\begin{matrix} C_{1 d} & = C_{3 d} = - C_{4 d} = - C_{5 d} = C_{6 d} = [\begin{matrix} 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \end{matrix}], \\ C_{2 d} & = - C_{7 d} = - C_{8 d} = [\begin{matrix} 0 & 0 & 0 & - 1.414 \\ 0 & 0 & - 1.414 & 0 \end{matrix}] . \end{matrix}

Choose

K_{i d}

as

\begin{matrix} K_{1 d} = {[\begin{matrix} - 7.7503 & - 1.5061 & 0.0013 & 5.8995 \\ 3.0130 & - 2.8508 & 4.1105 & - 0.9052 \end{matrix}]}^{T}, \\ K_{6 d} = {[\begin{matrix} - 5.4802 & - 1.0649 & - 0.0009 & - 4.1715 \\ 2.1305 & - 2.0158 & - 2.8995 & 0.6401 \end{matrix}]}^{T}, \\ K_{i d} = {[\begin{matrix} 5.4803 & 1.0649 & - 0.0009 & - 4.1716 \\ - 2.1305 & 2.0158 & - 2.8995 & 0.6401 \end{matrix}]}^{T}, i = 2, 3, 4, 5, 7, 8, \end{matrix}

then the matrix

A_{i d} + K_{i d} C_{i d} (i = 1, 2, \dots, 8)

is Hurwitz. The initial states of

{\hat{x}}_{i}

and

γ_{i} (0)

are both chosen as in Example 1.

As shown in Figure 4a, the completely distributed observer in (20) can recover the state of (1). Also, as

{\hat{x}}_{i} - x

converges to 0 as time goes to ∞,

{\dot{γ}}_{i}

goes to 0, so that

γ_{i}

converges to a constant, which is verified by Figure 4b. Figure 5a,b depict the trajectory of

{\hat{C}}_{V_{i}} - p_{i}

and that of

{\hat{C}}_{V_{i}} - C_{V_{i}}

, respectively, for

i = 1, \dots, 8

. As shown in these two figures,

{\hat{C}}_{V_{i}} - p_{i}

and

{\hat{C}}_{V_{i}} - C_{V_{i}}

both converge to 0 as time goes to ∞ for

i = 1, \dots, 8

, which further indicates that

p_{i}

approaches

C_{V_{i}}

as

t \to \infty

.

Figure 6 describes the snapshots of coverage control with dynamic topology, where the topology changes as the Voronoi partitions vary. In Figure 6,

M = 6

objects are represented by yellow five-pointed stars and

N = 8

sensors are denoted by the solid blue-green dots. Under the guideline of (15) with

k = 1

, the sensors move to their optimal locations, so as to reach an optimal coverage configuration, which is shown as Figure 6d. All these figures verify Theorem 2.

4.2. Discussions

In the previous section, two examples verify the theorems and illustrate the feasibility of the algorithm. Some important relevant points are stated in the following.

(i): The initial state of the objects can be arbitrarily chosen in the targeted region which does not interfere with the performance of state estimation and the optimal location of each sensor.
(ii): In principle, the selection of $K_{i d}$ only needs to make $A_{i d} + K_{i d} C_{i d}$ Hurwitz, but in order to meet the practical application, different $K_{i d}$ can be chosen to control the convergence rate which is determined by the eigenvalues of $A_{i d} + K_{i d} C_{i d}$ .
(iii): In Theorem 1, the coupling gain $γ$ should satisfy the condition (19), which is more than 16.808 at $t = 0$ . However, due to the fact that the topology of sensors is dynamic and changing, the adaptive coupling gain in (21) is designed to avoid real-time acquisition of topology information.
(iv): The existing results are compared to distributed state estimation-based coverage control, such as in References [6,15], where the measurement of a sensor can ensure the observability of the output matrix with the system matrix. Yet, it is not economical or even possible to carry out the output measurement by using one sensor. In this paper, each sensor only needs to measure partial outputs with no need to guarantee the observability, but the consensus-based communication among the sensors can compensate for the state estimation on the targets. Such manipulations make the distributed observer well suited for targeted objects with high dimension (such as a power network) or those occupying a wide and decentralized region (such as a multi-agent system consisting of six UAVs flying in a given formation).

5. Conclusions

The coverage control of a sensor network aims at driving sensors to spread over a targeted region and simultaneously minimizing a cost function. For a convex targeted region with multiple objects, considering the fact that the objects are unknown and that they may be spread over a vast region, two-layer dynamics are endowed to each sensor, the upper layer dynamics and the lower layer one. In detail, the upper layer dynamics is a distributed observer, which is used to accomplish the state estimation on the objects. In a distributed observer, each sensor can only measure partial outputs of the objects, which makes it well suited for objects spread over a vast region. The lower layer dynamics is a negative feedback law guiding each sensor to reach its own optimal location, where the optimal location of each sensor is determined by a estimated state-based cost function. As the estimated states converge to the real state, the estimated state-based cost function approaches the real cost function, so that the optimal locations of the sensors can minimize the real cost function. This paper provides a new, more rational, and more realistic perspective for Voronoi based-coverage control.

Author Contributions

Writing—original draft, A.Z.; Writing—review and editing, A.Z. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Natural Science Fund for Excellent Young Scholars of Jiangsu Province under Grant No. BK20220104.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

NN	Neural Network
RBFNN	Radial Basis Function Neural Network

References

Wang, Z.; Jusup, M.; Shi, L.; Lee, J.H.; Iwasa, Y.; Boccaletti, S. Exploiting a cognitive bias promotes cooperation in social dilemma experiments. Nat. Commun. 2018, 9, 2954. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Liu, D.; Feng, J.; Zhao, Y. Distributed Optimization Control for Heterogeneous Multiagent Systems under Directed Topologies. Mathematics 2023, 11, 1479. [Google Scholar] [CrossRef]
Liu, B.; Wang, X.; Su, H.; Gao, Y.; Wang, L. Adaptive second-order consensus of multi-agent systems with heterogeneous nonlinear dynamics and time-varying delays. Neurocomputing 2013, 118, 289–300. [Google Scholar] [CrossRef]
Xu, C.; Zeng, W.; Liu, C.; Yan, H. Event-triggered semi-global output consensus of discrete-time multi-agent systems with input saturation and external disturbances. IEEE Trans. Circuits Syst. II Express Briefs 2023, 70, 4469–4473. [Google Scholar] [CrossRef]
Tnunay, H.; Moussa, K.; Hably, A.; Marchand, N. Distributed Finite-Time Coverage Control of Multi-Quadrotor Systems with Switching Topology. Mathematics 2023, 11, 2621. [Google Scholar] [CrossRef]
Sun, Q.; Chi, M.; Liu, Z.W.; He, D. Observer-Based coverage control of unicycle mobile robot network in dynamic environment. J. Frankl. Inst. 2023, 360, 9015–9027. [Google Scholar] [CrossRef]
Yu, D.; Xu, H.; Chen, C.P.; Bai, W.; Wang, Z. Dynamic coverage control based on k-means. IEEE Trans. Ind. Electron. 2021, 69, 5333–5341. [Google Scholar] [CrossRef]
Sun, Z.; Wang, N.; Lin, H.; Zhou, X. Persistent coverage of UAVs based on deep reinforcement learning with wonderful life utility. Neurocomputing 2023, 521, 137–145. [Google Scholar] [CrossRef]
Hu, J.; Niu, H.; Carrasco, J.; Lennox, B.; Arvin, F. Voronoi-based multi-robot autonomous exploration in unknown environments via deep reinforcement learning. IEEE Trans. Veh. Technol. 2020, 69, 14413–14423. [Google Scholar] [CrossRef]
Schwager, M.; Rus, D.; Slotine, J.J. Decentralized, adaptive coverage control for networked robots. Int. J. Robot. Res. 2009, 28, 357–375. [Google Scholar] [CrossRef]
Abbasi, F.; Mesbahi, A.; Velni, J.M. A new voronoi-based blanket coverage control method for moving sensor networks. IEEE Trans. Control Syst. Technol. 2017, 27, 409–417. [Google Scholar] [CrossRef]
Wang, B. Coverage Control in Sensor Networks; Springer Science & Business Media: New York, NY, USA, 2010. [Google Scholar]
Luo, K.; Chi, M.; Chen, J.; Guan, Z.H.; Cai, C.X.; Zhang, D.X. Distributed coordination of multiple mobile actuators for pollution neutralization. Neurocomputing 2018, 316, 10–19. [Google Scholar] [CrossRef]
Kantaros, Y.; Zavlanos, M.M. Distributed communication-aware coverage control by mobile sensor networks. Automatica 2016, 63, 209–220. [Google Scholar] [CrossRef]
Zuo, L.; Yan, W.; Cui, R.; Chen, W.; Bai, X. Coverage control of multiple ocean vehicles for environment monitoring with energy constraints. In Proceedings of the OCEANS 2014-TAIPEI, Taipei, Taiwan, 7–10 April 2014; pp. 1–6. [Google Scholar]
Wang, L.; Morse, A.S. A distributed observer for a time-invariant linear system. IEEE Trans. Autom. Control 2017, 63, 2123–2130. [Google Scholar] [CrossRef]
Kim, T.; Lee, C.; Shim, H. Completely decentralized design of distributed observer for linear systems. IEEE Trans. Autom. Control 2019, 65, 4664–4678. [Google Scholar] [CrossRef]
Wang, X.; Jiang, G.P.; Su, H.; Zeng, Z. Consensus-based distributed reduced-order observer design for LTI systems. IEEE Trans. Cybern. 2020, 52, 6331–6341. [Google Scholar] [CrossRef]
Wang, X.; Su, H.; Zhang, F.; Chen, G. A Robust Distributed Interval Observer for LTI Systems. IEEE Trans. Autom. Control 2023, 68, 1337–1352. [Google Scholar] [CrossRef]
Wang, X.; Fan, Z.; Wang, L.; Su, H.; Lam, J. Fully distributed observer design for mobile targets. IEEE Trans. Netw. Sci. Eng. 2023, 10, 1696–1708. [Google Scholar] [CrossRef]
Du, Q.; Faber, V.; Gunzburger, M. Centroidal Voronoi tessellations: Applications and algorithms. SIAM Rev. 1999, 41, 637–676. [Google Scholar] [CrossRef]
Haykin, S. Neural Networks: A Comprehensive Foundation; Prentice Hall PTR: Hoboken, NJ, USA, 1998. [Google Scholar]
Yang, X.; Wan, X.; Zunshui, C.; Cao, J.; Liu, Y.; Rutkowski, L. Synchronization of switched discrete-time neural networks via quantized output control with actuator fault. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 4191–4201. [Google Scholar] [CrossRef]
Chen, Z.; Huang, F.; Chen, W.; Zhang, J.; Sun, W.; Chen, J.; Zhu, S. RBFNN-based adaptive sliding mode control design for delayed nonlinear multilateral telerobotic system with cooperative manipulation. IEEE Trans. Ind. Inform. 2019, 16, 1236–1247. [Google Scholar] [CrossRef]
Huang, J.T. Global tracking control of strict-feedback systems using neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2012, 23, 1714–1725. [Google Scholar] [CrossRef] [PubMed]
Chen, Z.; Zhang, H.T. A remark on collective circular motion of heterogeneous multi-agents. Automatica 2013, 49, 1236–1241. [Google Scholar] [CrossRef]

Figure 1. Simulations of system (29) with the distributed observer designed in (20).

Figure 2. Simulations of system (29) with coverage control algorithm in (15).

Figure 3. Snapshots of coverage control with dynamic topology. The

M = 2

objects and

N = 8

sensors are denoted by two yellow spots and 8 solid blue-green dots, respectively.

Figure 3. Snapshots of coverage control with dynamic topology. The

M = 2

objects and

N = 8

sensors are denoted by two yellow spots and 8 solid blue-green dots, respectively.

Figure 4. Simulations of system (32) with the distributed observer designed in (20).

Figure 5. Simulations of system (32) with coverage control algorithm in (15).

Figure 6. Snapshots of coverage control with dynamic topology. The 6 objects and 8 sensors are denoted by 6 yellow five-pointed stars and 8 solid blue-green dots, respectively.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, A.; Wang, X. RBFNN-Based Distributed Coverage Control on an Unknown Region. Mathematics 2024, 12, 111. https://doi.org/10.3390/math12010111

AMA Style

Zhang A, Wang X. RBFNN-Based Distributed Coverage Control on an Unknown Region. Mathematics. 2024; 12(1):111. https://doi.org/10.3390/math12010111

Chicago/Turabian Style

Zhang, Ankang, and Xiaoling Wang. 2024. "RBFNN-Based Distributed Coverage Control on an Unknown Region" Mathematics 12, no. 1: 111. https://doi.org/10.3390/math12010111

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

RBFNN-Based Distributed Coverage Control on an Unknown Region

Abstract

1. Introduction

1.1. Contribution

1.2. Organization

2. Preliminary and Problem Statement

2.1. Notation

2.2. Graph Theory

2.3. Problem Statement

2.4. RBFNN-Based Estimation on $ϕ_{s} (q, x_{s})$

2.5. Motion Control Algorithm Design

3. RBFNN-Based Distributed Coverage Control

4. Numerical Simulation and Discussion

4.1. Numerical Example

4.2. Discussions

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

RBFNN-Based Distributed Coverage Control on an Unknown Region

Abstract

1. Introduction

1.1. Contribution

1.2. Organization

2. Preliminary and Problem Statement

2.1. Notation

2.2. Graph Theory

2.3. Problem Statement

2.4. RBFNN-Based Estimation on ϕ s ( q , x s )

2.5. Motion Control Algorithm Design

3. RBFNN-Based Distributed Coverage Control

4. Numerical Simulation and Discussion

4.1. Numerical Example

4.2. Discussions

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.4. RBFNN-Based Estimation on $ϕ_{s} (q, x_{s})$