Next Article in Journal
Optimal Weak Order and Approximation of the Invariant Measure with a Fully-Discrete Euler Scheme for Semilinear Stochastic Parabolic Equations with Additive Noise
Previous Article in Journal
A Coupled Machine Learning and Lattice Boltzmann Method Approach for Immiscible Two-Phase Flows
Previous Article in Special Issue
Fixed-Time Synchronization of Complex-Valued Coupled Networks with Hybrid Perturbations via Quantized Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

RBFNN-Based Distributed Coverage Control on an Unknown Region

1
College of Automation & Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
2
Jiangsu Engineering Center for IOT Intelligent Robots, Nanjing 210023, China
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(1), 111; https://doi.org/10.3390/math12010111
Submission received: 22 November 2023 / Revised: 15 December 2023 / Accepted: 25 December 2023 / Published: 28 December 2023
(This article belongs to the Special Issue Mathematic Control and Artificial Intelligence)

Abstract

:
In this paper, we investigate the problem of achieving distributed coverage control of a mobile sensor network on an unknown region using local measurements. To accomplish this objective, each sensor is equipped with two-layer dynamics. The upper layer dynamic employs a completely distributed observer algorithm on the target region for state estimation of the density function. The lower layer dynamic utilizes a radial basis function neural network-based motion algorithm, which involves only the estimated state obtained by the upper layer dynamics, to guide the sensors towards an optimal coverage configuration. We demonstrate that with only the joint detectability of the partial outputs measurement, it is possible to achieve distributed coverage control in the unknown region without requiring additional information about the density function, communication topology associated with the sensors, or coupling gains. Finally, two examples are used to validate the theoretical findings.

1. Introduction

As a branch of cooperative control of multi-agent systems [1,2,3,4], coverage control of multi-agent systems has gained more and more attention [5,6,7,8,9]. A decentralized control law [10] is proposed to drive a network of mobile robots to an optimal sensing configuration by using an adaptive control architecture. In [11], the blanket coverage problem is addressed to cover a long region by imposing the dynamics of the boundaries on an agent’s respective control law, which ensures the locally optimal partitioning for the moving coverage area. They achieve optimal coverage in static [10] and dynamic [11] regions, respectively, with known environmental information. However, the actual application region information is probably unknown. Much attention has been focused on the coverage control of the unidentified region. With unknown environment information, a robust coverage control algorithm [6] is proposed to guide the unicycle robot network to the ideal configuration on the basis of an approximated density function. For a sensor network, the main object of coverage control is to drive the sensors to distribute over a region while aggregating in locations of their high interest [12]. As we know, there is a considerable amount of literature on the coverage control of a region with multiple robots, such as [6,7,8,9]. A dynamic coverage planning technique is given in [7] which can eliminate the convexity requirement restrictions of the targeted area by using the K-means approach. In [8,9], a deep reinforcement learning algorithm is introduced to address the problem of region coverage and exploration. Voronoi partition is one of the common ways to study the coverage control of multi-agent systems [9,13]. Further, the communication between agents should be considered when mobile sensor networks cover the targeted area. A hybrid scheme is proposed that decouples the optimization of the coverage objective from the control of the communication variables, which optimizes coverage and the routing of information [14]. With the Voronoi partition method, a convex region can be divided into several Voronoi cells, where the number of the Voronoi cells is equal to the number of sensors with one sensor per cell. The underlying idea of Voronoi-based coverage control is to design a control algorithm for each sensor to minimize a cost function so as to drive the sensor to the optimal coverage configuration. Namely, in coverage control, all the sensors are not dropped or thrown into the targeted region randomly, but deployed on the optimal locations of the corresponding Voronoi cells. This remarkable feature makes the Voronoi-based coverage control widely used in practical applications, such as monitoring and surveillance of targeted ocean regions.
As was stated, for the coverage control of a sensor network, the optimal location of each sensor is affected by its interest element. Generally speaking, the interest elements are unknown, so that the observer-based coverage control turns out [6]. In [6], a state estimation algorithm [15] is firstly equipped on each sensor to estimate the information of the targeted region; then, the coverage control algorithm is given to guide the motion of each sensor to reach the optimal coverage configuration. However, on one hand, it is important to consider that covering control is often associated with targeted regions that are typically expansive. This presents challenges in guaranteeing the detectability or observability of the interest elements through a single sensor measurement. On the other hand, let us consider the example of monitoring an oceanic region. The environmental variables that need to be monitored encompass temperature, pH, salinity, chemical plumes, and more. Achieving measurements for these numerous elements using a single sensor is arduous, if not impossible. Given the aforementioned circumstances, the implementation of a distributed observer [16,17,18,19,20] becomes necessary. In a distributed observer, a network of sensors are strategically deployed in spatial locations to effectively measure the output information of the targeted system and simultaneously, a distributed state estimation algorithm is integrated into each sensor. Each sensor in the distributed observer has only to access partial outputs, that is, the local measurement. Then, the distributed communication among the sensors on estimated states can prompt the implementation of the state estimation. Under the condition that the information of the targeted region is unknown and only partial output information of the objects is available, this paper approximates the unknown density function of the targeted region by using the radial basis function neural network (RBFNN) and considers the distributed observer-based coverage control problem of a sensor network.

1.1. Contribution

In this paper, an RBFNN-based distributed coverage algorithm is designed for a mobile sensor network to form an optimal coverage configuration on a complex targeted region with multiple objects. The difficulties, significant manipulations, and innovations discussed in this work are summarized as follows:
  • For a convex targeted region with multiple objects, considering the fact that the objects are unknown and that they may be spread over a vast region, two-layer dynamics is endowed to each sensor, where the upper dynamics is a distributed observer algorithm implementing the state estimation on the targets via the partial measurement, while the lower dynamics aims at guiding the motion of the i-th sensor to cover the multiple objects in the targeted region. The fact that each sensor in the distributed observer has only to measure partial outputs of the objects makes the distributed observer well suited for state estimation of objects spread over a vast region.
  • Notice that the lower layer dynamics is a negative feedback law driving each sensor to reach its own optimal location. With the estimated state of the objects given by the distributed observer algorithm in the upper layer dynamics, the RBFNN is adopted to estimate the unknown density function of the targeted region; it further determines the optimal location of each sensor which is determined by a estimated state-based cost function. Hence, the coverage motion control algorithm constructed in this paper relates only to the partial output information of the objects.
  • The Voronoi partitions and coverage motion of sensors result in a sensor network with dynamic communication topology. To the best of our knowledge, the existing Voronoi-based coverage control results of sensor networks are dependent on the communication topology of the sensors. In this paper, in order to eliminate the dependence of the distributed observer algorithm in the upper layer dynamics of each sensor on this dynamic communication topology, Lyapunov functions irrelevant to the communication topology are constructed to solve the completely distributed observer-based coverage control problem, eliminating the influence of the changes in the communication topology.
With these significant manipulations, coverage control of a complex region with multiple objects is accomplished by a sensor network, where each sensor can only access a part of the outputs and the partial outputs are jointly detectable with the system matrix.

1.2. Organization

The rest of this paper is organized as follows. Section 2 presents some preliminaries and the problem statement and designs the motion control algorithm. By adopting RBFNN to estimate the unknown density function of the targeted region, Section 3 proposes a distributed observer-based coverage control. Further, a density function observer without any global topology information is designed by using the adaptive strategy. In Section 4, numerical simulations are provided to verify the theoretical results and Section 5 concludes the whole paper.

2. Preliminary and Problem Statement

2.1. Notation

R n × m denotes the set of n × m real matrices while I n is an identity matrix with n × n -dimensions. For a matrix A R n × m , ker ( A ) and im ( A ) denote its kernal and image, respectively. Moreover, ker ( A ) = { x R m × 1 | A x = 0 } and im ( A ) = { y R n × 1 | y = A x , x R m × 1 } ; A denotes the orthogonal complement of A. λ min ( A ) and λ max ( A ) stand for the minimum and maximal eigenvalues of A, respectively. · is the 2-norm of a matrix or a vector, and  A 0 means that matrix A is positive definite. Throughout the paper, γ 0 > 0 is an arbitrary constant.

2.2. Graph Theory

The communication topology of the N sensors is determined by the Voronoi partition { V 1 , , V N } of Ω . As declared in the previous statement, the i-th sensor is located in the Voronoi cell V i . Here, a triple G ( t ) = N ¯ , E ( t ) , W ( t ) is used to describe the communication relation, where N ¯ = { 1 , 2 , , N } is the node set with the element i denoting the i-th sensor. E ( t ) = ( i , j ) | i , j V is the time-varying edge set which varies as the Voronoi partition of Ω . In  E ( t ) , the unordered pairs of vertices present the undirected neighboring relations among the sensors. As stated in [9], ( i , j ) E ( t ) if the Voronoi cell V i and the Voronoi cell V j share a common edge. Associated with E ( t ) , the third element W ( t ) R N × N in G ( t ) , which is a matrix with time-varying element w i j ( t ) with i , j N ¯ , is defined as
w i j ( t ) = 1 , if ( i j ) E ( t ) , 0 , otherwise .
Then, the Laplacian matrix of G ( t ) can be defined as L ( t ) = ( l i j ( t ) ) R N × N with
l i j ( t ) = w i j ( t ) , if i j , j = 1 n w i j ( t ) , if i = j .
Moreover, based on the adjacency matrix, the time-varying neighbor set of i can be defined by N i ( t ) = j | w i j ( t ) = 1 , j i , i = 1 , , N ¯ .
Next, in correspondence with the previous statement, some algorithms which play a key role are given.

2.3. Problem Statement

In this paper, we aim at covering a complex targeted region Ω R 2 , which is convex and occupies multiple objects. Suppose that there are M Z objects in Ω , which are governed by the following dynamics:
x ˙ = A x ,
where x R n describes the state of the M objects; A is the system matrix with compatible dimension. Notice that the state x not only contains the positions of the M targets but also some other variables. Without loss of generality, we assume that the first 2 M elements in x, denoted by x p R 2 M × 1 , describe the positions of the M targets, where 2 M n and
x p T = x 1 x x 1 y x 2 x x 2 y x M x x M y ,
with x i x x i y T R 2 ( i = 1 , , M ) describing the position of the i-th target. For brevity, we denote x = x p x * T with x * R n 2 M being some other variables. For example, in the monitoring and surveillance of a targeted ocean region, x * may represent the temperature, pH, salinity, or chemical plumes.
If the targeted region Ω is known, the information of the objects is certainly known. In this case, as was done in [9], a density function with Gaussian distribution is given to describe the influence of the M objects on Ω , which is given as
ϕ ( q , x s ) = s = 1 M μ s ϕ s ( q , x s ) = s = 1 M μ s e q x s 2 2 σ s 2 = s = 1 M μ s e ( q x x s x ) 2 2 σ s 2 ( q y x s y ) 2 2 σ s 2 ,
where q = q x q y T is the position of the point of q Ω , x s = x s x x s y T is the position of the s-th object with s = 1 , , M , σ s > 0 is a spatial sensitivity coefficient which determines the height and width of the density function, μ s > 0 is the weight which describes the influence of the s-th object, and ϕ s ( q , x s ) = e q x s 2 2 σ s 2 is the influence model of the s-th object.
In this paper, a sensor network consisting of N agents is used to converge Ω , where each sensor regulates its location according to the following dynamics:
p ˙ i = u i , i = 1 , , N ,
where p i = p i x p i y T R 2 is the position of the i-th sensor, denoting its location; the u i R 2 to be designed is the velocity of the sensor which is to drive the sensor to an optimal location for the coverage of Ω in a distributed manner. What is more, the communication among the sensors is determined by the Voronoi partition of Ω .
Associated with the N sensors, Ω is partitioned into N Voronoi cells, denoted by { V 1 , , V N } . In particular, by [21], V i is defined as
V i = { q Ω | q p i     q p j , j i } ,
which indicates that the Voronoi partitions are determined by the positions of the sensors. Then, to describe the coverage effect of each agent, the cost function
Q ( p , V , t ) = i = 1 N V i f ( q p i ) ϕ ( q , t ) d q ,
is used to calculate the sensing performance at point q Ω around every position p i , which is the function of the Euclidean distance q p i . In (4), the distance function f ( q p i ) is defined as
f ( q p i ) = α e β q p i 2 ,
with α , β > 0 . Qualitatively, the larger the value of Q , the better the configuration for sensory coverage of the region Ω . Note that the density function ϕ ( q , t ) is affected by the position of the target.
However, on one hand, it is hard or even impossible to obtain the real position of each object directly in some practical applications, so that x s cannot be used directly in (2). How to obtain the position of each object via the output y is another problem to be considered. Beyond these factors, regarding the wide occupancy of the region, the output measurement cannot be carried out by a single agent but requires a network of agents. In this sensor network, each agent measures only a part of the outputs. In mathematical expression, the output measured by the i-th agent is
y i = C i x ,
where C i R p i × n contains a subset of the rows of C. The total measurement obtained by all sensors is the sum of all local measurements, shown as col i = 1 , , N { y i } = col i = 1 , , N { C i } x . On the other hand, the targeted region is unknown which implies that in (2), μ s is unknown, so novel approaches are required to estimate it. Next, the RBFNN is introduced to calculate the density function of the targeted region.

2.4. RBFNN-Based Estimation on ϕ s ( q , x s )

In this subsection, our objective is to enhance the estimation of the density function for the targeted region using the RBFNN technique. Denote the estimated state of the i-th sensor on x by x ^ i . Then, the estimated state of the i-th sensor on x s by x ^ i , s . In this case, the RBFNN-based estimation on the density function ϕ s ( q , x s ) is carried out on the ϕ s ( q , x ^ i , s ) .
The analysis is performed assuming that the pair ( C i , A ) is jointly detectable, but it is important to note that the detectability of ( C i , A ) cannot be guaranteed. Therefore, a detectability decomposition is necessary. By [17], let f A ( s ) = det ( s I n A ) = 0 denote the characteristic polynomial of matrix A. Factor it as f A ( s ) = f A + ( s ) f A ( s ) , where f A + ( s ) and f A ( s ) are the polynomials with roots in the closed right and open left half-planes of the complex plane, respectively. Then, the undetectable subspace of ( C i , A ) is given by U i = = 1 n ker C i A ker f A + ( A ) . Let ϱ i be the dimension of U i and U i be the orthogonal complement of U i , where
0 ϱ i n .
Denote U i R n × ϱ i and let D i R n × ( n ϱ i ) be the matrices whose columns are the orthogonal bases of U i and U i , respectively. Then, one can obtain U i = im ( U i ) = ker D i T . Let T i = D i U i R n × n ; then, T i is an orthogonal matrix which can induce that
T i T A T i = A i d 0 A i r A i u , C i T i = C i d 0 ,
where A i d R ( n ϱ i ) × ( n ϱ i ) , A i r R ϱ i × ( n ϱ i ) , A i u R ϱ i × ϱ i , C i d R p i × ( n ϱ i ) and the pair ( C i d , A i d ) is detectable, so that one can choose K i d R ( n ϱ i ) × p i to make A i d + K i d C i d Hurwitz. Hence, the observer of i-th agent is designed as follows
x ^ ˙ i = A x ^ i K i y i C i x ^ i + γ U i U i T j N i ( t ) w i j ( t ) x ^ j x ^ i ,
where K i = T i K i d 0 , γ > 0 is the coupling gain of the i-th agent; x ^ i is the estimated state of the i-th sensor on x.
Based on x ^ i in (7), the estimated state of the s-th object which is gained by the i-th sensor is x ^ i , s ; the influence model of the M objects in (2) turns to
ϕ q , x ^ i , s = μ s ϕ s q , x ^ i , s , t = μ s e q x ^ i , s 2 2 σ s 2 = μ s e q x x ^ i , s x 2 2 σ s 2 q y x ^ i , s y 2 2 σ s 2 .
Even though one can obtain the state of the objects by using only the partial output information, the weight μ s is unknown and then ϕ q , x ^ s is still unknown. Yet, for a completely unknown region, the influence model ϕ s ( · ) is unknown. Thus, it is impossible to obtain ϕ q , x ^ i , s in (8) even though x ^ i , s is known. In the following, RBFNN used in [22,23,24] is introduced to approximate the continuous function ϕ q , x ^ i , s : R 2 R . Motivated by [22,23,24], ϕ q , x ^ i , s can be approximated by
ϕ ^ q , x ^ i , s = W * T Φ ( x ^ i , s ) + ϵ ,
where Φ ( x ^ i , s ) = Φ 1 ( x ^ i , s ) , Φ 2 ( x ^ i , s ) , , Φ l ( x ^ i , s ) T : R 2 R l with l 1 being the NN node number; W * R is the optimal weight vector, which is defined by
W * = arg min W ^ sup x ^ i , s ϕ ^ q , x ^ i , s W ^ T Φ ( x ^ i , s ) ,
where W ^ is the estimation of W * . Notice that, by [25], the more NN nodes, the more accurate the approximation will be. Note also that for  k = 1 , 2 , , Φ k ( x ^ i , s ) is selected as the generic Gaussian function as follows
Φ k ( x ^ i , s ) = e x ^ i , s γ k 2 η k 2 , k = 1 , 2 , , ,
where γ k and η k express, respectively, the center and spread. ϵ is an approximation error bounded on Π , namely, ϵ ϵ * with ϵ * > 0 being an unknown constant.

2.5. Motion Control Algorithm Design

With ϕ ^ q , x ^ i , s in (9), the cost function in (4) becomes
Q ^ ( p , V ) = i = 1 N V i f ( q p i ) ϕ ^ q , x ^ i , s d q ,
which induces
Q ^ p i = V i p i f ( q p i ) ϕ ^ q , x ^ i , s d q = V i p i α e β q p i 2 ϕ ^ q , x ^ i , s d q = V i 2 ( q p i ) α β e β q p i 2 ϕ ^ q , x ^ i , s d q = M ^ V i C ^ V i p i .
For brevity, we denote
M ^ V i = V i 2 β f ( q p i ) ϕ ^ ( q ) d q , L ^ V i = V i 2 β q f ( q p i ) ϕ ^ ( q ) d q , C ^ V i = L ^ V i / M ^ V i ,
where M ^ V i and C ^ V i are generalized mass and generalized centroid of Voronoi cell V i , respectively. Through the locational optimization function Q ^ , the agents’ local optimum point is the centroid of Voronoi cell, i.e., the critical points of Q ^ correspond to the configurations such that p i = C ^ V i , i = 1 , , N , that is to say, the agents’ location converges asymptotically to the set of centroidal Voronoi configurations on Ω . Then, the controller u i can be designed as
u i = k M ^ V i C ^ V i p i ,
where k > 0 is an arbitrary constant.

3. RBFNN-Based Distributed Coverage Control

As stated in Section 2, the controller is designed to drive sensors to cover multiple objects in the targeted region. This section aims at illustrating the performance of the distributed observer for state estimation on the targets and the conditions to be met, which is stated in the following theorems. RBFNN-based distributed coverage control can be designed for the system in (1) by implementing Algorithm 1.
Algorithm 1 Technical procedure for RBFNN-based distributed coverage control algorithm
Do detectability decomposition for the system in (1); there exists an orthogonal matrix T i , such that
T i T A T i = A i d 0 A i r A i u , C i T i = C i d 0 ,
and the pair C i d , A i d is detectable.
Choose K i d to make A i d + K i d C i d Hurwitz.
For the system in (1), design a distributed observer as shown in (7).
Approximate the density function in (9) by using RBFNN.
Based on step ➃, use the cost function in (12) to design the controller u i in (15).
Let e i = x ^ i x ; then, it follows from (1) and (7) that
e ˙ i = ( A + K i C i ) e i + γ U i U i T j N i ( t ) w i j ( t ) e j e i .
First, a model transformation on e i is given. Let e ˜ i = T i 1 e i = D i T U i T e i = e ˜ i d e ˜ i u , then one has
e ˜ ˙ i d = A i d + K i d C i d e ˜ i d , e ˜ ˙ i u = A i r e ˜ i d + A i u e ˜ i u γ U i T j = 1 N l i j ( t ) D j e ˜ j d + U j e ˜ j u ,
Define some diagonal matrices as follows:
U = diag { U 1 , , U N } , D = diag { D 1 , , D N } , A d = diag { A 1 d , , A N d } , A r = diag { A 1 r , , A N r } , A u = diag { A 1 u , , A N u } , K d = diag { K 1 d , , K N d } , C d = diag { C 1 d , , C N d } .
Then, (17) becomes
e ˜ ˙ d = A d + K d C d e ˜ d , e ˜ ˙ u = A r e ˜ d + A u e ˜ u γ U T ( L ( t ) I m ) D e ˜ d + U e ˜ u .
Before moving on, Lemma 1 is introduced.
Lemma 1
([17]). Suppose that G ( t ) = V , E ( t ) is strongly connected for t 0 ; then, the following statements are equivalent:
(i) 
( C i , A ) is jointly detectable;
(ii) 
U T ( L I n ) U is positive definite;
(iii) 
U T ( L I n ) U is nonsingular.
Theorem 1.
Consider a multi-agent system (1) with N mobile sensors communicating through the graph G ( t ) , where each sensor is governed by the control law in (15) with M ^ V i and C ^ V i determined by x ^ i in (7). Suppose that ( C i , A ) is jointly detectable, and γ in (7) satisfies
γ > 1 + A u + γ 0 λ min U T L I m U , γ 0 > 0
then, each agent can converge to an optimal coverage configuration, i.e, lim t p i C V i = 0 .
Proof. 
Define a Lyapunov function as V 1 p , C ^ v , x , x ^ = V 1 a + V 1 b , where
V 1 a = 1 2 i = 1 N V i f ( p i q 2 ) ψ ( q ) d q , V 1 b = e ˜ d T P 1 e ˜ d + e ˜ u T e ˜ u ,
with P 1 0 being the solution of the Lyapunov equation
0 = A d + K d C d T P 1 + P 1 A d + K d C d + M 1 I n N i = 1 N ϱ i , M 1 = A r 2 + 2 γ U T ( L I m ) D 2 + γ 0 .
By (18), one has
d d t e ˜ d T P 1 e ˜ d = e ˜ d T A d + K d C d T P 1 e ˜ d + e ˜ d T P 1 A d + K d C d e ˜ d = M 1 e ˜ d T e ˜ d , d d t e ˜ u T e ˜ u = 2 e ˜ u T A r T e ˜ d 2 e ˜ u T γ U T ( L I m ) D e ˜ d + 2 e ˜ u T A u e ˜ u 2 e ˜ u T γ U T ( L I m ) U e ˜ u 2 e ˜ u T e ˜ u + A r 2 e ˜ d T e ˜ d + 2 γ U T ( L I m ) D 2 e ˜ d T e ˜ d + 2 A u e ˜ u T e ˜ u 2 γ e ˜ u T λ min U T ( L I m ) U e ˜ u = A r 2 + 2 γ U T ( L I m ) D 2 e ˜ d T e ˜ d + 2 1 + A u γ λ min U T ( L I m ) U e ˜ u T e ˜ u ,
so that
V ˙ 1 b = d d t e ˜ d T P 1 e ˜ d + d d t e ˜ u T e ˜ u M 1 e ˜ d T e ˜ d + A r 2 + 2 γ U T ( L I m ) D 2 e ˜ d T e ˜ d + 2 1 + A u γ λ min U T ( L I m ) U e ˜ u T e ˜ u = γ 0 e ˜ d T e ˜ d 2 γ 0 e ˜ u T e ˜ u .
On the other hand, there holds
V ˙ 1 a = i = 1 N M ^ V i ( C ^ V i p i ) T ψ ^ ( q ) p ˙ i = k i = 1 N M ^ V i ( C ^ V i p i ) 2 .
By Lemma 1, one can obtain that U T ( L I m ) U 0 . Therefore, under γ in (19), there holds
V ˙ 1 k i = 1 N M ^ V i ( C ^ V i p i ) 2 γ 0 e ˜ d T e ˜ d 2 γ 0 e ˜ u T e ˜ u ,
which implies lim t p i = C ^ V i for i = 1 , , N , lim t e ˜ d = 0 and lim t e ˜ u = 0 . Notice that lim t e ˜ d = 0 and lim t e ˜ u = 0 together indicate that lim t x x ^ i = 0 and lim t ϕ ^ q , x ^ i , s = ϕ ^ q , x i , s for i = 1 , , N . On the other hand, with the help of the RBFNN-based estimation mentioned above, one has lim t ϕ ^ q , x i , s = ϕ q , x i , s . Thus, one can obtain that lim t ϕ ^ q , x ^ i , s = ϕ q , x i , s for i = 1 , , N , so that lim t C ^ V i = C V i for i = 1 , , N , which further results in lim t p i = C V i for i = 1 , , N . □
The limitation of the distributed observer design is that γ needs to satisfy the condition in (19), which is related to the information of system matrix and communication topology. However, the details of the system matrix and communication topology may be unknown. To further optimize the results in Theorem 1, inspired by [17], the dynamic coupling gains are introduced, so that (7) can be improved as
x ^ ˙ i = A x ^ i K i y i C i x ^ i + γ i U i U i T j N i w i j ( t ) x ^ j x ^ i ,
where γ i > 0 is the adaptive coupling gain, which updates itself according to the following dynamics:
γ ˙ i = U i T j N i w i j ( t ) x ^ j x ^ i 2 = U i T j N i l i j ( t ) D j e ˜ j d + U j e ˜ j u 2 .
Define
ξ i d = e ˜ i d , ξ i u = U i T j N i l i j D j e ˜ j d + U j e ˜ j u ,
then one has
γ ˙ i = ξ i u T ξ i u .
Furthermore, denote Γ = diag γ 1 I ρ i , , γ N I ρ N , e ˜ d = col i = 1 , , N { e ˜ i d } and likewise define ξ d , ξ u as well as e ˜ d in a similar manner. Then, (18) can be written as the following form
e ˜ ˙ d = A d + K d C d e ˜ d , e ˜ ˙ u = A r e ˜ d + A u e ˜ u Γ U T ( L ( t ) I n ) D e ˜ d + U e ˜ u .
By (22), one has
ξ = ξ d ξ u = I n N i = 1 N ϱ i 0 U T ( L ( t ) I n ) D U T ( L I n ) U e ˜ d e ˜ u .
Hereafter, denote Δ = U T ( L I n ) U for convenience. Based on Lemma 1 in the foregoing, Δ is positive definite; there holds
I n N i = 1 N ϱ i 0 U T ( L I n ) D Δ 1 = I n N i = 1 N ϱ i 0 Δ 1 U T ( L I n ) D Δ 1 ,
which further induces that
e ˜ d = ξ d , e ˜ u = Δ 1 U T ( L I n ) D ξ d ξ u ,
and so there holds that
ξ ˙ = A d + K d C d 0 Θ Δ A u Δ 1 Δ Γ ξ ,
with Θ = U T ( L I n ) D ( A d + K d C d ) + Δ A r Δ A u Δ 1 U T L I n D . Note that in (25), Δ is nonsingular. Hence, the stability of ξ is equivalent to that of the system in (24). Define a Lyapunov equation with the unique solution P 2 as follows:
A d + K d C d T P 2 + P 2 A d + K d C d = M 2 I n N i = 1 N ϱ i
where M 2 = 1 + Θ 2 .
Theorem 2.
Consider a multi-agent system (1) with N mobile sensors communicating through the graph G ( t ) , where each sensor is governed by the control law in (15) with M ^ V i and C ^ V i determined by x ^ i in (20). Suppose that ( C i , A ) is jointly detectable; then, each agent can converge to an optimal coverage configuration, i.e, lim t p i C V i = 0 .
Proof. 
Define a Lyapunov function as V 2 p , C ^ v , x , x ^ = V 2 a + V 2 b , where
V 2 a = 1 2 i = 1 N V i f ( p i q 2 ) ψ ( q ) d q + ξ d T P 2 ξ d + ξ u T ξ u , V 2 b = i = 1 N λ m i n Δ γ i γ * 2 ,
with γ * > 0 being a sufficiently large constant to be determined later. Then, the derivatives of V 2 a and V 2 b along (27) yield
V ˙ 2 a i = 1 N M ^ V i ( C ^ V i p i ) T ψ ^ ( q ) p ˙ i M 2 ξ d T ξ d + ξ d T Θ T Θ ξ d + ξ u T 1 + Δ A u Δ 1 T + Δ A u Δ 1 Γ Δ Δ Γ ξ u k i = 1 N M ^ V i ( C ^ V i p i ) 2 M 2 ξ d T ξ d + Θ 2 ξ d T ξ d + 1 + 2 A u ξ u T ξ u ξ u T Γ Δ + Δ Γ ξ u , V ˙ 2 b = 2 i = 1 N λ m i n Δ γ i γ * ξ i u T ξ i u = 2 λ m i n Δ ξ u T Γ ξ u 2 γ * ξ u T ξ u ξ u T Γ Δ + Δ Γ ξ u 2 γ * ξ u T ξ u .
For the arbitrarily positive constant γ * > 0 , we choose it as γ * > 1 2 1 + 2 A u + γ 0 without loss of generality. Hence, one has
V ˙ 2 k i = 1 N M ^ V i ( C ^ V i p i ) 2 ξ d T ξ d γ 0 ξ u T ξ u 0 .
Similar to the analysis in the proof of Theorem 1, one can obtain that each sensor converges to an optimal coverage configuration. □
Remark 1.
The coverage motion, the Voronoi partitions, and the edge set defined in Section 2.2 together result in a sensor network with dynamic communication topology. Yet, it is worth noting that the communication topology only switches from a connected graph to another connected one, different from the joint connectivity case in [26]. In this paper, Lyapunov functions (i.e., V 1 and V 2 ) irrelevant to the changes of the topology are constructed to eliminate the influence of the topology changes on the distributed observer-based coverage control.

4. Numerical Simulation and Discussion

In this section, numerical simulations are firstly given to verify the obtained theoretical results and then further discussions are provided to analyze the simulation results in more depth.

4.1. Numerical Example

In this subsection, two simulation examples are given. In detail, the targeted region Q is chosen to be a 100 × 100 square. μ s in (8) is chosen as μ s = 3 for s = 1 , , M . For the performance function Q ^ ( p , V ) defined in (12), α and β are chosen as α = 0.005 and β = 0.001 , respectively. A sensor network consisting of N = 8 sensors is used to implement coverage control of the targeted region.
Example 1.
In this example, there are M = 2 objects considered in Q, whose trajectory is described by (1) with
A = diag A ˜ 1 , A ˜ 2 , A ˜ i = 0 0 1 0 0 0 0 2 0 0 0 0.8 0 0 0 0 , s = 1 , 2 .
The initial states of the two objects are chosen as
x 1 ( 0 ) = 30 70 1 1 T ,   x 2 ( 0 ) = 82 30 1.5 1.5 T .
The measurement of each agent in (5) is chosen as
C i = S i I 2 0 2 × 2 , i = 1 , 2 , , 8
where
S 1 = 1 0 , S 2 = 1 1 , S 3 = 1 0 , S 4 = 0 1 , S 5 = 1 1 , S 6 = 1 0 , S 7 = 0 1 , S 8 = 1 1 .
It is verified that col i V ( C i ) , A is observable. With the detectability decomposition on ( C i , A i ) , one can obtain the detectable pair ( C i d , A i d ) which is shown as follows:
A 1 d = A 2 d = 0 0.8 0 0 0 0 0 0 0 1 0 0 1 0 0 0 , C 1 d = 0 0 0 1 0 0 1 0 , C 2 d = 0 0 0 1.414 0 0 1.414 0 .
Choose K i d as
K 1 d = 7.8774 2.1377 0.8593 5.6633 5.0502 3.8080 4.3367 1.4494 , K 2 d = 5.5702 1.5116 0.6076 4.0045 3.5710 2.6927 3.0665 1.0249 ,
then, A i d + K i d C i d is Hurwitz for i = 1 , 2 . With the Voronoi partitions on the given convex region Ω, the communication topology associated with the eight sensors can be obtained and it is connected. As mentioned above, the initial locations of these sensors are
p 1 ( 0 ) = 82 95 , p 2 ( 0 ) = 10 54 , p 3 ( 0 ) = 25 89 , p 4 ( 0 ) = 68 46 , p 5 ( 0 ) = 62 81 , p 6 ( 0 ) = 50 50 , p 7 ( 0 ) = 76 24 , p 8 ( 0 ) = 17 20 .
By the definitions in Section 2.2, the Laplacian matrix of this sensor network at t = 0 is
L ( 0 ) = 4 1 1 0 0 0 1 1 1 4 1 1 0 0 0 1 1 1 4 1 1 0 0 0 0 1 1 4 1 1 0 0 0 0 1 1 4 1 1 0 0 0 0 1 1 4 1 1 1 0 0 0 1 1 4 1 1 1 0 0 0 1 1 4 ,
The smallest nonzero eigenvalue of L ( 0 ) is 2.5858 , which indicates that the sensor network is connected.
Figure 1a depicts the trajectory of x ^ i x which shows that the distributed observer can accurately estimate the states of two objects, and the estimation error x ^ i x approaches 0 as t goes to . As shown in Figure 1b, the adaptive parameter γ i increases and converges to a constant. Figure 2a,b, respectively, illustrate that C ^ V i p i and C ^ V i C V i converge to 0 as t goes to . This indicates that p i approaches C V i as t goes to and the density function of the unknown targeted region can be well learned by the RBFNN. The sensors are denoted by eight solid blue-green dots in Figure 3 which illustrates that as time goes on, sensors driven by the underlying dynamic model can coverage the two objects successfully.
Example 2.
A multi-agent system consisting of M = 6 agents is considered as the multiple objects in Q. The trajectories of these six objects are steered by (1) with A as follows:
A = diag A ˜ 1 , , A ˜ 6 , A ˜ i = 0 0 1 0 0 0 0 1 0 0 0 0.47 0 0 0.44 0 , s = 1 , , 6 .
The initial states of the six objects are chosen as
x 1 ( 0 ) = 30 70 4 2 T , x 2 ( 0 ) = 30 65 4 2 T , x 3 ( 0 ) = 25 70 4 2 T , x 4 ( 0 ) = 35 75 3 1 T , x 5 ( 0 ) = 67 47 3 1 T , x 6 ( 0 ) = 70 50 3 1 T .
The measurement of each agent in (5) is chosen as
C i = S i I 2 0 2 × 2 , i = 1 , 2 , , 8
where
S 1 = 1 0 0 0 0 0 , S 2 = 1 1 0 0 0 0 , S 3 = 1 0 1 0 0 0 , S 4 = 1 0 0 1 0 0 , S 5 = 1 0 0 0 1 0 , S 6 = 1 0 0 0 0 1 , S 7 = 0 1 1 0 0 0 , S 8 = 0 0 1 1 0 0 .
It is verified that col i V ( C i ) , A is observable. With the help of the detectability decomposition on ( C i , A ) , one can obtain the corresponding detectable pair ( C i d , A i d ) as follows:
A i d = 0 0.47 0 0 0.44 0 0 0 0 1 0 0 1 0 0 0 , A 8 d = 0 0.47 0 0 0.44 0 0 0 0 1 0 0 1 0 0 0 ,
with i = 1 , , 7 and
C 1 d = C 3 d = C 4 d = C 5 d = C 6 d = 0 0 0 1 0 0 1 0 , C 2 d = C 7 d = C 8 d = 0 0 0 1.414 0 0 1.414 0 .
Choose K i d as
K 1 d = 7.7503 1.5061 0.0013 5.8995 3.0130 2.8508 4.1105 0.9052 T , K 6 d = 5.4802 1.0649 0.0009 4.1715 2.1305 2.0158 2.8995 0.6401 T , K i d = 5.4803 1.0649 0.0009 4.1716 2.1305 2.0158 2.8995 0.6401 T , i = 2 , 3 , 4 , 5 , 7 , 8 ,
then the matrix A i d + K i d C i d ( i = 1 , 2 , , 8 ) is Hurwitz. The initial states of x ^ i and γ i ( 0 ) are both chosen as in Example 1.
As shown in Figure 4a, the completely distributed observer in (20) can recover the state of (1). Also, as x ^ i x converges to 0 as time goes to , γ ˙ i goes to 0, so that γ i converges to a constant, which is verified by Figure 4b. Figure 5a,b depict the trajectory of C ^ V i p i and that of C ^ V i C V i , respectively, for i = 1 , , 8 . As shown in these two figures, C ^ V i p i and C ^ V i C V i both converge to 0 as time goes to for i = 1 , , 8 , which further indicates that p i approaches C V i as t .
Figure 6 describes the snapshots of coverage control with dynamic topology, where the topology changes as the Voronoi partitions vary. In Figure 6, M = 6 objects are represented by yellow five-pointed stars and N = 8 sensors are denoted by the solid blue-green dots. Under the guideline of (15) with k = 1 , the sensors move to their optimal locations, so as to reach an optimal coverage configuration, which is shown as Figure 6d. All these figures verify Theorem 2.

4.2. Discussions

In the previous section, two examples verify the theorems and illustrate the feasibility of the algorithm. Some important relevant points are stated in the following.
(i)
The initial state of the objects can be arbitrarily chosen in the targeted region which does not interfere with the performance of state estimation and the optimal location of each sensor.
(ii)
In principle, the selection of K i d only needs to make A i d + K i d C i d Hurwitz, but in order to meet the practical application, different K i d can be chosen to control the convergence rate which is determined by the eigenvalues of A i d + K i d C i d .
(iii)
In Theorem 1, the coupling gain γ should satisfy the condition (19), which is more than 16.808 at t = 0 . However, due to the fact that the topology of sensors is dynamic and changing, the adaptive coupling gain in (21) is designed to avoid real-time acquisition of topology information.
(iv)
The existing results are compared to distributed state estimation-based coverage control, such as in References [6,15], where the measurement of a sensor can ensure the observability of the output matrix with the system matrix. Yet, it is not economical or even possible to carry out the output measurement by using one sensor. In this paper, each sensor only needs to measure partial outputs with no need to guarantee the observability, but the consensus-based communication among the sensors can compensate for the state estimation on the targets. Such manipulations make the distributed observer well suited for targeted objects with high dimension (such as a power network) or those occupying a wide and decentralized region (such as a multi-agent system consisting of six UAVs flying in a given formation).

5. Conclusions

The coverage control of a sensor network aims at driving sensors to spread over a targeted region and simultaneously minimizing a cost function. For a convex targeted region with multiple objects, considering the fact that the objects are unknown and that they may be spread over a vast region, two-layer dynamics are endowed to each sensor, the upper layer dynamics and the lower layer one. In detail, the upper layer dynamics is a distributed observer, which is used to accomplish the state estimation on the objects. In a distributed observer, each sensor can only measure partial outputs of the objects, which makes it well suited for objects spread over a vast region. The lower layer dynamics is a negative feedback law guiding each sensor to reach its own optimal location, where the optimal location of each sensor is determined by a estimated state-based cost function. As the estimated states converge to the real state, the estimated state-based cost function approaches the real cost function, so that the optimal locations of the sensors can minimize the real cost function. This paper provides a new, more rational, and more realistic perspective for Voronoi based-coverage control.

Author Contributions

Writing—original draft, A.Z.; Writing—review and editing, A.Z. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Natural Science Fund for Excellent Young Scholars of Jiangsu Province under Grant No. BK20220104.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
NNNeural Network
RBFNNRadial Basis Function Neural Network

References

  1. Wang, Z.; Jusup, M.; Shi, L.; Lee, J.H.; Iwasa, Y.; Boccaletti, S. Exploiting a cognitive bias promotes cooperation in social dilemma experiments. Nat. Commun. 2018, 9, 2954. [Google Scholar] [CrossRef] [PubMed]
  2. Wang, J.; Liu, D.; Feng, J.; Zhao, Y. Distributed Optimization Control for Heterogeneous Multiagent Systems under Directed Topologies. Mathematics 2023, 11, 1479. [Google Scholar] [CrossRef]
  3. Liu, B.; Wang, X.; Su, H.; Gao, Y.; Wang, L. Adaptive second-order consensus of multi-agent systems with heterogeneous nonlinear dynamics and time-varying delays. Neurocomputing 2013, 118, 289–300. [Google Scholar] [CrossRef]
  4. Xu, C.; Zeng, W.; Liu, C.; Yan, H. Event-triggered semi-global output consensus of discrete-time multi-agent systems with input saturation and external disturbances. IEEE Trans. Circuits Syst. II Express Briefs 2023, 70, 4469–4473. [Google Scholar] [CrossRef]
  5. Tnunay, H.; Moussa, K.; Hably, A.; Marchand, N. Distributed Finite-Time Coverage Control of Multi-Quadrotor Systems with Switching Topology. Mathematics 2023, 11, 2621. [Google Scholar] [CrossRef]
  6. Sun, Q.; Chi, M.; Liu, Z.W.; He, D. Observer-Based coverage control of unicycle mobile robot network in dynamic environment. J. Frankl. Inst. 2023, 360, 9015–9027. [Google Scholar] [CrossRef]
  7. Yu, D.; Xu, H.; Chen, C.P.; Bai, W.; Wang, Z. Dynamic coverage control based on k-means. IEEE Trans. Ind. Electron. 2021, 69, 5333–5341. [Google Scholar] [CrossRef]
  8. Sun, Z.; Wang, N.; Lin, H.; Zhou, X. Persistent coverage of UAVs based on deep reinforcement learning with wonderful life utility. Neurocomputing 2023, 521, 137–145. [Google Scholar] [CrossRef]
  9. Hu, J.; Niu, H.; Carrasco, J.; Lennox, B.; Arvin, F. Voronoi-based multi-robot autonomous exploration in unknown environments via deep reinforcement learning. IEEE Trans. Veh. Technol. 2020, 69, 14413–14423. [Google Scholar] [CrossRef]
  10. Schwager, M.; Rus, D.; Slotine, J.J. Decentralized, adaptive coverage control for networked robots. Int. J. Robot. Res. 2009, 28, 357–375. [Google Scholar] [CrossRef]
  11. Abbasi, F.; Mesbahi, A.; Velni, J.M. A new voronoi-based blanket coverage control method for moving sensor networks. IEEE Trans. Control Syst. Technol. 2017, 27, 409–417. [Google Scholar] [CrossRef]
  12. Wang, B. Coverage Control in Sensor Networks; Springer Science & Business Media: New York, NY, USA, 2010. [Google Scholar]
  13. Luo, K.; Chi, M.; Chen, J.; Guan, Z.H.; Cai, C.X.; Zhang, D.X. Distributed coordination of multiple mobile actuators for pollution neutralization. Neurocomputing 2018, 316, 10–19. [Google Scholar] [CrossRef]
  14. Kantaros, Y.; Zavlanos, M.M. Distributed communication-aware coverage control by mobile sensor networks. Automatica 2016, 63, 209–220. [Google Scholar] [CrossRef]
  15. Zuo, L.; Yan, W.; Cui, R.; Chen, W.; Bai, X. Coverage control of multiple ocean vehicles for environment monitoring with energy constraints. In Proceedings of the OCEANS 2014-TAIPEI, Taipei, Taiwan, 7–10 April 2014; pp. 1–6. [Google Scholar]
  16. Wang, L.; Morse, A.S. A distributed observer for a time-invariant linear system. IEEE Trans. Autom. Control 2017, 63, 2123–2130. [Google Scholar] [CrossRef]
  17. Kim, T.; Lee, C.; Shim, H. Completely decentralized design of distributed observer for linear systems. IEEE Trans. Autom. Control 2019, 65, 4664–4678. [Google Scholar] [CrossRef]
  18. Wang, X.; Jiang, G.P.; Su, H.; Zeng, Z. Consensus-based distributed reduced-order observer design for LTI systems. IEEE Trans. Cybern. 2020, 52, 6331–6341. [Google Scholar] [CrossRef]
  19. Wang, X.; Su, H.; Zhang, F.; Chen, G. A Robust Distributed Interval Observer for LTI Systems. IEEE Trans. Autom. Control 2023, 68, 1337–1352. [Google Scholar] [CrossRef]
  20. Wang, X.; Fan, Z.; Wang, L.; Su, H.; Lam, J. Fully distributed observer design for mobile targets. IEEE Trans. Netw. Sci. Eng. 2023, 10, 1696–1708. [Google Scholar] [CrossRef]
  21. Du, Q.; Faber, V.; Gunzburger, M. Centroidal Voronoi tessellations: Applications and algorithms. SIAM Rev. 1999, 41, 637–676. [Google Scholar] [CrossRef]
  22. Haykin, S. Neural Networks: A Comprehensive Foundation; Prentice Hall PTR: Hoboken, NJ, USA, 1998. [Google Scholar]
  23. Yang, X.; Wan, X.; Zunshui, C.; Cao, J.; Liu, Y.; Rutkowski, L. Synchronization of switched discrete-time neural networks via quantized output control with actuator fault. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 4191–4201. [Google Scholar] [CrossRef]
  24. Chen, Z.; Huang, F.; Chen, W.; Zhang, J.; Sun, W.; Chen, J.; Zhu, S. RBFNN-based adaptive sliding mode control design for delayed nonlinear multilateral telerobotic system with cooperative manipulation. IEEE Trans. Ind. Inform. 2019, 16, 1236–1247. [Google Scholar] [CrossRef]
  25. Huang, J.T. Global tracking control of strict-feedback systems using neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2012, 23, 1714–1725. [Google Scholar] [CrossRef] [PubMed]
  26. Chen, Z.; Zhang, H.T. A remark on collective circular motion of heterogeneous multi-agents. Automatica 2013, 49, 1236–1241. [Google Scholar] [CrossRef]
Figure 1. Simulations of system (29) with the distributed observer designed in (20).
Figure 1. Simulations of system (29) with the distributed observer designed in (20).
Mathematics 12 00111 g001
Figure 2. Simulations of system (29) with coverage control algorithm in (15).
Figure 2. Simulations of system (29) with coverage control algorithm in (15).
Mathematics 12 00111 g002
Figure 3. Snapshots of coverage control with dynamic topology. The M = 2 objects and N = 8 sensors are denoted by two yellow spots and 8 solid blue-green dots, respectively.
Figure 3. Snapshots of coverage control with dynamic topology. The M = 2 objects and N = 8 sensors are denoted by two yellow spots and 8 solid blue-green dots, respectively.
Mathematics 12 00111 g003
Figure 4. Simulations of system (32) with the distributed observer designed in (20).
Figure 4. Simulations of system (32) with the distributed observer designed in (20).
Mathematics 12 00111 g004
Figure 5. Simulations of system (32) with coverage control algorithm in (15).
Figure 5. Simulations of system (32) with coverage control algorithm in (15).
Mathematics 12 00111 g005
Figure 6. Snapshots of coverage control with dynamic topology. The 6 objects and 8 sensors are denoted by 6 yellow five-pointed stars and 8 solid blue-green dots, respectively.
Figure 6. Snapshots of coverage control with dynamic topology. The 6 objects and 8 sensors are denoted by 6 yellow five-pointed stars and 8 solid blue-green dots, respectively.
Mathematics 12 00111 g006aMathematics 12 00111 g006b
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, A.; Wang, X. RBFNN-Based Distributed Coverage Control on an Unknown Region. Mathematics 2024, 12, 111. https://doi.org/10.3390/math12010111

AMA Style

Zhang A, Wang X. RBFNN-Based Distributed Coverage Control on an Unknown Region. Mathematics. 2024; 12(1):111. https://doi.org/10.3390/math12010111

Chicago/Turabian Style

Zhang, Ankang, and Xiaoling Wang. 2024. "RBFNN-Based Distributed Coverage Control on an Unknown Region" Mathematics 12, no. 1: 111. https://doi.org/10.3390/math12010111

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop