Next Article in Journal
Study on the Effect of Different Transcranial Pulse Current Stimulation Intervention Programs for Eliminating Physical Fatigue
Previous Article in Journal
A System for a Real-Time Electronic Component Detection and Classification on a Conveyor Belt
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Adaptive and Bounded Controller for Formation Control of Multi-Agent Systems with Communication Break

College of Weapon Engineering, Naval University of Engineering, Wuhan 430034, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2022, 12(11), 5602; https://doi.org/10.3390/app12115602
Submission received: 18 April 2022 / Revised: 24 May 2022 / Accepted: 25 May 2022 / Published: 31 May 2022
(This article belongs to the Topic Motion Planning and Control for Robotics)

Abstract

:
Aiming at maneuvering, input saturation, and communication interference in the controller design for formation control multi-agent systems, a novel nonlinear bounded controller is proposed. Based on coordinates transformation, reference information is processed, and nonlinear effects of maneuvering are analyzed. Then a nonlinear controller is established with graph theory, consensus algorithm, and Lyapunov method, which guarantee the stability of the controller. For input saturation avoidance, adaptive parameters are put forward with the Lyapunov function. Considering the communication breaks, various conditions of the sensing graph are discussed for stable formation control, and a dynamic programming regulator is proposed for unknown position reference needed for formation keeping. Comparison with the traditional consensus method is provided in numerical simulation to verify the stability and feasibility of the proposed strategy.

1. Introduction

Formation control [1,2] is crucial for multi-agent systems. Previously, switching topology [3,4], actuator faults [5], noise resistance [6], time constraints [7,8], connectivity maintenance [9], and communication delay [10] have been widely studied. Inspired by this research, a stable and feasible controller is critical for the formation control of multi-agent systems. Although various controllers have been proposed, nonlinearity caused by maneuvering [11,12,13] and saturation [5,14] is still the challenge. Besides, communication interference will lead to bad control due to the lack of reference information, which means extra strategies should be applied for stable formation [4]. Therefore, a comprehensive survey of bounded controllers considering maneuvering and communication interference is necessary. Previously, optimization-based methods and consensus algorithms with graph theory have been widely studied for controller design.
As for controller design, Ref. [14] proposed the inverse optimality method (IOM) with Hamilton–Jacobi–Bellman (HJB) equations, which provides a feasible bounded controller for formation control theoretically, but HJB equations are difficult to be solved for a simple control law. In [15,16], intelligent algorithms have been used to optimize velocities using centralized computing systems. However, time consumption is a big challenge for intelligent algorithms and saturation constraints are hard to be incorporated into the objective functions. Recently, model predictive control (MPC) [17,18] has been widely used for formation control, various requirements can be converted into inequality constraints. MPC is easy to implement programmatically, the challenge is the central computer will assume a great burden with a large number of agents. Then, distributed MPC (DMPC) has been proposed [19,20] for low time cost. With less computation and the advantage of bounded outputs, DMPC has attracted much attention now. The problem with DMPC is that there must exist nonlinear constraints considering dynamic constraints, such as maximum velocity and angular rate, which might cause complex computation. Furthermore, communication interferences will affect MPC and DMPC due to the lack of sufficient information for objective functions and some constraints.
Recently, consensus algorithms (CA) [12,21,22] with graph theory have been extensively researched. With the Laplacian matrix, relationships among agents can be easy to be incorporated into stability analysis with Lyapunov functions (LF), thus the formation switch and stability can be solved simultaneously. CA can apply measurement besides the interacted data to design control laws; what needs to be done is obtaining reference information with limited sources under communication interference [23]. The practicability of CA lies in linear control with linear feedback technique (LFT), with adaptive parameters [5] and sufficient information, the formation can move stably as required. Therefore, insufficient measuring data [1] might cause instability without cooperation, and strategies should be proposed for sufficient reference. To tackle this problem, rigid theory [1,8,11] has been introduced for limited sensors, the formation is expected to be predefined well for obtaining relative bearings or distances solely. However, communication systems and sensors might be equipped for stability in hostile environments, and strategies for obtaining expected information with limited sources can be useful in this situation. More importantly, agents encountering communication break needs to know how many sensing sources are required at least.
Motivated by the above analysis, this paper focuses on the formation control with the following purposes: (1) Establishing a control law with definitely bounded outputs based on LFT technology; thus, the problem of saturation can be avoided by adjusting parameters. (2) Searching for a strategy with insufficient reference, agents encountering communication breaks will obtain sensing data and obtain reference expectedly. In this paper, information processing is introduced for the nonlinear maneuvers [2,11]. Through the coordinate’s transformation, virtual reference information including velocity and acceleration are estimated. Then, a nonlinear controller is proposed with the Lyapunov method and the CA algorithm. Given the input saturation, parameters tuning principles of the controller are established based on the Lyapunov method. As for communication breaks, a sensing graph is introduced, and the requirements of the sensing graph are given in terms of various situations. Then, the Laplacian matrix is computed, and the interference resistance ability of agents can be enhanced with the Laplacian matrices of the communication graph. Considering the bearing information only [1], a nonlinear dynamic programming regulator for positions estimation is proposed. Finally, three tests are carried out and the proposed law is proved stable and feasible.
The contributions of this paper are listed below.
(1)
A definite bounded control law is proposed for anti-saturation control. The nonlinearity caused by maneuvers is considered compared with [24], then a controller with nonlinear parameters is proposed in this paper. Those parameters are designed as functions of states, and the values space of critical parameters is analyzed subsequently. Differing from the use of auxiliary variables in previous literature, parameters of reference are altered adaptively according to the value of inputs, which will be more applicable and easier to be realized.
(2)
A nonlinear dynamic programming regulator is proposed for reference with bearings only for some agents encountering breaks. Instead of expected locations change for all agents, we expect to change the expected locations of agents encountering saturation only for sensing measurements. Thus, most parts of the formation won’t be altered if they are not expected to be changed.
The remaining sections are organized as follows. Section 2 describes the problems, and a robust controller is proposed in Section 3. Then, Section 4 uses three tests to verify the proposed strategy. The conclusion is given in Section 5.
Notation: we use r × c to indicate the matrix space of which the matrix has r rows and c columns. n denotes the n -dimensional vector space. r c represents the stacked matrix concatenated with elements indicated in the bracket, and r c r × c . D 1 indicates the first-time derivative, and D 2 indicates the second-time derivative. rank represents the rank of the matrix in the bracket. The matrix operator denotes the Hadamard product and indicates the Kronecker product operator. 2 indicates the L2 norm form of the vector, and x M = x T M x represents the M norm form of the vector, where M denotes the symmetric positive definite matrix. Λ L indicates the diagonal matrix with size L d 1 × L d 2 , where d 1 , d 2 is the dimension of the diagonal element in the bracket.

2. Preliminaries

2.1. Graph Theory

For agents i in the agents set 𝒩 a = 1 , 2 , i , N with a predefined formation shape, we define the communication graph as 𝒢 = 𝒱 ,   , where 𝒱 = v i | i 𝒩 a indicates the vertices, and E V   ×   V represents the edges. Then, adjacent matrix A = a i j N N is obtained and we have a i j > 0 if v i , v j   i , j 𝒩 a , otherwise a i j = 0 and a i i = 0 . The Laplacian matrix is denoted as 𝓛 = l i j N N with l i i = i j a i j , and l i j = a i j , i j .

2.2. Numerical Models and Input Saturation

The dynamic model of agents is given by
    X ˙ t i       = f q t i f q t i = v t i c o s ψ t i , v t i s i n ψ t i       q ˙ t i       = u t i       u t i       = a t i , ω t i T  
where X t i = x t i ,   y t i denotes positions, q t i = v t i , ψ t i T represents the velocity expressed with the speed v t i 0 and the direction ψ t i 2 π , 2 π , u t i = a t i , ω t i T indicates the accelerations that will be designed for formation control. The model is nonlinear while considering the maneuvering. Thus, the estimation of u t i is vital.
In this paper, CA is applied for controller design, and the distributed feedback law can be indicated as:
u i = a i j k i j F j Θ j k i i F i Θ i + w i j p i j
where u i indicates the desired acceleration,   F i represents the processing functions of interacted information, Θ i = X t i , q t i , u t i denotes the interacted information, p i j n × 1 indicates the expected relative position, k i j n × m , w i j n   represent the weight matrix, n indicates the number of the controlled variables, and m represents the dimension of F i Θ i . Combining 𝓛 , Equation 2 can be rewritten as:
u = K 𝓛 I n × m F + P A 1 n × 1 1 n N × 1 F = F 1 Θ 1 T , F i Θ i T , , F N Θ N T T P = p 1 T , p 2 T , p i T , , p N T T
where K = k i j N n N m , p i = w i j p i j n N , I n × m = I n n m . Where I n represents the identity matrix, 1 n × 1 and 1 n N × 1 are constant matrices with value 1.
Considering the input saturation, the desired output u in 3 might not be feasible, and agents cannot realize the desired accelerations provided by the controller. Generally, constraints are placed behind the controller, thus the real inputs u f of the actuators are restricted. Defining the velocity boundary as q l i m , then the constraints can be indicated with 4 , where k denotes the k-th element.
u f i k = u i k , u i k + q t i k q l i m i k u f i k = u i k , u i k + q t i k > q l i m i k u f i k = u i k , u i k + q t i k < q l i m i k
Constraints in 4 help restrict the outputs, but the parameters of the controller cannot be adjusted timely without adaptive rules, which will hinder the controller from making correct decisions. In this paper, adaptive rules for parameters are proposed in Section 3.2, and u will be confined by u l i m .

2.3. Effect of Communication Break

Given that communication interferences impact interactions among agents, the expected value of coefficients of 𝓛 could not be kept unchanged. If all agents can obtain the planned reference information with faults when there exist interferences, methods in [5] can be adopted for motion control. However, there might be communication breaks, and the rank of 𝓛 must meet the requirements for formation keeping. Firstly, we give the primary requirements of healthy 𝒢 .
Lemma 1 [6].
If 𝒢 has a directed spanning forest, then we have r a n k 𝓛 = N N l , where N l represents the number of distributed networks.
Proof of Lemma 1 can refer to Remark 2.1 in the literature [6].
Lemma 2.
If there exist communication interferences and r a n k 𝓛 < N N l , where N l represents the number of distributed networks, then agents cannot keep the expected formation without extra reference information, such as the measurement.
Proof. 
According to [6], the Laplacian matrix 𝓛 of 𝒢 can be split as:
𝓛 = 0 N l × N l 0 N l × N N l L R L F
Based on 5 , we have rank 𝓛 N N l . Referring to 3 , each row of 𝓛 must have non-zero elements if we want all agents can obtain sufficient interacted information for formation control. Next, we assume rank 𝓛 < N N l , then there must exist at least one row of 𝓛 1 = L R L F in which all elements equal 0, which contradicts the demand in Lemma 2. Therefore, Lemma 2 is proved. □
Based on Lemma 2, we have rank 𝓛 < N N l if the communication interference (link failure, block, or strike) causes communication breaks, the system will get out of control. Thus, extra strategies should be provided, which will be discussed in Section 3.3.

3. Main Results

3.1. Reference Information Processing

Firstly, transformations between the inertial coordinate system   o x y and the velocity coordinate systems (VOSs) o k x k y k | k 𝒩 a as shown in Figure 1 are introduced for nonlinear controller design. We use 𝓣 0 i | i 𝒩 a to indicate the transformation from o x y to VOS, where 𝓣 0 i = T ψ t i , and T ψ t i is the rotation matrix.
Assume that the positions can be communicated among agents, we can directly compute q t j with D 1 X t j = f q t j . Then u t j can be estimated by D 1 q t j . However, for the measured information, it is more convenient to estimate q t j , u t j in VOSs. The relation between X i j t and q t j can be indicated as:
D 1 𝓣 0 i X i j t = v t j cos ψ r i v t i v t j sin ψ r i + D 1 𝓣 0 i X i j t
where ψ r i = g ψ t i ψ t j , and g φ = φ if φ π , otherwise g φ = g φ s i g n φ 2 π , where s i g n · is the sign function.
Referring to 6 , v t j can be estimated by
v t j = D 1 𝓣 0 i X i j t D 1 𝓣 0 i X i j t + v t i 0 2
Since we obtain v t j , ψ t j can be computed later. Thus q t j is estimated; similarly, we can obtain u t j . If more information can be interacted, such as q t j or u t j , then unbiased estimations of q t j or u t j can be realized with the above differential models and extended Kalman filter (EKF).

3.2. Nonlinear Controller Design and Stability Analysis

As can be seen from 6 , a high-order controller is required considering the maneuvers during formation acquisition and keeping. In this section, we search for a nonlinear controller without input saturation. Using adaptive parameters, model 3 can be rewritten as u = 𝓵 f K , 𝓛 F P A 1 n × 1 1 n N × 1 , where F i Θ i = X t i , q t i , u t i T , and 𝓵 f K , 𝓛 indicates the adaptive adjusting function of K and 𝓛 .
𝓵 f K , 𝓛 = l i j k i j N n × N m
Now, we propose the controller with Proposition 1.
Proposition 1.
For multi-agent systems with strongly connected undirected graph 𝒢 , the principle of 𝓵 f K , 𝓛 and P in 8 , the controller defined in Equation 3 can guarantee the finite-time stability of the system regardless of communication interference.
k i i = k i j 1 C K , 0 2 × 4             + k i j 2 l i i v t i v t i a i j k s , v t i a i j k c , 1 , 0 1 × 3 a i j k c , a i j k s , 0 , 0 1 × 3 k i j = k i j 1 C K , 0 2 , c o s ψ r i , 0 sin ψ r i , k i j 1             + k i j 2 v t i v t i k s , v t i k c , v t i c o s ψ r i , 0 1 × 3 k c , k s , sin ψ r i , 0 1 × 3 C K = 1 v t i v t i c o s ψ t i , v t i s i n ψ t i s i n ψ t i , c o s ψ t i w i j = k i j 1 C K + k i j 2 v t i v t i k s , v t i k c k c , k s
where k s = s i n ψ t i ψ ˙ t j , k c = c o s ψ t i ψ ˙ t j , and k i j 1 and k i j 2 are positive real numbers.
Before proceeding to the proof of Proposition 1, we introduce Lemma 3.
Lemma 3 [25,26].
If quadratic scalar function V t concerning the state of the system satisfies the inequality V t k + 1 V t k V 0 <   t 0 < t k < t k + 1 , the system is progressively stable, where V 0 is the value at the initial time t 0 . The inequality is equivalent to V ˙ t 0 , and the equality holds when t k τ , where τ > t 0 is a positive real number.
Lemma 4 [27,28].
If a power scalar function V x is continuously differentiable along x with x n , and the first-order derivative satisfies V x γ 1 V α x γ 1 V x γ 1 V β x + b V , where k = 1 , 2 , 3 γ k 0 , 0 < α < 1 < β , b V is a bounded positive number. Then x will approach the equilibrium state in a finite time.
Next, we give the proof of Proposition 1.
Proof. 
Firstly, we introduce the combined vector X i t , Z i t , and permutation matrix P e t , where X i t = X i j t T 1 n N T , Z i t = X i j t n N , and P e t satisfies:
Z i t P e t = Z i 𝓛 t Z i 𝒪 t
where Z i 𝓛 t = X i j k t n L , Z i 𝒪 t = X i j k t n N L , k = 1 , 2 , , N . Z i 𝓛 t represents the interacted information from nearby agents, and Z i 𝒪 t denotes the information of agents without interaction which is impossible to obtain. Reshaping Z i t P e t yields:
ϕ Z i t P e t = X i 𝓛 t X i 𝒪 t T
where X i 𝓛 t = X i j k t T 1 n L , X i 𝒪 t = X i j k t n N .
Define the rotation matrices Q i t with 𝓣 0 i 𝓛 = Λ L 𝓣 0 j 1 .
Q i t = 𝓣 0 i 𝓛 0 0 0
Define the Lyapunov function V i t = V 1 i t + V 2 i t with
V 1 i t = 1 2 Q i t ϕ Z i t P e t T R 1 Q i t ϕ Z i t P e t V 2 i t = 1 2 q i t T R 2 q i t
where q i t = D 1 Q i t ϕ Z i t P e t , and matrices R 1 ,   R 2 satisfy:
R 1 = diag Λ L η j k a i j k , 0 N L I 2 R 2 = η 0 diag Λ L η j k a i j k , 0 N L I 2
With diag · being a diagonal matrix, 0 N L being a zero matrix, and η i 0 L + 1 being positive real numbers.
Taking the derivative of 12 , we have:
V ˙ 1 i t = Q i t ϕ Z i t P e t T R 1 q i t                   = 𝓣 0 i 𝓛 X i 𝓛 t T Λ L η j k a i j k I 2 q i 𝓛 t V ˙ 2 i t = η 0 q i 𝓛 t T Λ L η j k a i j k I 2 D 1 q i 𝓛 t
with q i 𝓛 t = D 1 𝓣 0 i 𝓛 X i 𝓛 t .
Then, we obtain
V ˙ i t = 𝓣 0 i 𝓛 X i 𝓛 t T Λ L η j k a i j k I 2 q i 𝓛 t                       + η 0 q i 𝓛 t T Λ L η j k a i j k I 2 D 1 q i 𝓛 t
Before proceeding to the next step, we rewrite u i as
u i = E i 𝓵 f K , 𝓛 F               E i P A 1 n × 1 1 n N × 1 = u i j k t
where E i = e 1 , , e N n × n N , e i = I 2 , e j i = 0 2 .
q i 𝓛 t can be rearranged as
q i 𝓛 t = D 1 𝓣 0 i 𝓛 X i 𝓛 t                   = 𝓣 0 i 𝓛 P e 1 X i 𝓛 t + β i 1 q i 𝓛 t + 1 L × 1 P 1 q t i
With q i 𝓛 t = q t j k T 1 n N T , P e 1 = 0 1 1 0 , P 1 = 1 0 0 0 , β j k 1 = c o s ψ r i j k 0 s i n ψ r i j k 0 , 𝓣 0 i 𝓛 = Λ L 𝓣 0 j k ψ ˙ t j k , β i 1 = Λ L β j k 1 .
Referring to (17) and (18), we have
D 1 q i 𝓛 t = I L P e T i j 𝓛 P 2 u i 𝓛 t + φ i 𝓛 q i 𝓛 t β i 2 D 1 q i 𝓛 t
With u i 𝓛 t = u i j k t T 1 n N T , P e = 0 1 1 0 , P 2 = 0 v t i 1 0 , β j k 2 = 1 0 s i n ψ r i j k v t i c o s ψ r i j k , T i j 𝓛 = Λ L T ψ r i j 1 , β i 2 = Λ L β j k 2 , φ i 𝓛 = Λ L ψ ˙ t j 1 P e 1 .
Subsequently, V ˙ i t can be rewritten as:
V ˙ i t = 𝓣 0 i 𝓛 X i 𝓛 t T Λ L η j k a i j k I 2 q i 𝓛 t                     + γ k I L P e T i j 𝓛 P 2 u i 𝓛 t                     γ k I L P e T i j 𝓛 P e P 1 P e D 1 q i 𝓛 t                     + γ k φ i 𝓛 q i 𝓛 t + I L P 1 D 1 q i 𝓛 t γ k = η 0 q i 𝓛 t T Λ L η j k a i j k I 2
where u i 𝓛 t can be rewritten as
u i 𝓛 t = K i 2 1 L × 1 P 1 k i 2 q t i + K i 3 q i 𝓛 t + K i 4 D 1 q i 𝓛 t       K i 1 λ i 1 𝓣 0 i + T ´ 0 i 𝓛 P e 1 T 𝓣 0 i X i 𝓛 t
With T ´ 0 i 𝓛 = l i i Λ L k i j k 2 λ i j k , K i 1 = Λ L k i j k 1 , K i 2 = k i j k 1 L 1 , K i 3 = Λ L k i j k 2 k i j k 2 , K i 4 = Λ L k i j k 3 .
Redefining k i i and k i j k as k i i = k i i 1 , k i i 2 , 0 3 and k i j k = k i j k 1 , k i j k 2 , k i j k 3 with
k i i 1 = j i k i j 1 λ i 1 𝓣 0 i j i k i j 2 λ i j P e 𝓣 0 i k i j k 1 = k i j 1 λ i 1 𝓣 0 i k i j 2 λ i j P e 𝓣 0 i k i i 2 = k i j 2 0 , k i j k 2 = 1 v t i c o s ψ r i j k v t i 0 sin ψ r i j k 0 k i j k 3 = 1 v t i c o s ψ r i j k v t i 0 sin ψ r i j k v t i
where λ i 1 = 1 l i i v t i v t i 0 0 1 , λ i j = 1 l i i v t i ψ ˙ t j v t i 0 0 ψ ˙ t j .
Next, we have
V ˙ i t = η 0 k = 1 L γ i j k P T k i j k 2 q t j k                       + η 0 q i 𝓛 t T Λ L η j k a i j k φ i 𝓛 q i 𝓛 t                       η 0 k = 1 L γ i j k P T l i i λ i j k P e 1 T 𝓣 0 i X i j k t                         η 0 k = 1 L γ i j k ζ i j k D 1 q t j k + γ i j k P T P 1 q t i                         + k = 1 L η j k η 0 k i j k 1 𝓣 0 j k X i j k t T γ i j k T
where
γ i j k = q i j k t T a i j k I 2 , P T = P e T ψ r i j k P 2 ζ i j k = P T k i j k 3 P e T ψ r i j k P e P 1 P e + P 1
Furthermore, V ˙ i t can be simplified as
V ˙ i t = η 0 k = 1 L γ i j k ζ ¯ i j k D 1 q t j k                       + η 0 q i 𝓛 t T Λ L η j k a i j k I 2 φ i 𝓛 q i 𝓛 t                       + k = 1 L η j k η 0 k i j k 1 𝓣 0 j k X i j k t T γ i j k T                       = k = 1 L η j k η 0 k i j k 1 𝓣 0 j k X i j k t T γ i j k T                       + η 0 q i 𝓛 t T Λ L η j k a i j k I 2 φ i 𝓛 q i 𝓛 t                       η 0 q i 𝓛 t T Λ L η j k a i j k I 2 q i 𝓛 t
where
ζ ¯ i j k = 𝓣 0 j P e 1 X i j k t T ψ r i j k T P 1 q t i + P 1 q t j k
Utilizing the equality q i 𝓛 t T Λ L η j k a i j k I 2 φ i 𝓛 q i 𝓛 t = 0 and choosing η j k = η 0 k i j k 1 , we have V ˙ i t 0 and the equality holds only if q i 𝓛 t equals zero. According to Lemma 3, the system must be asymptotical stable, and the tracking errors are decreasing. Thus, ϕ Z i t P e t is bounded and there exists a positive number B V that satisfies V 1 i t b V . Therefore, we have V ˙ i t k V V i t + k V B V , where k V is a positive real number. Consequently, V i t will converge to an equilibrium state in a finite time based on Lemma 4.
The proof of Proposition 1 is finished. □

3.3. Adaptive Rules for Parameters

Referring to 4 , we have v t v i t v + t   i 𝒩 a , where v t indicates the lower limit of v i t , and v + t > 0 is the upper limit. In addition, v t and v + t are time varying but satisfy the constraint v m v t v + t v m .
Theorem 1.
If V t is the quadratic scalar function with the derivative V ˙ t = π 1 V t ± V t π t , where π 1 > 0 and π t satisfies the inequation π m π t π t π + t π m , then V t is bounded by V t π m .
Proof. 
Handling the derivative, we have V ˙ t = V t π 1 V t ± π t . Using the condition π m π t π t π + t π m yields V ˙ t V t π 1 V t + π m , we obtain the result V t π m referring to Lemma 4. □
Proposition 2.
Define the max acceleration, angular rate, and velocity as a m , ω m , and v m , respectively, with a t i a i a m , ω t i ω i ω m , v t i v i v m . Then, the velocity of agents must be bounded if k i j k 1 = α j k d j k t and k i j k 2 = β j k k i j k 1 hold with α j k , β j k , and d j k satisfying the following equations.
α j k + α j k β j k ω j k 0.5 2 k i j k 2 v m 2 0.5 k i j k 2 v j k + a j k 2
2 k i j k 2 v m k i j k 2 v j k + a j k
Remark 1.
d j k is correlated to X i j t , because V t is asymptotically convergent, which means V j k t = η j k 𝓣 0 j k X i j k t T 𝓣 0 j k X i j k t + η 0 η j k q i j k t T q i j k t is decreasing. Here, we can choose the positive variable d j k t = d j k t 0 , where V j k t 0 d j k t 0 , so we can obtain V j k t d j k t 0 , thus X i j t d j k t 0 1 1 × 2 .
Remark 2.
Since v j k v m , k i j k 2 2 v m v j k a j k can be realized, which means we can obtain positive α j k and β j k for bounded controller design.
Proof of Proposition 2.
Based on the law 3 , we can obtain:
v ˙ t i = k = 1 L k i j k 1 k i j k 2 ψ ˙ t j k 𝓣 0 i X i j k t k = 1 L k i j k 2 v t i v t j k c o s ψ r i j k c o s ψ r i j k a t j k
Introducing Lyapunov function V 3 :
V 3 t = 1 2 v t i 2
Based on Equation 30 , the derivative of V 3 can be obtained:
V ˙ 3 t = 2 k p v v t i 2 + v t i k = 1 L c o s ψ r i j k a t j k                         + v t i k = 1 L k i j k 2 v t j k c o s ψ r i j k                         + v t i k = 1 L k i j k 1 k i j k 2 ψ ˙ t j k 𝓣 0 i X i j k t                         = 2 k p v v t i 2 + v t i k = 1 L π t i j k
where k p v = k = 1 L k i j k 2 , π t i = k i j k 1 k i j k 2 ψ ˙ t j k 𝓣 0 i X i j k t + c o s ψ r i j k a t j k + k i j k 2 v t j k c o s ψ r i j k . Using 𝓣 0 i X i j k t = T ψ r i j k 𝓣 0 i X i j k t and defining 𝓣 0 i X i j k t = x i j k e t , y i j k e t T , we have
π t i j k = s i n ψ r i j k a j k t c o s ψ r i j k b j k t
where
a j k t = k i j k 1 y i j k e t k i j k 2 x i j k e t ψ ˙ t j k b j t = a t j k k i j k 2 v t j k                       k i j k 1 x i j k e t + k i j k 2 y i j k e t ψ ˙ t j k
Firstly, we give the following in Equation (31):
s i n ψ r i j k a j k t c o s ψ r i j k b j k t a j k t 2 + b j k t 2
Assuming x = s i n ψ r i j k yields the function f x = a j k t x + b j k t 1 x 2 , 0 x 1 . Then, taking the derivative of f x , we have:
f ˙ x = a j k t b j k t x 1 x 2
Referring to (32), f x is maximum when x a j k t 2 + b j k t 2 = a j k t . Defining σ j k = α j k + α j k β j k ω j k , we have the following inequation based on (30) and Theorem 1:
π t i j k a j k t 2 + b j k t 2 2 σ j k 2 + k i j k 2 v j k + a j k 2 + 2 k i j k 2 v j k + a j k σ j k
Consequently, π t i j k 2 k p v v m holds and we can obtain v t i v m with π t i j k > 0 or v t i v m with π t i j k > 0 . Based   on   the   equality   a i j k t k + 1 = π t k i j k k i j k 2 v t k i and the inequality v t k i v 0 , we have v t k + 1 i v m with v t k i v m and π t k i j k 0 , or v t k + 1 i v m with v t k i v m and π t k i j k 0 . Thus, the velocity constraint can be satisfied with the condition v t k i v 0 , the challenge is how to make sure that X i j k t T X i j k t is decreasing. Given that v t k + 1 i decreases if X i j k t has a positive value, there must be a i j k t k + 1 0 and v t k + 1 i = v < v t k + 1 j k . The boundedness analysis of the velocity is finished.
As for the angular rate, we have:
ψ ˙ t i = k = 1 L k i j k 2 ψ ˙ t j k k i j k 1 𝓣 0 i X i j k t v t i k = 1 L k i j k 2 v t j k s i n ψ r i j k + s i n ψ r i j k a t j k v t i ψ ˙ t j k
Define ψ ˙ t i = k = 1 L θ t i j k with
v t i θ t i j k = c o s ψ r i j k k i j k 1 y i j k e t k i j k 2 x i j k e t ψ ˙ t j k = s i n ψ r i j k b j k t c o s ψ r i j k a j k t + v t i ψ ˙ t j k
We can obtain:
v t i θ t i j k π t i j k + v t i ψ ˙ t j k 2 k i j k 2 v m + v t i ψ ˙ t j k
Choosing k i j k 2 = v t i 2 v m ω m ψ ˙ t j k , we can obtain a group of α j k and β j k . Thus, Proposition 2 is proved and the input saturation is avoided. □

3.4. Communication Interference Resistance with Sensors

Based on Lemma 2, the nonlinear controller proposed in Section 3.3 might become invalid when agents encounter communication breaks as conditions of Proposition 1 cannot be met. Therefore, extra reference information should be provided, and sensors mounted on agents can complete the mission. In this paper, we discuss the requirements of the sensing graph and propose an estimator with only bearings measurement.
Firstly, we introduce the sensing graph 𝒢 s , which is indicated as 𝒱 s , s , where 𝒱 s = v s i | i 𝒩 a indicates the vertices, and s 𝒱 s × 𝒱 s represents the edges. Similar to 𝒢 , the adjacent matrix of 𝒢 s is defined as A s = a s i j N N , a s i j > 0 if agent i can sense information of agent j , otherwise a i j = 0 . Obviously, A s is not necessary to be symmetric. Subsequently, the Laplacian matrix 𝓛 s is indicated as 𝓛 s = l s i j N N , l s i j = a s i j if i j , otherwise l s i i = a s i j .
Stimulated by [22] restrictions should be put to 𝓛 s with insufficient measured information, otherwise, the proposed controller in Proposition 1 will be invalid. The primary restriction is given with Theorem 2.
Theorem 2.
Multi-agent systems can keep stable with r a n k 𝓛 < N N l and r a n k 𝓛 + 𝓛 s = N N l , if there exist communication interferences.
Theorem 2 can be proved with Lemma 1 and various restrictions for different sensors. Next, we will discuss these restrictions required for stable formation.
As seen from the reference information processing in Section 2.3, distance information [8] is necessary for formation control, even in [1,21,25,29], velocity measurement is required for controller design. Therefore, bearings alone cannot provide sufficient information without information fusion. In this paper, we propose an estimator for relative positions with bearings only, and we give the model of bearings b i j t   i , j s :
b i j t = X i j t X i j t 2
where X i j t = 𝓣 0 i X t j X t i .
Then, take derivatives of b i j t and we can obtain:
D 1 b i j t = D 1 X i j t X i j t 2 2 X i j t X i j t T D 1 X i j t X i j t 2 3
Remark 3.
Equations (37) and (38) are both underdetermined with bearing b i j t only. If the relative positions can be obtained [8], then b i j t is useless. If the velocity can be sensed [21], then Equation (38) is determined and D 1 X i j t can be obtained. The above analysis implies that the controller is valid with bearings information if Equations (37) and (38) can be solved, which means we must obtain X i j t with Equations (37) and (38), which can be considered as restrictions on 𝓛 s in various situations.
Situation 1.
Only bearings can be transmitted among agents or obtained by detecting, then we can obtain  X i j t with Equations(37) and (38) if  r a n k 𝓛 + 𝓛 s = N N l holds, where the number of positive elements must be greater than n for each row of 𝓛 + 𝓛 s .
Situation 2.
Extra reference can be transmitted among agents or obtained by detecting beside bearings, such as distance or velocity, then we can obtain  X i j t with Equations(37) and (38) if  r a n k 𝓛 + 𝓛 s = N N l holds.
Situation 3.
Although extra reference can be transmitted among agents or obtained by detecting besides bearings, systems encounter communication breaks and some agents lose interaction with others, then we can obtain  X i j t with Equations(37) and (38) if  r a n k 𝓛 + 𝓛 s = N N l holds, and the number of positive elements of the corresponding rows of 𝓛 + 𝓛 s must be greater than n for agents without any interaction.
According to the discussion, formation reconstruction might be necessary while encountering communication breaks. The autonomous system should adjust the motion parameters, thus agents without interaction could obtain sufficient measurement. With the above strategy, the estimator is proposed as:
A s i 1 1 × 2 n T H X i j t = 0
where A s i = a s i j 1 N represents the adjacent vector of agent i , H X i j t = h X i j t 2 n N 1 , and h X i j t is indicated as:
h X i j t = b i j t X i j t 2 X i j t D 1 b i j t D 1 X i j t X i j t 2 + X i j t X i j t T D 1 X i j t X i j t 2 3
Considering the nonlinearity and measured deviation, 38 might have no closed-form analytic solution. Therefore, the nonlinear programming method is applied here, and the corresponding regulator is defined as:
J = A s i 1 1 × n T G X Q s s . t .         A s i 1 1 × n T G b = 0
where Q s is a symmetric positive matrix.
G b = b i j t X i j t 2 X i j t n N 1 G X = D 1 b i j t D 1 X i j t X i j t 2 + X i j t X i j t T D 1 X i j t X i j t 2 3 n N 1
For agents that can keep communication with neighbors, the sensing information can be used for unbiased estimations. For agents without interaction (encountering communication breaks), sensing information is crucial for consensus keeping.

4. Numerical Simulation

4.1. Leader-Follower Model-Based Formation Control

Formation control with 8 agents is considered, each agent has a leader, and there is no communication break. The expected relative position is [−69.2820, 40] or [−69.2820, −40] in the reference frame, boundaries are v m a x = 30   m / s , ω m a x = 1   rad / s . Initial states of agents are listed in Table 1.
As seen in Figure 2, all agents can form the expected formation and keep formation with the proposed controller. Fluctuations happen between steps 400 and 600 when the formation switch is performed, and agents conduct maneuvering among step 500 and step 550 with ω = 0.01 rad / s . Compared with the traditional linear controller with constant parameters, results of which are shown in Figure 3, Figure 4, and Figure 5, the proposed controller can bound the velocity and angular rate according to Figure 6 and Figure 7, and frequent shock is avoided.

4.2. Formation Control with Communication Break

Boundaries remain unchanged, agent 2 encountered communication breaks between steps 500 and 550, and distance and velocity information can be sensed. Initial states of agents are listed in Table 2.
Since at step 0 the formation is not complete, each agent should have reasonable reference information before step 200. The maneuvering time is between 700 and 710, thus the velocity fluctuation exists in Figure 8. Agent 2 encountered a communication break between steps 500 and 550; therefore, the errors increase at step 500 for agent 2. Figure 8 and Figure 9 show that the boundaries of velocity and angular rate are satisfied. Figure 10 shows that communication breaks and maneuvering affect the formation to a certain extent, but the formation is kept ultimately. Considering that agent i can obtain relatively sufficient reference information from other nearby agents with sensors while encountering a communication break, agent i can keep pace with others to some extent. Figure 11 displays the track of the formation in this test.

4.3. Formation Keep with Bearings Only

Based on the second simulation, the worse condition is considered, agent i could not obtain distance and velocity referred information, which means agent i must realize formation keeping with only bearings information. In this part, we consider that agent 8 lost all communication from step 500 to 800, and it can detect the relative bearing of agent 2 and agent 6, the maneuvering starts at step 300, and finishes at step 400. The turning rate is changing based on ω = 0.01 t 100 5000 rad / s .
The duration of the communication interruption is longer compared with test two, and no distance and velocity information can be sensed. Therefore, the positional error of agent 8 is larger, which is shown with the bold line in Figure 12. However, fortunately, the positional error of agent 8 is decreasing slowly, although there exists only bearings information, the formation is kept to some extent, which is shown in Figure 13. Due to communication breaks, control of velocity and angular rate are not so smooth, which can be found in Figure 14 and Figure 15, but the boundaries are satisfied in general. As can be seen from Figure 14, the turning maneuvering impacts the formation a lot, the time-varying angular rate will cause shock, and agents behind will suffer more effects. In this paper, no devices damage is considered, but the formation control results of persistent communication interruption show that agents without communication can be capable of roughly tracking refereed agents, although the tracking is not ideal.

5. Conclusions

This paper surveys stable formation control for multi-agent systems with communication breaks due to link failure and communication barriers. Considering the nonlinearity caused by maneuvers during formation acquisition, a novel nonlinear controller is proposed based on the CA algorithm. Using the Lyapunov function method and the distance-based function, an adaptive rule of parameters of the controller is put forward for anti-saturation control. In addition, the conditions of reference information are analyzed. Based on the sensing graph, the feasibility of formation keeping with bearings only is considered, and a dynamic programming regulator is proposed for unknown reference (relative positions and velocity) estimation. The proposed regulator provides auxiliary reference information for the controller, and the formation is kept encountering communication breaks. To addresses collisions among agents, the artificial potential field method is utilized for the law revision. Mathematical and numerical analysis shows that the controller is feasible and stable. Future research will focus on precise and more efficient control with bearings information and formation switch in environments with obstacles.

Author Contributions

Conceptualization, Z.X.; Funding acquisition, J.X.; Investigation, Z.L.; Methodology, Z.X. and Y.L.; Software, Z.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China grant number [51679247].

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gwihan, K.; Minh, H.T.; Hyo-Sung, A. Bearing-only control of directed cycle formations: Almost global convergence and hardware implementation. Int. J. Robust Nonlinear Control 2020, 30, 4789–4804. [Google Scholar] [CrossRef]
  2. Arijit, S.; Soumya, R.S.; Mangal, K. Nonlinear formation control strategies for agents without relative measurements under heterogeneous networks. Int. J. Robust Nonlinear Control 2017, 28, 1653–1671. [Google Scholar] [CrossRef]
  3. Xiwang, D.; Chao, S.; Guoqiang, H. Time-varying output formation control for linear multi-agent systems with switching topologies. Int. J. Robust Nonlinear Control 2016, 26, 3558–3579. [Google Scholar] [CrossRef]
  4. Daifeng, Z.; Haibin, D. Switching topology approach for UAV formation based on binary-tree network. J. Frankl. Inst. 2019, 356, 835–859. [Google Scholar] [CrossRef]
  5. Iman, S.; Khashayar, K. Actuator fault accommodation strategy for a team of multi-agent systems subject to switching topology. Automatica 2015, 62, 200–207. [Google Scholar] [CrossRef]
  6. JunHao, R.; XiaoFeng, Z. Containment Control of Multi-Agent Systems with Stochastic Multiplicative Noises. J. Syst. Sci. Complex. 2021. [Google Scholar] [CrossRef]
  7. Changduo, L.; Mingfeng, G.; Zhiwei, L.; Guang, L.; Feng, L. Predefined-time formation tracking control of networked marine surface vehicles. Control. Eng. Pract. 2021, 107, 104682. [Google Scholar] [CrossRef]
  8. Farhad, M.; Farzad, H.; Mahdi, B.; Marcio, D.Q. Finite-Time Rigidity-Based Formation Maneuvering of Multiagent Systems Using Distributed Finite-Time Velocity Estimators. IEEE Trans. Cybern. 2019, 49, 4473–4484. [Google Scholar] [CrossRef]
  9. Bong, S.P.; Sung, J.Y. Connectivity-maintaining and collision-avoiding performance function approach for robust leader–follower formation control of multiple uncertain underactuated surface vessels. Automatica 2021, 127, 109501. [Google Scholar] [CrossRef]
  10. Hamed, R.; Farzaneh, A. Motion synchronization in unmanned aircrafts formation control with communication delays. Commun. Nonlinear Sci. Numer. Simul. 2013, 18, 744–756. [Google Scholar] [CrossRef]
  11. Mohammad, A.D.; Mohammad, B.M. Communication free leader–follower formation control of unmanned aircraft systems. Robot. Auton. Syst. 2016, 80, 69–75. [Google Scholar] [CrossRef]
  12. Tagir, Z.M.; Rustem, A.M. Consensus-based cooperative control of parallel fixed-wing UAV formations via adaptive backstepping. Aerosp. Sci. Technol. 2021, 109, 106416. [Google Scholar] [CrossRef]
  13. Yusuf, K.; Kamesh, S.; Nicholas, R.G.; Atilla, D.; Frank, L. Distributed backstepping based control of multiple UAV formation flight subject to time delays. IET Control. Theory Appl. 2020, 14, 1628–1638. [Google Scholar] [CrossRef]
  14. Do, K.D. Bounded and inverse optimal formation stabilization of second-order agents. Automatica 2021, 123, 109367. [Google Scholar] [CrossRef]
  15. ShiKai, S.; Yu, P.; Chenglong, H.; Yu, D. Efficient path planning for UAV formation via comprehensively improved particle swarm optimization. ISA Trans. 2020, 97, 415–430. [Google Scholar] [CrossRef]
  16. AliNoormohammadi, A.; Mohammad, B.M.; Atena, S. Control of leader-follower formation and path planning of mobile robots using asexual reproduction optimization (ARO). Appl. Soft Comput. 2014, 14, 563–576. [Google Scholar] [CrossRef]
  17. Ahmed, T.H.; Anthony, J.M.; Sidney, N.G.; Mohamad, I.; Shahram, Y.; Camille, A.R. Solving multi-UAV dynamic encirclement via model predictive control. IEEE Trans. Control. Syst. Technol. 2015, 23, 2251–2265. [Google Scholar] [CrossRef]
  18. Ahmed, T.H.; Sidney, N.G.; Shahram, Y. Unmanned Aerial Vehicles Formation Using Learning Based Model Predictive Control. Asian J. Control 2018, 20, 1014–1026. [Google Scholar] [CrossRef]
  19. Qishao, W.; Zhisheng, D.; Yuezu, L.; Qingyun, W.; Guanrong, C. Linear quadratic optimal consensus of discrete-time multi-agent systems with optimal steady state: A distributed model predictive control approach. Automatica 2021, 127, 109505. [Google Scholar] [CrossRef]
  20. Qishao, W.; Zhisheng, D.; Jingyao, W.; Guanrong, C. LQ Synchronization of Discrete-Time Multi-Agent Systems: A Distributed Optimization Approach. IEEE Trans. Autom. Control 2019, 64, 5183–5190. [Google Scholar] [CrossRef]
  21. Choi, H.L.; Brunet, L.; How, J.P. Consensus-Based Decentralized Auctions for Robust Task Allocation. IEEE Trans. Robot. 2009, 25, 912–926. [Google Scholar] [CrossRef]
  22. Zhiqi, T.; Rita, C.; Tarek, H.; Carlos, S. Formation control of a leader–follower structure in three dimensional space using bearing measurements. Automatica 2021, 128, 109567. [Google Scholar] [CrossRef]
  23. Guanghui, W.; Zhisheng, D.; Wei, R.; Guanrong, C. Distributed consensus of multi-agent systems with general linear node dynamics and intermittent communications. Int. J. Robust Nonlinear Control 2014, 24, 2438–2457. [Google Scholar] [CrossRef]
  24. Xiaowen, F.; Jing, P.; Haixiang, W.; Xiaoguang, G. A formation maintenance and reconstruction method of UAV swarm based on distributed control. Aerosp. Sci. Technol. 2020, 104, 105981. [Google Scholar] [CrossRef]
  25. Michalska, H.; Mayne, D.Q. Robust receding horizon control of constrained nonlinear systems. IEEE Trans. Autom. Control 1993, 38, 1623–1633. [Google Scholar] [CrossRef]
  26. Sun, Z.; Dai, L.; Liu, K.; Xia, Y.; Johansson, K.H. Robust MPC for tracking constrained unicycle robots with additive disturbances. Automatica 2018, 90, 172–184. [Google Scholar] [CrossRef]
  27. Zongyu, Z.; Lin, T. Distributed robust finite-time nonlinear consensus protocols for multi-agent systems. Int. J. Syst. Sci. 2016, 47, 1366–1375. [Google Scholar] [CrossRef]
  28. Chenfeng, H.; Xianku, Z.; Guoqing, Z. Adaptive neural finite-time formation control for multiple underactuated vessels with actuator faults. Ocean. Eng. 2021, 222, 108556. [Google Scholar] [CrossRef]
  29. Florent, L.B.; Tarek, H.; Robert, M.; Claude, S. Observers for Position Estimation Using Bearing and Biased Velocity Information. In Sensing and Control for Autonomous Vehicles; Lecture Notes in Control and Information Sciences; Springer: Cham, Switzerland, 2017; Volume 474, pp. 3–23. [Google Scholar] [CrossRef]
Figure 1. Coordinate transformation.
Figure 1. Coordinate transformation.
Applsci 12 05602 g001
Figure 2. Position errors with the proposed controller.
Figure 2. Position errors with the proposed controller.
Applsci 12 05602 g002
Figure 3. Position errors with linear controller.
Figure 3. Position errors with linear controller.
Applsci 12 05602 g003
Figure 4. Velocity control result with linear controller.
Figure 4. Velocity control result with linear controller.
Applsci 12 05602 g004
Figure 5. Angular rate control result with linear controller.
Figure 5. Angular rate control result with linear controller.
Applsci 12 05602 g005
Figure 6. Velocity control with the proposed controller.
Figure 6. Velocity control with the proposed controller.
Applsci 12 05602 g006
Figure 7. Angular rate control with the proposed controller.
Figure 7. Angular rate control with the proposed controller.
Applsci 12 05602 g007
Figure 8. Velocity (test 2).
Figure 8. Velocity (test 2).
Applsci 12 05602 g008
Figure 9. Angular rate (test 2).
Figure 9. Angular rate (test 2).
Applsci 12 05602 g009
Figure 10. Position errors (test 2).
Figure 10. Position errors (test 2).
Applsci 12 05602 g010
Figure 11. Track with communication breaks (test 2).
Figure 11. Track with communication breaks (test 2).
Applsci 12 05602 g011
Figure 12. Position errors (test 3).
Figure 12. Position errors (test 3).
Applsci 12 05602 g012
Figure 13. Track with bearings only (test 3).
Figure 13. Track with bearings only (test 3).
Applsci 12 05602 g013
Figure 14. Velocity (test 3).
Figure 14. Velocity (test 3).
Applsci 12 05602 g014
Figure 15. Angular rate (test 3).
Figure 15. Angular rate (test 3).
Applsci 12 05602 g015
Table 1. Initial states of test 1.
Table 1. Initial states of test 1.
i X t 0 i m , m q t 0 i m / s , rad u t 0 i m / s 2 , rad / s
1(28.2, 0.7)(5, 0.6447)(0, 0)
2(49.6, 98.8)(5, 0.3731)(0, 0)
3(73.8, 31.1)(5, 0.7681)(0, 0)
4(60.0, 78.2)(5, 1.2662)(0, 0)
5(11.1, 57.9)(5, 0.5935)(0, 0)
6(87.0, 69.0)(5, 0.8136)(0, 0)
7(24.3, 34.3)(5, 0.1486)(0, 0)
8(54.5, 6.7)(5, 1.4280)(0, 0)
Table 2. Initial states of test 2.
Table 2. Initial states of test 2.
i X t 0 i m , m q t 0 i m / s , rad u t 0 i m / s 2 , rad / s
1(30.3, 81.7)(5, 2.4997)(0, 0)
2(−31.2, −67.5)(5, 2.0965)(0, 0)
3(−58.1, −47.2)(5, 1.4426)(0, 0)
4(−33.5, 28.8)(5, 5.8818)(0, 0)
5(−104.3, 11.8)(5, 4.2926)(0, 0)
6(−38.9, 55.0)(5, 6.0451)(0, 0)
7(−42.8, 0.38)(5, 2.7519)(0, 0)
8(−46.8, −34.1)(5, 5.9083)(0, 0)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Xiong, Z.; Liu, Z.; Luo, Y.; Xia, J. An Adaptive and Bounded Controller for Formation Control of Multi-Agent Systems with Communication Break. Appl. Sci. 2022, 12, 5602. https://doi.org/10.3390/app12115602

AMA Style

Xiong Z, Liu Z, Luo Y, Xia J. An Adaptive and Bounded Controller for Formation Control of Multi-Agent Systems with Communication Break. Applied Sciences. 2022; 12(11):5602. https://doi.org/10.3390/app12115602

Chicago/Turabian Style

Xiong, Zhigang, Zhong Liu, Yasong Luo, and Jiawei Xia. 2022. "An Adaptive and Bounded Controller for Formation Control of Multi-Agent Systems with Communication Break" Applied Sciences 12, no. 11: 5602. https://doi.org/10.3390/app12115602

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop