Next Article in Journal
Analysis of Beam Walk in Inter-Satellite Laser Link: Implications for Differential Wavefront Sensing in Gravitational Wave Detection
Previous Article in Journal
Cybernetic Model Design for the Qualification of Pharmaceutical Facilities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Data-Driven Containment Control for a Class of Nonlinear Multi-Agent Systems: A Model Free Adaptive Control Approach

1
School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China
2
Division of Optical Communications, China Mobile Group Design Institute Co., Ltd., Beijing 100144, China
3
School of Automation, Being Information Science and Technology University, Beijing 100192, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(13), 5527; https://doi.org/10.3390/app14135527
Submission received: 16 April 2024 / Revised: 14 June 2024 / Accepted: 20 June 2024 / Published: 25 June 2024
(This article belongs to the Topic Distributed Optimization for Control)

Abstract

:
This paper studies the containment control problem of heterogeneous multi-agent systems (MASs) with multiple leaders. The follower agent dynamics are assumed to be unknown and nonlinear. First, each follower is transformed into an incremental data description based on the dynamic linearization technique. Then, a distributed model-free adaptive containment control law is proposed such that all followers will be driven into the convex hull of the leaders. Furthermore, the algorithm is extended to the time-switching and dynamic leaders case. As a data-driven approach, the proposed controller design uses only the received input and output (I/O) data of these agents rather than agent mathematical models. Finally, to test the potential in real applications, three representative examples considering various environment factors, including external disturbances, are simulated to show the effectiveness and resilience of this method.

1. Introduction

Inspired by the coordinated behavior in nature, such as the flocking of birds or migrating of herds, the distributed control of multi-agent systems has been studied with increasing attention [1]. Due to the distributed and flexible structure, the MASs based methods have great potential in solving the coordination problem for complex systems. The applications have been reported in various fields, including vehicle cooperation [2], multi-robot systems [3], and traffic signal control [4]. Until now, numerous algorithms have been designed concerning three types of coordinated tasks, i.e., consensus control, formation control, and containment control [5]. Unlike the single leader setting in consensus or formation control, containment control is regarded as more challenging and requires all the followers to converge to a convex hull spanned by multiple leaders. Containment control also has many potential applications. For instance, a group of robots try to travel across an unsafe tunnel, but only a portion of them are equipped with sensors [6]. Then, the ones with sensors can act as leaders and the others will ensure safety by staying in the area constructed by leaders.
According to the dynamics of agents, the results of containment control can be roughly classified into four categories: (1) the first-order integral system [7,8]; (2) the second-order or high-order integral system [6,9,10,11,12]; (3) the general linear system [13,14,15]; (4) the nonlinear system [16,17,18,19,20]. The pioneering work for a collection of single-integrator agents is investigated in [7], where a “stop-go” rule based hybrid control scheme is designed for leaders, whereas consensus-like local interaction rules are applied for followers. However, the result of [7] is only discussed under a fixed undirected topology. To explore the effect of dynamic communication, the containment problem of single-integrator agents under switching directed network topologies is considered in [8]. In [9], a type of feedback control law with velocity measurements is proposed for double-integrator MASs and an experimental platform consisting of five wheeled mobile robots is used to validate the result. To reduce the communication burden, [10] developed a finite-time containment control algorithm for double-integrator MASs using only position measurements. In [11], the effects of time-varying delays are taken into consideration, where the linear matrix inequality method and the Lyapunov functional method are jointly used to ensure the containment. In [12], the noise in agent information transmission is formulated and corresponding time-varying state feedback containment control laws are proposed for both first-order and second-order integral MASs. As a extension of [12], ref. [6] considers both measurement noise and polynomial disturbance in high-order integral MASs, where a containment controller is designed with a proportional term and the nth-order integral terms.
Recent work begins to focus on the general linear or nonlinear MASs. In [13], based on the relative outputs of neighboring agents, dynamic observer-type containment controllers are built for both continuous-time and discrete-time general linear MASs. The results of time-varying uncertainties and constant time delays for general linear MASs are shown in [14,15], respectively. As a special kind of nonlinear agents, the rigid body MASs formed by Euler–Lagrange expression is studied in [16], in which a sliding-mode estimator based containment controller is given. Considering the parametric uncertainties of Euler–Lagrange MASs, [17] develops a modified sliding mode containment controller based on the one in [16] without using velocity information. Some results of more general nonlinear agents have also been reported. In [18], the fuzzy logic is utilized to approximate the dynamics of Lipschitz nonlinear MASs and an observer-based containment controller is designed. In [19], sliding mode controllers are proposed to address the finite-time containment problem of MASs with Lipschiz-like nonlinearity and external disturbances. Based on neural networks (NNs) and backstepping techniques, a finite-time containment controller is applied to the high-order nonlinear MASs with dynamic uncertainties in [20].
Although massive achievements has been obtained in [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20], some limitations still exist for the current algorithms. The containment control schemes used in [6,7,8,9,10,11,12,13,14,15] are designed for known linear agents. However, in coordination tasks, the practical MASs always suffer from the effects of unmodelled dynamics and nonlinearities. Thus, the control laws in [6,7,8,9,10,11,12,13,14,15] may face performance deterioration in real implementation. The methods presented in [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20] require that the controlled MASs should be the affine nonlinear system, i.e., the control input term is assumed to be known and linear. Furthermore, the NNs based controller needs extra training processes [20], and the selection of fuzzy membership functions in [18] heavily depends on the experience of users. Therefore, the containment control design for unknown nonlinear MASs is still an open and challenging problem.
Model-free adaptive control (MFAC), initially proposed in [21], is an effective approach to deal with unknown nonlinear systems. With 20 years of development, MFAC has now provided a systematic control framework with a wide range of engineering applications [22,23,24,25,26,27,28,29,30,31,32,33]. The partial-form dynamic linearization based MFAC controller is designed for a class of single-input and single-output unknown nonlinear system in [22], and it is extended to the multi-input and multi-output system in [23]. Based on the view of an ideal controller, the controller dynamic linearization based MFAC is proposed in [24] and then enhanced by radial basis function NNs in [25]. To address the repetitive tasks over a finite time interval, model-free adaptive iterative learning control (MFAILC) is presented in [26]. It is reported in [27] that MFAC has been applied to over 150 yields. The latest applications and advances can be found in [28,29,30,31,32], including weld penetration [28], steam-water heat exchanger [29], traffic systems [30,31], and the stability analysis of full-form dynamic linearization based MFAC [32]. For the brief review of MFAC, readers are referred to [27,33].
MFAC has also been employed in the coordination of MASs [34,35,36,37,38,39,40,41,42]. MFAC is first used to address the consensus problem for a group of unknown heterogenous nonlinear MASs in [34]. Based on the controller dynamic linearization technique and Newton-type optimization method, a data-driven ideal leader–follower consensus tracking algorithm is designed in [35]. Then, the consensus tracking issue under repetitive environment is solved by using MFAILC in [36]. In [37], a MFAC based predictive consensus scheme is presented for MASs with communication constraints. In [38], the consensus problem for the MASs under time delays is solved by the Smith estimator enhanced MFAC scheme. For the formation control of MASs, [39] gives a data-driven formation control scheme based on MFAILC. Considering the iterative-varying disturbance, the algorithm in [39] is extended to a robust form formation control law in [40]. In [41], an event-triggered MFAC method is used to perform the containment task where the MASs are suffering from denial-of-service attacks. A model-free adaptive containment control scheme is investigated for a class of multi-input multi-output nonlinear MASs in [42]. However, the MASs in [34,35,36,37,38,39,40] only contain one single leader, while no exploration has been conducted for multileaders case. The work in [41,42] only found the result that the defined local containment error is bounded or converged, which is not exactly equivalent to the containment goal. Inspired by above observations, a novel distributed model-free adaptive containment control scheme is proposed in this paper. Compared with existing work, the main contributions of this work are threefold:
(1)
A novel distributed model-free adaptive containment controller is developed. Compared with previous work [34,35,36,37,38,39,40], the MASs extend to the multiple-leader case. By virtue of the MFAC framework, this controller is endowed with the data-driven feature, i.e., the design process only utilizes I/O data of the agents rather than their mathematical models. Moreover, this method does not need the extra training process that is required in the NNs based algorithm [20].
(2)
The global containment error is proposed to overcome the analysis nonequivalent problem in [41,42]. Then, the Lyapunov function based stability analysis is conducted to guarantee that the containment goal is achieved, which is different from the conventional contraction mapping principle used in MFAC based protocols (e.g., ref. [35]). In addition, the follower dynamics considered in this paper can contain the nonaffine nonlinear property, whereas the follower dynamics appearing in [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20] are either affine or linear.
(3)
The control scheme is further extended to a time-switching topology and dynamic leaders case, which makes the controller more applicable and flexible in real implementation.
The remaining part of the paper is organized as follows. Section 2 presents some preliminary knowledge and gives problem formulation. Section 3 shows the main results, including the controller design and stability analysis. Section 4 extends the existing result to a time-switching topology and dynamic leaders case. In Section 5, three numerical examples are provided. Finally, the conclusion is discussed in Section 6.

2. Preliminaries and Problem Formulation

2.1. Symbol Notations and Graph Theory

Let R , R n and R n × m denote the set of real numbers, n × 1 real vectors, and n × m real matrices, respectively. The diagonal matrix and identity matrix are denoted by diag(·) and I; 1 n = [ 1 , 1 , . . . , 1 ] R n is a vector with all elements are 1; a denotes the Euclidean norm for a vector a R n ; card ( S ) stands for the number of elements in the set S.
The follower and leader agents are presented by nodes in the set F = { 1 , . . . , N } and W = { N + 1 , . . . , N + M } , respectively. The communication between these agents is described by a directed graph G = { V , E , A } whose node set is V = F W . The edges in the set E V × V represent the communication links between agents. The expression A = [ a i j ] R ( N + M ) × ( N + M ) ( i , j V ) is defined as an adjacency matrix with elements a i j = 1 if and only if the directed edge ( j , i ) E , otherwise a i j = 0 . The in-degree matrix of a graph G is defined by a diagonal matrix D = d i a g ( d 1 , . . . , d N + M ) with diagonal entries d i = j = 1 N + M a i j . The neighbors of the ith agent are denoted by the set N i = { j | ( j , i ) E } . A directed path from node i to node j is defined as a sequence of consecutive edges which begin from node i and end in node j. A directed graph is strongly connected if a path exists between any pair of nodes. The Laplacian matrix is defined as L = D A . Due to the fact that leaders do not receive any information from other agents, the Laplacian matrix L can be partitioned as the following block matrix [8]:
L = L 1 L 2 0 M × N 0 M × M
where L 1 R N × N and L 2 R N × M .
In order to give the readers a better understanding of this paper, the major notations are listed in Notations part.

2.2. Problem Formulation

The MASs consist of N follower agents and the dynamics of the ith agent is described as:
y i ( k + 1 ) = f i y i ( k ) , u i ( k )
where y i ( k ) R , u i ( k ) R are the agent output and control input of the ith agent at time instant k, respectively, and f i ( · ) is an unknown nonlinear function.
Denote the output of the leader as w l ( k ) R , l = N + 1 , . . . , N + M , which can only be received by a subset of follower agents.
Definition 1
(Convex Set and Convex Hull [43]). A set C R n is said to be convex if ( 1 α ) x + α y C , for any α [ 0 , 1 ] and x , y C . For a finite set of points W = { w 1 , w 2 , . . . , w q } R n , the convex hull C o ( W ) is the minimal convex set containing all points in W , that is, C o ( W ) = { i = 1 q α i w i | w i W , α i 0 , i = 1 q α i = 1 } . In particular, when W R , C o ( W ) = { w | w [ min i w i , max i w i ] } .
Definition 2
(Containment Control [10]). The MASs achieve containment if all followers are driven by a distributed control law and move into the convex hull spanned by the leader set W , that is, i 1 , . . . , N , there exist an element w ¯ i C o ( W ) , such that
lim k y i ( k ) w ¯ i = 0 .
where w ¯ i is the convergence value for the ith follower agent.
The following assumptions are given before further discussions.
Assumption 1.
The partial derivative of f i ( · ) with respect to u i ( k ) is continuous.
Assumption 2.
For all k with finite exceptions, if Δ u i ( k ) 0 , the follower agent dynamics (2) satisfies the following generalized Lipschitz condition, that is
| Δ y i ( k + 1 ) | b | Δ u i ( k ) |
where Δ y i ( k + 1 ) = y i ( k + 1 ) y i ( k ) , Δ u i ( k ) = u i ( k ) u i ( k 1 ) and b is a positive constant.
Remark 1.
Assumption 1 is a general condition for controller design. Assumption 2 gives a bound of the change rate of the agent’s output, i.e., the agent’s output change should be finite if the control input change is bounded. A wide range of real agent systems satisfy the assumptions. For instance, the velocity change of an autonomous car must be finite with a bounded change of the accelerator. Similar assumptions can also be found in [22,23,24,25,26,27,28,29,32,33].
Under Assumptions 1 and 2, the unknown agent dynamics (2) can be transformed into the following dynamic linearization model and then the distributed control law will be designed based on it.
Lemma 1
([44]). Consider follower agents with dynamics (2) satisfying Assumptions 1 and 2. If Δ u i ( k ) 0 holds, then agent (2) can be transformed into a compact form dynamic linearization (CFDL) data model:
Δ y i ( k + 1 ) = ϕ i ( k ) Δ u i ( k )
where the time-varying variable ϕ i ( k ) R is named as the pseudo partial derivative (PPD), which is bounded for any time instant k.
Remark 2.
By virtual of CFDL technique, the unknown nonlinear agent dynamics (2) are now transformed into an incremental form data description (3) in every operation point with a bounded PPD. In implementation, with proper selected algorithms, PPD can be estimated by using the I/O data of the agents and the estimated value can also be proved to be bounded, which is shown in Theorem 1. Here, the CFDL data model is derived under the condition Δ u i ( k ) 0 . In fact, if the case Δ u i ( k ) = 0 happens at some sampling time, a new CFDL data model can also be established after shifting γ i Z + time steps until u i ( k + γ i ) u i ( k ) holds. In the next section, the data model (3) will be used to design the distributed control law and derive the stability analysis result.

3. Main Results

In this section, the data driven containment control law is first designed under the stationary leaders and fixed topology case.
Let ξ i ( k ) denote the local containment error:
ξ i ( k ) = j = 1 N a i j ( y j ( k ) y i ( k ) ) + l = N + 1 N + M a i l ( w l ( k ) y i ( k ) )
where a i j ( i , j V ) is the entry in the adjacency matrix, and y i ( k ) and w l ( k ) denote the outputs of the follower agent i and leader agent l, respectively. From this definition, ξ j ( k ) can be regarded as the tracking error of the ith agent with all its neighboring agents at time instant k.
The distributed model free adaptive containment control (MFACC) is designed as follows:
  • Distributed PPD updating algorithm:
    ϕ ^ i ( k ) = ϕ ^ i ( k 1 ) + η Δ u i ( k 1 ) μ + | Δ u i ( k 1 ) | 2 × ( Δ y i ( k ) ϕ ^ i ( k 1 ) Δ u i ( k 1 ) )
  • PPD reset mechanism:
    ϕ ^ i ( k ) = ϕ ^ i ( 1 ) , i f | ϕ ^ i ( k ) | ε o r | Δ u i ( k 1 ) | ε o r sign ( ϕ ^ i ( k ) ) sign ( ϕ ^ i ( 1 ) )
  • Distributed containment input law:
    u i ( k ) = u i ( k 1 ) + ρ i ϕ ^ i ( k ) λ + | ϕ ^ i ( k ) | 2 ξ i ( k )
    where ρ i ( 0 , 1 ] , η ( 0 , 2 ] , μ > 0 and λ > 0 are adjustable parameters, ϕ ^ i ( k ) is the estimated value of ϕ i ( k ) , ϕ ^ i ( 1 ) is the initial value of ϕ ^ i ( k ) , and ε > 0 is a preset small constant.
Remark 3.
The PPD is estimated by the distributed updating algorithm (5) and (6), which is the well-known projection algorithm [22,23,24,25,26,27,28,29,30,31,32,33] and derived by minimizing the following performance index J ( ϕ ^ i ( k ) ) = | Δ y i ( k ) ϕ ^ i ( k ) Δ u i ( k 1 ) | 2 μ | ϕ ^ i ( k ) ϕ ^ i ( k 1 ) | 2 . With this distributed algorithm, the PPDs of follower agents are estimated by using their own I/O data. Theoretical results show these estimated values will be bounded if some parameters are appropriately chosen, which is presented in the next section. Furthermore, due to the heterogeneity of the agent dynamics, the trajectories of the estimated PPD ϕ ^ i ( k ) may vary from different follower agents.
Remark 4.
The distributed containment control law (7) is derived by minimizing the following index J ( u i ( k ) ) = | j = 1 N a i j y j ( k ) y i ( k + 1 ) + l = N + 1 N + M a i l w l ( k ) y i ( k + 1 ) | 2 + λ | Δ u i ( k ) | 2 . As we can see, this performance index tries to decrease the errors between the ith agent and its neighbors. It will be proved in the following part that the MASs will finally achieve the containment control goal. Observing the control scheme (5)–(7), one will find it only uses the local I/O data to estimate the PPD and calculate the input for each follower agent, i.e., no model information of f i ( · ) is involved; that is why it is called a data-driven method. The control gain in (7) is also adaptively tuned by ϕ ^ i ( k ) , which is continuously updated by the new received I/O data. In addition, the protocols (5)–(7) will regress to the consensus control law in [34] when there is only one leader.
The following assumptions and lemma are prepared before the main result.
Assumption 3.
For all follower agents, the sign of the PPD ϕ i ( k ) ( i = 1 , . . . , N ) is unchanged, i.e., ϕ i ( k ) > ε ¯ > 0 (or ϕ i ( k ) < ε ¯ < 0 ), where ε ¯ is an arbitrarily small positive constant.
Without loss of generality, it is assumed that ϕ i ( k ) > ε ¯ > 0 in the discussion.
Remark 5.
Assumption 3 indicates that the follower agent’s output incremental direction caused by its input do not change during the containment control process. Similar assumptions can also be found in model-based control theory.
Assumption 4.
The communication graph among followers is strongly connected, and each follower can receive the information through a direct path from at least one leader. Assumption 4 guarantees that there are no isolated followers and the containment control problem is well-posed.
Lemma 2
([45]). For a dynamic system x ( k + 1 ) = f ( x ( k ) ) , V ( k ) = g ( x ( k ) ) is a positive definite and radially unbounded Lyapunov function. If Δ V ( k ) 0 and Δ V ( k ) 0 for any x ( k ) 0 , then the equilibrium x e = 0 is global asymptotic stable.
In the stationary leader case, the trajectory of the lth leader is written as w l ( k ) = w l ( l W ) , where w l is a constant. The main result is summarized as Theorem 1.
Theorem 1.
Assume that the MASs dynamics satisfies Assumptions 1–3 and the communication topology satisfies Assumption 4. In the stationary leaders case, that is, w l ( k ) = w l ( l W ) , let the distributed MFACC (5)–(7) be applied. If the parameter ρ i ( i = 1 , . . . , N ) is selected to satisfy the following convergence condition
0 < ρ i < 1 d i
then there exists a λ m i n > b 2 / 4 , and the MASs will achieve the containment control objective defined in Definition 2 when λ > λ m i n and k .
Proof. 
The proof consists of three steps. Step one is to prove that the PPD estimation is bounded. Step two is to derive the convergence condition. Step three is to prove the convergence of the global containment error.
  • S t e p 1 The boundedness of ϕ ^ i ( k ) is straightforward if ϕ ^ i ( k ) ε or Δ u i ( k ) ε or sign ( ϕ ^ i ( k ) ) sign ( ϕ ^ i ( 1 ) ) .
In the other case, the PPD estimation error is defined as ϕ ˜ i ( k ) = ϕ ^ i ( k ) ϕ i ( k ) . Subtracting ϕ i ( k ) from both sides of the PPD estimation algorithm (5) yields
ϕ ˜ i ( k ) = ϕ ˜ i ( k 1 ) ( ϕ i ( k ) ϕ i ( k 1 ) ) + η Δ u i ( k 1 ) μ + | Δ u i ( k 1 ) | 2 × ( Δ y j ( k 1 ) ϕ ^ i ( k 1 ) ) Δ u i ( k 1 ) )
Define Δ ϕ i ( k ) = ϕ i ( k ) ϕ i ( k 1 ) . Then, substituting CFDL data description (3) into (8) gives
ϕ ˜ i ( k ) = 1 η Δ | u i ( k 1 ) | 2 μ + | Δ u i ( k 1 ) | 2 × ϕ ˜ i ( k 1 ) Δ ϕ i ( k )
From Lemma 1, it can be known that ϕ i ( k ) is bounded, and assume the bound of it as b ¯ . Taking the norm on both sides of (9) leads to
| ϕ ˜ i ( k ) | 1 η | Δ u i ( k 1 ) | 2 μ + | Δ u i ( k 1 ) | 2 ϕ ˜ i ( k 1 ) + | Δ ϕ i ( k ) | 1 η | Δ u i ( k 1 ) | 2 μ + | Δ u i ( k 1 ) | 2 ϕ ˜ i ( k 1 ) + 2 b ¯
Note the fact that the term η | Δ u i ( k 1 ) | 2 / ( μ + | Δ u i ( k 1 ) | 2 ) is monotonically increasing with respect to | Δ u ( k 1 ) | . Thus, there must exist a constant δ 1 to satisfy the following inequality when 0 < η < 2 and μ > 0 :
0 1 η | Δ u i ( k 1 ) | 2 μ + | Δ u i ( k 1 ) | 2 1 η ε 2 μ + ε 2 = δ 1 < 1
Substituting (11) into (10) gives
| ϕ ˜ i ( k ) | δ 1 | ϕ ˜ i ( k 1 ) | + 2 b ¯ δ 1 2 | ϕ ˜ i ( k 2 ) | + 2 δ 1 b ¯ + 2 b ¯ . . . δ 1 k 1 | ϕ ˜ i ( 1 ) | + 2 b ¯ ( 1 δ 1 k 1 ) 1 δ 1
which means ϕ ˜ i ( k ) is bounded. Since the boundedness of ϕ i ( k ) is guaranteed by Lemma 1, ϕ ^ i ( k ) is also bounded.
  • S t e p 2 Define the collective stack vectors as follows:
y ( k ) = [ y 1 ( k ) , y 2 ( k ) , . . . , y N ( k ) ] T u ( k ) = [ u 1 ( k ) , u 2 ( k ) , . . . , u N ( k ) ] T ξ ( k ) = [ ξ 1 ( k ) , ξ 2 ( k ) , . . . , ξ N ( k ) ] T w s = [ w 1 , w 2 , . . . , w N ] T
Now, (4) is rewritten as the collective form:
ξ ( k ) = L 1 y ( k ) L 2 w ( k )
Using the collective stack vectors, (7) is rewritten as the following compact form:
u ( k ) = u ( k 1 ) + ρ H ( k ) ξ ( k )
where
ρ = d i a g ρ 1 , . . . , ρ N
H ( k ) = d i a g ϕ ^ 1 ( k ) λ + | ϕ ^ 1 ( k ) | 2 , . . . , ϕ ^ N ( k ) λ + | ϕ ^ N ( k ) | 2
Similarly, the CFDL model (3) is also transformed as the collective form
y ( k + 1 ) = y ( k ) + Φ ( k ) Δ u ( k )
where Δ u ( k ) = u ( k ) u ( k 1 ) , and Φ ( k ) = d i a g ϕ 1 ( k ) . . . ϕ N ( k ) .
Substituting (14) into (15), one has
y ( k + 1 ) = y ( k ) + ρ Φ ( k ) H ( k ) ξ ( k )
Combining (13) and (16) yields
y ( k + 1 ) = ( I ρ Φ ( k ) H ( k ) L 1 ) y ( k ) ρ Φ ( k ) H ( k ) L 2 w s
Denote Ω ( k ) = [ y ( k ) T w s T ] T and
Γ ( k ) = I ρ Φ ( k ) H ( k ) L 1 ρ Φ ( k ) H ( k ) L 2 0 N × N I
Then, (17) can be rewritten as
Ω ( k + 1 ) = Γ ( k ) Ω ( k )
The ith ( i F ) diagonal entry γ i i ( k ) of Γ ( k ) can be expressed as
γ i i ( k ) = 1 d i · ρ i · ϕ i ( k ) ϕ ^ i ( k ) λ + | ϕ ^ i ( k ) | 2 = 1 d i ρ i ϕ i ( k ) ϕ ^ i ( k ) λ + | ϕ ^ i ( k ) | 2
If the condition 0 < ρ i < 1 d i holds, one has 0 < d i ρ i < 1 . According to Assumption 3 and reset condition (6), ϕ i ( k ) ϕ ^ i ( k ) > 0 , which means ϕ i ( k ) ϕ ^ i ( k ) λ + | ϕ ^ i ( k ) | 2 > 0 when the parameters satisfy λ > 0 . Since ϕ ^ i ( k ) and ϕ i ( k ) are bounded, we choose λ > λ m i n ( λ m i n > b 2 / 4 ) to satisfy the following inequality:
0 < ϕ i ( k ) ϕ ^ i ( k ) λ + | ϕ ^ i ( k ) | 2 b | ϕ ^ i ( k ) | 2 λ | ϕ ^ i ( k ) | < b 2 λ m i n < 1
which means 0 < γ i i ( k ) < 1 when 0 < ρ i < 1 d i and λ > λ m i n .
By definition of the Laplacian matrix, one has L 1 1 N + L 2 1 M = 0 . Note the fact that Γ ( k ) 0 and Γ ( k ) 1 N + M = 1 N + M , i.e., Γ ( k ) is a stochastic matrix.
  • S t e p 3 Prove the convergence of the global containment error.
Define the upper bounds of the followers, leaders, and the MASs as follows:
y + ( k ) = max i y i ( k ) w + = max l w l Ω + ( k ) = max { y + ( k ) , w + }
Note that one can then define y ( k ) = min i y i ( k ) , w = min l w l , and Ω ( k ) = min { y ( k ) , w } for the lower bound.
For the upper bound, Ω + ( k ) = w + means all the followers goes into the convex hull of the leaders. Therefore, define the upper global containment error as follows:
e + ( k ) = Ω + ( k ) w +
(Define e ( k ) = w Ω ( k ) for the lower global containment error). Then, one considers the following Lyapunov function V 1 ( k ) = e + ( k ) 2 for further analysis. The time difference can be written as
Δ V 1 ( k + 1 ) = V 1 ( k + 1 ) V 1 ( k ) = ( e + ( k + 1 ) e + ( k ) ) ( e + ( k + 1 ) + e + ( k ) ) = Ω + ( k + 1 ) Ω + ( k ) ( Ω + ( k + 1 ) + Ω + ( k ) 2 w + )
Rewrite y i ( k + 1 ) as
y i ( k + 1 ) = j : y j ( k ) = y + ( k ) γ i j ( k ) y j ( k ) + j : y j ( k ) < y + ( k ) γ i j ( k ) y j ( k ) + l = N + 1 N + M γ i l ( k ) w l
where γ i j ( k ) is the entries of Γ ( k ) .
According to Lemma 2, two parts need to be proved: (1) e + ( k ) = 0 is an equilibrium point; (2) For any e + ( k ) 0 , Δ V 1 ( k ) 0 and Δ V 1 ( k ) 0 .
(1)
e + ( k ) = 0 , i.e., y + ( k ) w + .
Substituting y j ( k ) w + and w l w + into (22), one derives
y i ( k + 1 ) j = 1 N + M γ i j ( k ) w + = w +
Therefore, Ω + ( k + 1 ) = max { y + ( k + 1 ) , w + } = w + = Ω + ( k ) , i.e., Δ V 1 ( k + 1 ) = 0 in this case. In other words, the e + ( k ) = 0 is the equilibrium point.
(2)
e + ( k ) 0 , i.e., y + ( k ) > w + .
Denote s i ( k ) = j : y j ( k ) = y + ( k ) γ i j ( k ) . For the ith follower, s i ( k ) = 1 means the outputs of the ith follower and its neighbors are all equal to y + ( k ) , which also means only the first term in the right side of (22) exists. Substituting y j ( k ) = y + ( k ) into (22) gives
y i ( k + 1 ) = s i ( k ) y + ( k ) = y + ( k ) if s i ( k ) = 1
Otherwise, if s i ( k ) 1 , the following results can be concluded from Equation (22):
y i ( k + 1 ) < j : y j ( k ) = y + ( k ) γ i j ( k ) y + ( k ) + j : y j ( k ) < y + ( k ) γ i j ( k ) y + ( k ) + l = N + 1 N + M γ i l ( k ) y + ( k ) = s i ( k ) y + ( k ) + ( 1 s i ( k ) ) y + ( k ) = y + ( k ) if s i ( k ) 1
Combining (24) and (25) gives
Ω + ( k + 1 ) Ω + ( k ) max { y + ( k ) , w + } Ω + ( k ) = 0
It is clear that Ω + ( k + 1 ) + Ω + ( k ) 2 w + > 0 . Therefore, Δ V 1 ( k + 1 ) 0 holds. Now, one only needs to prove that Δ V 1 ( k + 1 ) 0 .
Assume that there exists a time instant K 1 such that Δ V 1 ( k + 1 ) 0 when k > K 1 .
Define S 1 ( h ) = { i | y i ( K 1 + h ) = y + ( K 1 ) } and N 1 = card ( S 1 ( 0 ) ) . Since Δ V 1 ( k + 1 ) 0 , it means y + ( K 1 + h ) = y + ( K 1 ) for all h > 0 , i.e., card ( S 1 ( h ) ) > 0 must hold for all h > 0 .
According to Assumption 4, at least one agent in S 1 ( h ) whose neighbor’s output is less than y + ( K 1 ) , i.e., there exist a certain follower y p ( K 1 + h ) satisfying y p ( K 1 + h ) S 1 ( h ) and s p ( K 1 + h ) 1 . From (25), one derives y p ( K 1 + h + 1 ) < y + ( K 1 ) , which implies
card ( S 1 ( h + 1 ) ) card ( S 1 ( h ) ) 1
Repeating the process in (27), one gets
card ( S 1 ( N 1 ) ) card ( S 1 ( N 1 1 ) ) 1 card ( S 1 ( N 1 2 ) ) 2 . . . card ( S 1 ( 0 ) ) N 1 = 0
which contracts to card ( S 1 ( h ) ) > 0 . Therefore, the assumption is violated.
According to Lemma 2, one concludes that e + ( k ) will converge to 0. This completes the proof. □
Remark 6.
According to the analysis in Theorem 1, if there is only one leader, all followers will converge to the leader’s output. In other words, consensus is a special case of the containment control discussed in this paper.

4. Extension to Switching Topologies and Dynamic Leaders

Due to transmission failure and other factors, the communication topology between agents may change in the containment process. On the other hand, the trajectories of leaders usually change over time in practical agent coordination tasks. Therefore, the proposed containment design is extended to the switching topologies and dynamic leaders situation.
A switching graph G ( k ) associated with time instant k is now used to indicate the communication topology among agents. The graph switches from the set G = { G 1 , G 2 , . . . , G ψ } , where ψ Z + is the total number of possible graphs. The time-varying adjacency matrix and the Laplacian matrix are denoted by A ( k ) = [ a i j ( k ) ] R ( N + M ) × ( N + M ) ( i , j V ) and L ( k ) , respectively. The two blocks in (1) are also time-varying and written as L 1 ( k ) and L 2 ( k ) . The in-degree matrix in time instant k is denoted by D ( k ) = d i a g d 1 ( k ) , . . . , d N + M ( k ) with diagonal entries d i ( k ) = j = 1 N + M a i j ( k ) . For a specific interaction graph G l in the set G , the corresponding in-degree matrix is defined as D l = d i a g d 1 l , . . . , d N + M l . The neighbors of the ith agent in time instant k are denoted by the set N i ( k ) .
Assumption 5.
For every graph in G , the communication graph among followers is strongly connected and each follower can receive the information via a directed path from at least one leader.
Next, we consider the time varying leaders. Assume that the trajectories of leaders have lower and upper bounds, that is
w ˜ w l ( k ) w ˜ + , l = N + 1 , . . . , N + M
where w ˜ and w ˜ + are two constants.
Considering the above changes, the local containment error in (4) is modified as
ξ ¯ i ( k ) = j = 1 N a i j ( k ) ( y j ( k ) y i ( k ) ) + l = N + 1 N + M a i l ( k ) ( w l ( k ) y i ( k ) )
The control input (7) is extended as
  • Extended Distributed containment input law:
    u i ( k ) = u i ( k 1 ) + ρ i ϕ ^ i ( k ) λ + | ϕ ^ i ( k ) | 2 ξ ¯ i ( k )
    where ρ i ( 0 , 1 ] , η ( 0 , 2 ] , μ > 0 , λ > 0 are parameters.
Now, Equations (5), (6), and (30) consist the extended algorithm, and the result is given as the following theorem.
Theorem 2.
In the switching topologies and dynamic leaders case, assume that the MASs dynamics satisfy Assumptions 1–3 and the communication topology satisfies Assumption 5. Let the distributed MFACC (5), (6), and (30) be applied. If the parameter ρ i ( i = 1 , . . . , N ) is selected to satisfy the following convergence condition
0 < ρ i < 1 max l = 1 , 2 , . . , ψ d i l
then there exists a λ m i n > b 2 / 4 , and the MASs will converges to [ w ˜ , w ˜ + ] when λ > λ m i n and k .
Proof. 
Define the time-varying leaders’ output collective stack vectors as w ( k ) = [ w 1 ( k ) , w 2 ( k ) , . . . , w N ( k ) ] T .
In this case, Equation (17) becomes
y ( k + 1 ) = Γ ˜ ( k ) y ( k ) w ( k )
where
Γ ˜ ( k ) = I ρ Φ ( k ) H ( k ) L 1 ( k ) ρ Φ ( k ) H ( k ) L 2 ( k ) = [ γ ˜ i j ] R N × ( N + M ) .
The γ ˜ i i ( k ) is now written as
γ ˜ i i ( k ) = 1 d i ( k ) ρ i ϕ i ( k ) ϕ ^ i ( k ) λ + | ϕ ^ i ( k ) | 2
If the condition 0 < ρ i < 1 max l = 1 , 2 , . . , ψ d i l holds, one has 0 < d i ( k ) ρ i < 1 . From the analysis in (18)–(20), it is easy to verify that Γ ˜ ( k ) also is a stochastic matrix when λ > λ m i n ( λ m i n > b 2 / 4 ) .
Define Ω ˜ ( k ) = max { y + ( k ) , w ˜ + } and V 2 ( k ) = e ˜ + ( k ) 2 , where e ˜ + ( k ) = Ω ˜ ( k ) w ˜ + .
Δ V 2 ( k + 1 ) = Ω ˜ ( k + 1 ) Ω ˜ ( k ) ( Ω ˜ ( k + 1 ) + Ω ˜ ( k ) 2 w ˜ + )
Rewrite y i ( k + 1 ) as
y i ( k + 1 ) = j : y j ( k ) = y + ( k ) γ ˜ i j ( k ) y j ( k ) + j : y j ( k ) < y + ( k ) γ ˜ i j ( k ) y j ( k ) + l = N + 1 N + M γ ˜ i l ( k ) w l ( k )
Following the same steps from (23) to (26), one concludes
Δ V 2 ( k + 1 ) = 0 , if e ˜ + ( k ) = 0 .
Δ V 2 ( k + 1 ) 0 , if e ˜ + ( k ) 0 .
One only needs to prove that there does not exist a time instant K 2 such that Δ V 2 ( k + 1 ) 0 .
Assume that Δ V 2 ( k + 1 ) 0 when k > K 2 .
Define S 2 ( h ) = { i | y i ( K 2 + h ) = y + ( K 2 ) } and N 2 = card ( S 2 ( 0 ) ) . Since Δ V 2 ( k + 1 ) 0 , it means y + ( K 2 + h ) = y + ( K 2 ) for all h > 0 , i.e., card ( S 2 ( h ) ) > 0 must hold for all h > 0 .
According to Assumption 5, at least one agent in S 2 ( h ) whose neighbor’s output is less than y + ( K 2 ) , i.e., there exists a certain follower y p ( K 2 + h ) satisfying y p ( K 2 + h ) S 2 ( h ) and s p ( K 2 + h ) 1 . From (27), one gets
card ( S 2 ( N 2 ) ) card ( S 2 ( N 2 1 ) ) 1 card ( S 2 ( N 2 2 ) ) 2 . . . card ( S 2 ( 0 ) ) N 2 = 0
which contracts to card ( S 2 ( h ) ) > 0 . Therefore, the assumption is violated. In summary, one concludes that e ˜ + ( k ) converges to 0. This completes the proof. □
Remark 7.
When the leaders become time-invariant after time instant, one has w ˜ + = max l w l and w ˜ = min l w l , and the result in Theorem 2 will regress to the result in Theorem 1.

5. Simulation

This section gives three numerical examples to verify the theoretical results. The example settings and parameter selections are shown in Table 1. It is noteworthy that the agent dynamics presented in the simulation are only used to generate the I/O data and the controller design process contains no model information.
All tests in this paper are conducted in MATLAB R2018a with the Intel CPU Core (TM) i7-11800H @ 2.30 GHz (Intel, Santa Clara, CA, USA). The sample time is chosen as 0.1 s. The time step is set to be 500 for Example 1 and 1000 for Examples 2–3.

5.1. Example 1

Four follower agents are labeled as agents 1–4 and two leaders are labeled as agents 5–6, i.e., F = { 1 , 2 , 3 , 4 } and W = { 5 , 6 } . The communication network is fixed in this case as shown in Figure 1, in which the topology among followers is strongly connected. Moreover, the information from agents 5–6 can only be received by agent 1 and agent 4, respectively.
The dynamics of four followers is described as
A g e n t 1 : y 1 ( k + 1 ) = 0.7 y 1 ( k ) + u 1 ( k ) A g e n t 2 : y 2 ( k + 1 ) = 0.6 c o s ( y 2 ( k ) ) + 0.3 y 2 ( k ) + u 2 ( k ) A g e n t 3 : y 3 ( k + 1 ) = y 3 ( k ) 1 + y 3 2 ( k ) + u 3 ( k ) A g e n t 4 : y 4 ( k + 1 ) = y 4 ( k ) 1 + y 4 2 ( k ) + u 4 3 ( k )
It is worth noting that the followers are heterogeneous and include linear and nonlinear dynamics. Agent 1 is a linear agent, which is widely used in autonomous vehicle formation control [2]. The cosine nonlinear dynamics of agent 2 is taken from [46]. The nonaffine nonlinear descriptions of agents 3–4 are picked from [36]. The heterogeneity and nonlinearity of followers become a great challenge to the controller design.
The trajectories of two leaders are chosen as piece-wise step signals:
A g e n t 5 : w 5 ( k ) = 1.4 , if k < 165 1.6 , if 165 k < 330 1.3 , if k 330 A g e n t 6 : w 6 ( k ) = 0.7 , if k < 165 1.2 , if 165 k < 330 1.1 , if k 330
The initial conditions y i ( 0 ) ( i F ) are selected as y 1 ( 0 ) = 1.2 , y 2 ( 0 ) = 1.8 , y 3 ( 0 ) = 0.6 , y 4 ( 0 ) = 0.4 . Initial values of the control inputs and estimated PPD are set as u i ( 0 ) = 0 ( i F ) and ϕ ^ i ( 0 ) = 0.5 ( i F ) . Other parameters are listed in Table 1. Observing the communication network in Figure 1, the in-degree matrix is D = d i a g ( 2 , 1 , 2 , 2 , 0 , 0 ) and the reciprocal of its largest diagonal entry is 0.5. Thus, ρ i = 0.3 < 1 / d i ( i F ) satisfies the convergence condition in Theorem 1.
The outputs and inputs of agents are shown in Figure 2 and Figure 3 (with λ = 2), respectively. They show that the follower agents are driven into the convex hull of leaders gradually. At the time instant k = 165 and k = 330 , the convex hull moves to another location due to the change of the leaders’ outputs. The followers can still converge to the new convex hull, which verifies the result in Theorem 1. To further study the influence of the key parameter, the tracking performance with different λ (i.e., λ = 5, 2, 1, and 0.5) is presented in the sub-figures in Figure 2. The results show that, within a certain range, the smaller the λ value, the faster the tracking speed of follower agents. However, one should also note that the overshoots and oscillations may be intensified if the λ value is too small. The PPD estimation (with λ = 2) is shown in Figure 4. Due to the heterogeneous dynamics of followers, the PPD estimations are quite different from each other even though they have the same initial value.

5.2. Example 2

The time-switching topology and dynamic leaders are simulated in this case. The followers dynamics are assumed to be the same with Case 1. The extended algorithm Equations (5), (6), and (30) will be implemented in the MASs. The communication topology is assumed to switch over three graphs, i.e., S G = { G 1 , G 2 , G 3 } , which are presented in Figure 5. An extra time-varying signal σ ( k ) is introduced to indicate the switching process. The definition of σ ( k ) is given as follows:
σ ( k ) = 1 if G ( k ) = G 1 2 if G ( k ) = G 2 3 if G ( k ) = G 3
The dynamic trajectories of the leaders are selected as
A g e n t 5 : w 5 ( k ) = ( 0.5 + 0.001 k ) s i n ( π k / 150 ) A g e n t 6 : w 6 ( k ) = ( 0.5 + 0.001 k ) s i n ( π k / 150 ) 1 A g e n t 7 : w 7 ( k ) = 0.3 s i n ( π k / 100 ) 0.5
From the description, one can see the amplitude of leader 5 and 6 increases gradually over time.
All the initial values and the parameters are the same with Case 1 while λ = 0.5 . From Figure 5, the in-degree matrices are D 1 = d i a g ( 2 , 1 , 3 , 2 , 0 , 0 ) , D 2 = d i a g ( 3 , 1 , 2 , 2 , 0 , 0 ) , and D 3 = d i a g ( 2 , 1 , 2 , 3 , 0 , 0 ) for graphs G 1 , G 2 , and G 3 , respectively. Thus, it is easy to check ρ i = 0.3 < 1 / max l = 1 , 2 , . . , ψ d i l = 1 3 also satisfies the convergence condition in Theorem 2.
Figure 6 depicts the changes of indicative signal σ ( k ) from k = 1 to k = 50 , where one can observe that the communication network randomly switches among three graphs in different time instants. Figure 7, Figure 8 and Figure 9 show the outputs, inputs and estimated PPD of the follower agents, respectively. In Figure 7, the green curves present the envelopes of the leaders’ trajectory, i.e., these two curves are the upper and lower boundaries of the convex hull. It can be found all followers are driven into the space within these two curves, which verifies the result in Theorem 2. Furthermore, the trajectories in Figure 7, Figure 8 and Figure 9 are not smooth, which is caused by the quick switch of the communication topologies and the dynamic changes of the leaders.

5.3. Example 3

A more practical simulation with external disturbance is conducted in this case. The followers’ dynamics and topologies are the same as Example 2. The leaders’ outputs are modified as piece-wise functions to simulate the transition of different types of tasks.
A g e n t 5 : w 5 ( k ) = 1.2 if 0 < k 250 0.6 if 250 < k 500 ( 0.5 + 0.001 k ) s i n ( π k / 150 ) if k > 500 A g e n t 6 : w 6 ( k ) = 1.6 if 0 < k 250 0.8 if 250 < k 500 ( 0.5 + 0.001 k ) s i n ( π k / 150 ) 1 if k > 500 A g e n t 7 : w 7 ( k ) = 1.8 if 0 < k 250 1 if 250 < k 500 0.3 s i n ( π k / 100 ) 0.5 if k > 500
The communication channels are assumed to be intervened with external disturbances.
w ^ l ( k ) = w l ( k ) + β · n l ( k ) ( l = N + 1 , . . . , N + M ) y ^ i ( k ) = y i ( k ) + β · n i ( k ) ( i = 1 , . . . , N )
where w ^ l ( k ) and y ^ i ( k ) are the real information that the follower receives from leaders and its neighboring followers, respectively, and n i ( k ) is the channel disturbance following the normal distribution with intensity β .
Moreover, a widely used protocol [12] is chosen as comparison.
u i ( k ) = a i g ( k ) ξ ¯ i ( k )
where a i g ( k ) is the time-varying gain. It is worth noting that a i g ( k ) is difficult to determine if the agent model is unavailable. In this test, via a trial-and-error method, the gain is tuned similarly to the form in [12] as a 1 g ( k ) = 0.3 / ( 1 + 0.001 k ) , a 2 g ( k ) = 1 / ( 1 + 0.005 k ) , a 3 g ( k ) = 0.3 / ( 1 + 0.001 k ) , a 4 g ( k ) = 0.3 / ( 1 + 0.001 k ) .
Denote the integral square error (ISE) of global containment errors as the performance index:
ISE = k e ˜ + ( k ) 2 + e ˜ ( k ) 2
The received outputs and global containment errors of the MAS with MFACC ( λ = 0.5) are shown in Figure 10. It can be clearly seen that MFACC still works well under communication channel disturbances with different intensities, and the global containment errors converge, which also validates the theoretical results in this paper. The comparison results when using the protocol in [12] are given in Figure 11. One can observe the controller is stable, but the tracking performance is not satisfying facing unknown heterogeneous follower dynamics. In summary, the ISE results of two methods under various channel environment are listed in Table 2.

6. Conclusions

In this paper, the containment control problem of MASs is solved using the MFAC approach. Based on the dynamical linearization technique, the heterogeneous unknown nonlinear agents are transformed into CFDL data models, and then the novel distributed MFAC algorithm was designed to ensure that the followers are driven into the convex hull of the leaders. Further extension includes the results under time-switching topologies and dynamic leaders. The proposed algorithm is data-driven, i.e., only I/O data rather than systems models are used in the controller design. Three numerical examples are given to validate the effectiveness of this method. This paper brings the discussion of coordinated MASs MFAC schemes from the single leader field to multiple leaders field along with a novel stability analysis method.
Although the proposed method acquires several advantages, it still has limitations facing real applications. In the future, the containment control problem in the event of sensor failure and time delay will be investigated. Moreover, the algorithm should be validated by practical experiments.

Author Contributions

Conceptualization, Y.R; methodology, Y.R. and H.J.; software Y.R. and Y.X.; validation, Y.R. and D.L.; formal analysis, Y.R. and S.X.; investigation, S.X.; resources, L.W.; data curation, D.L.; writing—original draft preparation, Y.R., H.J., D.L. and Y.X.; writing—review and editing, Y.R., Y.X., S.X. and L.W.; visualization D.L.; supervision, H.J., D.L. and L.W.; project administration, Y.R.; funding acquisition, H.J. and S.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under Grant 62103057, R&D Program of Beijing Municipal Education Commission under Grant KM202210009011.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data generated or analyzed during this study are included in this article.

Conflicts of Interest

Author Deli Li was employed by the company China Mobile Group Design Institute Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

SymbolMeaning
F the set of followers
W the set of leaders
G the communication topology graph
A A ( k ) the adjacency matrix with fixed\switching topologies
D D ( k ) the in-degree matrix with fixed\switching topologies
L L ( k ) the Laplacian matrix with fixed\switching topologies
C o ( W ) the convex hull of the set W
y i ( k ) y ^ i ( k ) the output without\with disturbances of the ith follower agent
u i ( k ) the input of the ith follower agent
f i ( . ) the unknown follower dynamics
w ¯ i the convergence value for the ith follower agent
w l ( k ) w ^ l ( k ) the output without\with disturbances of the lth leader agent
ϕ i ( k ) the pseudo partial derivative
ϕ ^ i ( k ) the estimated pseudo partial derivative
ϕ ˜ i ( k ) the pseudo partial derivative estimation error
ξ i ( k ) ξ ¯ i ( k ) the local containment error with fixed\switching topologies
ρ i , η , λ , μ adjustable parameters of controllers
y ( k ) the vector collecting all follower agents’ outputs
u ( k ) the vector collecting all follower agents’ inputs
ξ ( k ) the vector collecting all local containment errors
w s w ( k ) the vector collecting all stationary\dynamic leaders’ outputs
w + w the upper\lower bound of stationary leaders’ outputs
w ˜ + w ˜ the upper\lower bound of dynamic leaders’ outputs
y + ( k ) y ( k ) the upper\lower bound of followers’ outputs
Ω + ( k ) Ω ( k ) the upper\lower bound of the MASs with stationary leaders
Ω ˜ + ( k ) Ω ˜ ( k ) the upper\lower bound of the MASs with dynamic leaders
e + ( k ) e ˜ + ( k ) the upper global containment error with stationary\dynamic leaders

References

  1. Lewis, F.L.; Zhang, H.; Hengster-Movric, K.; Das, A. Cooperative Control of Multi-Agent Systems: Optimal and Adaptive Control Design; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
  2. Ren, W. Consensus strategies for cooperative control of vehicle formations. IET Control Theory Appl. 2007, 1, 505–512. [Google Scholar] [CrossRef]
  3. Dong, S.; Wang, C.; Wen, S.; Gang, F. A synchronization approach to trajectory tracking of multiple mobile robots while maintaining time-varying formations. IEEE Trans. Robot. 2009, 25, 1074–1086. [Google Scholar] [CrossRef]
  4. Oliveira, L.; Camponogara, E. Multi-agent model predictive control of signaling split in urban traffic networks. Transp. Res. C Emerg. Technol. 2010, 18, 120–139. [Google Scholar] [CrossRef]
  5. Ren, W.; Cao, Y.C. Distributed Coordination of Multi-Agent Networks; Springer Science & Business Media: London, UK, 2011. [Google Scholar]
  6. Cheng, L.; Wang, Y.; Ren, W.; Hou, Z.; Tan, M. Containment control of multiagent systems with dynamic leaders based on a pin -type approach. IEEE Trans. Cybern. 2016, 46, 3004–3017. [Google Scholar] [CrossRef] [PubMed]
  7. Ji, M.; Ferrari-Trecate, G.; Egerstedt, M.; Buffa, A. Containment control in mobile networks. IEEE Trans. Autom. Control 2008, 53, 1972–1975. [Google Scholar] [CrossRef]
  8. Cao, Y.; Ren, W.; Egerstedt, M. Distributed containment control with multiple stationary or dynamic leaders in fixed and switching directed networks. Automatica 2012, 48, 1586–1597. [Google Scholar] [CrossRef]
  9. Cao, Y.; Stuart, D.; Ren, W.; Meng, Z. Distributed containment control for multiple autonomous vehicles with double-integrator dynamics: Algorithms and experiments. IEEE Trans. Control Syst. Technol. 2011, 19, 929–938. [Google Scholar] [CrossRef]
  10. Li, J.; Ren, W.; Xu, S. Distributed containment control with multiple dynamic leaders for double-integrator dynamics using only position measurements. IEEE Trans. Autom. Control 2012, 57, 1553–1559. [Google Scholar] [CrossRef]
  11. Liu, K.; Xie, G.; Wang, L. Containment control for secondorder multi-agent systems with time-varying delays. Syst. Control Lett. 2014, 67, 24–31. [Google Scholar] [CrossRef]
  12. Wang, Y.; Cheng, L.; Hou, Z.G.; Tan, M.; Wang, M. Containment control of multi-agent systems in a noisy communication environment. Automatica 2014, 50, 1922–1928. [Google Scholar] [CrossRef]
  13. Li, Z.; Ren, W.; Liu, X.; Fu, M. Distributed containment control of multi-agent systems with general linear dynamics in the presence of multiple leaders. Int. J. Robust Nonlinear Control 2013, 23, 534–547. [Google Scholar] [CrossRef]
  14. Wen, G.; Duan, Z.; Zhao, Y.; Yu, W.; Cao, J. Robust containment tracking of uncertain linear multi-agent systems: A non-smooth controlapproach. Int. J. Control 2014, 87, 2522–2534. [Google Scholar] [CrossRef]
  15. Wang, D.; Zhang, N.; Wang, J.; Wang, W. Cooperative containment control of multiagent systems based on follower observers with time delay. IEEE Trans. Syst. Man Cybern. Syst. 2017, 47, 13–23. [Google Scholar] [CrossRef]
  16. Meng, Z.; Ren, W.; You, Z. Distributed finite-time attitude containment control for multiple rigid bodies. Automatica 2010, 46, 2092–2099. [Google Scholar] [CrossRef]
  17. Mei, J.; Ren, W.; Chen, J.; Ma, G. Distributed adaptive coordination for multiple lagrangian systems under a directed graph without using neighbors’s velocity information. Automatica 2013, 49, 1723–1731. [Google Scholar] [CrossRef]
  18. Wang, W.; Tong, S. Observer-based adaptive fuzzy containment control for multiple uncertain nonlinear systems. IEEE Trans. Fuzzy Syst. 2019, 27, 2079–2089. [Google Scholar] [CrossRef]
  19. Lü, H.; He, W.; Han, Q.; Ge, X.; Peng, C. Finite-time containment control for nonlinear multi-agent systems with external disturbances. Inf. Sci. 2020, 512, 338–351. [Google Scholar] [CrossRef]
  20. Wang, W.; Pan, Y.; Liang, H. Adaptive neural finite-time containment control for nonlower triangular nonlinear multi-agent systems with dynamics uncertainties. Neurocomputing 2020, 391, 157–166. [Google Scholar] [CrossRef]
  21. Hou, Z.; Huang, W. The model-free learning adaptive control of a class of siso nonlinear systems. In Proceedings of the 1997 American Control Conference, Albuquerque, NM, USA, 6 June 1997; pp. 343–344. [Google Scholar]
  22. Hou, Z.; Jin, S. A novel data-driven control approach for a class of discrete-time nonlinear systems. IEEE Trans. Control Syst. Technol. 2011, 19, 1549–1558. [Google Scholar] [CrossRef]
  23. Hou, Z.; Jin, S. Data-driven model-free control for a class of mimo nonlinear discrete-time systems. IEEE Trans. Neural Netw. 2011, 22, 2173–2188. [Google Scholar]
  24. Hou, Z.; Zhu, Y. Controller-dynamic-linearization-based model free adaptive control for discrete-time nonlinear systems. IEEE Trans. Ind. Informat. 2013, 9, 2301–2309. [Google Scholar] [CrossRef]
  25. Zhu, Y.; Hou, Z. Data-driven mfac for a class of discrete-time nonlinear systems with RBFNN. IEEE Trans. Neural Netw. Learn. Syst. 2014, 25, 1013–1020. [Google Scholar] [CrossRef] [PubMed]
  26. Chi, R.; Hou, Z.; Huang, B.; Jin, S. A unified data-driven design framework of optimality-based generalized iterative learning control. Comput. Chem. Eng. 2015, 77, 10–23. [Google Scholar] [CrossRef]
  27. Hou, Z.; Chi, R.; Gao, H. An overview of dynamic linearization based datadriven control and applications. IEEE Trans. Ind. Electron. 2017, 64, 4076–4090. [Google Scholar] [CrossRef]
  28. Wu, D.; Chen, H.; Huang, Y.; Chen, S. Online Monitoring and Model-Free Adaptive Control of Weld Penetration in VPPAW Based on Extreme Learning Machine. IEEE Trans. Ind. Informat. 2019, 15, 2732–2740. [Google Scholar] [CrossRef]
  29. Liu, D.; Yang, G. Data-Driven adaptive sliding mode control of nonlinear discrete-time systems with prescribed Pperformance. IEEE Trans. Syst. Man Cybern. 2019, 49, 2598–2604. [Google Scholar] [CrossRef]
  30. Ren, Y.; Hou, Z.; Sirmatel, I.I.; Geroliminis, N. Data driven model free adaptive iterative learning perimeter control for large-scale urban road networks. Transp. Res. C Emerg. Technol. 2020, 115, 102618. [Google Scholar] [CrossRef]
  31. Ren, Y.; Yin, H.; Wang, L.; Ji, H.H. Data-driven RBFNN-enhanced model-free adaptive traffic symmetrical signal control for a multi-phase intersection with fast-changing traffic flow. Symmetry 2023, 15, 1235–1253. [Google Scholar] [CrossRef]
  32. Hou, Z.; Xiong, S. On model free adaptive control and its stability analysis. IEEE Trans. Autom. Control 2019, 64, 4555–4569. [Google Scholar] [CrossRef]
  33. Hou, Z.; Wang, Z. From model-based control to data-driven control: Survey, classification and perspective. Inf. Sci. 2013, 235, 3–35. [Google Scholar] [CrossRef]
  34. Bu, X.; Hou, Z.; Zhang, H. Data-driven multiagent systems consensus tracking using model free adaptive control. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 1514–1524. [Google Scholar] [CrossRef] [PubMed]
  35. Yu, X.; Jin, S.; Liu, G.; Lei, T.; Ren, Y. A data-driven distributed adaptive control approach for nonlinear multi-agent systems. IEEE Access 2020, 8, 207884–207893. [Google Scholar] [CrossRef]
  36. Bu, X.; Yu, Q.; Hou, Z.; Qian, W. Model free adaptive iterative learning consensus tracking control for a class of nonlinear multiagent systems. IEEE Trans. Syst. Man Cybern. Syst. 2019, 49, 677–686. [Google Scholar] [CrossRef]
  37. Liu, G. Predictive control of networked nonlinear multiagent systems with communication constraints. IEEE Trans. Syst. Man Cybern. 2020, 50, 4447–4457. [Google Scholar] [CrossRef]
  38. Liu, S.; Li, J.; Wang, L.; Hou, Z.; Ji, H. Data-driven consensus control for a class of unknown nonlinear multiagent systems with time delays. IET Control Theory Appl. 2022, 16, 698–717. [Google Scholar] [CrossRef]
  39. Bu, X.; Cui, L.; Hou, Z.; Qian, W. Formation control for a class of nonlinear multiagent systems using model-free adaptive iterative learning. Int. J. Robust Nonlinear Control 2017, 28, 1402–1412. [Google Scholar] [CrossRef]
  40. Ren, Y.; Hou, Z. Robust model-free adaptive iterative learning formation for unknown heterogeneous non-linear multi-agent systems. IET Control Theory Appl. 2020, 14, 654–663. [Google Scholar] [CrossRef]
  41. Bu, X.; Guo, J.; Cui, L.; Hou, Z. Event-triggered Model Free Adaptive Containment Control for Nonlinear Multiagent Systems Under DoS Attacks. IEEE Trans. Control Netw. Syst. 2023, 1–12. [Google Scholar] [CrossRef]
  42. Liu, T.; Hou, Z. Model-Free Adaptive Containment Control for Unknown Multi-Input Multi-Output Nonlinear MASs with Output Saturation. IEEE Trans. Circuits Syst. I Regul. Pap. 2023, 70, 2156–2166. [Google Scholar] [CrossRef]
  43. Rockafellar, R.T. Convex Analysis; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]
  44. Hou, Z.; Jin, S. Model Free Adaptive Control: Theory and Applications; CRC Press, Taylor and Francis Group: Boca Raton, FL, USA, 2013. [Google Scholar]
  45. Ioannou, P.A.; Sun, J. Robust Adaptive Control; PTR Prentice-Hall: Upper Saddle River, NJ, USA, 1995; pp. 75–76. [Google Scholar]
  46. Yang, S.; Xu, J.; Huang, D.; Tan, Y. Optimal iterative learning control design for multi-agent systems consensus tracking. Syst. Control Lett. 2014, 69, 80–89. [Google Scholar] [CrossRef]
Figure 1. Communication graph in Example 1.
Figure 1. Communication graph in Example 1.
Applsci 14 05527 g001
Figure 2. Outputs of agents in Example 1: (a) λ = 5; (b) λ = 2; (c) λ = 1; (d) λ = 0.5.
Figure 2. Outputs of agents in Example 1: (a) λ = 5; (b) λ = 2; (c) λ = 1; (d) λ = 0.5.
Applsci 14 05527 g002
Figure 3. Inputs of agents in Example 1 ( λ = 2).
Figure 3. Inputs of agents in Example 1 ( λ = 2).
Applsci 14 05527 g003
Figure 4. PPD estimations in Example 1 ( λ = 2).
Figure 4. PPD estimations in Example 1 ( λ = 2).
Applsci 14 05527 g004
Figure 5. Communication graphs in Example 2.
Figure 5. Communication graphs in Example 2.
Applsci 14 05527 g005
Figure 6. Topology switching over time.
Figure 6. Topology switching over time.
Applsci 14 05527 g006
Figure 7. Outputs of agents in Example 2.
Figure 7. Outputs of agents in Example 2.
Applsci 14 05527 g007
Figure 8. Inputs of agents in Example 2.
Figure 8. Inputs of agents in Example 2.
Applsci 14 05527 g008
Figure 9. PPD estimations in Example 2.
Figure 9. PPD estimations in Example 2.
Applsci 14 05527 g009
Figure 10. Outputs of agents with MFACC for Example 3: (a) MFACC, β = 0.1; (b) MFACC, β = 0.2; (c) MFACC, β = 0.5; (d) MFACC, β = 0.1; (e) MFACC, β = 0.2; (f) MFACC, β = 0.5.
Figure 10. Outputs of agents with MFACC for Example 3: (a) MFACC, β = 0.1; (b) MFACC, β = 0.2; (c) MFACC, β = 0.5; (d) MFACC, β = 0.1; (e) MFACC, β = 0.2; (f) MFACC, β = 0.5.
Applsci 14 05527 g010
Figure 11. Outputs of agents with the method in [12] for Example 3: (a) the method in [12], β = 0.1; (b) the method in [12], β = 0.1.
Figure 11. Outputs of agents with the method in [12] for Example 3: (a) the method in [12], β = 0.1; (b) the method in [12], β = 0.1.
Applsci 14 05527 g011
Table 1. Example settings and parameter selections.
Table 1. Example settings and parameter selections.
Example 1Example 2Example 3
Leader number233
Leader trajectoriesstationarytime-varyingstationary and time-varying
Topologyfixedtime-switchingtime-switching
Theorem validationTheorem 1Theorem 2Theorems 1–2
Disturbancenonenoneadditive channel disturbance
Parameters ρ 1 = ρ 2 = ρ 3 = ρ 4 = 0.3 , λ [ 0.5 5 ] , η = 0.5 , μ = 1 , ε = 0.0001
Table 2. Performance comparison in Example 3.
Table 2. Performance comparison in Example 3.
MFACC, λ = 0.5MFACC, λ = 1MFACC, λ = 2Method in [12]
ISE, β = 0.1119.8189.2327.8793.5
ISE, β = 0.2120.0188.8327.4794.6
ISE, β = 0.5120.7188.2326.1800.6
ISE, β = 1125.6188.5324.9824.2
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ren, Y.; Ji, H.; Li, D.; Xie, Y.; Xiong, S.; Wang, L. Data-Driven Containment Control for a Class of Nonlinear Multi-Agent Systems: A Model Free Adaptive Control Approach. Appl. Sci. 2024, 14, 5527. https://doi.org/10.3390/app14135527

AMA Style

Ren Y, Ji H, Li D, Xie Y, Xiong S, Wang L. Data-Driven Containment Control for a Class of Nonlinear Multi-Agent Systems: A Model Free Adaptive Control Approach. Applied Sciences. 2024; 14(13):5527. https://doi.org/10.3390/app14135527

Chicago/Turabian Style

Ren, Ye, Honghai Ji, Deli Li, Yongqiang Xie, Shuangshuang Xiong, and Li Wang. 2024. "Data-Driven Containment Control for a Class of Nonlinear Multi-Agent Systems: A Model Free Adaptive Control Approach" Applied Sciences 14, no. 13: 5527. https://doi.org/10.3390/app14135527

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop