Next Article in Journal
TriNet: Exploring More Affordable and Generalisable Remote Phenotyping with Explainable Deep Models
Next Article in Special Issue
Collision-Free Path Planning for Multiple Drones Based on Safe Reinforcement Learning
Previous Article in Journal
Multi-Unmanned Aerial Vehicles Cooperative Trajectory Optimization in the Approaching Stage Based on the Attitude Correction Algorithm
Previous Article in Special Issue
Multiple Unmanned Aerial Vehicle (multi-UAV) Reconnaissance and Search with Limited Communication Range Using Semantic Episodic Memory in Reinforcement Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Robust Hybrid Iterative Learning Formation Strategy for Multi-Unmanned Aerial Vehicle Systems with Multi-Operating Modes

1
Guangdong Laboratory of Artificial Intelligence and Digital Economy (Shenzhen), Shenzhen 518107, China
2
College of Civil and Transportation Engineering, Shenzhen University, Shenzhen 518060, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Drones 2024, 8(8), 406; https://doi.org/10.3390/drones8080406
Submission received: 19 July 2024 / Revised: 14 August 2024 / Accepted: 15 August 2024 / Published: 19 August 2024
(This article belongs to the Special Issue Distributed Control, Optimization, and Game of UAV Swarm Systems)

Abstract

:
This paper investigates the formation control problem of multi-unmanned aerial vehicle (UAV) systems with multi-operating modes. While mode switching enhances the flexibility of multi-UAV systems, it also introduces dynamic model switching behaviors in UAVs. Moreover, obtaining an accurate dynamic model for a multi-UAV system is challenging in practice. In addition, communication link failures and time-varying unknown disturbances are inevitable in multi-UAV systems. Hence, to overcome the adverse effects of the above challenges, a hybrid iterative learning formation control strategy is proposed in this paper. The proposed controller does not rely on precise modeling and exhibits its learning ability by utilizing historical input–output data to update the current control input. Furthermore, two convergence theorems are proven to guarantee the convergence of state, disturbance estimation, and formation tracking errors. Finally, three simulation examples are conducted for a multi-UAV system consisting of four quadrotor UAVs under multi-operating modes, switching topologies, and external disturbances. The results of the simulations show the strategy’s effectiveness and superiority in achieving the desired formation control objectives.

1. Introduction

In recent years, unmanned aerial vehicles (UAVs), which are aircraft capable of autonomous flight and load-carrying, have garnered significant attention from researchers in both the scientific and engineering communities. Compared to the single UAV systems, multi-UAV systems composed of groups of homogeneous/heterogeneous UAVs have the advantage of collaborative task execution and a wide range of applications in complex task scenarios [1], such as multi-satellite systems [2], persistent reconnaissance [3,4], and cargo transportation [5]. Formation control, one of the most fundamental coordinating control problems of multi-UAV systems [6,7], entails ensuring that any neighboring UAVs keep a fixed deviation while tracking the desired trajectory. Hitherto, several formation keeping methods have been well studied, including behavioral-based methods [8], artificial potential field methods [9], and distributed leader–follower methods [10,11]. Additionally, as a typical model-free intelligent algorithm, reinforcement learning is also widely used in motion planning and formation problems of multi-UAV systems [12,13]. Specifically, the design of leader–follower formation control for multi-UAV systems depends on a well-constructed communication graph, where each UAV can only interact with its neighboring UAVs [14].
For the control problem of leader–follower formation tracking, numerous researchers have proposed a variety of control algorithms, considering UAVs’ dynamic models to be nonlinear, strongly coupled, time-varying, and underactuated. In [15], Gong et al. present an event-triggered distributed formation algorithm, which, compared to centralized algorithms, facilitates the more efficient and flexible task execution of multi-satellite systems. In [11], a neural adaptive sliding mode controller is proposed to address the uncertainty and unmodeled dynamic models of multi-UAV systems. In [16], the optimization control technique is utilized in the design of a distributed leader–follower formation tracking scheme, and in [17], for multi-UAV systems subject to multi-constraints, a model predictive control scheme is proposed to achieve a formation flight. Additionally, a recurrent neural network-based leader–follower consensus formation control method for small-size nonlinear UAVs is presented in [18]. Based on the aforementioned methods for multi-UAV systems and considering the conservation of communication resources, the distributed formation control method is generally regarded as efficient because it requires information only from local neighboring UAVs rather than all UAVs [19].
However, in certain application scenarios, UAVs must switch between different operating modes to adapt to various working environments. For example, an amphibious robot described in [20] is designed with multi-mode operations to work flexibly in different environments, such as beaches, grasslands, seabeds, and underwater. In [21], the researchers present a novel self-reconfigurable modular robotic system with multi-mode locomotion and conduct an experiment with two locomotion modes: inchworm-type mode and quadruped-walker mode. In [22], the researchers develop a soft crawling robot with three locomotive modes (inchworm-like motion, creeping, and tumbling) to enhance the locomotion ability of untethered robots in narrow spaces. Therefore, this paper considers a class of multi-UAV systems composed of the individual UAVs with multi-operating modes, such as those described in [20,21], and it significantly enhances the application potential of multi-UAV systems in diverse working environments. Additionally, in [23], the dynamical system with multi-operating modes is described as a switched dynamic system consisting of a finite number of sub-modes, and such switching behavior complicates the designs of formation control algorithms of multi-UAV systems with multi-operating modes.
Meanwhile, the switching behaviors, resulting from the communication link failures or cyber-attacks, may occur in the communication topology, and many researchers have studied the formation control of multi-UAV systems with switching communication topologies. For instance, in [24], Zou et al. assume that the switching topologies are uniformly jointly connected, and design an adaptive controller to address the formation tracking of multiple vertical takeoff and landing UAVs. In [25], the inner–outer distributed second-order controller is presented for the formation tracking of multi-rotors under switching topologies with velocities and acceleration saturation constraints. In [26], under Markovian random switching topologies, a distributed formation control strategy combining the unknown parameter observer is put forward for a group of quadrotor UAVs. However, up to now, no literature has been found that addresses multi-UAV systems with multi-operating modes and switching communication topologies.
In addition, most existing formation tracking studies [26,27] are valid in the infinity time domain. However, this characteristic also imposes limitations on dealing with finite-time practical tasks. Unlike other control algorithms that investigate the convergence/stability of controlled systems in the infinite time domain, iterative learning control (ILC)-based methods aim to improve the tracking performance of repetitive dynamic systems in the iteration domain by learning from previous iterations. In [28], the researchers design a double proportional–derivative-type (PD-type) ILC control strategy for leader–follower formation tracking of a class of linear multiple quadrotor UAV systems. For the nonlinear and non-affine multi-UAV systems, Fu et al. use a data-driven ILC with a dynamic linearization approximation approach to handle the point–point leader–follower formation tracking problem in [29]. In [30], a derivative-type (D-type) ILC-based leader–follower formation control algorithm is constructed for a team of quadrotors. However, existing ILC-based formation control schemes [29,30] are difficult to apply to multi-UAV systems with multi-operating modes due to the presence of switching operating modes.
Motivated by the preceding discussions, we focus on an application scenario of repeat cargo transportation of multi-UAV systems. Thus, UAVs are required to have the ability to carry cargoes, and each UAV has two operating modes: on-load mode (i.e., the UAV is working under carrying cargoes), and no-load mode (i.e., the UAV is working without carrying cargoes). However, some dynamic parameters, including total mass, center of mass, and rotational inertia, undergo substantial variations when releasing heavy payloads. Developing a dynamic model to encompass various payload scenarios and designing a single flight controller throughout the mission are highly challenging [31]. Thus, a class of multi-UAV systems with multi-operating modes are considered. This paper investigates a hybrid iterative learning formation strategy for multi-UAV systems under multi-operating modes, switching communication topologies and unknown external disturbances. The contributions and novelty of this paper are summarized as follows.
  • We propose a hybrid robust iterative learning formation control strategy that combines a state and disturbance iterative learning observer (ILO), and an iterative learning controller for multi-UAV systems under multi-operating modes. In contrast to the existing literature on formation control for multi-UAV systems [32,33], this paper extends the application fields of multi-UAV system formation control from the single operating mode to the multi-operating mode.
  • The proposed method does not depend on the accurate dynamic model of multi-UAV systems, and the control input in the current iteration is updated by utilizing the stored data from previous iterations.
  • Compared to iterative learning works for multi-UAV systems [28,29], an iteration-fixed initial state assumption is not required in this paper. Detailed theoretical results are described to ensure the convergence of state and disturbance estimated errors and the formation tracking error with the proposed hybrid formation control strategy.
  • Three simulation experiments with a multi-UAV system composed of four quadrotor UAVs are conducted to demonstrate the effectiveness and the superiority of the proposed formation strategy in dealing with the finite-time formation tracking.
The rest of this paper is structured as follows. Section 2 presents the communication topology theory and the general second-order nonlinear dynamic model of multi-UAV systems, along with the key assumptions and lemmas. Section 3 proposes the hybrid iterative learning formation control strategy, and the detailed theoretical results are presented in Section 4. The simulation is conducted in Section 5 and Section 6, concluding the paper.
Notation 1.
“⊗” denotes the Kronecker product.  a  and  A  are the vector norm and matrix norm, respectively.  I n = d i a g { 1 , 1 , , 1 } 1 n = { 1 , 1 , , 1 } T  and  0 n = { 0 , 0 , , 0 } T A B  denotes the matrix ( A B ) is negative-definite.  Z +  is the set of all non-negative integers. For two vectors  a = ( a 1 , a 2 , a 3 ) R n  and  b = ( b 1 , b 2 , b 3 ) R n , their cross product is  a × b = ( a 2 b 3 a 3 b 2 , a 3 b 1 a 1 b 3 , a 1 b 2 a 2 b 1 ) R n .

2. Problem Formulation

In this section, we offer some prerequisite knowledge of communication graph and formulate the dynamic model of multi-UAV systems under multi-operating modes.

2.1. Communication Topology

A directed graph reflects the connection relationships among UAVs, and is defined by  G = { V , E , A } , where  V = { v 1 , v 2 , , v N }  is the vertex set,  E  is the edge set, and  A  is the adjacent matrix. If the edge  ( j , i )  belongs to  E , then  a ( i j ) > 0 ; otherwise,  a ( i j ) = 0 . In particular,  a ( i i ) = 0 . A Laplacian matrix is defined as  L = ( D A ) R N × N , where  D = d i a g { α ( 1 ) , α ( 2 ) , , α ( N ) } R N × N  with  α ( i ) = j = 1 N a ( i j ) . In this paper, suppose that the desired trajectory is generated by a leader UAV, and only a portion of the UAVs can exchange information with the leader UAV. Thus, we define  Π = d i a g ( π 1 , π 2 , , π N ) R N × N  to denote the connection between the UAVs and leader. The graph with a leader UAV is  G ¯ = { V { 0 } , E ¯ , A ¯ } . Then, if all UAVs can form a spanning tree with the leader UAV as the root vertex, this means that the graph  G ¯  contains a spanning tree.

2.2. Multi-UAV Systems with Multi-Operating Modes

In practical scenarios, the dynamic models of many UAV systems can be described as second-order dynamical systems.Hence, considering the formation tracking issue of multi-UAV systems under multi-operating modes, a general class of second-order nonlinear dynamic models of the j-th UAV at the h-th iteration is expressed as
x ˙ 1 ( j ) ( t , h ) = x 2 ( j ) ( t , h ) x ˙ 2 ( j ) ( t , h ) = f s ( t ) ( x 2 ( j ) ( t , h ) ) + B s ( t ) u ( j ) ( t , h ) + ω s ( t ) ( j ) ( t , h ) y ( j ) ( t , h ) = C s ( t ) x 1 ( j ) ( t , h )
where  t [ 0 , T ] h Z +  is the iteration number, and  j { 1 , 2 , , N } x 1 ( j ) ( t , h ) , x 2 ( j ) ( t , h ) R n u ( j ) ( t , h ) R p  and  y ( j ) ( t , h ) R m  are the state, input, and output of the UAV.  x ˙ 1 ( j ) ( t , h ) , x ˙ 2 ( j ) ( t , h ) R n  are first-order time derivatives of  x 1 ( j ) ( t , h ) , x 2 ( j ) ( t , h ) B s ( t ) R n × p  is a full-of-row rank input matrix and  C s ( t ) R m × n  is an output matrix.  f s ( t ) ( x 2 ( j ) ( t , h ) ) : R n R n  is a vector-valued nonlinear function.  ω s ( t ) ( j ) ( t , h ) R n  is a time- and iterative-variant finite external disturbance, and  ω s ( t ) ( j ) ( t , h ) b ω , where the positive constant  b ω  is a finite bound.  s ( t ) { 1 , 2 , . . . , M }  is the operating mode switching law; e.g., if  t Θ k = [ t k 1 , t k ) s ( t ) = k .
Remark 1.
For the multi-UAV system (1), the system matrices and vectors exhibit a switching behavior in different operating modes. However, the switching behavior, resulting from communication link failures or cyber-attacks, may occur in the communication topology, referred to as the switching topology  G ¯ s ( t ) { G ¯ 1 , G ¯ 2 , , G ¯ M } , and its corresponding Laplacian matrix is  L s ( t ) . In particular, any sub-topology  G ¯ k , ( k { 1 , 2 , , M } ) can form a spanning tree and the leader is the root vertex.
Here, some assumptions and lemmas contributing to the analysis are firstly given.
Assumption A1.
For any  t Θ k , the nonlinear function  f k ( · )  is assumed to be differentiable with respect to the state x and satisfies the Lipschitz condition  f k ( x ˜ ) f k ( x ^ ) l f x ˜ x ^ , where constant  l f > 0  is the Lipschitz constant.
Remark 2.
Assumption 1 is to provide some prior knowledge for the unknown nonlinear term  f k ( · )  of the multi-UAV system (1), and many practical networked systems are considered to be composed of nonlinear individuals that satisfy Assumption 1, e.g., [34,35].
Lemma 1
([36]). The non-negative real sequences  { r h } R  and  { q h } R  satisfy  r h + 1 ϵ r h + q h ; if the positive constant  ϵ < 1  and  lim h q h = q , then we have  lim h r h q / ( 1 ϵ ) .
Lemma 2
([37]). For a vector-valued function  f ( t ) R n , where  t Θ k , its λ-norm is defined by  f ( · ) Θ k λ = sup t Θ k e x p ( λ t ) f ( t ) . The following two properties associated with λ-norm are valid, i.e.,
(1) For vector-valued functions  f ( t ) , g ( t ) R n , if  f ( t ) = t k 1 t g ( τ ) d τ , then
f ( · ) Θ k λ 1 e x p ( λ ( t k t k 1 ) ) λ g ( · ) Θ k λ
(2) For vector-valued functions  f ( t ) , g ( t ) R n , if  f ( t ) = t k 1 t t k 1 s g ( τ ) d τ , then
f ( · ) Θ k λ ( 1 e x p ( λ ( t k t k 1 ) ) ) 2 λ 2 g ( · ) Θ k λ
Formation objective: Let  d j ( t ) R m  be the desired formation deviation between the j-th UAV and the virtual leader UAV, and let  d j l ( t ) = d j ( t ) d l ( t ) R m  be the desired formation deviation between the j-th UAV and the l-th UAV. For the multi-UAV system (1) with multi-operating modes under the switching law  s ( t ) , and with both the switching communication topology and unknown external disturbance being considered, the formation objective is to design an appropriate control input  u ( j ) ( t , h )  such that each each UAV maintains a desired deviation from its neighbor UAVs over a finite time interval [38], i.e.,
lim h + ( y ( i ) ( t , h ) y ( j ) ( t , h ) ) = d i j ( t ) , i , j V , i j .
Then, we define a leader trajectory tracking error as  e ( j ) ( t , h ) = y d ( t ) + d j ( t ) y ( j ) ( t , h ) . It is noted that the control objective (4) can be ensured if the formation tracking error  e ( j ) ( t , h )  is convergent in the iteration domain. In order to facilitate the analysis later, for any  t Θ k , the compact form of the multi-UAV system (1) is denoted by
x ˙ 1 ( t , h ) = x 2 ( t , h ) x ˙ 2 ( t , h ) = f k ( x 2 ( t , h ) ) + B ˜ k u ( t , h ) + ω k ( t , h ) y ( t , h ) = C ˜ k x 1 ( t , h )
where  x 1 ( t , h ) = [ x 1 ( 1 ) ( t , h ) T , , x 1 ( N ) ( t , h ) T ] T R N n x 2 ( t , h ) = [ x 2 ( 1 ) ( t , h ) T , , x 2 ( N ) ( t , h ) T ] T R N n u ( t , h ) = [ u ( 1 ) ( t , h ) T , , u ( N ) ( t , h ) T ] T R N p y ( t , h ) = [ y ( 1 ) ( t , h ) T , , y ( N ) ( t , h ) T ] T R N m f k ( x 2 ( t , h ) ) = [ f k ( x 2 ( 1 ) ( t , h ) ) T , , f k ( x 2 ( N ) ( t , h ) ) T ] T R N n ω k ( t , h ) = [ ω k ( 1 ) ( t , h ) T , , ω k ( N ) ( t , h ) T ] T R N n B ˜ k = I N B k  and  C ˜ k = I N C k . Thus, the compact form of the tracking error is  e ( t , h ) = 1 N y d ( t ) + d ( t ) y ( t , h ) , where  d ( t ) = [ d 1 ( t ) T , , d N ( t ) T ] T .

3. Methodology

To achieve the formation tracking of multi-UAV systems under switching dynamic modes, switching communication topologies, and external disturbances, this section proposes a hybrid iterative learning formation strategy.

3.1. ILO Design for Multi-UAV Systems

Due to the unmeasurable states and unknown disturbances in the multi-UAV system (1), a hybrid ILO for estimating the system states and disturbances is proposed as
x ^ ˙ 1 ( j ) ( t , h ) = x ^ 2 ( j ) ( t , h ) x ^ ˙ 2 ( j ) ( t , h ) = f s ( t ) ( x ^ 2 ( j ) ( t , h ) ) + B s ( t ) u ( j ) ( t , h ) + ω ^ s ( t ) ( j ) ( t , h ) + K s ( t ) ( j ) ( y ( j ) ( t , h ) y ^ ( j ) ( t , h ) ) y ^ ( j ) ( t , h ) = C s ( t ) x ^ 1 ( j ) ( t , h ) ω ^ s ( t ) ( j ) ( t , h + 1 ) = ω ^ s ( t ) ( j ) ( t , h ) + L s ( t ) ( j ) ( y ¨ ( j ) ( t , h ) y ^ ¨ ( j ) ( t , h ) )
where  x ^ 1 ( j ) ( t , h ) R n x ^ 2 ( j ) ( t , h ) R n y ^ ( j ) ( t , h ) R m , and  ω ^ s ( t ) ( j ) ( t , h ) R n  are the estimated state, output, and disturbance vectors, respectively.  x ^ ˙ 1 ( j ) ( t , h ) , x ^ ˙ 2 ( j ) ( t , h ) R n  are the first-order time derivatives of  x ^ 1 ( j ) ( t , h ) , x ^ 2 ( j ) ( t , h ) y ¨ ( j ) ( t , h ) , y ^ ¨ ( j ) ( t , h ) R m  are the second-order time derivatives of  y ( j ) ( t , h ) , y ^ ( j ) ( t , h ) K s ( t ) ( j ) R n × m  and  L s ( t ) ( j ) R n × m  are the observer gain matrices. The deviation between the initial estimation states and real states are assumed to be finite, i.e.,  x 1 ( j ) ( 0 , h ) x ^ 1 ( j ) ( 0 , h ) b x 1  and  x 2 ( j ) ( 0 , h ) x ^ 2 ( j ) ( 0 , h ) b x 2 .
For any  t Θ k , the compact form of ILO (6) is
x ^ ˙ 1 ( t , h ) = x ^ 2 ( t , h ) x ^ ˙ 2 ( t , h ) = f k ( x ^ 2 ( t , h ) ) + B ˜ k u ( t , h ) + ω ^ s ( t ) ( t , h ) + K k ( y ( t , h ) y ^ ( t , h ) ) y ^ ( t , h ) = C ˜ k x ^ 1 ( t , h ) ω ^ k ( t , h + 1 ) = ω ^ k ( t , h ) + L k ( y ¨ ( t , h ) y ^ ¨ ( t , h ) )
where  x ^ 1 ( t , h ) = [ x ^ 1 ( 1 ) ( t , h ) T , , x ^ 1 ( N ) ( t , h ) T ] T R N n 1 x ^ 2 ( t , h ) = [ x ^ 2 ( 1 ) ( t , h ) T , , x ^ 2 ( N ) ( t , h ) T ] T R N n 2 y ^ ( t , h ) = [ y ^ ( 1 ) ( t , h ) T , , y ^ ( N ) ( t , h ) T ] T R N m , and  ω ^ k ( t , h ) = [ ω ^ k ( 1 ) ( t , h ) T , , ω ^ k ( N ) ( t , h ) T ] T R N n K k = d i a g { K k ( 1 ) , , K k ( N ) }  and  L k = d i a g { L k ( 1 ) , , L k ( N ) } .
The estimated state error and disturbance error are defined as
e x 1 ( t , h ) = x 1 ( t , h ) x ^ 1 ( t , h )
e x 2 ( t , h ) = x 2 ( t , h ) x ^ 2 ( t , h )
e s ( t ) ω ( t , h ) = ω s ( t ) ( t , h ) ω ^ s ( t ) ( t , h )

3.2. ILC Controller Design for Multi-UAV Systems

The distributed formation tracking error of a multi-UAV system (1) is defined as
ψ ( j ) ( t , h ) = i = 1 N a s ( t ) ( i j ) ( y ( i ) ( t , h ) y ( j ) ( t , h ) + d j i ( t ) ) + π s ( t ) ( j ) ( y d ( t ) + d j ( t ) y ( j ) ( t , h ) )
where  a s ( t ) ( i j )  is the element of adjacent matrix  A s ( t ) R N × N , and  π s ( t ) ( j )  is the diagonal element of the leader connected status matrix  Π s ( t ) R N × N . Then, a distributed ILC control algorithm is proposed as
u ( j ) ( t , h + 1 ) = u ( j ) ( t , h ) + K j , s ( t ) x η ^ ( j ) ( t , h ) + Λ j , s ( t ) ψ ˙ ( j ) ( t , h ) + Γ j , s ( t ) ψ ¨ ( j ) ( t , h ) + K j , s ( t ) ω ω ^ ˜ s ( t ) ( j ) ( t , h )
where  η ^ ( j ) ( t , h ) = x ^ ( j ) ( t , h + 1 ) x ^ ( j ) ( t , h ) R n  and  ω ^ ˜ s ( t ) ( j ) ( t , h ) = ω ^ s ( t ) ( j ) ( t , h + 1 ) ω ^ s ( t ) ( j ) ( t , h ) R n ψ ˙ ( j ) ( t , h )  and  ψ ¨ ( j ) ( t , h )  are the first- and second-order time derivatives of  ψ ( j ) ( t , h ) , respectively.  K j , s ( t ) x R p × n Γ j , s ( t ) R p × m Λ j , s ( t ) R p × m , and  K j , s ( t ) ω R p × n  are the gain matrices.
The distributed initial state learning scheme is proposed as
x 1 ( j ) ( 0 , h + 1 ) = x 1 ( j ) ( 0 , h ) + B 1 Γ j , 1 ψ ( j ) ( 0 , h )
x 2 ( j ) ( 0 , h + 1 ) = x 2 ( j ) ( 0 , h ) + B 1 Λ j , 1 ψ ( j ) ( 0 , h ) + B 1 Γ j , 1 ψ ˙ ( j ) ( 0 , h )
The control block diagram of the hybrid iterative learning formation strategy for the multi-UAV system (1) with multi-operating modes are shown in Figure 1.
Remark 3.
In many existing ILC designs for multi-UAV systems [28,29,34], an iteration-fixed initial state assumption is required to make the fixed initial output of the follower UAVs remain identical to the initial output of the leader, (i.e.,  y ( j ) ( 0 , h ) = y d ( 0 ) ). However, not all UAVs have access to the desired information from the leader UAV in the multi-UAV system, and thus the identical initial state assumption is not applicable anymore. To overcome such a non-identical iteration initial state problem, a distributed initial state learning scheme (13) and (14) is proposed, and the formation tracking objective (4) can be reached by taking any arbitrary initial states.
Remark 4.
The proposed ILC control algorithm (12) is a data-driven control method and does not depend on the accurate dynamic model of multi-UAV systems. Specifically, from (12), we observe that the control input at the current iteration is updated by using the stored data from the previous iteration, and the objective of the new control input is computed in such way as to ensure that the formation tracking error will be reduced in the next iteration. In other words, the ILC control algorithm (12) has the capability to learn from the history of iterative tasks, thereby improving the formation tracking performance. This feature makes the iterative learning-based methods well suited to the formation control problem of multi-UAV systems subject to uncertainties and disturbances.

4. Convergence Analysis

The convergence of the estimation error and formation tracking errors are analyzed at length in this section.
Theorem 1.
Consider the multi-UAV system (1) under Assumption 1; let the hybrid ILO (6) be used if there exists a gain matrix  L k  that satisfies
sup k 1 , 2 , , M I N n L k C ˜ k ρ < 1
and then, the estimation errors  e x 1 ( t , h ) e x 2 ( t , h )  and  e s ( t ) ω ( t , h )  are convergent and bounded in the iteration domain.
Proof. 
For any sub-task interval  t Θ k , the estimation state errors along the iteration axis are
e x 1 ( t , h ) = x 1 ( t , h ) x ^ 1 ( t , h ) = e x 1 ( t k 1 , h ) + t k 1 t e x 2 ( τ , h ) d τ e x 2 ( t , h ) = x 2 ( t , h ) x ^ 2 ( t , h ) = e x 2 ( t k 1 , h ) + t k 1 t [ f k ( x 2 ( τ , h ) ) f k ( x ^ 2 ( τ , h ) ) ] d τ
+ t k 1 t [ K k C ˜ k e x 1 ( τ , h ) + e k ω ( τ , h ) ] d τ
We take  λ -norms on both sides of (16) and (17) to obtain
e x 1 ( · , h ) Θ k λ e x 1 ( t k 1 , h ) + Δ k ( λ 1 ) e x 2 ( · , h ) Θ k λ e x 2 ( · , h ) Θ k λ e x 2 ( t k 1 , h )
+ Δ k ( λ 1 ) [ l f e x 2 ( · , h ) Θ k λ + β 1 e x 1 ( · , h ) Θ k λ + e k ω ( · , h ) Θ k λ ]
where  Δ k ( λ 1 ) = 1 e x p ( λ ( t k t k 1 ) ) λ  and  β 1 = s u p 1 K 1 C ˜ 1 . By rearranging (19), we obtain
e x 2 ( · , h ) Θ k λ λ p k ( e x 2 ( t k 1 , h ) + Δ k ( λ 1 ) β 1 e x 1 ( t k 1 , h ) ) + 1 e x p ( λ ( t k t k 1 ) ) p k e k ω ( · , h ) Θ k λ
where  p k = λ l f ( 1 e x p ( λ ( t k t k 1 ) ) ) c k ( 1 e x p ( λ ( t k t k 1 ) ) ) 2 / λ .
On the other hand, the estimation disturbance error along the iteration axis is
e k ω ( t , h + 1 ) = ω k ( t , h + 1 ) ω ^ k ( t , h + 1 ) = e k ω ( t , h ) L k C ˜ k e ˙ x 2 ( t , h ) + ω ˜ k ( t , h ) = ( I N n L k C ˜ k ) e k ω ( t , h ) L k C ˜ k ( f k ( x 2 ( t , h ) ) f k ( x ^ ( t , h ) ) ) + L k C ˜ k K k C ˜ k e x 1 ( t , h ) + ω ˜ k ( t , h )
Then, taking  λ -norms on both sides of (21) yields
e k ω ( · , h + 1 ) Θ k λ I N n L k C ˜ k e k ω ( · , h ) Θ k λ + β 2 l f e x 2 ( · , h ) Θ k λ + β 3 e x 1 ( · , h ) Θ k λ + 2 b ω
where  β 2 = s u p k L k C ˜ k  and  β 3 = L k C ˜ k K k C ˜ k . The rest of this proof is given in the following steps:
Step 1: For  t Θ 1  and  t 0 = 0 , we know  e x 1 ( 0 , h ) b x 1  and  e x 2 ( 0 , h ) b x 2 , and substituting (18) and (20) into (22), there is
e 1 ω ( · , h + 1 ) Θ 1 λ I N n L 1 C ˜ 1 + 1 e x p ( λ t 1 ) p 1 ( β 2 l f + β 3 Δ 1 ( λ 1 ) ) × e k ω ( · , h ) Θ 1 λ + ( β 3 + λ β 1 β 2 l f p 2 Δ 1 ( λ 1 ) ) b x 1 + λ p 1 b x 2 + 2 b ω
The term  1 e x p ( λ ( t 1 ) ) / p 1  can be made negligibly small by choosing a sufficiently large  λ , and according to the convergence condition (15), we have
I N n L 1 C ˜ 1 + 1 e x p ( λ ( t 1 ) ) p 1 ( β 2 l f + β 3 Δ ( λ 1 ) ) ρ < 1
Thus, by applying Lemma 1 to (23), we derive
lim h e 1 ω ( · , h ) Θ 1 λ β ¯ 1 1 ρ e ¯ 1 ω
where  β ¯ 1 = ( β 3 + λ β 1 β 2 l f p 2 Δ 1 ( λ 1 ) ) b x 1 + λ p 1 b x 2 + 2 b ω , and from (20), we have
lim h e x 2 ( · , h ) Θ 1 λ 1 e x p ( λ t 1 ) p 1 lim h e 1 ω ( · , h ) Θ 1 λ = 1 e x p ( λ ( t 1 ) p 1 e ¯ 1 ω e ¯ 1 x 2 .
From (18), we also have
lim h e x 1 ( · , h ) Θ 1 λ Δ 1 ( λ 1 ) lim h e x 1 ( · , h ) Θ 1 λ = Δ 1 ( λ 1 ) e ¯ 1 x 2 e ¯ 1 x 1 .
Step 2: For  t Θ 2 , substituting (18) and (20) into (22) derives
e 2 ω ( · , h + 1 ) Θ 2 λ I N n L 2 C ˜ 2 + 1 e x p ( λ ( t 2 t 1 ) ) p 2 ( β 2 l f + β 3 Δ 2 ( λ 1 ) ) e 2 ω ( · , h ) Θ 2 λ + ( β 3 + λ β 1 β 2 l f p 2 Δ 2 ( λ 1 ) ) e x 1 ( t 1 , h ) + λ p 2 e x 1 ( t 1 , h ) + 2 b ω
Similarly, by choosing a sufficiently large  λ  such that the term  1 e x p ( λ ( t 2 t 1 ) ) / p 2  is made negligibly small, and according to the convergence condition (15), we have
I N n L 2 C ˜ 2 + 1 e x p ( λ ( t 2 t 1 ) ) p 2 ( β 2 l f + β 3 Δ ( λ 1 ) ) ρ < 1
Then, from (27) and (26), we take the limitation on the second term of the right sides of (28) to yield
lim h [ ( β 3 + λ β 1 β 2 l f p 2 Δ 2 ( λ 1 ) ) e x 1 ( t 1 , h ) + λ p 2 e x 1 ( t 1 , h ) + 2 b ω ] ( β 3 + λ β 1 β 2 l f p 2 Δ 2 ( λ 1 ) ) e ¯ 1 x 1 + λ p 2 e ¯ 1 x 2 + 2 b ω β ¯ 2
Thus, applying Lemma 1 to (28), we obtain  lim h e 2 ω ( · , h ) Θ 2 λ = β ¯ 2 1 ρ e ¯ 2 ω . And from (20), we have
lim h e x 2 ( · , h ) Θ 2 λ λ p 2 ( lim h e x 2 ( t 1 , h ) + Δ 2 ( λ 1 ) β 1 lim h e x 1 ( t 1 , h ) ) + 1 e x p ( λ ( t 2 t 1 ) ) p 2 lim h e 2 ω ( · , h ) Θ 2 λ = λ p 2 ( e ¯ 1 x 2 + Δ 2 ( λ 1 ) β 1 e ¯ 1 x 1 ) + 1 e x p ( λ ( t 2 t 1 ) p 2 e ¯ 2 ω e ¯ 2 x 2 .
From (18), we have
lim h e x 1 ( · , h ) Θ 2 λ lim h e x 1 ( t 1 , h ) + Δ 2 ( λ 1 ) lim h e x 1 ( · , h ) Θ 2 λ = e ¯ 1 x 1 + Δ 2 ( λ 1 ) e ¯ 1 x 2 e ¯ 2 x 1 .
By repeating the procedure of Step 2 in time intervals  Θ 3 , Θ 4 , , Θ M , we can conclude that the estimation errors  e x 1 ( t , h ) e x 2 ( t , h ) , and  e s ( t ) ω ( t , h )  are convergent and bounded for all  t [ 0 , T ]  with the iteration number tending to infinity. The proof is completed. □
Theorem 2.
Consider the multi-UAV system (1) under Assumption 1. Let the ILC controller (12) be used with the gain matrix  K k ω = B ˜ k T ( B ˜ k B ˜ k T ) 1 ; if there exists a gain matrix  Γ k  satisfying the following condition, i.e.,
sup k 1 , 2 , , M I N m C ˜ k B ˜ k Γ k Ψ k κ < 1 ,
where  Ψ k = ( L k + Π k ) I m , then the formation tracking error is convergent and bounded in the iteration domain.
Proof. 
For any sub-task interval  t Θ k , the compact form of the ILC algorithm (12) is
u ( t , h + 1 ) = u ( t , h ) + K k x η ^ 2 ( t , h ) + Λ k Ψ k e ˙ ( t , h ) + Γ k Ψ k e ¨ ( t , h ) + K k ω ω ^ ˜ k ( t , h )
where  K k x = d i a g { K 1 , k x , , K N , k x } R N p × N n Λ k = d i a g { Λ 1 , k , , Λ N , k } R N p × N m Γ k = d i a g { Γ 1 , k , , Γ N , k } R N p × N m K k ω = d i a g { K 1 , k ω , , K N , k ω } R N p × N n η ^ ( t , h ) = ( η ^ ( 1 ) ( t , h ) T , , η ^ ( N ) ( t , h ) T ) T R N n  and   e ( t , h ) = ( e ( 1 ) ( t , h ) T , , e ( N ) ( t , h ) T ) T R N m . From the compact form (5) of the multi-UAV system, the system states of multi-UAV systems under any k-th operating mode (i.e.,  t Θ k ) can be expressed as
x 2 ( t , h ) = x 2 ( t k 1 , h ) + t k 1 t f k ( x 2 ( τ , h ) ) d + t k 1 t [ B ˜ k u ( τ , h ) + ω k ( τ , h ) ] d τ
x 1 ( t , h ) = x 1 ( t k 1 , h ) + t k 1 t x 2 ( τ , h ) d τ
Thus, according to (36), the deviation of the formation tracking error between two consecutive iterations is derived as
e ( t , h + 1 ) e ( t , h ) = ( y ( t k 1 , h + 1 ) y ( t k 1 , h ) ) C ˜ k t k 1 t [ x 2 ( τ , h + 1 ) x 2 ( τ , h ) ] d τ
And from (35), the deviation of the state  x 2 ( t , h )  between two consecutive iterations is
x 2 ( t , h + 1 ) x 2 ( t , h ) = x 2 ( t k 1 , h + 1 ) x 2 ( t k 1 , h ) + t k 1 t [ f k ( x 2 ( τ , h + 1 ) ) f k ( x 2 ( τ , h ) ) ] d τ + t k 1 t [ B ˜ k Δ u ( τ , h ) + ω ˜ k ( τ , h ) ] d τ = t k 1 t [ ( A k ( τ , h ) + B ˜ k K k x ) η 2 ( τ , h ) B ˜ k K k x e η ^ 2 ( τ , h ) ] d τ + t k 1 t [ ( I N n + B ˜ k K k ω ) ω ˜ ( τ , h ) B ˜ k K k ω e ω ˜ ( τ , h ) ] d τ + η 2 ( t k 1 , h ) + B ˜ k [ Λ k Ψ k ( e ( t , h ) e ( t k 1 , h ) ) + Γ k Ψ k ( e ˙ ( t , h ) e ˙ ( t k 1 , h ) ) ]
where  A k ( t , h ) = d f k ( x ¯ 2 ( t , h ) ) d x  by using Assumption 1 and the Mean Value Theorem [39].  x ¯ 2 ( t , h ) = x 2 ( t , h + 1 ) + θ ( x 2 ( t , h + 1 ) x 2 ( t , h ) ) θ = d i a g { θ 1 , , θ N n }  and  ( 0 < θ l < 1 , l = 1 , 2 , , N n ) . Then, the substitution of (38) into (37) yields
e ( t , h + 1 ) = [ I N m C ˜ k B ˜ k Γ k Ψ k ] e ( t , h ) C ˜ k B ˜ k Λ k Ψ k t k 1 t e ( τ , h ) d τ C ˜ k t k 1 t t k 1 s A ˜ k ( τ , h ) η 2 ( τ , h ) d τ C ˜ k ( η 1 ( t k 1 , h ) + C ˜ k B ˜ k K k x t k 1 t t k 1 s e η ^ 2 ( τ , h ) d τ B ˜ k Γ k Ψ k e ( t k 1 , h ) ) + C ˜ k B ˜ k K k ω t k 1 t t k 1 s e ω ˜ ( τ , h ) d τ C ˜ k [ η 2 ( t k 1 , h ) + C ˜ k [ B ˜ k Λ k Ψ k e ( t k 1 , h ) + B ˜ k Γ k Ψ k e ˙ ( t k 1 , h ) ] ( t t k 1 )
where  A ˜ k ( t , h ) = A k ( t , h ) + B ˜ k K k x . Next, the rest of this proof is given in the following steps:
Step 1: For  t Θ 1  and  t 0 = 0 , the multi-UAV system (1) is working under the 1-th operating mode and the sub-topology  G ¯ 1  is active. The compact forms of initial state learning protocol (13) and (14) are
η 1 ( 0 , h ) = B ˜ 1 Γ 1 Ψ 1 e ( 0 , h )
η 2 ( 0 , h ) = B ˜ 1 Λ 1 Ψ 1 e ( 0 , h ) + B ˜ 1 Γ 1 Ψ 1 e ˙ ( 0 , h )
Hence, substituting (41) into (38), and choosing the gain matrix  K 1 ω = B ˜ 1 T ( B ˜ 1 B ˜ 1 T ) 1 , we obtain
η 2 ( t , h ) = 0 t [ A ˜ 1 ( t , h ) η 2 ( τ , h ) B ˜ 1 K 1 x e η ^ 2 ( τ , h ) + e ω ˜ ( τ , h ) ] d τ + B ˜ 1 [ Λ 1 Ψ 1 e ( t , h ) + Γ 1 Ψ 1 e ˙ ( t , h ) ]
And we take the  λ -norms on both sides of (42) to also obtain
η 2 ( · , h ) Θ 1 λ b 2 e η ^ 2 ( · , h ) Θ 1 λ + e ω ˜ ( · , h ) Θ 1 λ λ b 1 ( 1 e x p ( λ t 1 ) ) + λ ( b 3 e ( · , h ) Θ 1 λ + b 4 e ˙ ( · , h ) Θ 1 λ ) λ b 1 ( 1 e x p ( λ t 1 ) )
where  b 1 = s u p t , h , k A ˜ k ( t , h ) b 2 = s u p k B ˜ k K k x b 3 = s u p k B ˜ k Λ k Ψ k , and  b 4 = s u p k B ˜ k Γ k Ψ k . It is noted that  lim h e x 2 ( t , h ) e ¯ x 2  and  lim h e ω ( t , h ) λ e ¯ ω  from Theorem 1, and thus we know that
lim h e η ^ 2 ( · , h ) Θ 1 λ lim h e η ^ 2 ( t , h )   = lim h e x 2 ( t , h + 1 ) e x 2 ( t , h ) 2 e ¯ x 2 e ¯ η ^ 2
lim h e k ω ˜ ( · , h ) Θ 1 λ lim h e k ω ˜ ( t , h )   = lim h e k ω ( t , h + 1 ) e k x ( t , h ) 2 e ¯ k ω e ¯ k ω ˜
Meanwhile, in (43), based on  lim λ 1 λ b 1 ( 1 e x p ( λ t 1 ) ) = 0  and  lim λ λ λ b 1 ( 1 e x p ( λ t 1 ) ) = 1 , we can choose a sufficiently large  λ  such that  1 λ b 1 ( 1 e x p ( λ t 1 ) ) 1  and  λ λ b 1 ( 1 e x p ( λ t 1 ) ) 2 . Hence, (43) is further written as
η 2 ( · , h ) Θ 1 λ b 2 e ¯ η ^ 2 + e ¯ ω ˜ + 2 [ b 3 e ( · , h ) λ + b 4 e ˙ ( · , h ) Θ 1 λ ] b 2 e ¯ η ^ 2 + e ¯ ω ˜ + 2 [ ( b 3 + b 4 Δ 1 ( λ 1 ) ) e ( · , h ) Θ 1 λ + b 4 Δ 1 ( λ 1 ) e ( 0 , h ) ]
where the relation  0 t e ˙ ( τ , h ) d τ = e ( t , h ) e ( 0 , h )  is utilized. For  t Θ 1 , with consideration of the initial state learning algorithms (40) and (41), we take the norms on both sides of (39) to obtain
e ( t , h + 1 ) I N m C ˜ 1 B ˜ 1 Γ 1 Ψ 1 e ( t , h ) + b C b 3 0 t e ( τ , h ) d τ + b C 0 t 0 s [ b 1 η 2 ( τ , h ) + b 2 e η ^ 2 ( τ , h ) + e ω ˜ ( τ , h ) ] d τ
where  b C = s u p k C ˜ k . We consider (46) and take the  λ -norms on both sides of (47) to deduce
e ( · , h + 1 ) Θ 1 λ [ I N m C ˜ 1 B ˜ 1 Ψ 1 Γ 1 + b C b 3 Δ 1 ( λ 1 ) + 2 b C b 1 Δ 1 ( λ 2 ) ( b 3 + b 4 Δ 1 ( λ 1 ) ) ] e ( · , h ) Θ 1 λ + b C ( b 1 + 1 ) Δ 1 ( λ 2 ) [ b 2 e ¯ η ^ 2 + e ¯ ω ˜ ] + b C b 1 b 4 Δ 1 ( λ 1 ) e ( 0 , h )
where  Δ k ( λ 2 ) = Δ k ( λ 1 ) 2 . According to the initial learning algorithm (40), we have  e ( 0 , h + 1 ) = e ( 0 , h ) C ˜ 1 B ˜ 1 Γ 1 Ψ 1 e ( 0 , h )  and  e ( 0 , h + 1 ) I N m C ˜ 1 B ˜ 1 Γ 1 Ψ 1 e ( 0 , h ) , considering that condition (33) in  Θ 1  and Lemma 1 derives  lim h e ( 0 , h ) = 0 . Furthermore, we have  lim h η 1 ( 0 , h ) b C b 4 lim h e ( 0 , h ) = 0 .
Hence, in (48), if condition (33) can be satisfied, we can choose a sufficiently large  λ  to make
I N m C ˜ 1 B ˜ 1 Γ 1 Ψ 1 + b C b 3 Δ 1 ( λ 1 ) + 2 b C b 1 Δ ( λ 2 ) ( b 3 + b 4 Δ 1 ( λ 1 ) ) κ < 1
and taking the limitation on the second term of the right side of (48), we have
lim h b C ( b 1 + 1 ) Δ ( λ 2 ) [ b 2 e ¯ η ^ 2 + e ¯ ω ˜ ] + b C b 1 b 4 Δ ( λ 1 ) e ( 0 , h ) = b C ( b 1 + 1 ) Δ ( λ 2 ) [ b 2 e ¯ η ^ 2 + e ¯ ω ˜ ] v 1
Considering (49) and (50), and applying Lemma 1 to (50), we know that  e ( · , h ) Θ 1 λ  is convergent along the iteration axis and its upper bound is  lim h e ( · , h ) Θ 1 λ υ ¯ 1 1 κ e ¯ 1  and
lim h e ˙ ( · , h ) Θ 1 λ 1 Δ 1 ( λ 1 ) ( lim h e ( · , h ) Θ 1 λ + lim h e ( 0 , h ) ) e ¯ 1 Δ 1 ( λ 1 ) e ˙ ¯ 1
In addition, from (46) and (36), we have
lim h η 2 ( · , h ) Θ 1 λ b 2 e ¯ η ^ 2 + e ¯ ω ˜ + lim h 2 e ¯ 1 Δ 1 ( λ 1 ) e ( 0 , h ) + 2 ( b 3 + b 4 Δ 1 ( λ 1 ) ) lim h e ( · , h ) Θ 1 λ
b 2 e ¯ η ^ 2 + e ¯ ω ˜ + 2 ( b 3 + e ¯ 1 Δ 1 ( λ 1 ) ) e ˙ ¯ 1 e ¯ 1 η 2
lim h η 1 ( · , h ) Θ 1 λ lim h η 1 ( 0 , h ) + Δ 1 ( λ 1 ) lim h η 2 ( · , h ) Θ 1 λ Δ 1 ( λ 1 ) e η 2 ¯ 1 e ¯ 1 η 1
Step 2: For  t Θ 2 , the multi-UAV system (1) is working under the 2-th operating mode and the sub-topology  G ¯ 2  is active. Similar to (46) in Step 1, we can obtain
η 2 ( · , h ) Θ 2 λ b 2 e ¯ η ^ 2 + e ¯ ω ˜ + 2 ( b 3 + b 4 Δ 2 ( λ 1 ) ) e ( · , h ) Θ 1 λ + 2 b 4 Δ 2 ( λ 1 ) e ( t 1 , h ) + 2 ( η 2 ( t 1 , h ) + b 3 e ( t 1 , h ) + b 4 e ˙ ( t 1 , h ) )
And similar to (48) and considering (54), we take the  λ -norms on both sides of (39) during the time interval  Θ 2  to obtain
e ( · , h + 1 ) Θ 2 λ I N m C ˜ 2 B ˜ 2 Γ 2 Ψ 2 + b C b 3 Δ 2 ( λ 1 ) + 2 b C b 1 Δ 2 ( λ 2 ) ( b 3 + b 4 Δ 2 ( λ 1 ) ) e ( · , h ) Θ 2 λ + b C ( b 1 + 1 ) Δ ( λ 2 ) ( b 2 e ¯ η ^ 2 + e ¯ ω ˜ ) + b C b 4 ( t 2 t 1 + 2 b 1 Δ 2 ( λ 2 ) ) e ˙ ( t 1 , h ) + 2 b C b 1 Δ 2 ( λ 2 ) ( b 3 + b 4 Δ 2 ( λ 1 ) ) + b C ( b 4 + b 3 ( t 2 t 1 ) ) e ( t 1 , h ) + b C ( η 1 ( t 1 , h ) + ( t 2 t 1 + 2 b 1 Δ 2 ( λ 2 ) ) η 2 ( t 1 , h ) )
If the convergence condition (33) is satisfied, we can choose a sufficiently large  λ  to make
I N m C ˜ 2 B ˜ 2 Γ 2 Ψ 2 + b C b 3 Δ 1 ( λ 1 ) + 2 b C b 1 Δ ( λ 2 ) ( b 3 + b 4 Δ 2 ( λ 2 ) ) κ < 1
Notably, the boundness of the terms  e ( t 1 , h ) e ˙ ( t 1 , h ) η 1 ( t 1 , h )  and  η 2 ( t 1 , h )  in the iteration domain can be ensured according to the results (4)–(53) in Step 1, thus, it is easy to obtain
lim h [ b C ( b 1 + 1 ) Δ ( λ 2 ) ( b 2 e ¯ η ^ 2 + e ¯ ω ˜ ) + b C b 4 ( t 2 t 1 + 2 b 1 Δ 2 ( λ 2 ) ) e ˙ ( t 1 , h ) + 2 b C b 1 Δ 2 ( λ 2 ) ( b 3 + b 4 Δ 2 ( λ 1 ) ) + b C ( b 4 + b 3 ( t 2 t 1 ) ) e ( t 1 , h ) + b C ( η 1 ( t 1 , h ) + ( t 2 t 1 + 2 b 1 Δ 2 ( λ 2 ) ) η 2 ( t 1 , h ) ) ] υ ¯ 2
Consequently, by considering (55) and (57), and applying Lemma 1 to (55), we know that  e ( · , h ) Θ 2 λ  is convergent along the iteration axis and its upper bound is
lim h e ( · , h ) Θ 2 λ υ ¯ 2 1 κ e ¯ 2
And similar to the derivation process of (51)–(53), we further conclude that  lim h e ˙ ( · , h ) Θ 2 λ e ˙ ¯ 2 lim h η 2 ( · , h ) Θ 2 λ e ¯ 2 η 2 , and  lim h η 1 ( · , h ) Θ 2 λ e ¯ 2 η 1 .
By repeating the procedure of Step 2 in time intervals  Θ 3 , Θ 4 , , Θ M , we can conclude that the estimation errors  e ( t , h )  are convergent and bounded for all  t [ 0 , T ] , with the iteration number tending to infinity. The proof is completed. □
Corollary 1.
Consider the multi-UAV system (1) under Assumption 1. Let the ILC (12) be used, and if the following LMI can be solved,
I N m I N m C ˜ k B ˜ k Γ k Ψ k ( I N m C ˜ k B ˜ k Γ k Ψ k ) T I N m 0 N m × N m
then the formation tracking errors  e ( t , h )  are convergent along the iteration axis for all  t [ 0 , T ] .
Proof. 
By applying Lemma 1, (i.e., Schur complement lemma) in [40] to the LMI (59), there is
( I N m C ˜ k B ˜ k Γ k Ψ k ) ( I N m C ˜ k B ˜ k Γ k Ψ k ) T I N m
Then, we take norms on both sides of (60) to yield  I N m C ˜ k B ˜ k Γ k Ψ k 2 < 1 . This implies the convergence condition (33) can be ensured, and the rest of the proof is given in Theorem 2. This completes the proof. □

5. Simulation Example

This section presents two simulation experiments: one considering multi-operating modes and the other addressing switching communication topologies. Additionally, a comparison experiment with a PID controller is conducted. The ILO (6) and ILC algorithm (11) are applied to a multi-UAV system (1) to achieve the formation tracking objective (4). A flowchart of the proposed method is given in Figure 2. The simulations are performed with MATLAB R2022b on a digital computer with Intel Core i7 CPU at 2.10 GHz, 16 GB, and 3000 MHz of RAM under the Windows 11 operating system.

5.1. Simulation I: Formation Tracking with Switching Operating Modes

This subsection gives a simulation experiment that considers a multi-UAV system with four quadrotor UAVs, in which each quadrotor UAV has two operating modes: the on-load and no-load operating modes. The dynamic models of the multi-quadrotor UAV system with two operating modes are given as follows:
(1) The quadrotor UAV under the no-load operating mode [41]:
ξ ˙ = v ϖ ˙ = R α m v ˙ = m A 1 v + F J α ˙ = α × ( J α ) + M
where  m = 8  kg is the mass of each UAV,  ξ = [ p x , p y , p x ] T  is the position vector,  v = [ v x , v y , v z ] T  is the linear velocity vector,  ϖ = [ ϕ , θ , φ ] T  is the three Euler angles, i.e., roll angle  ϕ , pitch angle  θ , and yaw angle  φ α = [ α x , α y , α z ] T  is the angular velocity vector,  F = [ F x , F y , F z ] T  is the projection component of thrust in the  x y z —directions on the body frame, and  M = [ u ϕ , u θ , u φ ] T  is the input torque vector. For simplicity, we omit t and h in (61) (the same as below).
R = 1 t a n θ s i n ϕ t a n θ c o s ϕ 0 c o s ϕ s i n ϕ 0 s i n ϕ / c o s θ c o s ϕ / c o s θ , A 1 = 0 α z α y α z 0 α x α y α x 0 . J = J x J x y J x z J x y J y J y z J x z J y z J z .
Here,  J x = 4 J y = 4 J z = 8.8 J x y = 0.031 J x z = 0.011 , and  J y z = 0.023 . Here, without a loss of generality, we assume that the quadrotor UAVs are always flying with small attitude angles, i.e.,  R I 3 .
According to the dynamic model (1), the system vectors and matrices are
x 1 ( j ) ( t , h ) = [ ξ T , ϖ T ] T , x 2 ( j ) ( t , h ) = [ v T , α T ] T , y ( j ) ( t , h ) = [ ξ T , ϖ T ] T , u j ( t , h ) = [ F T , M T ] T , f 1 ( x ( t , h ) ) = [ ( A 1 v ) T , J 1 ( α × ( J α ) ) T ] T , C 1 = I 6 , ω 1 ( j ) ( t , h ) = 0.2 s i n ( 0.1 h ) s i n ( 2 π t ) 1 6 , B 1 = diag ( I 3 / m , J 1 ) .
(2) The quadrotor UAV under the on-load operating mode [41]:
For each quadrotor UAV, the load is suspended by a massless cable attached at the center of the quadrotor UAV. Thus, the position vector of the load in the body frame of the UAV is  r = [ 0 , 0 , ( l + z h ) ] , where  l = 0.4  m is the cable length, and  z h = 0.1  m is the distance between the suspension point and the center of mass of the quadrotor UAV. The mass of each load is  m 0 = 0.1  kg. Hence, the dynamic mode of the quadrotor UAV in the on-load operating mode is
ξ ˙ = v ϖ ˙ = R α ( m + m 0 ) v ˙ = m A 1 v + F + F p J p α ˙ = A 2 α + M + M p
where additional force  F p  and moment  M p  are related to the load, and the other parameters are
J p = J x + m 0 z h ( z h + l ) J x y J x z J x y J y + m 0 z h ( z h + l ) J y z J x z J y z J z , A 2 = ϵ 11 ϵ 12 ϵ 13 ϵ 21 ϵ 22 ϵ 23 ϵ 31 ϵ 32 ϵ 33 , ϵ 11 = J x z α y + J x y α z , ϵ 12 = m 0 z h α z ( z h + l ) J y z α y ( J z J y ) α z , ϵ 13 = J y z α z , ϵ 21 = α z m 0 z h ( z h + l ) + J x z α x ( J x J z ) α z , ϵ 22 = J y z α x J x y α z , ϵ 23 = J x z α z , ϵ 31 = J x y α x + ( J x J y ) α y , ϵ 32 = J x y α y + ( J x z ) α z , ϵ 33 = J y z α x , M p = ( 0 , 0 , z h ) T × g p , F p = m 0 ( α × ( α × r ) ) + g p , g p = [ 0 , 0 , m 0 g ] T .
According to the dynamic model (1), the system vectors and matrices are
f 2 ( x ( t , h ) ) = J 2 1 [ ( m A 1 v + F p ) T , ( A 2 α + M p ) T ] T , B 2 = J 2 1 , C 2 = I 6 , ω 2 ( j ) ( t , h ) = 0.2 s i n ( 0.01 h ) c o s ( 2 π t ) 1 6 .
where
J 2 = ( m + m 0 ) I 3 J 22 J 23 J p , J 22 = 0 m 0 z h 0 m 0 z h 0 0 0 0 0 , J 23 = 0 m 0 ( z h + l ) 0 m 0 ( z h + l ) 0 0 0 0 0 .
The operating mode switching law  s ( t )  is defined in Figure 3. If  s ( t ) = 1 , then the multi-quadrotor system is working under the on-load operating mode; if  s ( t ) = 2 , then the multi-quadrotor system is working under the on-load operating mode. The total runtime is  T = 1  s and the max iteration number is  h m a x = 100 . The initial states are  x 1 ( j ) ( 0 , 0 ) = [ 0 , 0 , 5 , 0 , 0 , 0 ] T  and  x 2 ( j ) ( 0 , 0 ) = 0 n . The initial iterative input is  u ( j ) ( t , 0 ) = 0 p , ( j = 1 , 2 , 3 , 4 ). The trajectory of the virtual leader quadrotor UAV is  y d ( t ) = [ 2.5 s i n ( 0.2 π t ) , 2.5 c o s ( 0.2 π t ) , 5 + 0.5 s i n ( 0.4 π t ) , 0.2 c o s ( 0.2 π t ) , 0 , 0 ] T . The desired formation deviations between the four quadrotor UAVs and the leader are  d 1 = [ 0.25 , 0.25 , 0 , 0 , 0 , 0 ] d 2 = [ 0.25 , 0.25 , 0 , 0 , 0 , 0 ] d 3 = [ 0.25 , 0.25 , 0 , 0 , 0 , 0 ]  and  d 4 = [ 0.25 , 0.25 , 0 , 0 , 0 , 0 ] , respectively. The initial iterative disturbance is  ω ^ s ( t ) ( j ) ( t , 0 ) = 0 n , ( j = 1 , 2 , 3 , 4 ). The communication topology of the four quadrotors is shown in Figure 4, and we calculate that  Ψ = 1 0 0 0 1 1 0 0 0 1 1 0 0 0 1 1 I 4 . For the ILO (6), we choose the observer gain matrices as  K 1 = K 2 = 0.5 I 24  and  L 1 = L 2 = 0.5 I 24 . For the ILC algorithm (11)–(14), we choose the control gain matrices as  K 1 x = K 2 x = 0.5 I 24 K 1 ω = K 2 ω = 0.5 I 24 , and  Λ 1 = Λ 2 = 0.5 I 24 Γ 1 = 2.1 I 24  and  Γ 2 = 2.4 I 24 . Hence, with the determined gain matrices and a fixed communication topology, the convergence conditions (15) and (33) are verified, i.e.,
If s ( t ) = 1 , then I 24 L 1 C ˜ 1 = 0.5000 < 1 I 24 C ˜ 1 B ˜ 1 Γ 1 Ψ = 0.9612 < 1 If s ( t ) = 2 , then I 24 L 2 C ˜ 2 = 0.5000 < 1 I 24 C ˜ 2 B ˜ 2 Γ 2 Ψ = 0.9570 < 1
The simulation results are shown in Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10.
The formation trajectories of the quadrotor UAVs at the  x y z  coordinate and in the  x o y  plane at different iterations are shown in Figure 5 and Figure 6, respectively. It can be easily seen from Figure 5 and Figure 6 that the four quadrotor UAVs can form a square formation well and track the leader quadrotor UAV over the all-time steps after 70 iterations, although the dynamic models of the quadrotor UAVs are varied at different time intervals. Figure 7 and Figure 8 indicate that all the follower quadrotor UAVs keep the same flight height and yaw angle as the leader quadrotor UAV. The average and maximal formation tracking error of the four quadrotor UAVs along the iteration axis are shown in Figure 9a,b, respectively, where  T s a m = 1000  is the total number of data samples. This demonstrates the convergence of the formation tracking errors in the iteration domain, and this implies that the formation objective (4) is successfully achieved. The maximal state observer errors  e x 1 ( t , h )  and  e x 2 ( t , h )  are shown in Figure 10a, and the maximal disturbance estimation error  e ω ( t , h )  is shown in Figure 10b. From these figures, we can see that the state and disturbance estimation errors converge to a small range around zero with the increasing iteration number. Hence, Theorem 1 and 2 under the switching operating modes are verified.

5.2. Simulation II: Formation Tracking with Switching Topologies

In this subsection, a simulation of a multi-quadrotor UAV system under the on-load operating mode (62) with switching topologies is conducted. According to the switching law in Figure 11 and the possible switching communication topologies are illustrated in Figure 12, we denote that if  s ( t ) = 1 , then the sub-topology  G ¯ 1  is active; if  s ( t ) = 2 , then the sub-topology  G ¯ 2  is active. The initial states are  x 1 ( 1 ) ( 0 , 0 ) = [ 0.25 , 0.25 , 1 , 0 , 0 , 0 ] T x 1 ( 2 ) ( 0 , 0 ) = [ 0.25 , 0.25 , 1 , 0 , 0 , 0 ] T x 1 ( 3 ) ( 0 , 0 ) = [ 0.25 , 0.25 , 1 , 0 , 0 , 0 ] T x 1 ( 4 ) ( 0 , 0 ) = [ 0.25 , 0.25 , 1 , 0 , 0 , 0 ] T , and  x 2 ( j ) ( 0 , 0 ) = 0 n , ( j = 1 , 2 , 3 , 4 ). The trajectory of the virtual leader quadrotor UAV is  y d ( t ) = [ 2 s i n ( 0.2 π t ) , 2 c o s ( 0.1 π t ) , 5 + 0.5 s i n ( 0.1 π t ) , 0.1 c o s ( 0.2 π t ) , 0 , 0 ] T , and the other parameters are set to be the same as in Simulation I. According to Figure 12, we can calculate that  Ψ 1 = 1 0 0 0 1 1 0 0 0 1 1 0 0 0 1 1 I 4 , and  Ψ 2 = 1 0 0 1 1 1 0 0 0 1 1 0 0 0 0 1 I 4 .
Similar to Simulation I, we determine the observer and control gain matrices as  K 1 = K 2 = 0.2 I 24 L 1 = L 2 = 0.9 I 24 K 1 x = K 2 x = 0.5 I 24 K 1 ω = K 2 ω = 0.5 I 24 , and  Λ 1 = Λ 2 = 0.5 I 24 Γ 1 = 2.7 I 24  and  Γ 2 = 2.5 I 24 . Hence, with a fixed on-load operating mode (62), the convergence condition (15) and (33) are verified, i.e.,
If s ( t ) = 1 , then I 24 L 1 C ˜ 2 = 0.9000 < 1 I 24 C ˜ 2 B ˜ 2 Γ 1 Ψ 1 = 0.9508 < 1 If s ( t ) = 2 , then I 24 L 2 C ˜ 2 = 0.9000 < 1 I 24 C ˜ 2 B ˜ 2 Γ 2 Ψ 2 = 0.9557 < 1
The simulation results are shown in Figure 13, Figure 14, Figure 15, Figure 16, Figure 17 and Figure 18. The formation trajectories are shown in Figure 13 and Figure 14. After 70 iterations, a square formation of four quadrotor UAVs can be maintained under switching communication topologies. Meanwhile, Figure 15 and Figure 16 show that all the quadrotor UAVs have the same flight height and yaw angle as the leader UAV. Figure 17 shows the formation tracking error along the iteration axis. We can see that the formation tracking errors are convergent in the iteration domain. The maximal state observer errors  e x 1 ( t , h )  and  e x 2 ( t , h )  and the maximal disturbance estimation error  e ω ( t , h )  are shown in Figure 18a,b. As a result, Theorems 1 and 2 under switching topologies are also verified.

5.3. Simulation III: Compared to the PID Controller

In this subsection, we conduct a comparison simulation experiment with a PID controller that is described as
u P I D ( t ) = K P Ψ ( e ( t ) + e ˙ ( t ) ) + K D Ψ ( e ˙ ( t ) + e ¨ ( t ) ) + K I 0 t Ψ ( e ( τ ) + e ˙ ( τ ) ) d τ
where  K P = 25 I 24 K D = 6 I 24 , and  K I = 20 I 24  are the control gains. The stability of the PID controller (63) for a second-order dynamic system is analyzed in Reference [42]. The multi-quadrotor UAV system is considered to be same as that in Simulation I. The comparison experiment results are shown in Figure 19 and Figure 20. For the simulation of the ILC controller, the results are obtained after 70 iterations. Figure 19 illustrates the formation tracking trajectories with the ILC strategy and PID controller, respectively. Compared to the tracking trajectory with the PID controller, perfect formation tracking during the entire working time interval can be achieved with two distinct operating modes. Meanwhile, Figure 20 gives the formation tracking errors with the ILC strategy and PID controller, respectively. Furthermore, we summarize the average formation tracking errors of the four quadrotor UAVs with the proposed formation strategy and PID controller in Table 1, which also demonstrates the superiority of the proposed formation strategy in this paper.
Consequently, the simulation results indicate that the four quadrotor UAVs can successfully track the leader quadrotor UAVs while maintaining a desired square formation. This further demonstrates the effectiveness of the proposed hybrid iterative learning strategy for the multi-UAV system by considering the multi-operating modes, switching communication topologies, and unknown external disturbances. Additionally, compared to the PID controller, the proposed hybrid iterative learning formation can ensure perfect formation tracking over the entire working time interval and has a higher precision tracking performance. Meanwhile, the calculation time of the proposed formation controller is about 0.009s in an iteration, and this is far less than the actual task cycle time,  T = 10  s, set in the simulations. Thus, the proposed method displays a good real-time performance.

6. Conclusions

In this paper, we propose a hybrid iterative learning formation strategy for the formation control problem of multi-UAV systems with multi-operating modes, switching communication topologies, and unknown external disturbances. Considering the fact that an accurate dynamic model of a multi-UAV system is hard to obtain, the proposed iterative learning formation controller uses the stored historical data to update the control input along the iterative axis without relying on the model’s knowledge of multi-UAV systems. Meanwhile, detailed theoretical results are given to guarantee the convergence of the estimation errors and formation tracking errors under multi-operating modes, switching communication topologies, and unknown external disturbances. Two sufficient convergence theorems can determine the appropriate observer and control gain matrices. At last, three formation flying simulations using four quadrotor UAVs under two different operating modes, switching topologies, and external disturbances demonstrate the effectiveness and superiority of the proposed formation strategy.
The limitation of the proposed method is that identical task trial lengths are required for different iterations. That is to say, the given goal task of multi-UAV systems with the proposed iterative learning strategy must be repeatable and have the same time cycle in different iterations. This repeatable characteristic ensures that the proposed iterative learning strategy has the learning ability to continuously improve the control performance over multiple iterations. However, this also causes some limitations of the proposed iterative learning strategy when extending to the non-strictly repeatable goal tasks. Thus, some researchers have attempted to solve the problem of varying trial lengths of iterative learning formation control [43,44], but the problem of the iterative learning formation control of multi-UAV systems with varying trial lengths is still open for discussion. Therefore, our future work will focus on the coordination control of multi-UAV systems with non-strictly repeatable application scenarios and varying trial lengths. In addition, the implementation of the proposed approach in practical UAVs may be limited by the computational resources of the practical control devices if the total number of UAVs is too large. Thus, the design of practical control units for UAVs should be a trade-off between the computing resources and the number of practical UAVs.

Author Contributions

S.Y.: writing—original draft, software, methodology. W.Y.: writing—reviewing and editing, validation, methodology, conceptualization. Z.L.: supervision, writing—reviewing and editing, project administration, funding acquisition. F.M.: supervision, investigation. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by GuangMing Laboratory, under the project no. 23501002.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Li, Q.; Hua, Y.; Dong, X.; Yu, J.; Ren, Z. Time-varying formation tracking control for unmanned aerial vehicles with the leader’s unknown input and obstacle avoidance: Theories and applications. Electronics 2022, 11, 2334. [Google Scholar] [CrossRef]
  2. Du, B.; Li, S. A new multi-satellite autonomous mission allocation and planning method. Acta Astronaut. 2019, 163, 287–298. [Google Scholar] [CrossRef]
  3. Xiao, P.; Li, N.; Xie, F.; Ni, H.; Zhang, M.; Wang, B. Clustering-based multi-region coverage-path planning of heterogeneous UAVs. Drones 2023, 7, 664. [Google Scholar] [CrossRef]
  4. Zhao, F.; Hua, Y.; Zheng, H.; Yu, J.; Dong, X.; Li, Q.; Ren, Z. Cooperative target pursuit by multiple fixed-wing UAVs based on deep reinforcement learning and artificial potential field. In Proceedings of the 2023 42nd Chinese Control Conference (CCC), Tianjin, China, 24–26 July 2023; pp. 5693–5698. [Google Scholar]
  5. Moein, D.; Kabganian, M.; Azimi, A. Robust adaptive control for formation-based cooperative transportation of a payload by multi quadrotors. Eur. J. Control 2023, 69, 100763. [Google Scholar]
  6. Zhang, J.; Yan, J.; Zhang, P. Multi-UAV formation control based on a novel back-stepping approach. IEEE Trans. Veh. Technol. 2020, 69, 2437–2448. [Google Scholar] [CrossRef]
  7. Muslimov, T.Z.; Munasypov, R.A. Adaptive decentralized flocking control of multi-UAV circular formations based on vector fields and backstepping. ISA Trans. 2020, 107, 143–159. [Google Scholar] [CrossRef]
  8. Balch, T.; Arkin, R.C. Behavior-based formation control for multirobot teams. IEEE Trans. Robot. Autom. 1998, 14, 926–939. [Google Scholar] [CrossRef]
  9. Pan, Z.; Zhang, C.; Xia, Y.; Xiong, H.; Shao, X. An improved artificial potential field method for path planning and formation control of the multi-UAV systems. IEEE Trans. Circuits Syst. II Express Briefs 2021, 69, 1129–1133. [Google Scholar] [CrossRef]
  10. Hua, Y.; Dong, X.; Hu, G.; Li, Q.; Ren, Z. Distributed time-varying output formation tracking for heterogeneous linear multiagent systems with a nonautonomous leader of unknown input. IEEE Trans. Autom. Control 2019, 64, 4292–4299. [Google Scholar] [CrossRef]
  11. Yu, Y.; Guo, J.; Ahn, C.K.; Xiang, Z. Neural adaptive distributed formation control of nonlinear multi-UAVs with unmodeled dynamics. IEEE Trans. Neural Networks Learn. Syst. 2023, 34, 9555–9561. [Google Scholar] [CrossRef]
  12. Cheng, C.; Zhang, H.; Sun, Y.; Tao, H.; Chen, Y. A cross-platform deep reinforcement learning model for autonomous navigation without global information in different scenes. Control Eng. Pract. 2024, 150, 105991. [Google Scholar] [CrossRef]
  13. Li, R.; Wang, L.; Chen, Y.; Ma, P. Adaptive locomotion control of hexapod robot based on deep reinforcement learning and proprioception. In Proceedings of the 2023 6th International Conference on Robotics, Control and Automation Engineering, RCAE, Suzhou, China, 27–29 October 2023; pp. 39–43. [Google Scholar]
  14. Han, D.; Jiang, H.; Wang, L.; Zhu, X.; Chen, Y.; Yu, Q. Collaborative task allocation and optimization solution for unmanned aerial vehicles in search and rescue. Drones 2024, 8, 138. [Google Scholar] [CrossRef]
  15. Gong, B.; Jiang, L.; Ning, X.; Li, S. Study on mission planning algorithm for multi-target passive tracking based on satellite formation. Aerosp. Sci. Technol. 2023, 142, 108660. [Google Scholar] [CrossRef]
  16. Wang, J.; Xin, M. Integrated optimal formation control of multiple unmanned aerial vehicles. IEEE Trans. Control Syst. Technol. 2013, 21, 1731–1744. [Google Scholar] [CrossRef]
  17. Yan, D.; Zhang, W.; Chen, H. Design of a multi-constraint formation controller based on improved MPC and consensus for quadrotors. Aerospace 2022, 9, 94. [Google Scholar] [CrossRef]
  18. Kuo, C.W.; Tsai, C.C.; Lee, C.T. Intelligent leader-following consensus formation control using recurrent neural networks for small-size unmanned helicopters. IEEE Trans. Syst. Man Cybern. Syst. 2019, 51, 1288–1301. [Google Scholar] [CrossRef]
  19. Sun, R.; Zhou, Z.; Zhu, X. Finite-time terminal sliding mode attitude control for tailless full-wing configuration UAVs based on extended state observers and auxiliary compensators. ISA Trans. 2024, 144, 282–307. [Google Scholar] [CrossRef]
  20. Ma, X.; Wang, G.; Liu, K. Design and optimization of a multimode amphibious robot with propeller-leg. IEEE Trans. Robot. 2022, 38, 3807–3820. [Google Scholar] [CrossRef]
  21. Russo, S.; Harada, K.; Ranzani, T.; Manfredi, L.; Stefanini, C.; Menciassi, A.; Dario, P. Design of a robotic module for autonomous exploration and multimode locomotion. IEEE/ASME Trans. Mechatronics 2013, 18, 1757–1766. [Google Scholar] [CrossRef]
  22. Xu, F.; Ma, K.; Gong, T.; Jiang, Z.; Hu, C. Design and testing of a magnetic soft crawling robot with multimodal locomotion driven by 3-D magnetic fields. IEEE Trans. Instrum. Meas. 2024, 73, 1–11. [Google Scholar] [CrossRef]
  23. Xiong, X.; Ruan, X. Non-Smooth bifurcation analysis of multi-structure multi-operating-mode power electronics systems for applications with renewable energy sources. IEEE Trans. Circuits Syst. II Express Briefs 2019, 66, 487–491. [Google Scholar] [CrossRef]
  24. Zou, Y.; Zhou, Z.; Dong, X.; Meng, Z. Distributed formation control for multiple vertical takeoff and landing UAVs with switching topologies. IEEE/ASME Trans. Mechatronics 2018, 23, 1750–1761. [Google Scholar] [CrossRef]
  25. Wang, Z.; Liu, T.; Jiang, Z.P. Cooperative formation control under switching topology: An experimental case study in multirotors. IEEE Trans. Cybern. 2020, 51, 6141–6153. [Google Scholar] [CrossRef]
  26. Zhou, Z.; Wang, H.; Wang, Y.; Xue, X.; Zhang, M. Distributed formation control for multiple quadrotor UAVs under Markovian switching topologies with partially unknown transition rates. J. Frankl. Inst. 2019, 356, 5706–5728. [Google Scholar] [CrossRef]
  27. Xue, M.; Tang, Y.; Ren, W.; Qian, F. Practical output synchronization for asynchronously switched multi-agent systems with adaption to fast-switching perturbations. Automatica 2020, 116, 108917. [Google Scholar] [CrossRef]
  28. Zhao, Z.; Wang, J.; Chen, Y.; Ju, S. Iterative learning-based formation control for multiple quadrotor unmanned aerial vehicles. Int. J. Adv. Robot. Syst. 2020, 17, 1729881420911520. [Google Scholar] [CrossRef]
  29. Fu, X.; Peng, J. Iterative learning control for UAVs formation based on point-to-point trajectory update tracking. Math. Comput. Simul. 2023, 209, 1–15. [Google Scholar] [CrossRef]
  30. Hock, A.; Schoellig, A.P. Distributed iterative learning control for multi-agent systems: Theoretic developments and application to formation flying. Auton. Robot. 2019, 43, 1989–2010. [Google Scholar] [CrossRef]
  31. Han, Y.; Liang, Y.; Zhang, L.; Cai, B.; Li, Y.; Li, B. Bumpless transfer switched control of aircraft for heavy payload dropping missions. Aerosp. Sci. Technol. 2024, 148, 109067. [Google Scholar] [CrossRef]
  32. Dong, X.; Zhou, Y.; Ren, Z.; Zhong, Y. Time-varying formation tracking for second-order multi-agent systems subjected to switching topologies with application to quadrotor formation flying. IEEE Trans. Ind. Electron. 2016, 64, 5014–5024. [Google Scholar] [CrossRef]
  33. Wu, J.; Luo, C.; Luo, Y.; Li, K. Distributed UAV swarm formation and collision avoidance strategies over fixed and switching topologies. IEEE Trans. Cybern. 2022, 52, 10969–10979. [Google Scholar] [CrossRef]
  34. Zhang, S.; Wang, L.; Wang, H.; Xue, B. Consensus control for heterogeneous multivehicle systems: An iterative learning approach. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 5356–5368. [Google Scholar] [CrossRef]
  35. Meng, D.; Zhang, J. Cooperative learning for switching networks with nonidentical nonlinear agents. IEEE Trans. Autom. Control 2021, 66, 6131–6138. [Google Scholar] [CrossRef]
  36. Sun, M.; Wang, D. Sampled-data iterative learning control for nonlinear systems with arbitrary relative degree. Automatica 2001, 37, 283–289. [Google Scholar] [CrossRef]
  37. Sun, M.; Huang, B. Iterative Learning Control; National Defence Industry Press: Beijing, China, 1999. [Google Scholar]
  38. Meng, D.; Jia, Y.; Du, J.; Zhang, J. High-precision formation control of nonlinear multi-agent systems with switching topologies: A learning approach. Int. J. Robust Nonlinear Control 2015, 25, 1993–2018. [Google Scholar] [CrossRef]
  39. Jeffreys, H.; Jeffreys, B.S. Methods of Mathematical Physics; Cambridge University Press: Cambridge, UK, 1999. [Google Scholar]
  40. Shao, Z.; Xiang, Z. Design of an iterative learning control law for a class of switched repetitive systems. Circuits Syst. Signal Process. 2017, 36, 845–866. [Google Scholar] [CrossRef]
  41. Shen, Z.; Tan, L.; Yu, S.; Song, Y. Fault-tolerant adaptive learning control for quadrotor UAVs with the time-varying CoG and full-state constraints. IEEE Trans. Neural Networks Learn. Syst. 2021, 32, 5610–5622. [Google Scholar] [CrossRef] [PubMed]
  42. Zhao, C.; Guo, L. PID controller design for second order nonlinear uncertain systems. Sci. China-Inf. Sci. 2017, 60, 022201:1–022201:13. [Google Scholar] [CrossRef]
  43. Shen, D.; Li, X. A survey on iterative learning control with randomly varying trial lengths: Model, synthesis, and convergence analysis. Annu. Rev. Control 2019, 48, 89–102. [Google Scholar] [CrossRef]
  44. Zeng, C.; Shen, D.; Wang, J. Adaptive learning tracking for uncertain systems with partial structure information and varying trial lengths. J. Frankl. Inst. 2018, 355, 7027–7055. [Google Scholar] [CrossRef]
Figure 1. The control block diagram of the hybrid iterative learning formation strategy for the multi-UAV system (1) with multi-operating modes. The red arrows denote the switching behaviors.
Figure 1. The control block diagram of the hybrid iterative learning formation strategy for the multi-UAV system (1) with multi-operating modes. The red arrows denote the switching behaviors.
Drones 08 00406 g001
Figure 2. The flowchart of the proposed method.
Figure 2. The flowchart of the proposed method.
Drones 08 00406 g002
Figure 3. The switching law  s ( t )  of operating modes.
Figure 3. The switching law  s ( t )  of operating modes.
Drones 08 00406 g003
Figure 4. The communication topology  G ¯  and symbol “1” represents the connection weight.
Figure 4. The communication topology  G ¯  and symbol “1” represents the connection weight.
Drones 08 00406 g004
Figure 5. The formation trajectory of the four quadrotor UAVs at  x y z  coordinates at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 5. The formation trajectory of the four quadrotor UAVs at  x y z  coordinates at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g005
Figure 6. The formation trajectory of the four quadrotor UAVs on the  x o y  plane at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 6. The formation trajectory of the four quadrotor UAVs on the  x o y  plane at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g006
Figure 7. The flight heights of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 7. The flight heights of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g007
Figure 8. The yaw angles of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 8. The yaw angles of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g008
Figure 9. The average and maximal formation tracking errors in the iteration domain.
Figure 9. The average and maximal formation tracking errors in the iteration domain.
Drones 08 00406 g009
Figure 10. The maximal state and disturbance estimation errors in the iteration domain.
Figure 10. The maximal state and disturbance estimation errors in the iteration domain.
Drones 08 00406 g010
Figure 11. The switching law  s ( t )  of communication topologies.
Figure 11. The switching law  s ( t )  of communication topologies.
Drones 08 00406 g011
Figure 12. The possible switching communication topologies and symbol "1" represents the connection weight.
Figure 12. The possible switching communication topologies and symbol "1" represents the connection weight.
Drones 08 00406 g012
Figure 13. The formation trajectory of the four quadrotor UAVs at the  x y z  coordinate at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 13. The formation trajectory of the four quadrotor UAVs at the  x y z  coordinate at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g013
Figure 14. The formation trajectory of the four quadrotor UAVs in the  x o y  plane at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 14. The formation trajectory of the four quadrotor UAVs in the  x o y  plane at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g014
Figure 15. The flight heights of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 15. The flight heights of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g015
Figure 16. The yaw angles of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Figure 16. The yaw angles of the four quadrotor UAVs at 1−th, 20−th, 30−th, and 70−th iterations.
Drones 08 00406 g016
Figure 17. The average and maximal formation tracking error in the iteration domain.
Figure 17. The average and maximal formation tracking error in the iteration domain.
Drones 08 00406 g017
Figure 18. The maximal state and disturbance estimation errors in the iteration domain.
Figure 18. The maximal state and disturbance estimation errors in the iteration domain.
Drones 08 00406 g018
Figure 19. The formation trajectory of four quadrotor UAVs at the  x y z  coordinate with ILC controller and PID controller, respectively.
Figure 19. The formation trajectory of four quadrotor UAVs at the  x y z  coordinate with ILC controller and PID controller, respectively.
Drones 08 00406 g019
Figure 20. The formation tracking and estimation errors with ILC controller and PID controller, respectively.
Figure 20. The formation tracking and estimation errors with ILC controller and PID controller, respectively.
Drones 08 00406 g020
Table 1. Comparative results of the average formation tracking errors of four quadrotor UAVs with the proposed hybrid formation strategy and PID controller.
Table 1. Comparative results of the average formation tracking errors of four quadrotor UAVs with the proposed hybrid formation strategy and PID controller.
Controller p x  (m) p y  (m) p z  (m) φ  (rad)
Controller (12)0.0047 m0.0110 m0.0018 m0.0024 rad
PID controller0.0478 m0.2223 m0.0790 m0.2190 rad
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, S.; Yu, W.; Liu, Z.; Ma, F. A Robust Hybrid Iterative Learning Formation Strategy for Multi-Unmanned Aerial Vehicle Systems with Multi-Operating Modes. Drones 2024, 8, 406. https://doi.org/10.3390/drones8080406

AMA Style

Yang S, Yu W, Liu Z, Ma F. A Robust Hybrid Iterative Learning Formation Strategy for Multi-Unmanned Aerial Vehicle Systems with Multi-Operating Modes. Drones. 2024; 8(8):406. https://doi.org/10.3390/drones8080406

Chicago/Turabian Style

Yang, Song, Wenshuai Yu, Zhou Liu, and Fei Ma. 2024. "A Robust Hybrid Iterative Learning Formation Strategy for Multi-Unmanned Aerial Vehicle Systems with Multi-Operating Modes" Drones 8, no. 8: 406. https://doi.org/10.3390/drones8080406

APA Style

Yang, S., Yu, W., Liu, Z., & Ma, F. (2024). A Robust Hybrid Iterative Learning Formation Strategy for Multi-Unmanned Aerial Vehicle Systems with Multi-Operating Modes. Drones, 8(8), 406. https://doi.org/10.3390/drones8080406

Article Metrics

Back to TopTop