Distributed Dual Closed-Loop Model Predictive Formation Control for Collision-Free Multi-AUV System Subject to Compound Disturbances

Zhang, Mingyao; Yan, Zheping; Zhou, Jiajia; Yue, Lidong

doi:10.3390/jmse11101897

Open AccessArticle

Distributed Dual Closed-Loop Model Predictive Formation Control for Collision-Free Multi-AUV System Subject to Compound Disturbances

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2023, 11(10), 1897; https://doi.org/10.3390/jmse11101897

Submission received: 12 September 2023 / Revised: 25 September 2023 / Accepted: 28 September 2023 / Published: 29 September 2023

(This article belongs to the Special Issue Motion Control and Path Planning of Marine Vehicles)

Download

Browse Figures

Versions Notes

Abstract

:

This paper focuses on the collision-free formation tracking of autonomous underwater vehicles (AUVs) with compound disturbances in complex ocean environments. We propose a novel finite-time extended state observer (FTESO)-based distributed dual closed-loop model predictive control scheme. Initially, a fast FTESO is designed to accurately estimate both model uncertainties and external disturbances for each subsystem. Subsequently, the outer-loop and inner-loop formation controllers are developed by integrating disturbance compensation with distributed model predictive control (DMPC) theory. With full consideration of the input and state constraints, we resolve the local information-based DMPC optimization problem to obtain the control inputs for each AUV, thereby preventing actuator saturation and collisions among AUVs. Moreover, to mitigate the increased computation caused by the control structure, the Laguerre orthogonal function is applied to alleviate the computational burden in time intervals. We also demonstrate the stability of the closed-loop system by applying the terminal state constraint. Finally, based on a connected directed topology, comparative simulations are performed under various control schemes to verify the robustness and superior performance of the proposed scheme.

Keywords:

multi-AUV system; formation tracking; finite-time extended state observer; distributed model predictive control; Laguerre function

1. Introduction

Autonomous underwater vehicles (AUVs) have assumed indispensable roles in various underwater operations, such as ocean exploration and hydrologic surveys [1]. They can autonomously perform appropriate maneuvers to achieve predefined objectives. Compared with the operational capability of a single AUV, collaborative AUVs can respond more reliably and flexibly to complex missions and extended operational ranges, thereby improving the efficiency and robustness of undersea operations. Given this backdrop, numerous application cases about AUV coordinated formation have been triggered in both civilian and industrial fields for decades [2,3]. Irrespective of the specific collaborative missions undertaken by AUVs, the core challenge lies in ensuring motion stability of AUV formations within complex underwater environments and the constraints of their own models. To tackle this problem, several mainstream methodologies have been proposed by engineers and academics. Studies by Chen et al. [4] and Zhen et al. [5] proposed AUV formation control schemes combined with the virtual structure method. However, this approach suffers from limited flexibility and applicability. Wang et al. [6] utilized the leader–follower method to address the AUV formation tracking problem, but this approach relies on the state of the leader, reducing the robustness and fault tolerance of the formation. Conversely, leaderless formations have been proposed promisingly and have received more considerable attention [7]. Munir et al. [8] proposed a new arbitrary-order distributed control strategy based on the novel sliding surfaces of error dynamics, which addresses the cooperative tracking control of uncertain higher-order nonlinear systems. The strategies to mitigate the chattering issue caused by sliding surfaces are discussed in [9]. Despite the abundance of existing research, multiple-AUV formation tracking control remains a significantly challenging project.

One of the main challenges is the various disturbances resulting from the underwater environment and the motion model of the AUVs themselves [10]. On the one hand, unknown disturbances such as waves, tides, and currents, are inevitable in practical marine environments. On the other hand, AUVs exhibit highly nonlinear and coupled dynamics, leading to model uncertainties. These uncertainties are often induced by modeling errors and deviations in hydrodynamic coefficient measurement. According to the research by Cui et al. [11], these external disturbances and model uncertainties that degrade the system performance negatively are referred to as compound disturbances. In response to these challenges, researchers have developed diverse schemes, such as disturbance observers [12], fuzzy logic theory [13], and neural networks [14]. Among these, the extended state observer (ESO) initially proposed by Han [15] is an attractive option to estimate compound disturbances, as it does not rely on an accurate model. Lei et al. [16] designed a high-gain ESO to solve AUV horizontal trajectory tracking problems under the time-varying disturbances. Although many ESOs have been established for different platforms, most only guarantee asymptotic convergence of estimation errors, implying a potentially infinite convergence time. Some research works also lack a rigorous analysis of convergence. Considering the impact of severe underwater environments on estimation accuracy, the concept of finite-time ESO proves more beneficial for improving control performance [17]. Wang et al. [18] implemented a FTESO-based nonsingular terminal sliding mode controller to address unmanned surface vehicle (USV) trajectory tracking in disturbed environments. This approach ensured that the disturbance estimation errors converge within a finite time. However, there remains room for improvement and optimization of the design structure to further enhance observation performance.

AUV formation navigation also presents significant technical challenges due to various complex constraints. For instance, the AUV attitude has a certain desired range and navigation velocities are inherently limited. These intrinsic input and state constraints pose substantial challenges to control performance [19]. In practical applications, actuators often have input saturation constraints due to physical structure limitations. This results in a limitation of the actual active control force of the AUV. If a control signal exceeds this boundary, it may lead to system instability. However, most previous work assumes that the actuators can tolerate any level of control signals. To avoid actuator saturation, a nonlinear auxiliary system for filtering saturation errors was proposed [20]. Additionally, collisions between AUVs are undesirable during the formation configuration phase. Thus, the ability to avoid collisions is vital for AUV formation control. A wealth of solutions have been developed to this end, with Li and Wang [21] proposing a collision-free position consensus algorithm for AUVs based on potential function. Moreover, Xu et al. [22] presented an event-triggered algorithm based on deep reinforcement learning to avoid AUV collisions. However, the above studies disregard the physical constraints of AUVs. From the perspective of safe navigation, it is essential to integrate factors such as input, state restrictions, and collision avoidance into the design scheme.

Model predictive control (MPC) has garnered considerable attention due to its ability to simultaneously handle multiple composite constraints and offer superior dynamic performance. This is widely applied to MIMO systems affected by model distortions and complex constraints. Several MPC-based applications have been integrated into AUV control systems. Zhang et al. [23] proposed an MPC-based AUV trajectory tracking strategy under random disturbances. In [24], a robust model-predictive control scheme based on the active disturbance rejection control approach was developed for the AUV tracking task. The challenge of extending these systems to multi-AUV systems involves coordinating the control behavior of each subsystem and ensuring the closed-loop stability of the local MPC optimization problem under system constraints. This coordination aims to maximize the overall control performance. Hence, DMPC came into being. Zheng et al. [25] proposed a DMPC method based on local state information for MAS formation tracking. To the best of our knowledge, there are few studies that apply DMPC to multi-AUV formations. Wei et al. [26] developed a Lyapunov-based distributed predictive controller for AUV formation tracking, subject to current disturbances. The auxiliary controller was utilized to establish stability constraints to ensure the closed-loop stability of the system. However, this method only considers horizontal formations without uncertainties and state constraints. Furthermore, many works that design predictive controllers result in additional computational loads, which could impair the real-time execution capability of the controller. Shen and Shi [27] managed to reduce the MPC computational burden by decomposing the original AUV trajectory tracking optimization problem into smaller subproblems and then solving them in a distributed manner. Despite these efforts, there has been no research to address the heavy computation of DMPC applied to AUV formations. In order to improve the dynamic response and control accuracy of AUV formation tracking in three-dimensional (3-D) space, we adopt the Laguerre orthogonal function to reduce the computational load. In response to these discussions, it is imperative to develop a safe and efficient formation control scheme to solve the problems of disturbances, parameter uncertainties, and complicated constraints.

Motivated by the above observations, this paper investigates the collision-free formation tracking of multi-AUVs with compound disturbances under complicated constraints. A novel FTESO-based distributed dual closed-loop model predictive control scheme is proposed. This method satisfies the formation constraints and collision avoidance requirements while compensating for model uncertainties and external disturbances. We incorporate the Laguerre function to alleviate the computational burden of the DMPC optimization problem, also giving corresponding stability analysis. Based on the connected directed topology, comparative simulations under different schemes demonstrate the effectiveness and robustness of our proposed scheme. The main contributions of this paper are as follows:

Compared with the FTESO-based controllers presented in works [16,28], the proposed third-order fast FTESO can estimate the compound disturbances and their first derivatives, which effectively suppress the amplification and fluctuation of the generalized uncertainties. It has better estimation accuracy and convergence speed. Hence, the active disturbance rejection capability of AUV formation is enhanced;
Unlike the existing DMPC schemes depicted in works [29,30], a dual closed-loop structure is utilized to enhance the response speed of the DMPC system and the controllability of the AUV speed. The outer-loop controller sets the desired velocity and the inner-loop controller generates the driving force. By solving the constrained quadratic programming (QP) problems, the risks of actuator saturation and collision are reduced. The safety and robustness of formation tracking are improved;
In order to solve the issue of heavy computational burden in traditional predictive control, the Laguerre orthogonal function is incorporated to reconstruct the input matrices, which automatically trades off control performance and computational complexity, thus avoiding possible formation deviation due to slow computational speed. The stability of the closed-loop system is proved by exerting terminal state constraints.

The rest of this paper is organized as follows: Section 2 introduces some notations, lemmas, and graph theory, and describes the AUV model and control objective. Section 3 presents the methodology, including the design of the FTESO and dual closed-loop DMPC scheme, the application of the Laguerre function, and the corresponding stability analysis. Section 4 and Section 5, respectively, provide simulation results and conclusions.

2. Preliminaries

2.1. Notations and Lemmas

Notation.

ℝ^{n}

represents the n-dimensional Euclidean space, and

ℝ^{m \times n}

denotes the set of

(m \times n)

real matrix.

I_{n}

,

0_{n}

, and

0_{p \times q}

signify

(n \times n)

identity matrix,

(n \times n)

, and

(p \times q)

null matrices, respectively.

‖\cdot‖

refers to the Euclidean vector norm and the induced matrix norm, while the infinity norm is denoted by

{‖\cdot‖}_{\infty}

.

λ_{\min} (\cdot)

represents the minimum eigenvalue of the specified matrix

(\cdot)

, with its maximum eigenvalue denoted as

λ_{\max} (\cdot)

. For simplicity, some notations are defined as

s i g^{p} (x) = sign (x) {|x|}^{p}

,

{|x|}^{p} = {[{|x_{1}|}^{p}, {|x_{2}|}^{p}, \dots, {|x_{n}|}^{p}]}^{T}

,

x = {[x_{1}, x_{2}, \dots, x_{n}]}^{T}

,

p \in ℝ

.

sign (\cdot)

symbolizes the signum function with

sign (0) = 0

. Notably,

s i g^{0} (x) = sign (x)

,

s i g^{0} (x) {|x|}^{p} = s i g^{p} (x)

.

Lemma 1

([31]). Consider the system

\dot{x} (t) = f (x (t))

,

x (0) = x_{0}

,

f (0) = 0

,

x \in ℝ^{n}

, where

f : U \to ℝ^{n}

is a continuous function. Suppose that this system has a unique solution in forward time for all initial conditions. If there exists a Lyapunov function

V (x)

, with

V (x_{0})

denoting its initial value, the following can be assumed: (1) The trajectory of this system is finite-time uniformly ultimately bounded stable within the region of

Q_{1} = \{x| V {(x)}^{α_{1} - α_{2}} < \frac{β_{2}}{γ_{1}}\}

, if

\dot{V} (x) \leq - β_{1} V {(x)}^{α_{1}} + β_{2} V {(x)}^{α_{2}}

for

α_{1} > α_{2}

,

β_{1} > 0

,

β_{2} > 0

,

γ_{1} \in (0, β_{1})

. The settling time for the states reaching the stable residual set is subject to the constraint as

T_{1} \leq \frac{V {(x_{0})}^{1 - α_{1}}}{(β_{1} - γ_{1}) (1 - α_{1})}

. (2) The trajectory of this system is fast finite-time uniformly ultimately bounded stable within

Q_{2} = \{x| γ_{1} V {(x)}^{α_{1} - α_{2}} + γ_{2} V {(x)}^{1 - α_{2}} < β_{3}\}

, if

\dot{V} (x) \leq - β_{1} V {(x)}^{α_{1}} - β_{2} V (x) + β_{3} V {(x)}^{α_{2}}

for

β_{3} > 0

,

γ_{2} \in (0, β_{2})

. The convergence time T₂ is bounded as

T_{2} \leq \frac{\ln [(β_{2} - γ_{2}) V {(x_{0})}^{1 - α_{1}} / (β_{1} - γ_{1}) + 1]}{(β_{2} - γ_{2}) (1 - α_{1})}

.

2.2. Graph Theory

We introduce a directed topology graph

G = \{V, ε\}

to describe the information interactions among the AUVs. Let the node set

V = \{V_{1}, V_{2}, \dots, V_{N}\}

to represent the N members in the formation, and an edge set

ε \subseteq V \times V

to represent the communication from the node

V_{i}

to the node

V_{j}

.

A = [a_{i j}] \subset ℝ^{N \times N}

is defined as an adjacency matrix, where

a_{i j}

represents the connection weight and

a_{i j} = 1

if

(i, j) \in ε

, while

a_{i j} = 0

if

(i, j) \notin ε

. It is assumed that the ith vehicle could receive information from the virtual leader and its neighbors

N_{i} = \{j \in V : (j, i) \in ε\}

. The graph is termed an undirected graph if bidirectional communication links exist among all members of the formation. Otherwise, it is referred to as a directed graph. A directed graph is considered strongly connected if a directed path can connect any point in the formation to any other.

2.3. AUV Model

As shown in Figure 1, it is convenient to describe the six-degree-of-freedom (DOF) AUVs with two reference frames: an earth-fixed frame

\{E\}

and a body-fixed frame

\{B\}

. This paper employs a fully actuated torpedo-type AUV, referenced from [32], based on the control objectives. In addition, the AUV uses an ultra-short baseline acoustic positioning system for underwater localization. Since this AUV can be regarded as a highly metacentric stable vehicle with self-stable roll motion, the effect of roll is ignored (roll angle

ϕ_{i} = 0

, roll angular velocity

p_{i} = 0

). The kinematics and dynamics of the ith AUV are described as follows [33]:

{\dot{η}}_{i} = J (η_{i}) v_{i}

(1)

M_{i} {\dot{v}}_{i} + C_{i} (v_{i}) v_{i} + D_{i} (v_{i}) v_{i} + g_{i} (η_{i}) = τ_{i} + τ_{i c}

(2)

where

i = 1, 2, \dots, N

,

η_{i} = {[x_{i}, y_{i}, z_{i}, θ_{i}, ψ_{i}]}^{T} \in ℝ^{5}

, and

v_{i} = {[u_{i}, v_{i}, w_{i}, q_{i}, r_{i}]}^{T} \in ℝ^{5}

denote the states of position, orientation, and velocity of the AUV, respectively.

J (η_{i})

is a rotation transformation matrix from the body-fixed frame to the earth-fixed frame, expressed as:

J (η_{i}) = [\begin{matrix} \cos ψ_{i} \cos θ_{i} & - \sin ψ_{i} & \cos ψ_{i} \sin θ_{i} & 0 & 0 \\ \sin ψ_{i} \cos θ_{i} & \cos ψ_{i} & \sin ψ_{i} \sin θ_{i} & 0 & 0 \\ - \sin θ_{i} & 0 & \cos θ_{i} & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 / \cos θ_{i} \end{matrix}]

(3)

M_{i}

represents the inertial matrix, which includes added mass.

C_{i} (v_{i})

and

D_{i} (v_{i})

denote the Coriolis and centripetal and hydrodynamic damping matrix, respectively, while

g_{i} (η_{i})

represents the restoring force and moment generated by gravity and buoyancy.

τ_{i} = {[τ_{i u}, τ_{i v}, τ_{i w}, τ_{i q}, τ_{i r}]}^{T}

represents the control input, and

τ_{i c}

denotes the external disturbance. Detailed expressions of these matrices are available in [34].

In practical engineering, we may not be able to obtain accurate hydrodynamic coefficients in the model, so the matrices in (2) are typically divided into two parts: the nominal value part and the uncertainty part caused by linear shifts, i.e.,

M_{i} = M_{i}^{*} + Δ M_{i}

,

C_{i} (v_{i}) = C_{i}^{*} (v_{i}) + Δ C_{i} (v_{i})

,

D_{i} (v_{i}) = D_{i}^{*} (v_{i}) + Δ D_{i} (v_{i})

, and

g_{i} (η_{i}) = g_{i}^{*} (η_{i}) + Δ g_{i} (η_{i})

, where

{(\cdot)}_{i}^{*}

denotes the nominal value that can be obtained from the computational fluid dynamics (CFD) or experimental analysis.

Δ {(\cdot)}_{i}

symbolizes the difference between the real value and the nominal value.

Accordingly, the ith AUV dynamic model (2) can be reformulated as:

M_{i}^{*} {\dot{v}}_{i} = - C_{i}^{*} (v_{i}) v_{i} - D_{i}^{*} (v_{i}) v_{i} - g_{i}^{*} (η_{i}) + τ_{i} + τ_{i d}

(4)

where

τ_{i d} = τ_{i c} - Δ M_{i} {\dot{v}}_{i} - Δ C_{i} (v_{i}) v_{i} - Δ D_{i} (v_{i}) v_{i} - Δ g_{i} (η_{i})

is regarded as the compound disturbance, which includes uncertainties and unknown external disturbance. Typically, external disturbances are periodically varying and energy limited. The model uncertainties are related to the actual states and physical properties of the AUV. Based on the constraints of DMPC on the system state, in practice, we give the following reasonable assumption:

Assumption 1

([11]). The ocean current disturbance term

τ_{i c}

and the first time derivative

{\dot{τ}}_{i c}

are bounded, and the model uncertainties

Δ M_{i}

,

Δ C_{i}

,

Δ D_{i}

, and

Δ g_{i}

are unknown and bounded. Hence, the compound disturbance

τ_{i d}

of the ith AUV is bounded and satisfies

‖τ_{i d}‖ \leq {\bar{τ}}_{i d}

, where

{\bar{τ}}_{i d} \in ℝ^{+}

represents the unknown upper bound.

It should be noted that the above assumption is untenable if there are no system state constraints [35,36].

2.4. Control Objective

In this paper, the control objective is to develop a control scheme that enables AUV formation to track a reference trajectory while maintaining a predefined configuration. Initially, a FTESO is designed to compensate for external disturbances and model uncertainties of the AUV formation, so that the estimation errors converge to the origin. Subsequently, a dual closed-loop DMPC controller is designed. In this structure, the outer-loop controller enables the ith AUV to track the reference trajectory

η_{r}

by generating the desired velocity, resulting in the convergence of position tracking errors. The inner-loop controller is used to achieve the convergence of velocity tracking errors. The desired formation is implemented by setting the corresponding formation configuration vector

r_{i f}

and the relative distance vector

r_{i j}

. The task must adhere to various constraints and ensure collision avoidance. Because the navigation trajectory has a limited range and the speed is continuous without abrupt changes, we adopt the following reasonable assumptions to avoid singularities in the reference trajectory:

Assumption 2.

The reference trajectory

η_{r} = {[x_{r}, y_{r}, z_{r}, θ_{r}, ψ_{r}]}^{T}

and its derivatives are smooth and bounded, i.e.,

{‖η_{r}‖}_{\infty} \leq {\bar{η}}_{r}

,

{‖{\dot{η}}_{r}‖}_{\infty} \leq {\bar{η}}_{r 1}

, and

{‖{\ddot{η}}_{r}‖}_{\infty} \leq {\bar{η}}_{r 2}

with positive numbers

{\bar{η}}_{r}

,

{\bar{η}}_{r 1}

, and

{\bar{η}}_{r 2}

.

3. Methodology

This section develops the FTESO-based distributed dual closed-loop model predictive control scheme for the AUV formation to perform trajectory tracking. A novel FTESO is designed to compensate the compound disturbances. Based on the model information reconstructed by FTESO, the DMPC optimization problems are formulated for the outer and inner loops under constraints such as actuator saturation and collision avoidance, respectively. The Laguerre function is applied to alleviate the computational load. The block diagram of proposed control scheme is depicted in Figure 2.

3.1. FTESO Design and Convergence Analysis

The AUV model is fundamental to controller design, but obtaining an accurate model in practice is challenging. Considering the superiority and effectiveness of the ESO technique in estimating and compensating for synthetic uncertainty, a novel fast FTESO is designed to simultaneously reconstruct the external disturbance and model uncertainties of multiple AUVs.

First, define the auxiliary velocity variable as

ω_{i} (v_{i}) = M_{i}^{*} v_{i} + \int v_{i}

, the derivative of

ω_{i} (v_{i})

with respect to time can be obtained from (4)

ω_{i} (v_{i}) = v_{i} - C_{i}^{*} (v_{i}) v_{i} - D_{i}^{*} (v_{i}) v_{i} - g_{i}^{*} (η_{i}) + τ_{i} + τ_{i d} .

(5)

For simplicity, denote

G_{i} (η_{i}, v_{i}) = v_{i} - C_{i}^{*} (v_{i}) v_{i} - D_{i}^{*} (v_{i}) v_{i} - g_{i}^{*} (η_{i})

. Then, a new variable is defined as

z_{i 1} = ω_{i} (v_{i})

, and the order of the system is extended by additional state variables,

z_{i 2}

and

z_{i 3}

, defined as

z_{i 2} = τ_{i d}

and

z_{i 3} = {\dot{z}}_{i 2}

with

{\dot{z}}_{i 3} = σ_{i}

. It should be noted that the compound disturbances

z_{i 2}

are assumed to be bounded and continuously differentiable, and the components of its second derivative satisfies

|σ_{i p}| \leq {\bar{σ}}_{i}

,

p = 1, 2, \dots, 5

. where

{\bar{σ}}_{i}

is an unknown positive constant. Afterward, the dynamic model of the ith AUV can be extended as follows:

\{\begin{array}{l} {\dot{z}}_{i 1} = G_{i} (η_{i}, v_{i}) + τ_{i} + z_{i 2} \\ {\dot{z}}_{i 2} = z_{i 3} \\ {\dot{z}}_{i 3} = σ_{i} . \end{array}

(6)

Denote

{\hat{z}}_{i 1}

,

{\hat{z}}_{i 2}

, and

{\hat{z}}_{i 3}

as the observation values of states

z_{i 1}

,

z_{i 2}

, and

z_{i 3}

in the above extended system, and

e_{i 1} = {\hat{z}}_{i 1} - z_{i 1}

,

e_{i 2} = {\hat{z}}_{i 2} - z_{i 2}

, and

e_{i 3} = {\hat{z}}_{i 3} - z_{i 3}

as the observation errors of the velocity, the compound disturbances, and its first derivatives, respectively. Then, a third-order fast FTESO is proposed as follows:

\{\begin{array}{l} {\dot{\hat{z}}}_{i 1} = {\hat{z}}_{i 2} - β_{i 1} (s i g^{α_{i 1}} (e_{i 1}) + e_{i 1}) + G_{i} (η_{i}, v_{i}) + τ_{i} \\ {\dot{\hat{z}}}_{i 2} = {\hat{z}}_{i 3} - β_{i 2} (s i g^{α_{i 2}} (e_{i 1}) + 2 s i g^{α_{i 1}} (e_{i 1}) + e_{i 1}) \\ {\dot{\hat{z}}}_{i 3} = - β_{i 3} (s i g^{α_{i 3}} (e_{i 1}) + 2 s i g^{α_{i 2}} (e_{i 1}) + s i g^{α_{i 1}} (e_{i 1})) \end{array}

(7)

where the observer gains satisfy

β_{i k} > 0, k = 1, 2, 3

,

α_{i 1} \in (2 / 3, 1)

and

α_{i 2} = 2 α_{i 1} - 1

, and

α_{i 3} = 3 α_{i 1} - 2

. Although the actual value of

z_{i k}

is probably unavailable, its observed value

{\hat{z}}_{i k}

can be obtained by the above FTESO. The analysis and proof that

{\hat{z}}_{i k}

tracks the actual value are described below.

According to the extended system (6) and the proposed FTESO (7), we can obtain the observation error dynamics as follows:

\{\begin{array}{l} {\dot{e}}_{i 1} = - β_{i 1} (s i g^{α_{i 1}} (e_{i 1}) + e_{i 1}) + e_{i 2} \\ {\dot{e}}_{i 2} = - β_{i 2} (s i g^{α_{i 2}} (e_{i 1}) + 2 s i g^{α_{i 1}} (e_{i 1}) + e_{i 1}) + e_{i 3} \\ {\dot{e}}_{i 3} = - β_{i 3} (s i g^{α_{i 3}} (e_{i 1}) + 2 s i g^{α_{i 2}} (e_{i 1}) + s i g^{α_{i 1}} (e_{i 1})) - σ_{i} . \end{array}

(8)

The stability and convergence of the proposed FTESO are stated in the following theorem:

Theorem 1.

Consider the AUV formation control system with the dynamic model (4) under Assumption 1. If the FTESO is proposed in the form of (9), with appropriate observer gains satisfying the prescribed constraints, then the observation errors

e_{i} = {[e_{i 1}^{T}, e_{i 2}^{T}, e_{i 3}^{T}]}^{T}

will converge to the small region

Ω_{i}

in finite time

T_{i f}

. This implies that the error dynamics system (8) is finite-time uniformly ultimately bounded stable.

Proof of Theorem 1.

Consider a Lyapunov candidate function as

V_{i 1} (e) = ε_{i}^{T} P_{i} ε_{i}

, where

P_{i}

is a positive definite symmetric matrix and

ε_{i}^{T} = [{(s i g^{α_{i 1}} (e_{i 1}) + e_{i 1})}^{T}, e_{i 2}^{T}, e_{i 3}^{T}]

is introduced as an auxiliary error variable. It should be noted that

e_{i 1}

,

e_{i 2}

, and

e_{i 3}

will converge to origin in finite time, if the new state

ε_{i}

is finite-time stable. The time derivative of

ε_{i}

, invoking (8), yields:

\begin{array}{l} {\dot{ε}}_{i} = [\begin{matrix} α_{i 1} {|e_{i 1}|}^{α_{i 1} - 1} {\dot{e}}_{i 1} + {\dot{e}}_{i 1} \\ {\dot{e}}_{i 2} \\ {\dot{e}}_{i 3} \end{matrix}] = [\begin{matrix} α_{i 1} {|e_{i 1}|}^{α_{i 1} - 1} (e_{i 2} - β_{i 1} (s i g^{α_{i 1}} (e_{i 1}) + e_{i 1})) \\ \frac{e_{i 3}}{2} - β_{i 2} (s i g^{α_{i 2}} (e_{i 1}) + s i g^{α_{i 1}} (e_{i 1})) \\ - β_{i 3} (s i g^{α_{i 3}} (e_{i 1}) + s i g^{α_{i 2}} (e_{i 1})) \end{matrix}] \\ + [\begin{matrix} e_{i 2} - β_{i 1} (s i g^{α_{i 1}} (e_{i 1}) + e_{i 1}) \\ \frac{e_{i 3}}{2} - β_{i 2} (s i g^{α_{i 1}} (e_{i 1}) + e_{i 1}) \\ - β_{i 3} (s i g^{α_{i 2}} (e_{i 1}) + s i g^{α_{i 1}} (e_{i 1})) \end{matrix}] + [\begin{matrix} 0_{5} \\ 0_{5} \\ - σ_{i} \end{matrix}] = diag ([{|e_{i 1}|}^{α_{i 1} - 1}, {|e_{i 1}|}^{α_{i 1} - 1}, {|e_{i 1}|}^{α_{i 1} - 1}]) A_{i 1} ε_{i} + A_{i 2} ε_{i} + Φ_{i} \end{array}

(9)

where

Φ_{i} = {[\begin{matrix} 0_{5} & 0_{5} & - σ_{i} \end{matrix}]}^{T}

and the coefficient matrices

A_{i 1}

and

A_{i 2}

are expressed as:

A_{i 1} = (\begin{matrix} - α_{i 1} β_{i 1} I_{5} & α_{i 1} I_{5} & 0_{5} \\ - β_{i 2} I_{5} & 0_{5} & {\bar{e}}_{i}^{- 1} I_{5} / 2 \\ - β_{i 3} {\bar{e}}_{i} I_{5} & 0_{5} & 0_{5} \end{matrix}), A_{2 i} = (\begin{matrix} - β_{i 1} I_{5} & I_{5} & 0_{5} \\ - β_{i 2} I_{5} & 0_{5} & I_{5} / 2 \\ - β_{i 3} {\bar{e}}_{i} I_{5} & 0_{5} & 0_{5} \end{matrix})

(10)

with

{\bar{e}}_{i} = {|e_{i 1}|}^{α_{i 1} - 1}

. From the characteristic polynomials of

A_{i 1}

and

A_{i 2}

that all their eigenvalues have negative real parts if the observer gains are set as

β_{i k} > 0

, indicating that

A_{i 1}

and

A_{i 2}

are Hurwitz matrices. Thus, symmetric and positive definite matrices

Q_{i 1}

and

Q_{i 2}

exist that satisfy the following Lyapunov equations:

\{\begin{matrix} A_{i 1}^{T} P_{i} + P_{i} A_{i 1} = - Q_{i 1} \\ A_{i 2}^{T} P_{i} + P_{i} A_{i 2} = - Q_{i 2} . \end{matrix}

(11)

Differentiating

V_{i 1} (e)

with respect to time yields the following:

\begin{array}{l} {\dot{V}}_{i 1} & = ε_{i}^{T} [diag ([{\bar{e}}_{i}, {\bar{e}}_{i}, {\bar{e}}_{i}]) (A_{i 1}^{T} P_{i} + P_{i} A_{i 1})] ε_{i} + ε_{i}^{T} (A_{i 2}^{T} P_{i} + P_{i} A_{i 2}) ε_{i} + 2 ε_{i}^{T} P_{i} Φ_{i} \\ = - ε_{i}^{T} [diag ([{\bar{e}}_{i}, {\bar{e}}_{i}, {\bar{e}}_{i}]) Q_{i 1}] ε_{i} - ε_{i}^{T} Q_{i 2} ε_{i} + 2 ε_{i}^{T} P_{i} Φ_{i} \leq - {\bar{e}}_{i}^{\max} ε_{i}^{T} Q_{i 1} ε_{i} - ε_{i}^{T} Q_{i 2} ε_{i} + 2 ‖ε_{i}‖ ‖P_{i}‖ ‖Φ_{i}‖ \end{array}

(12)

where

{\bar{e}}_{i}^{\max} = {|e_{i 1}|}_{\max}^{α_{i 1} - 1}

and

{|e_{i 1}|}_{\max} = \max \{|e_{i 11}|, \dots, |e_{i 15}|\}

. Given the fact that

{|e_{i 1}|}_{\max} \leq ‖e_{i 1}‖ \leq {‖ε_{i}‖}^{1 / α_{i 1}}

and

α_{i 1} \in (\frac{2}{3}, 1)

, we can obtain the following:

\begin{array}{l} {\dot{V}}_{i 1} & \leq - {‖ε_{i}‖}^{\frac{α_{i 1} - 1}{α_{i 1}}} ε_{i}^{T} Q_{i 1} ε_{i} - ε_{i}^{T} Q_{i 2} ε_{i} + 2 ‖ε_{i}‖ ‖P_{i}‖ ‖Φ_{i}‖ \\ \leq - λ_{\min} (Q_{i 1}) {‖ε_{i}‖}^{3 - \frac{1}{α_{i 1}}} - λ_{\min} (Q_{i 2}) {‖ε_{i}‖}^{2} + 2 ‖ε_{i}‖ ‖P_{i}‖ ‖Φ_{i}‖ . \end{array}

(13)

Since

σ_{i}

is assumed to be bounded reasonably by

|σ_{i p}| \leq {\bar{σ}}_{i}

, we have

2 ‖ε_{i}‖ ‖P_{i}‖ ‖Φ_{i}‖ \leq 2 \sqrt{5} {\bar{σ}}_{i} ‖ε_{i}‖ ‖P_{i}‖ \leq 2 \sqrt{5} {\bar{σ}}_{i} λ_{\min} {(P_{i})}^{- \frac{1}{2}} V_{i 1}^{\frac{1}{2}} ‖P_{i}‖

, by using the inequality

λ_{\min} (P_{i}) {‖ε_{i}‖}^{2} \leq V_{i 1} \leq λ_{\max} (P_{i}) {‖ε_{i}‖}^{2}

(14)

Then, inequality (13) becomes the following:

\begin{array}{l} {\dot{V}}_{i 1} & \leq - λ_{\min} (Q_{i 1}) λ_{\max} {(P_{i})}^{\frac{1}{2 α_{i 1}} - \frac{3}{2}} V_{1 i}^{^{\frac{3}{2} - \frac{1}{2 α_{i 1}}}} - λ_{\min} (Q_{i 2}) λ_{\max} {(P_{i})}^{- 1} V_{i 1} + 2 \sqrt{5} {\bar{σ}}_{i} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- \frac{1}{2}} V_{i 1}^{\frac{1}{2}} \\ \leq - λ_{i 1} V_{i 1}^{^{\frac{3}{2} - \frac{1}{2 α_{i 1}}}} - λ_{i 2} V_{i 1} + λ_{i 3} V_{i 1}^{^{\frac{1}{2}}} \end{array}

(15)

where

λ_{i 1} = - λ_{\min} (Q_{i 1}) λ_{\max} {(P_{i})}^{\frac{1}{2 α_{i 1}} - \frac{3}{2}}

,

λ_{i 2} = - λ_{\min} (Q_{i 2}) λ_{\max} {(P_{i})}^{- 1}

, and

λ_{i 3} = 2 \sqrt{5} {\bar{σ}}_{i} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- \frac{1}{2}}

.

It can be seen that (15) has the same form as the sufficient condition in Lemma 1 2. Thus, the error trajectories of the proposed FTESO (7) are fast finite-time uniformly ultimately bounded stable. The state observation errors

e_{i}

will converge to a small region

Ω_{i}

in the finite time

T_{i f}

. Moreover, the settling time

T_{i f}

is subject to the constraint:

T_{i f} \leq \frac{\ln ((λ_{i 2} - {\bar{λ}}_{i 2}) V_{i 1} {(e_{0})}^{\frac{1}{2 α_{i 1}} - \frac{1}{2}} / (λ_{i 1} - {\bar{λ}}_{i 1}) + 1)}{(λ_{i 2} - {\bar{λ}}_{i 2}) (\frac{1}{2 α_{i 1}} - \frac{1}{2})} .

(16)

And the stable region

Ω_{i}

is denoted as

Ω_{i} = \{e| {\bar{λ}}_{i 1} V_{i 1} {(e)}^{1 - \frac{1}{2 α_{i 1}}} + {\bar{λ}}_{i 2} V_{i 1} {(e)}^{\frac{1}{2}} < λ_{i 3}\}

(17)

where

{\bar{λ}}_{i 1}

and

{\bar{λ}}_{i 2}

are arbitrary constants that meet the conditions

{\bar{λ}}_{i 1} \in (0, λ_{i 1})

and

{\bar{λ}}_{i 2} \in (0, λ_{i 2})

. This completes the proof. □

Remark 1.

Contrasting our proposed FTESO (7) with the FTESO in [37], our approach factors in the dynamics of disturbances and uncertainties to achieve a higher degree of estimation accuracy. Our usage of fractional powers within the FTESO allows for a quick finite-time convergence. It can be noted that the size of the attraction region

Ω_{i}

hinges upon the selection of the observer gains

β_{i k}

and

α_{i 1}

. By increasing

β_{i k}

or decreasing

α_{i 1}

, the attraction region of the observation error system can be expanded and the convergence speed can be improved, but excessive tuning will lead to undesired overshoot and oscillation. As a result, a trade-off should be taken for

β_{i k}

and

α_{i 1}

.

3.2. Outer-Loop Formation Prediction Control Law

In this subsection, we design a DMPC-based outer-loop formation controller. This controller, which draws on the information interaction with neighbors, facilitates the positional tracking of the ith AUV. The controller operates under composite constraints and ensures the avoidance of collisions. Then, we formulate a constrained QP problem in accordance with the control objective to obtain the optimal driving speed.

To facilitate the recursive model prediction and the implementation of the control law, the kinematic model (1) is discretized by using the Forward-Euler method with a sampling period

T_{s}

, resulting in following discrete model:

η_{i} (k + 1) = η_{i} (k) + J_{i} (k) v_{i} (k) T_{s} .

(18)

To smoothen the speed change of the AUV, the velocity increment

Δ u_{i v} (k) = v_{i} (k) - v_{i} (k - 1)

is taken as the control input.

x_{i η} (k) = {[\begin{matrix} η_{i} (k) & v_{i} (k - 1) \end{matrix}]}^{T}

is denoted as the state variable of the prediction model. The augmented state-space model of the outer-loop subsystem can be derived as:

x_{i η} (k + 1) = A_{i η} x_{i η} (k) + B_{i η} Δ u_{i v} (k)

(19)

y_{i η} (k) = C_{i η} x_{i η} (k)

(20)

where

A_{i η} = [\begin{matrix} I_{5} & J_{i} (k) T_{s} \\ 0_{5} & I_{5} \end{matrix}] \in ℝ^{10 \times 10}

,

B_{i η} = [\begin{matrix} J_{i} (k) T_{s} \\ I_{5} \end{matrix}] \in ℝ^{10 \times 5}

, and

C_{i η} = [\begin{matrix} I_{5} & 0_{5} \end{matrix}] \in ℝ^{5 \times 10}

.

According to the state prediction model (19) and (20), we can calculate the predicted state sequence of the system when given an input sequence. Let

N_{p 1}

and

N_{c 1}

denote the prediction and control horizon of the outer-loop controller, respectively. The predicted state sequence and the input incremental sequence are usually represented by compact vectors:

Y_{i η} = [\begin{matrix} y_{i η} (k + 1 | k) \\ y_{i η} (k + 2 | k) \\ ⋮ \\ y_{i η} (k + N_{p 1} | k) \end{matrix}] \in ℝ^{5 N_{p 1}}, x_{i η} = [\begin{matrix} x_{i η} (k + 1 | k) \\ x_{i η} (k + 2 | k) \\ ⋮ \\ x_{i η} (k + N_{p 1} | k) \end{matrix}] \in ℝ^{10 N_{p 1}}

(21)

Δ U_{i v} = [\begin{matrix} Δ u_{i v} (k | k) \\ Δ u_{i v} (k + 1 | k) \\ ⋮ \\ Δ u_{i v} (k + N_{c 1} - 1 | k) \end{matrix}] \in ℝ^{5 N_{c 1}}

(22)

where

y_{i η} (k + l | k)

and

x_{i η} (k + l | k)

are the output vector

y_{i η} (k + l)

and state vector

x_{i η} (k + l)

predicted at time k, respectively.

Δ u_{i v} (k + j | k)

denotes the input increment

Δ u_{i v} (k + j)

predicted at the same time k. Then, we characterize the relationship between the predicted output vector sequence and the control increment sequence through the following prediction equation based on the recurrence relations:

Y_{i η} = H_{i x}^{1} x_{i η} (k) + H_{i u}^{1} Δ U_{i v}

(23)

where

x_{i η} (k)

is the initial state,

H_{i x}^{1} = {[C_{i η} A_{i η}, C_{i η} A_{i η}^{2}, \dots, C_{i η} A_{i η}^{N_{p 1}}]}^{T} \in ℝ^{5 N_{p 1} \times 10}

and

H_{i u}^{1} = [\begin{matrix} C_{i η} B_{i η} & 0_{5} & \dots & 0_{5} \\ C_{i η} A_{i η} B_{i η} & C_{i η} B_{i η} & \dots & 0_{5} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C_{i η} A_{i η}^{N_{p 1} - 1} B_{i η} & C_{i η} A_{i η}^{N_{p 1} - 2} B_{i η} & \dots & C_{i η} A_{i η}^{N_{p 1} - N_{c 1}} B_{i η} \end{matrix}] \in ℝ^{5 N_{p 1} \times 5 N_{c 1}}

.

Considering the control objective, the constraints within the outer-loop subsystem are considered. First, we set upper and lower boundaries for the amplitude of the control input

u_{i v} (k)

and the input increment

Δ u_{i v} (k)

:

u_{i v}^{\min} \leq u_{i v} (k) \leq u_{i v}^{\max}

(24)

Δ u_{i v}^{\min} \leq Δ u_{i v} (k) \leq Δ u_{i v}^{\max}

(25)

where

u_{i v}^{\min}

and

Δ u_{i v}^{\min}

represent the predefined lower bounds, and

u_{i v}^{\max}

and

Δ u_{i v}^{\max}

represent the predefined upper bounds.

Next, to assure safe navigation throughout the formation construction stage, we need to consider the collision avoidance constraints between AUVs. The primitive collision avoidance constraints of the ith AUV can be transformed into a convex constraint, as follows:

‖S (y_{i η} (k + l | k) - y_{j η} (k + l | k))‖ \geq r_{s}, j \in Ξ_{i}

(26)

where

l = 1, 2, \dots, N_{p 1}

and

r_{s}

is the preset minimum allowable distance between the ith AUV and the jth AUV.

S

denotes a scaling matrix. Let

r_{d}

be the radius of the safe detection zone for the ith AUV.

Ξ_{i}

is the set of those AUVs that contain within

r_{d}

. Let the nominal value

{\bar{y}}_{i η}

represent an initial guess of the actual value

y_{i η}

for convexifying the collision avoidance constraint. It follows from (26) that a sufficient condition for upholding the collision avoidance constraint is the following:

{\bar{d}}_{i j}^{T} (k + l | k) S^{T} S (y_{i η} (k + l | k) - {\bar{y}}_{j η} (k + l | k)) \geq r_{s} ‖S {\bar{d}}_{i j} (k + l | k)‖

(27)

where

{\bar{d}}_{i j} (k + l | k) = {\bar{y}}_{i η} (k + l | k) - {\bar{y}}_{j η} (k + l | k)

. In order to express the constraints in a compact matrix form, define

{\bar{R}}_{l} = r_{s} ‖S {\bar{d}}_{i j} (k + l | k)‖ + {\bar{d}}_{i j}^{T} (k + l | k) S^{T} S {\bar{y}}_{j η} (k + l | k)

,

{\bar{R}}_{i j} = {[{\bar{R}}_{1}, {\bar{R}}_{2}, \dots, {\bar{R}}_{N_{p 1}}]}^{T}

and

{\bar{S}}_{i j} = diag \{{\bar{S}}_{1}, {\bar{S}}_{2}, \dots, {\bar{S}}_{N_{p 1}}\}

, and

{\bar{S}}_{l} = {\bar{d}}_{i j}^{T} (k + l | k) S^{T} S

. Then, (27) can be rewritten as

{\bar{S}}_{i j} Y_{i η} \geq {\bar{R}}_{i j}

. Substitute (23) to derive the collision avoidance constraint as follows:

{\bar{S}}_{i j} H_{i u}^{1} Δ U_{i v} \geq {\bar{R}}_{i j} - {\bar{S}}_{i j} H_{i x}^{1} x_{i η} (k) .

(28)

The input amplitude constraint (24) can be converted to the input incremental constraint, associating (25) and (28), expressed in the compact linear constraint form as follows:

Γ_{i η} Δ U_{i v} \leq γ_{i η}

(29)

where

Γ_{i η} = [\begin{matrix} I_{5 N_{c 1}} \\ - I_{5 N_{c 1}} \\ I_{η 1} \\ - I_{η 1} \\ - {\bar{S}}_{i j} H_{i u}^{1} \end{matrix}]

,

γ_{i η} = [\begin{matrix} Δ U_{i v}^{\max} \\ - Δ U_{i v}^{\min} \\ U_{i v}^{\max} - I_{η 2} u_{i} (k - 1) \\ - U_{i v}^{\min} + I_{η 2} u_{i} (k - 1) \\ {\bar{S}}_{i j} H_{i x}^{1} x_{i η} (k) - {\bar{R}}_{i j} \end{matrix}]

,

I_{η 1} = [\begin{matrix} I_{5} & 0_{5} & \dots & 0_{5} \\ I_{5} & I_{5} & \dots & 0_{5} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ I_{5} & I_{5} & \dots & I_{5} \end{matrix}] \in ℝ^{5 N_{c 1} \times 5 N_{c 1}}

, and

I_{η 2} = {[I_{5}, I_{5}, \dots, I_{5}]}^{T} \in ℝ^{5 N_{c 1}}

.

In order to achieve the control objective of formation positional tracking with low energy requirements, we define the local distributed cost function in the outer-loop subsystem of the ith AUV in a discretized form:

\begin{array}{l} J_{i η} (k) = & \sum_{l = 1}^{N_{p 1}} {‖(y_{i η} (k + l| k) - y_{i f} (k + l))‖}_{Q_{i f}}^{2} + \sum_{l = 0}^{N_{c 1} - 1} {‖Δ u_{i v} (k + l| k)‖}_{R_{i 1}}^{2} \\ + \sum_{l = 1}^{N_{p 1}} \sum_{j \in N_{i}} a_{i j} {‖(y_{i η} (k + l| k) - y_{i j} (k + l))‖}_{Q_{i j}}^{2} \end{array}

(30)

where

Q_{i f}

,

Q_{i j}

, and

R_{i 1}

are the weight matrices.

y_{i f} (k + l) = η_{r} (k + l) + r_{i f} (k + l)

with

r_{i f} (k + l)

represents the formation configuration.

y_{i j} (k + l) = y_{j η} (k + l) + r_{i j} (k + l)

with

r_{i j} (k + l)

represents the predefined relative distance between the ith AUV and its neighbor jth AUV.

N_{p 1}

indicates the degree of prediction of future tracking errors. The larger it is, the better the tracking accuracy and stability. The smaller

N_{c 1}

is, the worse the dynamic response is, and conversely the more maneuverable the control is.

Q_{i f}

is the position tracking matrix, the larger it is, the better the tracking accuracy and dynamic response.

Q_{i j}

is the relative position matrix, the larger it is, the better the ability of the formation to maintain the preset configuration.

R_{i 1}

is the control increment weight matrix, mainly to limit the drastic change of

Δ u_{i v}

.

Based on the above derivations, we can formulate the optimization problem for the outer-loop subsystem of the ith AUV at the sampling instant k within the receding-horizon framework:

\begin{array}{l} \min_{Δ U_{i v}} J_{i η} (k) \\ s . t . Γ_{i η} Δ U_{i v} \leq γ_{i η} . \end{array}

(31)

To simplify the computation of (31), it can be transformed into a convex QP problem. This problem is solved over a finite receding horizon using a QP solver. The standard convex QP form of the DMPC problem (31) can be derived:

\begin{array}{l} Δ U_{i v}^{*} = \underset{Δ U_{i v}}{\arg \min} (\frac{1}{2} Δ U_{i v}^{T} W_{i η} Δ U_{i v} + f_{i η}^{T} Δ U_{i v}) \\ s . t . Γ_{i η} Δ U_{i v} \leq γ_{i η} \end{array}

(32)

where

W_{i η} = {\bar{R}}_{i 1} + H_{i u}^{1 T} {\bar{Q}}_{i f} H_{i u}^{1} + \sum_{j \in N_{i}} a_{i j} H_{i u}^{1 T} {\bar{Q}}_{i j} H_{i u}^{1}

,

f_{i η} = H_{i u}^{1 T} {\bar{Q}}_{i f} (H_{i x}^{1} x_{i η}^{} - Y_{i f}) + \sum_{j \in N_{i}} a_{i j} H_{i u}^{1 T} {\bar{Q}}_{i j} (H_{i x}^{1} x_{i η}^{} - Y_{i j})

, with

Y_{i f} = {[y_{i f} (k + 1), \dots, y_{i f} (k + N_{p 1})]}^{T}

,

Y_{i j} = {[y_{i j} (k + 1), \dots, y_{i j} (k + N_{p 1})]}^{T}

,

{\bar{Q}}_{i f} = diag \{Q_{i f}, Q_{i f}, \dots, Q_{i f}\} \in ℝ^{5 N_{p 1} \times 5 N_{p 1}}

,

{\bar{Q}}_{i j}

, and

{\bar{R}}_{i 1}

are similar to

{\bar{Q}}_{i f}

, both corresponding compact matrices.

By solving the QP optimization problem in (32) online, we obtain the optimal control input increment sequence

Δ U_{i v}^{*}

. Of this sequence, we only utilize the first element

Δ u_{i v}^{*} (k | k)

for receding optimization. Once

Δ u_{i v}^{*} (k)

is determined, we obtain

v_{i} (k)

which serves as the desired driving speed for the inner-loop controller of the ith AUV, i.e.,

v_{i r} (k) = v_{i} (k) = v_{i} (k - 1) + Δ u_{i v}^{*} (k) .

(33)

3.3. Inner-Loop Formation Prediction Control Law

In this subsection, with the aid of the proposed FTESO, we design a DMPC-based formation controller for the inner-loop subsystem to obtain the optimal driving force and moment for the ith AUV to track the desired speed.

The dynamic model (4) is discretized with a sampling period

T_{s}

, yielding the following discretized model:

v_{i} (k + 1) = (I - M_{i}^{*}^{- 1} T_{s} (C_{i}^{*} + D_{i}^{*})) v_{i} (k) + M_{i}^{*}^{- 1} T_{s} τ_{i} (k) + M_{i}^{*}^{- 1} T_{s} {\hat{τ}}_{i d} (k)

(34)

where

{\hat{τ}}_{i d}

represents the compound disturbance compensated by FTESO (7), which is supposed to be invariant over a short period. It should be noted that we assume the center of gravity and buoyancy of the ith AUV to coincide, which allows

g_{i} (η_{i})

to approximate to zero. We select

x_{i v} (k) = {[\begin{matrix} v_{i} (k) & τ_{i} (k - 1) \end{matrix}]}^{T}

as the state variable and take the increment

Δ u_{i τ} (k) = τ_{i} (k) - τ_{i} (k - 1)

as the control input. This allows us to reformulate the inner-loop predictive model as follows:

x_{i v} (k + 1) = A_{i v} x_{i v} (k) + B_{i v} Δ u_{i τ} (k) + D_{i v}

(35)

y_{i v} (k) = C_{i v} x_{i v} (k)

(36)

where

A_{i v} = [\begin{matrix} I_{5} - M_{i}^{*}^{- 1} T_{s} (C_{i}^{*} + D_{i}^{*}) & M_{i}^{*}^{- 1} T_{s} \\ 0_{5} & I_{5} \end{matrix}] \in ℝ^{10 \times 10}

,

B_{i v} = [\begin{matrix} M_{i}^{*}^{- 1} T_{s} \\ I_{5} \end{matrix}] \in ℝ^{10 \times 5}

,

C_{i v} = [\begin{matrix} I_{5} & 0_{5} \end{matrix}] \in ℝ^{5 \times 10}

, and

D_{i v} = [\begin{matrix} M_{i}^{*}^{- 1} T_{s} {\hat{τ}}_{i d} \\ 0_{5 \times 1} \end{matrix}] \in ℝ^{10}

. Similar to our previous approach, we can characterize the relationship between the predicted output vector sequence and the control increment sequence using the following prediction equation:

Y_{i v} = H_{i x}^{2} x_{i v} (k) + H_{i u}^{2} Δ U_{i τ} + {\bar{D}}_{i v}

(37)

where

Y_{i v} = {[y_{i v} (k + 1 | k), y_{i v} (k + 2 | k), \dots, y_{i v} (k + N_{p 2} | k)]}^{T} \in ℝ^{5 N_{p 2}}

,

Δ U_{i τ} = {[Δ u_{i τ} (k | k), Δ u_{i τ} (k + 1 | k), \dots, Δ u_{i τ} (k + N_{c 2} - 1 | k)]}^{T} \in ℝ^{5 N_{c 2}}

,

H_{i x}^{2} = {[C_{i v} A_{i v}, C_{i v} A_{i v}^{2}, \dots, C_{i v} A_{i v}^{N_{p 2}}]}^{T} \in ℝ^{5 N_{p 2} \times 10}

,

H_{i u}^{2} = [\begin{matrix} C_{i v} B_{i v} & 0_{5} & \dots & 0_{5} \\ C_{i v} A_{i v} B_{i v} & C_{i v} B_{i v} & \dots & 0_{5} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C_{i v} A_{i v}^{N_{p 2} - 1} B_{i v} & C_{i v} A_{i v}^{N_{p 2} - 2} B_{i v} & \dots & C_{i v} A_{i v}^{N_{p 2} - N_{c 2}} B_{i v} \end{matrix}] \in ℝ^{5 N_{p 2} \times 5 N_{c 2}}

, and

{\bar{D}}_{i v} = {[C_{i v} D_{i v}, C_{i v} A_{i v} D_{i v} + C_{i v} D_{i v}, \dots, C_{i v} \sum_{n = 0}^{N_{p 2} - 1} A_{i v}^{n} D_{i v}]}^{T} \in ℝ^{5 N_{p 2}}

.

N_{p 2}

and

N_{c 2}

denote the prediction and control horizon of the inner-loop controller, respectively.

According to the control objective, we assess the constraints on the control input increment and the actuator saturation in the inner-loop subsystem, as follows:

Δ u_{i τ}^{\min} \leq Δ u_{i τ} (k) \leq Δ u_{i τ}^{\max}

(38)

τ_{i}^{\min} \leq τ_{i} (k) \leq τ_{i}^{\max}

(39)

where

τ_{i}^{\min}

and

Δ u_{i v}^{\min}

represent predefined lower bounds, while

τ_{i}^{\max}

and

Δ u_{i v}^{\max}

denote predefined upper bounds. The actuator saturation constraint (39) can be transformed into an input incremental constraint, and we can express the above constraints in a compact linear constraint form:

Γ_{i v} Δ U_{i τ} \leq γ_{i v}

(40)

where

Γ_{i v} = [\begin{matrix} I_{5 N_{c 2}} \\ - I_{5 N_{c 2}} \\ I_{v 1} \\ - I_{v 1} \end{matrix}]

,

γ_{i η} = [\begin{matrix} Δ U_{i τ}^{\max} \\ - Δ U_{i τ}^{\min} \\ {\bar{τ}}_{i}^{\max} - I_{v 2} τ_{i} (k - 1) \\ - {\bar{τ}}_{i v}^{\min} + I_{v 2} τ_{i} (k - 1) \end{matrix}]

, with

I_{v 1} = [\begin{matrix} I_{5} & 0_{5} & \dots & 0_{5} \\ I_{5} & I_{5} & \dots & 0_{5} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ I_{5} & I_{5} & \dots & I_{5} \end{matrix}] \in ℝ^{5 N_{c 2} \times 5 N_{c 2}}

and

I_{v 2} = {[I_{5}, I_{5}, \dots, I_{5}]}^{T} \in ℝ^{5 N_{c 2}}

.

To achieve the convergence of the formation tracking velocity to the desired value, we define the local distributed cost function of the inner-loop subsystem as follows:

J_{i v} (k) = \sum_{l = 1}^{N_{p 2}} {‖(y_{i v} (k + l| k) - v_{i r} (k + l))‖}_{Q_{i v}}^{2} + \sum_{l = 0}^{N_{c 2} - 1} {‖Δ u_{i τ} (k + l| k)‖}_{R_{i 2}}^{2}

(41)

where

y_{i v} (k + l| k)

and

Δ u_{i τ} (k + j| k)

denote the predicted value of

y_{i v} (k + l)

and

Δ u_{i τ} (k + j)

at time k, respectively.

Q_{i v}

and

R_{i 2}

are given weight matrices.

By substituting (37) into (41), we can formulate the DMPC optimization problem for the inner-loop subsystem of the ith AUV at sampling instant k as the following QP form:

\begin{array}{l} Δ U_{i τ}^{*} = \underset{Δ U_{i τ}}{\arg \min} (\frac{1}{2} Δ U_{i τ}^{T} W_{i v} Δ U_{i τ} + f_{i v}^{T} Δ U_{i τ}) \\ s . t . Γ_{i v} Δ U_{i τ} \leq γ_{i v} \end{array}

(42)

where

W_{i v} = {\bar{R}}_{i 2} + H_{i u}^{2 T} {\bar{Q}}_{i v} H_{i u}^{2}

and

f_{i v} = H_{i u}^{2 T} {\bar{Q}}_{i v} (H_{i x}^{2} x_{i v} + {\bar{D}}_{i v} - V_{i r})

, with

v_{i r} = {[v_{i r} (k + 1), \dots, v_{i r} (k + N_{p 2})]}^{T} \in ℝ^{5 N_{p 2}}

,

{\bar{Q}}_{i v} = diag \{Q_{i v}, Q_{i v}, \dots, Q_{i v}\} \in ℝ^{5 N_{p 2} \times 5 N_{p 2}}

, and

{\bar{R}}_{i 2} = diag \{R_{i 2}, R_{i 2}, \dots, R_{i 2}\} \in ℝ^{5 N_{c 2} \times 5 N_{c 2}}

.

The solution of the QP optimization problem (42) yields the optimal control input increment sequence

Δ U_{i τ}^{*}

at time k. However, only the first element

Δ u_{i τ}^{*} (k | k)

of the sequence is used for the ith AUV to obtain the optimal control force and moment

τ_{i}^{*} (k) = τ_{i} (k - 1) + Δ u_{i τ}^{*} (k)

. The

Δ u_{i τ}^{*} (k)

is recalculated at each sampling instant, the ith AUV repeatedly calculates and executes

τ_{i}^{*} (k)

to achieve receding optimization. The predicted state

x_{i v} (k + 1)

and the optimal input

τ_{i}^{*} (k)

are both determined solely by the current state

x_{i v} (k)

.

With the parallel optimization of N AUV subsystems, all local optimization problems are solved simultaneously at each sampling moment. One or more information interactions occur between local controllers to obtain the optimal input sequence for that moment. Thus, the proposed control law can compensate well for the compound disturbances, which consist of model uncertainties and external disturbances. This occurs throughout the iterative optimization process, while simultaneously ensuring collision avoidance and formation tracking control tasks under complex constraints.

3.4. Use of Laguerre Functions in the DMPC Design

This subsection introduces a strategy to handle the computational burden caused by a longer control horizon and dual closed-loop structure. This is the main difficulty in our theoretical analysis. The Laguerre orthogonal functions are leveraged in the DMPC design to decrease the order of the input incremental matrices. This approach permits a reduction in input variables during each control cycle, thereby reducing the computational burden in the time interval and improving real-time performance.

The Laguerre functions are a set of discrete orthogonal polynomial functions, let it be

l_{1} (k), l_{2} (k), \dots, l_{M} (k)

, the z-transfer of the mth Laguerre function is expressed as follows:

Χ_{m} (z) = \frac{\sqrt{1 - a^{2}}}{z - a} {[\frac{1 - a z}{z - a}]}^{m - 1}

(43)

where

0 \leq a < 1

denotes the pole of the Laguerre function, also known as the scaling factor. It can be verified that

Χ_{m}

satisfies the following orthogonality:

\{\begin{array}{l} \frac{1}{2 π} \int_{- π}^{π} Χ_{m} (e^{j ω}) Χ_{n} {(e^{j ω})}^{*} d ω = 1 m = n \\ \frac{1}{2 π} \int_{- π}^{π} Χ_{m} (e^{j ω}) Χ_{n} {(e^{j ω})}^{*} d ω = 0 m \neq n \end{array}

(44)

where

{(\cdot)}^{*}

denotes complex conjugate of

(\cdot)

.

The discrete Laguerre functions are defined by taking the inverse Z-transform of (43), i.e.,

l_{m} (k) = Z^{- 1} \{Χ_{m} (z)\}

. Given the network structure of

Χ_{m} (z)

and the recurrence relation, the set of discrete Laguerre functions satisfies the following difference equation:

L (k + 1) = Ξ L (k)

(45)

where

Ξ = [\begin{matrix} a & 0 & 0 & \dots & 0 \\ β & a & 0 & \dots & 0 \\ - a β & β & a & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ {(- a)}^{M - 2} β & {(- a)}^{M - 3} β & {(- a)}^{M - 4} β & \dots & a \end{matrix}]

and

L (k) = {[l_{1} (k), l_{2} (k), \dots, l_{M} (k)]}^{T}

, with

β = 1 - a^{2}

and initial condition

L (0) = \sqrt{β} {[1, - a, a^{2}, - a^{3}, \dots, {(- a)}^{M - 1}]}^{T}

. Note that at

a = 0

, the Laguerre functions are converted to impulse functions.

Assuming the current moment is k, the input increment of the single-input system at the next time l, represented by the Laguerre function, is:

Δ u (k + l) = \sum_{m = 1}^{M} κ_{m} l_{m} (l) = L {(l)}^{T} Κ

(46)

where

K = {[κ_{1}, κ_{2}, \dots, κ_{M}]}^{T}

. When we extend this to the multi-AUV system, each AUV has five independent control inputs, and the input increment of the ith AUV is as follows:

Δ u_{i} {(k)}^{T} = [L_{i}^{1} {(k)}^{T} Κ_{i}^{1}, L_{i}^{2} {(k)}^{T} Κ_{i}^{2}, \dots, L_{i}^{5} {(k)}^{T} Κ_{i}^{5}] = {\bar{L}}_{i} {(k)}^{T} {\bar{Κ}}_{i}

(47)

where

L_{i}^{p} (k) = {[l_{i 1}^{p} (k), l_{i 2}^{p} (k), \dots, l_{i M}^{p} (k)]}^{T}

and

K_{i}^{p} = {[κ_{i 1}^{p}, κ_{i 2}^{p}, \dots, κ_{i M}^{p}]}^{T}

, with

p = 1, 2, \dots 5 .

{\bar{L}}_{i} (k) = diag \{L_{i}^{1} {(k)}^{T}, L_{i}^{2} {(k)}^{T}, \dots, L_{i}^{5} {(k)}^{T}\}

, and

{\bar{Κ}}_{i} = {[K_{i}^{1 T}, K_{i}^{2 T}, \dots, K_{i}^{5 T}]}^{T}

. Note that within a multi-input structure, the scaling factor

a_{p}

and the number of polynomial terms

M_{p}

can be selected independently for each input signal.

For illustrative purposes, the inner-loop predictive controller of the ith AUV is taken as an example. If we partition the input matrix into

B_{i v} = [\begin{matrix} B_{i v}^{1} & B_{i v}^{2} & \dots & B_{i v}^{5} \end{matrix}]

, the prediction of the system output in the next l steps can be derived as follows:

\begin{array}{l} y_{i v} (k + l| k) = & \sum_{j = 0}^{l - 1} C_{i v} A_{i v}^{l - j - 1} [\begin{matrix} B_{i v}^{1} L_{i}^{1} {(j)}^{T} K_{i τ}^{1} & B_{i v}^{2} L_{i}^{2} {(j)}^{T} K_{i τ}^{2} & \dots & B_{i v}^{5} L_{i}^{5} {(j)}^{T} K_{i τ}^{5} \end{matrix}] \\ + C_{i v} A_{i v}^{l} x_{i v} (k) + \sum_{j = 0}^{l - 1} C_{i v} A_{i v}^{l - j - 1} D_{i v} . \end{array}

(48)

For a compact notation, we denote (48) by the following:

y_{i v} (k + l| k) = C_{i v} A_{i v}^{l} x_{i v} (k) + μ_{i} {(l)}^{T} {\bar{Κ}}_{i τ} + D_{i v}^{l}

(49)

where

μ_{i} {(l)}^{T} = \sum_{j = 0}^{l - 1} C_{i v} A_{i v}^{l - j - 1} [\begin{matrix} B_{i v}^{1} L_{i}^{1} {(j)}^{T} & B_{i v}^{2} L_{i}^{2} {(j)}^{T} & \dots & B_{i v}^{5} L_{i}^{5} {(j)}^{T} \end{matrix}]

and

D_{i v}^{l} = \sum_{j = 0}^{l - 1} C_{i v} A_{i v}^{l - j - 1} D_{i v}

.

{\bar{Κ}}_{i τ}

as the parameter vector that is to be optimized.

First, we employ the Laguerre function to optimize the constraint terms (38) and (39), leading to the following constraint form:

Δ u_{i τ}^{\min} \leq {\bar{L}}_{i τ}^{T} {\bar{Κ}}_{i τ} \leq Δ u_{i τ}^{\max}

(50)

τ_{i}^{\min} \leq {\overset{⌢}{L}}_{i τ} {\bar{Κ}}_{i τ} + τ_{i} (k - 1) \leq τ_{i}^{\max}

(51)

where

{\bar{L}}_{i τ} = d i a g \{L_{i τ}^{1} {(l)}^{T}, \dots, L_{i τ}^{5} {(l)}^{T}\}

and

{\overset{⌢}{L}}_{i τ} = diag \{\sum_{j = 0}^{l - 1} L_{i τ}^{1} {(j)}^{T}, \sum_{j = 0}^{l - 1} L_{i τ}^{2} {(j)}^{T}, \dots, \sum_{j = 0}^{l - 1} L_{i τ}^{5} {(j)}^{T}\}

.

Given that the Laguerre functions are orthonormal for a sufficiently large control horizon

N_{c 2}

. Substituting (47) into (41) and using the orthogonality (44) of the Laguerre function (i.e., the inner product of different terms is 0 and the same term is 1), the following derivation can be performed to obtain the reconstructed form of the cost function (41):

\begin{array}{l} J_{i v} (k) & = \sum_{l = 1}^{N_{p 2}} {‖(y_{i v} (k + l| k) - v_{i r} (k + l))‖}_{Q_{i v}}^{2} + \sum_{l = 0}^{N_{c 2} - 1} Δ u_{i τ} {(k + l| k)}^{T} R_{i 2} Δ u_{i τ} (k + l| k) \\ = \sum_{l = 1}^{N_{p 2}} {‖(y_{i v} (k + l| k) - v_{i r} (k + l))‖}_{Q_{i v}}^{2} + \sum_{l = 0}^{N_{c 2} - 1} ({\bar{L}}_{i τ} {(l)}^{T} {\bar{Κ}}_{i τ}) R_{i 2} {({\bar{L}}_{i τ} {(l)}^{T} {\bar{Κ}}_{i τ})}^{T} \\ = \sum_{l = 1}^{N_{p 2}} {[y_{i v} (k + l| k) - v_{i r} (k + l)]}^{T} Q_{i v} [y_{i v} (k + l| k) - v_{i r} (k + l)] \\ + \sum_{l = 0}^{N_{c 2} - 1} (d i a g \{L_{i τ}^{1} (l), L_{i τ}^{2} (l), \dots, L_{i τ}^{5} (l)\} {\bar{Κ}}_{i τ}) R_{i 2} {(d i a g \{L_{i τ}^{1} (l), L_{i τ}^{2} (l), \dots, L_{i τ}^{5} (l)\} {\bar{Κ}}_{i τ})}^{T} \\ = \sum_{l = 1}^{N_{p 2}} {[y_{i v} (k + l| k) - v_{i r} (k + l)]}^{T} Q_{i v} [y_{i v} (k + l| k) - v_{i r} (k + l)] + {\bar{Κ}}_{i τ}^{T} R_{i 2} {\bar{Κ}}_{i τ} . \end{array}

(52)

By substituting (49) into (52), we can rewrite the DMPC optimization problem (42) for the inner-loop subsystem of the ith AUV as:

\begin{array}{l} \min_{{\bar{Κ}}_{i τ}^{*}} J_{i v} (k) = \min_{{\bar{Κ}}_{i τ}^{*}} (\frac{1}{2} {\bar{Κ}}_{i τ}^{T} W_{i L} {\bar{Κ}}_{i τ} + f_{i L}^{T} {\bar{Κ}}_{i τ}) \\ s . t . (50), (51) \end{array}

(53)

where

W_{i L} = \sum_{l = 1}^{N_{p 2}} μ_{i} (l) Q_{i v} μ_{i} {(l)}^{T} + R_{i 2}

and

f_{i L} = \sum_{l = 1}^{N_{p 2}} μ_{i} (l) Q_{i v} (C_{i v} A_{i v}^{l} x_{i v} (k) + D_{i v}^{l} - v_{i r} (k + l))

.

The QP optimization Equation (53), with constraints, can be solved to obtain the optimal parameter vector

{\bar{Κ}}_{i τ}^{*}

. This vector replaces the conventional DMPC method calculation of

Δ u_{i τ}^{*}

. Thus, the optimal input increment of the inner-loop subsystem is indirectly obtained by the rolling optimized control law,

Δ u_{i τ} {(k)}^{T} = {\bar{L}}_{i τ} {(0)}^{T} {\bar{Κ}}_{i τ}

, until the control variables at the next moment are calculated. This iterative process ensures the achievement of receding horizon optimization. The use of the Laguerre function in the design of the outer-loop predictive controller is not included here, as its analysis parallels that of the inner-loop controller described above.

Remark 2.

By parameterizing the input increment sequence using the Laguerre function, the input matrix order in the prediction horizon can be lowered, thereby reducing the computational load online. This property enables its application in large-scale and real-time AUV control systems. With the employment of the Laguerre function, the coefficients

a_{p}

and

M_{p}

can also be served as tuning parameters, in addition to the control and prediction horizon and weighting matrices. Larger

a_{p}

and

M_{p}

lead to faster closed-loop responses [38].

3.5. Stability Analysis

A notable attribute of the MPC is the potential for establishing the stability of a closed-loop system under certain conditions. Extending this to cases using Laguerre polynomials, a terminal state constraint is utilized to analyze the stability of the closed-loop system. Specifically, for the inner-loop subsystem, an additional constraint is attached to the final state of the receding optimization problem:

x_{i v} (k + N_{p 2}) = 0

, where

x_{i v} (k + N_{p 2})

is the terminal state produced under the effect of the control sequence,

Δ u_{i τ} {(k + l)}^{T} = {\bar{L}}_{i τ} {(l)}^{T} {\bar{Κ}}_{i τ}

.

Theorem 2.

Consider the inner-loop subsystem (35) and (36) of the ith AUV in the formation control system, which has a local cost function (41) with constraints (38) and (39). The inner-loop predictive control subsystem is asymptotically stable if for each sampling instant k, there exists a solution

{\bar{Κ}}_{i τ}

such that the performance index

J_{i v}

is minimized subject to the terminal state constraint.

Proof of Theorem 2.

Constructing an appropriate Lyapunov function is key to ensuring the stability of the DMPC system. Select the cost function

J_{i v} (k)

as the Lyapunov function

V_{i 2} (y (k), k)

:

V_{i 2} (y (k), k) = \sum_{l = 1}^{N_{p 2}} {\tilde{y}}_{i v} {(k + l| k)}^{T} Q_{i v} {\tilde{y}}_{i v} (k + l| k) + \sum_{l = 0}^{N_{c 2} - 1} Δ u_{i τ} {(k + l)}^{T} R_{i 2} Δ u_{i τ} (k + l)

(54)

where

{\tilde{y}}_{i v} (k + l| k) = y_{i v} (k + l| k) - v_{i r} (k + l)

and

y_{i v} (k + l| k) = C_{i v} A_{i v}^{l} x_{i v} (k) + μ_{i} {(l)}^{T} {\bar{Κ}}_{i τ}^{k} + D_{i v}^{l}

,

{\bar{Κ}}_{i τ}^{k}

is the parameter vector solution of the cost function (41) under the original and terminal constraints at moment k, and input increment

Δ u_{i τ} {(k + l)}^{T} = {\bar{L}}_{i τ} {(l)}^{T} {\bar{Κ}}_{i τ}^{k}

. It is clear that

V_{i 2} (y (k), k)

is positive definite and tends to infinity as

y_{i v} (k)

tends to infinity. Similarly, the Lyapunov function at moment

k + 1

can be derived as:

\begin{array}{l} V_{i 2} (y (k + 1), k + 1) = & \sum_{l = 1}^{N_{p 2}} {\tilde{y}}_{i v} {(k + 1 + l| k + 1)}^{T} Q_{i v} {\tilde{y}}_{i v} (k + 1 + l| k + 1) \\ + \sum_{l = 0}^{N_{c 2} - 1} Δ u_{i τ} {(k + 1 + l)}^{T} R_{i 2} Δ u_{i τ} (k + 1 + l) \end{array}

(55)

where

y_{i v} (k + 1 + l| k + 1) = C_{i v} A_{i v}^{l} x_{i v} (k + 1) + μ_{i} {(l)}^{T} {\bar{Κ}}_{i τ}^{k + 1} + D_{i v}^{l}

,

{\bar{Κ}}_{i τ}^{k + 1}

is the parameter vector solution at time

k + 1

, and

Δ u_{i τ} {(k + 1 + l)}^{T} = {\bar{L}}_{i τ} {(l)}^{T} {\bar{Κ}}_{i τ}^{k + 1}

. Given that

y_{i v} (k + 1)

is the response one step ahead of

y_{i v} (k)

and

y_{i v} (k + 1) = C_{i v} A_{i v} x_{i v} (k) + C_{i v} B_{i v} Δ u_{i τ} (k) + C_{i v} D_{i v}

, the feasible solution of

{\bar{Κ}}_{i τ}^{k + 1}

corresponding to the initial output

y_{i v} (k + 1)

in the receding horizon is

{\bar{Κ}}_{i τ}^{k}

. Therefore, the feasible solution sequence at moment

k + 1

is to move the elements in

{\bar{L}}_{i τ} {(0)}^{T} {\bar{Κ}}_{i τ}^{k}

,

{\bar{L}}_{i τ} {(1)}^{T} {\bar{Κ}}_{i τ}^{k}

, …,

{\bar{L}}_{i τ} {(N_{c 2} - 1)}^{T} {\bar{Κ}}_{i τ}^{k}

one step forward and substitute the last element with 0, i.e.,

{\bar{L}}_{i τ} {(1)}^{T} {\bar{Κ}}_{i τ}^{k}

,

{\bar{L}}_{i τ} {(2)}^{T} {\bar{Κ}}_{i τ}^{k}

, …,

{\bar{L}}_{i τ} {(N_{c 2} - 1)}^{T} {\bar{Κ}}_{i τ}^{k}

,

0_{5 \times 1}

. Due to the optimality of the solution

{\bar{Κ}}_{i τ}^{k + 1}

at

k + 1

, it follows that

V_{i 2} (y (k + 1), k + 1) \leq {\hat{V}}_{i 2} (y (k + 1), k + 1)

(56)

where

{\hat{V}}_{i 2} (y (k + 1), k + 1)

is identical to (55) except that the parameter vector solution

{\bar{Κ}}_{i τ}^{k + 1}

in the control sequence is replaced by the feasible solution

{\bar{Κ}}_{i τ}^{k}

. The difference between

V_{i 2} (y (k), k)

and

V_{i 2} (y (k + 1), k + 1)

is then bounded by the following:

V_{i 2} (y (k + 1), k + 1) - V_{i 2} (y (k), k) \leq {\hat{V}}_{i 2} (y (k + 1), k + 1) - V_{i 2} (y (k), k) .

(57)

Eliminate the same terms in the control sequence and output sequence of

{\hat{V}}_{i 2} (y (k + 1), k + 1)

and

V_{i 2} (y (k), k)

at moments

k + 1

,

k + 2

,…,

k + N_{p 2} - 1

, and we can derive the following equation:

\begin{array}{l} {\hat{V}}_{i 2} (y (k + 1), k + 1) - V_{i 2} (y (k), k) = & {\tilde{y}}_{i v} {(k + N_{p 2}| k)}^{T} Q_{i v} {\tilde{y}}_{i v} (k + N_{p 2}| k) \\ - {\tilde{y}}_{i v} {(k + 1| k)}^{T} Q_{i v} {\tilde{y}}_{i v} (k + 1| k) - Δ u_{i τ} {(k)}^{T} R_{i 2} Δ u_{i τ} (k) . \end{array}

(58)

Given the terminal constraint

x_{i v} (k + N_{p 2}) = 0

is applied, equivalent to

y_{i v} (k + N_{p 2}) = 0

, we have the following:

\begin{array}{l} {\hat{V}}_{i 2} (y (k + 1), k + 1) - V_{i 2} (y (k), k) = & - v_{i r} {(k + N_{p 2})}^{T} Q_{i v} v_{i r} (k + N_{p 2}) \\ - {\tilde{y}}_{i v} {(k + 1| k)}^{T} Q_{i v} {\tilde{y}}_{i v} (k + 1| k) - Δ u_{i τ} {(k)}^{T} R_{i 2} Δ u_{i τ} (k) . \end{array}

(59)

This allows inequality (57) to be converted into:

\begin{array}{l} V_{i 2} (y (k + 1), k + 1) - V_{i 2} (y (k), k) \leq & - v_{i r} {(k + N_{p 2})}^{T} Q_{i v} v_{i r} (k + N_{p 2}) \\ - {\tilde{y}}_{i v} {(k + 1| k)}^{T} Q_{i v} {\tilde{y}}_{i v} (k + 1| k) - Δ u_{i τ} {(k)}^{T} R_{i 2} Δ u_{i τ} (k) < 0 . \end{array}

(60)

Namely,

V_{i 2} (y (k + 1), k + 1) < V_{i 2} (y (k), k)

; the Lyapunov function is monotonically decreasing. This proves that the inner loop subsystem is asymptotically stable. □

Next, we analyze the stability of the entire closed-loop system. Analogous to the proof of Theorem 2, we select

J_{i η} (k)

as the Lyapunov function

V_{i 3} (y (k), k)

of the outer-loop subsystem:

\begin{array}{l} V_{i 3} (y (k), k) = & \sum_{l = 1}^{N_{p 1}} {‖(y_{i η} (k + l| k) - y_{i f} (k + l))‖}_{Q_{i f}}^{2} + \sum_{l = 0}^{N_{c 1} - 1} {‖Δ u_{i v} (k + l| k)‖}_{R_{i 1}}^{2} \\ + \sum_{l = 1}^{N_{p 1}} \sum_{j \in N_{i}} a_{i j} {‖(y_{i η} (k + l| k) - y_{i j} (k + l))‖}_{Q_{i j}}^{2} . \end{array}

(61)

According to the idea of the proof of Theorem 2, the following inequality can be obtained:

\begin{array}{l} V_{i 3} (y (k + 1), k + 1) - V_{i 3} (y (k), k) \leq & - y_{i f} {(k + N_{p 1})}^{T} Q_{i f} y_{i f} (k + N_{p 1}) - {\tilde{y}}_{i η} {(k + 1| k)}^{T} Q_{i f} {\tilde{y}}_{i η} (k + 1| k) \\ - N (N - 1) r_{i j} {(k + N_{p 1})}^{T} Q_{i j} r_{i j} (k + N_{p 1}) - Δ u_{i v} {(k)}^{T} R_{i 1} Δ u_{i v} (k) < 0 . \end{array}

(62)

Next, we set the Lyapunov function of the entire closed-loop system as follows:

V_{i 4} (y (k), k) = V_{i 2} (y (k), k) + V_{i 3} (y (k), k) .

(63)

From inequalities (60) and (62), we have the following:

\begin{array}{l} V_{i 4} (y (k + 1), k + 1) - V_{i 4} (y (k), k) & = V_{i 2} (y (k + 1), k + 1) - V_{i 2} (y (k), k) \\ + V_{i 3} (y (k + 1), k + 1) - V_{i 3} (y (k), k) < 0 \end{array}

(64)

As a result, the entire closed-loop system is asymptotically stable.

4. Simulation

In this section, some simulation analyses are conducted to verify the effectiveness and robustness of the proposed control scheme. A formation system consisting of four AUVs

(N = 4, i = 1, 2, 3, 4)

with a virtual leader (AUV0) is considered. The directed communication topology for the simulation is depicted in Figure 3, the meaning of the arrows is the direction of the communication or information flow between the nodes in the formation network. Initial values for

x_{i}

,

y_{i}

, and

z_{i}

are randomly distributed within the intervals

[10, 40]

,

[0, 30]

, and

[- 10, 0]

, respectively, while the attitude angles

θ_{i}

and

ψ_{i}

lie within the intervals

[- π / 18, π / 18]

and

[0, π]

, respectively. The parameters related to the AUVs are based on previous research [39]. A diamond formation was predefined to facilitate omnidirectional exploration, with the desired formation configuration preset to

r_{1 f} = {[0, 0, 6.5, 0, 0]}^{T}

,

r_{2 f} = {[0, - 7.5, 0, 0, 0]}^{T}

,

r_{3 f} = {[0, 0, - 6.5, 0, 0]}^{T}

, and

r_{4 f} = {[0, 7.5, 0, 0, 0]}^{T}

.

r_{12} = - r_{21} = {[0, 7.5, 6.5, 0, 0]}^{T}

,

r_{13} = - r_{31} = {[0, 0, 13, 0, 0]}^{T}

,

r_{14} = - r_{41} = {[0, - 7.5, 6.5, 0, 0]}^{T}

,

r_{23} = - r_{32} = {[0, - 7.5, 6.5, 0, 0]}^{T}

,

r_{24} = - r_{42} = {[0, - 15, 0, 0, 0]}^{T}

and

r_{34} = - r_{43} = {[0, - 7.5, - 6.5, 0, 0]}^{T}

. The safety distance during the formation construction stage is set as

r_{s} = 3 m

, while the detection distance measured using sonar is set as

r_{d} = 6 m

. To reflect model uncertainties, 20% of the nominal values were taken as model errors, meaning that the parameters for the AUVs in the simulation represented only 80% of the nominal system dynamics. External disturbances were applied to each AUV to evaluate the formation robustness, modeled as follows [34]:

\{\begin{array}{l} τ_{i c u} = 0.1 sign (u_{i}) + 0.2 \sin (0.1 t) N \\ τ_{i c v} = 0.1 sign (v_{i}) + 0.3 \sin (0.3 t) N \\ τ_{i c w} = 0.08 sign (w_{i}) + 0.2 \sin (0.5 t) N \\ τ_{i c q} = 0.02 sign (q_{i}) + 0.1 \sin (0.3 t) N \cdot m \\ τ_{i c r} = 0.05 sign (r_{i}) + 0.1 \sin (0.3 t) N \cdot m \end{array}

(65)

Each control parameter has its settings guidelines: Given the low driving speed of the AUV in this paper, smaller

N_{p 1}

and

N_{p 2}

are intended to be used. During debugging, reduce it if the rapidity is not enough, and increase it if the stability is not good; the selection of

N_{c 1}

and

N_{c 2}

is based on a trade-off between performance and computation [40]; since we value the position tracking performance more than the velocity tracking performance,

Q_{i f}

is set slightly larger than

Q_{i v}

; to weaken the interaction of angles between AUVs, the orientation weight in

Q_{i j}

is set slightly smaller; when tuning

R_{i 1}

and

R_{i 2}

, it can be set very small first, and then increase it slightly if the system is stable and the control variable does not change too drastically [41]. By solving the Lyapunov Equation (11), the relationship between the observer gains

β_{i k}

and

α_{i 1}

, such that

A_{i 1}

and

A_{i 2}

are Hurwitz matrices, can be obtained, and tuned to select the appropriate values [42]; the Laguerre parameter

a_{p}

is adjusted within the constraint interval, and a smaller

M_{p}

is selected to coordinate the number of constraints in the optimization problem, and to make a trade-off between response speed and control complexity [38]. Following the above guidelines, we dealt with the main difficulties in the simulation and selected the parameters that produced the optimal simulation results and listed them in Table 1.

Moreover, based on the actual speed limit of the thruster, we provide the state and input constraints as follows:

Δ U_{i v}^{\max} = - Δ U_{i v}^{\min} = {[0.2, 0.1, 0.2, 0.05, 0.05]}^{T}

,

U_{i v}^{\max} = - U_{i v}^{\min} = {[1.5, 1, 1, 0.05, 0.2]}^{T}

, and

Δ U_{i τ}^{\max} = - Δ U_{i τ}^{\min} = {[50, 50, 100, 5, 5]}^{T}

. To avoid actuator saturation for each AUV, the bounds of force and moment are set as

τ_{i}^{\min} = {[- 200, - 500, - 500, - 7, - 10]}^{T}

and

τ_{i}^{\max} = {[300, 500, 500, 7, 10]}^{T}

.The reference trajectory generated by the virtual leader is a 3-D spiral curve, defined as follows:

\{\begin{array}{l} x_{r} (t) = 30 \cos (0.005 π t) \\ y_{r} (t) = 30 \sin (0.005 π t) \\ z_{r} (t) = - 0.05 t - 3 \end{array}

(66)

To verify the disturbance compensation performance of the proposed FTESO (7), we conducted comparative simulations with the ESO (67) from [43] and the FTESO (68) from [18]. Figure 4 shows the norms of the compound disturbance estimation errors

‖e_{i 2}‖ = ‖{\hat{τ}}_{i d} - τ_{i d}‖

for the four AUVs under the three observers, characterizing insights into transient and steady-state responses. We calculated the settling time of the designed FTESO in the simulation and highlighted it on the plots. It is clear from Figure 4 that our proposed third-order fast FTESO can achieve finite-time stabilization, with the estimation errors converging to a small neighborhood of the origin. And the dynamic convergence speed and estimation accuracy of the proposed FTESO are better than ESO (67) and FTESO (68) with less chattering. This shows the advantages of our approach. Thus, each AUV can accurately compensate for model uncertainties and external disturbances of its corresponding subsystem in finite time.

\{\begin{array}{l} {\dot{\hat{z}}}_{i 1} = {\hat{z}}_{i 2} - β_{i 1} a_{i 1} e_{i 1} + G_{i} (η_{i}, v_{i}) + τ_{i} \\ {\dot{\hat{z}}}_{i 2} = - β_{i 1}^{2} a_{i 2} e_{i 1} \end{array} .

(67)

\{\begin{array}{l} {\dot{\hat{z}}}_{i 1} = {\hat{z}}_{i 2} - β_{i 1} s i g^{3 / 4} (e_{i 1}) + G_{i} (η_{i}, v_{i}) + τ_{i} \\ {\dot{\hat{z}}}_{i 2} = - β_{i 2} s i g^{1 / 2} (e_{i 1}) \end{array} .

(68)

The collision avoidance performance of the AUV formation was tested via a set of comparison experiments with and without collision avoidance constraints based on our proposed scheme. Since the initial positions of the four AUVs are randomly distributed, the risk of collision is increased. The formation trajectory without collision avoidance constraints is shown in Figure 5 (top). Here, the four AUVs track the reference trajectory while keeping the preset shape, but AUV3 and AUV4 collide at 10 s, followed by a collision between AUV1 and AUV2 at 20 s. Specifically, as presented in Figure 6 (top), the relative distance between AUV1 and AUV2 during the formation configuration stage exceeds the safe distance, resulting in a collision. The same situation occurs with AUV3 and AUV4. However, when collision avoidance constraints are considered, the formation trajectory (shown in Figure 5 (bottom)) indicates that the four AUVs can perform the formation tracking task while avoiding collision during the configuration stage. The collision avoidance performance is visualized in Figure 6 (bottom), where the distances among AUVs within the detection zone are always greater than the safe distance, indicating that inter-vehicle collision avoidance can be achieved. Therefore, the proposed control scheme can provide real-time collision avoidance capability for AUV formation maneuvers.

In order to assess the feasibility and superiority of the proposed scheme, we conducted three sets of comparative simulations with the same parameters, constraints, and disturbance settings: (a) the proposed FTESO-based dual closed-loop DMPC with Laguerre function; (b) a FTESO-based dual closed-loop DMPC without Laguerre function; (c) a standard DMPC without FTESO. Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16 plot the tracking performance curves of AUV positional and velocity states under the three schemes. It can be easily observed that, in all scenarios, the four AUVs are able to successfully track the desired state despite the differing tracking errors. In scheme (a), full-state stable tracking is achieved within 200 s. Meanwhile, in scheme (b), the process takes about 300 s, which suggests that the use of the Laguerre function improves both the response speed and control accuracy. Although the standard DMPC scheme (c) can also achieve formation tracking, the settling time of the state variables is longer and accompanied by oscillations due to the uncompensated compound disturbance effects. Compared with the other schemes, our proposed method delivers superior formation tracking control performance.

Figure 17 intuitively presents a 3-D formation trajectory tracking. Combined with Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16, it implies that all three schemes can successfully accomplish formation spiral tracking under the specified input and state constraints. However, when the formation faces harsh compound disturbances, the tracking performance of the controller without disturbance compensation performs poorly, demonstrating a tracking error significantly larger than that of the FTESO-based controller. This is because the compound disturbances cause the AUV to deviate from the desired trajectory. By comparing the results of (a,b), it can be further observed that the proposed control scheme with Laguerre function allows the AUV to form the preset formation more quickly and converge to the desired trajectory more smoothly. This implies a faster response at the onset of the task. Thus, the dual closed-loop structure and Laguerre function enable the AUV formation to track the reference trajectory with better speed and accuracy.

Without loss of generality, Figure 18 shows the actual control forces and moments versus time for AUV1 under the three schemes. Without the benefit of FTESO to compensate for compound disturbances, the fluctuations of the control force and moment are relatively drastic and unstable (Figure 18(c1,c2)). This is attributed to the need for the AUV to significantly rectify the driving forces and moments to more rapidly approach the deviated reference trajectory. Under the proposed scheme, as shown in Figure 18(a1,a2), the AUV forces and moments vary relatively smoothly, which makes the AUV track the trajectory steadily when disturbed. Comparing Figure 18(a1,a2) and Figure 18(b1,b2), the Laguerre-based controller has the fastest control signal response with the smallest amplitude when the disturbances are accurately compensated. This confirms that our proposed scheme (a) provides superior control performance. It is worth noting that the variation of control forces and moments always remains within the prescribed limits. This reflects the ability of the DMPC to handle the actuator saturation effectively, ensuring that the control input for each DOF does not exceed the maximum force provided by the actuator, thus reducing actuator losses.

To differentiate between the computational demands among the three schemes, we recorded the emulator execution times under the same configurations. The detailed simulation times corresponding to Figure 17 are given in Table 2. It can be observed that the actual running time of the standard DMPC system is approximately 43.62 s. In contrast, the system with a dual closed-loop DMPC requires 57.91 s, which is about 32.8% longer than the standard DMPC. This increase is due to the greater complexity of the dual closed-loop structure as opposed to the simpler DMPC structure. Although there is improvement in control efficacy, the execution of the dual closed-loop structure is sacrificed to some extent. However, the proposed system, which employs a Laguerre-based dual closed-loop DMPC, the computation time only requires 11.75 s. This suggests that, despite the inclusion of both the dual closed-loop structure and FTESO, the use of the Laguerre function makes the system solution faster. Thus, the proposed scheme can simultaneously improve the computational speed and control performance.

5. Conclusions

In conclusion, this study presents a FTESO-based distributed dual closed-loop model predictive control scheme for the AUV formation subject to compound disturbances. The designed FTESO can compensate for model uncertainties and external disturbances of each AUV faster and more accurately. Control inputs are determined by solving a constrained DMPC optimization problem based on local information, while avoiding both collisions among AUVs and actuator saturation. The Laguerre orthogonal function is applied to alleviate the heavy computational burden, and the corresponding stability proof is provided. Finally, based on a connected directed topology, simulation results of different schemes are investigated under the same compound disturbances and system constraints. It is confirmed that our proposed scheme provides the best tracking effect and superior active disturbance rejection capability. Control signals show smaller oscillations and enhanced stability. In addition, the computation time of our proposed formation control system, which utilizes the Laguerre function, is reduced by 73.1% and 79.7% compared to the standard DMPC system and the dual closed-loop DMPC system, respectively. This verifies that our proposed scheme can respond quickly to minimize control costs and improve real-time execution and dynamic performance of the system.

The proposed method does not consider the impact of communication burden on AUV formation. Therefore, in future work, we will focus on the control scheme based on the event-triggered mechanisms. Considering the limitations of the optimization accuracy of discrete predictive control, we want to carry out research on continuous predictive control with faster control response. In addition, it is essential to conduct formation obstacle avoidance research and real AUV experiments.

Author Contributions

Conceptualization, M.Z. and Z.Y.; Data curation, J.Z. and L.Y.; Funding acquisition, Z.Y.; Investigation, J.Z. and L.Y.; Methodology, M.Z.; Resources, Z.Y.; Software, L.Y.; Validation, M.Z.; Visualization, J.Z.; Writing—original draft, M.Z.; Writing—review & editing, M.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Natural Science Foundation of China under grant No. 52071102, and in part by National Natural Science Foundation of China under grant No. 51679057.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shi, Y.; Shen, C.; Fang, H.; Li, H. Advanced control in marine mechatronic systems: A survey. IEEE ASME Trans. Mechatron. 2017, 22, 1121–1131. [Google Scholar] [CrossRef]
Liu, G.; Chen, L.; Liu, K.; Luo, Y. A swarm of unmanned vehicles in the shallow ocean: A survey. Neurocomputing 2023, 531, 74–86. [Google Scholar] [CrossRef]
Yu, H.; Zeng, Z.; Guo, C. Coordinated formation control of discrete-time autonomous underwater vehicles under alterable communication topology with time-varying delay. J. Mar. Sci. Eng. 2022, 10, 712. [Google Scholar] [CrossRef]
Chen, Y.L.; Ma, X.W.; Bai, G.Q.; Sha, Y.; Liu, J. Multi-autonomous underwater vehicle formation control and cluster search using a fusion control strategy at complex underwater environment. Ocean Eng. 2020, 216, 108048. [Google Scholar] [CrossRef]
Zhen, Q.; Wan, L.; Li, Y.; Jiang, D. Formation control of a multi-AUVs system based on virtual structure and artificial potential field on SE(3). Ocean Eng. 2022, 253, 111148. [Google Scholar] [CrossRef]
Wang, J.; Wang, C.; Wei, Y.; Zhang, C. Sliding mode based neural adaptive formation control of underactuated AUVs with leader-follower strategy. Appl. Ocean Res. 2020, 94, 101971. [Google Scholar] [CrossRef]
He, X.; Geng, Z. Globally convergent leaderless formation control for unicycle-type mobile robots. IET Contr. Theory Appl. 2020, 14, 2651–2662. [Google Scholar] [CrossRef]
Munir, M.; Khan, Q.; Ullah, S.; Syeda, T.M.; Algethami, A.A. Control Design for Uncertain Higher-Order Networked Nonlinear Systems via an Arbitrary Order Finite-Time Sliding Mode Control Law. Sensors 2022, 22, 2748. [Google Scholar] [CrossRef]
Ullah, S.; Khan, Q.; Mehmood, A.; Kirmani, S.A.M.; Mechali, O. Neuro-adaptive fast integral terminal sliding mode control design with variable gain robust exact differentiator for under-actuated quadcopter UAV. ISA Trans. 2022, 120, 293–304. [Google Scholar] [CrossRef]
Zhang, W.; Wu, W.; Li, Z.; Du, X.; Yan, Z. Three-Dimensional Trajectory Tracking of AUV Based on Nonsingular Terminal Sliding Mode and Active Disturbance Rejection Decoupling Control. J. Mar. Sci. Eng. 2023, 11, 959. [Google Scholar] [CrossRef]
Cui, R.; Chen, L.; Yang, C.; Chen, M. Extended state observer-based integral sliding mode control for an underwater robot with unknown disturbances and uncertain nonlinearities. IEEE Trans. Ind. Electron. 2017, 64, 6785–6795. [Google Scholar] [CrossRef]
Ding, S.; Chen, W.H.; Mei, K.; Murray-Smith, D.J. Disturbance observer design for nonlinear systems represented by input-output models. IEEE Trans. Ind. Electron. 2019, 67, 1222–1232. [Google Scholar] [CrossRef]
Liang, X.; Qu, X.; Wan, L.; Ma, Q. Three-dimensional path following of an underactuated AUV based on fuzzy backstepping sliding mode control. Int. J. Fuzzy Syst. 2018, 20, 640–649. [Google Scholar] [CrossRef]
Zhang, G.; Yin, S.; Huang, C.; Zhang, W. Intervehicle Security-Based Robust Neural Formation Control for Multiple USVs via APS Guidance. J. Mar. Sci. Eng. 2023, 11, 1020. [Google Scholar] [CrossRef]
Han, J. From PID to active disturbance rejection control. IEEE Trans. Ind. Electron. 2009, 56, 900–906. [Google Scholar] [CrossRef]
Lei, M.; Li, Y.; Pang, S. Extended state observer-based composite-system control for trajectory tracking of underactuated AUVs. Appl. Ocean Res. 2021, 112, 102694. [Google Scholar] [CrossRef]
Nie, J.; Wang, H.; Lu, X.; Lin, X.; Sheng, C.; Zhang, Z.; Song, S. Finite-time output feedback path following control of underactuated MSV based on FTESO. Ocean Eng. 2021, 224, 108660. [Google Scholar] [CrossRef]
Wang, N.; Zhu, Z.; Qin, H.; Deng, Z.; Sun, Y. Finite-time extended state observer-based exact tracking control of an unmanned surface vehicle. Int. J. Robust Nonlinear Control. 2021, 31, 1704–1719. [Google Scholar] [CrossRef]
Sankaranarayanan, V.N.; Yadav, R.D.; Swayampakula, R.K.; Ganguly, S.; Roy, S. Robustifying payload carrying operations for quadrotors under time-varying state constraints and uncertainty. IEEE Robot. Autom. Lett. 2022, 7, 4885–4892. [Google Scholar] [CrossRef]
Chu, Z.; Xiang, X.; Zhu, D.; Luo, C.; Xie, D. Adaptive fuzzy sliding mode diving control for autonomous underwater vehicle with input constraint. Int. J. Fuzzy Syst. 2018, 20, 1460–1469. [Google Scholar] [CrossRef]
Li, S.; Wang, X. Finite-time consensus and collision avoidance control algorithms for multiple AUVs. Automatica 2013, 49, 3359–3367. [Google Scholar] [CrossRef]
Xu, J.; Huang, F.; Wu, D.; Cui, Y.; Yan, Z.; Du, X. A learning method for AUV collision avoidance through deep reinforcement learning. Ocean Eng. 2022, 260, 112038. [Google Scholar] [CrossRef]
Zhang, Y.; Liu, X.; Luo, M.; Yang, C. MPC-based 3-D trajectory tracking for an autonomous underwater vehicle with constraints in complex ocean environments. Ocean Eng. 2019, 189, 106309. [Google Scholar] [CrossRef]
Arcos-Legarda, J.; Gutiérrez, Á. Robust Model Predictive Control Based on Active Disturbance Rejection Control for a Robotic Autonomous Underwater Vehicle. J. Mar. Sci. Eng. 2023, 11, 929. [Google Scholar] [CrossRef]
Zheng, Y.; Li, S.E.; Li, K.; Borrelli, F.; Hedrick, J.K. Distributed model predictive control for heterogeneous vehicle platoons under unidirectional topologies. IEEE Trans. Control Syst. Technol. 2016, 25, 899–910. [Google Scholar] [CrossRef]
Wei, H.; Shen, C.; Shi, Y. Distributed Lyapunov-based model predictive formation tracking control for autonomous underwater vehicles subject to disturbances. IEEE Trans. Syst. Man Cybern. Syst. 2019, 51, 5198–5208. [Google Scholar] [CrossRef]
Shen, C.; Shi, Y. Distributed implementation of nonlinear model predictive control for AUV trajectory tracking. Automatica 2020, 115, 108863. [Google Scholar] [CrossRef]
Li, B.; Hu, Q.; Yu, Y.; Ma, G. Observer-based fault-tolerant attitude control for rigid spacecraft. IEEE Trans. Aerosp. Electron. Syst. 2017, 53, 2572–2582. [Google Scholar] [CrossRef]
Wei, H.; Sun, Q.; Chen, J.; Shi, Y. Robust distributed model predictive platooning control for heterogeneous autonomous surface vehicles. Control Eng. Pract. 2021, 107, 104655. [Google Scholar] [CrossRef]
Zhao, R.; Miao, M.; Lu, J.; Wang, Y.; Li, D. Formation control of multiple underwater robots based on ADMM distributed model predictive control. Ocean Eng. 2022, 257, 111585. [Google Scholar] [CrossRef]
Hu, Q.; Jiang, B. Continuous finite-time attitude control for rigid spacecraft based on angular velocity observer. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 1082–1092. [Google Scholar] [CrossRef]
Yan, Z.; Gong, P.; Zhang, W.; Li, Z.; Teng, Y. Autonomous underwater vehicle vision guided docking experiments based on L-shaped light array. IEEE Access 2019, 7, 72567–72576. [Google Scholar] [CrossRef]
Fossen, T.I. Handbook of Marine Craft Hydrodynamics and Motion Control; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
Xu, J.; Cui, Y.; Xing, W.; Huang, F.; Yan, Z.; Wu, D.; Chen, T. Anti-disturbance fault-tolerant formation containment control for multiple autonomous underwater vehicles with actuator faults. Ocean Eng. 2022, 266, 112924. [Google Scholar] [CrossRef]
Tao, T.; Roy, S.; De Schutter, B.; Baldi, S. Distributed Adaptive Synchronization in Euler Lagrange Networks with Uncertain Interconnections. IEEE Trans. Autom. Control, 2013; online ahead of print. [Google Scholar]
Roy, S.; Baldi, S.; Fridman, L.M. On adaptive sliding mode control without a priori bounded uncertainty. Automatica 2020, 111, 108650. [Google Scholar] [CrossRef]
Kong, S.; Sun, J.; Wang, J.; Zhou, Z.; Shao, J.; Yu, J. Piecewise Compensation Model Predictive Governor Combined with Conditional Disturbance Negation for Underactuated AUV Tracking Control. IEEE Trans. Ind. Electron. 2022, 70, 6191–6200. [Google Scholar] [CrossRef]
Wang, L. Continuous time model predictive control design using orthonormal functions. Int. J. Control. 2001, 74, 1588–1600. [Google Scholar] [CrossRef]
Yan, Z.; Wang, M.; Xu, J. Integrated guidance and control strategy for homing of unmanned underwater vehicles. J. Frankl. Inst.-Eng. Appl. Math. 2019, 356, 3831–3848. [Google Scholar] [CrossRef]
Hosen, M.A.; Hussain, M.A.; Mjalli, F.S. Control of polystyrene batch reactors using neural network based model predictive control (NNMPC): An experimental investigation. Control Eng. Pract. 2011, 19, 454–467. [Google Scholar] [CrossRef]
Cortes, P.; Kouro, S.; La Rocca, B.; Vargas, R.; Rodriguez, J.; Leon, J.I.; Vazquez, S.; Franquelo, L.G. Guidelines for weighting factors design in Model Predictive Control of power converters and drives. In Proceedings of the IEEE International Conference on Industrial Technology, Churchill, VIC, Australia, 10–13 February 2009; pp. 1–7. [Google Scholar]
Zhang, C.; Zhang, G.; Dong, Q. Multi-variable finite-time observer-based adaptive-gain sliding mode control for fixed-wing UAV. IET Contr. Theory Appl. 2021, 15, 223–247. [Google Scholar] [CrossRef]
Ma, C.; Tang, Y.; Lei, M.; Jiang, D.; Luo, W. Trajectory tracking control for autonomous underwater vehicle with disturbances and input saturation based on contraction theory. Ocean Eng. 2022, 266, 112731. [Google Scholar] [CrossRef]

Figure 1. AUV coordinate system.

Figure 2. The FTESO-based DMPC dual closed-loop structure for the AUV formation.

Figure 3. Structure of communication topology.

Figure 4. Compound disturbance estimation errors

e_{i 2}

of the ith AUV.

Figure 4. Compound disturbance estimation errors

e_{i 2}

of the ith AUV.

Figure 5. 3-D formation trajectories without (top) and with (bottom) collision avoidance constraints.

Figure 6. Relative distances among AUVs without (top) and with (bottom) collision avoidance constraints.