Distributed Lyapunov-Based Model Predictive Control for AUV Formation Systems with Multiple Constraints

Yan, Zheping; Zhang, Mingyao; Zhou, Jiajia; Yue, Lidong

doi:10.3390/jmse12030363

Open AccessArticle

Distributed Lyapunov-Based Model Predictive Control for AUV Formation Systems with Multiple Constraints

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2024, 12(3), 363; https://doi.org/10.3390/jmse12030363

Submission received: 8 January 2024 / Revised: 11 February 2024 / Accepted: 18 February 2024 / Published: 20 February 2024

(This article belongs to the Special Issue Marine Autonomous Vehicles: Design, Test and Operation)

Download

Browse Figures

Versions Notes

Abstract

This paper focuses on the formation tracking issue of autonomous underwater vehicles (AUVs) subject to multiple constraints in three-dimensional space. We developed a novel distributed Lyapunov-based model predictive controller (DLMPC) with a fast finite-time extended state observer (FFTESO). Initially, the external disturbances and internal uncertainties of each AUV were precisely compensated using the designed FFTESO. Subsequently, we proposed DLMPC-based position tracking and velocity tracking controllers, which solved an online optimization problem to determine optimal velocities and control forces. This hierarchical framework effectively managed system constraints, such as state constraints and actuator saturation. Additionally, the Lyapunov-based backstepping control law was applied to construct stability constraints in the distributed optimization problem, ensuring the recursive feasibility and closed-loop system stability of the proposed scheme. Sufficient conditions and attraction regions to ensure stability were explicitly provided. Finally, the simulation results demonstrated that the proposed method improved both the convergence speed and tracking accuracy by at least 30% compared to other methods.

Keywords:

autonomous underwater vehicles; finite-time extended state observer; distributed Lyapunov-based model predictive control; formation trajectory tracking; multiple constraints

1. Introduction

Due to their critical role in undersea exploration and hydrographic observation, autonomous underwater vehicles (AUVs) have emerged as the most effective tools for ocean development thus far [1]. In contrast to the limitations of individual AUVs, multi-AUV systems offer a broader detection range, heightened operational efficiency, and enhanced redundancy performance. Given these significant advantages, research on AUV formation control has garnered increasing attention [2]. The primary challenge is to ensure the stability of the formation motion in intricate underwater environments with multiple constraints [3]. Addressing this issue, researchers have undertaken extensive studies in recent years. A behavioral decision-making-based path planning method for AUVs was proposed in [4], but the built-in model creates challenges for mathematical analysis. The leader–follower method, introduced for AUV formation tracking control in [5], confronts challenges related to poor robustness and fault tolerance due to the presence of a designated leader. Consequently, there has been a growing focus on the more promising leaderless formation approach [6]. In [7], a virtual structure-based method for AUV formation control was developed, but its flexibility and applicability are constrained. Despite substantial research in this domain, crafting a control scheme that provides optimal control performance while accommodating multiple constraints remains a primary research objective for multi-AUV systems.

The most typical constraints are the various disturbances encountered by AUVs. These include internal disturbances within the system, such as model uncertainties caused by coupled dynamics and varying system parameters [8]. Another significant factor is the unpredictable external disturbances caused by currents in the actual ocean environment [9]. Additionally, system constraints impose considerable challenges on AUV formation motion. For instance, the cruising speed and attitude angle of an AUV are subject to specific limitations [10]. These intrinsic state constraints place substantial demands on the design of the controller. Furthermore, actuator saturation constraints are a pertinent concern in real applications, stemming from limitations in the active drive force due to the physical characteristics of the actuator. If the control inputs violate this limit, it will degrade the control performance of the system [11]. The traditional control methods in the aforementioned studies have difficulty in achieving optimal control performance. In contrast, model predictive control (MPC) offers notable advantages in explicitly managing system constraints and optimizing performance [12]. It has found widespread application in various control systems subject to multiple constraints. In terms of AUV formation control, distributed model predictive control (DMPC) has garnered increased attention among researchers. As a result, there is an urgent need to provide an AUV formation control strategy that guarantees closed-loop system stability under external disturbances, internal model uncertainties, and system constraints.

In response to the above facts and challenges, we propose a novel FFTESO-based hierarchical DLMPC strategy for AUV formation tracking systems subject to multiple constraints in the complex ocean environment. The scheme precisely compensates for lumped disturbances while concurrently accounting for system constraints, such as actuator saturation and state constraints. The Lyapunov-based backstepping method is employed to ensure closed-loop stability.

The subsequent sections of this paper are organized as follows. Section 2 overviews related works on responses to multiple constraints and MPC. Section 3 presents the AUV modeling and problem formulation. Section 4 proposes the methodology, including the design of the FFTESO and DLMPC-based hierarchical tracking controllers, as well as the theoretical analysis of the system stability. In Section 5, the comparative simulation results are demonstrated. Finally, conclusions are drawn in Section 6.

2. Related Work

In recent decades, to enhance the robustness and adaptability of formation control systems against disturbances, diverse advanced methods have been explored. These encompass disturbance observers [13], adaptive control methods [14], and strategies involving neural networks [15]. Notably, the extended state observer (ESO) stands out due to its superior property of not requiring precise information about the controlled object. In recent years, the ESO technique proposed in [16] has demonstrated potential for disturbance compensation. It treats internal uncertainties and external environmental disturbances as lumped disturbances, extending them into a new state. In [17], an output feedback motion control method employing a high-gain ESO was developed. This method effectively compensates for measurement errors, external disturbances, and model uncertainties in remotely operated vehicles. Additionally, Ref. [18] introduced AUV tracking controllers based on a generalized ESO and a harmonic ESO, which are intended to ensure path tracking even in the presence of lumped disturbances. Despite the diversity of ESOs developed by researchers, their capability is often constrained to ensuring asymptotic convergence on the observation error without effectively limiting the convergence time. To enhance higher estimation accuracy in complex environments, there is a growing attraction towards finite-time ESO.

The design strategy for conventional FTESOs is presented in [19] and the estimation of the convergence time under different cases is given for the first time. A robust fault-tolerant controller based on FTESO was designed in [20] to estimate the lumped disturbances of spacecraft. The control algorithm incorporates nonsingular terminal sliding mode and super-twisting methods. The authors of [21] investigated a safety control based on the FTESO adaptive neural network for unmanned aerial vehicles. The double-power FTESO was utilized to compensate for lumped disturbances. Nevertheless, there is room for further optimization of the observer’s structure to enhance compensation performance.

To cope with actuator saturation and state constraints, an adaptive energy-saving trajectory tracking control strategy for AUVs was proposed in [22]. A compensator based on radial basis function neural network was used to solve the problems of saturation of the actuator and multi-objective optimization. In [23], an adaptive super-twisting algorithm-based sliding mode controller (ASTASMC) was introduced for the formation control problem in a multi-AUV recovery system. This controller utilizes a robust adaptive law to estimate unclear hydrodynamic parameters and unknown environmental disturbances in real time.

DMPC addresses constrained optimization problems based on information from the AUV itself and its neighboring vehicles. In [24], a DMPC-based formation tracking method for AUV systems subject to input constraints was introduced. However, it is important to note that this approach solely considered the kinematics of the AUV and did not support a realistic AUV model. Addressing the AUV formation control under compound disturbances, Ref. [25] proposed a FTESO-based dual closed-loop DMPC scheme. However, stability analysis for optimal control problems within a finite horizon has proved challenging, often requiring the addition of suitable terminal constraint sets or selecting sufficiently large prediction horizons [26]. To overcome these limitations, a Lyapunov-based MPC (LMPC) method was proposed in [27], ensuring the stability of the control system through the construction of Lyapunov contraction constraints. This method inherits the stability and robustness of the Lyapunov-based control law, offering valuable insights for the design of AUV formation controllers. Based on stochastic Lyapunov feedback control strategies, Ref. [28] developed a Lyapunov-based MPC method for nonlinear systems subject to stochastic uncertainties. The authors of [29] studied iterative DMPC method for large-scale nonlinear systems subject to asynchronous and delayed state feedback. The stability constraints of the iterative DMPC were formulated by utilizing the Lyapunov-based control technique. To the best of our knowledge, a Lyapunov-based distributed predictive control law was only proposed in [30] to solve the three degrees-of-freedom (DOF) AUV formation tracking issue under time-varying disturbances. However, its applicability is limited to plane motion and does not consider model uncertainties. The state-of-the-art methods related to DLMPC are summarized in Table 1.

With respect to existing works, the principal contributions of this scheme include:

Compared with the existing DLMPC method [30], we have considered the lumped disturbances and improved the control structure. A hierarchical structure is employed, comprising a position controller and a velocity controller, aimed at generating the desired velocity and control force. This adaptation not only mitigates the challenge of accessing the optimal solution but also augments the controllability of the velocity, thereby improving the spatial tracking accuracy of the formation system.
Compared to the FTESO utilized in [19,31], we have enhanced convergence speed by augmenting linear terms. This enables faster compensation of the disturbances for online updating of the prediction model. It not only enhances the estimation accuracy and convergence speed but also effectively mitigates the fluctuations of lumped disturbances. Hence, the robustness of the multi-AUV system is enhanced.
The Lyapunov-based backstepping control law is utilized to institute stability constraints within the DMPC problem. This choice ensures the recursive feasibility of the control algorithm and the stability of the closed-loop system. The conditions and attraction regions sufficient to ensure stability are explicitly given. The control performance of formation tracking is substantially improved.

3. Preliminaries

In this section, we present the AUV model and problem formulation, where a new version of the dynamic equation is demonstrated based on the task constraints.

3.1. AUV Modeling

The article selects a fully-actuated torpedo-type AUV from the literature [32] aligning with the task objectives. As shown in Figure 1, a 6DOF AUV is typically described by two reference frames: an earth-fixed frame

\{E\}

and a body-fixed frame

\{B\}

. Since this AUV can be regarded as a highly metacentric stable vehicle with self-stable roll motion, we ignore the effect of roll; that is, roll angle

ϕ_{i} = 0

, roll angular velocity

p_{i} = 0

, and the spatial motion of AUV is regarded as a 5DOF motion process. The kinematics and dynamics of the ith AUV are expressed as [33]:

{\dot{η}}_{i} = J_{i} (η_{i}) v_{i}

(1)

M_{i} {\dot{v}}_{i} + C_{i} (v_{i}) v_{i} + D_{i} (v_{i}) v_{i} + g_{i} (η_{i}) = τ_{i} + τ_{i c}

(2)

where

i = 1, 2, \dots, N

,

η_{i} = {[x_{i}, y_{i}, z_{i}, θ_{i}, ψ_{i}]}^{T} \in ℝ^{5}

denotes the states of position and orientation of AUV,

v_{i} = {[u_{i}, v_{i}, w_{i}, q_{i}, r_{i}]}^{T} \in ℝ^{5}

denotes the velocity states of the AUV.

J_{i} (η_{i})

is a rotation transformation matrix from the body-fixed frame to the earth-fixed frame and is assumed to be invertible (i.e.,

|θ_{i}| < π / 2

), expressed as:

J_{i} (η_{i}) = [\begin{matrix} \cos ψ_{i} \cos θ_{i} & - \sin ψ_{i} & \cos ψ_{i} \sin θ_{i} & 0 & 0 \\ \sin ψ_{i} \cos θ_{i} & \cos ψ_{i} & \sin ψ_{i} \sin θ_{i} & 0 & 0 \\ - \sin θ_{i} & 0 & \cos θ_{i} & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 / \cos θ_{i} \end{matrix}]

(3)

M_{i}

denotes the inertial matrix.

C_{i} (v_{i})

and

D_{i} (v_{i})

represent the Coriolis and centripetal and hydrodynamic damping matrix, respectively. The gravitational and buoyancy forces of this AUV are balanced with each other such that the restoring force

g_{i} (η_{i})

is approximated to be zero.

τ_{i} = {[τ_{i u}, τ_{i v}, τ_{i w}, τ_{i q}, τ_{i r}]}^{T}

denotes the control force and moment, and

τ_{i c} = {[τ_{i c u}, τ_{i c v}, τ_{i c w}, τ_{i c q}, τ_{i c r}]}^{T}

represents the time-varying unknown external disturbance. Specific expressions for these matrices are given in [34].

In practical applications, acquiring precise hydrodynamic coefficients for the AUV model might be challenging. It can generally be assumed that the coefficients are subject to certain perturbations ranging from −20% to 20% [35]. Thus, the parameter matrices are divided into

M_{i} = M_{i}^{*} + Δ M_{i}

,

C_{i} (v_{i}) = C_{i}^{*} (v_{i}) + Δ C_{i} (v_{i})

, and

D_{i} (v_{i}) = D_{i}^{*} (v_{i}) + Δ D_{i} (v_{i})

,

{(\cdot)}_{i}^{*}

represents the nominal part that can be determined by the computational fluid dynamics (CFD) [36].

Δ {(\cdot)}_{i}

denotes the difference between the actual and nominal parts, i.e., model uncertainties.

Then, the dynamic model (2) of ith AUV with the above constraints is rewritten as:

M_{i}^{*} {\dot{v}}_{i} + C_{i}^{*} (v_{i}) v_{i} + D_{i}^{*} (v_{i}) v_{i} = τ_{i} + τ_{i d}

(4)

where

τ_{i d} = τ_{i c} - Δ M_{i} {\dot{v}}_{i} - Δ C_{i} (v_{i}) v_{i} - Δ D_{i} (v_{i}) v_{i}

is considered as the lumped disturbance, including environmental disturbances and model uncertainties. In general, external disturbances exhibit limited energy and periodic variations. The model uncertainties are limited by the actual state and physical features of the AUV. Therefore, we make the following rational assumption:

Assumption 1

([37]). The external disturbance

τ_{i c}

and its first derivative

{\dot{τ}}_{i c}

are bounded, and the model uncertainties

Δ M_{i}

,

Δ C_{i}

, and

Δ D_{i}

are unknown and bounded. Therefore, the lumped disturbance

τ_{i d}

is bounded and satisfies

‖τ_{i d}‖ \leq {\bar{τ}}_{i d}

,

{\bar{τ}}_{i d} \in ℝ^{+}

.

3.2. Problem Formulation

To better characterize the information exchange between the AUVs, we adopted a directed topology graph

G = \{V, ε\}

to describe the formation communication. The node set

V = \{V_{1}, V_{2}, \dots, V_{N}\}

denotes the N AUVs, and an edge set

ε \subseteq V \times V

describes the information interaction from the node

V_{i}

to the node

V_{j}

. Define

A = [a_{i j}]

as an adjacency matrix, where

a_{i j}

denotes the connection weight and

a_{i j} = 1

if

(i, j) \in ε

, while

a_{i j} = 0

if

(i, j) \notin ε

. Presume that the ith AUV has the capability to receive local information from the virtual leader and the neighbors

Ν_{i} = \{j \in V : (j, i) \in ε\}

[38].

Next, we formulate the AUV formation tracking control problem. In order for the AUV formation to track the reference trajectory

η_{r}

smoothly while maintaining the prescribed shape, the ith AUV is driven to satisfy: (1) Tracking:

\lim_{t \to \infty} ‖η_{i} (t) - η_{r} (t)‖ = d_{i r}

;

\lim_{t \to \infty} ‖v_{i} (t) - v_{i d} (t)‖ = 0

. (2) Formation:

\lim_{t \to \infty} ‖η_{i} (t) - η_{j} (t)‖ = d_{i j}

, with

d_{i r}

denoting the formation configuration vector and

d_{i j}

denoting the relative distance vector between the ith AUV and the jth AUV.

4. Methodology

In this section, to address the AUV formation control issue subject to complex constraints, we develop a novel distributed Lyapunov-based model predictive tracking control scheme. Initially, recognizing the presence of lumped disturbances that cannot be directly measured, a fast FTESO is devised to compensate for this constraint. Then, we make structural adaptation to the existing DLMPC method that uses a hierarchical design of position and velocity tracking controllers to handle the other constraints. Stability constraints are constructed based on Lyapunov theory. Finally, we analyze the feasibility and stability of the AUV formation system.

4.1. Design and Stability Analysis of FFTESO

Given the efficacy of the extended state observation method in estimating disturbances and model uncertainties, we propose a novel FFTESO to concurrently compensate for the lumped disturbances within the AUV formation.

To facilitate the FFTESO design, the AUV dynamics model (4) with respect to the earth-fixed frame can be further transformed as:

{\ddot{η}}_{i} = - M_{i η}^{- 1} [C_{i η} {\dot{η}}_{i} + D_{i η} {\dot{η}}_{i} - J_{i}^{- T} (η_{i}) (τ_{i} + τ_{i d})]

(5)

where

C_{i η} = J_{i}^{- T} (η_{i}) [C_{i}^{*} (v_{i}) - M_{i}^{*} J_{i}^{- 1} (η_{i}) {\dot{J}}_{i} (η_{i})] J_{i}^{- 1} (η_{i})

,

D_{i η} = J_{i}^{- T} (η_{i}) D_{i}^{*} (v_{i}) J_{i}^{- 1} (η_{i})

,

M_{i η} = J_{i}^{- T} (η_{i}) M_{i}^{*} J_{i}^{- 1} (η_{i})

. Then, define the auxiliary variables

μ_{i} = {\dot{η}}_{i} = J_{i} (η_{i}) v_{i}

,

f_{i} (μ_{i}) μ_{i} = M_{i η}^{- 1} (C_{i η} + D_{i η}) {\dot{η}}_{i}

,

d_{i} = M_{i η}^{- 1} J_{i}^{- T} (η_{i}) τ_{i d}

, so the AUV’s system model (1) and (2) are transformed to

\{\begin{array}{l} {\dot{η}}_{i} = μ_{i} \\ {\dot{μ}}_{i} = - f_{i} (μ_{i}) μ_{i} + M_{i η}^{- 1} J_{i}^{- T} τ_{i} + d_{i} \end{array} .

(6)

Next, we define some new variables

z_{i 1} = η_{i}

,

z_{i 2} = μ_{i}

, and the lumped disturbances are regarded as an extended state

z_{i 3}

, denoted as

z_{i 3} = d_{i}

with

{\dot{z}}_{i 3} = σ_{i}

. In Assumption 1,

d_{i}

is bounded and continuously differentiable as well, and the components of its first derivative

σ_{i}

satisfies

|σ_{i p}| \leq {\bar{σ}}_{i}

,

p = 1, 2, \dots, 5

, where

{\bar{σ}}_{i}

is an unknown upper bound. Afterward, the mathematical model of the ith AUV can be extended as:

\{\begin{array}{l} {\dot{z}}_{i 1} = z_{i 2} \\ {\dot{z}}_{i 2} = - f_{i} (z_{i 2}) z_{i 2} + M_{i η}^{- 1} J_{i}^{- T} τ_{i} + z_{i 3} \\ {\dot{z}}_{i 3} = σ_{i} \end{array} .

(7)

Denote

{\hat{z}}_{i 1}

,

{\hat{z}}_{i 2}

, and

{\hat{z}}_{i 3}

as the estimation of states

z_{i 1}

,

z_{i 2}

and

z_{i 3}

in the extended system (7), and

e_{i 1} = {\hat{z}}_{i 1} - z_{i 1}

,

e_{i 2} = {\hat{z}}_{i 2} - z_{i 2}

,

e_{i 3} = {\hat{z}}_{i 3} - z_{i 3}

as the estimation errors of the position, velocity, and lumped disturbance, respectively. Then, the FFTESO for the ith AUV is designed as:

\{\begin{array}{l} {\dot{\hat{z}}}_{i 1} = {\hat{z}}_{i 2} - β_{i 1} (「 e_{i 1} 」^{α_{i 1}} + e_{i 1}) \\ {\dot{\hat{z}}}_{i 2} = {\hat{z}}_{i 3} - β_{i 2} (「 e_{i 1} 」^{α_{i 2}} + γ_{i} e_{i 1}) - f_{i} ({\hat{z}}_{i 2}) {\hat{z}}_{i 2} + M_{i η}^{- 1} J_{i}^{- T} τ_{i} \\ {\dot{\hat{z}}}_{i 3} = - β_{i 3} (「 e_{i 1} 」^{α_{i 3}} + γ_{i}^{2} e_{i 1}) \end{array}

(8)

with the observer gains satisfying

β_{i k} > 0

,

k = 1, 2, 3

,

α_{i 1} \in (2 / 3, 1)

and

α_{i 2} = 2 α_{i 1} - 1

,

α_{i 3} = 3 α_{i 1} - 2

,

γ_{i} = {|e_{i 1}|}^{α_{i 1} - 1}

,

「 e_{i 1} 」^{α_{i k}} = sign (e_{i 1}) {|e_{i 1}|}^{α_{i k}}

with

{|e_{i 1}|}^{α_{i k}} = {[{|e_{i 11}|}^{α_{i k}}, {|e_{i 12}|}^{α_{i k}}, \dots, {|e_{i 1 n}|}^{α_{i k}}]}^{T}

. Based on the model (7) and the designed FFTESO (8), the observation error dynamics are:

\{\begin{array}{l} {\dot{e}}_{i 1} = e_{i 2} - β_{i 1} (「 e_{i 1} 」^{α_{i 1}} + e_{i 1}) \\ {\dot{e}}_{i 2} = e_{i 3} - β_{i 2} (「 e_{i 1} 」^{α_{i 2}} + γ_{i} e_{i 1}) + f_{i} (z_{i 2}) z_{i 2} - f_{i} ({\hat{z}}_{i 2}) {\hat{z}}_{i 2} \\ {\dot{e}}_{i 3} = - β_{i 3} (「 e_{i 1} 」^{α_{i 3}} + γ_{i}^{2} e_{i 1}) - σ_{i} \end{array} .

(9)

The convergence analysis of the FFTESO (8) is presented in the following theorem.

Theorem 1.

Consider the formation control system of the AUV model (7) under Assumption 1. If the FFTESO is designed as in (8) to satisfy the specified observer gain constraints, then the estimation errors

e_{i} = {[e_{i 1}^{T}, e_{i 2}^{T}, e_{i 3}^{T}]}^{T}

will converge to the stability region

Ω_{i}

within a finite time

T_{i f}

.

Proof.

See Appendix A. □

4.2. Position Tracking Controller

In this subsection, we explain our design of the DLMPC-based position tracking controller. It makes the ith AUV track the reference trajectory

η_{r}

by outputting the desired driving speed, thereby converging the position tracking error. It supplies the optimal desired speed needed by the velocity tracking controller.

The reference trajectory is defined as

η_{r} = {[x_{r}, y_{r}, z_{r}, θ_{r}, ψ_{r}]}^{T}

. Here, to avoid singularities in the reference trajectory, we make the following assumption:

Assumption 2

([25]). The reference trajectory

η_{r}

and its derivatives are smooth and bounded, satisfying the equation

{‖η_{r}‖}_{\infty} \leq {\bar{η}}_{r}

,

{‖{\dot{η}}_{r}‖}_{\infty} \leq {\bar{η}}_{r 1}

,

{‖{\ddot{η}}_{r}‖}_{\infty} \leq {\bar{η}}_{r 2}

with positive constants

{\bar{η}}_{r}

,

{\bar{η}}_{r 1}

,

{\bar{η}}_{r 2}

.

The kinematic model for the ith AUV position tracking can be established by (1):

{\dot{x}}_{i 1} = J_{i} (η_{i}) v_{i} = f_{i 1} (x_{i 1}, u_{i 1})

(10)

where

x_{i 1} = {[x_{i}, y_{i}, z_{i}, θ_{i}, ψ_{i}]}^{T}

and

u_{i 1} = {[u_{i}, v_{i}, w_{i}, q_{i}, r_{i}]}^{T}

are the state and the control input of the ith AUV, respectively. To fulfill the control objective of each AUV, the DLMPC optimization problem for the position tracking controller can be formulated as:

\min_{u_{i 1} \in C (h)} J_{i 1} = \int_{0}^{T} (\sum_{j \in N_{i}} a_{i j} {‖x_{i j} (s)‖}_{Q_{i j}}^{2} + {‖{\tilde{x}}_{i 1} (s)‖}_{Q_{i 1}}^{2} + {‖u_{i 1} (s)‖}_{R_{i 1}}^{2}) d s

(11a)

s . t . {\dot{\overset{⌢}{x}}}_{i 1} (s) = f_{i 1} ({\overset{⌢}{x}}_{i 1} (s), u_{i 1} (s))

(11b)

{\overset{⌢}{x}}_{i 1} (0) = {x_{i 1}|}_{t = t_{0}}

(11c)

x_{i 1}^{\min} \leq {\overset{⌢}{x}}_{i 1} (s) \leq x_{i 1}^{\max}

(11d)

{‖u_{i 1} (s)‖}_{\infty} \leq u_{i 1}^{\max}

(11e)

{{\dot{V}}_{i p}|}_{u_{i 1} (0)} \leq {{\dot{V}}_{i p}|}_{u_{i 1}^{v i r} (0)}

(11f)

where

{\overset{⌢}{x}}_{i 1} (s)

denotes the predicted state trajectory,

x_{i j} = {\overset{⌢}{x}}_{i 1} - {\overset{⌢}{x}}_{j 1} - d_{i j}

,

{\tilde{x}}_{i 1} = {\overset{⌢}{x}}_{i 1} - η_{r} - d_{i r}

.

C (h)

represents the cluster of piecewise functions featured by the sampling period h,

T = M h

denotes the prediction horizon.

Q_{i j}

,

Q_{i 1}

and

R_{i 1}

represent weighting matrices that are diagonal and positive-definite. (11c) is the initial state condition. (11d) represents the position state constraint. (11e) represents the control input constraint. (11f) is the stability constraint constructed by the Lyapunov-based virtual control law

u_{i 1}^{v i r}

and the relevant Lyapunov function

V_{i p}

, which explicitly characterizes the guaranteed region of attraction. This is designed to circumvent the local linearization of the standard DMPC while guaranteeing the stability of the formation tracking. One should note that

u_{i 1}^{v i r}

does not actually control the vehicle but only ensures system stability.

Then, we construct the concrete expression of the stability constraints in (11f), which involves determining an appropriate state-feedback controller and the corresponding Lyapunov function. Various nonlinear control techniques, such as sliding mode control and backstepping, can be employed. For the trajectory tracking problem, we select the backstepping method to develop the Lyapunov-based nonlinear controller.

Let

η_{i}

denote the trajectory of the ith AUV, and

η_{i r} = η_{r} + d_{i r} = {[x_{i r}, y_{i r}, z_{i r}, θ_{i r}, ψ_{i r}]}^{T}

be the desired path;

{\tilde{η}}_{i} = η_{i} - η_{i r}

represents the position tracking error of ith AUV. Define the following Lyapunov function:

V_{i p} = \frac{1}{2} {\tilde{η}}_{i}^{T} Λ_{i 1} {\tilde{η}}_{i}

(12)

where

Λ_{i 1} > 0

is a specified control gain matrix, diagonal and positive-definite. Taking the time derivative of

V_{i p}

, we can obtain:

{\dot{V}}_{i p} = {\tilde{η}}_{i}^{T} Λ_{i 1} {\dot{\tilde{η}}}_{i} = {\tilde{η}}_{i}^{T} Λ_{i 1} (J_{i} (η_{i}) v_{i} - {\dot{η}}_{r}) .

(13)

To stabilize the position tracking, we choose the following control law:

v_{i}^{v i r} = J_{i}^{- 1} (η_{i}) ({\dot{η}}_{r} - K_{i p} {\tilde{η}}_{i})

(14)

where

K_{i p} > 0

is another specified control gain matrix. Then, the derivative of

V_{i p}

(13) becomes the following form:

{\dot{V}}_{i p} = - {\tilde{η}}_{i}^{T} Λ_{i 1} K_{i p} {\tilde{η}}_{i} .

(15)

From (12) and (15), it can be seen that

V_{i p} > 0

and

{\dot{V}}_{i p} \leq 0

, so based on Lyapunov’s direct method, the position tracking subsystem with virtual control law (14) is globally asymptotically stable with respect to the equilibrium

[{\tilde{η}}_{i}, v_{i}] = [0, 0]

. Therefore, we can obtain the concrete expression of the stability constraint (11f) as follows:

{\tilde{η}}_{i}^{T} (0) Λ_{i 1} (J_{i} (η_{i} (0)) v_{i} (0) - {\dot{η}}_{r} (0)) \leq - {\tilde{η}}_{i}^{T} (0) Λ_{i 1} K_{i p} {\tilde{η}}_{i} (0) .

(16)

The stability constraint (16) facilitates the verification that the DLMPC inherits the stability properties of the state-feedback control law (14) [39]. Moreover, owing to the online optimization process, the DLMPC-based position controller will automatically execute the optimal control performance obeying the system constraints.

4.3. Velocity Tracking Controller

In this subsection, we design a DLMPC-based velocity tracking controller to obtain the optimal control forces and moments of the ith AUV, aiming to track the desired velocity. It is used to stabilize the velocity tracking error in the AUV dynamics subsystem.

The dynamic model for the ith AUV velocity tracking can be modeled by (4):

{\dot{x}}_{i 2} = M_{i}^{* - 1} (τ_{i} + τ_{i d} - C_{i}^{*} (v_{i}) v_{i} - D_{i}^{*} (v_{i}) v_{i}) = f_{i 2} (x_{i 2}, u_{i 2}, τ_{i d})

(17)

where the state is defined as

x_{i 2} = {[u_{i}, v_{i}, w_{i}, q_{i}, r_{i}]}^{T}

and the control input is defined as

u_{i 2} = {[τ_{i u}, τ_{i v}, τ_{i w}, τ_{i q}, τ_{i r}]}^{T}

. Based on the effect of FFTESO (8) and the control objective, the DLMPC optimization problem for the velocity tracking controller can be formulated as:

\min_{u_{i 2} \in C (h)} J_{i 2} = \int_{0}^{T} ({‖{\tilde{x}}_{i 2} (s)‖}_{Q_{i 2}}^{2} + {‖u_{i 2} (s)‖}_{R_{i 2}}^{2}) d s

(18a)

s . t . {\dot{\overset{⌢}{x}}}_{i 2} (s) = f_{i 2} ({\overset{⌢}{x}}_{i 2} (s), u_{i 2} (s), {\hat{τ}}_{i d} (s))

(18b)

{\overset{⌢}{x}}_{i 2} (0) = {x_{i 2}|}_{t = t_{0}}

(18c)

x_{i 2}^{\min} \leq {\overset{⌢}{x}}_{i 2} (s) \leq x_{i 2}^{\max}

(18d)

{‖u_{i 2} (s)‖}_{\infty} \leq u_{i 2}^{\max}

(18e)

{{\dot{V}}_{i v}|}_{u_{i 2} (0)} \leq {{\dot{V}}_{i v}|}_{{\hat{u}}_{i 2}^{v i r} (0)}

(18f)

where

{\overset{⌢}{x}}_{i 2} (s)

is the predicted state trajectory, and

{\tilde{x}}_{i 2} = {\overset{⌢}{x}}_{i 2} - v_{i d}

denotes the tracking error. The desired speed

v_{i d}

is derived from the position tracking controller (11).

Q_{i 2}

and

R_{i 2}

represent positive-definite weighting matrices. Similar to the optimization problem (11), the condition (18c) denotes the initial state. (18d) represents the velocity state constraint. (18e) represents the control input constraint. (18f) is the stability constraint constructed by the virtual control law

{\hat{u}}_{i 2}^{v i r}

and the corresponding Lyapunov function

V_{i v}

. The DLMPC controller (18) inherits the stability and robustness of the virtual controller. Then, we construct the concrete expression of the stability constraints.

Let

v_{i}

denote the velocity of the ith AUV, where

{\tilde{v}}_{i} = v_{i} - v_{i d}

represents the velocity tracking error of ith AUV. Consider the following Lyapunov function:

V_{i v} = \frac{1}{2} {\tilde{v}}_{i}^{T} Λ_{i 2} {\tilde{v}}_{i} + V_{i p}

(19)

where

Λ_{i 2} > 0

is a positive-definite diagonal matrix. Taking the time derivative of

V_{i v}

, we can derive:

{\dot{V}}_{i v} = {\tilde{v}}_{i}^{T} Λ_{i 2} {\dot{\tilde{v}}}_{i} + {\dot{V}}_{i p} = {\tilde{v}}_{i}^{T} Λ_{i 2} [M_{i}^{* - 1} (τ_{i} + τ_{i d} - C_{i}^{*} (v_{i}) v_{i} - D_{i}^{*} (v_{i}) v_{i}) - {\dot{v}}_{i d}] - {\tilde{η}}_{i}^{T} Λ_{i 1} K_{i p} {\tilde{η}}_{i} .

(20)

To achieve stable velocity tracking, based on the backstepping method and the lumped disturbances compensated by FFTESO, we choose the following control law:

τ_{i}^{v i r} = C_{i}^{*} (v_{i}) v_{i} + D_{i}^{*} (v_{i}) v_{i} + M_{i}^{*} {\dot{v}}_{i d} - M_{i}^{*} K_{i v} {\tilde{v}}_{i} - {\hat{τ}}_{i d}

(21)

where

K_{i v} > 0

is a specified gain matrix. Then, (13) becomes the following form:

{\dot{V}}_{i v} = - {\tilde{v}}_{i}^{T} Λ_{i 2} K_{i v} {\tilde{v}}_{i} - {\tilde{η}}_{i}^{T} Λ_{i 1} K_{i p} {\tilde{η}}_{i} .

(22)

From (19) and (22), it can be seen that

V_{i v} > 0

and

{\dot{V}}_{i v} \leq 0

, so according to Lyapunov’s direct method, the velocity tracking subsystem with virtual control law (21) is globally asymptotically stable with respect to the equilibrium

[{\tilde{v}}_{i}, τ_{i}] = [0, 0]

. Thus, the concrete expression of the stability constraint (18f) is:

\begin{array}{l} {\tilde{v}}_{i}^{T} (0) Λ_{i 2} [M_{i}^{* - 1} (τ_{i} (0) + {\hat{τ}}_{i d} (0) - C_{i}^{*} (v_{i} (0)) v_{i} (0) - D_{i}^{*} (v_{i} (0)) v_{i} (0)) - {\dot{v}}_{i d} (0)] \\ \leq - {\tilde{v}}_{i}^{T} (0) Λ_{i 2} K_{i v} {\tilde{v}}_{i} (0) . \end{array}

(23)

Likewise, the DLMPC-based velocity controller exerts excellent control performance thanks to online optimization. Leveraging the designed position controller (11) and velocity controller (18), the distributed Lyapunov-based model predictive formation control will be implemented for each AUV in the receding horizon mode.

The implementation process of DLMPC is described in Algorithm 1. The core of the algorithm consists of two parts: solving the optimization problem (11) of the position tracking layer and solving the optimization problem (18) of the velocity tracking layer. Firstly, the optimization problem (11) is solved, yielding the optimal control input related to AUV linear and angular velocity variables. Then, the control inputs of the position tracking layer are passed as reference velocity to the velocity tracking layer. By solving the optimization problem (18), the optimal control force and torque are obtained. Finally, the control inputs of the velocity tracking layer are applied to the AUV system. Figure 2 shows the flow diagram of the proposed Algorithm 1.

Algorithm 1. DLMPC Implementation

1: The ith AUV samples the current state

η_{i} (t)

. Input the cost function

J_{i 1}

in (11a).

2: The ith AUV receives the state trajectory of its neighbor AUV

{\overset{⌢}{x}}_{j 1} (t)

,

j \in V

,

j \neq i

.

3: Solve the optimization problem (11) provided

{x_{i 1}|}_{t = t_{0}} = η_{i} (t)

, generate

κ_{i 1} (s)

, and let it be the (sub-)optimal solution.

4: Implement

κ_{i 1} (s)

for only one sampling period, i.e.,

u_{i 1} (t) = κ_{i 1} (s)

for

s \in [0, h]

.

5: Let

v_{i d} (t) = u_{i 1} (t)

, input the cost function

J_{i 2}

in (18a).

6: Solve the optimization problem (18) provided,

{x_{i 2}|}_{t = t_{0}} = v_{i} (t)

, generating

κ_{i 2} (s)

.

7: Implement

κ_{i 2} (s)

for only one sampling period, i.e.,

u_{i 2} (t) = κ_{i 2} (s)

for

s \in [0, h]

.

8: At next sampling time instant, set

t = t + h

, and repeat from step 1.

Remark 1.

In the next section, we demonstrate that both system stability and recursive feasibility are not dependent on obtaining the precise solution from the optimization process. Therefore, suboptimal solutions are deemed acceptable in Algorithm 1. The use of iterative methods ensures that DLMPC optimization problems (11) and (18) possess locally optimal solutions. We can trade off numerical efficacy and control effect by setting the maximum iteration number without compromising the stability of the formation control. Additionally, the compensation of lumped disturbances is continuously applied throughout the iterative optimization process, ensuring the success of the formation tracking task under multiple constraints.

4.4. Stability Analysis

In the previous subsection, stability constraints were constructed using the Lyapunov-based backstepping method. In this subsection, we will analyze the recursive feasibility and closed-loop stability of Algorithm 1.

First, we give the following theorem to show the recursive feasibility of the designed position tracking controller.

Theorem 2.

Choose the positive-definite gain matrix as

K_{i p} = diag \{k_{i p 1}, k_{i p 2}, k_{i p 3}, k_{i p 4}, k_{i p 5}\}

. Let

{\bar{K}}_{i p}

denote the largest entity in the control gain

K_{i p}

. For

u_{i 1}^{v i r} (x_{i 1}) = v_{i}^{v i r} (x_{i 1})

, under Assumption 2, if the following relation can be ensured

(1 + \frac{\sqrt{2}}{2}) ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖{\tilde{η}}_{i} (0)‖) \leq v_{i}^{\max}

(24)

where

v_{i}^{\max} = {‖v_{i}^{\max}‖}_{\infty}

denotes the maximum generalized velocity, then the DLMPC-based position controller is recursively feasible, i.e.,

{‖u_{i 1}^{v i r} ({\overset{⌢}{x}}_{i 1} (t))‖}_{\infty} \leq u_{i 1}^{\max}

, for all

t \geq 0

.

Proof.

See Appendix B. □

Then, we give the following theorem to show the recursive feasibility of the velocity controller.

Theorem 3.

Choose the positive definite gain matrix as

K_{i v} = diag \{k_{i v 1}, k_{i v 2}, k_{i v 3}, k_{i v 4}, k_{i v 5}\}

. Let

{\bar{K}}_{i v}

denote the largest entity in the control gain

K_{i v}

. For

{\hat{u}}_{i 2}^{v i r} (x_{i 2}) = τ_{i}^{v i r} (x_{i 2})

, under Assumption 1 and Assumption 2, if the following relation can be ensured

({\bar{c}}_{i} + {\bar{d}}_{i}) {\bar{v}}_{i} + {\bar{m}}_{i} {\bar{v}}_{i d} + [{\bar{K}}_{i v} + (1 + \frac{\sqrt{2}}{2})] {\bar{m}}_{i} ‖γ_{i} (0)‖ + {\bar{τ}}_{i d} \leq τ_{i}^{\max}

(25)

where

{\bar{v}}_{i} = (1 + \frac{\sqrt{2}}{2}) ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖γ_{i} (0)‖)

,

{\bar{v}}_{i d} = \frac{2 + \sqrt{2}}{h} ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖γ_{i} (0)‖)

,

τ_{i}^{\max}

is the maximum possible generalized thrust force,

{\bar{m}}_{i}

is a known constant bound for

M_{i}^{*}

. Then, the velocity controller is recursively feasible, i.e.,

{‖{\hat{u}}_{i 2}^{v i r} ({\overset{⌢}{x}}_{i 2} (t))‖}_{\infty} \leq u_{i 2}^{\max}

, for all

t \geq 0

.

Proof.

See Appendix C. □

Finally, we give the following theorem to show the stability of closed-loop system.

Theorem 4.

Consider the AUV formation control system described by (10) and (17) with lumped disturbances. If Assumption 1 and Assumption 2 hold, then the DLMPC-based position controller (11) renders the equilibrium

[{\tilde{η}}_{i}, v_{i}] = [0, 0]

asymptotically stable, and the velocity controller (18) renders the equilibrium

[{\tilde{v}}_{i}, τ_{i}] = [0, 0]

asymptotically stable. In other words, the AUV formation tracking task can be realized under the control inputs produced by Algorithm 1.

Proof.

See Appendix D. □

5. Simulation

In this section, we conduct some simulation analyses to verify the effectiveness of the proposed hierarchical DLMPC algorithm for the AUV formation system. The formation network comprises four AUVs

(N = 4, i = 1, 2, 3, 4)

and a virtual leader (AUV0). Figure 3 illustrates the adopted communication topology, with the arrows indicating the communication direction between the AUVs. The simulation results demonstrate the promising formation tracking performance and robustness of the proposed method.

5.1. Simulation Setup

The initial states for each AUV are selected as

η_{1} = {[17 m, 28 m, - 2 m, 0.08 r a d, 2 r a d]}^{T}

,

η_{2} = {[21 m, 23 m, - 8 m, 0.06 r a d, 2 r a d]}^{T}

,

η_{3} = {[20 m, 8 m, - 6 m, - 0.05 r a d, 2.3 r a d]}^{T}

and

η_{4} = {[30 m, 17 m, - 3 m, - 0.06 r a d, 2.6 r a d]}^{T}

, respectively. The model parameters for the homogeneous AUV were extracted from previous work [40]. We selected a diamond-shaped formation conducive to omnidirectional marine exploration, setting the formation configuration vectors as

d_{1 r} = {[0, 0, 8, 0, 0]}^{T}

,

d_{2 r} = {[0, - 6, 0, 0, 0]}^{T}

,

d_{3 r} = {[0, 0, - 8, 0, 0]}^{T}

,

d_{4 r} = {[0, 6, 0, 0, 0]}^{T}

.

d_{12} = - d_{21} = {[0, 6, 8, 0, 0]}^{T}

,

d_{13} = - d_{31} = {[0, 0, 16, 0, 0]}^{T}

,

d_{14} = - d_{41} = {[0, - 6, 8, 0, 0]}^{T}

,

d_{23} = - d_{32} = {[0, - 6, 8, 0, 0]}^{T}

,

d_{24} = - d_{42} = {[0, - 12, 0, 0, 0]}^{T}

and

d_{34} = - d_{43} = {[0, - 6, - 8, 0, 0]}^{T}

. The model uncertainties are reflected by considering 15% of the nominal value as the model error, implying that the AUV parameters in the simulation characterize only 85% of the nominal system. To assess the system’s robustness, the external disturbances are modeled as follows:

\{\begin{array}{l} τ_{i c u} = 0.2 sign (u_{i}) + 0.3 \sin (t / 10) N \\ τ_{i c v} = 0.1 sign (v_{i}) + 0.2 \sin (t / 20) N \\ τ_{i c w} = 0.05 sign (w_{i}) + 0.1 \sin (t / 5) N \\ τ_{i c q} = 0.2 sign (q_{i}) + 0.1 \sin (t / 10) N \cdot m \\ τ_{i c r} = 0.3 sign (r_{i}) + 0.2 \sin (t / 10) N \cdot m \end{array}

(26)

There are guidelines for selecting each parameter: considering that AUVs navigate at slower speeds, a smaller T is planned to be adopted. During debugging, if the rate is not fast enough, adjust it down, and if the stability is poor, adjust it up. As we place more attention on the position tracking performance,

Q_{i 1}

is set as slightly bigger than

Q_{i 2}

; to attenuate the interaction between angles, the angle weights in

Q_{i j}

are set a little smaller;

R_{i 1}

and

R_{i 2}

are set as small as possible while ensuring the stability of the system. The connection among the observer gains

β_{i k}

and

α_{i 1}

is obtained by solving the Lyapunov equation (29), and then tuned to select appropriate values. Following the above guidelines, the simulation parameters of the proposed algorithm in Table 2 were chosen.

Moreover, the prediction horizon is

T = 10

h, and the limitation of each actuator is 500 N. The upper and lower bound of velocity states are set as

\pm 2

m/s,

\pm 1.2

m/s,

\pm 0.7

m/s,

\pm 0.1

rad/s,

\pm 0.3

rad/s. The reference trajectory generated by a virtual leader is a helical curve:

\{\begin{array}{l} x_{r} (t) = 35 \cos (π t / 200) \\ y_{r} (t) = 35 \sin (π t / 200) \\ z_{r} (t) = - 0.06 t - 6 \end{array}

(27)

5.2. Performance Tests for Lumped Disturbances Estimation

Firstly, to assess the disturbance rejection capability of the formation system under spatial disturbance constraints, we conducted comparative tests with the conventional FTESO in [19], the improved third-order FTESO in [31], and the designed FFTESO (8). Figure 4 illustrates the disturbance estimation error norms

‖e_{i 3}‖ = ‖{\hat{d}}_{i} - d_{i}‖

for each AUV under different observers. It is evident from the figure that the conventional FTESO exhibits slow convergence time and slight chattering. While the third-order FTESO achieves finite-time stabilization, it starts with a large initial error and displays slow convergence speed. In contrast, the proposed FFTESO not only ensures the convergence of the estimation error to a small neighborhood of the origin within a finite time but also outperforms the other two observers in terms of dynamic response speed and estimation accuracy. Therefore, owing to the advantages of fast transient response and high accuracy of the designed FFTESO (8), each AUV can more swiftly and accurately compensate for both external disturbances and internal uncertainties. This markedly fortifies the active disturbance rejection capability of the formation control system.

5.3. Performance Tests for Formation Trajectory Tracking

To evaluate the tracking control performance of the designed scheme, comparative tests were conducted under uniform parameter and disturbance settings: scheme (a) corresponds to our proposed FFTESO-based hierarchical DLMPC algorithm; scheme (b) corresponds to the conventional DLMPC algorithm for AUV formation in [30]; and scheme (c) corresponds to the ASTASMC algorithm proposed for AUV formation control in [23]. In the following section, we analyze the performance metrics of formation tracking control: convergence speed, tracking accuracy, and smoothness of control inputs.

Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9 depict the trajectories of the position and attitude angles of each AUV during formation tracking under both schemes. Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 depict the trajectories of linear and angular velocities. Observing the results, it is evident that the three schemes successfully guided the four AUVs to the desired state. However, the tracking performance differed. In terms of convergence speed, scheme (a) achieved full-state stable tracking within 150 s, scheme (b) required 220 s, while in scheme (c) the entire process convergence took about 300 s. In terms of tracking accuracy, unlike the observers used in (a) and (b), scheme (c) compensated for lumped disturbances through a robust adaptive law designed to mitigate high-frequency measurement noise. However, from the state trajectories, it is discernible that the state variables in (c) exhibited longer stabilization times and were accompanied by chattering. This indicates a weaker disturbance rejection capability compared to our proposed scheme. Furthermore, the comparison of (a) and (b) shows that the hierarchical structure enhances the rate of convergence and the controllability of the velocity state. The simulation results affirm that combining disturbance compensation from FFTESO and the online optimization of DLMPC strongly enhanced the formation control performance.

Figure 15 visually illustrates the formation tracking trajectory in three-dimensional space. In conjunction with Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14, it is evident that under the same initial conditions, all three schemes performed the formation helical dive task. However, scheme (c) was characterized by continuous fluctuations during the tracking process, posing an increased risk of the AUV formation deviating from the desired trajectory. Conversely, the formation members in schemes (a) and (b) smoothly tracked the reference trajectory while maintaining the preset distance. This difference arose from the distinct compensation principles of the disturbance rejection methods. The robust adaptive law in scheme (c) proved less robust to lumped disturbances with fast time-varying characteristics. On further observation, the proposed control scheme facilitated each AUV in forming the predefined configuration more rapidly, showcasing the superior response speed of the designed control system. Consequently, under multiple constraints such as lumped disturbances, state constraints, and stability constraints, the FFTESO-based DLMPC algorithm exhibited greater adaptability to complex underwater environments than the other two algorithms in terms of disturbance rejection, convergence speed, and tracking performance. Figure 16 gives the tracking error for each AUV under the three schemes. Table 3 presents the convergence time of all states and the average of AUV3’s tracking error after 130 s for the three schemes. We can clearly conclude that the proposed method had the optimal convergence speed and tracking accuracy.

Without loss of generality, Figure 17 illustrates the actual control forces and moments applied to AUV1 under the three algorithms. The blue curve represents the ASTASMC scheme, the green curve represents the traditional DLMPC scheme, and the red curve depicts the proposed DLMPC scheme. Compared with schemes (b) and (c), the control signals under the proposed scheme were regulated more swiftly, and the force and moment varied smoothly, allowing for steady trajectory tracking of the AUV formation when subjected to constraints. It is worth mentioning that the AUVs under the ASTASMC scheme required continual correction of the driving force and moment, leading to persistent chattering. This observation underscores the robustness and superiority of the FFTESO-based hierarchical DLMPC algorithm. It is noteworthy that, at the onset of the task, the proposed scheme made use of the propulsion capability to achieve the fastest possible convergence, all while adhering to the physical limitations of the thrusters. In other words, the variation of the control signals continually remained within prescribed limits, effectively avoiding actuator saturation and reducing the failure rate.

6. Conclusions

In summary, this paper proposes a FFTESO-based hierarchical DLMPC scheme for AUV formation tracking under multiple constraints. The scheme leverages the faster and more precise compensation of lumped disturbances by FFTESO to dynamically update the prediction model online. Position tracking and velocity tracking controllers were particularly designed to determine the optimal velocities and control forces for the formation system while adhering to specified constraints. The Lyapunov-based backstepping controllers were then employed to construct stability constraints in the DMPC optimization problem, ensuring both recursive feasibility and closed-loop stability of the control algorithm. The simulation results demonstrated that compared with the conventional DLMPC and the ASTASMC method, we enhanced the convergence speed by 31.8% and 50%, and the tracking accuracy by 38.8% and 62.1%, respectively. This demonstrates that the proposed scheme significantly improved the formation tracking performance and anti-disturbance capability. The theoretical results lay a robust foundation for the practical design and implementation of AUV formation controllers.

The main limitation of DLMPC is that it relies heavily on timely and reliable inter-subsystem communication. In the future work, we will focus on the design of a DLMPC controller for AUV formation systems subject to communication delays to overcome the communication challenges in real-world applications. In addition, in response to the issue that finite-time convergence is sensitive to the initial values of the states, we intend to investigate an extended state observer with fixed-time convergence.

Author Contributions

Conceptualization, Z.Y.; data curation, J.Z.; funding acquisition, Z.Y.; investigation, J.Z.; methodology, Z.Y. and M.Z.; resources, L.Y.; software, J.Z.; supervision, Z.Y.; validation, M.Z.; visualization, L.Y.; writing—original draft, M.Z.; writing—review and editing, M.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Natural Science Foundation of China under grant No. 52071102, and No. 51679057.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Theorem 1.

Introducing an auxiliary error variable

ε_{i}^{T} = [{(「 e_{i 1} 」^{α_{i 1}} + e_{i 1})}^{T}, e_{i 2}^{T}, e_{i 3}^{T}]

. It can be seen that when the estimation error

e_{i}

converges to the neighborhood of the origin in finite time,

ε_{i}

also converges to the neighborhood of the origin. Taking the time derivation of

ε_{i}

, we obtain:

\begin{matrix} {\dot{ε}}_{i} = [\begin{matrix} α_{i 1} {|e_{i 1}|}^{α_{i 1} - 1} {\dot{e}}_{i 1} + {\dot{e}}_{i 1} \\ {\dot{e}}_{i 2} \\ {\dot{e}}_{i 3} \end{matrix}] = [\begin{matrix} α_{i 1} {|e_{i 1}|}^{α_{i 1} - 1} [e_{i 2} - β_{i 1} (「 e_{i 1} 」^{α_{i 1}} + e_{i 1})] \\ \frac{1}{2} [e_{i 3} - β_{i 2} (「 e_{i 1} 」^{α_{i 2}} + γ_{i} e_{i 1})] \\ - \frac{1}{2} β_{i 3} (「 e_{i 1} 」^{α_{i 3}} + γ_{i}^{2} e_{i 1}) \end{matrix}] \\ + [\begin{matrix} e_{i 2} - β_{i 1} (「 e_{i 1} 」^{α_{i 1}} + e_{i 1}) \\ \frac{1}{2} [e_{i 3} - β_{i 2} (「 e_{i 1} 」^{α_{i 2}} + γ_{i} e_{i 1})] \\ - \frac{1}{2} β_{i 3} (「 e_{i 1} 」^{α_{i 3}} + γ_{i}^{2} e_{i 1}) \end{matrix}] + [\begin{matrix} 0_{5} \\ {\tilde{f}}_{i} \\ 0_{5} \end{matrix}] + [\begin{matrix} 0_{5} \\ 0_{5} \\ - σ_{i} \end{matrix}] \\ = diag ([{|e_{i 1}|}^{α_{i 1} - 1}, {|e_{i 1}|}^{α_{i 1} - 1}, {|e_{i 1}|}^{α_{i 1} - 1}]) C_{i 1} ε_{i} + C_{i 2} ε_{i} + F_{i} + Θ_{i} \end{matrix}

(A1)

where

F_{i} = {[\begin{matrix} 0_{5} & {\tilde{f}}_{i} & 0_{5} \end{matrix}]}^{T}

,

Θ_{i} = {[\begin{matrix} 0_{5} & 0_{5} & - σ_{i} \end{matrix}]}^{T}

, and the state coefficient matrices

C_{i 1} = (\begin{matrix} - α_{i 1} β_{i 1} I_{5} & α_{i 1} I_{5} & 0_{5} \\ - β_{i 2} I_{5} / 2 & 0_{5} & γ_{i}^{- 1} I_{5} / 2 \\ - β_{i 3} γ_{i} I_{5} / 2 & 0_{5} & 0_{5} \end{matrix})

,

C_{i 2} = (\begin{matrix} - β_{i 1} I_{5} & I_{5} & 0_{5} \\ - β_{i 2} γ_{i} I_{5} / 2 & 0_{5} & I_{5} / 2 \\ - β_{i 3} γ_{i}^{2} I_{5} / 2 & 0_{5} & 0_{5} \end{matrix})

, defining the state function

{\tilde{f}}_{i} = f_{i} (z_{i 2}) z_{i 2} - f_{i} ({\hat{z}}_{i 2}) {\hat{z}}_{i 2} = - f_{i} (z_{i 2}) e_{i 2} - f_{i} (e_{i 2}) (z_{i 2} + e_{i 2})

. Then, we can further obtain

‖{\tilde{f}}_{i}‖ \leq l_{i 1} ‖z_{i 2}‖ ‖e_{i 2}‖ + l_{i 2} ‖e_{i 2}‖ (‖z_{i 2}‖ + ‖e_{i 2}‖) \leq (l_{i 1} + l_{i 2}) {\bar{μ}}_{i} ‖e_{i 2}‖ + l_{i 2} {‖e_{i 2}‖}^{2}

, with

l_{i 1}

and

l_{i 2}

as positive constants, and

{\bar{μ}}_{i}

as an upper bound of

z_{i 2}

that exists due to limited velocity.

If the designed observer gain is restricted to

β_{i 3} < 2 α_{i 1} β_{i 1} β_{i 2}

, it is known that all eigenvalues of

C_{i 1}

and

C_{i 2}

have negative real parts. This means that the coefficient matrices

C_{i 1}

,

C_{i 2}

are Hurwitz matrices. So, there exist Hermitian matrices

H_{i 1}

and

H_{i 2}

such that the below Lyapunov equation holds:

\{\begin{matrix} C_{i 1}^{T} P_{i} + P_{i} C_{i 1} = - H_{i 1} \\ C_{i 2}^{T} P_{i} + P_{i} C_{i 2} = - H_{i 2} \end{matrix} .

(A2)

where

P_{i}

is a positive-definite symmetric matrix. Then, we select a candidate Lyapunov function as

V_{i 1} (e_{i}) = ε_{i}^{T} P_{i} ε_{i}

, differentiating

V_{i 1} (e_{i})

with respect to time, one obtains:

\begin{matrix} {\dot{V}}_{i 1} & = ε_{i}^{T} [diag ([{|e_{i 1}|}^{α_{i 1} - 1}, {|e_{i 1}|}^{α_{i 1} - 1}, {|e_{i 1}|}^{α_{i 1} - 1}]) (C_{i 1}^{T} P_{i} + P_{i} C_{i 1})] ε_{i} + ε_{i}^{T} (C_{i 2}^{T} P_{i} + P_{i} C_{i 2}) ε_{i} + 2 ε_{i}^{T} P_{i} (F_{i} + Θ_{i}) \\ \leq - {|e_{i 1}|}_{\max}^{α_{i 1} - 1} ε_{i}^{T} H_{i 1} ε_{i} - ε_{i}^{T} H_{i 2} ε_{i} + 2 ‖ε_{i}‖ ‖P_{i}‖ (‖F_{i}‖ + ‖Θ_{i}‖) \end{matrix}

(A3)

where

{|e_{i 1}|}_{\max} = \max \{|e_{i 11}|, \dots, |e_{i 15}|\}

. Given that

{|e_{i 1}|}_{\max} \leq ‖e_{i 1}‖ \leq {‖ε_{i}‖}^{1 / α_{i 1}}

, and

α_{i 1} \in (2 / 3, 1)

, we can derive the following inequality:

\begin{array}{l} {\dot{V}}_{i 1} \leq - {‖ε_{i}‖}^{\frac{α_{i 1} - 1}{α_{i 1}}} ε_{i}^{T} H_{i 1} ε_{i} - ε_{i}^{T} H_{i 2} ε_{i} + 2 ‖ε_{i}‖ ‖P_{i}‖ ‖F_{i}‖ + 2 ‖ε_{i}‖ ‖P_{i}‖ ‖Θ_{i}‖ \\ \leq - λ_{\min} (H_{i 1}) {‖ε_{i}‖}^{3 - \frac{1}{α_{i 1}}} - λ_{\min} (H_{i 2}) {‖ε_{i}‖}^{2} + 2 {‖ε_{i}‖}^{2} ‖P_{i}‖ [(l_{i 1} + l_{i 2}) {\bar{μ}}_{i} + l_{i 2} ‖ε_{i}‖] + 2 ‖ε_{i}‖ ‖P_{i}‖ ‖Θ_{i}‖ . \end{array}

(A4)

From

λ_{\min} (P_{i}) {‖ε_{i}‖}^{2} \leq V_{i 1} \leq λ_{\max} (P_{i}) {‖ε_{i}‖}^{2}

we have

λ_{\max} {(P_{i})}^{- 1 / 2} V_{i 1}^{1 / 2} \leq ‖ε_{i}‖ \leq λ_{\min} {(P_{i})}^{- 1 / 2} V_{i 1}^{1 / 2}

. Since

σ_{i}

is supposed to be limited by

|σ_{i p}| \leq {\bar{σ}}_{i}

, we can obtain:

2 ‖ε_{i}‖ ‖P_{i}‖ ‖Θ_{i}‖ \leq 2 \sqrt{5} {\bar{σ}}_{i} ‖ε_{i}‖ ‖P_{i}‖ \leq 2 \sqrt{5} {\bar{σ}}_{i} λ_{\min} {(P_{i})}^{- 1 / 2} ‖P_{i}‖ V_{i 1}^{1 / 2}

(A5)

Accordingly, inequality (A4) can be further derived as:

\begin{matrix} {\dot{V}}_{i 1} & \leq - λ_{\min} (H_{i 1}) λ_{\max} {(P_{i})}^{\frac{1}{2 α_{i 1}} - \frac{3}{2}} V_{i 1}^{\frac{3}{2} - \frac{1}{2 α_{i 1}}} + 2 l_{i 2} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- \frac{3}{2}} V_{i 1}^{\frac{3}{2}} + 2 \sqrt{5} {\bar{σ}}_{i} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- \frac{1}{2}} V_{i 1}^{\frac{1}{2}} \\ + (- λ_{\min} (H_{i 2}) λ_{\max} {(P_{i})}^{- 1} + 2 (l_{i 1} + l_{i 2}) {\bar{μ}}_{i} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- 1}) V_{i 1} \\ \leq - λ_{i 1} V_{i 1}^{\frac{3}{2} - \frac{1}{2 α_{i 1}}} + λ_{i 2} V_{i 1} + λ_{i 3} V_{i 1}^{\frac{3}{2}} + λ_{i 4} V_{i 1}^{\frac{1}{2}} \end{matrix}

(A6)

with

λ_{i 1} = - λ_{\min} (H_{i 1}) λ_{\max} {(P_{i})}^{\frac{1}{2 α_{i 1}} - \frac{3}{2}}

,

λ_{i 2} = - λ_{\min} (H_{i 2}) λ_{\max} {(P_{i})}^{- 1} + 2 (l_{i 1} + l_{i 2}) {\bar{μ}}_{i} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- 1}

,

λ_{i 3} = 2 l_{i 2} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- \frac{3}{2}}

and

λ_{i 4} = 2 \sqrt{5} {\bar{σ}}_{i} ‖P_{i}‖ λ_{\min} {(P_{i})}^{- \frac{1}{2}}

. Then, we define two new variables as

{\bar{λ}}_{i 2} = λ_{i 2} V_{i 1} {(e_{i}^{0})}^{- \frac{1}{2} + \frac{1}{2 α_{i 1}}}

,

{\bar{λ}}_{i 3} = λ_{i 3} V_{i 1} {(e_{i}^{0})}^{\frac{1}{2 α_{i 1}}}

, and a restriction region

Θ_{i 1}

specified for the initial value

V_{i 1} (e_{i}^{0})

. For

e_{i} \in Θ_{i 1} = \{e_{i}| λ_{i 2} V_{i 1}^{- \frac{1}{2} + \frac{1}{2 α_{i 1}}} + λ_{i 3} V_{i 1}^{\frac{1}{2 α_{i 1}}} < λ_{i 1}\}

, there is

{\dot{V}}_{i 1} < 0

. This indicates that

V_{i 1}

is a monotonically decreasing function, then it has

V_{i 1} (e_{i}^{0}) \geq V_{i 1} (e_{i})

, and

{\bar{λ}}_{i 2} + {\bar{λ}}_{i 3} < λ_{i 1}

. According to the above definition, inequality (A6) can be simplified as:

\begin{matrix} {\dot{V}}_{i 1} & \leq - (λ_{i 1} - {\bar{λ}}_{i 2} - {\bar{λ}}_{i 3}) V_{i 1}^{\frac{3}{2} - \frac{1}{2 α_{i 1}}} + λ_{i 2} V_{i 1} - {\bar{λ}}_{i 2} V_{i 1}^{\frac{3}{2} - \frac{1}{2 α_{i 1}}} + λ_{i 3} V_{i 1}^{\frac{3}{2}} - {\bar{λ}}_{i 3} V_{i 1}^{\frac{3}{2} - \frac{1}{2 α_{i 1}}} + λ_{i 4} V_{i 1}^{\frac{1}{2}} \\ \leq - (λ_{i 1} - {\bar{λ}}_{i 2} - {\bar{λ}}_{i 3}) V_{i 1}^{\frac{3}{2} - \frac{1}{2 α_{i 1}}} + λ_{i 4} V_{i 1}^{\frac{1}{2}} = - λ_{i 5} V_{i 1}^{\frac{3}{2} - \frac{1}{2 α_{i 1}}} + λ_{i 4} V_{i 1}^{\frac{1}{2}} \end{matrix}

(A7)

where

λ_{i 5} = λ_{i 1} - {\bar{λ}}_{i 2} - {\bar{λ}}_{i 3}

, it can be noted that the inequality (A7) has the same structure as Proposition 2 of [41]. Therefore, the error trajectory of the designed FTESO (8) is finite-time uniformly ultimately bounded stable, which means that the estimation errors

e_{i}

will converge to a small neighborhood of the origin. Furthermore, the convergence time

T_{i f}

is given by:

T_{i f} \leq \frac{2 α_{i 1} V_{i 1} {(e_{i}^{0})}^{\frac{1}{2 α_{i 1}} - \frac{1}{2}}}{(λ_{i 5} - δ_{i 5}) (1 - α_{i 1})} .

(A8)

With the stable region

Ω_{i}

given by

Ω_{i} = \{e_{i}| V_{i 1} {(e_{i})}^{1 - \frac{1}{2 α_{i 1}}} < λ_{i 4} / δ_{i 5}\}

, where

δ_{i 5} \in (0, λ_{i 5})

is an arbitrary constant. This completes the proof. □

Appendix B

Proof of Theorem 2.

Given the current system state

x_{i 1} (t)

, if

{‖u_{i 1}^{v i r} ({\overset{⌢}{x}}_{i 1})‖}_{\infty} \leq u_{i 1}^{\max}

can be satisfied, then

u_{i 1}^{v i r} ({\overset{⌢}{x}}_{i 1})

is always feasible for the DLMPC optimization problem (11).

Taking the infinity norm on both sides of (14), we have

\begin{matrix} {‖v_{i}^{v i r} ({\overset{⌢}{x}}_{i 1})‖}_{\infty} & = {‖J_{i}^{- 1} (η_{i})‖}_{\infty} {‖{\dot{η}}_{r} - K_{i p} {\tilde{η}}_{i}‖}_{\infty} \leq {‖J_{i}^{- 1} (η_{i})‖}_{\infty} ({‖{\dot{η}}_{r}‖}_{\infty} + {‖K_{i p} {\tilde{η}}_{i}‖}_{\infty}) \\ \leq {‖J_{i}^{- 1} (η_{i})‖}_{\infty} ({\bar{η}}_{r 1} + {\bar{K}}_{i p} {‖{\tilde{η}}_{i}‖}_{\infty}) . \end{matrix}

(A9)

From (15),

{\dot{V}}_{i p} \leq 0

. Therefore,

‖{\tilde{η}}_{i} (t)‖ \leq ‖{\tilde{η}}_{i} (0)‖

. Considering that

{‖{\tilde{η}}_{i}‖}_{\infty} \leq ‖{\tilde{η}}_{i}‖

, we have

{‖{\tilde{η}}_{i}‖}_{\infty} \leq ‖{\tilde{η}}_{i} (0)‖

. According to the property of the rotation matrix (3), we obtain

\begin{matrix} {‖J_{i}^{- 1} (η_{i})‖}_{\infty} = \max \{|\cos ψ_{i} \cos θ_{i}| + |\sin ψ_{i} \cos θ_{i}| + |- \sin θ_{i}|, |- \sin ψ_{i}| + |\cos ψ_{i}|, \\ |\cos ψ_{i} \sin θ_{i}| + |\sin ψ_{i} \sin θ_{i}| + |\cos θ_{i}|, 1, |1 / \cos θ_{i}|\} \leq 1 + \frac{\sqrt{2}}{2} . \end{matrix}

(A10)

Accordingly, (A9) can be further derived as:

{‖v_{i}^{v i r} ({\overset{⌢}{x}}_{i 1})‖}_{\infty} \leq (1 + \frac{\sqrt{2}}{2}) ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖{\tilde{η}}_{i} (0)‖) .

(A11)

If (24) can be satisfied, then the relation

{‖v_{i}^{v i r} ({\overset{⌢}{x}}_{i 1})‖}_{\infty} \leq v_{i}^{\max}

can hold. This ensures that

{‖u_{i 1}^{v i r} ({\overset{⌢}{x}}_{i 1})‖}_{\infty} \leq u_{i 1}^{\max}

holds at all moments, which concludes the proof. □

Appendix C

Proof of Theorem 3.

Given the current system state

x_{i 2} (t)

, if

{‖{\hat{u}}_{i 2}^{v i r} ({\overset{⌢}{x}}_{i 2})‖}_{\infty} \leq u_{i 2}^{\max}

can be satisfied, then

{\hat{u}}_{i 2}^{v i r} ({\overset{⌢}{x}}_{i 2})

is always feasible for the DLMPC optimization problem (18).

Consider the following Lyapunov function:

V_{i} = V_{i v} + V_{i 1} = \frac{1}{2} γ_{i}^{T} Π_{i} γ_{i}

(A12)

where

γ_{i} = {[{\tilde{η}}_{i}^{T}, {\tilde{v}}_{i}^{T}, \sqrt{2} ε_{i}^{T}]}^{T}

,

Π_{i} = diag (Λ_{i 1}, Λ_{i 2}, P_{i})

. From (22) and the proof of Theorem 1, it follows that

{\dot{V}}_{i} \leq 0

. Therefore,

‖γ_{i} (t)‖ \leq ‖γ_{i} (0)‖

. Moreover, we have

{‖{\tilde{η}}_{i}‖}_{\infty} \leq ‖{\tilde{η}}_{i}‖ \leq ‖γ_{i}‖

,

{‖{\tilde{v}}_{i}‖}_{\infty} \leq ‖{\tilde{v}}_{i}‖ \leq ‖γ_{i}‖

,

{‖ε_{i}‖}_{\infty} < {‖\sqrt{2} ε_{i}‖}_{\infty} \leq ‖\sqrt{2} ε_{i}‖ \leq ‖γ_{i}‖

, then

{‖{\tilde{η}}_{i}‖}_{\infty} \leq ‖γ_{i} (0)‖

,

{‖{\tilde{v}}_{i}‖}_{\infty} \leq ‖γ_{i} (0)‖

, and

{‖ε_{i}‖}_{\infty} \leq ‖γ_{i} (0)‖

.

Since the desired speed in the velocity tracking controller is derived from the position controller, we have

{\dot{v}}_{i d} = {\dot{u}}_{i 1} = [v_{i} (t + h) - v_{i} (t)] / h

. Then, we take the infinity norm on

{\dot{v}}_{i d}

to obtain:

\begin{matrix} {‖{\dot{v}}_{i d}‖}_{\infty} & = {‖\frac{v_{i} (t + h) - v_{i} (t)}{h}‖}_{\infty} \leq \frac{{‖v_{i} (t + h)‖}_{\infty} + {‖v_{i} (t)‖}_{\infty}}{h} \leq \frac{2 {‖v_{i}‖}_{\infty}}{h} \\ \leq \frac{2 {‖v_{i}^{v i r} ({\overset{⌢}{x}}_{i 1})‖}_{\infty}}{h} \leq \frac{2 + \sqrt{2}}{h} ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖γ_{i} (0)‖) = {\bar{v}}_{i d} \end{matrix}

(A13)

where

{‖v_{i}‖}_{\infty} \leq {‖v_{i}^{v i r} ({\overset{⌢}{x}}_{i 1})‖}_{\infty} \leq (1 + \frac{\sqrt{2}}{2}) ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖γ_{i} (0)‖) = {\bar{v}}_{i}

. Consequently, the bound of the Coriolis and centripetal matrix

C_{i}^{*} (v_{i})

can be obtained by taking the infinity norm:

{‖C_{i}^{*} (v_{i})‖}_{\infty} \leq (2 + \sqrt{2}) {\bar{m}}_{i} ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖γ_{i} (0)‖) = {\bar{c}}_{i}

(A14)

{‖D_{i}^{*} (v_{i})‖}_{\infty}

can be derived by the same principle:

{‖D_{i}^{*} (v_{i})‖}_{\infty} \leq {\bar{d}}_{1} + {\bar{d}}_{2} (1 + \frac{\sqrt{2}}{2}) ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖γ_{i} (0)‖) = {\bar{d}}_{i}

(A15)

with

{\bar{d}}_{1} = \max \{|X_{u}|, |Y_{v}|, |Z_{w}|, |M_{q}|, |N_{r}|\}

,

{\bar{d}}_{2} = \max \{X_{|u| u}, Y_{|v| v}, Z_{|w| w}, M_{|q| q}, N_{|r| r}\}

. Taking infinity norms on both sides of the lumped disturbance

{\hat{τ}}_{i d} = M_{i}^{*} J_{i}^{- 1} (η_{i}) {\hat{d}}_{i}

, one obtains:

\begin{matrix} {‖{\hat{τ}}_{i d}‖}_{\infty} & = {‖M_{i}^{*} J_{i}^{- 1} (η_{i}) {\hat{d}}_{i}‖}_{\infty} = {‖M_{i}^{*} J_{i}^{- 1} (η_{i}) (d_{i} + e_{i 3})‖}_{\infty} \leq {‖τ_{i d}‖}_{\infty} + {‖M_{i}^{*} J_{i}^{- 1} (η_{i})‖}_{\infty} {‖e_{i 3}‖}_{\infty} \\ \leq ‖τ_{i d}‖ + (1 + \frac{\sqrt{2}}{2}) {\bar{m}}_{i} {‖ε_{i}‖}_{\infty} \leq {\bar{τ}}_{i d} + (1 + \frac{\sqrt{2}}{2}) {\bar{m}}_{i} ‖γ_{i} (0)‖ . \end{matrix}

(A16)

Based on the above analysis, taking the infinity norms on both sides of (21) yields:

\begin{matrix} {‖τ_{i}^{v i r} ({\overset{⌢}{x}}_{i 2})‖}_{\infty} & = {‖C_{i}^{*} (v_{i}) v_{i} + D_{i}^{*} (v_{i}) v_{i}‖}_{\infty} + {‖M_{i}^{*} {\dot{v}}_{i d}‖}_{\infty} + {‖M_{i}^{*} K_{i v} {\tilde{v}}_{i}‖}_{\infty} + {‖{\hat{τ}}_{i d}‖}_{\infty} \\ \leq ({\bar{c}}_{i} + {\bar{d}}_{i}) {‖v_{i}‖}_{\infty} + {\bar{m}}_{i} {‖{\dot{v}}_{i d}‖}_{\infty} + {\bar{m}}_{i} {\bar{K}}_{i v} {‖{\tilde{v}}_{i}‖}_{\infty} + {‖{\hat{τ}}_{i d}‖}_{\infty} \\ \leq ({\bar{c}}_{i} + {\bar{d}}_{i}) {\bar{v}}_{i} + {\bar{m}}_{i} {\bar{v}}_{i d} + [{\bar{K}}_{i v} + (1 + \frac{\sqrt{2}}{2})] {\bar{m}}_{i} ‖γ_{i} (0)‖ + {\bar{τ}}_{i d} . \end{matrix}

(A17)

If (25) can be satisfied, then the relation

{‖τ_{i}^{v i r} ({\overset{⌢}{x}}_{i 2})‖}_{\infty} \leq τ_{i}^{\max}

can hold. This ensures that

{‖{\hat{u}}_{i 2}^{v i r} ({\overset{⌢}{x}}_{i 2})‖}_{\infty} \leq u_{i 2}^{\max}

holds at all moments, which concludes the proof. □

Appendix D

Proof of Theorem 4.

Since we have constructed a Lyapunov function

V_{i p} (x_{i 1})

that is continuously differentiable and radically unbounded, according to converse Lyapunov theorems [42], there exist functions

χ_{i k} (\cdot)

(k = 1, 2, 3) belonging to class

K_{\infty}

that satisfy the following inequalities:

χ_{i 1} (‖x_{i 1}‖) \leq V_{i p} (x_{i 1}) \leq χ_{i 2} (‖x_{i 1}‖)

(A18)

{{\dot{V}}_{i p}|}_{u_{i 1}^{v i r} (x_{i 1})} \leq - χ_{i 3} (‖x_{i 1}‖)

(A19)

In view of the stability constraint (11f) and the optimal solution

κ_{i 1} (s)

implemented at each sampling period, we obtain:

{{\dot{V}}_{i p}|}_{u_{i 1} (x_{i 1})} \leq {{\dot{V}}_{i p}|}_{u_{i 1}^{v i r} (x_{i 1})} \leq - χ_{i 3} (‖x_{i 1}‖) .

(A20)

From the Lyapunov argument of Theorem 4.8 in [42], we conclude that the position tracking subsystem is asymptotically stable within an attraction region

R_{i 1}

.

\{x_{i 1} \in R_{i 1}^{n}| (1 + \frac{\sqrt{2}}{2}) ({\bar{η}}_{r 1} + {\bar{K}}_{i p} ‖{\tilde{η}}_{i} (0)‖) \leq v_{i}^{\max}\} .

(A21)

Similarly, the following conclusion can be obtained: the velocity tracking subsystem under Algorithm 1 is asymptotically stable within an attraction region

R_{i 2}

.

\{x_{i 2} \in R_{i 2}^{n}| ({\bar{c}}_{i} + {\bar{d}}_{i}) {\bar{v}}_{i} + {\bar{m}}_{i} {\bar{v}}_{i d} + [{\bar{K}}_{i v} + (1 + \frac{\sqrt{2}}{2})] {\bar{m}}_{i} ‖γ_{i} (0)‖ + {\bar{τ}}_{i d} \leq τ_{i}^{\max}\} .

(A22)

This ensures the stability of the overall AUV formation system. Since there are no other limitations on

{\bar{K}}_{i p}

and

{\bar{K}}_{i v}

, the attraction regions

R_{i 1}

and

R_{i 2}

can be arbitrarily large as long as the control gains are small enough. □

Remark A1.

It is noteworthy that the tracking control performance of the backstepping technique-based virtual control law relies on the amplitude of the control gains. As seen in (14) and (21), smaller values of

{\bar{K}}_{i p}

and

{\bar{K}}_{i v}

result in slower convergence. Moreover, thanks to the optimization process of the proposed DLMPC controller, even if smaller control gains are chosen to expand the attractive regions, the controller can effectively leverage the thrust capability to achieve optimal control performance aligned with the cost function. This confirms the advantage that DLMPC inherits the stability properties of the virtual controller.

References

Shi, Y.; Shen, C.; Fang, H.; Li, H. Advanced control in marine mechatronic systems: A survey. IEEE-ASME Trans. Mechatron. 2017, 22, 1121–1131. [Google Scholar] [CrossRef]
Wang, C.; Cai, W.; Lu, J.; Ding, X.; Yang, J. Design, modeling, control, and experiments for multiple AUVs formation. IEEE Trans. Autom. Sci. Eng. 2021, 19, 2776–2787. [Google Scholar] [CrossRef]
Yu, H.; Zeng, Z.; Guo, C. Coordinated formation control of discrete-time autonomous underwater vehicles under alterable communication topology with time-varying delay. J. Mar. Sci. Eng. 2022, 10, 712. [Google Scholar] [CrossRef]
Chen, G.; Shen, Y.; Qu, N.; He, B. Path planning of AUV during diving process based on behavioral decision-making. Ocean Eng. 2021, 234, 109073. [Google Scholar] [CrossRef]
Cui, R.; Ge, S.S.; How, B.V.E.; Choo, Y.S. Leader–follower formation control of underactuated autonomous underwater vehicles. Ocean Eng. 2010, 37, 1491–1502. [Google Scholar] [CrossRef]
He, X.; Geng, Z. Globally convergent leaderless formation control for unicycle-type mobile robots. IET Control Theory Appl. 2020, 14, 2651–2662. [Google Scholar] [CrossRef]
Zhen, Q.; Wan, L.; Li, Y.; Jiang, D. Formation control of a multi-AUVs system based on virtual structure and artificial potential field on SE(3). Ocean Eng. 2022, 253, 111148. [Google Scholar] [CrossRef]
Hu, C.D.; Wu, D.F.; Liao, Y.X.; Hu, X. Sliding mode control unified with the uncertainty and disturbance estimator for dynamically positioned vessels subjected to uncertainties and unknown disturbances. Appl. Ocean Res. 2021, 109, 102564. [Google Scholar] [CrossRef]
Zhang, W.; Wu, W.; Li, Z.; Du, X.; Yan, Z. Three-Dimensional Trajectory Tracking of AUV Based on Nonsingular Terminal Sliding Mode and Active Disturbance Rejection Decoupling Control. J. Mar. Sci. Eng. 2023, 11, 959. [Google Scholar] [CrossRef]
Peng, Z.; Wang, J.; Wang, J. Constrained control of autonomous underwater vehicles based on command optimization and disturbance estimation. IEEE Trans. Ind. Electron. 2018, 66, 3627–3635. [Google Scholar] [CrossRef]
Miao, J.; Sun, X.; Chen, Q.; Zhang, H.; Liu, W.; Wang, Y. Robust Path-Following Control for AUV under Multiple Uncertainties and Input Saturation. Drones 2023, 7, 665. [Google Scholar] [CrossRef]
Zhang, Y.; Liu, X.; Luo, M.; Yang, C. MPC-based 3-D trajectory tracking for an autonomous underwater vehicle with constraints in complex ocean environments. Ocean Eng. 2019, 189, 106309. [Google Scholar] [CrossRef]
Yong, K.; Chen, M.; Wu, Q. Anti-disturbance control for nonlinear systems based on interval observer. IEEE Trans. Ind. Electron. 2019, 67, 1261–1269. [Google Scholar] [CrossRef]
Chen, C.; Wen, C.; Liu, Z.; Xie, K.; Zhang, Y.; Chen, C.P. Adaptive consensus of nonlinear multi-agent systems with non-identical partially unknown control directions and bounded modelling errors. IEEE Trans. Autom. Control. 2016, 62, 4654–4659. [Google Scholar] [CrossRef]
Guo, P.; Lyu, M.R.; Chen, C.L.P. Regularization parameter estimation for feedforward neural networks. IEEE Trans. Syst. Man Cybern. Part B-Cybern. 2003, 33, 35–44. [Google Scholar]
Han, J. From PID to active disturbance rejection control. IEEE Trans. Ind. Electron. 2009, 56, 900–906. [Google Scholar] [CrossRef]
Fernandes, D.D.A.; Sørensen, A.J.; Pettersen, K.Y.; Donha, D.C. Output feedback motion control system for observation class ROVs based on a high-gain state observer: Theoretical and experimental results. Control Eng. Pract. 2015, 39, 90–102. [Google Scholar] [CrossRef]
Lamraoui, H.C.; Qidan, Z. Path following control of fully-actuated autonomous underwater vehicle in presence of fast-varying disturbances. Appl. Ocean Res. 2019, 86, 40–46. [Google Scholar] [CrossRef]
Basin, M.; Yu, P.; Shtessel, Y. Finite-and fixed-time differentiators utilising HOSM techniques. IET Control Theory Appl. 2017, 11, 1144–1152. [Google Scholar] [CrossRef]
Li, B.; Hu, Q.; Yang, Y. Continuous finite-time extended state observer based fault tolerant control for attitude stabilization. Aerosp. Sci. Technol. 2019, 84, 204–213. [Google Scholar] [CrossRef]
Cai, X.; Zhu, X.; Yao, W. FTESO-adaptive neural network based safety control for a quadrotor UAV under multiple disturbances: Algorithm and experiments. Int. J. Robot. Res. Appl. 2024, 51, 20–33. [Google Scholar] [CrossRef]
Xia, Y.; Xu, K.; Huang, Z.; Wang, W.; Xu, G.; Li, Y. Adaptive energy-efficient tracking control of a X rudder AUV with actuator dynamics and rolling restriction. Appl. Ocean Res. 2022, 118, 102994. [Google Scholar] [CrossRef]
Xia, G.; Zhang, Y.; Zhang, W.; Zhang, K.; Yang, H. Robust adaptive super-twisting sliding mode formation controller for homing of multi-underactuated AUV recovery system with uncertainties. ISA Trans. 2022, 130, 136–151. [Google Scholar] [CrossRef]
Li, H.; Xie, P.; Yan, W. Receding horizon formation tracking control of constrained underactuated autonomous underwater vehicles. IEEE Trans. Ind. Electron. 2016, 64, 5004–5013. [Google Scholar] [CrossRef]
Zhang, M.; Yan, Z.; Zhou, J.; Yue, L. Distributed Dual Closed-Loop Model Predictive Formation Control for Collision-Free Multi-AUV System Subject to Compound Disturbances. J. Mar. Sci. Eng. 2023, 11, 1897. [Google Scholar] [CrossRef]
Liu, C.; Sun, T.; Hu, Q.Z. Synchronization Control of Dynamic Positioning Ships Using Model Predictive Control. J. Mar. Sci. Eng. 2021, 9, 1239. [Google Scholar] [CrossRef]
Shen, C.; Shi, Y.; Buckham, B. Trajectory tracking control of an autonomous underwater vehicle using Lyapunov-based model predictive control. IEEE Trans. Ind. Electron. 2017, 65, 5796–5805. [Google Scholar] [CrossRef]
Mahmood, M.; Mhaskar, P. Lyapunov-based model predictive control of stochastic nonlinear systems. Automatica 2012, 48, 2271–2276. [Google Scholar] [CrossRef]
Liu, J.; Chen, X.; de la Pena, D.M.M.; Christofides, P.D. Iterative distributed model predictive control of nonlinear systems: Handling asynchronous, delayed measurements. IEEE Trans. Autom. Control 2011, 57, 528–534. [Google Scholar]
Wei, H.; Shen, C.; Shi, Y. Distributed Lyapunov-based model predictive formation tracking control for autonomous underwater vehicles subject to disturbances. IEEE Trans. Syst. Man Cybern. Syst. 2019, 51, 5198–5208. [Google Scholar] [CrossRef]
Meng, C.; Zhang, W.; Du, X. Finite-time extended state observer based collision-free leaderless formation control of multiple AUVs via event-triggered control. Ocean Eng. 2023, 268, 113605. [Google Scholar] [CrossRef]
Yan, Z.; Gong, P.; Zhang, W.; Li, Z.; Teng, Y. Autonomous underwater vehicle vision guided docking experiments based on L-shaped light array. IEEE Access 2019, 7, 72567–72576. [Google Scholar] [CrossRef]
Fossen, T.I. Handbook of Marine Craft Hydrodynamics and Motion Control; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
Xu, J.; Cui, Y.; Xing, W.; Huang, F.; Yan, Z.; Wu, D.; Chen, T. Anti-disturbance fault-tolerant formation containment control for multiple autonomous underwater vehicles with actuator faults. Ocean Eng. 2022, 266, 112924. [Google Scholar] [CrossRef]
Zhang, Z.; Lin, M.; Li, D. A double-loop control framework for AUV trajectory tracking under model parameters uncertainties and time-varying currents. Ocean Eng. 2022, 265, 112566. [Google Scholar] [CrossRef]
Chen, C.W.; Lu, Y.F. Computational fluid dynamics study of water entry impact forces of an airborne-launched, axisymmetric, disk-type Autonomous underwater hovering vehicle. Symmetry 2019, 11, 1100. [Google Scholar] [CrossRef]
Cui, R.; Chen, L.; Yang, C.; Chen, M. Extended state observer-based integral sliding mode control for an underwater robot with unknown disturbances and uncertain nonlinearities. IEEE Trans. Ind. Electron. 2017, 64, 6785–6795. [Google Scholar] [CrossRef]
Majeed, A.; Rauf, I. Graph Theory: A Comprehensive Survey about Graph Theory Applications in Computer Science and Social Networks. Inventions 2020, 5, 10. [Google Scholar] [CrossRef]
Liu, J.; de la Peña, D.M.; Christofides, P.D. Distributed model predictive control of nonlinear systems subject to asynchronous and delayed measurements. Automatica 2010, 46, 52–61. [Google Scholar] [CrossRef]
Yan, Z.; Wang, M.; Xu, J. Integrated guidance and control strategy for homing of unmanned underwater vehicles. J. Frankl. Inst.-Eng. Appl. Math. 2019, 356, 3831–3848. [Google Scholar] [CrossRef]
Hu, Q.; Jiang, B. Continuous finite-time attitude control for rigid spacecraft based on angular velocity observer. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 1082–1092. [Google Scholar] [CrossRef]
Khalil, H.K. Nonlinear Systems; Prentice Hall Inc.: Upper Saddle River, NJ, USA, 2002. [Google Scholar]

Figure 1. AUV coordinate system.

Figure 2. Flow diagram of Algorithm 1.

Figure 3. Structure of communication topology.

Figure 4. The estimation error norm

‖e_{i 3}‖

for the lumped disturbance of the ith AUV.

Figure 4. The estimation error norm

‖e_{i 3}‖

for the lumped disturbance of the ith AUV.