Model Predictive Control Technique for Ducted Fan Aerial Vehicles Using Physics-Informed Machine Learning

Manzoor, Tayyab; Pei, Hailong; Sun, Zhongqi; Cheng, Zihuan

doi:10.3390/drones7010004

Open AccessArticle

Model Predictive Control Technique for Ducted Fan Aerial Vehicles Using Physics-Informed Machine Learning

¹

Key Laboratory of Autonomous Systems and Networked Control, Ministry of Education, Unmanned Aerial Vehicle Systems Engineering Technology Research Center of Guangdong, School of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China

²

School of Automation, Beijing Institute of Technology, Beijing 100081, China

^*

Author to whom correspondence should be addressed.

Drones 2023, 7(1), 4; https://doi.org/10.3390/drones7010004

Submission received: 28 November 2022 / Revised: 18 December 2022 / Accepted: 20 December 2022 / Published: 21 December 2022

Download

Browse Figures

Versions Notes

Abstract

:

This paper proposes a model predictive control (MPC) approach for ducted fan aerial robots using physics-informed machine learning (ML), where the task is to fully exploit the capabilities of the predictive control design with an accurate dynamic model by means of a hybrid modeling technique. For this purpose, an indigenously developed ducted fan miniature aerial vehicle with adequate flying capabilities is used. The physics-informed dynamical model is derived offline by considering the forces and moments acting on the platform. On the basis of the physics-informed model, a data-driven ML approach called adaptive sparse identification of nonlinear dynamics is utilized for model identification, estimation, and correction online. Thereafter, an MPC-based optimization problem is computed by updating the physics-informed states with the physics-informed ML model at each step, yielding an effective control performance. Closed-loop stability and recursive feasibility are ensured under sufficient conditions. Finally, a simulation study is conducted to concisely corroborate the efficacy of the presented framework.

Keywords:

aerial robotics; flight control; machine learning; model predictive control; trajectory tracking; UAV

1. Introduction

1.1. Literature Review

The ducted fan aerial vehicles (DFAVs) are a type of vertical take-off and landing platforms that have the innate capabilities of a helicopter and fixed-wing aircraft [1,2]. These airborne vehicles are used in a wide number of applications in different areas such as transportation, inspection, aerial surveillance, and manipulation [3,4]. Due to their annular fuselage mechanism called duct [5], these aerial robots are known for their safety in congested, cluttered, and hazardous environments [6,7]. Having vector thrust capabilities [8], DFAVs also have several advantages over other configurations, such as fixed-wing platforms, helicopters, open-rotor, and shrouded quadrotors [9]. However, complex flow distribution makes it arduous to model these aircraft accurately [10], consequently making the flight control design more challenging.

Over the years, several flight control systems based on different strategies have been presented to control these vehicles [9]. In this regard, model predictive control (MPC) has proved itself as an advanced control approach that can address many issues associated with aerial applications [11]. One of the simplest techniques is linear MPC, which has been employed for the purpose of attitude control for these aerial platforms [12,13]. An improved unified control framework based on linear MPC for position and attitude control with Kalman filter as a disturbance observer (DOB) has also been studied and validated [14]. Moreover, robust MPC with a DOB and adaptive MPC with two DOBs have been presented to compensate for the effect of model inaccuracies and disturbances [15]. For achieving similar tasks with some improvements compared to robust MPC [15], composite disturbance rejection approaches based on MPC, referred to as MPC-based compound flight control (MPC-CFC), have been proposed [16,17].

In the aforementioned works, some problems can be solved to improve the control performance of future MPC-driven control systems.

The performance of MPC depends on the accurate model of the system dynamics [18,19], which is challenging to obtain. This problem is even more prevalent in flight control, where varying flight conditions, disturbances, faults, and model uncertainties may lead to ineffective control performance.
In the disturbance rejection MPC framework based on DOB (e.g., [16,17]), the dynamics of the disturbance profile cannot be faster than the dynamics of DOB, which implies that this type of composite method is not suitable to estimate and compensate for fast time-varying disturbances.

There are two types of dominant techniques for dynamical system modeling [20]: (1) Physics-informed dynamics, where the equations of motion are derived from governing physical rules, and (2) machine learning (ML) based data-driven system modeling [21]. To construct an end-to-end dynamical model, these techniques are employed separately. There is a growing potential for integrating data-driven techniques based on ML in the aerospace industry [22]. One of the commonly used ML techniques is the neural network (NN) [23,24,25,26]. Nevertheless, NN-based methods need massive training data, which may be uninterpretable and difficult to include constraints. These reasons limit their utilization for online system identification [27]. One alternative is to use sparse identification of nonlinear dynamics (SINDy) developed in [28], which has shown effective control and predictive capabilities with MPC in the presence of noise and is more suitable for low and medium data than the NN technique. Moreover, it has demonstrated strong parametric robustness, effective training, and execution time [27]. While the SINDy algorithm is an effective strategy for multiple learning, an adaptive SINDy technique is more suitable that can efficiently update and correct the dynamics online in a repetitive fashion [29,30]. However, it is challenging to seamlessly capture the correct model with ML techniques alone to satisfy the conservation laws and constraints. On the other hand, physics-informed dynamical models can capture constraints and conservation laws. However, they are either too simple or ultra-complex that may become too expensive to handle online due to high computational costs, particularly for varying system dynamics for MPC. In this aspect, a physics-informed ML hybrid-modeling technique can be employed to seamlessly incorporate physical models and data in all circumstances involving high-dimensional, physically understood, and uncertain contexts [31,32,33,34,35,36].

1.2. Contributions

Motivated by the above discussion, this article presents an MPC-based control strategy for DFAVs via a hybrid modeling approach called physics-informed ML. First, the physics-informed model (nominal system) for the aerial robot is constructed by considering the forces and moments offline. Moreover, it is considered that the nominal system is not affected by any disturbances or uncertainties. Thereafter, a data-driven approach called adaptive SINDy is employed for estimating and compensating for any parametric changes online during the flight. The optimization problem is solved online, where in each step, the physics-informed system is updated by the physics-informed ML. This way, the control action applied to the aerial vehicle is optimal with respect to the current state. In Table 1, the advantages and disadvantages of existing techniques over the proposed scheme are provided. The contributions of this manuscript are expressed in the following manner:

To fully exploit the potential of MPC, an online-based predictive control algorithm using physics-informed ML without the utilization of DOB is presented, unlike [15,16,17]. By employing the methodology in such a way, the inherent issues encountered in the disturbance rejection MPC framework [16,17] can be solved.
For efficient response and to avoid any usual computational complexity problem, only the data-driven part in the hybrid modeling is determined online for model correction. To further enhance the computational efficiency of the developed control algorithm, the physics-informed model is also updated by the physics-informed ML model in each step while solving the optimization problem.
Unlike [12,13,14,15], theoretical properties such as recursive feasibility and closed-loop stability with constraints satisfaction under sufficient conditions are derived.
In contrast to the existing disturbance rejection framework [16], the designed approach demonstrates effective performance. Furthermore, the constructed approach can be implemented in other robotics systems to attain similar goals.

1.3. Organization

In Section 2, the problem formulation is constructed in the physics-informed modeling and physics-informed ML model subsections. Furthermore, control objectives are defined, and a few assumptions with preliminaries are established for smooth control operation. The control approach includes MPC development and its feasibility and stability proofs and is provided in Section 3. Section 4 illustrates the numerical implementation that includes comparative analysis with some discussion. Section 5 concludes the article.

1.4. Notation

N

and

R

denote all non-negative integers and real space, respectively.

∥ (.) ∥ ≜ \sqrt{{(.)}^{T} (.)}

is the Euclidean norm.

diag {{(.)}_{11}, {(.)}_{22}, . . ., {(.)}_{n}}

denotes the diagonal matrix with entries

{(.)}_{11}, \dots, {(.)}_{n} \in R

. Q-weighted norm is referred to as

{∥ (.) ∥}_{Q} ≜ \sqrt{{(.)}^{T} Q (.)}

, where

Q

denotes a positive definite matrix. Consider any given system state/input as ∘,

\circ (τ | t_{l})

means ∘ at

τ

predicted at

t_{l}

. Superscript

\circ^{*}

is utilized to define state/input after optimization. Accents

\tilde{\circ}

and

\hat{\circ}

represent the nominal state/nominal control input and estimated state/input, respectively. In the entire manuscript, physics-informed dynamics and physics-informed ML are referred to as nominal and real systems, and these terms are used interchangeably. A bold font style is employed to represent vectors and matrices.

2. Problem Formulation

The hybrid modeling procedure is divided into two phases, i.e., physics-informed modeling (offline) and data-driven scheme (online), which is based on lots of physics and less data, shown in Figure 1.

2.1. Physics-Informed Modelling

In Figure 2, the ducted fan aerial robot is represented by two coordinate frames, i.e., body-fixed frame

B_{f} ≜ {B_{o}, B_{x}, B_{y}, B_{z}}

and inertial frame

I_{f} ≜ {I_{o}, I_{x}, I_{y}, I_{z}}

.

ξ^{I} = {[ξ_{x}^{I}, ξ_{y}^{I}, ξ_{z}^{I}]}^{T} \in R^{3}

,

V_{I} = {[V_{x}^{I}, V_{y}^{I}, V_{z}^{I}]}^{T} \in R^{3}

,

V_{B} = {[V_{x}^{B}, V_{y}^{B}, V_{z}^{B}]}^{T} \in R^{3}

and

ω_{B} = {[p, q, r]}^{T} \in R^{3}

are position, linear velocity in

I_{f}

, linear velocity in

B_{f}

, and angular velocity, respectively.

Δ_{w i n d} = {[Δ_{x}^{B} (t), Δ_{y}^{B} (t), Δ_{z}^{B} (t)]}^{T}

are disturbances acting on the flying robot.

Θ = {[ϕ, θ, ψ]}^{T} \in R^{3}

denotes the attitude of the aircraft, where

ϕ

,

θ

and

ψ

are roll, pitch, and yaw angle, respectively. From these rotation angles, the rotational matrix

T_{B I} \in S O (3)

from

B_{f}

and

I_{f}

can be defined as (see among others, e.g., [9]):

T_{B I} = [\begin{matrix} C_{ψ} C_{θ} & C_{ψ} S_{θ} S_{ϕ} - S_{ψ} C_{ϕ} & C_{ψ} S_{θ} C_{ϕ} + S_{ψ} S_{ϕ} \\ S_{ψ} C_{θ} & S_{ψ} S_{θ} S_{ϕ} + C_{ψ} C_{ϕ} & S_{ψ} S_{θ} C_{ϕ} - C_{ψ} S_{ϕ} \\ - S_{θ} & C_{θ} S_{ϕ} & C_{θ} C_{ϕ} \end{matrix}],

(1)

where

C_{(*)} = cos (*)

and

S_{(*)} = sin (*)

, with

(*)

denotes any angle. The rotational kinematics of the airborne platform is described in the following form:

\dot{Θ} = Ω ω_{B}, Ω = [\begin{matrix} C_{θ} & 0 & - S_{θ} \\ S_{θ} S_{ϕ} / C_{ϕ} & 1 & C_{θ} S_{ϕ} / C_{ϕ} \\ S_{θ} / C_{ϕ} & 0 & C_{ϕ} C_{ϕ} \end{matrix}] .

(2)

Other flight components in Figure 2 are expressed as follows:

\begin{matrix} V_{w i n d} & = \sqrt{{(V_{x}^{B} - Δ_{x}^{B})}^{2} + {(V_{y}^{B} - Δ_{y}^{B})}^{2} + {(V_{z}^{B} - Δ_{z}^{B})}^{2}}, \\ α & = - arccos (\frac{V_{z}^{B} - Δ_{z}^{B}}{V_{w i n d}}), 0 \leq α \leq π, \\ β & = arctan (\frac{V_{y}^{B} - Δ_{y}^{B}}{V_{x}^{B} - Δ_{x}^{B}}), - \frac{π}{2} \leq β \leq \frac{π}{2}, \end{matrix}

(3)

where

V_{w i n d}

,

α

, and

β

are denoted as airspeed, angle of attack, and side-slip angle, respectively. The translational and rotational dynamics of the VTOL aircraft can be formulated as

\{\begin{matrix} {\ddot{ξ}}_{I} = {\dot{V}}_{I} = T_{B I} F_{B} / m + G_{a}, \\ {\dot{ω}}_{B} = I^{- 1} (M_{B} + I ω_{B} \times ω_{B}), \end{matrix}

(4)

where m,

F_{B}

,

M_{B}

, and

I

are the mass, total force, total moments, and diagonal matrix consisting of the moment of inertia, respectively. Moreover,

G_{a} = {[\begin{matrix} 0 & 0 & g \end{matrix}]}^{T}

, and g is the gravitational acceleration. Velocity of the airflow exhausted from the fan (

V_{a f}

) can be expressed as:

V_{a f} = - \frac{V_{z}^{B} - Δ_{z}^{B}}{2} + \sqrt{{(\frac{V_{z}^{B} - Δ_{z}^{B}}{2})}^{2} + \frac{T}{2 ρ A_{f d}}},

(5)

where

T = A_{ω T} ω_{r}^{2}

is the thrust, and

ω_{r}

and

A_{ω T}

denote the rotation speed and thrust coefficient, respectively. Moreover,

ρ

and

A_{f d}

are the free air density and fan disk area, respectively. Next, we describe the local airspeed on left-wing (

V_{ℓ w}

) and right-wing (

V_{r w}

) with local

α

on left-wing (

α_{ℓ_{w}}

) and right-wing (

α_{r_{w}}

) are given as follows:

\begin{matrix} V_{ℓ_{w}} & = \sqrt{{(V_{x}^{B} + r ℓ_{w} - Δ_{x})}^{2} + {(V_{z}^{B} - Δ_{z})}^{2}}, \\ V_{r_{w}} & = \sqrt{{(V_{x}^{B} - r ℓ_{w} - Δ_{x})}^{2} + {(V_{z}^{B} - Δ_{z})}^{2}}, \\ α_{ℓ_{w}} & = C_{α_{l_{w}}} = - (V_{z}^{B} - Δ_{z}) / V_{ℓ_{w}}, 0 \leq α_{ℓ_{w}} \leq π, \\ α_{r_{w}} & = C_{α_{r_{w}}} = - (V_{z}^{B} - Δ_{z}) / V_{r_{w}}, 0 \leq α_{r_{w}} \leq π, \end{matrix}

(6)

where

ℓ_{w}

represents the lever arm from the

B_{z}

axis to the aerodynamic center of the left and right/left wing. Generally, an aircraft is affected by four main forces during flight (lift, weight, thrust, and drag), depicted in Figure 2c. In the current scenario, the

F_{B}

can be defined as follows:

\begin{matrix} F_{B} & = \underset{Thrust force}{\underset{⏟}{[\begin{matrix} 0 \\ 0 \\ - A_{ω T} ω_{r}^{2} \end{matrix}]}} + \underset{Duct - body force}{\underset{⏟}{V_{w i n d}^{2} [\begin{matrix} - (A_{L_{d}} C_{α} + A_{D_{d}} S_{α}) C_{β} \\ - (A_{L_{d}} C_{α} + A_{D_{d}} S_{α}) S_{β} \\ - A_{L_{d}} S_{α} + A_{D_{d}} C_{α} \end{matrix}]}} \\ + \underset{Wing force}{\underset{⏟}{[\begin{matrix} - (A_{L_{w}} V_{ℓ_{w}}^{2} C_{α_{ℓ_{w}}} + A_{L_{w}} V_{r_{w}}^{2} C_{α_{r_{w}}} + A_{D_{w}} V_{ℓ_{w}}^{2} S_{α_{ℓ_{w}}} + A_{D_{w}} V_{r_{w}}^{2} S_{α_{r_{w}}}) \\ 0 \\ - A_{L_{w}} V_{ℓ_{w}}^{2} S_{α_{ℓ_{w}}} - A_{L_{w}} V_{r_{w}}^{2} S_{α_{r_{w}}} + A_{D_{w}} V_{ℓ_{w}}^{2} C_{α_{ℓ_{w}}} + A_{D_{w}} V_{r_{w}}^{2} C_{α_{r_{w}}} \end{matrix}]}} \\ + \underset{Momentum drag}{\underset{⏟}{ρ A_{f d} V_{a f} ζ_{r a t i o} [\begin{matrix} V_{x}^{B} - Δ_{x}^{B} \\ V_{y}^{B} - Δ_{y}^{B} \\ 0 \end{matrix}]}}, \end{matrix}

(7)

where

A_{L_{d}}

and

A_{D_{d}}

are the lift and drag coefficients, respectively. Moreover,

A_{D_{w}}

and

A_{L_{w}}

represent the drag and lift coefficients of the half-wing section of the aircraft, respectively.

ζ_{r a t i o}

denotes the damping ratio and is defined as

ζ_{r a t i o} = \frac{V_{s x} - V_{s x}^{^{'}}}{V_{s x}} .

(8)

The total moment is expressed in the following manner:

\begin{matrix} M_{B} & = \underset{Control surfaces}{\underset{⏟}{A_{c s} V_{a f}^{2} [\begin{matrix} - l_{1} δ_{1} + l_{1} δ_{3} \\ - l_{1} δ_{2} + l_{1} δ_{4} \\ l_{2} δ_{1} + l_{2} δ_{2} + l_{2} δ_{3} + l_{2} δ_{4} \end{matrix}]}} + \underset{Fan torque}{\underset{⏟}{[\begin{matrix} 0 \\ 0 \\ k_{ω s} ω_{r}^{2} \end{matrix}]}} + \underset{Gyroscopic effect}{\underset{⏟}{I_{f} ω_{r} [\begin{matrix} - q \\ p \\ 0 \end{matrix}]}} \\ + \underset{Aerodynamic pitching moment}{\underset{⏟}{ϵ_{d r a g} ρ A_{f d} V_{a f} ζ_{r a t i o} [\begin{matrix} V_{x}^{B} - Δ_{x}^{B} \\ V_{y}^{B} - Δ_{y}^{B} \\ 0 \end{matrix}] + ϵ_{D} V_{w i n d}^{2} [\begin{matrix} - (A_{L_{d}} C_{α} + A_{D_{d}} S_{α}) C_{β} \\ - (A_{L_{d}} C_{α} + A_{D_{d}} S_{α}) S_{β} \\ - A_{L_{d}} S_{α} + A_{D_{d}} C_{α} \end{matrix}]}} \\ + \underset{Wing moment}{\underset{⏟}{[\begin{matrix} 0 \\ A_{M} V_{ℓ_{w}}^{2} + A_{M} V_{ℓ_{w}}^{2} \\ - ℓ_{w} A_{L_{w}} V_{ℓ_{w}}^{2} + ℓ_{w} A_{L_{w}} V_{r_{w}}^{2} \end{matrix}]}} + \underset{Anti - rotation torque}{\underset{⏟}{A_{a r} V_{a f}^{2}}}, \end{matrix}

(9)

where

k_{ω s}

,

ϵ_{d r a g}

,

ϵ_{D}

,

I_{f}

,

A_{a r}

,

l_{1}

,

l_{2}

,

A_{c s}

,

δ_{i}

, and

A_{M}

represent the rotation speed with respect to fan torque coefficient, lever arms of momentum drag, duct body force, fan inertia, constant coefficient, lever arms related to roll/pitch-axis, and yaw-axis, deflection angle to force coefficient, the deflection angle of i-th control surface, and moment coefficient of the half-wing section of the vehicle, respectively. Consider the drone’s system defined as follows:

{\dot{x}}_{s} = f_{s} (x_{s} (t), u_{s} (t); η_{s}), x_{s} (0) = x_{0},

(10)

where

x_{s} = {[ξ_{I} V_{I} Θ ω_{B}]}^{T} \in R^{12}

denotes the system states.

f_{s}

and

η_{s}

are the dynamics and the aircraft’s parameters. It is considered that the system (10) is directly affected by disturbances and uncertainties.

u_{s} (t) = {[T u_{s 2}^{T}]}^{T} \in R^{4}

represents control input, where

u_{s 2}

denotes the regulated moment input and is provided as follows:

u_{s 2} = [\begin{matrix} u_{11} \\ u_{12} \\ u_{13} \end{matrix}] = [\begin{matrix} - l_{1} δ_{1} + l_{1} δ_{3} \\ - l_{1} δ_{2} + l_{1} δ_{4} \\ l_{2} δ_{1} + l_{2} δ_{2} + l_{2} δ_{3} + l_{2} δ_{4} \end{matrix}] .

Next, by considering only the physics-informed model of the aircraft as a nominal system:

{\dot{\tilde{x}}}_{s} (t) = f_{s} ({\tilde{x}}_{s} (t), {\tilde{u}}_{s} (t)) .

(11)

The discrepancy modeling problem is derived from the difference between the quantity of interest (i.e., forces and moments) from a physics-informed model

ν_{m} (t)

and estimated value

ν_{o} (t)

. Hence, discrepancy

Δ ν (t)

is expressed as:

Δ ν (t) = ν_{o} (t) - ν_{m} (t) .

(12)

Under ideal conditions, the estimated value and physics-informed model may have the same magnitude in (12), i.e.,

Δ ν (t) \approx 0

. Nonetheless, such an outcome in actual flight may not be possible due to several factors.

2.2. Data-Driven ML-Adaptive SINDy

Inspired by the existing work on learning from discrepancy model [20] and integrating it with adaptive SINDy by [30] for compensating for any abrupt changes in the system dynamics during different flight conditions of the DFAV, a hybrid modeling approach based on physics-informed dynamics and adaptive SINDy is constructed, shown in Figure 3. The sudden changes in the adaptive SINDy involve only addition, deletion, and modifications. This is useful as it enables us to effectively determine the new and missing terms with less data than to identify the entire model from scratch. There are three basic kinds of model changes in the adaptive SINDy:

Modification: If the whole model is unchanged except for model parameters, then least square regression will be employed on the known model to find new parameters;
Deletion: If a few terms are removed, then sparse regression can be utilized on the sparse coefficients to identify which terms have been taken out;
Addition: To find the model error, the SINDy regression will identify the sparsest combination of inactive terms.

2.2.1. Baseline Model

By using the standard procedure of SINDy, we measure the required number of snapshots

m_{s}

of

x_{s}

and

u_{s}

and rearrange them into two data matrices

X_{s}

and

U_{s}

:

\begin{matrix} X_{s} = & [\begin{matrix} ξ_{x}^{I} (t_{1}) & ξ_{y}^{I} (t_{1}) & \dots & ψ (t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ ξ_{x}^{I} (t_{i}) & ξ_{y}^{I} (t_{i}) & \dots & ψ (t_{i}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ ξ_{x}^{I} (t_{m}) & ξ_{y}^{I} (t_{m}) & \dots & ψ (t_{m}) \end{matrix}], \\ U_{s} = & [\begin{matrix} T (t_{1}) & u_{11} (t_{1}) & \dots & u_{13} (t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ T (t_{i}) & u_{11} (t_{i}) & \dots & u_{13} (t_{i}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ T (t_{m}) & u_{11} (t_{m}) & \dots & u_{13} (t_{m}) \end{matrix}], \end{matrix}

(13)

where

t_{i}

is the sampling time. Simplify Equation (13) and transform it into the following form:

\begin{matrix} X_{s} & = {[x_{s 1} (t_{1}) x_{s 2} (t_{2}) x_{s 3} (t_{3}) . . . x_{m s} (t_{m})]}^{T}, \\ U_{s} & = {[u_{s 1} (t_{1}) u_{s 2} (t_{2}) u_{s 3} (t_{3}) . . . u_{m s} (t_{m})]}^{T}, \end{matrix}

(14)

where

u_{m s}

and

x_{m s}

are control and state vector at the mth sampling time. The value of the aircraft’s state is calculated by the numerical differentiation.

\begin{matrix} {\dot{X}}_{s} & = {[{\dot{x}}_{s 1} (t_{1}) {\dot{x}}_{s 2} (t_{2}) {\dot{x}}_{s 3} (t_{3}) . . . {\dot{x}}_{m s} (t_{m})]}^{T}, \\ = [\begin{matrix} {\dot{ξ}}_{x}^{I} (t_{1}) & {\dot{ξ}}_{y}^{I} (t_{1}) & \dots & \dot{ψ} (t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\dot{ξ}}_{x}^{I} (t_{i}) & {\dot{ξ}}_{y}^{I} (t_{i}) & \dots & \dot{ψ} (t_{i}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\dot{ξ}}_{x}^{I} (t_{m}) & {\dot{ξ}}_{y}^{I} (t_{m}) & \dots & \dot{ψ} (t_{m}) \end{matrix}] . \end{matrix}

(15)

The vehicle model depends primarily on the physics-informed modeling method with less concentration on the ML technique as it is mainly useful for any discrepancy and abrupt changes in the parametric values. Therefore, a similar assumption to [29] is required, i.e., either the obtained data are prefiltered through relevant frameworks [37,38] or it contains no noise interference. Next, a candidate library function

Ξ (X_{s}, U_{s})

is developed using the data matrices

X_{s}

and

U_{s}

, in which selected function is arbitrary, and is either a trigonometric or polynomial function. In general, polynomial functions are a relatively basic choice. Therefore, the complexity of the library can be increased by including trigonometric functions. Nevertheless, both functions and physics-informed knowledge of the vehicle will be utilized to construct the library function:

Ξ (X_{s}, U_{s}) = [1^{T}, X_{s}^{T}, {(X_{s} \otimes X_{s})}^{T}, {(X_{s} \otimes U_{s})}^{T}, \dots, S_{X_{s}^{T}}, S_{U_{s}^{T}}, \dots],

(16)

where

X_{s} \otimes X_{s}

and

X_{s} \otimes U_{s}

are defined as:

\begin{matrix} X_{s} \otimes X_{s} = [\begin{matrix} ξ_{x}^{I} ξ_{x}^{I} (t_{1}) & ξ_{x}^{I} ξ_{y}^{I} (t_{1}) & \dots & ξ_{y}^{I} ξ_{y}^{I} (t_{1}) & \dots & ψ^{2} (t_{1}) \\ ξ_{x}^{I} ξ_{x}^{I} (t_{2}) & ξ_{x}^{I} ξ_{y}^{I} (t_{2}) & \dots & ξ_{y}^{I} ξ_{y}^{I} (t_{2}) & \dots & ψ^{2} (t_{2}) \\ ⋮ & ⋮ & \dots & ⋮ & \dots & ⋮ \\ ξ_{x}^{I} ξ_{x}^{I} (t_{m}) & ξ_{x}^{I} ξ_{y}^{I} (t_{m}) & \dots & ξ_{y}^{I} ξ_{y}^{I} (t_{m}) & \dots & ψ^{2} (t_{m}) \end{matrix}], \\ X_{s} \otimes U_{s} = [\begin{matrix} ξ_{x}^{I} T (t_{1}) & ξ_{x}^{I} u_{11} (t_{1}) & \dots & ξ_{y}^{I} u_{11} (t_{1}) & \dots & ψ u_{13} (t_{1}) \\ ξ_{x}^{I} T (t_{2}) & ξ_{x}^{I} u_{11} (t_{2}) & \dots & ξ_{y}^{I} u_{11} (t_{2}) & \dots & ψ u_{13} (t_{2}) \\ ⋮ & ⋮ & \dots & ⋮ & \dots & ⋮ \\ ξ_{x}^{I} T (t_{m}) & ξ_{x}^{I} u_{11} (t_{m}) & \dots & ξ_{y}^{I} u_{11} (t_{m}) & \dots & ψ u_{13} (t_{m}) \end{matrix}] . \end{matrix}

(17)

The dynamics of the vehicle model only depend on the few nonlinear terms in practice. The sparse model of

f_{k} (.)

in system (10) can be set as follows:

f_{k} (X_{s}, U_{s}, χ_{k}) = \sum_{j = 0}^{p_{s}} χ_{k j} Λ_{j} (x_{s}, u_{s}), k = 1, 2, 3, \dots, 12,

(18)

where

p_{s} + 1

represents the number of candidate functions.

χ_{k j}

and

Λ_{j}

are the candidate functions of the j-th column in

Ξ (X_{s}, U_{s})

and the weighted coefficient of

Λ_{j} (x_{s}, u_{s})

related to the k-th state, i.e.,

χ_{k} = {[χ_{k 0}, χ_{k 1}, χ_{k 2}, \dots, χ_{k p_{s}}]}^{T}

, respectively. In the third step, sparse optimization is employed to determine a sparse model:

χ_{k} = \underset{χ_{k}}{arg min} {∥{\dot{X}}_{k s} - χ_{k} Ξ {(X_{s}, U_{s})}^{T}∥}_{2} + λ_{k} {∥χ_{k}∥}_{1},

(19)

where

{\dot{X}}_{k s}

and

λ_{k}

denote the k-th row of

\dot{X}

and sparsity-promoting hyper-parameter, respectively. The theoretical results for convergences of SINDy framework have been provided in [39]. The computed value of

χ_{k}

is stored in sparse matrix

Ψ

:

Ψ = [\begin{matrix} χ_{01} & χ_{02} & \dots & χ_{0 k} & \dots & χ_{0 n} \\ χ_{11} & χ_{12} & \dots & χ_{1 k} & \dots & χ_{1 n} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋱ & ⋮ \\ χ_{p 1} & χ_{p 2} & \dots & χ_{p k} & \dots & χ_{p n} \end{matrix}] = [χ_{1} χ_{2} \dots χ_{k} \dots χ_{12}] .

(20)

The baseline model of the DFAV’s dynamics can be written in the following expression:

X_{s} \approx Ξ (X_{s}, U_{s}) Ψ .

(21)

To summarize the entire process, the baseline model is determined, and a grid search is employed to identify the optimal hyper-parameter selection. In grid-search, all the combinations of hyper-parameters are evaluated, and the best available set is chosen. This process is executed one time to find the best set of hyper-parameters for future updates. The baseline model can be expressed as sparse coefficients in

χ_{0}

.

2.2.2. Estimation of Model Divergence

Due to various factors (e.g., flight condition, nonlinearity, turbulence, actuator faults, etc.), the model parameters may vary at different times, resulting in divergence problems. Hence, a prediction framework that can quickly estimate any variation in the vehicle model is needed. In this work, the classical-predictor–correction approach is utilized to detect the variation caused by the aforementioned factors [40]. The predictor step is executed over a time

\bar{τ}

during the interval

t, t + \bar{τ}

using a valid model at time t. The divergence of the predictor

{\hat{x}}_{s}

and measured state

x_{s}

is calculated at

t + τ

as

∥ x_{d i v} ∥ = ∥ {\hat{x}}_{s} (t + τ) - x_{s} (t + τ) ∥

. The main theme is to determine when the model and the related state measurements diverge faster as opposed to the predicted ones by the dynamics of the system. Among many techniques, the divergence of the trajectory can be expressed by the Lyapunov exponent:

λ = lim_{τ \to \infty} lim_{x_{d i v} (t_{0}) \to 0} \frac{〈log (\frac{x_{d i v} (t_{0} + τ)}{x_{d i v} (t_{0})})〉}{τ},

(22)

and its inverse sets of fastest time scale [30]. Since the information about the dynamical system for the prediction step, the Lyapunov exponent is computed via tangent space. The requirement for an effective detection necessitates computing tolerance

λ_{d i v}

and determining the divergence time. If the prediction horizon by the local Lyapunov exponent and provided time scale are inconsistent, a divergence issue will arise between the measurements and the model. In the start, when the predicted and measured values deviate, then the divergence horizon is defined as:

T_{d i v} (t) = \underset{t_{d i v}}{arg min} ∥ {\hat{x}}_{s} (t + t_{d i v}) - x_{s} (t + t_{d i v}) ∥ > ∥λ_{d i v}∥ .

(23)

Hence, it is assumed that, at time t, the real system and prediction model diverge if the model value and measured time scale differ:

{\bar{λ}}_{d i v} (t) > \frac{log (λ_{d i v}) - log (∥ x_{d i v} (t) ∥)}{T_{d i v} (t)},

(24)

where

λ_{d i v}

is the maximum value obtained from (22) during the time scale

[t, t + T_{d i v}]

, &

{\bar{λ}}_{d i v} = {〈 λ_{d i v} (t^{^{'}}) 〉}_{t^{^{'}} \in [t, t + T_{d i v}]}

with

λ_{d i v} (t) = \max (λ_{d i v_{1}}, λ_{d i v_{2}}, λ_{d i v_{3}}, \dots, λ_{d i v_{12}})

.

2.2.3. Adaptive Model Recovery

The following process is implemented for the quick recovery of model parameters. In the beginning, new data onto the existing sparse data

χ_{0}

are regressed to determine varying parameters. Thereafter, the deletion of terms is identified by executing the sparse regression on the columns of

Ξ

that in

χ_{0}

corresponds to the nonzero rows. In the presence of a residual error, a sparse model in the inactive columns of

Ξ

that corresponds to zero rows in

χ_{0}

is fit for this error. Through this process, new terms may be introduced. This procedure is only performed when there is a divergence issue present.

2.3. Control Objective

By considering the physics-informed model as a nominal system

{\tilde{x}}_{s}

and real system

x_{s}

obtained as a result of the physics-informed ML procedure, suppose the aerial vehicle is flying with speed

υ

, and the plane follows a reference trajectory

x_{ref}

in the presence of different factors (e.g., disturbances, uncertainties, actuator faults). From a control perspective, the tracking error of both control input

e_{u} = u_{s} - u_{ref}

and aircraft’s state

e_{x} = x_{s} - x_{ref}

needs to be minimized. Similarly,

{\tilde{e}}_{x}

and

{\tilde{e}}_{u}

represent nominal state and control input tracking error, respectively. Hence, the cost function is designed as follows:

\begin{matrix} J ({\tilde{e}}_{x} (t), {\tilde{e}}_{u} (t)) & = \int_{t}^{t + t_{p}} \underset{Stage \cos t}{\underset{⏟}{S_{c} ({\tilde{e}}_{x} (τ | t), {\tilde{e}}_{u} (τ | t)) d τ}} + \underset{Terminal penalty}{\underset{⏟}{T_{p} ({\tilde{e}}_{x} (t + t_{p} | t))}}, \\ J ({\tilde{e}}_{x} (t), {\tilde{e}}_{u} (t)) & = \int_{t}^{t + t_{p}} \underset{Stage cost}{\underset{⏟}{({∥{\tilde{e}}_{x} (τ | t)∥}_{Q}^{2} + {∥{\tilde{e}}_{u} (τ | t)∥}_{P}^{2}) d τ}} + \underset{Terminal penalty}{\underset{⏟}{{∥{\tilde{e}}_{x} (t + t_{p} | t)∥}_{R}^{2}}}, \end{matrix}

(25)

where

t_{p} = n_{s} t_{i}

represents the prediction horizon with

n_{s}

denoting the predictive step. Moreover,

P

,

Q

, and

R

are the positive definitive weighting matrices in (25).

For the smooth and effective functioning of controller design, some preliminaries are needed before proceeding to the main control framework. Recall the nominal system (11), where only physics-informed modeling is considered.

Assumption 1.

It is assumed that

f_{s} ({\tilde{x}}_{s} (t), {\tilde{u}}_{s} (t))

is Lipschitz continuous with

{\tilde{u}}_{s} \in U_{MPC}

is locally Lipschitz with a constant

L

in

{\tilde{x}}_{s} (t)

. Hence, the following condition

∥f_{s} ({\tilde{x}}_{s 1}, {\tilde{u}}_{s}) - f_{s} ({\tilde{x}}_{s 2}, {\tilde{u}}_{s})∥

⩽ L ∥{\tilde{x}}_{s 1} - {\tilde{x}}_{s 2}∥

holds.

Assumption 2.

We assume that nominal system (11) is known and differentiable with all the initial states available. Thus, one can use Jacobian linearization at the origin, which is assumed to be stabilizable. With this assumption, a linear state feedback control law

u_{s} = K {\tilde{e}}_{x}

can be obtained in such a form that is asymptotically stable if no disturbances are present.

To further elaborate on the nature of the control system, Assumption A2 from [16] with an additional condition can be adopted in the following assumption:

Assumption 3.

(i) For the nominal tracking error, the terminal controller and terminal region

T

are such that, if

{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l}) \in T

, then by applying

{\tilde{u}}_{s} (τ | t_{l + 1}) = K {\tilde{e}}_{x} (τ | t_{l + 1})

to the vehicle over the time interval

τ \in [t_{l} + t_{p}, t_{l + 1} + t_{p})

, it satisfies the following expressions of the locally stabilizing controller:

{\tilde{e}}_{x} (τ | t_{l}) \in T, {\tilde{u}}_{s} = K {\tilde{e}}_{x} \in U_{MPC} .

(26)

(ii) the condition related to stage and terminal cost is provided as follows:

{\dot{T}}_{p} ({\tilde{e}}_{x} (τ | t_{l})) + S_{c} ({\tilde{e}}_{x} (τ | t_{l}), {\tilde{e}}_{u} (τ | t_{l})) \leq 0 .

(27)

Remark 1.

Assumption 3 in (26) is a standard assumption employed in the MPC community [41]. However, in the current scenario, its usage is only restricted to an ideal situation, and the linear state feedback control law in Assumption 2 is only available if the tracking error lies in the terminal region, which is not always the case in the control system for an aircraft.

Several techniques to determine the terminal region have been developed over the years (e.g., [42,43,44]). Inspired by [16], a terminal region is defined as:

T = \{{\tilde{e}}_{x} : {\tilde{F}}_{1} | {\tilde{e}}_{ξ} | + {\tilde{F}}_{2} | {\tilde{e}}_{V} | + {\tilde{F}}_{3} | {\tilde{e}}_{Θ} | + {\tilde{F}}_{4} | {\tilde{e}}_{ω} | < 1 - ϰ\},

(28)

where

ϰ

can be appropriately selected based on the control design.

\tilde{F}

and

{\tilde{e}}_{(•)}

are the feedback gains and state error between reference and actual trajectory with parameters satisfying

p_{ι} q_{ι} \leq 0.25

,

{\tilde{F}}_{ι} \in (\frac{1 \pm \sqrt{1 - 4 p_{ι} q_{ι}}}{p_{ι} q_{ι}})

,

ι \in {1, 2, 3, . . ., 12}

.

Furthermore, disturbances are also present in the system. Therefore, an assumption is needed to prove the recursive feasibility and stability in the subsequent section.

Assumption 4.

It is assumed that turbulence

Δ_{ext}

may also be present along with

Δ_{w i n d}

. Hence, the dynamics of total disturbances acting on the aircraft are defined as:

\begin{matrix} {\dot{Δ}}_{total} & = Δ_{ext} + Δ_{w i n d}, \\ = {[Δ_{v} Δ_{ω}]}^{T} + Δ_{w i n d}, \end{matrix}

(29)

where

Δ_{v}

and

Δ_{ω}

denote the turbulence acting on the translational and rotational components of the vehicle. It is assumed that the total disturbances are bounded, i.e.,

∥ Δ_{total} ∥ \leq ℷ

, which can be fast-time varying unlike [16].

Furthermore, crucial definitions related to stability proof are given as follows:

Definition 1.

[45] The tracking error system is input-to-state stable (ISS), if there exists a

K L

function

Γ (\cdot, \cdot) : R_{\geq 0} \times R_{\geq 0} \to R

&

K

function

Υ (\cdot)

such that

t \geq 0

, it satisfies that

∥e_{x} (t)∥ \leq Γ (∥e_{x} (t_{0})∥, t) + Υ (ℷ) .

(30)

Definition 2.

[45] A function

V (.)

is referred to as ISS-Lyapunov function for tracking error system, if there exist

K_{\infty}

functions

ℶ_{i} (.)

, and

K

function

ℸ (.)

such that

\forall e_{x} \in R^{2}

ℶ_{1} (∥e_{x} (t_{l})∥) \leq V (e_{x} (t_{l})) \leq ℶ_{2} (∥e_{x} (t_{l})∥),

(31)

V (e_{x} (t_{l + 1})) - V (e_{x} (t_{l})) \leq - ℶ_{3} (∥e_{x} (t_{l})∥) + ℸ (ℷ) .

(32)

If the Definitions 1 and 2 satisfy and no disturbances are acting on the aerial vehicle, then the tracking error also vanishes [46].

3. Control Framework

This section explains the presented control approach, illustrated in Figure 4. Reference trajectory

P

is employed to generate a particular flight envelop in different scenarios during the time interval

t \in [0, 10]

, which is expressed as:

P (t) = {[ξ_{ref}, V_{ref}, η_{ref}, ω_{ref}]}^{T} .

(33)

The optimization control problem is solved at a time sequence

\{t_{l} | l \in N, t_{l + 1} - t_{l} = t_{i}\}

:

Problem 1.

\begin{matrix} \min_{{\tilde{u}}_{s} (τ | t_{l})} & J ({\tilde{e}}_{x} (t), {\tilde{e}}_{u} (t)), \end{matrix}

(34a)

\begin{matrix} subject to & {\tilde{x}}_{s} (t_{l} | t_{l}) = x_{s} (t_{l}), \end{matrix}

(34b)

\begin{matrix} {\dot{\tilde{x}}}_{s} (τ | t_{l}) = f_{s} ({\tilde{x}}_{s} (τ | t_{l}), {\tilde{u}}_{s} (τ | t_{l})), \end{matrix}

(34c)

\begin{matrix} {\tilde{u}}_{s} (τ | t_{l}) \in U_{MPC}, \end{matrix}

(34d)

\begin{matrix} {\tilde{x}}_{s} (τ | t_{l}) \in X, \end{matrix}

(34e)

\begin{matrix} ∥{\tilde{e}}_{x} (τ | t_{l})∥ \leq \frac{t_{p} \times ϖ}{τ - t_{l}}, \end{matrix}

(34f)

\begin{matrix} {\tilde{e}}_{x} (t_{l} + t_{p} | t_{l}) \in T_{ϑ}, \end{matrix}

(34g)

where

T_{ϑ} = \{{\tilde{e}}_{x} : ∥{\tilde{e}}_{x}∥ \leq ϑ\}

is a robust terminal region, and

ϖ

is designed as follows:

ϖ = \frac{υ - ϰ}{{(F_{1}^{2} + F_{2}^{2} + F_{3}^{2} + F_{4}^{2})}^{1 / 2}}, ϑ < ϖ,

(35)

Remark 2.

For Problem 1, the nominal system (physics-informed dynamics) is updated by the actual state (physics-informed ML model) at each step. Consequently, the provided optimization problem must be solved online. Nevertheless, such an updating framework produces an optimal state with respect to the current one, and since the disturbances are not considered in Problem 1, the designed optimization problem consumes less computational resources.

Problem 1 determines the minimizing sequence over the interval

[t_{l}, t_{l} + t_{p})

:

{\tilde{u}}_{s}^{*} (t_{l}) = \{{\tilde{u}}_{s}^{*} (t_{l} | t_{l}), {\tilde{u}}_{s}^{*} (t_{l + 1} | t_{l}), \dots, {\tilde{u}}_{s}^{*} (t_{l + N} | t_{l})\} .

(36)

Hence, the drone’s control signal over interval

[t_{l}, t_{l + 1})

is defined as:

u_{s} = {\tilde{u}}_{s}^{*} (t_{l} | t_{l}),

(37)

where

{\tilde{u}}_{s}^{*} (t_{l} | t_{l})

is the initial control action of

{\tilde{u}}_{s}^{*} (t_{l})

.

A systematic procedure to apply the control action to the aircraft is provided in Algorithm 1. Nonetheless, if we assume that unknown disturbances are present, a tracking error between the reference and actual trajectory may exist. Furthermore, in order to satisfy the state constraints in the presence of the disturbances, the error on the upper bound between the actual and predicted nominal state is given as follows:

\begin{matrix} ∥x_{s} (t_{l + 1}) - {\tilde{x}}_{s}^{*} (t_{l + 1} | t_{l})∥ \\ = ∥ x_{s} (t_{l}) + \int_{t_{l}}^{t_{l + 1}} f_{s} (x_{s} (τ), {\tilde{u}}_{s} (τ)) d τ - {\tilde{x}}_{s}^{*} (t_{l} | t_{l}) \\ - \int_{t_{l}}^{t_{l + 1}} f_{s} ({\tilde{x}}_{s}^{*} (τ | t_{l}), {\tilde{u}}_{s}^{*} (t_{l} | t_{l})) d τ ∥, \\ \leq ℷ t_{i} + υ \int_{t_{l}}^{t_{l + 1}} ∥x_{s} (τ) - {\tilde{x}}_{s}^{*} (τ | t_{l})∥ d τ, \\ \leq ℷ t_{i} e^{υ t_{i}} . \end{matrix}

(38)

Algorithm 1: MPC-based control using physics-informed ML

Theorem 1.

Suppose the Assumptions 2 and 3 hold, and the Problem 1 is feasible at initial time

t_{0}

. Then, Problem 1 is feasible for all intervals under Algorithm 1, if the following conditions satisfy:

ℷ \leq \frac{e^{- υ t_{p}}}{t_{i}} (ϖ - ϑ), \tilde{F} t_{i} \geq ln (ϖ) - ln (ϑ),

(39)

where

\tilde{F} = min \{{\tilde{F}}_{i}\}

.

ϑ \geq \frac{ϖ (t_{p} - t_{i})}{t_{p}} .

(40)

Proof.

A feasible control sequence

{\tilde{u}}_{s} (τ | t_{l + 1})

at

t_{l + 1}

is provided as follows:

{\tilde{u}}_{s} (τ | t_{l + 1}) = \{\begin{matrix} {\tilde{u}}_{s}^{*} (τ | t_{l}), τ \in [t_{l + 1}, t_{l} + t_{p}), \\ K {\tilde{e}}_{x} (τ | t_{l}), τ \in [t_{l} + t_{p}, t_{l + 1} + t_{p}) . \end{matrix}

(41)

As the nominal state is updated by the actual state, i.e.,

{\tilde{x}}_{s} (t_{l + 1} | t_{l + 1}) = x_{s} (t_{l + 1})

, and by considering the time interval

τ \in [t_{l + 1}, t_{l} + t_{p})

, we obtain

\begin{matrix} ∥{\tilde{x}}_{s} (τ | t_{l + 1}) - {\tilde{x}}_{s}^{*} (τ | t_{l})∥ \\ = ∥ x_{s} (t_{l + 1}) + \int_{t_{l + 1}}^{τ} f_{s} ({\tilde{x}}_{s} (s | t_{l + 1}), {\tilde{u}}_{s}^{*} (s | t_{l})) d s \\ - {\tilde{x}}_{s}^{*} (t_{l + 1} | t_{l}) - \int_{t_{l + 1}}^{τ} f_{s} ({\tilde{x}}_{s}^{*} (s | t_{l}), {\tilde{u}}_{s}^{*} (s | t_{l})) d s ∥ . \end{matrix}

(42)

Utilizing Grönwall–Bellman inequality, we achieve:

∥{\tilde{x}}_{s} (τ | t_{l + 1}) - {\tilde{x}}_{s}^{*} (τ | t_{l})∥ \leq ℷ t_{i} e^{υ (τ - t_{l + 1} + t_{i})} .

(43)

Employing triangle inequality and substituting

t_{l} + t_{p}

in (43), we have

∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1})∥ \leq ∥{\tilde{e}}_{x}^{*} (t_{l} + t_{p} | t_{l})∥ + ℷ t_{i} e^{υ t_{i}} .

(44)

Since the ensuing expressions hold,

∥{\tilde{e}}_{x}^{*} (t_{l} + t_{p} | t_{l})∥ \leq ϑ

and

ℷ \leq \frac{e^{- υ t_{p}}}{t_{i}} (ϖ - ϑ)

. Therefore, one can obtain:

∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1})∥ \leq ϖ,

(45)

which implies

{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1}) \in T

. Next, consider the control input

u (τ) = K {\tilde{e}}_{x} \in U_{MPC}

is directly applied to the drone during

τ \in [t_{l} + t_{p}, t_{l + 1} + t_{p})

.

\begin{matrix} \frac{d}{d τ} {∥{\tilde{e}}_{x} (τ | t_{l + 1})∥}^{2} & = - 2 ({\tilde{F}}_{1} {\tilde{e}}_{ξ}^{2} + {\tilde{F}}_{2} {\tilde{e}}_{V}^{2} + {\tilde{F}}_{3} {\tilde{e}}_{Θ}^{2} + {\tilde{F}}_{4} {\tilde{e}}_{ω}^{2}), \\ \leq - 2 \tilde{F} {∥{\tilde{e}}_{x} (τ | t_{l + 1})∥}^{2} . \end{matrix}

(46)

By using the comparison principle,

∥{\tilde{e}}_{x} (t_{l + 1} + t_{p} | t_{l + 1})∥ \leq ∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1})∥ e^{- \tilde{F t_{i}}} .

Through

\tilde{F} t_{i} \geq ln (ϖ) - ln (ϑ)

:

∥{\tilde{e}}_{x} (t_{l + 1} + t_{p} | t_{l + 1})∥ \leq ϑ .

(47)

This comprehensively ensures that

{\tilde{u}}_{s} (τ | t_{l + 1})

is able to force the nominal tracking error into the terminal region

T_{ϑ}

, which satisfies the constraint (34g).

To prove the constraint satisfaction of (34f), we initially consider the time interval

τ \in [t_{l + 1}, t + t_{p})

. From (43), it follows that:

∥{\tilde{e}}_{x} (τ | t_{l + 1})∥ \leq ∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥ + ℷ t_{i} e^{υ t_{i}} .

(48)

Because of (39) and

∥{\tilde{e}}_{x} (τ | t_{l})∥ \leq \frac{t_{p}}{τ - t_{l}}

, we obtain

∥{\tilde{e}}_{x} (τ | t_{l + 1})∥ \leq \frac{t_{p} \times ϖ}{τ - t_{l}} + (ϖ - ϑ) .

(49)

From (40), we have

ϖ - ϑ \leq \frac{t_{i} ϖ}{t_{p} - t_{i}} \leq \frac{t_{i} ϖ t_{p}}{(τ - t_{l + 1}) (τ - t_{l})} .

(50)

Substituting (50) into (49), we achieve

∥{\tilde{e}}_{x} (τ | t_{l + 1})∥ \leq \frac{t_{p} \times ϖ}{τ - t_{l + 1}},

(51)

which comprehensively proves that that state constraint (34f) is satisfied. Next, we consider the time interval

τ \in [t_{k} + t_{p}, t_{l + 1} + t_{p})

. Through the proof of (45) and (47), and since

\frac{ϖ t_{i}}{τ - t_{l + 1}} \geq ϖ

,

∥{\tilde{e}}_{x} (τ | t_{l + 1})∥ \leq \frac{t_{p} \times ϖ}{τ - t_{l + 1}}

is satisfied over

τ \in [t_{k} + t_{p}, t_{l + 1} + t_{p})

. This completes the proof of constraint satisfaction of (34f). The aforementioned mathematical analysis conclusively proves that the designed candidate solution (41) is a feasible solution. This establishes that Problem 1 is feasible for the entire time and concludes the feasibility proof. □

Theorem 2.

Supposing that all the conditions in Theorem 1 meet, then the aircraft will satisfy constraints in the closed loop control operation.

Proof.

For the entire time interval

t \geq 0

, the constraints satisfied in Theorem 1, then the actual system will ensure constraint satisfaction in the close loop scenario despite the presence of disturbances. □

The following theorem provides the stability analysis of the proposed technique.

Theorem 3.

Suppose the drone is controlled through (37) and by Algorithm 1 with all conditions holding in Theorems 1 and 2, then the tracking error system is ISS if

\underset{̲}{q} ϑ^{2} > \frac{1}{2} ℷ e^{υ t_{p}} (ϖ + ϑ) + \frac{{\bar{q}}^{2} ℷ^{2} t_{i}}{2 υ} (e^{2 υ t_{p}} - e^{2 υ t_{i}}) + \frac{2 {\bar{q}}^{2} ℷ ϖ}{\sqrt{2} υ} {(\frac{t_{p}^{2}}{t_{i}} - t_{p})}^{\frac{1}{2}} {(e^{2 υ t_{p}} - e^{2 υ t_{i}})}^{\frac{1}{2}},

(52)

where

\underset{̲}{q} = min {q_{k}}

and

\bar{q} = max {q_{k}}

with

k = {1, 2, 3, \dots, 12}

.

Proof.

Choose a Lyapunov function

V (e_{x} (t_{l})) = J ({\tilde{e}}_{x}^{*} (t_{l}), {\tilde{e}}_{u}^{*} (t_{l})) .

(53)

Through the Riemann integral principle, a constant

r_{1}

exists such that the subsequent expression satisfies

V (e_{x}) \geq r_{1} S_{c} (e_{x}, e_{u}) ≜ ℶ_{1} (∥e_{x}∥)

. From (27),

V (e_{x} (t_{l})) \leq T_{p} (e_{x} (t_{l})) + T_{p} (e_{x} (t_{l} + t_{p} ∣ t_{l})), \forall e_{x} \in T_{ϑ}

. Due to

T_{p} (.)

in the terminal region with respect to time. Hence,

V (e_{x} (t_{l})) \leq 2 g (e_{x} (t_{l})), \forall e_{x} \in T_{ϑ}

. As the origin lies in the

T_{ϑ}

, &

2 g (e_{x} (t_{l})) \leq ϑ

\forall e_{x} \in T_{ϑ}

, it satisfies that

2 T_{p} (e_{x} (t_{l})) \geq T_{ϑ}

. Because of the feasibility of Problem 1, an upper bound

r_{2} > ϑ

for

V (e_{x} (t_{l}))

exists. Thus,

ℶ_{2} (∥e_{x} (t_{l})∥) = \frac{r_{2}}{ϑ} T_{p} (e_{x} (t_{l}))

is a

K_{\infty}

function such that

ℶ_{2} (∥e_{x} (t_{l})∥) \geq r_{2}

, thereby satisfying

ℶ_{2} (∥e_{x} (t_{l})∥) \geq V (e_{x} (t_{l}))

. This confirms the existence of

K_{\infty}

functions

ℶ_{1}

,

ℶ_{2}

satisfying (31).

Lyapunov function at

t_{l + 1}

with the actual system

V (e_{x} (t_{l + 1})) = J ({\tilde{e}}_{x} (t_{l + 1}), {\tilde{e}}_{u} (t_{l + 1})),

(54)

so

\begin{matrix} Δ V & = V (e_{x} (t_{l + 1})) - V (e_{x} (t_{l})) \\ \leq J ({\tilde{e}}_{x} (t_{l + 1}), {\tilde{e}}_{u} (t_{l + 1})) - J ({\tilde{e}}_{x}^{*} (t_{l}), {\tilde{e}}_{u}^{*} (t_{l})) \\ ≜ Δ V_{1} + Δ V_{2} + Δ V_{3}, \end{matrix}

(55)

where

\begin{matrix} Δ V_{1} & = \int_{t_{l + 1}}^{t_{l} + t_{p}} ({∥{\tilde{e}}_{x} (τ | t_{l + 1})∥}_{Q}^{2} - {∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥}_{Q}^{2}) d τ, \\ Δ V_{2} & = \int_{t_{l} + t_{p}}^{t_{l + 1} + t_{p}} ({∥{\tilde{e}}_{x} (τ | t_{l + 1})∥}_{Q}^{2} + {∥{\tilde{e}}_{u} (τ | t_{l + 1})∥}_{P}^{2}) d τ \\ + {∥{\tilde{e}}_{x} (t_{l + 1} + t_{p} | t_{l + 1})∥}_{R}^{2} - {∥{\tilde{e}}_{x}^{*} (t_{l} + t_{p} | t_{l})∥}_{R}^{2}, \\ Δ V_{3} & = - \int_{t_{l}}^{t_{l + 1}} ({∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥}_{Q}^{2} + {∥{\tilde{e}}_{u}^{*} (τ | t_{l})∥}_{P}^{2}) d τ . \end{matrix}

Let

Δ V_{1}

\begin{matrix} Δ V_{1} & \leq \int_{t_{l + 1}}^{t_{l} + t_{p}} ({∥{\tilde{e}}_{x} (τ | t_{l + 1}) - {\tilde{e}}_{x}^{*} (τ | t_{l})∥}_{Q}) ({∥{\tilde{e}}_{x} (τ | t_{l + 1})∥}_{Q} + {∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥}_{Q}) d τ \\ \leq \int_{t_{l + 1}}^{t_{l} + t_{p}} [{\bar{q}}^{2} ℷ t_{i} e^{υ (τ + t_{i} - t_{l + 1})} \times (2 ∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥ + ℷ t_{i} e^{υ (τ + t_{i} - t_{l + 1})})] d τ \\ = \int_{t_{l + 1}}^{t_{l} + t_{p}} [2 {\bar{q}}^{2} e^{υ (τ + t_{i} - t_{l + 1})} ∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥ + {\bar{q}}^{2} e^{2 υ (τ + t_{i} - t_{l + 1})}] d τ \\ \leq \int_{t_{l + 1}}^{t_{l} + t_{p}} 2 \bar{q} e^{υ (τ + t_{i} - t_{l + 1})} ∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥ d τ + \frac{{\bar{q}}^{2}}{2 υ} (e^{2 υ t_{p}} - e^{2 υ t_{i}}) . \end{matrix}

(56)

Applying Hölder inequality to the first term,

Δ V_{1} \leq {(\int_{t_{l + 1}}^{t_{l} + t_{p}} {∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥}^{2} d τ)}^{\frac{1}{2}} \frac{2 {\bar{q}}^{2} ℷ t_{i}}{\sqrt{2} υ} {(e^{2 υ t_{p}} - e^{2 υ t_{i}})}^{\frac{1}{2}} + \frac{{\bar{q}}^{2} ℷ^{2} t_{i}^{2}}{2 υ} (e^{2 υ t_{p}} - e^{2 υ t_{i}}) .

(57)

Suppose

Δ V_{2}

\begin{matrix} Δ V_{2} = & \int_{t_{l} + T}^{t_{l + 1} + t_{p}} {∥{\tilde{e}}_{x} (τ | t_{l + 1})∥}_{Q}^{2} + {∥{\tilde{e}}_{u} (τ | t_{l + 1})∥}_{P}^{2} d τ + {∥{\tilde{e}}_{x} (t_{l + 1} + t_{p} | t_{l + 1})∥}_{R}^{2} \\ - {∥{\tilde{e}}_{x}^{*} (t_{l} + t_{p} | t_{l})∥}_{R}^{2} + {∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1})∥}_{R}^{2} - {∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1})∥}_{R}^{2} . \end{matrix}

(58)

Integrating (27) from

t_{l} + t_{p}

into

t_{l + 1} + t_{p}

and substituting it into (58)

\begin{matrix} Δ V_{2} \leq & {∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1})∥}_{R}^{2} - {∥{\tilde{e}}_{x}^{*} (t_{l} + t_{p} | t_{l})∥}_{R}^{2} \\ \leq & (\frac{1}{2} ∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1}) - {\tilde{e}}_{x}^{*} (t_{l} + t_{p} | t_{l})∥) \\ \times (∥{\tilde{e}}_{x} (t_{l} + t_{p} | t_{l + 1})∥ + ∥{\tilde{e}}_{x}^{*} (t_{l} + t_{p} | t_{l})∥) \\ \leq & \frac{1}{2} ℸ ℶ e^{υ t_{p}} (ϑ + ϖ) . \end{matrix}

(59)

Consider

Δ V_{3}

\begin{matrix} Δ V_{3} & < - \int_{t_{l}}^{t_{l + 1}} {∥{\tilde{e}}_{x}^{*} (τ | t_{l})∥}_{Q}^{2} d τ, \\ \leq - \underset{̲}{q} t_{i} ϑ^{2} . \end{matrix}

(60)

By the addition of (57), (59) and (60),

Δ V ≜ Δ V_{1} + Δ V_{2} + Δ V_{3}

satisfies

\begin{matrix} Δ V < & - \underset{̲}{q} t_{i} ϑ^{2} + \frac{1}{2} ℷ t_{i} e^{υ t_{p}} (ϑ + ϖ) + \frac{2 {\bar{q}}^{2} ℷ t_{i}}{\sqrt{2} υ} {(e^{2 υ t_{p}} - e^{2 υ t_{i}})}^{\frac{1}{2}} \\ + \frac{{\bar{q}}^{2} ℷ^{2} t_{i}^{2}}{2 υ} (e^{2 υ t_{p}} - e^{υ t_{i}}) . \end{matrix}

(61)

From the condition (52),

Δ V < 0

satisfies. When

e_{x} \in T_{ϑ}

, re-assume

Δ V_{1}

and

Δ V_{3}

:

\begin{matrix} Δ V_{1} & \leq \int_{t_{l + 1}}^{t_{l} + t_{p}} 2 {\bar{q}}^{2} ℷ t_{i} ϑ (e^{υ τ - t_{l + 2}} d τ + \frac{{\bar{q}}^{2} ℷ^{2} t_{i}^{2}}{2 υ} (e^{2 υ t_{p}} - e^{2 υ t_{i}}) \\ = \frac{2 {\bar{q}}^{2} ℶ t_{i} ϑ}{υ} (e^{υ t_{p}} - e^{υ t_{i}}) + \frac{\bar{q} {\bar{q}}^{2} ℶ^{2} t_{i}^{2}}{2 υ} (e^{υ t_{p}} - e^{υ t_{i}}) . \end{matrix}

(62)

Because of the decreasing property of

{∥{\tilde{e}}_{x} (τ | t_{l})∥}_{Q}^{2}

in

T_{ϑ}

, then

Δ V_{3}

follows

Δ V_{3} \leq - \underset{̲}{q} t_{i} {∥{\tilde{e}}_{x}^{*} (t_{l + 1} | t_{l})∥}^{2} .

(63)

From (48), we obtain

{∥e_{x} (t_{l + 1})∥}^{2} \leq {∥e_{x}^{*} (t_{l + 1} | t_{l})∥}^{2} + ℷ^{2} t_{i}^{2} e^{2 υ t_{i}} + 2 ϑ ℷ t_{i}^{2} e^{ϑ t_{i}}

. As a result,

Δ V_{3} \leq - \underset{̲}{q} t_{i} {∥{\tilde{e}}_{x} (t_{l + 1})∥}^{2} \underset{̲}{q} ℷ^{2} t_{i}^{3} e^{2 ϑ ℷ} + 2 \underset{̲}{q} ϑ ℷ t_{i}^{2} e^{ϑ t_{i}} .

Consequently, it satisfies that

Δ V \leq - \underset{̲}{q} t_{i} {∥{\tilde{e}}_{x} (t_{l + 1})∥}^{2} + ℸ (ℷ)

(64)

where

ℸ (ℷ) = \frac{2 {\bar{q}}^{2} ℶ t_{i} ϑ}{υ} (e^{υ t_{p}} - e^{υ t_{i}}) + \frac{{\bar{q}}^{2} ℶ^{2} t_{i}^{2}}{2 υ} (e^{υ t_{p}} - e^{υ t_{i}}) + \frac{1}{2} ℶ ℸ e^{υ t_{p}} (ϑ + ϖ) + \underset{̲}{q} ℶ^{2} t_{i}^{3} e^{2 υ t_{i}} + 2 \underset{̲}{q} ϑ ℶ t_{i}^{2} e^{υ t_{i}}

is a

K

function w.r.t. ℶ. Hence, the stability is comprehensively proved. □

4. Comparative Analysis

4.1. Simulation Studies

The parameters of the aerial robot are provided as follows:

m = 1.6

kg,

I_{x x} = I_{y y} = 0.025

kg.m

^{2}

,

I_{z z} = 0.005

kg.m

^{2}

,

l_{1} = 0.17

m,

l_{2} = 0.06

m,

g = 9.8

m/s

^{2}

. 30% parametric uncertainty for the entire flight is considered during the simulation study. To validate the reliability of the controller, a high-speed

υ = 20

m/s is selected for the comparative analysis. In the cost function,

R = diag {0.1 0.1 0.1 0.1 1 1}

, and the values of

q_{i}

and

p_{i}

are 1 and 0.25, respectively. In Problem 1, the numerical values of

t_{p}

,

t_{i}

,

ϑ

,

F_{i}

,

ϖ

and

ϰ

are chosen as 10 s, 0.05 s, 0.04, 4, 0.08 and 0.2, respectively.

λ

is selected as 0.02, 0.03, 0.04, 0.042, 0.05, 0.051, 0.06, 0.07, 0.08, 0.085, 0.09, and 0.095. The constraints are

- 1 \leq ξ^{I} \leq 1

,

- 2.5 \leq V_{I} \leq 2.5

,

0 \leq T \leq 4

,

- π / 4 \leq Θ \leq π / 4

, and

- 2 \leq ω_{B} \leq 2

.

In order to corroborate the control performance, different trajectories are defined that show that the DFAV has the properties of fixed-wing aircraft and helicopters. Suppose the drone is flying with an angular velocity of 2 rad/s. In the first trajectory (65), the aircraft takes off, and after attaining a certain altitude, it performs a bank-turn motion:

\begin{matrix} ξ_{x} & = sin (2 t), \\ ξ_{y} & = 5 cos (2 t), \\ ξ_{z} & = 50 - e^{- 3 t} . \end{matrix}

(65)

In the second instance, we validate the control performance with a helical 3D trajectory:

\begin{matrix} ξ_{x} & = sin (2 t), \\ ξ_{y} & = 5 cos (2 t), \\ ξ_{z} & = 50 t - cos (- 2 t) . \end{matrix}

(66)

Moreover, since the aerial robot is flying with high speed

υ

, we validate the performance with a straight and level flight (SLF), a kind of horizontal flight phase of DFAVs [14]. During this flight, the drone must keep a constant altitude (e.g., 40 m) during the entire time, while the aircraft should fly at the desired angles, i.e.,

α =

5

^{\circ}

and

θ =

5

^{\circ}

. In this case, some initial conditions are required to be considered, e.g.,

ξ_{x} (0) = 0

m,

ξ_{z} (0) = 39

m,

α (0) = 4 . 8^{\circ}

, and

θ (0) = 4 . 8^{\circ}

.

Three different cases are considered:

Case (1): Performance in the presence of parametric uncertainties;
Case (2): Performance in the presence of parametric uncertainties and disturbances;
Case (3): Performance in the presence of faults, model uncertainties and disturbances.

Before proceeding to the aforementioned case studies, the correlation between flight performance and endurable wind speed is provided. In the previous work based on this aerial vehicle [47], aircraft was flown at various speeds under different weather conditions. From these real-time experiments, it was found that the maximum endurable wind speed is approximately 8 m/s. Therefore, it can be concluded that effective flight performance can be achieved if the wind speed is within the maximum endurable wind speed limit. However, the flight performance degrades as the wind speed passes the maximum endurable wind speed threshold. If the aircraft keeps flying over this maximum endurable wind speed, it encounters a structural failure, or a severe fault may occur. Nonetheless, the severity of this structural failure or fault depends on the magnitude and the total duration of the wind gust it encounters. To keep in view the contributions and main objective of this article, we deal with the control performance only under three case studies, under calm weather (Case 1), and wind gust is present (Case 2). However, it is assumed that it may not cause any structural faults because the magnitude of the wind gust is mostly under 8 m/s during the entire flight. Lastly, it is considered in the third case study that the partial fault exists associated with the unavailability of the dynamical information of the wings whose probability of occurrence is less than 3%. This occurrence level is uncommon, but to precisely verify the efficacy of the control approach, the worst-case situation (less than 3%) shall be analyzed in the third case study.

4.1.1. Case(1): Trajectory Tracking Response under Model Uncertainties

A comparative analysis of both strategies under model uncertainties is shown in Figure 5. In Figure 5a, the simulation results based on 3D trajectory (65) and (66) are illustrated. Furthermore, the tracking response under model inaccuracies derived from SLF flight is depicted in Figure 5b. Both controllers are primarily constructed on the principle of nominal MPC with similar attributes, so they exhibit adequate tracking performance. Nevertheless, the proposed technique demonstrates effective control performance with fast response and rapid convergence time.

4.1.2. Case(2): Trajectory Tracking Response under Uncertainties and Disturbances

Recall disturbance model in (29): the continuous Dryden wind turbulence model MIL-HDBK-1797B is employed using Simulink with default parameters chosen. Due to some minor limitations in this turbulence model, wind components are added, depicted in Figure 6. These wind components are composed of sinusoidal waveform

Δ_{wind} = {[sin (8 t) sin (6.01 t) sin (4.02 t)]}^{T}

. The trajectory tracking performance under model uncertainties and disturbances are shown in Figure 7. Note that the MPC-CFC performance, as opposed to the developed approach, deteriorates despite having a dedicated DOB mechanism. This conclusively indicates that DOB in MPC-CFC [16,17] is only suitable for slow time-varying disturbances.

4.1.3. Case(3): Tracking Response in the Presence of Model Uncertainties, Disturbances, and Fault

In the third case study, suppose a fault occurs during the initial few seconds of the simulations due to the presence of a strong wind gust, considered in (29), and this fault is considered as a partial malfunction in one of the wings of the aircraft, whose role is depicted in Figure 2, is now either unavailable or unknown in this case. The tracking performance subject to disturbances, uncertainties, and fault is shown in Figure 8. More specifically, this fault has an adverse effect while controlling the attitude of the aerial platform illustrated in Figure 8b. In this aspect, as adaptive SINDY is equipped to handle and compensate for model divergence and correction, unlike DOB developed offline in [16,17], the presented approach by exploiting the full potential of MPC reveals effective control performance compared to the MPC-CFC technique [16,17].

4.1.4. Performance Analysis Based on Tracking Error and Computations

To analyze the effectiveness of both techniques, mean absolute error (MAE) is utilized and is defined as follows:

MAE = \sum_{j = 1}^{n_{obs}} | e (t_{j}) | / n_{obs},

(67)

where

e (t_{j})

and

n_{obs}

are the tracking error in each time step and the sum of a number of observations, respectively. In Table 2, a comparative analysis is illustrated. The trajectory error suggests that the designed approach shows effective tracking performance.

The numerical simulations are executed on Windows-driven Intel-based dual-core processor (2.2 GHz) with 8 GB of RAM. The optimization Problem 1 is written in MATLAB 2021a via CasADi [48], 3.5.5 version, and computed by NLP solver IPOPT [49], 3.12.3 version. The computational cost of these schemes is determined by the time required to compute the control action during the trajectory tracking response, depicted in Figure 9. Based on these findings, the average normalized computational time for the entire simulation study for the developed algorithm and MPC-CFC strategy is 18.62 ms and 19.51 ms, respectively.

4.2. Discussion

Different case studies have shown that the developed algorithm demonstrates an efficient tracking performance compared to the MPC-CFC technique. It is primarily due to the hybrid modeling approach integrated with the control mechanism that enables the MPC-based controller to perform effectively despite uncertainties, disturbances, and faults. This also confirms our earlier assertion that physics-informed dynamics and DOB in the disturbance rejection MPC approaches (e.g., [16,17]) limited the capability of MPC due to inadequate modeling and ineffective estimation. Nevertheless, the computational complexity may be affected if a large amount of data are utilized online rather than with less data. With regard to the ML technique related to adaptive SINDy, this work has not addressed any shortcomings concerning this method (e.g., noise sensitivity and long-term memory), as they are beyond the scope of this article. Regarding flight robustness, the proposed framework is designed based on a more suitable selection of the terminal region, which yields adequate stability margins as opposed to [16,17]. In spite of all the aforementioned benefits, the presented MPC-based technique using a physics-informed ML model still ensures sub-optimal performance.

5. Conclusions and Future Directions

This paper has presented an MPC-based control design for the ducted fan aircraft utilizing a hybrid modeling scheme known as physics-informed ML. In the beginning, the physics-informed model was derived offline from the drone with sufficient capabilities. Thereafter, an online-based data-driven modeling technique is integrated with physics-informed dynamics to determine the actual model. Afterwards, an MPC-based control algorithm was developed by updating the physics-informal dynamics (nominal system) with real states. ISS stability and recursive feasibility were proven under adequate conditions. Finally, simulations were conducted under three different scenarios starting from challenging (case 1), worse (case 2), and worst-case situation (case 3), which revealed that the developed framework exhibits effective control performance.

For future studies, the worst-case scenario can be utilized as a baseline to effectively construct a more refined fault-tolerant controller. Nevertheless, it is noteworthy that the developed framework is not designed as a fault-tolerant controller. Moreover, the worst-case scenario is only verified under a partial fault occurrence. Furthermore, the designed strategy also ensures sub-optimal performance like [16,17] for aerial applications.

In the end, several future directions can be pursued to improve the efficacy of the proposed strategy:

Physics-informed modeling: the designed model is basically derived from the principle of the Newton–Euler method. Other mathematical models, such as the Lagrange-Euler approach, can be employed;
Data-driven ML: the developed ML scheme was inspired by adaptive SINDy [30], where different ways of improving the ML model’s capability can be explored. Moreover, further investigation is required to find an effective technique to enhance computational efficiency with more data and less physics;
Real-time implementation: future work will involve real-time testing of the presented framework. For this purpose, a more suitable solver can be used for code generation especially developed for real-time embedded optimization.

Author Contributions

Conceptualization, T.M.; methodology, T.M., Z.S. and Z.C.; software, T.M.; validation, T.M.; formal analysis, T.M., Z.S. and H.P.; investigation, T.M.; resources, H.P.; data curation, T.M. and Z.C.; writing—original draft preparation, T.M.; writing—review and editing, T.M., Z.S., Z.C. and H.P.; visualization, T.M., Z.S., Z.C. and H.P.; supervision, H.P.; project administration, H.P.; funding acquisition, T.M. and H.P. All authors have read and agreed to this version of the manuscript.

Funding

This work was supported in part by the Scientific Instruments Development Program of NSFC of China: 615278010, in part by Fundamental Research Funds for the Central Universities, in part by Science and Technology Planning Project of Guangdong, China: 2017B010116005, and in part by 2022 Foreign Expert Program (Foreign Youth Talent Program) of Ministry of Science and Technology of China: QN2022163002.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this article are available on reasonable request from the authors.

Acknowledgments

The authors would like to thank all the funders who supported this work.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DFAV	Ducted Fan Aerial Vehicle
DOB	Disturbance Observer
MPC	Model Predictive Control
MPC-CFC	MPC-based Compound Flight Control
MAE	Mean Absolute Error
ML	Machine Learning
NN	Neural Network
SINDy	Sparse Identification of Nonlinear Dynamics
SLF	Straight and Level Flight

References

Cheng, Z.; Pei, H. Control Effectiveness Enhancement for the Hovering/Cruising Transition Control of a Ducted Fan UAV. J. Intell. Robot. Syst. 2022, 105. [Google Scholar] [CrossRef]
Cheng, Z.; Pei, H. Transition Analysis and Practical Flight Control for Ducted Fan Fixed-Wing Aerial Robot: Level Path Flight Mode Transition. IEEE Robot. Autom. Lett. 2022, 7, 3106–3113. [Google Scholar] [CrossRef]
Cheng, Z.; Pei, H. Flight Transition Control for Ducted Fan UAV with Saturation on Control Surfaces. In Proceedings of the 2021 International Conference on Unmanned Aircraft Systems (ICUAS), Athens, Greece, 5–18 June 2021; pp. 439–446. [Google Scholar] [CrossRef]
Marconi, L.; Naldi, R.; Gentili, L. Modelling and control of a flying robot interacting with the environment. Automatica 2011, 47, 2571–2583. [Google Scholar] [CrossRef] [Green Version]
Naldi, R.; Macchelli, A.; Mimmo, N.; Marconi, L. Robust Control of an Aerial Manipulator Interacting with the Environment. IFAC-PapersOnLine 2018, 51, 537–542. [Google Scholar] [CrossRef]
Marconi, L.; Naldi, R. Control of Aerial Robots: Hybrid Force and Position Feedback for a Ducted Fan. IEEE Control Syst. Mag. 2012, 32, 43–65. [Google Scholar] [CrossRef]
Naldi, R.; Torre, A.; Marconi, L. Robust Control of a Miniature Ducted-Fan Aerial Robot for Blind Navigation in Unknown Populated Environments. IEEE Trans. Control Syst. Technol. 2015, 23, 64–79. [Google Scholar] [CrossRef]
Roberts, A.; Tayebi, A. Adaptive position tracking of VTOL UAV. IEEE Trans. Robot. 2011, 27, 129–142. [Google Scholar] [CrossRef] [Green Version]
Manzoor, T.; Xia, Y.; Ali, Y.; Hussain, K. Flight control techniques and classification of ducted fan aerial vehicles. Kongzhi Lilun Yu Yingyong/Control Theory Appl. 2022, 39, 201–221. [Google Scholar] [CrossRef]
Hua, M.; Hamel, T.; Morin, P.; Samson, C. Introduction to feedback control of underactuated VTOL vehicles: A review of basic control design ideas and principles. IEEE Control Syst. Mag. 2013, 33, 61–75. [Google Scholar] [CrossRef]
Eren, U.; Prach, A.; Kocer, B.; Rakovic, S.V.; Kayacan, E.; Acikmese, B. Model Predictive Control in Aerospace Systems: Current State and Opportunities. J. Guid. Control. Dyn. 2017, 40, 1541–1566. [Google Scholar] [CrossRef]
Banazadeh, A.; Emami, S.A. Control effectiveness investigation of a ducted-fan aerial vehicle using model predictive controller. In Proceedings of the 2014 International Conference on Advanced Mechatronic Systems, Kumamoto, Japan, 10–12 August 2014; pp. 532–537. [Google Scholar] [CrossRef]
Emami, A.; Banazadeh, A. Robustness investigation of a ducted-fan aerial vehicle control, using linear, adaptive, and model predictive controllers. Int. J. Adv. Mechatron. Syst. 2015, 6, 108–117. [Google Scholar] [CrossRef]
Manzoor, T.; Xia, Y.; Zhai, D.H.; Ma, D. Trajectory tracking control of a VTOL unmanned aerial vehicle using offset-free tracking MPC. Chin. J. Aeronaut. 2020, 33, 2024–2042. [Google Scholar] [CrossRef]
Emami, A.; Rezaeizadeh, A. Adaptive model predictive control-based Attitude and Trajectory Tracking of a VTOL Aircraft. IET Control Theory Appl. 2018, 12, 2031–2042. [Google Scholar] [CrossRef]
Manzoor, T.; Sun, Z.; Xia, Y.; Ma, D. MPC based compound flight control strategy for a ducted fan aircraft. Aerosp. Sci. Technol. 2020, 107, 106264. [Google Scholar] [CrossRef]
Manzoor, T.; Pei, H.; Cheng, Z. Composite observer-based robust model predictive control technique for ducted fan aerial vehicles. Nonlinear Dyn. 2022. [Google Scholar] [CrossRef]
Hewing, L.; Wabersich, K.P.; Menner, M.; Zeilinger, M.N. Learning-Based Model Predictive Control: Toward Safe Learning in Control. Annu. Rev. Control. Robot. Auton. Syst. 2020, 3, 269–296. [Google Scholar] [CrossRef]
Brunke, L.; Greeff, M.; Hall, A.W.; Yuan, Z.; Zhou, S.; Panerati, J.; Schoellig, A.P. Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning. Annu. Rev. Control. Robot. Auton. Syst. 2022, 5, 411–444. [Google Scholar] [CrossRef]
Kaheman, K.; Kaiser, E.; Strom, B.; Kutz, J.N.; Brunton, S.L. Learning Discrepancy Models From Experimental Data. arXiv 2019, arXiv:1909.08574. [Google Scholar] [CrossRef]
Brunton, S.L.; Kutz, J.N. Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control; Cambridge University Press: Cambridge, UK, 2019. [Google Scholar] [CrossRef] [Green Version]
Brunton, S.L.; Nathan Kutz, J.; Manohar, K.; Aravkin, A.Y.; Morgansen, K.; Klemisch, J.; Goebel, N.; Buttrick, J.; Poskin, J.; Blom-Schieber, A.W.; et al. Data-Driven Aerospace Engineering: Reframing the Industry with Machine Learning. AIAA J. 2021, 59, 2820–2847. [Google Scholar] [CrossRef]
Zhang, W.; Shen, J.; Ye, X.; Zhou, S. Error model-oriented vibration suppression control of free-floating space robot with flexible joints based on adaptive neural network. Eng. Appl. Artif. Intell. 2022, 114, 105028. [Google Scholar] [CrossRef]
Hosseini, S.; Poormirzaee, R.; Hajihassani, M. Application of reliability-based back-propagation causality-weighted neural networks to estimate air-overpressure due to mine blasting. Eng. Appl. Artif. Intell. 2022, 115, 105281. [Google Scholar] [CrossRef]
Floriano, B.R.; Vargas, A.N.; Ishihara, J.Y.; Ferreira, H.C. Neural-network-based model predictive control for consensus of nonlinear systems. Eng. Appl. Artif. Intell. 2022, 116, 105327. [Google Scholar] [CrossRef]
Park, B.S.; Yoo, S.J. Quantized-communication-based neural network control for formation tracking of networked multiple unmanned surface vehicles without velocity information. Eng. Appl. Artif. Intell. 2022, 114, 105160. [Google Scholar] [CrossRef]
Kaiser, E.; Kutz, J.N.; Brunton, S.L. Sparse identification of nonlinear dynamics for model predictive control in the low-data limit. Proc. R. Soc. A Math. Phys. Eng. Sci. 2018, 474, 20180335. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Brunton, S.L.; Proctor, J.L.; Kutz, J.N. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc. Natl. Acad. Sci. USA 2016, 113, 3932–3937. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Cao, R.; Lu, Y.; He, Z. System identification method based on interpretable machine learning for unknown aircraft dynamics. Aerosp. Sci. Technol. 2022, 126, 107593. [Google Scholar] [CrossRef]
Quade, M.; Abel, M.; Nathan Kutz, J.; Brunton, S.L. Sparse identification of nonlinear dynamics for rapid model recovery. Chaos: Interdiscip. J. Nonlinear Sci. 2018, 28, 063116. [Google Scholar] [CrossRef]
Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
Arnold, F.; King, R. State–space modeling for control based on physics-informed neural networks. Eng. Appl. Artif. Intell. 2021, 101, 104195. [Google Scholar] [CrossRef]
Zobeiry, N.; Humfeld, K.D. A physics-informed machine learning approach for solving heat transfer equation in advanced manufacturing and engineering applications. Eng. Appl. Artif. Intell. 2021, 101, 104232. [Google Scholar] [CrossRef]
Shen, S.; Lu, H.; Sadoughi, M.; Hu, C.; Nemani, V.; Thelen, A.; Webster, K.; Darr, M.; Sidon, J.; Kenny, S. A physics-informed deep learning approach for bearing fault detection. Eng. Appl. Artif. Intell. 2021, 103, 104295. [Google Scholar] [CrossRef]
Liu, X.; Peng, W.; Gong, Z.; Zhou, W.; Yao, W. Temperature field inversion of heat-source systems via physics-informed neural networks. Eng. Appl. Artif. Intell. 2022, 113, 104902. [Google Scholar] [CrossRef]
Nascimento, R.G.; Fricke, K.; Viana, F.A. A tutorial on solving ordinary differential equations using Python and hybrid physics-informed neural network. Eng. Appl. Artif. Intell. 2020, 96, 103996. [Google Scholar] [CrossRef]
Ahnert, K.; Abel, M. Numerical differentiation of experimental data: Local versus global methods. Comput. Phys. Commun. 2007, 177, 764–774. [Google Scholar] [CrossRef]
Chartrand, R. Numerical Differentiation of Noisy, Nonsmooth Data. Int. Sch. Res. Netw. 2011, 2011, 1023–1033. [Google Scholar] [CrossRef] [Green Version]
Zhang, L.; Schaeffer, H. On the Convergence of the SINDy Algorithm. Multiscale Model. Simul. 2019, 17, 948–972. [Google Scholar] [CrossRef] [Green Version]
Gershenfeld, N.A. The Nature of Mathematical Modeling; Cambridge University Press: Cambridge, UK, 1999. [Google Scholar] [CrossRef]
Xue, R.; Dai, L.; Huo, D.; Xie, H.; Sun, Z.; Xia, Y. Compound tracking control based on MPC for quadrotors with disturbances. J. Frankl. Inst. 2022, 359, 7992–8013. [Google Scholar] [CrossRef]
Chen, H.; Allgöwer, F. A Quasi-Infinite Horizon Nonlinear Model Predictive Control Scheme with Guaranteed Stability. Automatica 1998, 34, 1205–1217. [Google Scholar] [CrossRef]
Althoff, M.; Stursberg, O.; Buss, M. Reachability analysis of nonlinear systems with uncertain parameters using conservative linearization. In Proceedings of the 47th IEEE Conference on Decision and Control, Cancun, Mexico, 9–11 December 2008; pp. 4042–4048. [Google Scholar] [CrossRef] [Green Version]
Sun, Z.; Xia, Y. Receding horizon tracking control of unicycle-type robots based on virtual structure. Int. J. Robust Nonlinear Control 2016, 26, 3900–3918. [Google Scholar] [CrossRef]
Sontag, E.D. Input to State Stability: Basic Concepts and Results. In Nonlinear and Optimal Control Theory: Lectures Given at the C.I.M.E. Summer School Held in Cetraro, Italy June 19–29, 2004; Springer: Berlin/Heidelberg, Germany, 2008; pp. 163–220. [Google Scholar] [CrossRef]
Sun, Z.; Dai, L.; Liu, K.; Xia, Y.; Johansson, K.H. Robust MPC for tracking constrained unicycle robots with additive disturbances. Automatica 2018, 90, 172–184. [Google Scholar] [CrossRef]
Cheng, Z.; Pei, H.; Li, S. Neural-Networks Control for Hover to High-Speed-Level-Flight Transition of Ducted Fan UAV With Provable Stability. IEEE Access 2020, 8, 100135–100151. [Google Scholar] [CrossRef]
Andersson, J.A.E.; Gillis, J.; Horn, G.; Rawlings, J.B.; Diehl, M. CasADi–A software framework for nonlinear optimization and optimal control. Math. Program. Comput. 2019, 11, 1–36. [Google Scholar] [CrossRef]
Wächter, A.; Biegler, L.T. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math. Program. 2006, 106, 25–57. [Google Scholar] [CrossRef]

Figure 1. Physics and data scenario in the presented scheme.

Figure 2. (a) Inertial and body-fixed frame; (b) other components during flight; (c) aerodynamic effects on the duct; (d) aerodynamic effects on the wings.

Figure 3. Physics-informed ML model for DFAVs.

Figure 4. Schematic diagram of the MPC-based control approach using physics-informed ML.

Figure 5. Tracking response under model uncertainties (a) 3D trajectory tracking (b) SLF flight.

Figure 6. Disturbance profile involving Dryden wind turbulence (

Δ_{w}, Δ_{v}

) and other time-varying wind disturbances

Δ_{wind}

.

Figure 6. Disturbance profile involving Dryden wind turbulence (

Δ_{w}, Δ_{v}

) and other time-varying wind disturbances

Δ_{wind}

.

Figure 7. Tracking response under uncertainties and disturbances (a) 3D trajectory tracking; (b) SLF flight.

Figure 8. Tracking response under model uncertainties, disturbances, and fault (a) 3D trajectory tracking; (b) SLF flight.

Figure 9. Normalized computational time of both frameworks.

Table 1. A concise comparison between existing MPC-based control methods and the proposed technique for DFAVs (order by year).

References	Features ¹	Advantages	Disadvantages
Linear MPC [12,13]	AMS	Simple design, effective	Lack adequate tracking and robustness.
		computations.
1. Disturbance rejection RMPC ²,	ACDMPS	Considered time-delays, discrete time optimization problem	1. RMPC ²: lack effective control performance,
2. Disturbance rejection adaptive			2. Adaptive MPC: design intricacy may not be effective, feasibility analysis is not available for both schemes.
Observer-based MPC [14]	ACDMPS	Effective computations, easy real-time implementation, considered time delays.	Ineffective control performance, recursive easibility cannot be established for the entire flight.
Compound RMPC ² [16,17]	ACDMPS	-	Inadequate performance, suitable if the DOB’s dynamics is faster than disturbance dynamics.

¹ A: attitude tracking, C: composite technique, D: DOB, M: physics-informed modeling, P: position tracking, S: simulation study. ² RMPC: Robust MPC.

Table 2. Performance analysis based on tracking error (MAE).

	Scenario	MPC-CFC	Proposed		Scenario	MPC-CFC	Proposed
$ξ_{y}$ (m)	Case(1)-Figure 5a	1.5752	0.0016	$ξ_{z}$ (m)	Case(1)-Figure 5b	0.1711	0.1424
	Case(1)-Figure 5a	0.1569	0.0157		Case(1)-Figure 5a	0.0011	1.09 $\times 10^{- 6}$
	Case(2)-Figure 7a	2.3279	0.0141		Case(2)-Figure 7b	0.0999	0.0987
	Case(2)-Figure 7a	0.2357	0.0235		Case(2)-Figure 7a	0.0016	9.82 $\times 10^{- 6}$
	Case(3)-Figure 8a	2.8687	0.0157		Case(3)-Figure 8b	0.122	0.0782
	Case(3)-Figure 8a	0.4729	0.0314		Case(3)-Figure 8a	0.002	1.091 $\times 10^{- 5}$
$ξ_{x}$ (m)	Case(1)-Figure 5a	0.03066	0.0003	$θ$ ( $^{°}$ )	Case(1)-Figure 5b	0.0482	0.0378
	Case(1)-Figure 5a	0.0325	0.0033		Case(2)-Figure 7b	0.0376	0.0349
	Case(2)-Figure 7a	0.4429	0.0029		Case(3)-Figure 8b	0.0748	0.0284
	Case(2)-Figure 7a	0.0486	0.049	$α$ ( $^{°}$ )	Case(1)-Figure 5b	0.0466	0.0460
	Case(3)-Figure 8a	0.5446	0.0033		Case(2)-Figure 7b	0.0371	0.0344
	Case(3)-Figure 8a	0.0966	0.0065		Case(3)-Figure 8b	0.069	0.0389

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Manzoor, T.; Pei, H.; Sun, Z.; Cheng, Z. Model Predictive Control Technique for Ducted Fan Aerial Vehicles Using Physics-Informed Machine Learning. Drones 2023, 7, 4. https://doi.org/10.3390/drones7010004

AMA Style

Manzoor T, Pei H, Sun Z, Cheng Z. Model Predictive Control Technique for Ducted Fan Aerial Vehicles Using Physics-Informed Machine Learning. Drones. 2023; 7(1):4. https://doi.org/10.3390/drones7010004

Chicago/Turabian Style

Manzoor, Tayyab, Hailong Pei, Zhongqi Sun, and Zihuan Cheng. 2023. "Model Predictive Control Technique for Ducted Fan Aerial Vehicles Using Physics-Informed Machine Learning" Drones 7, no. 1: 4. https://doi.org/10.3390/drones7010004

Article Menu

Model Predictive Control Technique for Ducted Fan Aerial Vehicles Using Physics-Informed Machine Learning

Abstract

1. Introduction

1.1. Literature Review

1.2. Contributions

1.3. Organization

1.4. Notation

2. Problem Formulation

2.1. Physics-Informed Modelling

2.2. Data-Driven ML-Adaptive SINDy

2.2.1. Baseline Model

2.2.2. Estimation of Model Divergence

2.2.3. Adaptive Model Recovery

2.3. Control Objective

3. Control Framework

4. Comparative Analysis

4.1. Simulation Studies

4.1.1. Case(1): Trajectory Tracking Response under Model Uncertainties

4.1.2. Case(2): Trajectory Tracking Response under Uncertainties and Disturbances

4.1.3. Case(3): Tracking Response in the Presence of Model Uncertainties, Disturbances, and Fault

4.1.4. Performance Analysis Based on Tracking Error and Computations

4.2. Discussion

5. Conclusions and Future Directions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI