Next Article in Journal
Diffusion Kinetics Theory of Removal of Assemblies’ Surface Deposits with Flushing Oil
Previous Article in Journal
The Complexity of the Super Subdivision of Cycle-Related Graphs Using Block Matrices
Previous Article in Special Issue
A Parametric Family of Triangular Norms and Conorms with an Additive Generator in the Form of an Arctangent of a Linear Fractional Function
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quadrotor Trajectory Tracking Using Model Reference Adaptive Control, Neural Network-Based Parameter Uncertainty Compensator, and Different Plant Parameterizations

by
Anton Glushchenko
*,† and
Konstantin Lastochkin
V.A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences, 117997 Moscow, Russia
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Computation 2023, 11(8), 163; https://doi.org/10.3390/computation11080163
Submission received: 20 June 2023 / Revised: 7 August 2023 / Accepted: 15 August 2023 / Published: 18 August 2023
(This article belongs to the Special Issue Control Systems, Mathematical Modeling and Automation II)

Abstract

:
A quadrotor trajectory tracking problem is addressed via the design of a model reference adaptive control (MRAC) system. As for real-world applications, the entire quadrotor dynamics is typically unknown. To take that into account, we consider a plant model, which contains uncertain nonlinear terms resulting from aerodynamic friction, blade flapping, and the fact that the mass and inertia moments of the quadrotor may change from their nominal values. Unlike many known studies, the explicit equations of the parameter uncertainty for the position control loop are derived in two different ways using the differential flatness approach: the control signals are (i) used and (ii) not used in the parametric uncertainty parameterization. After analysis, the neural network (NN) is chosen for both cases as a compensator of such uncertainty, and the set of NN input signals is justified for each of them. Unlike many known MRAC systems with NN for quadrotors, in this study, we use the k x x + k r r baseline controller, which follows from the control system derivation, with both time-invariant (parameterization (i)) and adjustable (parameterization (ii)) parameters instead of an arbitrarily chosen non-tunable PI/PD/PID-like one. Adaptive laws are derived to adjust the parameters of NN uncertainty compensator for both parameterizations. As a result, the position controller ensures the asymptotic stability of the tracking error for both cases under the assumption of perfect attitude loop tracking, which is ensured in the system previously developed by the authors. The results of the numerical experiments support the theoretical conclusions and provide a comparison of the effectiveness of the derived parameterizations. They also allow us to make conclusions on the necessity of the baseline controller adjustment.

1. Introduction

This study is an extended version of a previously published conference paper [1].
The problem of quadrotor control quality improvement under the condition of parameter uncertainty has gained considerable attention from the control community in the recent years [2] because of the numerous outdoor and indoor civilian applications of such devices in the following: agriculture, traffic monitoring, delivery, mapping coverage, etc.
The main advantages of the quadrotors are their small size, light weight, vertical take-off and landing, and dynamical maneuverability. However, on the other hand, their dynamics is non-linear with couplings between translational and rotational motions; we have only four control signals (thrust and rotational torques) for six degrees of freedom. Moreover, some parameters of such a plant can not be known precisely: aerodynamical coefficients, its mass, and inertia moments. For instance, if the quadrotor is spraying pesticides on crops, its mass is getting lower like a smooth function, while in the case of cargo delivery, such variation happens in a step-like manner. It should be noted that the inertia moments change their values in both cases.
Despite all the above-mentioned difficulties, the quadrotor position control is required to be of high quality, especially if the indoor missions are considered. To solve such a complicated problem, the control system is usually implemented in accordance with the cascade principle [3] with the inner attitude and outer position loops. The plants of both have their own parameter uncertainties to be compensated. The scope of this research is the position loop only, but interested readers are referred to [4], in which a short review of the existing methods of attitude adaptive control is given, and an effective solution of such problem is proposed.
As far as the trajectory tracking is concerned, the following different techniques have been applied to solve the control problem under consideration: the conventional PID-control [5], LQ-regulator [5,6], backstepping [7], and feedback linearization [8]. However, all of them require the plant dynamics (model parameters) to be known a priori.
To relax such a strict requirement, two solutions should be mentioned. The first is an active disturbance rejection control [9], which is based on the idea of the disturbance (uncertainty) estimation with the help of the extended state observer (ESO). It is a linear high-gain observer, in which the large gain may induce the well-known peaking phenomenon. So, such a scheme is not considered further.
Secondly, the adaptive control techniques can be applied. In particular, the model reference adaptive control (MRAC) with the conventional adaptive law of the form of Γ e r e f P B x is used in [10]. Unfortunately, it is not explained why the baseline controller is chosen as the PD instead of k x x + k r r , as well as the absence of the σ or e robust modification in the adaptive law to compensate for the parameter uncertainty, given that its basis functions are approximated using the set of radial-basis ones. The presented stability analysis of such a scheme holds only in case the uncertainty parameters are time invariant.
As the parameter uncertainty is both time varying and nonlinear, it is worth applying the universal approximators such as neural networks (NN) to compensate for it. NN is used to implement compensators in the MRAC-like robust scheme with a PD-based baseline controller [11] and the schemes of backstepping in [12,13]. The solution from [11] requires the known upper bounds of some constituent parts of the parameter uncertainty, while the application of backstepping results in complex controllers of a high dynamic order. Concerning NNs in [11,12,13], the input vector components are chosen without analysis of what signals form the uncertainty to be compensated, and only the output layer parameters are adjusted. So, the higher number of hidden neurons is to be chosen to obtain the required approximation error [14]. More detailed analyses of NN applications to solve the problem under consideration can be found in recent studies [15,16]. As for [15,16], the proposed solutions use the PI/PID controller with time-invariant parameters as a baseline, which contradicts the classical principles of MRAC system synthesis.
All in all, despite a great number of studies on the application of MRAC + NN to control quadrotor trajectory, all of them contains at least some of the following disadvantages:
(d1)
A simplified model of the quadrotor with time-invariant parameters of mass and inertia moments is used.
(d2)
The explicit equation of the parameter uncertainty to be compensated is not presented, as well as the numerical simulation results, which compare the outputs of the proposed compensator with the ones of such equation. This makes it difficult to understand to what extent the uncertainty has been compensated.
(d3)
The baseline controller is chosen arbitrarily but not as a result of the synthesis procedure.
(d4)
Baseline controller has time-invariant parameters, and plant parameterization, under which it can also be adjustable, is not considered.
This study fills this gap and designs the direct MRAC schemes with the NN-based compensator of matched parameter uncertainty for a quadrotor, which are free from the above-mentioned disadvantages.
The following contributions of this study are that simultaneously:
(c1)
The explicit equations of the parameter uncertainty are derived for the quadrotor trajectory tracking problem for two cases, when control signals are (i) used and (ii) not used in the parametric uncertainty parameterization;
(c2)
Using (c1), the application of NN-based uncertainty compensator and signals included into its input vector are justified;
(c3)
The MRAC-based schemes are implemented with the baseline controller of a k x x + k r r type (in two variants, with time-invariant and adjustable k x and k r ) and the NN-based compensator of the paramtric uncertainty, in which the parameters of both output and hidden layers are adjusted in real time.

2. Problem Statement and Methods

2.1. Mathematical Model of Quadrotor

Schematic representation of quadrotor is shown in Figure 1.
The equations of the quadrotor model [8] are written as follows (the brackets and the time argument are omitted whenever it is clear from the context and causes no confusion):
p ¯ ˙ = V ¯ = R V b , ξ ˙ = ω = J ω b , V ˙ b = 1 m F e x t ω b × V b , I b ω ˙ b = τ e x t ω b × I b ω b ,
where m is a plant mass, ψ is roll, θ stands for pitch, and ϕ denotes yaw, ξ = ψ , θ , ϕ T . The center of gravity of the quadrotor is the origin of a body-fixed frame (BFF). Considering the inertial frame (IF), p ¯ = X , Y , Z is the coordinate vector, V ¯ = X ˙ Y ˙ Z ˙ T stands for the linear velocity, and w = ψ ˙ θ ˙ ϕ ˙ T is the plant angular velocity. As far as the BFF is concerned, the same vectors are denoted as p ¯ b = ( x , y , z ) , V b = u , v , w T , and ω b = p , q , r T , respectively. F e x t = f x , f y , f z T and τ e x t = τ ψ , τ θ , τ ϕ T are the external forces and moments in BFF. τ ψ , τ θ , τ ϕ are calculated via ψ , θ , and ϕ controllers. R and J are transformation matrices, which are expressed as follows:
R = c θ c ϕ s ψ s θ c ϕ c ψ s ϕ c ψ s θ c ϕ + s ψ s ϕ c θ s ϕ s ψ s θ s ϕ + c ψ c ϕ c ψ s θ s ϕ s ψ c ϕ s θ s ψ c θ c ψ c θ , J = 1 s ψ t θ c ψ t θ 0 c ψ s ψ 0 s ψ / c θ c ψ / c θ ,
Hereafter, s ( . ) stands for s i n ( . ) , c ( . ) is c o s ( . ) , and t ( . ) denotes t a n ( . ) .
Assumption 1.
The inertia matrix I b = d i a g ( J x , J y , J z ) is diagonal as the quadrotor is symmetric in x B z B and y B z B planes.
Taking into account the drag force [8], Equation (1) are represented as follows:
X ˙ Y ˙ Z ˙ = R u v w , ψ ˙ θ ˙ ϕ ˙ = J p q r ,
p ˙ q ˙ r ˙ = q r J y J z J x p r J z J x J y p q J x J y J z + q ω r J r J x ψ ˙ k ψ l J x s ψ K c J x p ω r J r J y θ ˙ k θ l J y s θ K c J y ϕ ˙ k ϕ l J z Δ p q r + τ ψ J x τ θ J y τ ϕ J z ,
u ˙ v ˙ w ˙ = r v q w p w r u q u p v + g s θ g c θ s ψ g c θ c ψ + T m + C x ρ S x u u 2 m C y ρ S y v v 2 m C z ρ S z w w 2 m Δ u v w ,
where K c , C x , C y , C z stand for aerodynamic coefficients; k ψ , k θ are the air resistance parameters; ρ is the air density; S x , S y , S z are the quadrotor surface areas corresponding to the respective axis; l is the distance between the gravity center and each motor rotor; J r stands for the inertia moment of the motor; and ω r is the total motors speed.
The output of the altitude controller is T. Lift forces T 1 , T 2 , T 3 , T 4 and rotational torques M 1 , M 2 , M 3 , M 4 are calculated from T , τ ψ , τ θ , τ ϕ , using (13) from [8].

2.2. Trajectory Tracking Control Problem

Assumption 2.
The parameters values of the quadrotor model are unknown. p , q , r and u , v , w are measurable. Then, all other states from (3)–(5) can be obtained.
Assumption 3.
The inner attitude control system that includes ψ , θ , ϕ loops has already been designed using the approach from [4]. That means that despite (a) J x , J y , J z being unknown and time-varying and (b) having Δ p q r in (4) and J in the right-hand part of (3), the NN-based compensator of the the inner loop proposed in [4] ensures ψ , θ , ϕ control quality defined using the attitude reference model.
Goal. To solve the quadrotor trajectory tracking problem via development of MRAC system (green blocks in Figure 2), which is capable of the parameter uncertainty (see (5)) compensation and ensures asymptotic convergence of the tracking error to a compact set.

3. Main Result

3.1. Uncertainty Parameterization

The left equation in (3) is differentiated with regards to time. The substitution of (5) into the obtained result yields the following:
X ¨ Y ¨ Z ¨ = R u ˙ v ˙ w ˙ + R ˙ u v s . w = R 0 0 1 T m + R r v q w p w r u q u p v + R g s θ g c θ s ψ g c θ c ψ + R Δ u v w + R ˙ u v s . w = c ψ s θ c ϕ + s ψ s ϕ c ψ s θ s ϕ s ψ c ϕ c ψ c θ T m + R r v q w p w r u q u p v + R g s θ g c θ s ψ g c θ c ψ + R Δ u v w + R ˙ u v s . w Δ ¯ ,
R ˙ = R · 0 r q r 0 p q p 0 .
The actual quadrotor mass m is unknown and time varying. However, we know its nominal constant value m ¯ .
The aim is to ensure trajectory tracking independent of the value of the yaw angle, i.e., X d , Y d , Z d are to be followed with the required quality at any value of ϕ . To achieve this, the differential flatness approach [17] is applied:
θ d = arctan 1 u z u x c ϕ + u y s ϕ , ψ d = arctan c θ 1 u z u x s ϕ u y c ϕ , T = m ¯ u z c ψ c θ .
This equation is actually an implementation of the converter block in Figure 2.
Further parameterization depends on whether u x , u y , u z is directly used in the uncertainty parameterization or not.

3.1.1. Case I: Control Signals Are Directly Used in Uncertainty Parameterization

Equation (6) is rewritten as follows:
X ¨ Y ¨ Z ¨ = c ψ s θ c ϕ + s ψ s ϕ c ψ s θ s ϕ s ψ c ϕ c ψ c θ T m ¯ + c ψ s θ c ϕ + s ψ s ϕ c ψ s θ s ϕ s ψ c ϕ c ψ c θ T m ¯ m m m ¯ + Δ ¯ .
The only constant in (8) is m ¯ .
Equation (7) is substituted into (8) to obtain the following:
X ¨ Y ¨ Z ¨ = u x u y u z + u x u y u z m ¯ m m + Δ ¯ Δ .
Therefore, Δ = Δ X Δ Y Δ Z T R 3 × 1 is the matched parameter uncertainty to be compensated. It can be rewritten as a linear regression with unknown m (time varying), ρ , C x , C y , C z , S x , S y , S z and measurable nonlinear regressor, which depends on the twelve known signals u x , u y , u z , ψ , θ , ϕ , u , v , w , p , q , r . The obtained parameterization has two main features: (i) the input gain matrix is known and identified, and (ii) control signals u x , u y , u z are included into uncertainty parameterization.

3.1.2. Case II: Control Signals Are Not Used in Uncertainty Parameterization

Equation (6) is used, and (7) is substituted into it as follows:
X ¨ Y ¨ Z ¨ = u x u y u z m ¯ m + Δ ¯ = m ¯ m u x u y u z + Δ .
In such a case, the matched parametric uncertainty again can be rewritten as a linear regression, but its regressor does not include u x , u y , u z . Furthermore, the system input gain matrix is unknown and represented as a diagonal matrix with m ¯ m on its main diagonal.

3.1.3. Representation of Plant in State-Space Form

The state x p = X , X ˙ , Y , Y ˙ , Z , Z ˙ T and control vectors u = u x , u y , u z T are introduced. Then, (9) and (10) are written as follows:
x ˙ p ( t ) = A p x p ( t ) + B p u ( t ) + Δ ( t ) , u ( t ) = u b a s e ( t ) u a d ( t ) ,
Considering (9), we have the following:
B p = I 3 × 3 0 1 , A p = I 3 × 3 0 1 0 0 ,
while it is written for (10) as follows:
B p = m ¯ m Λ I 3 × 3 0 1 , A p = I 3 × 3 0 1 0 0 .
Λ is unknown. So, in the course of the following design, we will need u b a s e with time-invariant parameters for Case I and adjustable ones for Case II.
The uncertainty Δ is time varying and has a nonlinear regressor for both cases, so NN is chosen to compensate for it.

3.2. MRAC System with NN-Based Compensator

3.2.1. Reference Model

The reference model is introduced as follows:
x ˙ r e f ( t ) = A r e f x r e f ( t ) + B r e f r t , B r e f = I 3 × 3 0 w n 2 , A r e f = I 3 × 3 0 1 a 0 a 1 .
The following definitions are used: x r e f = X r e f , X ˙ r e f , Y r e f , Y ˙ r e f , Z r e f , Z ˙ r e f T , r = X d Y d Z d T , a 0 = w n 2 , a 1 = 2 g w n , g > 0 , w n > 0 .

3.2.2. Plant Representation: Neural Network Description

According to the proved approximation theorem [18], Δ is expressed as NN with the sigmoid activation function σ in the hidden layer (Figure 3) as follows:
x ˙ p = A p x p + B p u + W T σ V T x n n Δ + ε , ε ε N , x n n = b 1 ξ T ( V b ) T ( ω b ) T T , σ V T x n n = b 2 σ 1 σ 2 σ N 2 T , V = β 1 1 β N 2 1 ω 1 , 1 1 ω 1 , N 2 1 ω N 1 , 1 1 ω N 1 , N 2 1 , W = β 1 2 β N 3 2 ω 1 , 1 2 ω 1 , N 3 2 ω N 2 , 1 2 ω N 2 , N 3 2 .
On the basis of the obtained parameter uncertainty parameterization, it is concluded that the number of the input neurons is N 1 = 12 for Case I ( x n n = { u x , u y , u z , ψ , θ , ϕ , u , v , w , p , q , r } ) and N 1 = 9 ( x n n = { ψ , θ , ϕ , u , v , w , p , q , r } ) for Case II, the number of output neurons is N 3 = 3 for both cases. ε is the approximation error, of which its upper bound ( ε ε N ) can be estimated using the results from [14]. It depends on the number of hidden neurons N 2 , the higher N 2 , and the lower ε N .
As ideal weights W and V are unknown, NN with adjustable parameters V ^ and W ^ is introduced as follows [19]:
Δ ^ = W ^ T σ V ^ T x n n ,
The adaptive laws for V ^ and W ^ will be derived further for both cases under consideration.

3.2.3. MRAC System Design

The approximation and estimation errors are introduced as follows:
e = Δ Δ ε , V ˜ = V ^ V , W ˜ = W ^ W .
The dependence of e from V is nonlinear due to the sigmoid function, so it is linearized via the Taylor series expansion as follows:
σ V T x n n = σ V ^ T x n n σ ˙ V ^ T x n n V ˜ T x n n + O V ˜ T x n n 2 .
Introducing e r e f = x p x r e f , the error equation is obtained as follows:
e ˙ r e f = A r e f e r e f + B p u a d + Δ + ε + B p u b a s e B p B r e f r + B p A p A r e f x p ,
The summands of u ( t ) are chosen as follows:
u b a s e = B p A p A r e f k x x p + B p B r e f k r r , u a d = Δ ^ .
As far as Case I is concerned, k x and k r can be directly calculated, while in Case II, this is impossible as B p is unknown. In such a case, the baseline control law with adjustable parameters should be introduced as follows:
u b a s e = k ^ x x p + k ^ r r = θ ^ T ω , θ ^ T = k ^ x k ^ r , ω = x p r .
As for Case I, the substitution of (15) and (17) into (16) yields the following:
e ˙ r e f = A r e f e r e f + B p ( W T O V ˜ T x n n 2 + ε ) δ B p W ˜ T σ V ^ T x n n σ ˙ V ^ T x n n V ^ T x n n B p W ^ T σ ˙ V ^ T x n n V ˜ T x n n .
Considering Case II, we have the following:
e ˙ r e f = A r e f e r e f + B p θ ˜ T ω + B p ( W T O V ˜ T x n n 2 + ε ) δ B p W ˜ T σ V ^ T x n n σ ˙ V ^ T x n n V ^ T x n n B p W ^ T σ ˙ V ^ T x n n V ˜ T x n n .
We are in a position to present the main result of this research.
If the parametric uncertainty parameterization includes the control signals u x , u y , u z , the following theorem holds.
Theorem 1.
Let the laws to adjust NN parameters be defined as follows:
V ^ ˙ = Γ v x n n e r e f T P B p W ^ T σ ˙ V ^ T x n n σ v V ^ , W ^ ˙ = Γ w σ V ^ T x n n σ ˙ V ^ T x n n V ^ T x n n e r e f T P B p σ w W ^ ,
where Γ v and Γ w are adaptive gains, σ v and σ w are σ-modification parameters, P = P T > 0 stands for the solution of the Lyapunov equation A r e f T P + P A r e f = Q , the augmented error ζ = e r e f T ve c T W ˜ ve c T V ˜ T is uniformly ultimately bounded ( ζ UUB) with the ultimate bound R, and e r e f asymptotically converges to a compact set with the bound R 1 .
Proof. 
The proof of this theorem, as well as the definitions of R and R 1 are presented in Appendix A. □
If the uncertainty parameterization does not include control signals u x , u y , u z , the following theorem holds. It is noted that in such a case, we can write B p = m ¯ m I 3 × 3 0 1 . Furthermore, s g n ( m ¯ m ) = 1 , so we understand s g n ( B p ) as B s g n = I 3 × 3 0 1 .
Theorem 2.
Let the laws to adjust the baseline controller and NN parameters be defined as follows:
θ ^ ˙ = Γ θ ω e r e f T P B s g n σ θ θ ^ , V ^ ˙ = Γ v x n n e r e f T P B s g n W ^ T σ ˙ V ^ T x n n σ v V ^ , W ^ ˙ = Γ w σ V ^ T x n n σ ˙ V ^ T x n n V ^ T x n n e r e f T P B s g n σ w W ^ ,
where Γ θ , Γ v , and Γ w are adaptive gains, σ θ , σ v , and σ w are σ-modification parameters, P = P T > 0 stands for the solution of the Lyapunov equation A r e f T P + P A r e f = Q , the augmented error ζ = e r e f T ve c T θ ˜ ve c T W ˜ ve c T V ˜ T is uniformly ultimately bounded ( ζ UUB) with the ultimate bound R 2 , and e r e f asymptotically converges to a compact set with the bound R 3 .
Proof. 
The proof of this theorem, as well as the definitions of R 2 and R 3 , are verbatim to the one of Theorem 1 up to one additional term in the Lyapunov function for the baseline controller adjustable parameters. □

3.3. Numerical Experiments and Discussion

On the basis of Equations (1)–(5), the mathematical model of the quadrotor has been implemented in Matlab/Simulink. The values of its parameters were taken from the Parrot Mambo model [4]. The Euler solver with fixed step of 0.001 s was used for modeling.
The same flight plan was chosen for all experiments as follows: (1) to take-off to reach 1.1 m height (from start to fifth second), (2) to track Gerono lemniscate (figure eight) trajectory from the 5th to 65th seconds: X d = cos ( 2 π t ) and Y d = sin ( 4 π t ) .
The parameters of MRAC system were picked as follows: ω n = 5 , γ = 1.1 , N 2 = 50 , V ^ 0 = 0 N 1 × N 2 , σ v = 10 3 , σ w = 10 3 , Γ w = 10 · I 3 × 3 , Γ v = 10 8 I 50 × 50 , W ^ 0 = 0 N 2 × N 3 , k x = I 3 × 3 25 11 , k r = d i a g ( 25 , 25 , 25 ) , Q = I 6 × 6 , C x = C y = C z = 1 , ρ = 1.184 kg/m 3 , K c = 0.001 , k r e s = 0.001 , l = 0.0441 m, k ψ = k θ = 2 · k r e s · l 2 , k ϕ = k r e s · l . For Case II, we first picked σ θ = 10 5 , Γ θ = 100 · I 9 × 9 , k ^ x ( 0 ) = 0 3 × 6 , k ^ r ( 0 ) = 0 3 × 3 .
The known nominal values of inertia matrix and mass were chosen as follows: m ¯ = 0.063 2 kg, I ¯ b = d i a g ( 5.8 2 ) × 10 5 , ( 7.17 2 ) × 10 5 , ( 1 2 ) × 10 4 kg·m 2 .
The quadrotor model parameters were unknown to the control system. In the course of experiments, I b and m were changed as follows: (1) from start to 35th second I b = d i a g ( 1.2 × 5.8 × 10 5 , 1.2 × 7.17 × 10 5 , 1.2 × 1 × 10 4 kg·m 2 , m = 1.2 × 0.063 kg; (2) from 35th to 65th second I b = d i a g ( 5.8 × 10 5 , 7.17 × 10 5 , 10 4 kg·m 2 , m = 0.063 kg.
The attitude control was implemented in accordance with [4], while the system proposed in this study was used for the position control. The yaw setpoint was picked as ϕ d = π 4 .
Here, we outline some comments on how the parameters for simulation were chosen. All quadrotor (and environment) parameters were taken from [4]. The initial values of the controllers parameters were picked as zeros. The reference model parameters were calculated on the basis of the analysis of the results from [4]. The values of k x and k r were calculated using (17). The value of ϕ d could be chosen arbitrarily. The values of the adaptive gains and σ -modification parameters were picked as a result of trial and error. The trade-off of such a choice was as follows. The only way to reduce the tracking error was to increase the adaptive gain. However, this increased the L 2 norm of the control signal, which resulted in oscillations [20]. Therefore, the trade-off was between the values of the tracking error and the oscillations. The values of the σ -modification parameters were in relation with the corresponding adaptive gain. Their lower values resulted in higher controller parameters and lower tracking error, and vice versa (please see proof of Theorem 1 in Appendix A).
The experiments were conducted as follows: (i) with NN-based compensator designed for Case I, (ii) with NN-based compensator designed for Case II, and (iii) without compensator.
First of all, the compensator for Case I was compared with the system without uncertainty compensation. Figure 4 presents the behavior of the Z coordinate. When the control system included only the baseline controller, a steady-state error occurred due to the difference between m and m ¯ . Such an error was compensated for in the neural network.
The behavior of the coordinates X and Y is shown in Figure 5. The sum of squared errors (the difference between the reference figure-eight trajectory and the real one) was reduced via the NN-compensator by 16.62 % .
The obtained equation for Δ was used to calculate the ideal value of the parameter uncertainty for each time instant of the numerical experiment. Figure 6 illustrates that the NN-based compensator approximated such ideal Δ and verifies the theoretical result. The fact that m value was changed at 35th second did not cause the peaking phenomenon.
As the attitude control loop synthesized according to [4] also included its own NN-based compensator, Figure 7 and Figure 8 are shown to demonstrate that the parameter uncertainty of the ψ , θ , ϕ control loops and J x , J y , J z switch were approximated as well.
The same results were obtained using different values of ϕ d . Therefore, the validity of the converter synthesis procedure for Case I was corroborated.
The next step was to test the adaptive control system designed for Case II. As mentioned above, the initial values of the baseline controller adjustable parameters were zeros. The results obtained for X , Y , Z coordinates are shown in Figure 9 and Figure 10.
They demonstrate that the adaptive control system stabilized the quadrotor and ensured the asymptotic convergence of the tracking error to a compact set, but the steady-state error is too large for X and Y. This was mainly caused by the fact that both baseline and adaptive controllers started from zero initial conditions, which did not provide stabilization. Figures with curves to assess the quality of uncertainty approximation are not provided, as in Case II, both the NN and baseline controller have adjustable parameters and try to compensate for the whole uncertainty together.
To overcome the above-faced problems, it was decided to set the baseline controller initial conditions equal to the parameters of the characteristic polynomial of the reference model as follows: k ^ x ( 0 ) = I 3 × 3 25 11 , k ^ r ( 0 ) = d i a g ( 25 , 25 , 25 ) . Furthermore, the experiment was conducted again. The obtained results are shown in Figure 11 and Figure 12.
Having compared the curves in Figure 4 and Figure 11 and Figure 5 and Figure 12, it was concluded that the design in Case I provided better transients quality for X, Y, and Z coordinates, so the additional adjustment of the baseline controller did not allow us to obtain any advantages. Therefore, according to the conducted experiments, Case I parameterization is a better solution to the stated problem.
Additionally, the system on basis of the PID-controllers of altitude, attitude and position from the “ParrotMinidroneHover” project was also applied to the implemented quadrotor model. The comparison of such system with the proposed solution is considered to be fair, as we did not adjust PID-controllers by ourselves but used the best values of their parameters obtained by the developers of the “ParrotMinidroneHover” project, which were successfully applied not only for simulations but also for a real minidrone. The obtained behavior of the X, Y, and Z coordinates are shown in Figure 13 and Figure 14. It was concluded that compared to the MRAC system with NN, the PID-controllers were not capable of the parameter uncertainty compensation.
Furthermore, finally, we compared the proposed solution with the ones in which the PI/PD/PID controllers with time-invariant parameters are used as the baseline. To do so, the scheme with PID-controllers was augmented with the NN-based compensator for the X, Y, and Z coordinates. The obtained results are presented in Figure 15 and Figure 16. Comparing Figure 5, Figure 14 and Figure 16, it was concluded that the simple combination of some PID-controllers and NN-based compensators did not allow us to obtain results similar to the proposed solution. So, as in [15,16], the PID-controllers has to be adjusted via a trial and error procedure. Therefore, in [15,16], such a procedure is much more difficult, as both baseline controller parameters and adaptive controller hyperparameters have to be chosen manually. As far as the proposed solution (Case I) is concerned, the baseline controller parameters are calculated using mathematically-sound equations, and only adaptive gains and σ -modification parameters for NN are chosen via trial and error.
All in all, the obtained experimental results can be summarized as follows:
(r1)
Despite the fact that systems designed on the basis of Case I and Case II parameterizations had the same theoretical properties, the MRAC system with the NN-compesator with time-invariant baseline controller parameters allowed us to obtain better results in comparison with the adjustable baseline controller.
(r2)
The system on the basis of PID-controllers was not able to fully compensate for the parametric uncertainty.
(r3)
The simple combination of the PID-based control system with the NN-based compensator did not allow us to obtain the same results as the proposed approach. So, the baseline controller should be derived on the basis of the MRAC design procedure and have the form k x x + k r r . Moreover, as far as Case I parameterization is considered, the values of k x and k r can be directly computed.

4. Conclusions

Considering the trajectory tracking problem, the equations of the quadrotor parameter uncertainty were obtained for two cases: control signals are (i) used and (ii) not used in the parametric uncertainty parameterization. Their analysis allowed us to choose the following: (1) the neural networks to implement the compensators of Case I and Case II uncertainties and (2) the signals to form their input layers. For Case I, the adaptive laws of NN parameters were derived. For Case II, the adaptive laws of both NN and baseline controller parameters were obtained, which ensured that the augmented tracking error was UUB. The conducted experiments demonstrated that, as far as transient quality is concerned, Case I parameterization seems to be a better solution.
The main problem of the proposed solution is the above-mentioned trade-off between the value of the augmented error and the transient quality, particularly, oscillations. The scope of further research is to (i) derive a parameterization of the quadrotor to apply an identification-based adaptive control and some extension and mixing schemes to ensure some transient quality, (ii) guarantee exponential convergence instead of the assymptotic one, and (iii) consider cases when the quadrotor dynamics is affected by the wind and gravity center displacement due to load.

Author Contributions

Conceptualization, A.G. and K.L.; methodology, A.G. and K.L.; software, K.L.; validation, K.L.; formal analysis, K.L.; investigation, A.G. and K.L.; writing original draft preparation, K.L.; writing—review and editing, A.G. and K.L.; visualization, K.L.; supervision, A.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the Grants Council of the President of the Russian Federation (project MD-1787.2022.4).

Data Availability Statement

Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof. 
Let a Lyapunov function candidate be chosen as follows:
L e r e f , V ˜ , W ˜ = e r e f T P e r e f + tr V ˜ T Γ v 1 V ˜ + tr W ˜ T Γ w 1 W ˜ , A r e f T P + P A r e f = Q , λ m ζ 2 V ζ λ M ζ 2 , λ m = min λ min P , λ min Γ w 1 , λ min Γ v 1 , λ M = max λ max P , λ max Γ w 1 , λ max Γ v 1
Its derivative is written as follows:
L ˙ = e r e f T Q e r e f + 2 tr W ˜ T σ V ^ T x n n σ ˙ V ^ T x n n V ^ T x n n × e r e f T P B p + Γ w 1 W ˜ ˙ + 2 tr V ˜ T x n n e r e f T P B p W ^ T σ ˙ V ^ T x n n + Γ v 1 V ˜ ˙ + 2 e r e f T P B p δ .
Substituting (21) into (A2), it is obtained as follows:
L ˙ λ min Q e r e f 2 + 2 λ max P B p e r e f δ 0 2 B p σ v V ˜ 2 + 2 B p σ v V ˜ V 0 2 B p σ w W ˜ 2 + 2 B p σ w W ˜ W 0 , δ 0 = max δ , V 0 = V , W 0 = W .
The first two terms of the right hand side of (A3) are added to full square. Then, after routine operations, the upper bound of L ˙ is rewritten as follows:
L ˙ 1 2 λ min Q e r e f 2 + 2 B p σ v γ 1 V ˜ 2 + 2 B p σ w γ 2 W ˜ 2 + 2 λ max 2 P B p 2 δ 0 2 λ min Q + B p σ v V 0 2 + B p σ w W 0 2 γ 3 κ L + γ 3 , κ = 1 2 λ M min λ min Q , γ 1 , γ 2
Applying the comparison lemma, the inequality is solved as follows:
ζ t λ M λ m e κ t ζ 0 + γ 3 κ λ m = R .
So, ζ t is uniformly ultimately bounded. To prove the asymptotic convergence of the tracking error e r e f , we use (A5) and take into account the definitions of γ 2 and t . As a result, the upper bound of the tracking error is obtained as follows:
e r e f 2 λ max P λ min Q λ min P γ 3 = R 1
This completes the proof. □

References

  1. Glushchenko, A.; Lastochkin, K. Neural Network Based Parameter Uncertainty Compensation to Solve Quadrotor Trajectory Tracking Problem. In Proceedings of the 2022 4th International Conference on Control Systems, Mathematical Modeling, Automation and Energy Efficiency (SUMMA), Lipetsk, Russia, 9–11 November 2022; pp. 443–448. [Google Scholar]
  2. Nascimento, T.P.; Saska, M. Position and attitude control of multi-rotor aerial vehicles: A survey. Annu. Rev. Control 2019, 48, 129–146. [Google Scholar] [CrossRef]
  3. Fan, F.; Lin, M.; Ding, R.; Zheng, Z.; Liu, Y. Augmented-MRAC for quadrotor UAVs with parameter change. In Proceedings of the 2016 IEEE International Conference on Advanced Intelligent Mechatronics (AIM), Banff, AB, Canada, 12–15 July 2016; pp. 501–506. [Google Scholar]
  4. Glushchenko, A.; Lastochkin, K. Neural Network-based Direct Model Reference Adaptive Control of Quadrotor Attitude. In Proceedings of the 2022 16th International Conference on Stability and Oscillations of Nonlinear Control Systems (Pyatnitskiy’s Conference), Moscow, Russia, 1–3 June 2022; pp. 1–4. [Google Scholar]
  5. Liu, C.; Pan, J.; Chang, Y. PID and LQR trajectory tracking control for an unmanned quadrotor helicopter: Experimental studies. In Proceedings of the 2016 35th Chinese Control Conference (CCC), Chengdu, China, 27–29 July 2016; pp. 10845–10850. [Google Scholar]
  6. Voos, H. Nonlinear state-dependent Riccati equation control of a quadrotor UAV. In Proceedings of the 2006 IEEE International Conference on Control Applications, Munich, Germany, 4–6 October 2006; pp. 2547–2552. [Google Scholar]
  7. Madani, T.; Benallegue, A. Control of a quadrotor mini-helicopter via full state backstepping technique. In Proceedings of the 45th IEEE Conference on Decision and Control, San Diego, CA, USA, 13–15 December 2006; pp. 1515–1520. [Google Scholar]
  8. Altug, E.; Ostrowski, J.P.; Taylor, C.J. Control of a quadrotor helicopter using dual camera visual feedback. Int. J. Robot. Res. 2005, 24, 329–341. [Google Scholar] [CrossRef]
  9. Zhang, Y.; Chen, Z.; Zhang, X.; Sun, Q.; Sun, M. A novel control scheme for quadrotor UAV based upon active disturbance rejection control. Aerosp. Sci. Technol. 2018, 79, 601–609. [Google Scholar] [CrossRef]
  10. Abdul Ghaffar, A.F.; Richardson, T.S. Position tracking of an underactuated quadrotor using model reference adaptive control. In Proceedings of the AIAA Guidance, Navigation, and Control Conference, AIAA SciTech, San Diego, CA, USA, 4–8 January 2016; pp. 1–13. [Google Scholar]
  11. Jin, X.Z.; He, T.; Wu, X.M.; Wang, H.; Chi, J. Robust adaptive neural network-based compensation control of a class of quadrotor aircrafts. J. Frankl. Inst. 2020, 357, 12241–12263. [Google Scholar] [CrossRef]
  12. Dierks, T.; Jagannathan, S. Output feedback control of a quadrotor UAV using neural networks. IEEE Trans. Neural Netw. 2010, 21, 50–66. [Google Scholar] [CrossRef] [PubMed]
  13. Wang, M.; Chen, B.; Lin, C. Fixed-time Backstepping control of quadrotor trajectory tracking based on neural network. IEEE Access 2020, 8, 177092–177099. [Google Scholar] [CrossRef]
  14. Barron, A.R. Universal approximation bounds for superpositions of a sigmoidal function. IEEE Trans. Inf. Theory 1993, 39, 930–945. [Google Scholar] [CrossRef]
  15. Lopez-Sanchez, I.; Perez-Alcocer, R.; Moreno-Valenzuela, J. Trajectory tracking double two-loop adaptive neural network control for a Quadrotor. J. Frankl. Inst. 2023, 360, 3770–3799. [Google Scholar] [CrossRef]
  16. Doukhi, O.; Lee, D.J. Neural network-based robust adaptive certainty equivalent controller for quadrotor UAV with unknown disturbances. Int. J. Control Autom. Syst. 2019, 17, 2365–2374. [Google Scholar] [CrossRef]
  17. Choutri, K.; Lagha, M.; Dala, L.; Lipatov, M. Quadrotors trajectory tracking using a differential flatness-quaternion based approach. In Proceedings of the 2017 7th International Conference on Modeling, Simulation, and Applied Optimization (ICMSAO), Sharjah, United Arab Emirates, 4–6 April 2017; pp. 1–5. [Google Scholar]
  18. Funahashi, K.; Nakamura, Y. Approximation of Dynamical Systems by Continuous Time Recurrent Neural Networks. Neural Netw. 1993, 6, 801–806. [Google Scholar] [CrossRef]
  19. Nguyen, N.T. Model-Reference Adaptive Control—A Primer; Springer International Publishing: London, UK, 2018. [Google Scholar]
  20. Na, J.; Herrmann, G.; Zhang, K. Improving transient performance of adaptive control via a modified reference model and novel adaptation. Int. J. Robust Nonlinear Control 2017, 27, 1351–1372. [Google Scholar] [CrossRef]
Figure 1. Schematic representation of quadrotor.
Figure 1. Schematic representation of quadrotor.
Computation 11 00163 g001
Figure 2. Block scheme of quadrotor control system.
Figure 2. Block scheme of quadrotor control system.
Computation 11 00163 g002
Figure 3. Neural network structure.
Figure 3. Neural network structure.
Computation 11 00163 g003
Figure 4. Transients of Z for control systems with (Case I) and without NN-based compensator.
Figure 4. Transients of Z for control systems with (Case I) and without NN-based compensator.
Computation 11 00163 g004
Figure 5. Transients of X and Y for control systems with (Case I) and without NN-based compensator.
Figure 5. Transients of X and Y for control systems with (Case I) and without NN-based compensator.
Computation 11 00163 g005
Figure 6. Approximation of the ideal parameter uncertainty in neural network for X , Y , Z .
Figure 6. Approximation of the ideal parameter uncertainty in neural network for X , Y , Z .
Computation 11 00163 g006
Figure 7. Approximation of the ideal parameter uncertainty by neural network for ψ , θ , ϕ .
Figure 7. Approximation of the ideal parameter uncertainty by neural network for ψ , θ , ϕ .
Computation 11 00163 g007
Figure 8. The behavior of ψ , θ , ϕ for MRAC system with NN.
Figure 8. The behavior of ψ , θ , ϕ for MRAC system with NN.
Computation 11 00163 g008
Figure 9. Transients of Z coordinate for control system with NN-based compensator for Case II using baseline controller with zero initial conditions.
Figure 9. Transients of Z coordinate for control system with NN-based compensator for Case II using baseline controller with zero initial conditions.
Computation 11 00163 g009
Figure 10. Transients of X and Y coordinates for control system with NN-based compensator for Case II using baseline controller with zero initial conditions.
Figure 10. Transients of X and Y coordinates for control system with NN-based compensator for Case II using baseline controller with zero initial conditions.
Computation 11 00163 g010
Figure 11. Transients of Z coordinate for control system with NN-based compensator for Case II using baseline controller with non-zero initial conditions.
Figure 11. Transients of Z coordinate for control system with NN-based compensator for Case II using baseline controller with non-zero initial conditions.
Computation 11 00163 g011
Figure 12. Transients of X and Y coordinates for control system with NN-based compensator for Case II using baseline controller with non-zero initial conditions.
Figure 12. Transients of X and Y coordinates for control system with NN-based compensator for Case II using baseline controller with non-zero initial conditions.
Computation 11 00163 g012
Figure 13. Transients of Z coordinate for PID-based control system.
Figure 13. Transients of Z coordinate for PID-based control system.
Computation 11 00163 g013
Figure 14. Transients of X and Y coordinates for PID-based control system.
Figure 14. Transients of X and Y coordinates for PID-based control system.
Computation 11 00163 g014
Figure 15. Transients of Z coordinate for PID+NN control system.
Figure 15. Transients of Z coordinate for PID+NN control system.
Computation 11 00163 g015
Figure 16. Transients of X and Y coordinates for PID+NN control system.
Figure 16. Transients of X and Y coordinates for PID+NN control system.
Computation 11 00163 g016
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Glushchenko, A.; Lastochkin, K. Quadrotor Trajectory Tracking Using Model Reference Adaptive Control, Neural Network-Based Parameter Uncertainty Compensator, and Different Plant Parameterizations. Computation 2023, 11, 163. https://doi.org/10.3390/computation11080163

AMA Style

Glushchenko A, Lastochkin K. Quadrotor Trajectory Tracking Using Model Reference Adaptive Control, Neural Network-Based Parameter Uncertainty Compensator, and Different Plant Parameterizations. Computation. 2023; 11(8):163. https://doi.org/10.3390/computation11080163

Chicago/Turabian Style

Glushchenko, Anton, and Konstantin Lastochkin. 2023. "Quadrotor Trajectory Tracking Using Model Reference Adaptive Control, Neural Network-Based Parameter Uncertainty Compensator, and Different Plant Parameterizations" Computation 11, no. 8: 163. https://doi.org/10.3390/computation11080163

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop