Next Article in Journal
Leveraging Cooperative Intent and Actuator Constraints for Safe Trajectory Planning of Autonomous Vehicles in Uncertain Traffic Scenarios
Previous Article in Journal
L1 Adaptive Fault-Tolerant Control for Nonlinear Systems Subject to Input Constraint and Multiple Faults
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Adaptive Fuzzy Fault-Tolerant Attitude Control for a Hypersonic Gliding Vehicle: A Policy-Iteration Approach

1
204 Unit, Xi’an Research Institute of High-Tech, Xi’an 710025, China
2
Engineering Training Center, Xi’an University of Science and Technology, Xi’an 710054, China
3
School of Automation, Northwestern Polytechnical University, Xi’an 710072, China
*
Author to whom correspondence should be addressed.
Actuators 2024, 13(7), 259; https://doi.org/10.3390/act13070259
Submission received: 7 June 2024 / Revised: 1 July 2024 / Accepted: 4 July 2024 / Published: 9 July 2024
(This article belongs to the Section Control Systems)

Abstract

:
In this paper, adaptive fuzzy fault-tolerant control (AFFTC) for the attitude control system of a hypersonic gliding vehicle (HGV) experiencing an actuator fault is proposed. Actuator faults of the HGV are considered with respect to its actual structure and actuator characteristics. The HGV’s attitude system is firstly represented by a T–S fuzzy model, and then a normal T–S fuzzy controller is designed. A reinforcement learning (RL)-based policy iterative solution algorithm is proposed for the solving of the T-S fuzzy controller. Then, based on the normal T–S controller, a fuzzy FTC controller is proposed in which the control matrices can improve themselves according to the special fault. An integral reinforcement learning (IRL)-based solving algorithm is proposed to reduce the dependence of the design methods on the HGV model. Simulations on three different kinds of actuator faults show that the designed IRL-based FTC can ensure a reliable flight by the HGV.

1. Introduction

A hypersonic gliding vehicle (HGV) is a vehicle with high flight speed and a large flight envelope, so its flight dynamics are really complex. As a basic element of aircraft control, the modeling and realization of HGV attitude control have received extensive attention in recent years. Various control methods are used to solve this problem, such as trajectory-linearization-based active disturbance rejection control [1], fuzzy logic-based adaptive control [2], reinforcement-learning-based nonlinear control [3], and so on. Linear parameter varying-based attitude control for the attitude system of an HGV is proposed in [4], and the sources of uncertainties in HGV control are analyzed, which are uncertainty of aerodynamic parameters and external disturbance. Robust nonlinear controllers for the attitude system of an HGV suffering from model disturbance have also received widespread attention.
In practice, faults may exist in the actuator of the HGV, and such failures necessarily reduce the control accuracy of the HGV, so FTC is an important research area in the attitude control of HGVs. FTC of HGVs has been widely studied in recent years. In ref. [5], actuator faults of an HGV’s attitude control system are considered, and adaptive FTC is proposed. In ref. [6], actuator faults of an HGV’s attitude control system are given in a more general model, and then FTC that can ensure the robustness of the system under actuator fault is given. In ref. [7], fixed-time-quantized fault-tolerant attitude control is presented for an HGV. In the above literature, the model of the HGV’s attitude system is assumed to be known, while in practice the nonlinear dynamics of the HGV may be unknown. In ref. [8], adaptive FTC is proposed for an HGV’s attitude system with an unknown inertial matrix and state constraints. In ref. [9], time-varying fault control of an HGV together with an adaptive FTC are proposed. Actuator faults and model uncertainties are considered synchronously and then sliding-mode-control-based FTC is proposed in ref. [10].
In the above literature, the specific form and occurrence time of the faults are all supposed to be already known, and then FTC is designed according to these known faults. This is unreasonable in practice, as the specific form and occurrence time of faults are difficult to obtain. In ref. [11], online computing-method-based FTC is proposed for the attitude control of HGV, but this method can deal with, only fixed-form faults. In ref. [12], iterative learning fault-tolerant control is proposed for time-varying industrial processes with actuator faults, and in ref. [13], stochastic actuator failures are considered for Markovian jump time-delayed systems. The FTC of the constrained system is solved by model predictive control in ref. [14]. The proposed controller design strategy can deal with some given faults but cannot adjust itself according to a specific fault online, so this is still an open problem. It should be noted that in order to facilitate practical use, the form of a given adaptive FTC should be as simple as possible.
The Takagi–Sugeno (T–S) fuzzy-based control design strategy can approach a nonlinear system with arbitrary accuracy, and the resulting T–S model is a combination of linear systems, so classical linear control methods can still be utilized by a parallel-distributed compensation (PDC) scheme [15]. T–S fuzzy technology is viewed as an efficient way for analyzing and designing the control of nonlinear systems and has been widely applied, such as type-2 T–S fuzzy-based tracking control of a saturation system [16] and T–S fuzzy control for semi-Markov jumps [17]. T–S fuzzy FTC of HGVs has also been utilized in [18]. In ref. [18], the upper and lower bounds of the faults are already known. How to design FTC for HGVs with faults of unknown specific form and unknown occurrence time is still an open problem.
Furthermore, in order to improve the self-learning ability of the FTC, an intelligent algorithm is needed. Commonly used intelligent algorithms include deep learning [19] and reinforcement learning (RL) [20]. The learning process of RL is similar to that of human beings. By interacting with the environment and updating the reward value in time, RL can obtain an optimal control law. RL does not need a specific model of the nonlinear system and can handle high-dimensional task scenarios, so it has been widely utilized by all trades and professions [21].
As a branch of RL, integral reinforcement learning (IRL) obtains its reinforcement signal by integrating the value function, and thereby can be utilized by unknown systems. Using only data from a completely unknown system, IRL can complete the assigned studying task, so it has been successfully applied to the optimal control of discrete-time multiagent systems [22], motion planning of autonomous vehicles [23], nuclear systems [24], and linear systems with input delay [25]. IRL-based fault-tolerant adaptive tracking control of Euler–Lagrange systems are also presented to improve the tracking performance of fault-tolerant control [26]. In a word, IRL has proved to be an effective means of solving FTC design of complex nonlinear systems [27].
Based on the above understanding, in this paper, an RL-based policy-iteration (PI) algorithm is utilized for the design of FTC of an HGV’s attitude system. A nonlinear model of the HGV’s attitude system is firstly, represented by a T–S fuzzy model, and then, the actuator fault model is built. A PI-based normal T–S fuzzy controller solving method is proposed without considering the actuator fault model. Based on the normal fuzzy controller, IRL-based fuzzy adaptive FTC is proposed. The control gains of the adaptive fuzzy FTC controller can self-adjust according to the special model of actuator fault. Finally, three simulation results are given to prove the effectiveness of the presented controller under different faults.
To sum up, the paper’s contributions are:
(1) A policy-iteration (PI) algorithm is utilized for the optimal controller design of a T–S fuzzy system.
(2) IRL-based adaptive fuzzy FTC is built for a T–S fuzzy system with actuator faults. With this FTC controller, the controller can be utilized online.
(3) This method is successfully applied to the attitude-tracking control of an HGV’s attitude system.

2. Problem Description

In this section, the nonlinear attitude model of an HGV is discussed, and then the nonlinear mode is represented by a T–S fuzzy system. Based on the attitude tracking control objective and the actuator fault model of the HGV, the control objective of the proposed T–S fuzzy model is discussed.

2.1. Nonlinear Model of an HGV’s Attitude System

The attitude model of an HGV used in this paper is
α ˙ = ω z ω x cos α tan β + ω y sin α tan β sec β Y cos γ v m V D sin γ v m V m g cos θ m V cos γ v + Y sin γ v m V cos θ D cos γ v m V cos θ cos θ sin γ v β ˙ = ω x sin α + ω y cos α Y cos γ v m V D sin γ v m V m g cos θ m V sin γ v + Y sin γ v m V cos θ D cos γ v m V cos θ cos θ cos γ v γ v ˙ = sec β ( ω x cos α ω y sin α ) + Y cos γ v m V D sin γ v m V m g cos θ m V tan β cos γ v + Y sin γ v m V cos θ D cos γ v m V cos θ ( sin θ + tan β cos θ sin γ v ) J x d ω x d t + ( J x J y ) ω z ω y = M x J y d ω y d t + ( J x J z ) ω x ω z = M y J z d ω z d t + ( J y J x ) ω y ω x = M z
where V is the velocity of the HGV, θ is the flight path angle and ψ v is the heading angle, L , D , and Y are the lift force, drag force, and side force, respectively. α is the attack angle, β is the sideslip angle, and γ v is the velocity inclination angle, ω x , ω y , and ω z are angular velocity of three axes, respectively. M x , M y , M z are the corresponding aerodynamic torques, and they are functions of α , β , and γ v , and δ e , δ r , and δ a , where δ e , δ r , and δ a are the elevator angle, yaw angle, and aileron angle, respectively. More details of the utilized mode can be found in ref. [5].
For the convenience of description, the nonlinear mode (1) can be rewritten as
x ˙ ( t ) = F ( x ( t ) , u t )
where
x ( t ) = α , β , γ v , ω x , ω y , ω z T , u t = δ e , δ r , δ a T
and the expression of F ( x ( t ) , u t ) can be found in (1).

2.2. T–S Fuzzy Modeling of an HGV

Since the parameter variation range of an HGV is big, a simple linear model derived by linearization of the equilibrium points is unsuitable for controller design. Considering the application convenience of the control system, T–S fuzzy modeling technology is utilized here.
For the application of T–S fuzzy modeling to an HGV’s attitude system, the premise variable is chosen as α . Then three levels for α are chosen, the specific values of which are listed in Table 1. Then, the HGV attitude system T–S model is built. The details are as follows:
(Rule 1) If  α  is  α S , then
x ˙ ( t ) = A 1 x ( t ) + B 1 u ( t ) ,
(Rule 2) If  α  is  α M , then
x ˙ ( t ) = A 2 x ( t ) + B 2 u ( t ) ,
(Rule 3) If  α  is  α B , then
x ˙ ( t ) = A 3 x ( t ) + B 3 u ( t ) ,
A i and B i ( i = S , M , B ) are system matrices and
A i = F x | x = x i , u = u i , B i = F u | x = x i , u = u i ,
where x i = ( α i , β i , γ v i ) , i = ( S , M , B ) , u i is the equilibrium point associated with x i .
Based on the T–S fuzzy model, the overall T–S fuzzy model of the HGV’s attitude model can be represented by:
x ˙ ( t ) = i = 1 3 μ i ( t ) A i x ( t ) + B i u ( t ) = A ¯ x ( t ) + B ¯ u ( t )
where
A ¯ = i = 1 3 μ i ( t ) A i , B ¯ = i = 1 3 μ i ( t ) B i
and μ i ( t ) is the membership function, μ i ( t ) 0 , i = 1 , 2 , 3 and i = 1 3 μ i ( t ) = 1 .

2.3. Actuator Fault Model

In considering the actuator faults of an HGV, the following actuator fault is utilized:
u F ( t ) = Γ F u ( t ) Λ F ,
where Γ F is the failure matrix, which represents the failure coefficient and is unknown, and Λ F is the loss matrix, which denotes a bias fault, and is also unknown. u ( t ) is the normal control signal, and
Γ F = ρ δ e 0 0 0 ρ δ r 0 0 0 ρ δ a , Λ F = χ δ e χ δ r χ δ a
Γ F and Λ F have different values, which means that different fault cases occur. ρ j ( j = δ e , δ r , δ a ) represents the actual available proportion of the normal actuator, χ j ( j = δ e , δ r , δ a ) represents the actuator float variable caused by friction, piezoelectric effects, and so on. χ j ( j = δ e , δ r , δ a ) is a slowly changed value or is invariant; so it can be regarded as a constant in the controller design.
Expression (3) represents almost all fault models, more specifically:
(1) If Γ F = I , Λ F = 0 , the actuator fault is in free fault mode;
(2) If 0 < Γ F < I ,   Λ F = 0 , the actuator fault is loss of effectiveness;
(3) If Γ F = I ,   Λ F 0 , the actuator fault is a drift fault;
(4) If 0 < Γ F < I ,   Λ F 0 , the actuator fault is a combined loss of effectiveness and drift fault fault.
Remark 1.
The faults described by (3) are a general form of HGV actuator faults. In the previous literature, one or more faults were considered, but there was no unified description of HGV actuator faults. This paper gives a general description as shown in (3) and then presents a controller design method according to this fault mode.

2.4. Control Objective

The problem mainly considered in this paper is the design of an attitude-tracking controller for an HGV, so the main objective of the optimal tracking problem is to seek the control policy u ( t ) , so as to make the system (2) track a desired trajectory r d ( t ) , where r d ( t ) = α d , β d , γ v d , 0 , 0 , 0 T . Then, defining the tracking error as
e ( t ) x ( t ) r d ( t )
by (2) and (4), e ( t ) can be rewritten as:
(Rule i) If  α  is  α i , then
e ˙ ( t ) = A i e ( t ) + A i r ˙ d ( t ) + B i u F ( t )
where r ˙ d ( t ) is the the differential of the reference command r d ( t ) . Then, the whole system is:
e ˙ ( t ) = i = 1 k μ i ( t ) A i e ( t ) + B i u F ( t ) + A i r ˙ d ( t ) = A ¯ e ( t ) + B ¯ u F ( t ) + A ¯ r ˙ d ( t )
where r ˙ d ( t ) is assumed to be bounded. Then, considering the actuator faults proposed in (3), the designed controller for (6) should track a command r d ( t ) under unknown actuator faults u F ( t ) .

3. Main Results

In this section, a PI-based fuzzy adaptive FTC strategy is proposed based on only a little information about the actuator faults.

3.1. PI-Based Normal Controller

For the T–S fuzzy model (2) without actuator faults (3), the comprehensive control performance index for the ith rule is
J i ( e ( t ) , u i ( t ) , r ˙ d ( t ) ) = t u i T ( τ ) R u i ( τ ) + e ( τ ) T Q e ( τ ) γ 2 r ˙ d ( t ) T r ˙ d ( t ) d τ ,
where R R 3 and Q R 6 are given positive definite symmetric matrices, and γ is a prescribed constant that respects the performance of interference suppression for the designed controller. Defining the following value function for the ith rule:
V i ( e ( t ) , u i ( t ) , r ˙ d ( t ) ) = t e ( τ ) T Q e ( τ ) + u i ( τ ) T R i u i ( τ ) γ 2 r ˙ d ( t ) T r ˙ d ( t ) d τ
then, the above value function can be viewed as a zero-sum game between the control policy u i ( t ) and the derivation of the reference command r ˙ d ( t ) [28]. In this zero-sum game, the control policy u i ( t ) wants to seek an optimal controller to minimize the performance index (7), while r ˙ d ( t ) wants to maximize it. According to the definition of a zero-sum game, we make the following definition of the Nash equilibrium.
Definition 1.
Nash equilibrium: The zero-sum game (8) has a unique Nash equilibrium if the following conditions are satisfied:
1. For each fixed control u i ( t ) , there is always a unique r ˙ d ( t ) that can maximize V i ( e ( t ) , u i ( t ) , r ˙ d ( t ) ) ; which means that, there exists a r ˙ d * ( t ) such that
V i ( e ( t ) , u i ( t ) , r ˙ d * ( t ) ) V i ( e ( t ) , u i ( t ) , r ˙ d ( t ) ) ;
2. The optimal game solution u i * ( t ) can minimize V i ( x , u , r ˙ d * ( t ) ) , which means that
V i ( e ( t ) , u i ( t ) , r ˙ d * ( t ) ) V i ( e ( t ) , u i * ( t ) , r ˙ d * ( t ) ) ;
3. u i * ( t ) is the optimal control input, and r ˙ d * ( t ) is the optimal disturbance policy.
Remark 2.
In the above zero-sum game, we assume that the reference command r ˙ d ( t ) is a player and can be changed freely. In practice, the reference command must combine some flight requirements, but this assumption is more conducive to us obtaining the optimal strategy of the game. This assumption can also cope with various changes in actual instructions, so this assumption is necessary.
For the above game, according to the Bellman equation, we choose the Hamiltonian function for the ith rule to be
H i ( e ( t ) , u i ( t ) , r ˙ d ( t ) ) = e ( t ) T Q e ( t ) + u i ( t ) T R i u i ( t ) γ 2 r ˙ d ( t ) T r ˙ d ( t ) + V i e ( t ) e ( t )
For convenience of description, using the fuzzy state feedback controller, we use the following rule
(Rule i) If  α  is  α i , then
u i ( t ) = K i e ( t ) , r ˙ d ( t ) = L i e ( t )
to describe the control policy and derivative policy of the reference command. Then, the game value function for the ith rule can be respected as
V i e ( t ) , u i ( t ) , r ˙ d ( t ) = t e ( τ ) T P i e ( τ ) d τ
where P i is a solution of the following Algebraic Riccati Equation (10):
P i ( A i + B i K i ) + ( A i + B i K i ) T P i + Q + K i T R i K i + γ 2 P i A i A i T P i = 0
According to the Pontryagin minimum principle, the optimal control policy for (9) is
H i ( e ( t ) , u i * ( t ) , r ˙ d ( t ) ) u i * ( t ) = 0 H i ( e ( t ) , u i ( t ) , r ˙ d * ( t ) ) r ˙ d * ( t ) = 0
which implies that
u i * ( t ) = R i 1 B i T P i * ζ ( t ) = K i e ( t )
r ˙ d * ( t ) = γ 2 A i T P i * ζ ( t ) = L i e ( t )
where u i * ( t ) is the optimal control for the ith rule of the T–S model (2),
K i = R i 1 B i T P i *
r ˙ d * ( t ) is the optimal derivation of the reference command, and
L i = γ 2 A i T P i *
With the above optimal control u i * ( t ) and r ˙ d * ( t ) , the optimal game value function for the ith rule is
V i * e ( t ) , u i * ( t ) , r ˙ d * ( t ) = t e ( τ ) T P i * e ( τ ) d τ
where P i * is the optimal value of P i . Then, for the T–S model of the HGV, the total controller is constructed as:
u ( t ) = j = 1 3 μ j ( t ) K j e ( t )
Theorem 1.
For system (2) without actuator faults, if there exist positive-definite matrices P i > 0 , and for the ith rule of the T–S fuzzy system, under controller (11) and (12), satisfying
P i ( A i + B i K i ) + ( A i + B i K i ) T P i + Q + K i T R i K i + γ 2 P i A i A i T P i = 0
then, the T–S fuzzy system (2) is stable.
Proof. 
The overall performance of (6) is
J T = t e ( τ ) T P e ( τ ) d τ = t u ( τ ) T R u ( τ ) + e ( τ ) T Q e ( τ ) d τ
where P is a positive definite symmetric matrix for the overall system. Then,
J ˙ T = u ( t ) T R u ( t ) e ( t ) T Q e ( t ) = e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P ( A i + B i K j ) + ( A i + B i K j ) T P e ( t ) + γ 2 r ˙ d ( t ) T r ˙ d ( t ) = 2 e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P A i P B i R 1 B j T P j e ( t ) + γ 2 r ˙ d ( t ) T r ˙ d ( t )
Since P and P j are all positive definite symmetric matrices, considering the matrices B i for controller design, there exist positive constants ϱ M ϱ m > 0 ,
ϱ m e T ( t ) P j B j R 1 B j T P j e ( t ) e T ( t ) P B i R 1 B j T P j e ( t ) ϱ M e T ( t ) P j B j R 1 B j T P j e ( t )
and positive definite symmetric matrix M j , and positive constant η > 0 , such that
P = η P j M j .
Then,
J ˙ T = 2 e T ( t ) j = 1 3 μ j ( t ) η P j A ¯ j η P j B i R 1 B j T P j e ( t ) + γ 2 r ˙ d ( t ) T r ˙ d ( t ) = 2 η e T ( t ) j = 1 3 μ j ( t ) P j A ¯ j P j B i R 1 B j T P j e ( t ) + γ 2 r ˙ d ( t ) T r ˙ d ( t ) 2 η e T ( t ) j = 1 3 μ j ( t ) P j A ¯ j ϱ M P j B j R 1 B j T P j e ( t ) + γ 2 r ˙ d ( t ) T r ˙ d ( t ) η 0 t e T ( t ) i = 1 3 μ j ( t ) Q + 2 ϱ M 1 K j T R K j e ( τ ) d τ + γ 2 r ˙ d ( t ) T r ˙ d ( t )
In this case, J ˙ T < 0 if ϱ M > 1 2 and e ( t ) > γ 2 r ˙ d ( t ) T r ˙ d ( t ) / λ min i = 1 3 μ j ( t ) ( Q + 2 ϱ M 1 K j T R K j ) . The proof process is completed.    □
Based on Theorem 1, the online policy-iteration algorithm is as Algorithm 1:   
Algorithm 1: Model-based PI algorithm
1. Initialization: Select i = 0 , choose any reasonable policy u i ( t ) = K i 0 e ( t ) ;
2. Policy evaluation: Solve the following equations for P i ( i + 1 )
P i ( i + 1 ) ( A i + B i K i i ) + ( A i + B i K i i ) T P i ( i + 1 ) + Q + K i T i R i K i i + γ 2 P i A i A i T P i = 0
3. Policy improvement:
K i i + 1 = R i B ¯ i T P i ( i + 1 )
L i ( i + 1 ) = γ 2 A i T P i ( i + 1 )
4. If the convergence condition is satisfied, stop; else, go to step 2.
In Algorithm 1, the superscript i represents the ith iteration.
Remark 3.
In the proof of the above Theorem, overall performance J T in (14) is not equal to the simple sum of J i proposed in (7). This proof process for Theorem 1 has taken into account the PDC of fuzzy systems.

3.2. PI-Based Fuzzy FTC Control

Just as described in (3), Γ F and Λ F are unknown, but they are all bounded. Then, adaptive fuzzy FTC is constructed: for the T–S fuzzy model (6) with actuator faults (3), the fuzzy state feedback FTC controller is:
(Rule i) If  α  is  α i , then
u i ( t ) = K i e ( t ) + Γ ^ F i 1 Λ ^ F i ,
where Λ ^ F i is the estimation of Λ F at Rule i, and Γ ^ F i is the estimation of Γ F at Rule i.
Theorem 2.
For system (2) with actuator fault (3), if there exist positive-definite matrices P i > 0 , and for the ith rule of the T–S fuzzy system and a FTC controller K i satisfying
P i ( A i + B i K i ) + ( A i + B i K i ) T P i + Q + K i T R i K i + γ 2 P i A i A i T P i = 0
where
K i = Γ ^ F i 1 R i B i T P i , u i ( t ) = K i e ( t ) + Γ ^ F i 1 Λ ^ F i ,
and the updating law Γ ^ F i and Λ ^ F i are
Λ ^ ˙ F i = 2 q 1 B i T P i e ( t )
Γ ^ ˙ F i = d i a g ( 2 q 2 e ( t ) T P i B i u i ( t ) )
where q 1 and q 2 are given constants, then system (2) is semi-globally uniformly ultimately bounded.
Proof. 
Choosing a Lyapunov function for tracking system (6) as
V F ( t ) = V 1 ( t ) + V 2 ( t ) + V 3 ( t ) ,
where
V 1 ( t ) = 0 t e ( τ ) T P e ( τ ) d τ , V 2 ( t ) = 1 2 q 1 t r ( Γ ˜ F i T Γ ˜ F i ) , V 3 ( t ) = 1 2 q 2 Λ ˜ F i Λ ˜ F i ,
and
Γ ˜ F i = Γ F i Γ ^ F i , Λ ˜ F i = Λ F i Λ ^ F i
Then, considering the differential of the proposed Lyapunov function
V ˙ 1 ( t ) = 2 e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P A ¯ i e ( t ) + P B ¯ i u j F e ( t ) + γ 2 r ˙ d ( t ) T r ˙ d ( t ) = 2 e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P A ¯ i e ( t ) + P B ¯ i Γ F j u j ( t ) Λ F j + γ 2 r ˙ d ( t ) T r ˙ d ( t ) = 2 e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P A ¯ i e ( t ) + P B ¯ i Γ F i Γ ^ F i u j ( t ) + Γ ^ F i u j ( t ) Λ F j + γ 2 r ˙ d ( t ) T r ˙ d ( t ) = 2 e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P A ¯ i P B ¯ i R 1 B j T P j e ( t ) + 2 e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P B ¯ i Γ ˜ F i u j ( t ) Λ ˜ F i + γ 2 r ˙ d ( t ) T r ˙ d ( t )
and considering the updating law of Γ ^ F i and Λ ^ F i ,
V ˙ 2 ( t ) = t r ( Γ ˜ F i T Γ ^ F i ) .
V ˙ 3 ( t ) = Λ ˜ F i Λ ^ F i .
and
e T ( t ) i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) P B ¯ i Γ ˜ F i u j ( t ) Λ ˜ F i = e T ( t ) P i = 1 3 μ i ( t ) j = 1 3 μ j ( t ) B ¯ i Γ ˜ F i u j ( t ) Λ ˜ F i
then,
V ˙ 1 ( t ) + V ˙ 2 ( t ) + V ˙ 3 ( t ) η 0 t e T ( t ) i = 1 3 μ j ( t ) Q + 2 ϱ M 1 K j T R K j e ( τ ) d τ + γ 2 r ˙ d ( t ) T r ˙ d ( t ) + 2 ϱ M e T ( t ) P j B j Γ ˜ F i u j ( t ) t r ( Γ ˜ F i T Γ ^ F i ) 2 ϱ m e T ( t ) P j B j Λ ˜ F i Λ ˜ F i Λ ^ F i η 0 t e T ( t ) i = 1 3 μ j ( t ) Q + 1 2 ( 1 η ) K j T R K j e ( τ ) d τ + γ 2 r ˙ d ( t ) T r ˙ d ( t )
Based on the analysis of Theorem 1, the whole system is semi-globally uniformly ultimately bounded.    □

3.3. IRL-Based Fuzzy FTC Control

For solving Theorem 2, all information on A i and B i are needed, and the ARE is difficult to solve. To reduce the dependence on the system model in the solution process, and give a more easily solved method, a new data-based PI fuzzy FTC controller is developed from Theorem 2 and Algorithm 1, in which the system matrices A i can be unknown and only B i are utilized for the proposed FTC [25].
The derivative of the ith value function is
V ˙ i ( e ( t ) , u i ( t ) , r ˙ d ( t ) ) = e T ( t ) P i e ˙ ( t ) = e ( t ) T Q e ( t ) u i ( t ) T R i u i ( t ) + γ 2 r ˙ d ( t ) T r ˙ d ( t )
Integrating both sides of (21) from t to t + T , results in
e ( t + T ) T P i ( i + 1 ) e ( t + T ) e ( t ) T P i ( i + 1 ) e ( t ) = t t + T e ( τ ) T Q + K i i T R i K i i e ( τ ) γ 2 r ˙ d ( t ) T r ˙ d ( t ) d τ .
Since the value of e ( t + T ) ,   e ( t ) and the integral of the right side can all be computed, the positive defined matrices P i can be solved by the above equation.
Then, the following theorem is proposed (Algorithm 2):    
Algorithm 2: IRL-based fuzzy FTC control algorithm
1. Initialization: Select i = 0 , choose any reasonable policy u i ( t ) = K i 0 e ( t ) , P ( i ) > 0 ;
2. Policy evaluation: Solve the following equations for P i ( i + 1 )
e ( t + T ) T P i ( i + 1 ) e ( t + T ) e ( t ) T P i ( i + 1 ) e ( t ) = 0 t e ( τ ) T Q + K i i T R i K i i e ( τ ) γ 2 r ˙ d ( t ) T r ˙ d ( t ) d τ .
3. Policy improvement:
K i i + 1 = Γ ^ F i 1 R i B i T P i ( i + 1 ) .
and
Λ ^ F i = 2 q 1 B i T P i e ( t ) Γ ^ F i = d i a g ( 2 q 2 e ( t ) T P i B i u i ( t ) )
4. If the convergence condition is satisfied, stop; else, go to step 2.
A flowchart of the proposed AFFTC is given in Figure 1. From Figure 1, one can see that, the desired trajectory r d ( t ) is firstly added to system (1), and then, the tracking error e ( t ) is generated. After that, the adaptive parameter estimator updates the parameters according to (19) and (20), and the controller updates the controller according to (18). Finally, the overall updating of the fuzzy controller is completed.
Remark 4.
The proposed FTC method is an IRL-based adaptive fuzzy FTC controller. Although adaptive control methods can adjust the parameters of a controller, the efficiency of self-adaptation adjustment is too low for time-varying faults or sudden faults. Reinforcement learning can quickly obtain new control parameters according to the changes in HGV states. Combining reinforcement learning with self-adaptation, the proposed method can quickly adjust the parameters of the controller when the HGV has sudden or time-varying faults.

4. Simulation Results

According to the T–S approximation method proposed in Section 2, the fuzzy model of the HGV’s attitude system can be constructed as follows:
(Rule 1) If  α  is  α S , then
x ˙ ( t ) = A 1 x ( t ) + B 1 u ( t ) ,
(Rule 2) If  α  is  α M , then
x ˙ ( t ) = A 2 x ( t ) + B 2 u ( t ) ,
(Rule 3) If  α  is  α B , then
x ˙ ( t ) = A 3 x ( t ) + B 3 u ( t ) ,
Then, based on the adaptive fuzzy FTC strategy given in Section 3.2, the FTC controller can be driven from different faults. The parameters and initial states of the proposed simulation are similar to ref. [5]. For testing the effectiveness of the proposed controller, three simulations under three different fault modes are given as follows:
Case I: loss of effectiveness. In this simulation, the actuator faults are
ρ F = 0.6 0 0 0 0.6 0 0 0 0.6 , β F = 0 0 0 .
According to the proposed method, the loss of effectiveness fault is carried on the nonlinear attitude model of the HGV. The fault is set to occur 40 s after the simulation starts. For comparison, the T–S controller proposed in ref. [29], marked as u T S , is applied to the attitude control of the HGV, and the proposed adaptive fuzzy control is marked as u F T C . The tracking simulation results are presented in Figure 2 and Figure 3. Specifically, the tracking results of the given command are presented in Figure 2, and the input is presented in Figure 3. From the simulation figures, we can see that, when a fault occurs, the tracking error of u F T C is smaller than that of u T S , and the input of u F T C is also smoother than that of u T S . So, the proposed adaptive fuzzy FTC can adjust itself and obtain a better performance.
Case II: Drift fault. In this simulation, the following faults are considered:
ρ F = 1 0 0 0 1 0 0 0 1 , β F = 0.1 0.1 0.1 .
The fault is set to occur 40 s after the simulation starts. The tracking simulation results are presented in Figure 4 and Figure 5. Specifically, the tracking results of the given command are presented in Figure 4, and the input is presented in Figure 5. From the simulation figures, we can see that, under drift fault, the proposed adaptive fuzzy FTC can also guarantee the tracking performance of the HGV, and the smoothness of the input under u F T C is better than that under u T S .
Case III: Combined fault. In this simulation, the following faults are considered:
ρ F = 0.6 0 0 0 0.6 0 0 0 0.6 , β F = 0.1 0.1 0.1 .
The faults are set to occur 40 s after the simulation starts. The tracking simulation results are presented in Figure 6 and Figure 7. Specifically, the tracking results of the given command are presented in Figure 6, and the input to the HGV is presented in Figure 7. Under the combined fault, the input of u F T C and u T S are all non-smooth, but the oscillation amplitude of u F T C is much smaller than that of u T S .

5. Conclusions

FTC of an HGV’s attitude control system is discussed in this paper. Actuator faults are considered and the mode of the faults is the general model of the actuator. Based on the T–S fuzzy approach, a nonlinear attitude model is firstly modeled using a T–S fuzzy model, and then, a normal T–S controller without considering the actuator fault is designed utilizing RL technology. Then, based on the normal fuzzy controller, an improved adaptive FTC is designed in which the FTC can be adjusted online according to the failure mode. Finally, simulation results on different kinds of actuator faults are given to show the good performance of the proposed FTC.
The simulation results given in this paper are based on a mathematical model. Future work will consist of engineering the proposed algorithm and verification of the proposed method by combining the algorithm with an actual HGV.

Author Contributions

Conceptualization, C.H. and M.L.; methodology, X.H.; software, M.L.; validation, C.H., H.L. and M.L.; investigation, X.H.; resources, C.H.; data curation, H.P. and H.L.; writing—original draft preparation, M.L.; writing—review and editing, H.P. and C.H.; visualization, X.H.; supervision, X.H.; project administration, H.L.; funding acquisition, C.H. and X.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by National Natural Science Foundation of China under Grant 62073265.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shao, X.; Wang, H. Active disturbance rejection based trajectory linearization control for hypersonic reentry vehicle with bounded uncertainties. ISA Trans. 2015, 54, 27–38. [Google Scholar] [CrossRef] [PubMed]
  2. Zhao, S.; Wang, J.; Xu, H.; Wang, B. Composite observer-based optimal attitude-tracking control with reinforcement learning for hypersonic vehicles. IEEE Trans. Cybern. 2022, 53, 913–926. [Google Scholar] [CrossRef]
  3. Wang, Y.; Chen, M.; Wu, Q.; Zhang, J. Fuzzy adaptive non-affine attitude tracking control for a generic hypersonic flight vehicle. Aerosp. Sci. Technol. 2018, 80, 56–66. [Google Scholar] [CrossRef]
  4. Huang, Y.; Sun, C.; Qian, C.; Wang, L. Linear parameter varying switching attitude control for a near space hypersonic vehicle with parametric uncertainties. Int. J. Syst. Sci. 2015, 46, 3019–3031. [Google Scholar] [CrossRef]
  5. Yu, X.; Li, P.; Zhang, Y. Fixed-time actuator fault accommodation applied to hypersonic gliding vehicles. IEEE Trans. Autom. Sci. Eng. 2020, 18, 1429–1440. [Google Scholar] [CrossRef]
  6. Chao, D.; Qi, R.; Jiang, B. Adaptive fault-tolerant attitude control for hypersonic reentry vehicle subject to complex uncertainties. J. Frankl. Inst. 2022, 359, 5458–5487. [Google Scholar] [CrossRef]
  7. Lv, J.; Wang, C.; Kao, Y. Adaptive fixed-time quantized fault-tolerant attitude control for hypersonic reentry vehicle. Neurocomputing 2023, 520, 386–399. [Google Scholar] [CrossRef]
  8. Wang, L.; Meng, Y.; Hu, S.; Peng, Z.; Shi, W. Adaptive fault-tolerant attitude tracking control for hypersonic vehicle with unknown inertial matrix and states constraints. IET Control Theory Appl. 2023, 17, 1397–1412. [Google Scholar] [CrossRef]
  9. Hu, Q.; Wang, C.; Li, Y.; Huang, J. Adaptive control for hypersonic vehicles with time-varying faults. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 1442–1455. [Google Scholar] [CrossRef]
  10. Li, P.; Yu, X.; Zhang, Y.; Peng, X. Adaptive multivariable integral TSMC of a hypersonic gliding vehicle with actuator faults and model uncertainties. IEEE ASME Trans. Mechatron. 2007, 22, 2723–2735. [Google Scholar] [CrossRef]
  11. Li, A.; Liu, S.; Hu, X. Fault-tolerant Attitude Control for Hypersonic Flight Vehicle Subject to Actuators Constrain: A Model Predictive Static Programming Approach. IEEE J. Miniaturization Air Space Syst. 2023, 4, 136–145. [Google Scholar] [CrossRef]
  12. Wang, L.; Liu, F.; Yu, J.; Li, P.; Zhang, R.; Gao, F. Iterative learning fault-tolerant control for injection molding processes against actuator faults. J. Process Control 2017, 59, 59–72. [Google Scholar] [CrossRef]
  13. Poongodi, T.; Saravanakumar, T.; Mishra, P.P.; Zhu, Q. Extended Dissipative Control for Markovian Jump Time-Delayed Systems with Bounded Disturbances. Math. Probl. Eng. 2020, 2020, 5685324. [Google Scholar] [CrossRef]
  14. Wang, L.; Li, H.; Li, H.; Zhang, R.; Gao, F. Constrained model predictive fault-tolerant control for nonlinear batch processes with time delay by integrating a LRF method and a switching strategy. Chem. Eng. Sci. 2024, 287, 119762. [Google Scholar] [CrossRef]
  15. Qiu, J.; Ji, W.; Chadli, M. A novel fuzzy output feedback dynamic sliding mode controller design for two-dimensional nonlinear systems. IEEE Trans. Fuzzy Syst. 2020, 29, 2869–2877. [Google Scholar] [CrossRef]
  16. Zeng, Y.; Lam, H.K.; Xiao, B.; Wu, L. Tracking Control for Nonlinear Systems with Actuator Saturation via Interval Type-2 TS Fuzzy Framework. IEEE Trans. Cybern. 2022, 53, 7085–7094. [Google Scholar] [CrossRef]
  17. Ren, B.; Karimi, H.R.; Yin, T.; Fu, S. Asynchronous H filtering for semi-Markov jump TS fuzzy systems within partial state delay and deception attack: Applied to aircraft-pilot state estimation. J. Frankl. Inst. 2022, 360, 9265–9289. [Google Scholar] [CrossRef]
  18. Wang, J.; Hu, L.; Chen, F.; Wen, C. Multiple-step fault estimation for interval type-II TS fuzzy system of hypersonic vehicle with time-varying elevator faults. Int. J. Adv. Robot. Syst. 2017, 14, 1729881417699149. [Google Scholar] [CrossRef]
  19. Li, Y.; Gao, W.; Huang, S.; Wang, R.; Yan, W.; Gevorgian, V.; Gao, D.W. Data-driven optimal control strategy for virtual synchronous generator via deep reinforcement learning approach. J. Mod. Power Syst. Clean Energy 2021, 9, 919–929. [Google Scholar] [CrossRef]
  20. Wei, Q.; Li, H.; Yang, X.; He, H. Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems. IEEE Trans. Cybern. 2021, 51, 2372–2383. [Google Scholar] [CrossRef]
  21. Luo, B.; Liu, D.; Huang, T.; Liu, J. Output Tracking Control Based on Adaptive Dynamic Programming with Multistep Policy Evaluation. IEEE Trans. Syst. Man Cybern. Syst. 2019, 49, 2155–2165. [Google Scholar] [CrossRef]
  22. Li, H.; Wu, Y.; Chen, M. Adaptive Fault-Tolerant Tracking Control for Discrete-Time Multiagent Systems via Reinforcement Learning Algorithm. IEEE Trans. Cybern. 2021, 51, 1163–1174. [Google Scholar] [CrossRef]
  23. Zhang, L.; Zhang, R.; Wu, T.; Weng, R.; Han, M.; Zhao, Y. Safe Reinforcement Learning With Stability Guarantee for Motion Planning of Autonomous Vehicles. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 5435–5444. [Google Scholar] [CrossRef]
  24. Zhong, W.; Wang, M.; Wei, Q.; Lu, J. A new neuro-optimal nonlinear tracking control method via integral reinforcement learning with applications to nuclear systems. Neurocomputing 2022, 483, 361–369. [Google Scholar] [CrossRef]
  25. Wang, G.; Luo, B.; Xue, S. Integral reinforcement learning-based optimal output feedback control for linear continuous-time systems with input delay. Neurocomputing 2021, 460, 31–38. [Google Scholar] [CrossRef]
  26. Chen, Q.; Jin, Y.; Song, Y. Fault-tolerant adaptive tracking control of Euler-Lagrange systems—An echo state network approach driven by reinforcement learning. Neurocomputing 2022, 484, 109–116. [Google Scholar] [CrossRef]
  27. Wang, L.; Li, X.; Zhang, R.; Gao, F. Reinforcement Learning-Based Optimal Fault-Tolerant Tracking Control of Industrial Processes. Ind. Eng. Chem. Res. 2023, 62, 16014–16024. [Google Scholar] [CrossRef]
  28. Wang, L.; Jia, L.; Zhang, R.; Gao, F. H∞ output feedback fault-tolerant control of industrial processes based on zero-sum games and off-policy Q-learning. Comput. Chem. Eng. 2023, 179, 108421. [Google Scholar] [CrossRef]
  29. Hu, X.; Wu, L.; Hu, C.; Gao, H. Fuzzy guaranteed cost tracking control for a flexible air-breathing hypersonic vehicle. IET Control Theory Appl. 2012, 6, 1238–1249. [Google Scholar] [CrossRef]
Figure 1. Flowchart of the proposed method.
Figure 1. Flowchart of the proposed method.
Actuators 13 00259 g001
Figure 2. Tracking performance: Case I.
Figure 2. Tracking performance: Case I.
Actuators 13 00259 g002
Figure 3. Input of HGV: Case I.
Figure 3. Input of HGV: Case I.
Actuators 13 00259 g003
Figure 4. Tracking performance: Case II.
Figure 4. Tracking performance: Case II.
Actuators 13 00259 g004
Figure 5. Input of HGV: Case II.
Figure 5. Input of HGV: Case II.
Actuators 13 00259 g005
Figure 6. Tracking performance: Case III.
Figure 6. Tracking performance: Case III.
Actuators 13 00259 g006
Figure 7. Input of HGV: Case III.
Figure 7. Input of HGV: Case III.
Actuators 13 00259 g007
Table 1. Fuzzy Rules.
Table 1. Fuzzy Rules.
Rule NO.Premise Variables α
1S
2M
3B
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, M.; Hu, C.; Pei, H.; Li, H.; Hu, X. Adaptive Fuzzy Fault-Tolerant Attitude Control for a Hypersonic Gliding Vehicle: A Policy-Iteration Approach. Actuators 2024, 13, 259. https://doi.org/10.3390/act13070259

AMA Style

Liu M, Hu C, Pei H, Li H, Hu X. Adaptive Fuzzy Fault-Tolerant Attitude Control for a Hypersonic Gliding Vehicle: A Policy-Iteration Approach. Actuators. 2024; 13(7):259. https://doi.org/10.3390/act13070259

Chicago/Turabian Style

Liu, Meijie, Changhua Hu, Hong Pei, Hongzeng Li, and Xiaoxiang Hu. 2024. "Adaptive Fuzzy Fault-Tolerant Attitude Control for a Hypersonic Gliding Vehicle: A Policy-Iteration Approach" Actuators 13, no. 7: 259. https://doi.org/10.3390/act13070259

APA Style

Liu, M., Hu, C., Pei, H., Li, H., & Hu, X. (2024). Adaptive Fuzzy Fault-Tolerant Attitude Control for a Hypersonic Gliding Vehicle: A Policy-Iteration Approach. Actuators, 13(7), 259. https://doi.org/10.3390/act13070259

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop