1. Introduction
In areas with high safety requirements such as spacecraft, the fault-tolerant control (FTC) has always attracted the attention [
1,
2] of many scholars and engineers. In [
1], a novel discontinuous adaptive failure compensation controller with system chattering rejection based on a common approximate function was designed for a rigid satellite with actuator failure. In [
2], an ADP-based tracking FTC with an adaptive fault compensation was given for a spacecraft attitude system with additive faults. In the FTC community, a common and very important method is the sliding-mode control (SMC) that was widely applied in industrial and academic fields due to the extremely simple design and strong anti-disturbance ability [
3]. The equivalent control of SMC can cope with matched uncertainties inherently, but for mismatched uncertainties, including faults or failures, it is an open problem [
4,
5]. The SMC with a new sliding surface including the information of mismatched uncertainties through a disturbance observer was designed in [
4]. The invariant manifold-based SS was applied to transform the multiple mismatched disturbances into matched ones in [
5]. It has always been very interesting to find more and better ways to solve this open problem.
The ADP approaches have developed a relatively complete paradigm for the optimal control [
6]. An online adaptive algorithm was proposed in [
7], which involves the simultaneous updating of both critic and actor NN. A classical thinking [
8] of transformation from robust control into optimal control was used in RADP methods [
9] for adaptive control problems subject to matched or mismatched uncertainties. Specifically, in [
10], ADP was introduced to integral sliding-mode control (ISMC) for systems with mismatched disturbance, but the SS design was not considered. Hence, combining SMC with ADP to cope with mismatched uncertainties induces our attentions.
The idea of using other control methods to design SS is an optional combination solution, which is an effective way to deal with nonlinear FTC problems. In [
11], the classical control-based SS design combined two control algorithms. When the system states are in sliding mode, the classical control is conducted. The method [
12] also adopted the above idea and used a pseudo control input in the SS design process. In [
13], a novel fast terminal sliding mode control without the reaching phase was designed, which makes the derivative of an augmented state satisfy an optimal cost function. In [
14], the optimal SS similar to [
15] was given for a linear system. The robustness of SS to mismatched uncertainties were not considered in these two papers [
13,
14]. In references [
16,
17,
18,
19], optimal SSs were developed based on the first subsystem of the normal form [
19]. In [
16], an optimal sliding manifold was designed based on kinematics without uncertainties and the angular velocity was seen as the control input in this design process. In [
17], optimal design of reaching phase and sliding phase of the SMC was given. In [
18], the successive approximation approach was used to solve a novel optimal sliding mode that was designed by the same method. In [
19], after an optimal equivalent control was first solved, a novel SS was derived from the above equivalent control. Recently, the ADP algorithm has also been introduced into the integral SMC design [
10,
20]. In [
10], ADP was used to solve the optimal guaranteed cost control problem of sliding mode dynamics. In [
20], ADP was employed to design the nominal control, which aimed at stabilizing sliding mode dynamics. However, in these two papers [
10,
20], ADP was adopted to stabilize sliding mode dynamics rather than to design SS directly.
In this paper, we intend to explore a unifying design of sliding-mode FTC with the RADP-based SS for mismatched uncertain nonlinear cascade systems with actuator faults. To this end, the RADP-based SS is developed to obtain robust sliding mode for mismatched uncertainties, and an adaptive sliding-mode FTC is designed to guarantee the reachability of above sliding mode in the presence of faults. The detailed contributions are summarized as follows.
The RADP-based SS is designed, which is robust to the mismatched uncertainties. An adaptive fault compensation is used to deal with the actuator faults. Compared with the traditional sliding-mode FTC [
21], the novel scheme in the present paper achieves a certain sense of optimized performance by introducing the RADP-based SS. To the best knowledge of the authors, the ADP-based SS does not appear in previous reports, and the more important factor of a novel SMC tackling mismatched uncertainties based on this new SS is not reported either.
The idea of the ADP-based SS design method may be open a door that we can obtain many kinds of sliding modes with various properties inherited from ADP methods, such as finite-horizon optimal SS [
22] or safety SS [
23]. These provide more methods or possibilities for solving FTC problems.
The rest of this article is organized as follows.
Section 2 gives the problem formulation. In
Section 3, an adaptive sliding-mode FTC with RADP-based SS for the uncertain cascade systems is developed. The simulation examples are provided in
Section 4 followed by the conclusions in
Section 5.
3. Sliding Surface and Control Design
Concretely, for the system (
1) and (
2), a RADP-based SS is designed as
where
s is the sliding-mode variable for the mismatched uncertain cascade system, the matrix
R is a penalty matrix in the utility function, the
is an activation function vector of critic neural network (CNN) in ADP, the
is gradient with respect to
denoted as
, and
is an estimation of ideal weight vector in ADP. The following assumption is a standard assumption in ADP-related literatures, such as [
2,
10,
31].
Assumption 3. The ideal NN weights is bounded, i.e., . The NN activation functions and their gradient are bounded, i.e., , and . The NN approximation error and its gradient are bounded on the compact set Ω, i.e., , and .
3.1. Optimal Control-Based Sliding Surface Design
In what follows, the
is regarded as the control input of subsystem (
3) and assumed that it is admissible. For the continuously differentiable cost function
, we can obtain the nonlinear Lyapunov equation is
where
. The Hamiltonian function of this optimal control problem is
Define the optimal cost function (OCF) as
where
is the admissible control set on the
, and the HJB equation of this system is given as
Therefore, by differentiating (
10) with respect to
, the optimal control policy is
The control policy (
11) is unbounded, and if the problem of input constraints is to be studied, an alternative approach is to use a generalized non-quadratic utility function [
28]. Substitute the optimal control (
11) into the Equation (
10) and then the HJB equation can be expressed as
which means that
.
The design of
is a key point in the robust stabilization scheme. In what follows, the term
is specified as
and
meets. Based on the help of this form, the following lemma showing stability of the sliding-mode dynamics of the systems (
1) and (
2) is derived as follows.
Lemma 1. Consider the nominal subsystem (3) and the cost function (5) with the term (13). It is assumed the HJB Equation (12) is of a solution , is given by (11) and the SS designed as exists. According to these conditions, on this SS, the sliding-mode dynamics of this mismatched uncertain nonlinear system described by (1) and (2) is asymptotic stability. Proof of Lemma 1. According to the Theorem 1 of [
30], we know that the optimal control policy (
11) can ensure asymptotic stability of the uncertain nonlinear subsystem (
1). On the SS, there is
and explicitly the state
is also asymptotically convergent to the equilibrium point. □
3.2. RADP-Based Sliding Surface Design
In what follows, the RADP with NN approximation is used to find this SS through obtaining a numerical solution for the above HJB equation, the analytical solution of which is rather difficult to calculate.
Based on the universal approximation property of NN, approximate
by a NN with only one hidden layer as
where
is the ideal weight,
is the activation function,
p is the number of neurons, and
is the unknown approximation error of this NN. Next, partial derivative of the OCF is written as
However, the ideal NN weights are unknown, the estimation of NN weight vector
approximates the OCF.
The partial derivative of the OCF by approximation is
From (
15), the optimal control can be given as
From (
17), the approximate optimal control policy can be expressed as
For the following derivation, two non-negative matrices are denoted as follows
By using the cost function and control policy denoted by the NN, the Hamiltonian function could be given by
where the term
is the residual error of the NN expression.
The approximate Hamiltonian by the estimated weight is described as
letting
, and
. Based on (
20) and (
22), the
with
is formulated by
Now, the next step is to train the CNN and obtain an update law of weights by minimizing the simple cost criterion
. For avoiding the difficulty of finding an admissible initial control policy, the improved weight updating rule [
30] is also selected. The following general assumption [
2,
32] are given before the training process and the stability analysis.
Assumption 4. Considering the nominal subsystem (3) with the cost function (5) and substituting the optimal feedback control (11) into the subsystem (3). Let be a continuously differentiable function and simultaneously a Lyapunov function candidate. It can be written as The is a positive definite matrix ensuring thatis true, where is the minimal eigenvalue of the matrix Γ. For implementing this algorithm,
can be given by some polynomials of the system states, such as
. Then, a weight update law of the CNN is constructed
where
is the basic update rate of the critic network, and
is the learning rate of the additional stabilization term. Substitute the optimal control (
19) into (
26) and then the update law is expressed as
According to the above formula,
and dynamics of the weight estimation error (WEE) could be expressed as
where
Thus, dynamics of WEE could be expressed as
According to this design, the following Lemma 2 shows the property of the system on the novel robust SS.
Lemma 2. Assuming the sliding mode exists, and considering the nominal system described by (3) and (4) under the proposed RADP-based SS (6), the states of system (1) and (2) on this SS and the weight error dynamics are both UUB. Proof of Lemma 2. Based on Theorem 2 of [
30], we can know that state
on this SS and dynamics of WEE are both UUB. For brevity, the proof process will not be repeated. Additionally, according to the SS
, state
of the system on this SS is also UUB. □
Remark 3. The use of ADP faces some problems, such as the difficulty of choosing the initial admissible control, how to choose activation functions of the neural network, and the training taking too much time. In order to avoid the impact of these problems, the off-line training method and other empirical methods are used to ensure the real-time performance of the overall control system as much as possible.
3.3. Sliding-Mode Control Design
After this novel SS is obtained through the above off-line design process, an adaptive fault-tolerant sliding-mode control is developed.
Considering the system (
1) and (
2) and the approximating optimal control policy of (
19), the sliding-mode variable can be expressed as
Note that the control
is differential according to (
19), and the factory is that
is a constant vector after convergence. Therefore, the derivative of the sliding-mode variable is written as
The novel SMC design with the RADP-based SS and the adaptive fault compensation law is designed as
and
respectively, where
are the control gain and learning rate of the adaptive law and
is estimation of actuator faults. Define the fault estimation error as
.
Theorem 1. Considering the mismatched uncertain and faulty system described by (1) and (2) under the proposed RADP-based sliding-mode variable (6), the control policy (32) and adaptive fault compensation law (33), the fault estimation error is bounded and the closed-loop system dynamics is UUB. Proof of Theorem 1. Choose the following Lyapunov function candidate
The derivative of the Lyapunov function candidate (
34) with respect to time along the sliding mode variable dynamics is
Substituting the control policy (
32) and the actuator fault compensation law (
33) into above formula (
35), we obtain
According to the above derivation process, we could find that all signals of closed-loop system are bounded. Based on the Lyapunov theorem and Lemma 2, the sliding mode exists and the states of the whole system are uniformly ultimately bounded. From the above derivation, it can be seen that the reaching time of the SMC can be estimated by , where is the reaching time, is the initial value of the Lyapunov function candidate, and is a positive design parameter. Therefore, the corresponding time can be reduced by adjusting the parameter . The proof ends here. □
Novelty and comparison: How to make the designed sliding surface (SS) insensitive to the mismatched uncertainties is crucial for the SMC design and the SMC-based FTC.
In [
4], a mismatched disturbance is estimated by a disturbance observer, and then the obtained estimation signal of this disturbance is used to design the SS. The disturbance observer makes the system more complex, which increases the difficulty of controller design. However, our designed SS can directly deal with this problem by the optimal control method without designing any estimator. Even though finding the solution of optimal control is tricky, we can solve it directly using the ADP method off-line.
In [
10], the integral SMC (ISMC) law is divided into two parts, one is a continuous part, and the other is a discontinuous part. The continuous part of control is used to reject the mismatched uncertainties and consequently the SS is insensitive to the mismatched uncertainties. In this process, the complex zero-sum games and online complex neural network training are used in the continuous part of control, which are not conducive to practical applications and the realization of engineers. Our method is simpler in the design process and implemented off-line.
In the papers [
13,
14], although they all use the optimal control to design the SS, they cannot solve the problem that is making the SS insensitive to the mismatched uncertainties in the SS design process.
4. Simulation Examples
In this part, the novel sliding-mode FTC with the RADP-based SS is applied in the mismatched uncertain spacecraft attitude control systems (ACS), which are the benchmark nonlinear cascade systems. The model of this ACS [
2,
33] is presented as
and
where the inertia matrix
J and
are characterized by
the mismatched uncertainties
and the actuator faults
are given as
furthermore,
,
, and
are pitch angle, yaw angle, and roll angle of a spacecraft, simply denoted as a vector
, and
,
, and
are angular velocities with respect to an inertial reference frame expressed in the body-fixed frame. In addition, the uncertainty
is bounded, i.e.,
. The initial value of states of this ACS are
, and
.
4.1. Traditional SMC-Based Fault-Tolerant Controller
To compare with the proposed method in this paper, a traditional SMC-based FTC is designed as follows
and the adaptive fault compensation is
where the designed sliding surface is
with
and let
and
. The simulation results are given as follows, and comments are given in the next subsection in a comparative manner.
4.2. RADP-Based Sliding-Mode Fault-Tolerant Controller
The total utility function is chosen as
with
and
for the uncertain ACS. A CNN is constructed to approximate the OCF as
Note that and are the activation function and the estimation of the ideal weight of NN, respectively. In these simulations, we set and . Substitute the constant weight after convergence into the novel SS and let and .
Via
Figure 1,
Figure 2 and
Figure 3, it can be found that, under the basically same parameter settings, the proposed control scheme posses better convergence characteristics and less adjustment time and, through the boundary layer technology, no chattering phenomena. Observe the above simulation results of the proposed novel control scheme and it can be found that the states of the systems are asymptotic stability rather than UUB in Theorem 1. The reason for this behaviour is that
is invertible constant matrix and this makes the error
in (
18) matched with
. Therefore, if
is large enough, the SMC (
32) can guarantee its robustness against this error. However, in the SS design process of traditional SMC-based FTC, it is not considered to deal with the mismatched uncertainties, so that system states cannot eventually converge to zero, even if the SMC guarantees reachability of the SS. The evolution of the weight
is shown in
Figure 4, and finally the weight vector eventually converges to
.
4.3. RADP-Based Sliding-Mode Fault-Tolerant Controller in Presence of Input Delay
Factors that cause delays are numerous in the ACS, such as the response time of actuators and sensors, faults of actuators and sensors, signal sampling and transmission, etc. The effects will be reflected in the input delay [
34]. The existence of the input delay will cause the degradation of control performance and even destroy the stability of the system. To verify whether the proposed controller can cope with the time delay uncertainty, in the above simulation example, an unknown constant time delay is added into the original control system:
where the time delay is set as
and other parameters are the same as the values set above. The simulation results are shown as
Figure 5,
Figure 6 and
Figure 7.
By comparing the simulation results before and after adding the input delay, it can be found that the proposed control scheme can make the time delay system bounded, but the control performance has dropped. If a better control performance is pursued, special design for time delay is essential.