Actor-Critic Neural-Network-Based Fractional-Order Sliding Mode Control for Attitude Tracking of Spacecraft with Uncertainties and Actuator Faults

Jing, Chenghu; Ma, Xiaole; Zhang, Kun; Wang, Yanfeng; Yan, Bingsheng; Hui, Yanbo

doi:10.3390/fractalfract8070385

Open AccessArticle

Actor-Critic Neural-Network-Based Fractional-Order Sliding Mode Control for Attitude Tracking of Spacecraft with Uncertainties and Actuator Faults

by

Chenghu Jing

^*

,

Xiaole Ma

,

Kun Zhang

,

Yanfeng Wang

,

Bingsheng Yan

and

Yanbo Hui

School of Mechanical and Electronic Engineering, Henan University of Technology, Zhengzhou 450001, China

^*

Author to whom correspondence should be addressed.

Fractal Fract. 2024, 8(7), 385; https://doi.org/10.3390/fractalfract8070385

Submission received: 1 June 2024 / Revised: 25 June 2024 / Accepted: 26 June 2024 / Published: 28 June 2024

Download

Browse Figures

Versions Notes

Abstract

:

This paper investigates the attitude control of rigid spacecraft in the presence of uncertainties, disturbances, and actuator faults. In order to effectively address these challenges and improve the performance of the system, a novel actor-critic neural-network-based fractional-order sliding mode control (ACNNFOSMC) has been developed for spacecraft. The integration of actor-critic neural network, fractional-order theory, and sliding mode control enables dual functionality: the actor-critic neural network serves to approximate the aggregate of uncertain parameters, disturbances, and actuator faults, thereby facilitating their compensation, while the fractional-order sliding mode control mechanism significantly improves the system’s tracking precision and overall robustness against uncertainties. Theoretical analyses are presented to analyze the stability of the proposed control framework. Thorough examination via simulation experiments affirms the effectiveness and control precision of attitude of our proposed control strategy, even in complex operational scenarios.

Keywords:

spacecraft; attitude tracking; neural network; sliding mode control; fractional-order

1. Introduction

In the pursuit of advanced space missions, spacecraft systems serve as crucial components, requiring exceptional performance attributes like high-precision tracking, robustness, rapid responsiveness, and operational efficiency, all while enduring diverse and challenging environmental conditions [1,2,3]. It is a widely accepted fact that spacecraft confront inherent uncertainties and experience extrinsic disturbances, compounded by potential actuator malfunctions, which pose significant hurdles to optimal control [4]. Considering the intricacies involved, the attitude control of spacecraft has emerged as a critical research focus due to its centrality in ensuring accurate navigation and maneuverability [5].

Over the past years, concerted efforts have been dedicated to the development of sophisticated attitude tracking control methodologies, such as RISE control [6], adaptive control [7], backstepping control [8], iterative learning control [9], robust H∞ control [10] and so on. Nonetheless, these approaches fundamentally guarantee either asymptotic error convergence or uniform ultimate bounding of the attitude tracking error. In response, finite-time controllers have been vigorously explored [11,12,13,14,15,16], underscoring the quest for enhanced performance in practical scenarios. Among these methods, terminal sliding mode control has emerged as a potent tool, representing an advancement over conventional sliding mode control (SMC) [17]. It boasts robust characteristics and enables finite-time convergence, albeit its implementation is hampered by issues of singularity and chattering [18,19]. Pukdeboon designed a second-order SMC approach for the attitude control of spacecraft [20], while Lu and Xia introduced an adaptive super-twisting algorithm addressing the uncertainties and external disturbances in spacecraft attitude control [21]. High-order sliding mode effectively overcomes the chattering of classical sliding mode control, and has good robustness and fast response.

The integration of fractional-order calculus with sliding mode control methodologies has provided new means to advance controller performance [22]. By transcending traditional integer-order derivatives, fractional-order differentiation transforms the control signal’s sharp transitions into smoother fractional derivatives, effectively suppressing the chattering induced by discontinuous control actions [23]. This innovation has fostered the application of fractional-order methodologies in a multitude of nonlinear control systems, including tailored solutions for spacecraft attitude control. Alipour et al.’s [24] work introduced an adaptive fractional-order nonsingular terminal SMC, ingeniously designed to stabilize spacecraft attitude while concurrently minimizing reaction wheel momentum errors. Zhang et al. [25] introduced a novel fault-tolerant control scheme, integrating fractional-order nonsingular terminal sliding mode control with backstepping techniques, tailored to tackle spacecraft attitude control amidst faults. This strategy ensures the finite-time stability of the tracking error in a faulty closed-loop attitude control system. Qian et al. [26] employed an adaptive sliding mode observer for fault estimation and combined fractional-order SMC with adaptive fuzzy approximation techniques, which ensured enhanced robustness and fault tolerance of the system. Fractional-order control, as elucidated in various seminal works, augments the parameter tuning range, thereby enhancing the flexibility in controller design and refining system response precision [27]. These studies have devised fractional-order SMC for spacecraft applications, demonstrating the efficacy of this approach.

All the previously mentioned methods have demonstrated their effectiveness. However, they all necessitate prior knowledge of system dynamics, thereby precluding their application in realizing optimal tracking control. Due to this limitation, the interest in reinforcement learning (RL) as a means to achieve control optimization has been rejuvenated. Among the prominent and efficacious RL methodologies, the actor-critic RL stands out [28,29]. The integration of neural networks (NNs) with the actor-critic RL framework has spurred significant advancements in intelligent control techniques and shed light on the approaches to address tracking control challenges [30,31,32,33,34]. Wang et al. [35] proposed an RL-oriented optimal control method that effectively reduces vibration and improves the operational efficiency of unmanned ground vehicles operating in complex and uncertain environments. Similarly, Zheng et al. [36] integrated the actor-critic RL framework into an SMC strategy for precise trajectory tracking in spacecraft, emphasizing the adaptability of RL to intricate aerospace engineering applications.

However, a comprehensive method that combines SMC, fractional-order control, and an actor-critic NN (ACNN) has not been investigated. Therefore, this paper proposes ACNNFOSMC for the attitude control of spacecraft amidst uncertainties and actuator faults. The core novelties and advancements of this study can be encapsulated as follows:

(1) The developed framework includes an ACNN framework to approximate the aggregate of uncertain parameters, disturbances, and actuator faults of spacecraft, thereby facilitating their compensation. Unlike traditional neural network (NN) approximation [37,38], an ACNN-based control scheme enables the derivation of an optimized control policy in real-time, utilizing state information. By leveraging neural networks to learn and compensate for system uncertainties, our method reduces the reliance on computationally intensive model development and fine tuning. The ability to adaptively estimate and respond to uncertainties in real time through neural-network-based learning fosters greater robustness in the control system.

(2) A fractional-order super-twisting control based on a fractional-order sliding mode surface is proposed for spacecraft. Compared with SMC, the proposed fractional-order SMC converts the sharp transformation of control signals into smoother fractional derivatives, effectively suppressing the chattering caused by discontinuous control actions. Compared with super-twisting control, it augments the parameter tuning range, thereby enhancing the flexibility in controller design and improving system response precision.

(3) By merging ACNN with a fractional-order super-twisting SMC, we significantly expedite the system’s convergence process while concurrently boosting steady-state accuracy. This fusion represents a pivotal advancement in control system dynamics and responsiveness.

2. Model Description and Preliminaries

2.1. Spacecraft Dynamics

Consider the following notation: Let

q^{T} = [q_{1}, q_{2}, q_{3}]

and

ω = {[ω_{1}, ω_{2}, ω_{3}]}^{T} \in R^{3}

denote attitude and the angular velocity vector, respectively; let

q_{u} = {[q_{0}, q^{T}]}^{T} \in R^{4}

with

q^{T} q + q_{0}^{2} = 1

denote the unit quaternion,

I_{3} \in R^{3 \times 3}

denote the identity matrix; let

q_{u d} = {[q_{d 0}, q_{d}^{T}]}^{T} \in R^{4}

with

q_{d}^{T} = [q_{d 1}, q_{d 2}, q_{d 3}]

and

ω_{d} = {[ω_{d 1}, ω_{d 2}, ω_{d 3}]}^{T} \in R^{3}

denote the desired attitude and angular velocity, respectively; let

{\tilde{q}}_{u} = {[{\tilde{q}}_{0}, {\tilde{q}}^{T}]}^{T}

and

\tilde{ω}

denote the attitude tracking error and velocity tracking error, respectively. Throughout this document, the index

i

spans the set

Ω_{i} ≜ \{i : 1, 2, 3\}

; any vector

x \in R^{3}

means

x = {[x_{1}, x_{2}, x_{3}]}^{T}

, and

x^{\times} = [0, - x_{3}, x_{2}; x_{3}, 0, - x_{1}; - x_{2}, x_{1}, 0]

means its skew-symmetric matrix.

The dynamics of spacecraft are formally described by [39]

\{\begin{array}{l} {\dot{q}}_{0} = - \frac{1}{2} q^{T} ω, \dot{q} = \frac{1}{2} (q_{0} I_{3} + q^{\times}) ω \\ J \dot{ω} = - ω^{\times} J ω + u - E u + E \bar{u} + d (t) \end{array},

(1)

where

J = J^{T} \in R^{3 \times 3}

denotes the inertia matrix,

d (t) = {[d_{1}, d_{2}, d_{3}]}^{T} \in R^{3}

denotes the disturbance,

u \in R^{3}

denotes the control torque,

\bar{u} \in R^{3}

denotes bounded fault, and

E = diag \{E_{1}, E_{2}, E_{3}\} \in R^{3 \times 3}

denotes a failure indicator matrix, where its element

0 < E_{i} \leq 1

signifies the failure indicator for each respective actuator.

Due to the inherent uncertainty of the inertia matrix, it can be represented as the sum of a nominal inertia matrix

J_{0}

and an uncertain component

Δ J

, expressed as

J = J_{0} + Δ J

. The dynamics (1) are given by

\dot{ω} = - J_{0}^{- 1} ω^{\times} J_{0} ω + J_{0}^{- 1} u - f_{N},

(2)

where

f_{N} = Δ \tilde{J} ω^{\times} J ω + J_{0}^{- 1} ω^{\times} Δ J ω - Δ \tilde{J} u - J^{- 1} (d (t) + E u - E \bar{u})

,

Δ \tilde{J} = - J_{0}^{- 1} Δ J (I_{3} + J_{0}^{- 1} Δ J) J_{0}^{- 1}

.

From Equation (1), it follows that

\begin{array}{l} \tilde{q} = q_{0 d} q - q_{d}^{\times} q - q_{0} q_{d} \\ {\tilde{q}}_{0} = q_{d}^{T} q + q_{0} q_{0 d} \\ \tilde{ω} = ω - C ω_{d} \end{array},

(3)

where

C = \frac{1}{2} ({\tilde{q}}_{0}^{2} - {\tilde{q}}^{T} \tilde{q}) I_{3} + 2 {\tilde{q}}^{T} \tilde{q} - 2 {\tilde{q}}_{0} {\tilde{q}}^{\times}

.

By employing Equations (1)–(3), the description of the tracking error’s dynamic behavior is thereby established as

\{\begin{array}{l} {\tilde{q}}_{0} = - \frac{1}{2} {\tilde{q}}^{T} \tilde{ω}, \dot{\tilde{q}} = \frac{1}{2} ({\tilde{q}}_{0} I_{3} + {\tilde{q}}^{\times}) \tilde{ω} \\ \dot{\tilde{ω}} = - J_{0}^{- 1} ω^{\times} J_{0} ω + {\tilde{ω}}^{\times} C ω_{d} - C {\dot{ω}}_{d} + J_{0}^{- 1} u - f_{N} \end{array},

(4)

Assumption 1

([20,21]). The overall perturbation

f_{N}

exhibits continuous differentiability in relation to time.

2.2. Preliminaries

Within this section, the fundamental concepts pertinent to fractional-order calculus are outlined.

Definition 1

([22]). The Riemann–Liouville

β

-order fractional derivative and integral are expressed as

{}_{a}𝒟_{t}^{β} f (t) = \frac{d^{β} f (t)}{d t^{β}} = \frac{1}{Γ (r - β)} \frac{d^{r}}{d t^{r}} \int_{a}^{t} \frac{f (τ)}{{(t - τ)}^{β - r + 1}} d τ,

(5)

{}_{a}𝒟_{t}^{- β} f (t) = {}_{a}ℐ_{t}^{β} f (t) = \frac{1}{Γ (β)} \int_{a}^{t} \frac{f (τ)}{{(t - τ)}^{1 - β}} d τ,

(6)

where

f (t)

denotes any function,

𝒟^{β}

denotes the fractional derivative,

ℐ^{β}

represents the fractional integral,

r - 1 < β < r

,

Γ (\cdot)

is Euler’s gamma function, which is given by

Γ (β) = \int_{a}^{\infty} e^{- t} t^{β - 1} d t,

(7)

Shifting focus to the realm of engineering and control, the prevalent Caputo interpretation for

β

-order fractional calculus is encapsulated by:

{}_{a}𝒟_{t}^{β} f (t) = \frac{1}{Γ (r - β)} \int_{a}^{t} {(t - τ)}^{r - β - 1} f^{(r)} (τ) d τ,

(8)

Notably, it is important to recognize

{}_{a}𝒟_{t}^{β} f (t) = {}_{a}ℐ_{t}^{1 - β} \dot{f} (t)

in this context. Nonetheless, the versatility of operator (8) resides in its broad applicability to an extensive set of continuous functions, which may not be restricted to exhibiting solely integer-order derivatives.

3. Controller Design

The control scheme framework proposed in this paper is shown in Figure 1. The integration of an ACNN with a super-twisting SMC with fractional-order theory enables dual functionality: the ACNN serves to approximate the aggregate of uncertain parameters, disturbances, and actuator faults, thereby facilitating their compensation, while the fractional-order super-twisting SMC significantly improves the system’s tracking precision and overall robustness against uncertainties.

3.1. Uncertainty Estimation Using Actor-Critic NN

The unknown and time-varying nature of

f_{N}

has an impact on control efficacy. To address this,

f_{N}

can be approximated using an ACNN.

3.1.1. Critic NN

The unknown and time-varying nature of

f_{N}

, has an impact on control efficacy. To address this,

f_{N}

can be approximated using an ACNN.

Regarding the critic NN, a long-term cost function can be formulated as [28]

l (t) = \int_{t}^{\infty} e^{- \frac{τ - t}{ψ}} η (τ) d τ,

(9)

where

ψ

serves as a constant factor for discounting the future cost, and

η (t)

denotes an instant cost function given by

φ (t) = {\tilde{q}}^{T} M \tilde{q} + u^{T} G u,

(10)

where

M

and

G

denote the designed positive definite matrices. The pinnacle of control performance is attained upon achieving the minimal accumulated cost, referred to as the cost-to-go function.

Based on the approximation property of NNs,

l (t)

can be represented as

l (t) = ω_{c}^{* T} σ_{c} (\tilde{q}) + ε_{c}

, where

ω_{c}^{*} = {[ω_{c 1}^{*}, ω_{c 2}^{*}, \dots, ω_{c h}^{*}]}^{T}

signifies the vector of optimal weights,

h

indicates the quantity of nodes comprising the hidden layer,

\tilde{q} = {[{\tilde{q}}_{1}, {\tilde{q}}_{2}, {\tilde{q}}_{3}]}^{T}

characterizes the input vector,

ε_{c}

is referred to as the least approximation error, and

σ_{c} (\tilde{q})

utilizes a Gaussian function structured as

σ_{c} (\tilde{q}) = \exp [- {(\tilde{q} - μ_{c j})}^{T} (\tilde{q} - μ_{c j}) / b_{c j}^{2}], j = 1, 2, \dots, h,

(11)

where

μ_{c j} = {[μ_{c j 1}, μ_{c j 2}, μ_{c j 3}]}^{T}

signifies the receptive field’s centroid, and

b_{c j}

denotes the breadth of the Gaussian function.

The approximation for

l (t)

is formalized by

\hat{l} (t) = {\hat{ω}}_{c}^{T} σ_{c} (\tilde{q}),

(12)

With respect to Equations (10) and (12), the approximate error of the cost function is expressed as

λ (t) = η (t) - \frac{1}{ψ} \hat{l} (t) + \dot{\hat{l}} (t),

(13)

To outline the update rule for the critic NN,

E_{c} = (1 / 2) λ^{2}

is defined. Based on the principle of gradient descent, the adjustment rule for the critic network’s weights is devised as

{\dot{\hat{ω}}}_{c} = - δ_{c} \frac{\partial E_{c}}{\partial {\hat{ω}}_{c}},

(14)

Upon integrating Equation (13) into Equation (14), the resultant expression becomes

\begin{matrix} {\dot{\hat{ω}}}_{c} & = - δ_{c} λ \frac{\partial E_{c}}{\partial {\hat{ω}}_{c}} \\ = - δ_{c} λ \frac{\partial [η (t) - (1 / ψ) \hat{l} (t) + \dot{\hat{l}} (t)]}{\partial {\hat{ω}}_{c}} \\ = - δ_{c} λ [- \frac{1}{ψ} \frac{\partial \hat{l}}{\partial {\hat{ω}}_{c}} + \frac{\partial}{\partial {\hat{ω}}_{c}} (\frac{\partial \hat{l}}{\partial \tilde{q}}) \dot{\tilde{q}}] \end{matrix},

(15)

where

δ_{c}

signifies the learning rate of the critic NN.

Defining

ζ_{c} ({\hat{ω}}_{c}, \tilde{q}, \dot{\tilde{q}}) = (η (t) + {\hat{ω}}_{c}^{T} Λ) Λ

with

Λ = - (σ_{c} / ψ) + \nabla σ_{c} \dot{\tilde{q}}

, Equation (15) is further written as

{\dot{\hat{ω}}}_{c} = - δ_{c} ζ_{c} ({\hat{ω}}_{c}, \tilde{q}, \dot{\tilde{q}}),

(16)

A constant vector

ω_{c \max} = {[ω_{c \max 1}, ω_{c \max 2}, \dots, ω_{c \max h}]}^{T}

is selected to adhere to the condition

|{\hat{ω}}_{c j}| \leq |ω_{c \max j}|, j = 1, 2, \dots, h

. To ensure that the critic NN’s weights are bounded, an update rule grounded in the projection methodology for the critic NN is formulated as follows:

{\dot{\hat{ω}}}_{c} = \{\begin{array}{l} - δ_{c} ζ_{c}, & ‖{\hat{ω}}_{c}‖ \leq ‖ω_{c \max}‖ or ‖{\hat{ω}}_{c}‖ = ‖ω_{c \max}‖, ω_{c}^{T} ζ_{c} > 0 \\ - δ_{c} ζ_{c} + δ_{c} ξ_{c}, & ‖{\hat{ω}}_{c}‖ = ‖ω_{c \max}‖, ω_{c}^{T} ζ_{c} \leq 0 \end{array},

(17)

where

ξ_{c} = ({\hat{ω}}_{c}^{T} ζ_{c} / {‖{\hat{ω}}_{c}‖}^{2}) {\hat{ω}}_{c}

. Provided that

‖{\hat{ω}}_{c} (0)‖ \leq ‖ω_{c \max}‖

holds true, the implementation of the projection-based update law in Equation (17) consistently upholds the constraint

‖{\hat{ω}}_{c}‖ \leq ‖ω_{c \max}‖

.

3.1.2. Actor NN

The output vector

{\hat{f}}_{N}

of radial basis function NN, which serves to approximate modeling uncertainties

f_{N}

, is formulated as

{\hat{f}}_{N} = [\begin{matrix} {\hat{ω}}_{a 1}^{T} σ_{a 1} (𝒵_{1}) \\ {\hat{ω}}_{a 2}^{T} σ_{a 2} (𝒵_{2}) \\ {\hat{ω}}_{a 3}^{T} σ_{a 3} (𝒵_{3}) \end{matrix}],

(18)

where

ω_{a i} = {[ω_{a i 1}, ω_{a i 2}, \dots, ω_{a i m}]}^{T}

,

σ_{a i} (𝒵_{i})

and

𝒵_{i} = {[{\tilde{q}}_{1 i}, {\tilde{ω}}_{1 i}]}^{T}

represent the weight vector, the Gaussian function, and the input vector, respectively.

The approximate error of the actor NN is expressed as

{\tilde{f}}_{N i} = {\tilde{ω}}_{a i}^{T} σ_{a i} (𝒵_{i}),

(19)

where

{\tilde{ω}}_{a i} = ω_{a i}^{*} - {\hat{ω}}_{a i}

. Considering Equations (9) and (12), let

l_{d} (t) = 0

represent the expected cost function for future states; the error associated with actor NN is given by

e_{a} = {\tilde{f}}_{N i} + k_{l} (\hat{l} (t) - l_{d} (t)),

(20)

where

k_{l} > 0

is a coefficient. Upon defining

E_{a} = \ln (\cosh (e_{a}))

, the updating rule of the actor NN’s weights is devised as

\begin{matrix} {\dot{\hat{ω}}}_{a i} & = - δ_{a} \frac{\partial E_{a}}{\partial {\hat{ω}}_{a}} \\ = - δ_{a} \frac{d E_{a}}{d e_{a}} \frac{\partial e_{a}}{\partial {\tilde{f}}_{N i}} \frac{\partial {\tilde{f}}_{N i}}{\partial {\hat{ω}}_{a i}} \\ = - δ_{a} \tanh ({\tilde{f}}_{N i} + k_{l} \hat{l}) σ_{a i} \end{matrix},

(21)

where

δ_{a}

is the designed updating rate. In light of the unavailability of

{\tilde{f}}_{N i}

, Equation (21) is modified as follows:

\begin{matrix} {\dot{\hat{ω}}}_{a i} & = - δ_{a} \tanh (\sum_{i = 1}^{n} {\hat{ω}}_{a i}^{T} σ_{a i} + k_{l} \hat{l}) σ_{a i} \\ = - δ_{a} ζ_{a i} ({\hat{ω}}_{a i}, \tilde{q}, \dot{\tilde{q}}) \end{matrix},

(22)

To ensure that the actor NN’s weights are bounded, parameter projection is employed. A set of constant vectors

ω_{a i \max} = {[ω_{a i 1 \max}, ω_{a i 2 \max}, \dots, ω_{a i h \max}]}^{T}

are tailored to fulfill the condition

|{\hat{ω}}_{a i j}| \leq |ω_{a i j \max}| (j = 1, 2, \dots, h)

, leading to the design of a projection-based update law for the actor NN as follows:

{\dot{\hat{ω}}}_{a i} = \{\begin{array}{l} - δ_{a} ζ_{a i}, & ‖{\hat{ω}}_{a i}‖ \leq ‖ω_{a i \max}‖ or ‖{\hat{ω}}_{a i}‖ = ‖ω_{a i \max}‖, ω_{a i}^{T} ζ_{a i} > 0 \\ - δ_{a} ζ_{a i} + δ_{a} ξ_{a i}, & ‖{\hat{ω}}_{a i}‖ = ‖ω_{a i \max}‖, ω_{a i}^{T} ζ_{a i} \leq 0 \end{array},

(23)

where

ξ_{a i} = ({\hat{ω}}_{a i}^{T} ζ_{a i} / {‖{\hat{ω}}_{a i}‖}^{2}) {\hat{ω}}_{a i}

. Provided that

‖{\hat{ω}}_{a i} (0)‖ \leq ‖ω_{a i \max}‖

holds true, the projection updating law (23) consistently guarantees the constraint

‖{\hat{ω}}_{a i}‖ \leq ‖ω_{a i \max}‖

.

3.2. Fractional-Order Super-Twisting Sliding Mode Control

A fractional-order sliding mode surface is configured as

s = \tilde{ω} + ι_{1} 𝒟^{α} \tilde{q} + ι_{2} 𝒟^{β - 1} \tilde{q},

(24)

where

ι_{1}

and

ι_{2}

denote coefficients,

0 \leq α \leq 1

, and

0 \leq β \leq 1

.

Derived from Equation (24), the dynamics of

s

are delineated by

\begin{matrix} \dot{s} & = \dot{\tilde{ω}} + ι_{1} 𝒟^{α + 1} \tilde{q} + ι_{2} 𝒟^{β} \tilde{q} \\ = - J_{0}^{- 1} ω^{\times} J_{0} ω + {\tilde{ω}}^{\times} C ω_{d} - C {\dot{ω}}_{d} + J_{0}^{- 1} u \\ - {\hat{f}}_{N} + ε + ι_{1} 𝒟^{α + 1} \tilde{q} + ι_{2} 𝒟^{β} \tilde{q} \end{matrix},

(25)

From Equation (25), the control law is thereby formulated as

\begin{matrix} u & = & u_{e q} + u_{e s} \\ u_{e q} & = & ω^{\times} J_{0} ω - J_{0} (ω_{e}^{\times} C ω_{d} - C {\dot{ω}}_{d}) + J_{0} {\hat{f}}_{N} - J_{0} (ι_{1} 𝒟^{α + 1} \tilde{q} + ι_{2} 𝒟^{β} \tilde{q}) \\ u_{e s} & = & - J_{0} k_{1} {sig}^{\frac{1}{2}} (s) - J_{0} k_{2} ℐ^{γ} sign (s) \end{matrix},

(26)

where

0 < γ \leq 1

.

Remark 1.

When

γ = 1

, the fractional-order control scheme (26) is transformed into the classical integer-order super-twisting controller, whose control performance and stability characteristics have been extensively investigated in many prior studies. Consequently, the present discussion intentionally excludes the case where

γ = 1

, focusing exclusively on the cases where

0 < γ < 1

.

Incorporating Equation (26) into Equation (25) leads to

\dot{s} = - k_{1} {sig}^{\frac{1}{2}} (s) - k_{2} ℐ^{γ} sign (s) + f_{d},

(27)

where

f_{d} = {\tilde{f}}_{N} + ε

,

{\tilde{f}}_{N} = f_{N} - {\hat{f}}_{N}

.

Subsequently, Equation (27) can be reformulated as

\begin{array}{l} \dot{s} = - k_{1} {sig}^{\frac{1}{2}} (s) + s_{I} \\ 𝒟^{γ} s_{I} = - k_{2} sign (s) + {\dot{f}}_{d} \end{array},

(28)

For clarity and comprehension, Equation (28) is recast in scalar format as:

\begin{array}{l} {\dot{s}}_{i} = - k_{1 i} {sig}^{\frac{1}{2}} (s_{i}) + s_{I i} \\ 𝒟^{γ} s_{I i} = - k_{2 i} sign (s_{i}) + {\dot{f}}_{d i} \end{array},

(29)

The sequence

{(t_{n})}_{n \in N \cup \{0\}}

, which is characterized by strict monotonic increase, comprises every instant fulfilling the criterion

s_{i} (t_{n}) = 0

. The presence of solutions to Equation (29) is elucidated following Garrappa’s [40] framework, thereby augmenting the Filippov regularization methodology to encompass fractional-order scenarios. Capitalizing on the implications derived from Equation (29), particularly with regard to

{\dot{s}}_{i} (t_{n}) = s_{I i} (t_{n})

, Equation (25) undergoes a transformation into its subsequent form

{\dot{s}}_{i} (t) = {\dot{s}}_{i} (t_{n}) - k_{1 i} {sig}^{\frac{1}{2}} (s_{i}) - k_{2 i} ℐ^{γ} sign (s_{i}) + ℐ^{γ} {\dot{f}}_{d i} (t),

(30)

Based on Assumption 1 and the characteristics of neural networks, we can assume that

{\dot{f}}_{d i} (t)

is bounded.

4. Stability Analysis

Theorem 1.

Given the dynamics during the reaching phase described in Equation (30) with

γ / (γ + 1) < 0.5

and

γ \in (0, 1)

, if the parameters are selected to meet the following constraints

\begin{array}{l} k_{1 i} > 0 \\ k_{2 i} > \max (\frac{3 + γ}{1 - γ} δ_{m}, {(\frac{k_{1 i}}{1 - ς})}^{γ / 0.5} {|s_{I i} (t_{0})|}^{1 - γ} + δ_{m}) \\ ς = (1 + γ) (k_{2 i} + δ_{m}) / (k_{2 i} - δ_{m}) - 1 < 1 \end{array},

(31)

then there exists a finite time

t = t_{f}

where

s_{i} (t) = s_{I i} (t) = 0

holds for

t \geq t_{f},

where

t_{f} \leq t_{0} + \frac{1}{1 - μ_{s}^{1 / γ}} {\sum_{n = 0}^{\infty} [μ_{s}^{n} \frac{Γ (2 + γ)}{k_{2 i} - δ_{i m}} s_{I i} (t_{0})]}^{1 / γ},

which applies for

μ_{s} = ς + \frac{k_{1 i} {|s_{I i} (t_{0})|}^{0.5 (1 + 1 / γ) - 1}}{{(1 + 1 / γ)}^{0.5}} {[\frac{Γ (2 + γ)}{k_{2 i} - δ_{i m}}]}^{0.5 / γ} < 1

and commences with the given initial conditions

(s_{i} (t), s_{I i} (t)) = (0, 0) .

Proof of Theorem 1.

For a detailed proof, readers are referred to [40]. Consider an open interval

(t_{0}, t_{1})

and assume

s_{i} (t) = s_{I i} (t) = 0

, then

{\dot{ϕ}}_{1} (t) \leq {\dot{s}}_{i} (t) \leq {\dot{ϕ}}_{2} (t)

,

ϕ_{1} (t) \leq s_{i} (t) \leq ϕ_{2} (t)

.

ϕ_{1} (t)

, and

ϕ_{2} (t)

are given by

\begin{array}{l} ϕ_{1} (t) = s_{I i} (t_{0}) (t - t_{0}) - λ_{M} ϕ_{2 M}^{0.5} (t - t_{0}) - \frac{k_{2 i} + δ_{i m}}{Γ (2 + γ)} {(t - t_{0})}^{1 + γ} \\ ϕ_{2} (t) = s_{I i} (t_{0}) (t - t_{0}) - \frac{k_{2 i} - δ_{i m}}{Γ (2 + γ)} {(t - t_{0})}^{1 + γ} \end{array},

(32)

where

ϕ_{2 M} = \sup ϕ_{2} (t)

.

To ascertain an upper limit for time

t_{1}

, for which

s_{i} (t_{1}) = 0

, consider

t = t_{ϕ_{2}}

as the time at

ϕ_{2} (t_{ϕ_{2}}) = 0

since

t_{1} \leq t_{ϕ_{2}}

, which leads to

{(t - t_{0})}^{γ} \leq {(t_{ϕ_{2}} - t_{0})}^{γ} = \frac{Γ (2 + γ)}{k_{2 i} - δ_{i m}} s_{I i} (t_{0}),

(33)

Additionally, in the scenario where

ϕ_{2} ({t^{'}}_{ϕ_{2}}) = ϕ_{2 M}

, with

{t^{'}}_{ϕ_{2}}

marking the instance when

{\dot{ϕ}}_{2} ({t^{'}}_{ϕ_{2}}) = 0

, the following holds

{({t^{'}}_{ϕ_{2}} - t_{0})}^{γ} = \frac{Γ (1 + γ)}{k_{2 i} - δ_{i m}} s_{I i} (t_{0}),

(34)

One obtains

ϕ_{2 M}^{0.5} = \frac{{[s_{I i} (t_{0})]}^{0.5 (1 + 1 / γ)}}{{(1 + 1 / γ)}^{0.5}} {[\frac{Γ (1 + γ)}{k_{2 i} - δ_{i m}}]}^{0.5 / γ},

(35)

t_{1} \leq t_{ϕ_{2}}

and the monotonically decreasing nature of

{\dot{ϕ}}_{1} (t)

ensure that

{\dot{ϕ}}_{1} (t_{ϕ_{2}}) \leq s_{I i} (t_{1})

, where

{\dot{ϕ}}_{1} (t_{ϕ_{2}}) = - μ_{s} s_{I i} (t_{0})

. Therefore,

- μ_{s} s_{I i} (t_{0}) \leq s_{I i} (t_{1}) < 0

holds, equivalently

|s_{I i} (t_{1})| \leq μ_{s} |s_{I i} (t_{0})|,

(36)

Next, with the assumption

|s_{I i} (t_{m})| \leq μ_{s}^{m} |s_{I i} (t_{0})|

being valid for the first

m \in \{1, \dots, n\}

and solving in

(t_{n}, t_{n + 1})

, it follows that

\begin{matrix} \frac{|s_{I i} (t_{n + 1})|}{|s_{I i} (t_{n})|} & \leq ς + \frac{k_{1 i} {|s_{I i} (t_{n})|}^{0.5 (1 + 1 / γ) - 1}}{{(1 + 1 / γ)}^{0.5}} {[\frac{Γ (1 + γ)}{k_{2 i} - δ_{i m}}]}^{0.5 / γ} \\ \leq ς + \frac{k_{1 i} {|s_{I i} (t_{0})|}^{0.5 (1 + 1 / γ) - 1}}{{(1 + 1 / γ)}^{0.5}} {[\frac{Γ (1 + γ)}{k_{2 i} - δ_{i m}}]}^{0.5 / γ} \\ = μ_{s} \end{matrix},

(37)

From Equation (37), one has

|s_{I i} (t_{n + 1})| \leq μ_{s}^{n + 1} |s_{I i} (t_{0})|

, which leads to

\lim_{n \to \infty} s_{I i} (t_{n}) = 0

. Considering

{(t_{n + 1} - t_{n})}^{γ} = μ_{s}^{n} \frac{Γ (2 + γ)}{k_{2 i} - δ_{i m}} s_{I i} (t_{0})

, an estimation for the time to convergence is derived as

\begin{matrix} t_{f} & = t_{0} + \sum_{n = 0}^{\infty} (t_{n + 1} - t_{n}) \\ \leq t_{0} + {\sum_{n = 0}^{\infty} [μ_{s}^{n} \frac{Γ (2 + γ)}{k_{2 i} - δ_{i m}} s_{I i} (t_{0})]}^{1 / γ} \\ \leq t_{0} + \frac{1}{1 - μ_{s}^{1 / γ}} {\sum_{n = 0}^{\infty} [μ_{s}^{n} \frac{Γ (2 + γ)}{k_{2 i} - δ_{i m}} s_{I i} (t_{0})]}^{1 / γ} \end{matrix},

(38)

Consequently, at

t = t_{f}

,

s_{I i} (t) = 0

holds, persisting for any subsequent instant

t_{f}

. Moreover, assuming the existence of a time

t^{'} > t_{f}

where

s_{i} (t^{'}) \neq 0

yields to a contradiction, this serves to validate that

s_{i} (t) = s_{I i} (t) = 0 \forall t \geq t_{f}

.□

5. Simulations

In this study, we present a novel ACNNFOSMC aimed at enhancing the performance of spacecraft systems under disturbances, uncertainties, and faults. To validate the efficacy and superiority of our proposed method, comprehensive simulations were conducted, with fractional-order PID (FOPID) and smooth super-twisting SMC (SSTSMC) [20] selected as benchmarks for comparison. For the selection of system parameters, we mainly referred to [20]. The main parameter of the system is inertia. The nominal inertia of the spacecraft was

J_{0} = diag \{21, 18, 15\}

kg·m²; inertia uncertainties were set as

Δ J = [2 \sin (0.1 t), 1.3, 0.8; 1.0, 2 \sin (0.2 t), 1.5; 0.9, 1.5, \sin (0.3 t)]

kg·m². Within these simulations, the disturbances were set as

d (t) = [0.8 \sin (0.1 t), 0.4 \sin (0.2 t), 0.3 \sin (0.5 t)]

N·m, fault signals were set as

{\bar{u}}_{i} = \{\begin{array}{l} 0, t \leq 25 s \\ 0.2 + 0.1 \sin (5 t), t > 25 s \end{array}

and

E_{i} = \{\begin{array}{l} 0, t \leq 20 s \\ 0.2, t > 20 s \end{array}

, the expected signal were given by

q_{d} (0) = {[1, 0, 0, 0]}^{T}

and

ω_{d} = 0.06 \times {[\sin (π t / 80), \cos (2 π t / 80), \sin (3 π t / 80)]}^{T}

, and the initial states were set as

q = {[0.8832, 0.3, - 0.2, - 0.3]}^{T}

and

ω = {[0.1, - 0.04, 0.03]}^{T}

. The control gains of ACNNFOSMC were set as

ι_{1} = ι_{2} = 0.1

,

μ_{c j} = - 5 : 0.5 : 5

,

b_{c j} = 0.1

,

δ_{c} = 2

,

μ_{a j} = - 5 : 0.2 : 5

,

b_{a j} = 1

,

δ_{c} = 4

,

k_{1} = 260

, and

k_{2} = 10

.

The simulation results, depicted in Figure 2, Figure 3 and Figure 4, illustrate the attitude and angular velocity tracking performances of the three control strategies. In Figure 2, Figure 3 and Figure 4, the spacecraft under all three controllers can quickly converge to a stable state in about 15 s. Specifically, the attitude tracking errors for the proposed controller, fractional-order PID, and smooth super-twisting controllers were recorded as

|{\tilde{q}}_{i}| \leq 7.8 \times 10^{- 4}

,

|{\tilde{q}}_{i}| \leq 6.2 \times 10^{- 3}

, and

|{\tilde{q}}_{i}| \leq 4 \times 10^{- 3}

, respectively. Correspondingly, the angular velocity tracking errors stood at

|{\tilde{ω}}_{i}| \leq 1 \times 10^{- 3}

rad/s,

|{\tilde{ω}}_{i}| \leq 3 \times 10^{- 3}

rad/s, and

|{\tilde{ω}}_{i}| \leq 9 \times 10^{- 3}

rad/s. These findings undeniably highlight the superior tracking precision offered by our proposed controller, outperforming the conventional methods in both attitude and angular velocity control tasks.

Figure 5 provides insights into the control input profiles, revealing that despite all controllers maintaining control inputs within the predefined bounds of ±2 Nm, the fractional-order PID control exhibits the smoothest control signal among the three, which is a critical aspect in reducing system wear and improving overall efficiency. This observation underscores the importance of control signal smoothness for practical applications. A unique aspect of our method lies in the adaptive tuning of the actor-critic neural network weights, as illustrated in Figure 6. This figure showcases the self-adaptive process of the neural network’s parameters, which dynamically adjusts to optimize control performance, thereby contributing to the robustness and adaptability of the proposed ACNNFOSMC. In conclusion, the proposed ACNNFOSMC has demonstrated remarkable capabilities in terms of tracking accuracy and robustness against various operational challenges.

To further validate the adaptive capability and strong robustness of the proposed control method, the system inertia uncertainty was set to

Δ J = [8 \sin (0.1 t), 2.3, 1.8; 1.0, 6 \sin (0.2 t), 2.5; 2.9, 1.5, 8 \sin (0.3 t)]

, and the disturbance was configured as

d (t) = [1.2 \sin (0.2 t), 0.8 \sin (0.3 t), 0.7 \sin (0.5 t)]

. Figure 7 illustrates the corresponding attitude and angular velocity tracking errors of the system under the proposed method and classical sliding mode control, while Figure 8 presents the control inputs.

Comparing Figure 2 and Figure 7, it is evident that while increases in uncertainties and disturbances lead to a longer time for the system to reach the steady state, under the action of the proposed controller, the system still exhibits minimal attitude tracking errors and angular velocity tracking errors. This demonstrates that the proposed controller possesses strong adaptive capabilities and robustness against disturbances. As shown in Figure 8, compared to sliding mode control, the proposed method effectively suppresses chattering, further highlighting its advantages.

6. Conclusions

This study has successfully investigated and tackled the critical challenge of attitude control for rigid spacecraft under uncertain environments, encountering unpredictable disturbances and facing potential actuator failures. By introducing a novel control strategy that merges the prowess of an actor-critic neural network with the advanced principles of a fractional-order super-twisting sliding mode control, we have demonstrated a significant advancement in spacecraft control systems. The proposed actor-critic neural-network-based fractional-order super-twisting sliding mode control scheme has proven instrumental in accurately estimating and compensating for the combined influence of uncertainties, external disturbances, and actuator faults. The actor-critic neural network has shown exceptional adaptability in the real-time estimation of these detrimental factors, thereby enabling effective countermeasures. Concurrently, the employment of fractional-order control has significantly bolstered the system’s responsiveness and precision, ensuring enhanced tracking accuracy and overall system robustness. The stability analyses performed in this research have rigorously substantiated the theoretical foundations of ACNNFOSMC, affirming its ability to maintain system stability amidst varying operational dynamics. The following simulation studies have functioned as concrete evidence, confirming the exceptional efficacy of our proposed methodology. These simulations have illustrated marked improvements in control accuracy and resilience compared to those of other control methods, underlining the practical significance of our control strategy. Given the promising results, ACNNFOSMC emerges as a compelling solution for future spacecraft control systems. It offers a sophisticated yet practical way to navigate complex space environments with uncertainties and hazards. Ultimately, this research underscores the transformative potential of integrating advanced computational intelligence with innovative control theories in advancing aerospace engineering and exploration capabilities. The application of our proposed control strategy is predicated on assumptions of lumped uncertainties that are continuous and bounded in nature. The current limitation is that the tuning of controller parameters necessitates knowledge of the upper bound of these uncertainties, which might pose a practical challenge in some real-world scenarios. By adopting adaptive mechanisms, our future work aims to dynamically adjust controller parameters without the explicit reliance on predefined uncertainty bounds. This adaptive approach would broaden the applicability of our control scheme, enabling it to effectively operate in environments where uncertainty characteristics are less well-known or subject to change over time.

Author Contributions

Methodology, C.J. and X.M.; software, X.M., K.Z. and B.Y.; validation, C.J., K.Z. and B.Y.; formal analysis, C.J., K.Z. and Y.H.; investigation, C.J., X.M., K.Z. and Y.W.; data curation, C.J., X.M. and K.Z.; writing—original draft preparation, C.J., X.M., K.Z. and Y.W.; writing—review and editing, C.J., B.Y. and Y.H; funding acquisition, C.J. and Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the Foundation of Henan University of Technology (No. 2021BS071), the Foundation of the Science and Technology Department of Zhengzhou (No.22ZZRDZX17), and the Key Scientific and Technological Research Projects in Henan Province (No. 242102240039 and No. 232102220085).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhang, S.; Zhou, Y.; Cai, S. Fractional-Order PD Attitude Control for a Type of Spacecraft with Flexible Appendages. Fractal Fract. 2022, 6, 601. [Google Scholar] [CrossRef]
Hasan, M.N.; Haris, M.; Qin, S. Fault-tolerant spacecraft attitude control: A critical assessment. Prog. Aerosp. Sci. 2022, 130, 100806. [Google Scholar] [CrossRef]
Jin, T.; Kang, G.; Cai, J.; Jia, S.; Yang, J.; Zhang, X.; Liu, F. Disturbance Attenuation and Pointing Control System Design for an Improved Disturbance-Free Payload Spacecraft. Aerospace 2023, 10, 530. [Google Scholar] [CrossRef]
Golestani, M.; Zhang, W.; Yang, Y.; Xuan-Mung, N. Disturbance observer-based constrained attitude control for flexible spacecraft. IEEE T. Aero. Elec. Sys. 2022, 59, 963–972. [Google Scholar] [CrossRef]
Jing, C.; Du, H.; Liu, Y. Immersion and invariance based adaptive robust control for attitude tracking of spacecraft with input saturation. Adv. Space Res. 2023, 72, 3606–3618. [Google Scholar] [CrossRef]
Guo, Y.; Zhou, J.; Liu, Y. Distributed RISE control for spacecraft formation reconfiguration with collision avoidance. J. Franklin I. 2019, 356, 5332–5352. [Google Scholar] [CrossRef]
Tiwari, M.; Prazenica, R.; Henderson, T. Direct adaptive control of spacecraft near asteroids. Acta Astronaut. 2023, 202, 197–213. [Google Scholar] [CrossRef]
Nadafi, R.; Kabganian, M. Robust backstepping attitude tracking control of an underactuated spacecraft with saturation and time-variant perturbations. P. I. Mech. Eng. G-J Aer. 2022, 236, 502–516. [Google Scholar] [CrossRef]
Yao, Q. Robust adaptive iterative learning control for high-precision attitude tracking of spacecraft. J. Aerospace Eng. 2021, 34, 04020108. [Google Scholar] [CrossRef]
Wang, Z.; Li, Y. Rigid spacecraft nonlinear robust H∞ attitude controller design under actuator misalignments. Nonlinear Dynam. 2023, 111, 15037–15054. [Google Scholar] [CrossRef]
Pukdeboon, C.; Siricharuanun, P. Nonsingular terminal sliding mode-based finite-time control for spacecraft attitude tracking. Int. J. Control Autom. 2014, 12, 530–540. [Google Scholar] [CrossRef]
Moradi, R.; Alikhani, A.; Fathi Jegarkandi, M. Spacecraft attitude fault tolerant control based on multi-objective optimization. J. Theor. App. Mech-Pol. 2020, 58, 983–996. [Google Scholar] [CrossRef] [PubMed]
Hajnorouzali, Y.; Malekzadeh, M.; Ataei, M. Finite-time disturbance observer based-control of flexible spacecraft. J. Vib. Control 2023, 29, 346–361. [Google Scholar] [CrossRef]
Esmaeilzadeh, S.M.; Golestani, M.; Fekih, A. Adaptive attitude stabilization of flexible spacecraft with fast fixed-time convergence. IJST-T. Mech. Eng. 2022, 46, 195–208. [Google Scholar] [CrossRef]
Han, Z.; Wang, M.; Yan, X.; Qian, H. Adaptive fixed-time nonsingular terminal sliding mode attitude tracking control for spacecraft with actuator saturations and faults. Int. J. Aerospace Eng. 2021, 1, 8838784. [Google Scholar] [CrossRef]
Eshghi, S.; Varatharajoo, R. Nonsingular terminal sliding mode control technique for attitude tracking problem of a small satellite with combined energy and attitude control system (CEACS). Aerosp. Sci. Technol. 2018, 76, 14–26. [Google Scholar] [CrossRef]
Xuan-Mung, N.; Golestani, M. Energy-efficient disturbance observer-based attitude tracking control with fixed-time convergence for spacecraft. IEEE T. Aero. Elec. Sys. 2022, 59, 3659–3668. [Google Scholar] [CrossRef]
Esmaeilzadeh, S.M.; Golestani, M.; Mobayen, S. Chattering-free fault-tolerant attitude control with fast fixed-time convergence for flexible spacecraft. Int. J. Control Autom. 2021, 19, 767–776. [Google Scholar] [CrossRef]
Xuan-Mung, N.; Golestani, M.; Hong, S.K. Constrained nonsingular terminal sliding mode attitude control for spacecraft: A funnel control approach. Mathematics 2023, 11, 247. [Google Scholar] [CrossRef]
Pukdeboon, C. Finite-Time Second-Order Sliding Mode Controllers for Spacecraft Attitude Tracking. Math. Probl. Eng. 2013, 930269. [Google Scholar] [CrossRef]
Lu, K.F.; Xia, Y.Q. Finite-time attitude control for rigid spacecraft-based on adaptive super-twisting algorithm. IET Control Theory A. 2014, 8, 1465–1477. [Google Scholar] [CrossRef]
Ahmed, S.; Azar, A.T.; Tounsi, M.; Ibraheem, I.K. Adaptive Control Design for Euler–Lagrange Systems Using Fixed-Time Fractional Integral Sliding Mode Scheme. Fractal Fract. 2023, 7, 712. [Google Scholar] [CrossRef]
Yu, Y.; Liu, X. Model-free fractional-order sliding mode control of electric drive system based on nonlinear disturbance observer. Fractal Fract. 2022, 6, 603. [Google Scholar] [CrossRef]
Alipour, M.; Malekzadeh, M.; Ariaei, A. Practical fractional-order nonsingular terminal sliding mode control of spacecraft. ISA T. 2022, 128, 162–173. [Google Scholar] [CrossRef]
Zhang, X.; Gao, Z.; Qian, M.; Bai, L. Integrated fault estimation and fault-tolerant control for rigid spacecraft attitude system with multiple actuator faults. Int. J. Innov. Comput. I. 2019, 15, 1255–1270. [Google Scholar]
Qian, M.; Shi, Y.; Gao, Z.; Zhang, X. Integrated fault tolerant tracking control for rigid spacecraft using fractional order sliding mode technique. J. Franklin I. 2020, 357, 10557–10583. [Google Scholar] [CrossRef]
Fang, Y.; Li, S.; Fei, J. Adaptive Intelligent High-Order Sliding Mode Fractional Order Control for Harmonic Suppression. Fractal Fract. 2022, 6, 482. [Google Scholar] [CrossRef]
Cao, S.; Sun, L.; Jiang, J.; Zuo, Z. Reinforcement learning-based fixed-time trajectory tracking control for uncertain robotic manipulators with input saturation. IEEE T. Neur. Net. Lear. 2021, 34, 4584–4595. [Google Scholar] [CrossRef] [PubMed]
Chen, R.Z.; Li, Y.X.; Ahn, C.K. Reinforcement-learning-based fixed-time attitude consensus control for multiple spacecraft systems with model uncertainties. Aerosp. Sci. Technol. 2023, 132, 108060. [Google Scholar] [CrossRef]
Wang, X.; Shi, P.; Wen, C.; Zhao, Y. Design of parameter-self-tuning controller based on reinforcement learning for tracking noncooperative targets in space. IEEE T. Aero. Elec. Sys. 2020, 56, 4192–4208. [Google Scholar] [CrossRef]
Muduli, R.; Jena, D.; Moger, T. Application of Reinforcement Learning-Based Adaptive PID Controller for Automatic Generation Control of Multi-Area Power System. IEEE T. Autom. Sci. Eng. 2024, 1–12. [Google Scholar] [CrossRef]
Nohooji, H.R.; Zaraki, A.; Voos, H. Actor–critic learning based PID control for robotic manipulators. Appl. Soft Comput. 2024, 151, 111153. [Google Scholar] [CrossRef]
Ouyang, Y.; Dong, L.; Wei, Y.; Sun, C. Neural network-based tracking control for an elastic joint robot with input constraint via actor-critic design. Neurocomputing 2020, 409, 286–295. [Google Scholar] [CrossRef]
Patel, B.M.; Dwivedy, S.K. Manoeuvring of underwater snake robot with tail thrust using the actor-critic neural network super-twisting sliding mode control in the uncertain environment and disturbances. Neural. Comput. Appl. 2023, 1–15. [Google Scholar] [CrossRef]
Wang, N.; Gao, Y.; Zhao, H.; Ahn, C.K. Reinforcement learning-based optimal tracking control of an unknown unmanned surface vehicle. IEEE T. Neur. Net. Lear. 2020, 32, 3034–3045. [Google Scholar] [CrossRef] [PubMed]
Zheng, M.; Wu, Y.; Li, C. Reinforcement learning strategy for spacecraft attitude hyperagile tracking control with uncertainties. Aerosp. Sci. Technol. 2021, 119, 107126. [Google Scholar] [CrossRef]
Cao, J.; Zhang, Y.; Ju, C.; Xue, X.; Zhang, J. A New Force Control Method by Combining Traditional PID Control with Radial Basis Function Neural Network for a Spacecraft Low-Gravity Simulation System. Aerospace 2023, 10, 520. [Google Scholar] [CrossRef]
Yu, B.; Du, H.; Ding, L.; Wu, D.; Li, H. Neural network-based robust finite-time attitude stabilization for rigid spacecraft under angular velocity constraint. Neural. Comput. Appl. 2022, 34, 5107–5117. [Google Scholar] [CrossRef]
Jing, C.; Du, H.; Liu, Y.; Yan, B.; Liu, C. Immersion and invariance sliding mode control with time-delay estimation for rigid spacecraft with uncertainties and actuator faults. T. I. Meas. Control 2023, 45, 3138–3146. [Google Scholar] [CrossRef]
Muñoz-Vázquez, A.J.; Sánchez-Torres, J.D.; Parra-Vega, V.; Sánchez-Orta, A.; Martínez-Reyes, F. A fractional super-twisting control of electrically driven mechanical systems. T. I. Meas. Control 2020, 42, 485–492. [Google Scholar] [CrossRef]

$Fractalfract 08 00385 g001$

Figure 1. The framework of the proposed control scheme.

$Fractalfract 08 00385 g001$

$Fractalfract 08 00385 g002$

Figure 2. The tracking performance of spacecraft with ACNNFOSMC.

$Fractalfract 08 00385 g002$

$Fractalfract 08 00385 g003$

Figure 3. The tracking performance of spacecraft with FOPID.

$Fractalfract 08 00385 g003$

$Fractalfract 08 00385 g004a$ $Fractalfract 08 00385 g004b$

Figure 4. The tracking performance of spacecraft with SSTSMC.

$Fractalfract 08 00385 g004a$ $Fractalfract 08 00385 g004b$

$Fractalfract 08 00385 g005$

Figure 5. Comparison of control inputs for (a) ACNNFOSMC; (b) FOPID; (c) SSTSMC.

$Fractalfract 08 00385 g005$

$Fractalfract 08 00385 g006$

Figure 6. The adaptive tuning of the actor-critic NN’s weights.

$Fractalfract 08 00385 g006$

$Fractalfract 08 00385 g007$

Figure 7. Comparison of tracking performance: (a) attitude tracking errors of ACNNFOSMC; (b) angular velocity errors of ACNNFOSMC; (c) attitude tracking errors of SMC; (d) angular velocity errors of SMC.

$Fractalfract 08 00385 g007$

$Fractalfract 08 00385 g008$

Figure 8. Comparison of control inputs for (a) ACNNFOSMC; (b) SMC.

$Fractalfract 08 00385 g008$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jing, C.; Ma, X.; Zhang, K.; Wang, Y.; Yan, B.; Hui, Y. Actor-Critic Neural-Network-Based Fractional-Order Sliding Mode Control for Attitude Tracking of Spacecraft with Uncertainties and Actuator Faults. Fractal Fract. 2024, 8, 385. https://doi.org/10.3390/fractalfract8070385

AMA Style

Jing C, Ma X, Zhang K, Wang Y, Yan B, Hui Y. Actor-Critic Neural-Network-Based Fractional-Order Sliding Mode Control for Attitude Tracking of Spacecraft with Uncertainties and Actuator Faults. Fractal and Fractional. 2024; 8(7):385. https://doi.org/10.3390/fractalfract8070385

Chicago/Turabian Style

Jing, Chenghu, Xiaole Ma, Kun Zhang, Yanfeng Wang, Bingsheng Yan, and Yanbo Hui. 2024. "Actor-Critic Neural-Network-Based Fractional-Order Sliding Mode Control for Attitude Tracking of Spacecraft with Uncertainties and Actuator Faults" Fractal and Fractional 8, no. 7: 385. https://doi.org/10.3390/fractalfract8070385

Article Menu

Actor-Critic Neural-Network-Based Fractional-Order Sliding Mode Control for Attitude Tracking of Spacecraft with Uncertainties and Actuator Faults

Abstract

1. Introduction

2. Model Description and Preliminaries

2.1. Spacecraft Dynamics

2.2. Preliminaries

3. Controller Design

3.1. Uncertainty Estimation Using Actor-Critic NN

3.1.1. Critic NN

3.1.2. Actor NN

3.2. Fractional-Order Super-Twisting Sliding Mode Control

4. Stability Analysis

5. Simulations

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI