Next Article in Journal
Investigating the Effects of Center of Gravity (CoG) Shift Due to a Simulated Exploration Extravehicular Mobility Unit (xEMU) Suit on Balance
Previous Article in Journal
Research on Multi-Objective Flexible Job Shop Scheduling Problem with Setup and Handling Based on an Improved Shuffled Frog Leaping Algorithm
Previous Article in Special Issue
Dynamic Modeling and Altitude Control for Flying Cars Based on Active Disturbance Rejection Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification and Control of Flexible Joint Robots Based on a Composite-Learning Optimal Bounded Ellipsoid Algorithm and Prescribe Performance Control Technique

1
School of Automation, Beijing Institute of Technology, Beijing 100811, China
2
China North Artificial Intelligence & Innovation Research Institute, Beijing 100072, China
3
School of Mechanical Engineering, Shandong University, Jinan 250061, China
*
Authors to whom correspondence should be addressed.
Appl. Sci. 2024, 14(10), 4030; https://doi.org/10.3390/app14104030
Submission received: 29 February 2024 / Revised: 13 April 2024 / Accepted: 7 May 2024 / Published: 9 May 2024
(This article belongs to the Special Issue Research and Development of Intelligent Robot)

Abstract

:
This paper presents an indirect adaptive neural network (NN) control algorithm tailored for flexible joint robots (FJRs), aimed at achieving desired transient and steady-state performance. To simplify the controller design process, the original higher-order system is decomposed into two lower-order subsystems using the singular perturbation technique (SPT). NNs are then employed to reconstruct the aggregated uncertainties. An adaptive prescribed performance control (PPC) strategy and a continuous terminal sliding mode control strategy are introduced for the reduced slow subsystem and fast subsystem, respectively, to guarantee a specified convergence speed and steady-state accuracy for the closed-loop system. Additionally, a composite-learning optimal bounded ellipsoid algorithm (OBE)-based identification scheme is proposed to update the NN weights, where the tracking errors of the reduced slow and fast subsystems are integrated into the learning algorithm to enhance the identification and tracking performance. The stability of the closed-loop system is rigorously established using the Lyapunov approach. Simulations demonstrate the effectiveness of the proposed identification and control schemes.

1. Introduction

In today’s complex work environment, where human–robot interaction has become increasingly frequent, a diverse range of robots has emerged to cater to distinct implementation scenarios. Among these robots, FJRs have garnered significant attention due to their inherent compliance, which enhances the safety of human interactions [1]. Typically, FJRs comprise motors with load links, connected via low-stiffness flexible couplings. This joint elasticity allows FJRs to be modeled as a two-mass system. To effectively control these systems, numerous control methods have been proposed in the academic literature. For instance, Zhu et al. [2] combined command filtering with adaptive fuzzy control methods to control FJRs with time-varying full-state constraints. Xu et al. [3] elaborated on how to integrate event-triggered control with adaptive neural networks to design a control strategy, aiming to achieve precise tracking control of FJRs. Jerónimo et al. [4] combined a PID-type control strategy with motor position measurement information to achieve precise control of FJRs.
In practical scenarios, it is crucial to confine the tracking error within a predefined range. To this end, PPC has garnered significant attention from researchers [5,6,7]. Ma, for instance, introduced an adaptive fuzzy control approach for single-link flexible-joint robotic manipulators, leveraging PPC to ensure transient performance guarantees [8]. Furthermore, FJRs can achieve desired impedance against external contact forces and superior position tracking in free motion, thanks to robust prescribed performance position tracking controllers [9]. However, it is worth noting that the aforementioned control methods often rely on backstepping or dynamic surface control techniques, which are renowned for their intricate and repetitive design procedures. Consequently, their application in real high-order FJRs remains limited.
In FJRs, the position and velocity of links exhibit slower time-scale variations, whereas the position and velocity of motors undergo faster dynamics. This characteristic allows the overall system dynamic equations to be recast into a standard singularly perturbed form, facilitating the utilization of SPT to simplify controller design. Kim et al. [10], for instance, applied SPT to decompose the higher-order model of a serial elastic actuator into two lower-order models. Subsequently, they developed a second-order sliding mode control law for each lower-order model, ensuring semi-global exponential stability. Another study by Sun et al. [11] introduced a nonlinear hybrid control scheme for a redundant parallel robot system, leveraging SPT and Tikhonov’s theorem to achieve both trajectory tracking and vibration suppression. Chen et al. proposed a prescribed performance controller for FJRs with exogenous disturbances, incorporating SPT and a time-varying tangent barrier Lyapunov function to expand the initial error constraint boundary [12]. However, these methods fail to identify and compensate for unknown dynamics in FJRs, resulting in relatively large control errors.
To address the challenges posed by unknown nonlinear dynamics in practical systems, NNs have been extensively employed due to their remarkable universal approximation capabilities [13,14,15,16,17]. In the development of NN-based adaptive control methods, the algorithm for identifying NN weights plays a crucial role, as it profoundly affects both the tracking performance and the robustness of the closed-loop system [18]. NN-based adaptive control methods can be categorized into two main branches: direct adaptive control methods [19,20,21] and indirect adaptive control methods [22]. Direct adaptive control primarily relies on gradient-descent-based updating laws for online identification of unknown controller parameters. However, these methods may not yield satisfactory identification results [23]. Consequently, indirect adaptive control has garnered significant attention since it offers more advanced identification algorithms. A traditional approach in this domain is the identification algorithm based on the Kalman filter, which characterizes uncertainty through a noise signal with a predefined probability density function [24]. However, the limited capability of this algorithm in handling nonlinear systems restricts its utilization in specific application scenarios [25].
The set-membership identification approach ensures that the parameters belong to specific sets, offering increased flexibility in addressing intricate problems. Among the various set-membership identification methods, the OBE algorithm stands out as the most prominent. The OBE algorithm has significant importance in the realms of system identification and adaptive control, serving as an effective and precise tool for the estimation of unknown system parameters [26]. This iterative technique incrementally shrinks the volume of an ellipsoid to yield the optimal parameter estimate. Notably, ellipsoids are employed in this context to demarcate predefined sets [27,28]. The learning gain matrix in the OBE algorithm exhibits dynamic characteristics, with an upper bound that prevents explosions in scenarios of inadequate excitation and a lower bound that guarantees robust identification performance for time-varying parameters. The OBE algorithm has been studied extensively in the literature. For example, Davila et al. proposed a novel dynamic sliding mode control design scheme by combining sliding mode control with the attracting ellipsoid method, enabling the system state to rapidly enter and maintain the sliding surface during the dynamic process [29]. Cao et al. introduced a directional forgetting algorithm that discards outdated data based on the informational content across various directions, above and below the bounded information matrix [30]. Ordaz et al. developed an adaptive state estimator and a robust output control for a class of uncertain nonlinear systems, which enables the system to maintain stability and performance in the presence of uncertainty and disturbances, and the feedback gains K, L can be updated online based on the attractive ellipsoid method [31]. Guo et al. proposed an efficient OBE identification algorithm that ensures deterministic upper and lower bounds for the learning gain matrix [13,32]. Applications of OBE-based algorithms for the identification of singularly perturbed systems have been detailed in [33,34]. However, it is crucial to note that these papers rely solely on identification errors for updating NN weights. As highlighted in [22], incorporating both identification and tracking errors in weight updating laws enhances NN training outcomes. Therefore, it is advisable to incorporate the tracking error into a composite learning algorithm to further bolster identification performance.
To tackle the aforementioned challenges, this paper introduces a novel identification and control algorithm tailored for the FJR. The key contributions of this work are outlined below:
  • An adaptive prescribed performance controller for the FJR is formulated, leveraging the SPT framework and continuous terminal sliding mode control technique. This simplifies the controller design process and ensures that the transient and steady-state behavior meets the pre-established specifications.
  • A composite-learning OBE-based identification algorithm is introduced, aiming to boost NN training performance by incorporating both identification and tracking errors.
The subsequent sections of this paper are organized as follows: Section 2 presents preliminary information and demonstrates how the original high-order FJRs can be decoupled into two lower-order subsystems using SPT. Section 3 introduces an NN-based terminal sliding mode PPC for the simplified subsystems. The identification algorithm is presented in Section 4. Simulation results are presented in Section 5 to demonstrate the efficacy of the proposed control method, and Section 6 concludes the paper with final remarks.

2. Preliminaries and Problem Formulation

An FJR can be described by the following dynamic model [35]:
M ( q ) q ¨ + C ( q , q ˙ ) q ˙ + D ( q ˙ ) + G ( q ) + τ d = K ( θ q )
J θ ¨ + K ( θ q ) = u ,
where q ( t ) R n , q ˙ ( t ) R n are the angular position and velocity of the joints, and θ R n and θ ˙ R n represent the angular position and velocity of the motors. M ( q ) R n × n , C ( q , q ˙ ) R n , D ( q ˙ ) R n , G ( q ) R n , u R n represents the inertia matrix, the centripetal/Coriolis matrix, the friction torque vector, the gravity torque vector, and the system input vector, respectively. τ d R n is a bounded aggregate disturbance term that encompasses various sources of uncertainty, including inaccuracies in the model, uncaptured dynamics, and external perturbations. K R n × n is a diagonal matrix representing the stiffness of the spring between the motor and the link, and J R n × n is a diagonal matrix representing the inertia of the motors.
Denote H r ( q , q ˙ ) = C ( q , q ˙ ) q ˙ + D q ˙ + G ( q ) , τ 1 = K ( θ ( t ) q ( t ) ) . Considering the fact that the torque τ 1 generated by the spring changes much faster than the position of the link q, then by defining K = K 0 / ε 2 , where ε is a small parameter, and K 0 R n × n is a positive-definite diagonal matrix, we can use a singular perturbation model to describe the FJR as
M ( q ) q ¨ + H r ( q , q ˙ ) + τ d = τ 1
ε 2 J τ ¨ 1 + K 0 τ 1 = K 0 ( u J q ¨ ) .
Remark 1.
In control systems theory, the singular perturbation model is extensively employed to describe dynamical systems, where certain state derivatives are scaled by a small positive parameter ε. These states scaled by ε are known as fast states, while the remaining states are classified as slow. A decrease in ε typically correlates with accelerated changes in the fast states. For such systems, the singular perturbation technique serves as a valuable tool for order reduction and controller simplification [36]. Although there is no standardized approach for selecting ε, for flexible joint robots, it is customary to choose ε inversely proportional to the square root of joint stiffness [37]. As joint stiffness K increases, ε decreases, resulting in faster torque τ 1 changes. In particular, as K approaches infinity, ε tends to zero, causing instantaneous torque τ 1 changes, effectively transforming the flexible joint robot into a rigid joint robot.
Substituting (2a) into (2b), one obtains
ε 2 J τ ¨ 1 + K 0 τ 1 = K 0 u K 0 J M 1 ( τ 1 τ d H r ) .
Multiplying both sides of (3) with J 1 , one obtains
ε 2 τ ¨ 1 = K 0 J 1 ( u τ 1 ) + M 1 ( H r + τ d τ 1 ) .
Let x 1 : = q , x 2 : = q ˙ , τ 2 : = ε τ ˙ 1 , then, (2) can be written as
x ˙ 1 = x 2
x ˙ 2 = M 1 ( τ 1 H r τ d )
ε τ ˙ 1 = τ 2
ε τ ˙ 2 = K 0 J 1 ( u τ 1 ) + M 1 ( H r + τ d τ 1 ) .
Because τ d is unknown, a three-layer neural network is adopted to approximate M 1 τ d as:
M 1 τ d = W * T Φ ( Z ) + ζ ,
where W * = [ W 1 * , W 2 * , , W n * ] R N × n is a matrix of NN weights, W i * = [ w i * 1 , w i * 2 , , w i * N ] T R N , N Z + is the number of neurons used in the hidden layer, Z = [ z 1 , , z N ] = V x , with x = [ x 1 T , x 2 T ] T being the input of the NNs, and V R N × 2 n the weights between the input and the hidden layer, Φ = [ ϕ 1 , ϕ 2 , , ϕ N ] T R N is a vector of regression functions, ϕ k : R R + with k = 1 , , N is a sigmoid function modeled by ϕ k = 1 / ( 1 + e z k ) , and ζ R n is the approximation error of the NN.
Using (6), the system (5) can be written as
x ˙ 1 = x 2
x ˙ 2 = M 1 ( τ 1 H r ) + W * T Φ + ζ
ε τ ˙ 1 = τ 2
ε τ ˙ 2 = K 0 J 1 ( u τ 1 ) + M 1 ( H r τ 1 ) W * T Φ ζ .
To apply the singular perturbation technique, one can first set ε = 0 in (7d) and neglect the unknown W * T Φ and ζ ; then, τ 1 has a unique isolated root
τ 1 * = h ( x , t ) = ( M 1 + J 1 ) 1 ( J 1 u s + M 1 H r ) = a u s + b H r ,
where u s is the slow control component of u when ε = 0 , and a = ( M 1 + J 1 ) 1 J 1 , b = ( M 1 + J 1 ) 1 M 1 .
Replace τ 1 with τ 1 * in (7), the reduced slow subsystem can be obtained as
x ˙ 1 = x 2
x ˙ 2 = M 1 a u s + ( b I ) H r + W * T Φ + ζ .
Define a new state variable Λ = τ 1 h ( x , t ) ; the fast dynamics in (7) can be written as
ε Λ ˙ = τ 2 ε h ˙ ( x , t )
ε τ ˙ 2 = K 0 J 1 ( u τ 1 ) + M 1 ( H r τ 1 ) W * T Φ ζ .
Define a “stretched” time variable t ε : = t ε . Setting ε = 0 freezes the variables x and t at x 0 and 0, respectively, so that h ( x , t ) is constant at the fast time scale t ε . Then, substituting τ 1 = Λ + h ( x , t ) with h ˙ = 0 into (10) and changing the consequent expression into the fast time scale t ε , one obtains
d Λ d t ε = τ 2
d τ 2 d t ε = K 0 J 1 u + K 0 M 1 H r K 0 ( W * T Φ + ζ ) K 0 ( M 1 + J 1 ) ( Λ + h ( x , t ) ) .
Applying (8) into the above equations and setting ε = 0 , the reduced fast subsystem can be obtained as
d Λ d t ε = τ 2
d τ 2 d t ε = K 0 J 1 u f ( M 1 + J 1 ) Λ ( W * T Φ + ζ ) ,
where u f : = u u s .

3. Continuous Terminal Sliding Mode Control of the Flexible Joint Robotic Manipulator

This paper aims to ensure the angular position of joints, q, accurately tracking a designated reference, q d , while meeting pre-specified transient and steady-state performance criteria. Leveraging the singular perturbation technique (SPT), the high-order singular perturbation model (7) is successfully decomposed into two lower-order subsystems: (9) and (12). According to Tikhonov’s theory [36], controlling these reduced-order subsystems suffices to solve the trajectory tracking problem for the original system, thereby simplifying the controller design process. In the following subsections, we delve into the controller design for both the reduced slow subsystem and the reduced fast subsystem.

3.1. Controller Design of the Reduced Slow Subsystem

Firstly, the position tracking error, denoted as e 1 ( t ) is defined as the difference between the actual position q ( t ) and the desired position q d ( t ) , that is, e 1 ( t ) = q ( t ) q d ( t ) . Based on the given Equation (9), the error dynamics can be obtained as
e ˙ 1 = e 2
e ˙ 2 = M 1 a u s + ( b I ) H r + W * T Φ + ζ q ¨ d .
To attain exceptional transient and steady-state tracking capabilities in robotic systems, it is imperative for the tracking error to remain confined within predefined boundaries, as emphasized in [8,38]
σ ̲ ρ ( t ) < e i 1 < σ ¯ ρ ( t ) , i = 1 , , n ,
where the prescribed performance function ρ ( t ) can be expressed as follows:
ρ ( t ) = ( ρ 0 ρ ) e h t + ρ
with ρ 0 R + and ρ R + being designed constants satisfying the condition 0 < ρ < ρ 0 , while h R + is another designed constant. Additionally, σ ̲ R + , σ ¯ R + represent designed parameters.
To guarantee that the tracking error e i 1 is bounded by the predefined boundaries, the following error transformation is first introduced [39]:
e i 1 ( t ) = ρ S i ( ϵ i ) ,
where ϵ i ( t ) represents the transformed tracking error, and the function S i ( · ) is given by
S i ( ϵ i ) = σ ¯ e ϵ i ( t ) σ ̲ e ϵ i ( t ) e ϵ i ( t ) + e ϵ i ( t ) .
Utilizing the inverse transformation of S i ( · ) , ϵ i ( t ) can be derived from (16) as:
ϵ i ( t ) = S i 1 ( e i 1 ( t ) ρ ( t ) ) = 1 2 ln Ψ i ( t ) + σ ̲ σ ¯ Ψ i ( t ) ,
where S i 1 ( ) is the inverse function of S i ( ϵ i ) , and Ψ i ( t ) = e i 1 ( t ) / ρ ( t ) represents the intermediate variable. Let ϵ = [ ϵ 1 , ϵ 2 , , ϵ n ] T , Ψ = [ Ψ 1 , Ψ 2 , , Ψ n ] T . The derivative of ϵ i can be derived as
ϵ ˙ i = 1 2 z i Ψ ˙ i ,
where
z i = 1 Ψ i + σ ̲ 1 Ψ i σ ¯ , Ψ ˙ i = e i 2 ρ e i 1 ρ ˙ ρ 2 , ρ ˙ = h ( ρ 0 ρ ) e h t .
Subsequently, the derivative of ϵ ˙ i can be computed as:
ϵ ¨ i = 1 2 z ˙ i Ψ ˙ i + 1 2 z i Ψ ¨ i
= 1 2 z ˙ i Ψ ˙ i + 1 2 z i ( e ˙ i 2 ρ + 2 e i 1 ρ ˙ 2 ρ 3 2 e i 2 ρ ˙ ρ 2 e i 1 ρ ¨ ρ 2 )
= 1 2 z ˙ i Ψ ˙ i + 1 2 z i ( e ˙ i 2 ρ + ν i ) ,
where
z ˙ i = 1 ( Ψ i σ ¯ ) 2 1 ( Ψ i + σ ̲ ) 2 Ψ ˙ i , ν i = 2 e i 1 ρ ˙ 2 ρ 3 2 e i 2 ρ ˙ ρ 2 e i 1 ρ ¨ ρ 2 , ρ ¨ = h 2 ( ρ 0 ρ ) e h t .
Denote Ξ = diag { z 1 , z 2 , , z n } , ν = [ ν 1 , ν 2 , , ν n ] T . Then, we can respectively rewrite (19) and (21) as
ϵ ˙ = 1 2 Ξ Ψ ˙
ϵ ¨ = 1 2 Ξ ˙ Ψ ˙ + 1 2 Ξ ( e ˙ 2 ρ + ν ) .
Define a sliding manifold s s as
s s ( t ) = ϵ ˙ ( t ) ϵ ˙ ( 0 ) + 0 t ( λ 1 s sig ( ϵ ) α 1 + λ 2 s sig ( ϵ ˙ ) α 2 ) d τ ,
where λ 1 s = diag { λ 11 s , λ 12 s , , λ 1 n s } , λ 2 s = diag { λ 21 s , λ 22 s , , λ 2 n s } are positive definite diagonal matrices selected by the designer with λ i j s R + , i = 1 , 2 , j = 1 , , n such that the polynomial π 2 + λ 2 j s π + λ 1 j s is Hurwitz, α 1 = 1 α 0 1 + α 0 , α 2 = 1 α 0 , with 0 < α 0 < 1 being a positive real number, sig ( z ) α i : = [ sign ( z 1 ) | z 1 | α i , , sign ( z n ) | z n | α i ] T for a vector z R n , i = 1 , 2 . Using (13) and (23), one has
s ˙ s = λ 1 s sig ( ϵ ) α 1 + λ 2 s sig ( ϵ ˙ ) α 2 + ϵ ¨ = Ξ 2 ρ M 1 a u s + ( b I ) H r + W * T Φ + ζ q ¨ d + ,
where
= 2 ρ Ξ 1 λ 1 s sig ( ϵ ) α 1 + λ 2 s sig ( ϵ ˙ ) α 2 + 1 2 Ξ ˙ Ψ ˙ + Ξ 2 ν .
To guarantee the convergence of the sliding manifold s s , we can design the control command u s for the reduced slow subsystem as
u s = a 1 M ( W ^ T Φ + q ¨ d M 1 ( b I ) H r K 1 s s s K 2 s sig ( s s ) ϱ s K 3 s tanh ( s s ϵ s ) ) ,
where K j s = diag { k j 1 s , k j 2 s , , k j n s } R n × n , j = 1 , 2 , 3 are the designed positive definite feedback gain matrix, ϱ s R + , ϵ s R + are two designed parameters satisfying 0 < ϱ s < 1 , 0 < ϵ s < 1 , and W ^ R N × n is the estimation of W * . Substituting (26) into (25), one has
s ˙ s = g s ( K 1 s s s K 2 s sig ( s s ) ϱ s K 3 s tanh ( s s ϵ s ) + W ˜ T Φ + ζ ) ,
where g s = Ξ 2 ρ , W ˜ = W * W ^ .

3.2. Controller Design of the Reduced Fast Subsystem

Define a sliding manifold s f for the reduced fast subsystem as
s f ( t ε ) = τ 2 ( t ε ) τ 2 ( 0 ) + 0 t ε ( λ 1 f sig ( Λ ) β 1 + λ 2 f sig ( τ 2 ) β 2 ) d τ ε ,
where λ 1 f = diag { λ 11 f , λ 12 f , , λ 1 n f } R n × n , λ 2 f = diag { λ 21 f , λ 22 f , , λ 2 n f } R n × n are positive definite diagonal matrices selected by the designer with λ i j f R + , i = 1 , 2 , j = 1 , , n such that the polynomial π 2 + λ 2 j f π + λ 1 j f is Hurwitz, β 1 = 1 β 0 1 + β 0 , β 2 = 1 β 0 , with 0 < β 0 < 1 being a positive real number. The derivative of s f with respect to t ε is
d s f d t ε = λ 1 f sig ( Λ ) β 1 + λ 2 f sig ( τ 2 ) β 2 + K 0 J 1 u f K 0 ( M 1 + J 1 ) Λ K 0 ( W * T Φ + ζ ) .
To guarantee the convergence of the sliding manifold s f , the control command u f can be designed as
u f = J K 0 1 ( λ 1 f sig ( Λ ) β 1 λ 2 f sig ( τ 2 ) β 2 + K 0 ( M 1 + J 1 ) Λ + K 0 W ^ T Φ K 1 f s f K 2 f sig ( s f ) ϱ f K 3 f tanh ( s f ϵ f ) ) ,
where K j f = diag { k j 1 f , k j 2 f , , k j n f } , j = 1 , 2 , 3 R n × n are the designed positive definite feedback gain matrix, and ϱ f R + , ϵ f R + are positive real numbers satisfying 0 < ϱ f < 1 , 0 < ϵ f < 1 . Substituting (30) into (29), one has
d s f d t ε = K 1 f s f K 2 f s f ϱ f K 3 f tanh ( s f ϵ f ) K 0 ( W ˜ T Φ + ζ ) .
Using the controllers (26) and (30) designed for the reduced slow and fast subsystems, respectively, the overall control signal for the original high-order nonlinear FJR system (5) can be obtained as
u = u s + u f .

4. Training of NN Weights

As the true nominal NN weights W * remain unknown, in this section, we devise a composite-learning OBE-based identification algorithm to estimate W ^ in real time. To lay the groundwork, we present the following definition of an ellipsoid:
Definition 1.
An N-dimensional real ellipsoid set Ω, centered at point z * , is mathematically represented as:
Ω ( z * , ϝ ) : = { z R N : ( z z * ) T ϝ ( z z * ) 1 } ,
where ϝ R N × N denotes a positive-definite symmetric matrix.
To guarantee the boundedness of the control signal and the stability of the closed-loop system, the estimation result W ^ is expected to be bounded by preset compact sets Ω c w i as
Ω c w i : = { W ^ i W ^ i < c w i } .
To facilitate the learning law design, a set of filtered variables, x 2 f , H ¯ f , Φ f , τ ¯ 1 f and ζ f are first defined by using a first-order filter F f ( s ) = 1 k s + 1 ( k > 0 ) as
k x ˙ 2 f + x 2 f = x 2
k H ¯ ˙ f + H ¯ f = M 1 H r
k Φ ˙ f + Φ f = Φ
k τ ¯ ˙ 1 f + τ ¯ 1 f = M 1 τ 1
k ζ ˙ f + ζ f = ζ
Applying the filter F f ( s ) to both sides of (7b) and subsequently rearranging, we obtain
ψ f ( t ) = x ˙ 2 f H ¯ f τ ¯ 1 f = W * T Φ f + ζ f .
Since ζ is bounded, then the filtered optimal NN approximation error ζ f is also bounded; that is, | ζ f i | ζ ¯ f i , where ζ f i is the ith element of ζ f , ζ ¯ f i R + is a constant [40].
Employing the ith column of the matrix W * , denoted W i * , from (35), we derive
ψ f i ( t ) = W i * T Φ f + ζ f i ,
where ψ f i is the ith elements of ψ f , i = 1 to n. Define the identification error as
ψ ˜ f i = ψ f i ψ ^ f i ,
where ψ ^ f i ( t ) = W ^ i T Φ f is the output of the NN. Then, we propose the following composite-learning OBE algorithm to update the NN weights estimation W ^ i online:
W ^ ˙ i = Ψ i ζ ¯ f i 2 b i ϝ i Φ f ψ ˜ f i + γ s g i s ϝ i Φ s i s γ f K 0 ϝ i Φ s i f
ϝ ˙ i = L i ϝ ˜ i j i ( ϝ i Φ f Φ f T ϝ i ϝ i l Φ f Φ f T ϝ i l )
with
ς i = λ i ζ ¯ f i 2 1 + Φ f T ϝ i Φ f
j i = ς i L i ( 1 ς i ) ζ ¯ f i 2 + ς i Φ f T ϝ i Φ f
L i = L i 0 b i b p ( 1 ϝ i u 1 | | ϝ i | | )
ϝ ˜ i = ϝ i ϝ i l
b i = 1 , if | ψ ˜ f i | > ζ ¯ f i & Φ f T ϝ i Φ f > ζ ¯ f i 2 0 , if | ψ ˜ f i | ζ ¯ f i or Φ f T ϝ i Φ f ζ ¯ f i 2
for i = 1 n , let g i s be the ith element of g s , λ i R + be a design parameter satisfying λ i < 1 / ζ ¯ f i 2 , and o i o R + be a forgetting factor satisfying 0 < o i o < 1 . ϝ i l R N × N is a positive definite diagonal matrix, and ϝ i u R + is a positive constant that satisfies ϝ i l < ϝ i u I , where I R N × N is the identity matrix.
The projection operator, denoted by ( ) , is defined as follows:
( ) = , if | | W ^ i | | < c w i or | | W ^ i | | = c w i & W ^ i T 0 W ^ i W ^ i T / | | W ^ i | | 2 , otherwise
and b p ( ) is given by
b p ( ) = 1 , if | | W ^ i | | < c w i or | | W ^ i | | = c w i & W ^ i T 0 0 , otherwise
Define an ellipsoid set Ω E i centered at W i * as
Ω E i ( W i * , ϝ i ) = { W ^ i | W ˜ i T ϝ i 1 W ˜ i 1 } .
Now, we propose the following theorem.
Theorem 1.
Consider the singularly perturbed model of an FJR (5), by using the weights updating law (39) and the NN-based adaptive controller u = u s + u f with u s given in (26) and u f given in (30), it can be guaranteed that: (1) The NN weights W ^ i ( t ) remain inside the ellipsoid set defined in (43); that is, W ^ i ( t ) Ω E i , t 0 . (2) The identification error ψ ˜ f i is bounded. (3) The sliding manifolds s s and s f are bounded, and the tracking error e 1 is bounded by the predefined performance boundaries.
A detailed proof of Theorem 1 is presented in Appendix A.
Remark 2.
In this study, we propose an adaptive PPC tailored for FJRs. Departing from conventional approaches that employ back-stepping or dynamic surface control methodologies [6,7,8,9], our design leverages the SPT to reduce system complexity and simplify the controller synthesis process, thereby significantly enhancing its practical applicability to actual FJR systems. Furthermore, unlike other research efforts [10,11] that solely rely on SMC or PPC to counteract disturbances, this work introduces an online-trained NN to approximate and mitigate unknown perturbations, significantly enhancing control precision. This innovative utilization of SPT, PPC and NNs constitutes the first key contribution of our paper.
Remark 3.
This paper presents a novel composite-learning OBE algorithm for online updating of NN weights. Diverging from conventional OBE methods in the literature [13,32,33,34], which solely utilize identification error, our approach uniquely integrates tracking errors from both the slow and fast subsystems into the weight updating process. This innovative strategy yields substantial enhancements in identification and tracking precision, constituting the second major contribution of this study.

5. Simulation Study

In this section, Matlab simulations of a two-degrees-of-freedom FJR are reported to verify the effectiveness of the identification and control algorithm proposed in this study. The schematic diagram of the ith link of the FJR is depicted in Figure 1. The matrices M ( q ) , C ( q , q ˙ ) , G ( q ) of the FJR model (1) are given by
M ( q ) = M 11 M 12 M 21 M 22 , C ( q , q ˙ ) = C 1 C 2 , G ( q ) = G 1 G 2 , M 11 = m 1 l c 1 2 + I 1 + I 2 + m 2 ( l 1 2 + l c 2 2 + 2 l 1 l c 2 cos q 2 ) , M 12 = m 2 l c 2 2 + I 2 + m 2 l 2 l c 2 cos q 2 M 21 = m 2 l c 2 2 + I 2 + m 2 l 2 l c 2 cos q 2 , M 22 = m 2 l c 2 2 + I 2 C 1 = m 2 l 1 l c 2 sin q 2 q ˙ 2 m 2 l 1 l c 2 sin q 2 ( q ˙ 1 + q ˙ 2 ) , C 2 = m 2 l 1 l c 2 sin q 2 q ˙ 1 G 1 = m 2 g l 1 cos ( q 1 + q 2 ) + ( m 1 + m 2 ) g l 1 cos q 1 , G 2 = m 2 g l 2 cos ( q 1 + q 2 )
and the nominal system parameters used in the simulation are listed in Table 1. For simplicity, no friction is considered in the simulation; thus, D = [ 0 , 0 ] T . To verify the effectiveness of the proposed methods for systems with model uncertainties, the real system parameters m 1 * , m 2 * , I 1 * , I 2 * , l 1 * , l 2 * , l c 1 * , l c 2 * are set to be 1.2 times of their nominal values. The reference signal is q d = [ sin ( 1.6 π t ) , sin ( 1.6 π t + π / 3 ) ] T . The initial values for x 1 and x 2 are x 1 ( 0 ) = 0.8 and x 2 ( 0 ) = 0 . To transform the original system model (1) into a singular perturbation model (2), K 0 = diag { 0.01 , 0.01 } is used. The prescribed performance continuous terminal sliding mode controller based on the singular perturbation technique and the composite-learning optimal bounded ellipsoid algorithm (SOBE) is used to control this system, and the following parameters are chosen for the controller: K 1 s = 5 , K 2 s = 1 , K 3 s = 1 , ϵ s = 0.1 , ϱ s = 0.9 , K 1 f = 250 , K 2 f = 1 , K 3 f = 1 , ϵ f = 0.1 , ϱ f = 0.9 . To compensate the uncertainties, an NN with 30 neurons is employed, and the identification algorithm parameters are chosen as: γ s = 0.02 , γ f = 0.02 , L i 0 = 0.2 , ϝ i l = 20 I , ϝ i u = 200 , λ i = 5 , ζ ¯ f i = 0.001 .
For the purpose of comparison, a modified OBE algorithm (MOBE)-based adaptive controller [13] is also tested in the simulation. The choice of this particular control scheme is motivated by the fact that our identification methodology builds upon the MOBE framework. While MOBE exclusively utilizes the identification error for updating NN weights, our proposed method incorporates both identification and tracking errors in the NN training process, thereby extending the MOBE approach. Additionally, whereas [13] employs a conventional sliding mode controller, our study develops a terminal sliding mode PPC. Consequently, [13] constitutes an apt and informative reference point against which to assess the performance of our novel controller. For the MOBE method, all the shared parameters are the same as those used for SOBE. It should be pointed out that the control method proposed in [13] is only used to control the reduced slow subsystem. The controller for the reduced fast subsystem is the same as the one used in SOBE.
The simulation outcomes are presented in Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15, Figure 16 and Figure 17. As is evident from Figure 2 and Figure 3, both the SOBE and MOBE methods enable the system state x 1 to track the designated reference x r . However, the angular position of the robot converges faster to the reference when employing the SOBE controller, resulting in smaller disparities between the system state and the reference signal.
Additionally, Figure 4 and Figure 5 clearly illustrate that the SOBE method yields smaller position tracking errors, which converge to 0 more rapidly. Notably, the predefined performance boundaries are consistently upheld throughout the entire identification and control process. Conversely, the MOBE method exhibits a larger tracking error and a slower convergence rate.
Furthermore, Figure 6 and Figure 7 demonstrate that when using the SOBE method, the angular velocity rapidly approaches its reference signal x ˙ r and maintains close alignment thereafter. In contrast, the MOBE method leads to slower convergence and larger disparities. Similarly, Figure 8 and Figure 9 show that the SOBE method achieves reduced velocity tracking errors, which converge to a small residual set centered around 0 more quickly.
It is worth mentioning that since both the SOBE and MOBE methods employ the same controller for the reduced fast controller, τ 1 tracks h ( x ) well regardless of the method used, as depicted in Figure 10 and Figure 11.
The identification results are presented in Figure 12 and Figure 13, while the identification errors are depicted in Figure 14 and Figure 15. Both figures reveal that the integration of tracking error into the learning law enhances the performance of the composite-learning OBE algorithm. Specifically, the estimation results ψ ^ f i converge to ψ f i more rapidly. Conversely, when the MOBE algorithm is employed, the estimation results take longer to converge to the true values.
It is noteworthy that in Figure 14 and Figure 15, the steady-state NN identification errors associated with the MOBE exhibit smaller magnitudes. This observation can be attributed to the fact that, in the MOBE framework, only the identification error ψ ˜ f i serves as the basis for updating NN weights, with the aim of having ψ ^ f i = W ^ i T Φ f closely approximate ψ f i . By contrast, the SOBE approach introduces tracking errors from the reduced subsystems into the weight updating mechanism. Consequently, the NN not only strives to approximate ψ f i but also compensates for model discrepancies arising from the SPT-induced order reduction, ultimately minimizing tracking errors. Thus, while the discrepancy between W ^ i T Φ f and ψ f i may be larger when employing the SOBE method, the associated tracking errors are concurrently smaller.
The control signals generated by both the SOBE and MOBE methodologies are illustrated in Figure 16 and Figure 17. Evidently, during the transient phase, the amplitude of u f is substantial, facilitating the swift convergence of the fast state τ 1 to its equilibrium value h ( x , t ) . Subsequently, u f diminishes to near-zero levels at steady state. This characteristic response exemplifies the inherent behavior of a singularly perturbed system.
The figures presented in this study demonstrate that the SOBE method, which incorporates tracking errors into the NN weight updating law and utilizes the PPC to constrain error boundaries, significantly outperforms the MOBE method. Specifically, the SOBE method exhibits superior identification and control performance, as compared to the MOBE method, which relies solely on a regular sliding mode control approach and updates NN weights based solely on identification errors.

6. Conclusions

This paper introduces a novel identification and adaptive control algorithm tailored for FJRs, aimed at enhancing NN training efficiency and trajectory tracking accuracy. We begin by applying the SPT method to decompose the intricate high-order system into two easily-managed low-order subsystems. This simplifies controller design by focusing on these reduced-order components instead of the full high-order system. To ensure desired transient and steady-state characteristics, we integrate the PPC technique into the slow subsystem’s controller design. For faster convergence, continuous terminal sliding mode controllers are developed for both slow and fast subsystems. Additionally, we boost identification performance by merging the tracking error with the conventional OBE algorithm, creating a novel composite-learning OBE-based learning rule.
Simulations reveal that using the proposed identification and control algorithm leads to closer tracking of reference states, quicker convergence of tracking errors to zero, and smaller steady-state tracking errors. This improved performance is attributed to the capture of model uncertainties resulting from system order reduction, as tracking errors are also utilized to train NN weights. Furthermore, NN training speed is accelerated.
The immediate next step involves implementing the proposed algorithm on a real multi-link FJR to further validate its practical efficacy. Given that many practical FJR systems experience input saturation nonlinearity due to limited motor drive capacity, investigating SPT-based PPC under input saturation constraints is warranted. Lastly, while this paper establishes the closed-loop system’s boundedness, proving its finite-time convergence would be a valuable future extension.

Author Contributions

Conceptualization, D.Z. and X.R.; methodology, D.Z. and K.G.; software, X.L. and D.Z.; validation, D.Z., K.G. and X.R.; formal analysis, X.L., D.Z. and X.R.; investigation, X.L. and D.Z.; resources, K.G. and X.R.; data curation, X.L. and D.Z.; writing—original draft preparation, X.L.; writing—review and editing, D.Z., K.G. and X.R.; visualization, X.L.; supervision, D.Z.; project administration, X.R.; funding acquisition, D.Z. and K.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported in part by the National Nature Science Foundation under Grant No. 62103051, 52205035, in part by the the National Key R&D Program of China under Grant No. 2023YFB4703900, and in part by the Beijing Institute of Technology Research Fund Program for Young Scholars.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy.

Conflicts of Interest

The authors declare no conflicts of interest.

Notations

The following notations are used in this manuscript:
q, q ˙ angular position and velocity of the joint
θ , θ ˙ angular position and velocity of the motor
M ( q ) the inertia matrix
C ( q , q ˙ ) the centripetal/Coriolis matrix
D ( q ˙ ) the friction torque vector
G ( q ) the gravity torque vector
τ d the aggregate disturbance
u, u s , u f the system input vector, control command of the reduced slow subsystem and control command of the reduced fast subsystem
Kthe stiffness matrix of the spring between the motor and the link
Jthe inertia matrix of the motors
x 1 , x 2 q and q ˙
τ 1 , τ 2 the elastic torque and its derivative with respect to t ε
h ( x , t ) the equilibrium of τ 1
Λ the difference between τ 1 and h ( x , t )
W * , Φ the NN weight matrix and regression functions vector
Nthe number of neurons used in the hidden layer
ζ the NN approximation error
t ε the fast time scale
e 1 , e 2 the position tracking error and the velocity tracking error
ρ ( t ) the prescribed performance function
ϵ i ( t ) the transformed tracking error of e 1 i
s s , s f the slow sliding manifold and the fast sliding manifold
K 1 s , K 2 s , K 3 s the feedback gain matrices of u s
K 1 f , K 2 f , K 3 f the feedback gain matrices of u f
Ω c w i preset compact sets for W ^ i
Ω E i the ellipsoid set of W ˜ i
x 2 f , H ¯ f , Φ f , τ ¯ 1 f and ζ f filtered results of x 2 , M 1 H r , Φ , M 1 τ 1 and ζ
ϝ i the learning gain matrix of the NN weight’s updating law
ψ ^ f i the filtered output of the NN

Appendix A. Proof of Theorem 1

Proof. 
To facilitate the stability analysis, several basic lemmas and definitions are given first.
Lemma A1.
[13] By using the updating law (39b), it can be guaranteed that the learning gain ϝ i ( t ) has deterministic and predeterminable bounds, that is, ϝ i l < ϝ i ( t ) ϝ i u I , t 0 .
Lemma A2.
[41] The following inequality holds for any ϵ 0 > 0 and x 0 R :
0 | x 0 | x 0 tanh ( x 0 ϵ 0 ) κ 0 ϵ 0 ,
where κ 0 is a constant satisfying κ 0 = e ( κ 0 + 1 ) , i.e., κ 0 = 0.2785 .
The proof of Theorem 1 comprises the following steps:
Step 1, the boundedness of the NN weights W ^ i are proved.
Choose a Lyapunov function candidate
V o i = γ s s i s 2 + W ˜ i T ϝ i 1 W ˜ i + ε γ f s i f 2 ,
where s i s and s i f are the ith elements of s s and s f , respectively. The time derivative of V o i is given as
V ˙ o i = 2 γ s s i s s ˙ i s 2 W ˜ i T ϝ i 1 W ^ ˙ i + W ˜ i T ϝ ˙ i 1 W ˜ i + 2 γ f s i f d s i f d t ε .
Noting the result of the projection operation, if W ^ i ( 0 ) Ω c w i , then the parameter learning law ensures W ^ i ( t ) Ω c w i and
W ˜ i T ϝ i 1 W ^ ˙ i W ˜ i T ϝ i 1 ς i ζ ¯ f i 2 b i ϝ i Φ f ψ ˜ f i + γ s g i s ϝ i Φ s i s γ f K 0 ϝ i Φ s i f .
Applying the foregoing inequality to (A3), one obtains
V ˙ o i 2 ς i ζ ¯ f i 2 b i W ˜ i T Φ f ψ ˜ f i 2 γ s g i s ϝ i W ˜ i T ϝ i 1 Φ s i s + 2 γ f K 0 ϝ i W ˜ i T ϝ ˙ i 1 Φ s i f + 2 γ s s i s s ˙ i s + W ˜ i T ϝ ˙ i 1 W ˜ i + 2 γ f s i f d s i f d t ε .
Case 1a: When | ψ ˜ f i | > ζ ¯ f i & Φ f T ϝ i Φ f > ζ ¯ f i 2 hold, and | | W ^ i | | < c w i or | | W ^ i | | = c w i & W ^ i T 0 hold, that is, b i = 1 and b p = 1 in this case. Combining (A5) and (39a), one obtains
V ˙ o i ς i ζ ¯ f i 2 W ˜ i T Φ f ( W ˜ i T Φ f + ζ f i ) ς i ζ ¯ f i 2 ( ψ ˜ f i ζ f i ) ψ ˜ f i 2 γ s g i s ϝ i W ˜ i T ϝ i 1 Φ s i s + 2 γ f K 0 ϝ i W ˜ i T ϝ i 1 Φ s i f L i W ˜ i T ϝ i 1 ϝ ˜ i ϝ i 1 W ˜ i + j i W ˜ i T ϝ i 1 ( ϝ i Φ f Φ f T ϝ i ϝ i l Φ f Φ f T ϝ i l ) ϝ i 1 W ˜ i + 2 γ s s i s g i s k 1 i s s i s k 2 i s sig ( s i s ) ϱ s k 3 i s tanh ( s i s ϵ s ) + W ˜ i T Φ + ζ i + 2 γ f s i f k 1 i f s i f k 2 i f sig ( s i f ) ϱ f k 3 i f tanh ( s i f ϵ f ) K 0 ( W ˜ i T Φ + ζ i ) ( ς i ζ ¯ f i 2 j i ) W ˜ i T Φ f Φ f T W ˜ i ς i ζ ¯ f i 2 ( ψ ˜ f i 2 ζ f i 2 ) + 2 γ s s i s g i s k 1 i s s i s k 2 i s sig ( s i s ) ϱ s k 3 i s tanh ( s i s ϵ s ) + ζ i + 2 γ f s i f k 1 i f s i f k 2 i f sig ( s i f ) ϱ f k 3 i f tanh ( s i f ϵ f ) K 0 ζ i .
Using Lemma A2, the above equation can be further rewritten as:
| | Φ f | | 2 ς i ζ ¯ f i 2 j i | | W ˜ i | | 2 ς i ζ ¯ f i 2 ( ψ ˜ f i 2 ζ f i 2 ) 2 γ s g i s k 1 i s | s i s | 2 2 γ s g i s k 2 i | s i s | 1 + ϱ s 2 γ f k 1 i f | s i f | 2 2 γ f k 2 i f | s i f | 1 + ϱ f ( 2 γ s g i s | ζ i | + 2 γ s g i s k 3 i ) | s i s | ( 2 γ f k 3 i f + 2 K 0 γ f | ζ i | ) | s i f | + 2 γ s g i s k 3 i s κ 0 ϵ s + 2 γ f k 3 i f κ 0 ϵ f | | Φ f | | 2 ς i ζ ¯ f i 2 j i | | W ˜ i | | 2 2 γ s g i s k 1 i s | s i s | 2 2 γ f k 1 i f | s i f | 2 + 2 γ s g i s k 3 i s κ 0 ϵ s + 2 γ f k 3 i f κ 0 ϵ f α v 1 i V o i + β v 1 i ,
where α v 1 i = min { | | Φ f | | 2 ( ς i ζ ¯ f i 2 j i ) , 2 γ s g i s k 1 i s , 2 γ f k 1 i f } , β v 1 i = 2 γ s g i s k 3 i s κ 0 ϵ s + 2 γ f k 3 i f κ 0 ϵ f . From 0 < ς i < 1 and ϕ f T ϝ i ϕ f > ζ ¯ f i 2 , one obtains ( 1 ς i ) ζ ¯ f i 2 + ς i ϕ f T ϝ i ϕ f > ζ ¯ f i 2 . Noting 0 < L i 0 < 1 , it can be obtained ( ς i / ζ ¯ f i 2 ) j i > 0 . It is clear that V ˙ o i < 0 is true as long as V > β v 1 i α v 1 i .
Case 1b: When | ψ ˜ f i | > ζ ¯ f i & Φ f T ϝ i Φ f > ζ ¯ f i 2 hold, and | | W ^ i | | > c w i or | | W ^ i | | = c w i & W ^ i T > 0 hold, that is, b i = 1 and b p = 0 in this case, which implies ϝ ˙ i = 0 . Combining (A5) and (39a), it yields:
V ˙ o i ς i ζ ¯ f i 2 W ˜ i T Φ f ( W ˜ i T Φ f + ζ f i ) ς i ζ ¯ f i 2 ( ψ ˜ f i ζ f i ) ψ ˜ f i 2 γ s g i s ϝ i W ˜ i T ϝ i 1 Φ s i s + 2 γ f K 0 ϝ i W ˜ i T ϝ i 1 Φ s i f + 2 γ s s i s g i s k 1 i s s i s k 2 i s sig ( s i s ) ϱ s k 3 i s tanh ( s i s ϵ s ) + W ˜ i T Φ + ζ i + 2 γ f s i f k 1 i f s i f k 2 i f sig ( s i f ) ϱ f k 3 i f tanh ( s i f ϵ f ) K 0 ( W ˜ i T Φ + ζ i ) | | Φ f | | 2 ς i ζ ¯ f i 2 | | W ˜ i | | 2 ς i ζ ¯ f i 2 ( ψ ˜ f i 2 ζ f i 2 ) 2 γ s g i s k 1 i s | s i s | 2 2 γ f k 1 i f | s i f | 2 + 2 γ s g i s k 3 i s κ 0 ϵ s + 2 γ f k 3 i f κ 0 ϵ f α v 2 i V o i + β v 2 i ,
where α v 2 i = min { | | Φ f | | 2 ς i ζ ¯ f i 2 , 2 γ s g i s k 1 i s , 2 γ f k 1 i f } , β v 2 i = 2 γ s g i s k 3 i s κ 0 ϵ s + 2 γ f k 3 i f κ 0 ϵ f . Noting | ψ ˜ f i | > ζ ¯ f i , ζ ¯ f i | ζ f i | . It is clear that V ˙ o i < 0 is true as long as V > β v 2 i α v 2 i .
Case 1c: When | ψ ˜ f i | ζ ¯ f i & Φ f T ϝ i Φ f ζ ¯ f i 2 hold, that is, b i = 0 in this case, which implies ϝ ˙ i = 0 . Combining (A5) and (39a), one obtains:
V ˙ o i 2 γ s g i s ϝ i W ˜ i T ϝ i 1 Φ s i s + 2 γ f K 0 ϝ i W ˜ i T ϝ i 1 Φ s i f + 2 γ s s i s g i s k 1 i s s i s k 2 i s sig ( s i s ) ϱ s k 3 i s tanh ( s i s ϵ s ) + W ˜ i T Φ + ζ i + 2 γ f s i f k 1 i f s i f k 2 i f sig ( s i f ) ϱ f k 3 i f tanh ( s i f ϵ f ) K 0 ( W ˜ i T Φ + ζ i ) 2 γ s g i s k 1 i s | s i s | 2 2 γ f k 1 i f | s i f | 2 + 2 γ s g i s k 3 i s κ 0 ϵ s + 2 γ f k 3 i f κ 0 ϵ f α v 3 i V o i + β v 3 i ,
where α v 3 i = min { 2 γ s g i s k 1 i s , 2 γ f k 1 i f } , β v 3 i = 2 γ s g i s k 3 i s κ 0 ϵ s + 2 γ f k 3 i f κ 0 ϵ f . It is clear that V ˙ o i < 0 is true as long as V > β v 3 i α v 3 i .
Combining the above analysis in cases 1a-1c, it can be concluded that W ^ i ( t ) Ω E i , t 0 if W ^ i ( 0 ) Ω E i .
Step 2, the boundedness of the identification error ψ ˜ f i is proved.
Since W ^ i ( t ) Ω E i holds, it can be obtained that W ˜ i ( t ) remain in the ellipsoid set defined (43), that is
W ˜ i T ϝ i u W ˜ i W ˜ i T ϝ i T W ˜ i 1 .
Therefore, one obtains
| | W ˜ i | | ϝ i u .
Moreover, Φ is bounded e Ω c e , that is, | | Φ f | | ϕ ¯ , where ϕ ¯ R + is a constant. Therefore, it can be obtained that | | ψ ˜ f i | | = | | W T ˜ Φ f | | ϝ i u ϕ ¯ .
Step 3, the boundedness of the tracking errors is proved.
Since the Lyapunov function candidate V o i is bounded, it can be further inferred that s s , s f also converges to a small residual set around 0, which also means the tracking error e 1 of the reduced slow subsystem is bounded by the predefined performance boundaries and converges to a small residual set around 0. Since the stability of both subsystems is guaranteed, then, according to Tikhonov’s theorem, the stability of the original high-order system is also guaranteed. □

References

  1. Iskandar, M.; van Ommeren, C.; Wu, X.; Albu-Schäffer, A.; Dietrich, A. Model predictive control applied to different time-scale dynamics of flexible joint robots. IEEE Rob. Autom. Lett. 2023, 8, 672–679. [Google Scholar] [CrossRef]
  2. Zhu, Y.; Liu, J.; Yu, J.; Wang, Q.G. Command filtering-based adaptive fuzzy control of flexible-joint robots with time-varying full-state constraints. IEEE Trans. Circuits Syst. II 2024, 71, 682–686. [Google Scholar] [CrossRef]
  3. Xu, X.; Xu, S. Event-triggered adaptive neural tracking control of flexible-joint robot systems with input saturation. IEEE Access 2022, 10, 43367–43375. [Google Scholar] [CrossRef]
  4. Moyrón, J.; Moreno-Valenzuela, J.; Sandoval, J. Nonlinear PI “D”-type control of flexible joint robots by using motor position measurements is globally asymptotically stable. IEEE Trans. Autom. Control 2023, 68, 3648–3655. [Google Scholar] [CrossRef]
  5. Bu, X.; Qi, Q.; Jiang, B. A simplified finite-time fuzzy neural controller with prescribed performance applied to waverider aircraft. IEEE Trans. Fuzzy Syst. 2022, 30, 2529–2537. [Google Scholar] [CrossRef]
  6. Li, J.; Ma, K.; Wu, Z. Prescribed performance control for uncertain flexible-joint robotic manipulators driven by DC motors. Int. J. Control Autom. 2021, 19, 1640–1650. [Google Scholar] [CrossRef]
  7. Liu, L.; Yao, W.H.; Guo, Y. Prescribed performance tracking control of a free-flying flexible-joint space robot with disturbances under input saturation. J. Franklin Inst. 2021, 358, 4571–4601. [Google Scholar] [CrossRef]
  8. Ma, H.; Zhou, Q.; Li, H.; Lu, R. Adaptive prescribed performance control of a flexible-joint robotic manipulator with dynamic uncertainties. IEEE Trans. Cybern. 2022, 52, 12905–12915. [Google Scholar] [CrossRef]
  9. Atawnih, A.; Doulgeri, Z.; Rovithakis, G.A. Operational space prescribed tracking performance and compliance in flexible joint robots. ASME J. Dyn. Syst. Meas. Control 2015, 137, 074503. [Google Scholar] [CrossRef]
  10. Kim, D.; Koh, K.; Cho, G.R.; Zhang, L.Q. A robust impedance controller design for series elastic actuators using the singular perturbation theory. IEEE/ASME Trans. Mechatronics 2020, 25, 164–174. [Google Scholar] [CrossRef]
  11. Sun, T.; Liang, D.; Song, Y. Singular-perturbation-based nonlinear hybrid control of redundant parallel robot. IEEE Trans. Ind. Electron. 2018, 65, 3326–3336. [Google Scholar] [CrossRef]
  12. Chen, H.Y.; Dong, X.C.; Yang, Y.; Liu, J.T. Fixed-time tracking control for flexible joint manipulator with prescribed performance constraint. IEEE Access 2021, 9, 99388–99397. [Google Scholar] [CrossRef]
  13. Guo, K.; Zheng, D.D.; Li, J. Optimal bounded ellipsoid identification with deterministic and bounded learning gains: Design and application to Euler-Lagrange systems. IEEE Trans. Cybern. 2022, 52, 10800–10813. [Google Scholar] [CrossRef]
  14. Dai, W.; Zhang, L.; Fu, J.; Chai, T.; Ma, X. Dual-rate adaptive optimal tracking control for dense medium separation process using neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 4202–4216. [Google Scholar] [CrossRef]
  15. Shen, J.; Khodak, M.; Talwalkar, A. Efficient architecture search for diverse tasks. Adv. Neural Inf. Process. Syst. 2022, 35, 16151–16164. [Google Scholar]
  16. Shen, J.; Yang, L.F. Theoretically principled deep RL acceleration via nearest neighbor function approximation. Proc. AAAI Conf. Artif. Intell. 2021, 35, 9558–9566. [Google Scholar] [CrossRef]
  17. Shen, J.; Li, L.; Dery, L.M.; Staten, C.; Khodak, M.; Neubig, G.; Talwalkar, A. Cross-modal fine-tuning: Align then refine. In Proceedings of the 40th International Conference on Machine Learning, Honolulu, HI, USA, 23–29 July 2023; Volume 202, pp. 31030–31056. [Google Scholar]
  18. Ioannou, P.A.; Sun, J. Robust Adaptive Control; Prentice-Hall, Inc.: Upper Saddle River, NJ, USA, 1996. [Google Scholar]
  19. Liu, Y.J.; Zeng, Q.; Tong, S.; Chen, C.P.; Liu, L. Actuator failure compensation-based adaptive control of active suspension systems with prescribed performance. IEEE Trans. Ind. Electron. 2020, 67, 7044–7053. [Google Scholar] [CrossRef]
  20. Yang, Q.; Zhang, F.; Sun, Q.; Wang, C. Dynamic learning from adaptive neural control for full-state constrained strict-feedback nonlinear systems. Neural Netw. 2024, 170, 596–609. [Google Scholar] [CrossRef]
  21. Rout, R.; Cui, R.; Yan, W. Sideslip-compensated guidance-based adaptive neural control of marine surface vessels. IEEE Trans. Cybern. 2022, 52, 2860–2871. [Google Scholar] [CrossRef]
  22. Pan, Y.; Guo, K.; Bobtsov, A.; Yang, C.; Yu, H. Composite error learning robot control using discontinuous Lyapunov analysis. IEEE Trans. Autom. Control 2023, 69, 1705–1712. [Google Scholar] [CrossRef]
  23. Patre, P.M.; MacKunis, W.; Johnson, M.; Dixon, W.E. Composite adaptive control for Euler–Lagrange systems with additive disturbances. Automatica 2010, 46, 140–147. [Google Scholar] [CrossRef]
  24. Zhang, H.; Zhou, X.; Wang, Z.; Yan, H.; Sun, J. Adaptive consensus-based distributed target tracking with dynamic cluster in sensor networks. IEEE Trans. Cybern. 2018, 49, 1580–1591. [Google Scholar] [CrossRef]
  25. Du, G.L.; Zhang, P.; Liu, X. Markerless human-manipulator interface using leap motion with interval Kalman filter and improved particle filter. IEEE Trans. Ind. Inform. 2016, 12, 694–704. [Google Scholar] [CrossRef]
  26. de Jesús Rubio, J.; Yu, W.; Ferreyra, A. Neural network training with optimal bounded ellipsoid algorithm. Neural Comput. Appl. 2009, 18, 623–631. [Google Scholar] [CrossRef]
  27. Ge, X.H.; Han, Q.L.; Wang, Z. A dynamic event-triggered transmission scheme for distributed set-membership estimation over wireless sensor networks. IEEE Trans. Cybern. 2019, 49, 171–183. [Google Scholar] [CrossRef]
  28. Alexander, P.; Andrey, P.; Vadim, A. Attractive Ellipsoids in Robust Control; Birkhauser: Boston, MA, USA, 2014. [Google Scholar]
  29. Jorge, D.; Alexander, P. Dynamic sliding mode control design using attracting ellipsoid method. Automatica 2011, 47, 1467–1472. [Google Scholar]
  30. Cao, L.Y.; Schwartz, H. A directional forgetting algorithm based on the decomposition of the information matrix. Automatica 2000, 36, 1725–1731. [Google Scholar] [CrossRef]
  31. Ordaz, P.; Poznyak, A. ‘KL’-gain adaptation for attractive ellipsoid method. IMA J. Math. Control. Inf. 2015, 32, 447–469. [Google Scholar] [CrossRef]
  32. Guo, K.; Zhang, Z.; Zheng, D.D.; Sun, J. Set-membership adaptive robot control with deterministically bounded learning gains. IEEE Trans. Ind. Informat. 2023, 19, 8564–8574. [Google Scholar] [CrossRef]
  33. Zheng, D.D.; Xie, W.F.; Chai, T. Identification and trajectory tracking control of nonlinear singularly perturbed system. IEEE Trans. Ind. Electron. 2017, 64, 3737–3747. [Google Scholar] [CrossRef]
  34. Zheng, D.D.; Xie, W.F.; Ren, X.; Na, J. Identification and control for singularly perturbed systems using multitime-scale neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2017, 28, 321–333. [Google Scholar] [CrossRef] [PubMed]
  35. Sun, W.; Su, S.; Xia, J.; Nguyen, V. Adaptive fuzzy tracking control of flexible-joint robots with full-state constraints. IEEE Trans. Syst. Man Cybern. Syst. 2019, 49, 2201–2209. [Google Scholar] [CrossRef]
  36. Khalil, H.K. Nonlinear Systems, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
  37. Chang, Y.Z.; Daniel, R.W. On the adaptive control of flexible joint robots. Automatica 1992, 28, 969–974. [Google Scholar] [CrossRef]
  38. Zhang, C.; Na, J.; Wu, J.; Chen, Q.; Huang, Y. Proportional-integral approximation-free control of robotic systems with unknown dynamics. IEEE/ASME Trans. Mechatronics 2021, 26, 2226–2236. [Google Scholar] [CrossRef]
  39. Bechlioulis, C.P.; Rovithakis, G.A. Robust adaptive control of feedback linearizable MIMO nonlinear systems with prescribed performance. IEEE Trans. Autom. Control 2008, 53, 2090–2099. [Google Scholar] [CrossRef]
  40. Guo, K.; Pan, Y.; Yu, H. Composite learning robot control with friction compensation: A neural network-based approach. IEEE Trans. Ind. Electron. 2019, 66, 7841–7851. [Google Scholar] [CrossRef]
  41. Wu, Z.; Guo, J.; Liu, B.; Ni, J.; Bu, X. Composite learning adaptive dynamic surface control for uncertain nonlinear strict-feedback systems with fixed-time parameter estimation under sufficient excitation. Int. J. Robust Nonlinear Control 2021, 31, 5865–5889. [Google Scholar] [CrossRef]
Figure 1. Schematic diagram of link i of the FJR. In the diagram, m i , I i are the mass and moment of inertia of the link i, l i is the length of the link i, and l c i denotes the position of the center of mass of link i, i = 1 , 2 .
Figure 1. Schematic diagram of link i of the FJR. In the diagram, m i , I i are the mass and moment of inertia of the link i, l i is the length of the link i, and l c i denotes the position of the center of mass of link i, i = 1 , 2 .
Applsci 14 04030 g001
Figure 2. Angular position for joint 1.
Figure 2. Angular position for joint 1.
Applsci 14 04030 g002
Figure 3. Angular position for joint 2.
Figure 3. Angular position for joint 2.
Applsci 14 04030 g003
Figure 4. Angular position error for joint 1.
Figure 4. Angular position error for joint 1.
Applsci 14 04030 g004
Figure 5. Angular position error for joint 2.
Figure 5. Angular position error for joint 2.
Applsci 14 04030 g005
Figure 6. Angular velocity for joint 1.
Figure 6. Angular velocity for joint 1.
Applsci 14 04030 g006
Figure 7. Angular velocity for joint 2.
Figure 7. Angular velocity for joint 2.
Applsci 14 04030 g007
Figure 8. Velocity tracking error for joint 1.
Figure 8. Velocity tracking error for joint 1.
Applsci 14 04030 g008
Figure 9. Velocity tracking error for joint 2.
Figure 9. Velocity tracking error for joint 2.
Applsci 14 04030 g009
Figure 10. Tracking results of τ 1 for joint 1.
Figure 10. Tracking results of τ 1 for joint 1.
Applsci 14 04030 g010
Figure 11. Tracking results of τ 1 for joint 2.
Figure 11. Tracking results of τ 1 for joint 2.
Applsci 14 04030 g011
Figure 12. NN estimation results for joint 1.
Figure 12. NN estimation results for joint 1.
Applsci 14 04030 g012
Figure 13. NN estimation results for joint 2.
Figure 13. NN estimation results for joint 2.
Applsci 14 04030 g013
Figure 14. NN estimation errors for joint 1.
Figure 14. NN estimation errors for joint 1.
Applsci 14 04030 g014
Figure 15. NN estimation errors for joint 2.
Figure 15. NN estimation errors for joint 2.
Applsci 14 04030 g015
Figure 16. Control signal for joint 1.
Figure 16. Control signal for joint 1.
Applsci 14 04030 g016
Figure 17. Control signal for joint 2.
Figure 17. Control signal for joint 2.
Applsci 14 04030 g017
Table 1. Nominal parameters of the manipulator.
Table 1. Nominal parameters of the manipulator.
ParameterValueUnit
m 1 , m 2 0.1kg
l 1 , l 2 0.4m
I 1 , I 2 0.02kg·m2
l c 1 , l c 2 0.2m
J 1 , J 2 0.01kg·m2
K9N·m/rad
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, X.; Zheng, D.; Guo, K.; Ren, X. Identification and Control of Flexible Joint Robots Based on a Composite-Learning Optimal Bounded Ellipsoid Algorithm and Prescribe Performance Control Technique. Appl. Sci. 2024, 14, 4030. https://doi.org/10.3390/app14104030

AMA Style

Li X, Zheng D, Guo K, Ren X. Identification and Control of Flexible Joint Robots Based on a Composite-Learning Optimal Bounded Ellipsoid Algorithm and Prescribe Performance Control Technique. Applied Sciences. 2024; 14(10):4030. https://doi.org/10.3390/app14104030

Chicago/Turabian Style

Li, Xianyan, Dongdong Zheng, Kai Guo, and Xuemei Ren. 2024. "Identification and Control of Flexible Joint Robots Based on a Composite-Learning Optimal Bounded Ellipsoid Algorithm and Prescribe Performance Control Technique" Applied Sciences 14, no. 10: 4030. https://doi.org/10.3390/app14104030

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop