Adaptive Dynamic Programming-Based Cross-Scale Control of a Hydraulic-Driven Flexible Robotic Manipulator

Xiaohua Wei; Jiangang Ye; Jianliang Xu; Zhiguo Tang

doi:10.3390/app13052890

,

and

¹

School of Mechanical and Electrical Engineering, Quzhou College of Technology, Quzhou 324000, China

²

Quzhou Special Equipment Inspection Center, Quzhou 324000, China

³

College of Communication Engineering, Jilin University, Changchun 130022, China

^*

Author to whom correspondence should be addressed.

Appl. Sci.2023, 13(5), 2890;https://doi.org/10.3390/app13052890

This article belongs to the Special Issue Adaptive Dynamic Programming and Control Application in Intelligent Systems

Version Notes

Order Reprints

Featured Application

The proposed method is suitable for application in robotic arm systems.

Abstract

This paper focuses primarily on adaptive dynamic programming (ADP)-based tracking control of the hydraulic-driven flexible robotic manipulator system (HDFRMS) with varying payloads and uncertainties via singular perturbation theory (SPT). Firstly, the dynamics is derived using a driven Jacobin matrix, which represents the coupling between the hydraulic servo-driven system and rigid–flexible manipulator established using the assumed mode method and Lagrange principle. Furthermore, the whole dynamic model of the manipulator system is decoupled into a second slow subsystem (SSS), a second fast subsystem (SFS) and a first fast subsystem (FFS). The three subsystems can describe a large range of movement, flexible vibration and electro-hydraulic servo control, respectively. Hereafter, an adaptive dynamic programming trajectory tracking control law with a critic-only policy iteration algorithm is presented in the second slow timescale, while both robust optimal control (ROC) in the second first timescale and adaptive sliding mode control (ASMC) in the first fast timescale are also designed using the Lyapunov stability theory. Finally, the numerical simulations are carried out to illustrate the rightness and robustness of the singular perturbation decomposition and proposed composite control algorithm.

Keywords:

adaptive dynamic programming; rigid–flexible manipulator; optimal control; singular perturbation theory; assumed mode method

1. Introduction

The hydraulic-driven flexible robotic manipulator system (HDFRMS) has wide implementation potentials in a large number of engineering fields human can hardly deal with directly, such as forest exploitation, heavy payload motion control, construction work, mobile equipment applications and other industrial areas [,,], since their energy consumption and weight are lower, but the speed is higher.

Unlike rigid manipulators, the dynamics of flexible manipulators is the nonlinear rigid–flexible coupling infinite-order model [,]. Indeed, many control methods have been presented for flexible multibody dynamics system, in particular flexible link manipulators and flexible joint manipulators, involving PID control [,], neural network control [], sliding mode control [], adaptive control [], optimal control [], etc. Nevertheless, ill-conditioned numerical issues and inaccurate or unreliable control will occur, because these dynamic model-based control methods are directly designed ignoring multiple timescales in dynamics. Singular perturbation theory (SPT) is used to deal with the problem of control under different timescales.

The two-order dynamic model of flexible systems can be decoupled into the slow subsystem (SS) in slow timescale, which is the traditional timescale, and the fast subsystem (FS) in new timescale, namely fast timescale based on SPT. In [,] the flexible link manipulator system is decomposed through the perturbation parameter that the small-time constant is defined as, and two sub-controllers are designed. The composite control consisting of fuzzy sliding mode control in SS and linear quadratic regulator (LQR) optimal control in FS are proposed in []. Additionally, in [], the slow controller is substituted by the learning controller and the disturbance observer to improve the trajectory tracking precision. An adaptive non-saturated control is preferentially considered to handle actuator saturation involving dynamics parameters in the flexible joint manipulator system, where there are any parametric uncertainties [,]. Additionally, the adaptive neural backstepping control in SS is presented as the uncertain nonlinearity compensator []. Furthermore, the fuzzy control [], the robust fuzzy sliding mode control [] and the adaptive high-gain Kalman filter [] are investigated to improve robustness.

Compared with the electrical-driven manipulator, hydraulic-driven manipulators have higher power to handle heavier payloads whose weights are greater than 100 kg, and they have lower stiffness and response time. However, the order of their dynamic equations that are more complicated is raised from the second order to the third order. It is so difficult to control them because of electro-hydraulic coupling and inherent elastic vibration. A hybrid controller including a feedforward control and feedback control [], and model-based adaptive energy shaping controller [] are proposed in the hydraulic manipulator system. Many observers, such as the adaptive observer and variable structure disturbance observer, are also designed to handle uncertainties and vibrations [,,]. However, the description of dynamics is not complete yet. That is, flexible vibration or hydraulic drive has always been ignored in past controller designs. Therefore, the control method which can track desired trajectories precisely and suppress flexible vibrations based on the complete electro-hydraulic coupling model needs be presented.

In the work, we discuss an ADP-based trajectory tracking control law and singular perturbation decoupling for the HDFRMS with uncertainties and varying payloads. The main contributions include the following three aspects.

(1) The whole third-order dynamic model including three link rigid–flexible manipulator, asymmetric four-way valve controlled hydraulic cylinder and hydraulic actuator is derived. It is more beneficial and complete than the actual engineering system, since both the flexible vibration and hydraulic drive are considered in the dynamic model.

(2) A novel singular perturbation decoupling method for the electro-hydraulic servo flexible multibody dynamics system is presented. Unlike the traditional method that contains one perturbation parameter and two timescales, two singular perturbation parameters are used to establish three timescales.

(3) An optimal control method based on the critic-only policy iteration algorithm for tracking control is proposed, which simplifies the controller structure of ADP. The controller can improve the control precision and optimize the energy consumption, because the model network and control network of traditional ADP are not considered.

The rest of the paper is organized as follows. We derive the whole dynamics of the manipulator system in Section 2. Three decoupled subsystems are obtained based on SPT in Section 3. Section 4 presents two tasks that are the design of composite control scheme and the closed-loop stability analysis at the same time. Some simulation results are exhibited in Section 5. Finally, Section 6 draws some conclusions.

2. Dynamic Modeling of HDFRMS

We consider the mechanical structure involving the first rigid link with a rotary joint, the second rigid link with a revolute joint and the last flexible link with another revolute joint (see Figure 1).

Figure 1. Model of HDFRMS.

2.1. Mechanical Subsystem Model

Link 1 and 2 of HDFRMS are rigid. However, the third link, like a slender beam, is flexible. Its vibration will occur because the end-effector with heavy payload and manipulator system have large overall motions. Only the transverse vibration is considered, except for the axial deformation and shear deformation. Therefore, the Euler–Bernoulli beam theory is sufficient to describe the flexible deformation and vibration of the third link.

Concretely, the dynamic modeling of HDFRMS consists of two parts: mechanical subsystem modeling and hydraulic subsystem modeling.

We derive the dynamics of the mechanical subsystem using vibration analysis result-based AMM via Lagrange’s equation. The following assumptions are made based on the actual situation. Table 1 shows the nomenclature of the mechanical subsystem.

Table 1. Notation, frequently used symbols of mechanical subsystem.

Assumption 1.

The payload is held tightly by the manipulator end-effector, and their contact is static.

Assumption 2.

The mass of actuators focuses on joints of the manipulator.

Let

X O Y

and

X_{i} O Y_{i}

be the inertial Cartesian coordinate and the moving coordinates fixed on the robotic manipulator, respectively. Figure 2 illustrates the schematic diagram of four primary coordinate systems for HDRMES.

Figure 2. Schematic diagram of rigid–flexible manipulator.

Considering AMM, we find the elastic displacement

ω (r, t)

of the third link as:

ω (r, t) = \sum_{j = 1}^{n} Φ_{j} (r) q_{j} (t)

(1)

The higher order modes are ignored, and only previous order modes are considered in practical engineering application, because they can make greater impacts on system performances.

When flexible link 3 is conceived as cantilever beam model, one end of this link is supported simply, and the other end is free []. Therefore, we can obtain the elastic displacement

ω (r, t) = Φ_{1} (r) q_{1} (t) + Φ_{2} (r) q_{2} (T)

(2)

Φ_{j} (r) = s h β_{j} r + ζ_{j} \sin β_{j} r

(3)

where

ζ_{j} = s h β_{j} L_{3} / \sin β_{j} L_{3}

,

β_{j} = (j + 0.25) π / L_{3}

,

j = 1, 2

.

r

is position of any point on the third link, and

0 \leq r \leq L_{3}

.

Considering the coordinate system and the kinematics of three link rigid–flexible manipulators, the kinetic energy of the mechanical subsystem can be obtained as

\begin{matrix} T & = (\frac{1}{6} ρ_{2} L_{2}^{3} + \frac{1}{2} m_{3} L_{2}^{2} + \frac{1}{2} ρ_{3} L_{3} L_{2}^{2} + \frac{1}{2} M L_{2}^{2}) ({\dot{θ}}_{2}^{2} + s_{2}^{2} {\dot{θ}}_{1}^{2}) + [\frac{1}{6} ρ_{1} (r_{2}^{3} - r_{1}^{3}) + \frac{1}{2} m_{2} r_{1}^{2}] {\dot{θ}}_{1}^{2} \\ + ρ_{3} L_{2} \int_{0}^{L_{3}} \dot{ω} d r {\dot{θ}}_{2} c_{3} + \frac{1}{2} M L_{3}^{2} [{({\dot{θ}}_{2} + {\dot{θ}}_{3})}^{2} + {\dot{θ}}_{1}^{2} s_{23}^{2}] + \frac{1}{2} ρ_{3} \int_{0}^{L_{3}} {\dot{ω}}^{2} d r + (\frac{1}{2} ρ_{3} L_{2} L_{3}^{2} \\ + M L_{2} L_{3}) [c_{3} {\dot{θ}}_{2} ({\dot{θ}}_{2} + {\dot{θ}}_{3}) + s_{2} s_{23} {\dot{θ}}_{1}^{2}] + (\frac{1}{2} ρ_{3} \int_{0}^{L_{3}} ω^{2} d r + \frac{1}{2} M ω_{L_{3}}^{2}) [{({\dot{θ}}_{2} + {\dot{θ}}_{3})}^{2} + c_{23}^{2} {\dot{θ}}_{1}^{2}] \\ + L_{2} (ρ_{3} \int_{0}^{L_{3}} ω d r + M ω_{L_{3}}) [s_{2} c_{23} {\dot{θ}}_{1}^{2} - s_{3} {\dot{θ}}_{2} ({\dot{θ}}_{2} + {\dot{θ}}_{3})] + (ρ_{3} \int_{0}^{L_{3}} \dot{ω} r d r + M L_{3} {\dot{ω}}_{L_{3}}) ({\dot{θ}}_{2} + {\dot{θ}}_{3}) \\ + \frac{1}{2} M {\dot{ω}}_{L_{3}}^{2} + (ρ_{3} \int_{0}^{L_{3}} ω r d r + M L_{3} ω_{L_{3}}) s_{23} c_{23} {\dot{θ}}_{1}^{2} + M L_{2} {\dot{ω}}_{L_{3}} {\dot{θ}}_{2} c_{3} + \frac{1}{6} ρ_{3} L_{3}^{3} [{({\dot{θ}}_{2} + {\dot{θ}}_{3})}^{2} + {\dot{θ}}_{1}^{2} s_{23}^{2}] \end{matrix}

(4)

Including the elastic potential energy of flexible link 3, gravitational potential energy of manipulators and payloads, the potential energy of the mechanical subsystem can be derived as

\begin{matrix} V & = (\frac{1}{2} ρ_{1} L_{1} + ρ_{2} L_{2} + ρ_{3} L_{3} + m_{2} + m_{3} + M) g L_{1} + (\frac{1}{2} ρ_{2} L_{2} + ρ_{3} L_{3} + m_{3} + M) g L_{2} c_{2} \\ + \frac{1}{2} E I \int_{0}^{L_{3}} {\frac{\partial^{2} ω}{\partial r^{2}}}^{2} d r + (M g L_{3} + \frac{1}{2} ρ_{3} g L_{3}^{2} - ρ_{3} g \int_{0}^{L_{3}} ω d r) c_{23} - M g ω_{L_{3}} s_{23} \end{matrix}

(5)

where

c_{1} = \cos θ_{1}

,

c_{2} = \cos θ_{2}

,

c_{3} = \cos θ_{3}

,

s_{1} = \sin θ_{1}

,

c_{23} = \cos (θ_{2} + θ_{3})

,

s_{3} = \sin θ_{3}

,

s_{2} = \sin θ_{2}

,

s_{23} = \sin (θ_{2} + θ_{3})

,

ω = ω (r, t)

,

\dot{ω} = Φ_{1} (r) {\dot{q}}_{1} (t) + Φ_{2} (r) {\dot{q}}_{2} (t)

,

{\dot{ω}}_{L_{3}} = Φ_{1} (L_{3}) {\dot{q}}_{1} (t) + Φ_{2} (L_{3}) {\dot{q}}_{2} (t)

and

\frac{\partial^{2} ω}{\partial r^{2}} = {\ddot{Φ}}_{1} (r) q_{1} (t) + {\ddot{Φ}}_{2} (r) q_{2} (t)

.

Defining a Lagrange function

L

, which consists of total potential energy

V

and total kinetic energy

T

, we get

L = T - V

.

According to the following Lagrange equation []

\frac{d}{d T} (\frac{\partial L}{\partial {\dot{p}}_{i}}) - \frac{\partial L}{\partial p_{i}} = τ_{i}

(6)

the dynamic model of the rigid–flexible manipulator without the viscous damping and the external interference can be expressed as

M (θ, q) [\begin{matrix} \ddot{θ} \\ \ddot{q} \end{matrix}] + K [\begin{matrix} θ \\ q \end{matrix}] + G (θ, \dot{θ}, q, \dot{q}) = [\begin{matrix} τ \\ 0 \end{matrix}]

(7)

where

M (θ, q)

is the

5 \times 5

positive definite, symmetric, time-varying generalized inertia matrix,

K = d i a g (k_{1}, k_{2})

is the stiffness matrix of the flexible link 3,

G (θ, \dot{θ}, q, \dot{q}) =

{[g_{1} g_{2} g_{3} g_{4} g_{5}]}^{T}

consists of Coriolis force, centrifugal force and gravity,

θ = {[θ_{1} θ_{2} θ_{3}]}^{T}

is the joint angles,

q = {[q_{1} q_{2}]}^{T}

is the mode coordinates and

τ = {[τ_{1} τ_{2} τ_{3}]}^{T}

is the control torque.

Remark 1.

Utilizing the Lagrange equation, p contains joint angles

θ

and elastic displacement

ω

. The joint angles

θ

and control torque

τ

are considered when

i = 1, 2, 3

. Otherwise, we have

\frac{d}{d T} (\frac{\partial L}{\partial \dot{ω}}) - \frac{\partial L}{\partial ω} = 0

2.2. Hydraulic Subsystem Model

Joint 1 is a rotary joint, which is driven by a rotary hydraulic motor. Joints 2 and 3 are revolute joints, which are driven by a linear asymmetric hydraulic cylinder. The manipulator needs to move the payload along some predefined trajectory in practice.

The hydraulic-driven subsystem consists of a hydraulic cylinder and hydraulic motor. Additionally, Table 2 shows the nomenclature of the hydraulic subsystem. The asymmetric four-way valve-controlled hydraulic cylinder is a kind of hydraulic actuator converting hydraulic energy into linear motion mechanical energy as shown in Figure 3.

Table 2. Notation, frequently used symbols of hydraulic subsystem.

Figure 3. Schematic diagram of asymmetric hydraulic cylinder.

Actually, the time constant of mechanical systems is much greater than the hydraulic serve-valve time constant; therefore, the servo valve spool displacement

x_{v}

is proportional to the control input by ignoring the dynamics of the servo valve. We obtain

K_{i} = K_{v} K_{a}

, where

K_{v}

is the proportional valve gain and

K_{a}

is the amplifier gain.

x_{v} = K_{i} i

(8)

Considering the law of cosines, depending on the installation location of hydraulic cylinder, we have

y_{2} = \sqrt{L_{21}^{2} + L_{11}^{2} - 2 L_{11} L_{21} \cos (180 - θ_{2})} y_{3} = \sqrt{L_{22}^{2} + L_{31}^{2} - 2 L_{22} L_{31} \cos (180 - θ_{3})} \dot{y} = J \dot{θ}

(9)

where

J = d i a g (J_{1}, J_{2})

is the matrix relating the actuator linear displacement to the link angular displacement.

Additionally, the details of this matrix

J

are described below.

J_{2} = \frac{\partial y_{2}}{\partial θ_{2}} = \frac{- L_{21} L_{11} \sin θ_{2}}{\sqrt{L_{21}^{2} + L_{11}^{2} + 2 L_{11} L_{21} \cos θ_{2}}} J_{3} = \frac{\partial y_{3}}{\partial θ_{3}} = \frac{- L_{22} L_{31} \sin θ_{3}}{\sqrt{L_{22}^{2} + L_{31}^{2} + 2 L_{22} L_{31} \cos θ_{3}}}

(10)

According to the relationships of force balances and the basic theory of the hydraulic system [], we obtain

\{\begin{matrix} Q_{L} = K_{q} x_{v} - K_{c} P_{L} \\ Q_{L} = C_{t m} P_{L} + \frac{V_{t}}{4 β_{e}} {\dot{P}}_{L} + A_{p} \dot{y} \\ x_{v} = K_{i} I \\ τ = A_{p} J P_{L} \end{matrix} A_{p} = \{\begin{cases} A_{1} x_{v} \geq 0 \\ A_{2} x_{v} < 0 \end{cases}

(11)

Applying (9) and (11), we can derive that

\dot{τ} + [\frac{4 β_{e}}{V_{t}} (K_{c} + C_{t m}) - \frac{\dot{J}}{J}] τ + \frac{4 β_{e} A_{p}^{2}}{V_{t}} J^{2} \dot{θ} = \frac{4 β_{e} A_{p} K_{q} K_{i}}{V_{t}} J I

(12)

where

V_{t} = 2 L A_{p}^{4} / (A_{1}^{3} + A_{2}^{3})

.

Similarly, the kinetics of the valve-controlled hydraulic motor can be derived as

\{\begin{cases} x_{v} = K_{i} I \\ τ = P_{L} D \\ Q_{L} = K_{q} x_{v} - K_{c} P_{L} \\ Q_{L} = D \dot{θ} + C_{t m} P_{L} + \frac{V_{t}}{4 β_{e}} {\dot{P}}_{L} \end{cases}

(13)

where

K_{q} = C_{d} w \sqrt{(P_{s} - P_{L}) / ρ}

.

By simple calculation of (13), we can obtain the relationship between current and torque as follows.

\dot{τ} + \frac{4 β_{e}}{V_{t}} (C_{t m} + K_{c}) τ + \frac{4 β_{e} D^{2}}{V_{t}} \dot{θ} = \frac{4 β_{e} D K_{q} K_{i}}{V_{t}} I

(14)

From (12) and (14), the model of the hydraulic subsystem can be given as

\dot{τ} + A τ + B \dot{θ} = C I

(15)

From the previous derivations, both (7) and (9) are the dynamic model of the HDFRMS.

\{\begin{array}{l} M (θ, q) [\begin{matrix} \ddot{θ} \\ \ddot{q} \end{matrix}] + K [\begin{matrix} θ \\ q \end{matrix}] + G (θ, \dot{θ}, q, \dot{q}) = [\begin{matrix} τ \\ 0 \end{matrix}] \\ \dot{τ} + A τ + B \dot{θ} = C I \end{array}

(16)

3. Cross-Scale Decoupling of Manipulator System

The dynamic model of the HDFRMS is essentially a strongly coupled third-order time-varying nonlinear differential equation. Therefore, it makes system control law design more difficult. In addition to this, it is difficult to guarantee system control performances due to the coupling when there are uncertainties and disturbances in system.

In this section, the complete dynamics for HDFRMS will be decomposed twice for control design based on singular perturbation. They are second slow subsystem, second fast subsystem and fast subsystem, respectively.

3.1. First Singular Perturbation Decomposition

Firstly, the complete dynamic model can be decoupled into the first stage SS that characterizes the rigid–flexible manipulator, as well as the first stage FS characterizing the hydraulic servo-driven part.

Defining the first singular perturbation parameter

ε_{1} = \frac{1}{β_{e}}

, it satisfies

0 < ε_{1} < < 1

. Then, the dynamic model (16) can be rewritten as

\{\begin{array}{l} M (θ, q) [\begin{matrix} \ddot{θ} \\ \ddot{q} \end{matrix}] + K [\begin{matrix} θ \\ q \end{matrix}] + G (θ, \dot{θ}, q, \dot{q}) = [\begin{matrix} τ \\ 0 \end{matrix}] \\ ε_{1} \dot{τ} = - \tilde{A} τ - \tilde{B} \dot{θ} + \tilde{C} I \end{array}

(17)

where

\tilde{A} = ε_{1} A

,

\tilde{B} = ε_{1} B

,

\tilde{C} = ε_{1} C

.

When perturbation parameter

ε_{1}

is small enough, the higher order infinitesimal

o (ε_{1})

can be ignored. According to multiple timescale theories, the variables

τ

and

I

can be represented as

τ = τ_{f 1} + τ_{s 1} I = I_{f 1} + I_{s 1}

(18)

where subscript

f 1

and subscript

s 1

represent the first level of fast timescales and slow timescales, respectively.

Setting

ε_{1} = 0

, we obtain the first stage SS, namely FSS

\{\begin{array}{l} M (θ, q) [\begin{matrix} \ddot{θ} \\ \ddot{q} \end{matrix}] + K [\begin{matrix} θ \\ q \end{matrix}] + G (θ, \dot{θ}, q, \dot{q}) = [\begin{matrix} τ_{s 1} \\ 0 \end{matrix}] \\ τ_{s 1} = - {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + {\tilde{A}}^{- 1} \tilde{C} I_{s 1} \end{array}

(19)

We now introduce the first level of fast timescale

σ_{1}

in the boundary layer, and we have

σ_{1} = \frac{t}{ε_{1}}

(20)

The first-level slow variable

τ_{s 1}

is considered constant near the boundary layer

ε_{1} \to 0

. According to (17) and (19), the first stage FS, namely FFS, can be expressed as

\frac{d τ_{f 1}}{d σ_{1}} = - \tilde{A} τ_{f 1} + \tilde{C} I_{f 1}

(21)

3.2. Second Singular Perturbation Decomposition

The FSS (19) will be decoupled again. We can have the SSS characterizing the large overall motion of the manipulator without the whole flexibilities, and the SFS is only about the flexible vibration of the third link.

We define the inverse matrix

D

of matrix

M

as

D = M^{- 1} = [\begin{matrix} M_{1} M_{2} \\ M_{3} M_{4} \end{matrix}] = [\begin{matrix} D_{1} D_{2} \\ D_{3} D_{4} \end{matrix}]

(22)

where

M_{1} \in R^{3 \times 3}

,

D_{1} \in R^{3 \times 3}

.

Additionally, we make

G (θ, \dot{θ}, q, \dot{q}) = [\begin{array}{l} G_{1} \\ G_{2} \end{array}]

(23)

Substituting (22) and (23) into the first stage of the slow subsystem (19), we can obtain

\begin{array}{l} \ddot{θ} = & - D_{2} (θ, q) K_{x} q - D_{1} (θ, q) G_{1} (θ, \dot{θ}, q, \dot{q}) - D_{2} (θ, q) G_{2} (θ, \dot{θ}, q, \dot{q}) \\ + D_{1} (θ, q) (- {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + {\tilde{A}}^{- 1} \tilde{C} I_{s 1}) \end{array}

(24)

\begin{array}{l} \ddot{q} = & - D_{4} (θ, q) K_{x} q - D_{3} (θ, q) G_{1} (θ, \dot{θ}, q, \dot{q}) - D_{4} (θ, q) G_{2} (θ, \dot{θ}, q, \dot{q}) \\ + D_{3} (θ, q) (- {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + {\tilde{A}}^{- 1} \tilde{C} I_{s 1}) \end{array}

(25)

where

K_{x} = [\begin{matrix} k_{1} \\ k_{2} \end{matrix}]

.

We define the second singular perturbation parameter

ε_{2} = \frac{1}{k}

with

k = \min (k_{1}, k_{2})

.

Comparing with the first singular perturbation parameter, they satisfy the following inequality:

0 < ε_{1} < < ε_{2} < < 1

.

We denote

\tilde{K} = ε_{2} K_{x} q = ε_{2} y

(26)

Setting

ε_{2} = 0

, substituting (26) into the (24) and (25), we can derive

\begin{array}{l} \ddot{θ} = & - D_{2, s 2} (θ, 0) \tilde{K} y_{s 2} - D_{1, s 2} (θ, 0) G_{1, s 2} (θ, \dot{θ}, 0, 0) - D_{2, s 2} (θ, 0) G_{2, s 2} (θ, \dot{θ}, 0, 0) \\ + D_{1, s 2} (θ, 0) (- {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + {\tilde{A}}^{- 1} \tilde{C} I_{s 2}) \end{array}

(27)

\begin{matrix} 0 = & - D_{4, s 2} (θ, 0) \tilde{K} y_{s 2} - D_{3, s 2} (θ, 0) G_{1, s 2} (θ, \dot{θ}, 0, 0) - D_{4, s 2} (θ, 0) G_{2, s 2} (θ, \dot{θ}, 0, 0) \\ + D_{3, s 2} (θ, 0) (- {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + {\tilde{A}}^{- 1} \tilde{C} I_{s 2}) \end{matrix}

(28)

By solving Equation (28), we obtain

\begin{matrix} y_{s 2} = & - {\tilde{K}}^{- 1} D_{4, s 2}^{- 1} (θ, 0) D_{3, s 2} (θ, 0) G_{1, s 2} (θ, \dot{θ}, 0, 0) - {\tilde{K}}^{- 1} D_{4, s 2}^{- 1} (θ, 0) D_{4, s 2} (θ, 0) G_{2, s 2} (θ, \dot{θ}, 0, 0) \\ + {\tilde{K}}^{- 1} D_{4, s 2}^{- 1} (θ, 0) D_{3, s 2} (θ, 0) (- {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + {\tilde{A}}^{- 1} \tilde{C} I_{s 2}) \end{matrix}

(29)

where subscript

s 2

represents the second stage slow timescale, and

I_{s 2}

is the control current under the second stage slow timescale.

Substituting (29) into Equation (27), the SSS can be derived as

M_{1, s 2} (θ, 0) \ddot{θ} + G_{1, s 2} (θ, \dot{θ}, 0, 0) = - {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + {\tilde{A}}^{- 1} \tilde{C} I_{s 2}

(30)

Similarly, ignoring

o (ε_{2})

, the variables

y

and

y = y_{f 2} + y_{s 2}

can be represented as

y = y_{f 2} + y_{s 2} I_{s 1} = I_{f 2} + I_{s 2}

(31)

where the subscript

f 2

represents the second stage fast timescale.

We introduce the second level of fast timescale

σ_{2}

in the boundary layer, and we have

σ_{2} = \frac{t}{\sqrt{ε_{2}}}

(32)

The second level slow variable

y_{s 2}

is considered constant near the boundary layer

ε_{2} \to 0

. According to Equations (25), (29) and (31), SFS can be obtained as

\frac{d^{2} y_{f 2}}{d σ_{2}^{2}} = - D_{4, s 2} (θ, ε_{2} y) \tilde{K} y_{f 2} + D_{3, s 2} (θ, ε_{2} y) {\tilde{A}}^{- 1} \tilde{C} I_{f 2}

(33)

where

I_{f 2}

is the control current under the second fast timescale

σ_{2}

.

4. Control Design

4.1. ADP Trajectory Control

ADP is a newly emerging approximate optimum method in the field of optimal control, which is an important branch of machine learning. It has been applied in blast furnace gas systems [], reusable launch vehicles [], manipulators [], road intersection path planning [] and other fields, and achieved good results, especially in improving the tracking accuracy of reusable dynamic systems.

The structure of the traditional ADP control system consisting of action network, critic network and model network is shown in Figure 4, and the model network is not used when dynamic model is derived. In SSS, the trajectory tracking controller, whose optimal feedback control depends only on the gradient of the optimal cost function output from the critic network obtained by online iteration [,,], is designed based on ADP with critic network only. The controller that not only simplifies the training process, but also eliminates the approximation error between the two networks, can always improve the tracking accuracy.

Figure 4. Basic idea of traditional ADP.

Considering the uncertainties and disturbances of the actual system, let

x = {[x_{1}, x_{2}]}^{T} = {[θ, \dot{θ}]}^{T}

, then SSS (30) is organized as

\{\begin{matrix} {\dot{x}}_{1} = x_{2} \\ {\dot{x}}_{2} = f (x) + g (x) I_{s 2} \end{matrix} y = x_{1}

(34)

where

f (x) = - M_{1, s 2}^{- 1} (θ, 0) \ddot{θ} [G_{1, s 2} (θ, \dot{θ}, 0, 0) + {\tilde{A}}^{- 1} \tilde{B} \dot{θ} + ψ]

,

g (x) = M_{1, s 2}^{- 1} (θ, 0) {\tilde{A}}^{- 1} \tilde{C}

, and

f (x)

and

g (x)

are both Lipschitz functions.

Assumption 3.

The sum of uncertainties and disturbances

ψ

in the second slow subsystem has an unknown upper bound

ψ^{*}

, which is

‖ψ‖ \leq ψ^{*}

.

Let the desired trajectory and actual trajectory of the subsystem be

x_{d}

and

x

, respectively, then the trajectory tracking error is

e = x - x_{d}

(35)

We define the performance index as follows

J (e (τ)) = \int_{0}^{\infty} N (e (τ), I_{s 2} (e (τ))) d τ

(36)

where

N (e (τ), I_{s 2} (e (τ))) = e^{T} Q e + I_{s 2}^{T} R I_{s 2}

is the utility function. There is

N (0, 0) = 0

,

N (e, I_{s 2}) > 0

holds for all

e

and

I_{s 2}

, and

Q \in R^{n \times n}

,

R \in R^{m \times m}

are positive definite matrixes [].

Let

u_{d}

denote the desired control law, then

I_{s 2}_{d} = g^{+} (x_{d}) ({\dot{x}}_{d} - f (x_{d}))

(37)

Therefore,

\dot{e} = \dot{x} - {\dot{x}}_{d} = f (x) - f (x_{d}) + g (x) I_{s 2} - g (x_{d}) I_{s 2}_{d}

(38)

For SSS, the control law

I_{s 2}

includes two parts, desired control

I_{s 2}_{d}

and optimal control

I_{s 2}_{v}

, which is

I_{s 2} = I_{s 2}_{d} + I_{s 2}_{v}

(39)

Therefore,

\dot{e} = f_{e} + [g (x) - g (x_{d})] I_{s 2} + g (x_{d}) I_{s 2}_{v}

(40)

The optimal control

I_{s 2}_{v}

can ensure that the trajectory tracking error of the subsystem converges to the steady state in an optimal manner.

Equation (30) can be rewritten as

J (e (τ)) = \int_{0}^{\infty} N (e (τ), I_{s 2}_{v} (e (τ))) d τ

(41)

where

N (e, I_{s 2}_{v}) = e^{T} Q e + I_{s 2 v}^{T} R I_{s 2}_{v}

is the utility function,

N (0, 0) = 0

,

N (e, I_{s 2}_{v}) > 0

holds for all e and

I_{s 2}_{v}

.

I_{s 2}_{v} \in Φ (Ω)

and

Φ (Ω)

is a set of allowable control sequences.

Definition 1.

For the second slow tracking error subsystem (40), if there exists a set of tolerance controls

I_{s 2} (e) \in Φ (Ω)

that are continuous and satisfy

I_{s 2} (e) = 0

when

\forall e \in Ω

, then the subsystem is guaranteed by

I_{s 2} (e)

to converge on a compact set with a finite performance index function [,].

If the performance index (41) is continuously differentiable, then its infinitesimal form can be expressed as

0 = N (e, I_{s 2}_{v}) + {(\nabla J (e))}^{T} (f_{e} + [g (x) - g (x_{d})] I_{s 2} + g (x_{d}) I_{s 2}_{v})

(42)

where

J (0) = 0

,

N (0, 0) = 0

,

\nabla J (e)

is the partial derivatives of with respect to

J (e)

which is

\nabla J (e) = \frac{\partial J (e)}{\partial e}

.

We define the Hamiltonian function and the optimal performance index as

H (e, I_{s 2}, \nabla J (e)) = N (e, I_{s 2}_{v}) + {(\nabla J (e))}^{T} (f_{e} + [g (x) - g (x_{d})] I_{s 2} + g (x_{d}) I_{s 2}_{v})

(43)

J^{*} (e) = \min_{I_{s 2}_{v}} \int_{0}^{\infty} N (e (τ), I_{s 2}_{v} (e (τ))) d τ

(44)

Clearly, with

J^{*} (e)

meeting the

0 = \min_{I_{s 2}_{v}} H (e, I_{s 2}_{v}, \nabla J^{*} (e))

(45)

where

\nabla J^{*} (e) = \frac{\partial J^{*} (e)}{\partial e}

.

If

J^{*} (e)

exists and is continuously differentiable, the optimal feedback control law can be solved by a single network evaluation structure strategy iterative algorithm with circular iterations as

I_{s 2} {_{v}}^{*} = - \frac{1}{2} R^{- 1} g^{T} (x) \nabla J^{*} (e)

(46)

Combining (42) and (45), we have

{(\nabla J (e))}^{T} (f_{e} + [g (x) - g (x_{d})] I_{s 2} + g (x_{d}) I_{s 2}_{v}) = - e^{T} Q e - I_{s 2 v}^{T} R I_{s 2}_{v}

(47)

Figure 5 is the single network evaluation structure strategy iteration process. With the control strategy evaluation using Equation (42), based on the evaluation results using Equation (45) to find the optimal feedback control law, in the algorithm, to improve the system regulation effect and performance index function using neural network approximation, there are

J (e) = w_{τ}^{T} σ_{τ} (e) + ε_{τ}

(48)

where

l

is the number of neurons in the hidden layer,

w_{τ} \in R^{l}

is the ideal neural network weight,

σ_{τ} (e)

is the neural network activation function and

ε_{τ}

is the evaluation network approximation error.

Figure 5. Flowchart of critic-only policy iteration algorithm.

Then, we calculate performance index function gradient

\nabla J (e)

\nabla J (e) = {(\nabla σ_{τ} (e))}^{T} w_{τ} + \nabla ε_{τ}

(49)

where

\nabla σ_{τ} (e) = \frac{\partial σ_{τ} (e)}{\partial e} \in R^{N \times n}

,

\nabla ε_{τ}

denotes the gradients of

σ_{τ} (e)

and

ε_{τ}

, respectively.

Substituting (49) into (42), we can obtain the Hamiltonian function

H (e, I_{s 2}_{v}, w_{τ}) = N (e, I_{s 2}_{v}) + (w_{τ}^{T} \nabla σ_{τ} (e)) \dot{e} = - \nabla ε_{τ}^{T} \dot{e} ≜ e_{P H}

(50)

where

e_{P H}

is the residual error of the approximation neural network.

The definitions

{\hat{w}}_{τ}

and

{\tilde{w}}_{τ}

are, respectively, the estimates and estimation errors of the evaluation neural network weights

w_{τ}

. Therefore, the output of the evaluation network

\hat{J} (e)

and its gradient are

\hat{J} (e) = {\hat{w}}_{τ}^{T} σ_{τ} (e) \nabla \hat{J} (e) = {(\nabla σ_{τ} (e))}^{T} {\hat{w}}_{τ}

(51)

The approximate Hamiltonian function is

H (e, I_{s 2}_{v}, {\hat{w}}_{τ}) = N (e, I_{s 2}_{v}) + ({\hat{w}}_{τ}^{T} \nabla σ_{τ} (e)) \dot{e} ≜ e_{P}

(52)

The performance criterion that needs to be minimized for the neural network training process [] is

E_{P} = \frac{1}{2} e_{P}^{T} e_{P}

(53)

The weights are then updated using the gradient descent method, with

{\dot{\hat{w}}}_{τ} = - α_{τ} e_{P} η

(54)

where

η = \nabla σ_{τ} (e) \dot{e}

,

α_{τ}

is the evaluation network learning rate and

α_{τ} > 0

.

Due to

{\tilde{w}}_{τ} = w_{τ} - {\hat{w}}_{τ}

(55)

we obtain

e_{P} = e_{P H} - w_{τ}^{T} \nabla σ_{τ} (e) \dot{e}

(56)

Therefore, the error update rate of the weight estimation is

{\dot{\tilde{w}}}_{τ} = - {\dot{\hat{w}}}_{τ} = α_{τ} (e_{P H} - {\tilde{w}}_{τ}^{T} \nabla σ_{τ} (e) \dot{e}) \nabla σ_{τ} (e) \dot{e}

(57)

The ideal optimal feedback control law and the corresponding iterative control law are

I_{s 2}_{v} = - \frac{1}{2} R^{- 1} g^{T} (x) ({(\nabla σ_{τ} (e))}^{T} w_{τ} + \nabla ε_{τ}) {\hat{I}}_{s 2 v} = - \frac{1}{2} R^{- 1} g^{T} (x) {(\nabla σ_{τ} (e))}^{T} {\hat{w}}_{τ}

(58)

Assumption 4.

The desired control law

I_{s 2}_{d}

and

η

both have unknown upper bounds, which are

‖I_{s 2}_{d}‖ \leq w_{u d} ‖η‖ \leq η_{M}

(59)

Theorem 1.

Under the conditions of Assumptions 3 and 4, if the solution of the neural network-based

HJB

equation exists, considering the state space model (30) in SSS and the evaluation network weight update rate (57), if the optimal control law of the system trajectory tracking is chosen as

I_{s 2} = I_{s 2}_{d} + {\hat{I}}_{s 2 v} = g^{+} (x_{d}) ({\dot{x}}_{d} - f (x_{d})) - \frac{1}{2} R^{- 1} g^{T} (x) {(\nabla σ_{τ} (e))}^{T} {\hat{w}}_{τ}

(60)

this ensures that both the weight approximation error and the system trajectory tracking error are eventually consistent and bounded.

Proof.

Considering the Lyapunov theory, we define a positive definite energy function

V_{1} = \frac{1}{2} e^{T} e + J^{*} (e) + \frac{1}{2 α_{τ}} {\tilde{w}}_{τ}^{T} {\tilde{w}}_{τ}

(61)

Derivative of time, we obtain

\begin{matrix} {\dot{V}}_{1} & = \frac{1}{α_{τ}} {\tilde{w}}_{τ}^{T} {\dot{\tilde{w}}}_{τ} + {(\nabla J^{*} (e))}^{T} \dot{e} + e^{T} \dot{e} \\ = \frac{1}{α_{τ}} {\tilde{w}}_{τ}^{T} {\dot{\tilde{w}}}_{τ} + {(\nabla J^{*} (e))}^{T} \dot{e} + e^{T} [f_{e} + [g (x) - g (x_{d})] I_{s 2} + g (x_{d}) I_{s 2}_{v}] \\ = \frac{1}{α_{τ}} {\tilde{w}}_{τ}^{T} {\dot{\tilde{w}}}_{τ} - e^{T} Q e - I_{s 2 v}^{T} R I_{s 2}_{v} + e^{T} [f_{e} + [g (x) - g (x_{d})] I_{s 2} + g (x_{d}) I_{s 2}_{v}] \end{matrix}

(62)

Because

f (x)

is a

Lipschitz

function, there must exist

L_{f} > 0

. When

L_{f} > 0

, it holds

‖f_{e}‖ \leq L_{f} ‖e‖

. According to assumption 1, we know that

g (x)

and

g (x_{d})

are bounded, so we can set

‖g (x)‖ \leq w_{g}, ‖g (x_{d})‖ \leq w_{g d}

(63)

and then

‖g (x) - g (x_{d})‖ \leq Δ w_{g}

(64)

Using the trigonometric inequality, we have

\begin{matrix} {\dot{V}}_{1} & \leq L_{f} {‖e‖}^{2} + Δ w_{g} ‖I_{s 2}‖ ‖e‖ + w_{g d} ‖I_{s 2}_{v}‖ ‖e‖ - e^{Τ} Q e - I_{s 2 v}^{T} R I_{s 2}_{v} + \frac{1}{α_{τ}} {\tilde{w}}_{τ}^{T} {\dot{\tilde{w}}}_{τ} \\ \leq L_{f} {‖e‖}^{2} + Δ w_{g} ‖I_{s 2}_{d} + I_{s 2}_{v}‖ ‖e‖ + w_{g d} ‖I_{s 2}_{v}‖ ‖e‖ - e^{Τ} Q e - I_{s 2 v}^{T} R I_{s 2}_{v} + {\tilde{w}}_{τ}^{T} (e_{P H} - {\tilde{w}}_{τ}^{T} η) η \\ \leq L_{f} {‖e‖}^{2} + Δ w_{g} ‖I_{s 2}_{d}‖ ‖e‖ + \frac{1}{2} Δ w_{g}^{2} {‖I_{s 2}_{v}‖}^{2} + \frac{1}{2} {‖e‖}^{2} + {\tilde{w}}_{τ}^{T} e_{P H} η \\ + \frac{1}{2} w_{g d}^{2} {‖e‖}^{2} - {‖{\tilde{w}}_{τ}^{T} η‖}^{2} + \frac{1}{2} {‖I_{s 2}_{v}‖}^{2} - λ_{\min} (R) {‖I_{s 2}_{v}‖}^{2} - λ_{\min} (Q) {‖e‖}^{2} \\ \leq - [(λ_{\min} (Q) - L_{f} - \frac{1}{2} w_{g d}^{2} - \frac{1}{2}) ‖e‖ - Δ w_{g} ‖I_{s 2}_{d}‖] ‖e‖ \\ - (λ_{\min} (R) - \frac{1}{2} Δ w_{g}^{2} - \frac{1}{2}) {‖I_{s 2}_{v}‖}^{2} + \frac{1}{2} e_{P H}^{2} - \frac{1}{2} {‖{\tilde{w}}_{τ}^{T} η‖}^{2} \\ \leq - [(λ_{\min} (Q) - L_{f} - \frac{1}{2} w_{g d}^{2} - \frac{1}{2}) ‖e‖ - Δ w_{g} w_{u d}] ‖e‖ \\ - (λ_{\min} (R) - \frac{1}{2} Δ w_{g}^{2} - \frac{1}{2}) {‖I_{s 2}_{v}‖}^{2} + \frac{1}{2} e_{P H}^{2} - \frac{1}{2} {‖{\tilde{w}}_{τ}^{T} η‖}^{2} \end{matrix}

(65)

Clearly, in the collections

Ω_{1} = \{{\tilde{w}}_{τ} : ‖{\tilde{w}}_{τ}‖ \leq \frac{e_{P H}}{η_{M}}\} Ω_{2} = \{e : ‖e‖ \leq \frac{Δ w_{g} w_{u d}}{λ_{\min} (Q) - L_{f} - \frac{1}{2} w_{g d}^{2} - \frac{1}{2}}\}

(66)

in addition to them, and they must satisfy the conditions

λ_{\min} (Q) \geq L_{f} + \frac{1}{2} w_{g d}^{2} + \frac{1}{2} λ_{\min} (R) \geq \frac{1}{2} Δ w_{g}^{2} + \frac{1}{2}

(67)

Then, there is

{\dot{V}}_{1} \leq 0

(68)

Thus, considering the Lyapunov stability theory, both the joint angle tracking error in SSS and the neural network weight approximation error are eventually consistent and bounded.

This completes the proof. □

4.2. Robust Optimal Vibration Control

Aiming at designing the optimal control law using the quadratic form, the model of SFS is rewritten as

{\dot{X}}_{k} = A_{k} X_{k} + B_{k} I_{f 2}

(69)

where

X_{k} = [\begin{matrix} y_{f 2} \\ \frac{d y_{f 2}}{d σ_{2}} \end{matrix}]

,

A_{k} = [\begin{matrix} 0 & E \\ - D_{4, s 2} (θ, ε_{2} y) \tilde{K} & 0 \end{matrix}]

,

B_{k} = [\begin{matrix} 0 \\ D_{3, s 2} (θ, ε_{2} y) {\tilde{A}}^{- 1} \tilde{C} \end{matrix}]

.

The control purpose is that the appropriate control law is proposed to adjust the system state to zero, that is, suppress elastic vibrations in SFS. It is easy to verify that it is completely controllable, and the optimal control method can be used, since SFS is a linear system.

The quadratic performance index function is selected as follows:

J_{k} = \frac{1}{2} \int_{0}^{\infty} [X_{k}^{T} Q X_{k} + I_{f 2}^{T} R I_{f 2}] d t

(70)

The algebraic Riccati equation can be written as

A_{k}^{T} P + P A_{k} - P B_{k} R^{- 1} B_{k}^{T} P + Q = 0

(71)

Then, the optimal feedback control law is

I_{f 2} = - K_{f} X_{k} = - R^{- 1} B_{k}^{T} P X_{k}

(72)

A robust optimal control law with great dynamic performance is given below to ensure the stability of the system while there are uncertainties in SFS. Therefore, we obtain the dynamic equation with uncertainties

{\dot{X}}_{k} = (A_{k 0} + Δ A_{k}) X_{k} + B_{k} I_{f 2}

(73)

Theorem 2.

For the second fast subsystem with parameter uncertainty, if the uncertainty satisfies

{‖Δ A_{k}‖}_{2} < \frac{λ_{\min}}{2 ρ (P)}

where

λ_{\min}

is the minimum eigenvalue of

Q + P B_{k} R^{- 1} B_{k}^{T} P

and

ρ (P)

is the spectral radius of

P

, and

I_{f 2}

is elected optimal control law (72), the closed loop asymptotic stability of SFS can be guaranteed.

Proof.

Considering the Lyapunov theory, define the second positive definite energy function

V_{2} = X_{k}^{T} P X_{k}

(74)

Substituting (69), (72) and (73) into the time derivative of Equation (74), we obtain

\begin{matrix} {\dot{V}}_{2} & = {\dot{X}}_{k}^{T} P X_{k} + X_{k}^{T} P {\dot{X}}_{k} \\ = X_{k}^{T} (A_{k 0}^{T} P + P A_{k 0} - 2 P B_{k} R^{- 1} B_{k}^{T} P) X_{k} + 2 X_{k}^{T} P Δ A_{k} {\dot{X}}_{k} \\ = - X_{k}^{T} (Q + P B_{k} R^{- 1} B_{k}^{T} P) X_{k} + 2 X_{k}^{T} P Δ A_{k} {\dot{X}}_{k} \end{matrix}

(75)

Because both

P

and

Q + P B_{k} R^{- 1} B_{k}^{T} P

are symmetric matrices, we have

2 X_{k}^{T} P Δ A {\dot{X}}_{k} < 2 ρ (P) \cdot {‖Δ A_{k}‖}_{2} \cdot {‖X_{k}‖}_{2}^{2} \frac{X_{k}^{T} (Q + P B_{k} R^{- 1} B_{k}^{T} P) X_{k}}{X_{k}^{T} X_{k}} > λ_{\min}

(76)

Substituting inequality (76) into Equation (75), we have

{\dot{V}}_{2} < [2 ρ (P) \cdot {‖Δ A_{k}‖}_{2} - λ_{\min}] {‖X_{k}‖}_{2}^{2}

(77)

When the inequality

{‖Δ A_{k}‖}_{2} < \frac{λ_{\min}}{2 ρ (P)}

holds,

{\dot{V}}_{2}

will be less than 0. Hence, the second fast closed-loop subsystem is stable.

This completes the proof. □

4.3. Adaptive Sliding Mode Servo Control

In the actual work of the hydraulic servo control system, there are some uncertain factors such as the perturbation of the elastic modulus of hydraulic oil, hydraulic oil leakage and friction of moving parts. They will directly affect the stability and dynamic characteristics of the hydraulic servo control subsystem. Therefore, the dynamics of FFS with uncertainties will be from Equation (21) to

\frac{d τ_{f 1}}{d σ_{1}} + {\tilde{A}}_{0} τ_{f 1} + F = {\tilde{C}}_{0} I_{f 1}

(78)

where

F = Δ \tilde{A} τ_{f 1} - Δ \tilde{C} I_{f 1}

is the overall uncertainty, and it is bounded.

If

\hat{F}

is the estimated value of

F

, define the estimated error

\tilde{F} = F - \hat{F}

(79)

We define the tracking error vector

e_{f} = τ_{f 1 d} - τ_{f 1} {\dot{e}}_{f} = {\dot{τ}}_{f 1 d} - {\dot{τ}}_{f 1}

(80)

then select a sliding surface

s_{f} = e_{f}

(81)

Theorem 3.

Considering the hydraulic servo FFS, the ASMC is presented as

I_{f 1} = {\tilde{C}}_{0}^{- 1} [\frac{d τ_{f 1}}{d σ_{1}} + {\tilde{A}}_{0} τ_{f 1} + \hat{F} + ξ sgn (s_{f})]

(82)

\dot{\hat{F}} = Λ^{T} s_{f}

(83)

Then, the closed-loop FFS is uniformly ultimately stable.

Proof.

Considering the Lyapunov theory, we define the third positive definite energy function

V_{3} = \frac{1}{2} s_{f}^{T} s_{f} + \frac{1}{2} {\tilde{F}}^{T} Λ^{T} \tilde{F}

(84)

Differentiate (84) with respect to time, and we can have

\begin{array}{l} {\dot{V}}_{3} & = s_{f}^{T} {\dot{s}}_{f} + {\dot{\tilde{F}}}^{T} Λ^{- 1} \tilde{F} \\ = & s_{f}^{T} (\frac{d τ_{f 1}}{d σ_{1}} + {\tilde{A}}_{0} τ_{f 1} + F - \tilde{C} I_{f 1}) - {\dot{\hat{F}}}^{T} Λ^{- 1} \tilde{F} \\ = & s_{f}^{T} (F - \hat{F} - ξ sgn (s_{f})) - s_{f}^{T} Λ Λ^{- 1} \tilde{F} \\ = - s_{f}^{T} ξ sgn (s_{f}) \\ \leq 0 \end{array}

(85)

From (85),

V_{3} (s_{f} (t)) \leq V_{3} (s_{f} (0))

, so

s_{f} (t)

is bounded. Considering the Lyapunov function as

V_{4} = - s_{f}^{T} k s_{f}

, the integration of

V_{4}

can be obtained as

\int_{0}^{\infty} V_{4} (τ) d τ \leq V_{3} (s_{f} (0)) - V_{3} (s_{f} (\infty)) \leq \infty

. Therefore, it is concluded that error can asymptotically converge to zero as

t \to \infty

based on Barbalat Lemma.

This completes the proof. □

4.4. Composite Control

According to the theoretical analysis above, the control law for the HDFRMS by (16) can be expressed as (60), (72), (82) and (83). Additionally, Figure 6 is the whole block diagram for the manipulator control closed-loop system.

Figure 6. Block diagram of HDFRMS with controllers.

In Figure 6, the desired angles and angle velocities

x_{d}

after trajectory planning are provided as input to the manipulator system. Meanwhile, three control outputs

I_{s 2}

,

I_{f 2}

and

I_{f 1}

can be individually calculated using information measured by the sensors such as photoelectric encoders, pressure sensors and strain gauges. However, these control outputs are then summed after timescale transformation. At last, the closed-loop HDFRMS with control schemes can be realized.

Theorem 4.

The stabilities of SSS (30), SFS (33) and FFS (21) are guaranteed under adaptive dynamic programming trajectory control law (60), robust optimal vibration control law (72) and adaptive sliding mode servo control law (82) and (83), based on SPT. Therefore, the closed-loop stability of the whole manipulator system (16) can be guaranteed under the composite controller.

5. Simulations

In this section, some simulation results (implemented in MATLAB) are analyzed to indicate the effectiveness and robustness of the singular perturbation decoupling method and proposed composite control laws in previous parts. All the tests are conducted on the HDFRMS, whose dynamics is given in (16) with these structure parameters (see Table 3).

Table 3. HDFRMS structure parameters.

In ADP trajectory control, the back propagation (BP) neural network, the critic neural network, is selected as 6-8-1 with 6 input neurons, 8 hidden neurons and 1 output neuron (see Figure 7). Additionally, its activation function, learning rate and initial weight values are selected as sigmoid function,

α_{τ} = 0.02

, and

{\hat{w}}_{τ 0} = {[\begin{matrix} 1.4 & 1.7 & 0.6 & 1.3 & 0.4 & 1.9 & 0.8 & 0.9 \end{matrix}]}^{T}

, respectively.

Figure 7. The structure of BP neural network.

In addition, the parameters of robust optimal vibration control and adaptive sliding mode servo control can be given as

Q = d i a g (\begin{matrix} 55 & 65 & 40 & 60 \end{matrix})

,

R = d i a g (\begin{matrix} 15 & 10 & 24 \end{matrix})

,

ξ = d i a g (\begin{matrix} 27 & 36 & 28 \end{matrix})

,

Λ = d i a g (\begin{matrix} 60 & 90 & 85 \end{matrix})

.

The initial positions of the manipulator joints are selected as

θ_{d 0} = {[1 0.1 0.2]}^{T}

, and their desired trajectories as chosen as

θ_{d} = {[0.75 \cos 2 t - 0.1 + 0.8 \sin 1.5 t 0.5 \cos t]}^{T}

.

The sampled time instant is 0.1 ms. Time taken is 10 s. The simulations are illustrated in Figure 8, Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13.

Figure 8. Tracking performances and vibration suppression of the manipulator without uncertainties.

Figure 9. Weights of critic neural network.

Figure 10. Tracking performances and vibration suppression of the manipulator with uncertainties.

Figure 11. Control current comparisons with full payload and uncertainties.

Figure 12. Tracking performances and vibration suppression of the manipulator with uncertainties in different conditions with varying payload.

Figure 13. Comparison with the ASMC in SSS.

5.1. Reference-Tracking Performance in System without Uncertainties

When there is no uncertainty in HDFRMS, the tracking performances of joints (see Figure 8a–c) and vibration suppression of the flexible link 3 (see Figure 8d–f) are tested. Both simulations and numerical results use the same control parameters under several different working conditions, including the end-effector without payload (

M = 0 kg

), with half payload (

M = 300 kg

) and with full payload (

M = 600 kg

).

Figure 8a–c indicates that all joint angles can quickly track to the prospective trajectories in 3 s or less. In particular, the trajectory tracking speed of joint 1 is faster than other joints because of the rotary joint, which is insensitive to vibrations. Figure 8d–f shows that both first-order mode and second-order mode can be controlled effectively, even if the vibration suppression needs to spend 3 s handling full payload. However, only forced vibration caused by a large range of joint motion exists in system. In addition, Figure 9 illustrates that the weights of the critic neural network converge to

{[\begin{matrix} 0.994 & 0.003 & 1.001 & 1.013 & 0.499 & - 0.007 & 0.002 & 1.007 \end{matrix}]}^{T}

.

5.2. Reference-Tracking Performance in System with Uncertainties

To verify the robustness of HDFRMS closed-loop system, as shown in Figure 10 and Figure 11, the total uncertainties

ψ

in SSS and the uncertainties

F

in FFS are chosen as

ψ = {[\begin{matrix} 0.001 \sin 1.5 t + 0.0004 r a n d (1) & 0.0005 \cos t + 0.0005 r a n d (1) & 0.0005 \sin 0.5 t + 0.0005 r a n d (1) \end{matrix}]}^{T}

and

F = {[\begin{matrix} 0.001 \cos 5 t + 0.001 r a n d (1) & 0.001 \sin 3.5 t + 0.001 r a n d (1) & 0.001 \sin 4 t + 0.001 r a n d (1) \end{matrix}]}^{T}

, respectively. Then, both the sine and cosine function are used to express parameter uncertainties in the manipulator system, and the random function represents external disturbances and noises.

Figure 10 shows the tracking performances of joints and vibration suppression of the third flexible arm in HDFRMS with uncertainties. Compared to the previous cases, the tracking time of joint angle 2, nearly 3.5 s, is the longest response time of the system according to Figure 10a–c. Furthermore, the tracking error fluctuation of joint angle 3 is less than 0.01 rad between 9.2 s and 9.5 s, because of the combined effects of system uncertainties, accumulated errors, full payload and motion coupling. However, the largest error fluctuating ranges can satisfy the requirements of steady state. Figure 10d–f shows that the vibration is also so small and can be suppressed quickly.

The chattering comparisons of control currents with full payload and uncertainties are shown in Figure 11. Clearly, the chattering can be weakened by the saturation function whose boundary layer is 0.01 instead of the sign function.

5.3. Reference-Tracking Performance in System with Varying Payloads and Uncertainties

To test if the composite controller is more adaptive in the condition of varying payloads, three working conditions are considered. The first working condition is that the payload mass is gradual changed from 600 kg to 0 kg, starting from 0 s to 6 s. The other working conditions are that the payload mass is suddenly changed from 600 kg to 200 kg at 6 s and 2 s.

Figure 12 shows the tracking performances and vibration suppression of the manipulator with uncertainties in different conditions with varying payloads. Table 4 is the composite controller performance comparisons with uncertainties, where three evaluation criteria for the trajectory tracking errors are adopted: the root mean squared error (RMSE), integral squared error (ISE), and integral absolute error (IAE) [].

Table 4. Controller performance comparisons with uncertainties.

Figure 12a–c indicates that the tracking error converges faster than the sudden change of mass, when the payload mass changes gradually. The settling time is about 1 s. In addition, the convergence rate of tracking error is almost the same whether the sudden time is 2 s or 6 s. Their error curves nearly coincide (see Figure 12a–c) and their RMSE, ISE and IAE are almost the same (see Table 4). Therefore, the sudden change in mass does not affect the tracking performance of manipulator system, whether in the transition process or in the steady state.

5.4. The Comparison with the ASMC in SSS

We chose an ASMC that can deal with uncertain systems to compare with ADP in SSS. The ASMC can be designed as

\begin{matrix} I_{s 2} = {\tilde{C}}^{- 1} \tilde{A} (M_{1, s 2} (θ, 0) {\ddot{θ}}_{r} + {\tilde{A}}^{- 1} \tilde{B} {\dot{θ}}_{r} + G_{1, s 2} (θ, \dot{θ}, 0, 0) + \hat{ψ} + ε s a T (s_{s})) \\ {\dot{θ}}_{r} = {\dot{θ}}_{d} + λ (θ_{d} - θ) \\ \dot{\hat{ψ}} = Υ^{T} s_{s} \end{matrix}

(86)

where

ε = d i a g (40, 35, 20)

,

λ = d i a g (25, 25, 45)

and

Υ = d i a g (240, 185, 200)

. The closed-loop stability proof with ASMC is similar to this paper [].

The payload mass is selected as 450 kg. The uncertainties and other controller parameters remain unchanged, consistent with previous simulations.

The simulation results can be seen in Figure 13, indicating that the transient response of the ADP in SSS is significantly faster than that of the ASMC. The ASMC has few superiorities for trajectory tracking, and the longer settling time will inevitably lead to larger average errors. In particular, the settling time of joint angle 2 is more than 4 s (see Figure 13b). Moreover, the vibration suppression of the first-order mode becomes worse in the transition process when ASMC is used in SSS and other controllers remain the same as before.

In summary, the composite controller proposed in this paper can adequately represent the manipulator trajectory tracking performances under six different cases because controller parameters always remain constant. Additionally, the control has greater robustness with system uncertainties, no matter how the payload mass is changed. Compared with the ASMC with the same robustness, the settling time of ADP is shorter.

6. Conclusions

This paper discusses the dynamics modeling and trajectory tracking control of HDFRMS in practical engineering applications. The dynamics of the manipulator system, which are modeled as AMM and the Lagrange principle, are decomposed into second slow, second fast and first fast subsystem describing the rigid motion, flexible vibration and servo-hydraulic-driven control, based on SPT. The control laws of all subsystems with independent state variables can be designed respectively. Additionally, the adaptive dynamic programming trajectory tracking control with critic-only policy iteration algorithm, robust optimal vibration control and adaptive sliding mode servo control are established by the Lyapunov stability theory, while the total uncertainty boundary in HDFRMS is unknown. Finally, numerical simulations successfully demonstrate that the composite controller is not only effective but also robust.

Although it was not the focus of this paper, we believe that the cross-scale decoupling method can be extended to rigid–flexible, macro–micro, and electro–hydraulic complex multi-body dynamics systems. Moreover, the selection of perturbation parameters and their influences on the system should be further studied. Additionally, we will also discuss how to improve robustness for further research based on ADP, when frictions of hydraulic cylinder exist in the manipulator servo actual system.

Author Contributions

Conceptualization, X.W. and Z.T.; methodology, Z.T.; software, J.X.; validation, X.W., J.Y. and Z.T.; formal analysis, J.X.; investigation, Z.T.; resources, Z.T.; data curation, X.W.; writing—original draft preparation, X.W. and Z.T.; writing—review and editing, J.Y. and Z.T.; visualization, J.X.; supervision, X.W.; project administration, Z.T.; funding acquisition, J.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the Natural Science Foundation of Jilin Province under Grant 20220101120JC, in part by the Science Research Planning Project of Jilin Province Department of Education under Grant JJKH20221007KJ, in part by the Guiding Project for Tackling Key Scientific and Technological Problems in Quzhou under Grant 2021Z219 and in part by the Zhejiang Province Basic Public Welfare Research Project under Grant LGC22E050006.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yang, H.J.; Liu, J.K. Distributed piezoelectric vibration control for a flexible-link manipulator based on an observer in the form of partial differential equations. J. Sound Vib. 2016, 363, 77–96. [Google Scholar] [CrossRef]
Lu, E.; Li, W.; Yang, X.F.; Wang, Y.Q.; Liu, Y.F. Dynamic modeling and analysis of a rotating piezoelectric smart beam. Int. J. Struct. Stab. Dyn. 2018, 18, 1850003. [Google Scholar] [CrossRef]
Liu, Y.F.; Li, W.; Yang, X.F. Coupled dynamic model and vibration responses characteristic of a motor-driven flexible manipulator system. Mech. Sci. 2015, 6, 235–244. [Google Scholar] [CrossRef]
Xu, B.; Yuan, Y. Two performance enhanced control of flexible-link manipulator with system uncertainty and disturbances. Sci. China Inf. Sci. 2017, 60, 050202. [Google Scholar] [CrossRef]
Ju, J.Y.; Li, W.; Wang, Y.Q. Vibration observation for a translational flexible-link manipulator based on improved Luenberger observer. J. Vibroeng. 2016, 18, 238–249. [Google Scholar]
Su, Y.; Mueller, P.C.; Zheng, C. Global asymptotic saturated PID control for robot manipulators. IEEE Trans. Control Syst. Technol. 2010, 18, 1280–1288. [Google Scholar] [CrossRef]
Zamora-Gomez, G.; Zavala-Río, A.; Lpez-Araujo, D.; Santibáñez, V. Further results on the global continuous control for finite-time and exponential stabilisation of constrained-input mechanical systems: Desired conservative force compensation and experiments. IET Control Theory Appl. 2019, 13, 159–170. [Google Scholar] [CrossRef]
Riad, A.; Meddahi, H. A fast adaptive artificial neural network controller for flexible link manipulators. Int. J. Adv. Comput. Sci. Appl. 2016, 7, 298–308. [Google Scholar] [CrossRef]
Li, F.; Zhang, Z.; Wu, Y.; Chen, Y.; Liu, K.; Yao, J. Improved fuzzy sliding mode control in flexible manipulator actuated by PMAs. Robotica 2022, 40, 2683–2696. [Google Scholar] [CrossRef]
Dog, M.; Istefanopulos, Y. Optimal nonlinear controller design for flexible robot manipulators with adaptive internal model. IET Control Theory Appl. 2007, 1, 770–778. [Google Scholar]
Bastos, G. A non-inherent parametric estimation for dynamical equivalence of flexible manipulators. Optim. Control Appl. Methods 2022, 43, 825–841. [Google Scholar] [CrossRef]
Mehrez, M.W.; El-Badawy, A.A. Effect of the joint inertia on selection of under-actuated control algorithm for flexible-link manipulators. Mech. Mach. Theory 2010, 45, 967–980. [Google Scholar] [CrossRef]
Lu, E.; Li, W.; Yang, X.; Liu, Y.F. Modelling and composite control of single flexible manipulators with piezoelectric actuators. Shock Vib. 2016, 7, 2689178. [Google Scholar] [CrossRef]
Xu, B. Composite learning control of flexible-link manipulator using NN and DOB. IEEE Trans. Syst. 2018, 48, 1979–1985. [Google Scholar] [CrossRef]
Siciliano, B.; Book, W.J. A singular perturbation approach to control of lightweight flexible manipulators. Int. J. Robot. Res. 1988, 7, 79–90. [Google Scholar] [CrossRef]
Cheng, X.; Zhang, Y.J.; Liu, H.S.; Wollherr, D.; Buss, M. Adaptive neural backstepping control for flexible-joint robot manipulator with bounded torque inputs. Neurocomputing 2021, 458, 70–86. [Google Scholar] [CrossRef]
Xu, B.; Shi, Z.K.; Yang, C.G. Composite fuzzy control of a class of uncertain nonlinear systems with disturbance observer. Nonlinear Dyn. 2015, 80, 341–351. [Google Scholar] [CrossRef]
Cheng, X.; Liu, H.S.; Zeng, Z.; Lu, W.K. Robust fuzzy sliding mode control and vibration suppression of free-floating flexible-link and flexible-joints space manipulator with external interference and uncertain parameter. J. Dyn. Syst. Meas. Control Trans. Asme 2022, 144, 1004–1015. [Google Scholar]
Xie, L.M.; Yu, X.Y.; Chen, L. Saturated Output Feedback Control for Robot Manipulators With Joints of Arbitrary Flexibility. Robotica 2022, 40, 997–1019. [Google Scholar] [CrossRef]
Dindorf, R.; Wos, P. A Case Study of a Hydraulic Servo Drive Flexibly Connected to a Boom Manipulator Excited by the Cyclic Impact Force Generated by a Hydraulic Rock Breaker. IEEE Access 2022, 10, 7734–7752. [Google Scholar] [CrossRef]
Franco, E.; Garriga-Casanovas, A.; Tang, J.; Baena, F.R.; Astolfi, A. Adaptive energy shaping control of a class of nonlinear soft continuum manipulators. IEEE/ASME Trans. Mechatron. 2022, 27, 280–291. [Google Scholar]
Zeng, H.; Sepehri, N. On tracking control of cooperative hydraulic manipulators. Int. J. Control 2007, 80, 454–469. [Google Scholar] [CrossRef]
Zhang, X.; Shi, G. Dual extended state observer-based adaptive dynamic surface control for a hydraulic manipulator with actuator dynamics. Mech. Mach. Theory 2022, 169, 104647. [Google Scholar]
Li, S.; Zhu, K.; Chen, L.; Yan, Y.; Guo, Q. Variable Structure Disturbance Observer Based Dynamic Surface Control of Electrohydraulic Systems with Parametric Uncertainty. Energies 2022, 15, 1671. [Google Scholar]
Irani, A.N.; Talebi, H.A. Tip tracking control of a rigid-flexible manipulator based on deflection estimation using neural network. In Proceedings of the ISSNIP Biosignals and Biorobotics Conference 2011, Vitoria, Brazil, 6–8 January 2011; pp. 1–6. [Google Scholar]
Merritt, H.E. Hydraulic Control Systems; Wiley: New York, NY, USA, 1967. [Google Scholar]
Zhao, J.; Wang, T.; Pedrycz, W.; Wang, W. Granular prediction and dynamic scheduling based on adaptive dynamic programming for the blast furnace gas system. IEEE Trans. Cybern. 2019, 51, 2201–2214. [Google Scholar] [CrossRef]
Wang, X.; Quan, Z.; Zhang, J. Optimal 3-dimension trajectory-tracking guidance for reusable launch vehicle based on back-stepping adaptive dynamic programming. Neural Comput. Appl. 2022, 35, 5319–5334. [Google Scholar]
Ren, X.; Li, H. Adaptive dynamic programming-based feature tracking control of visual servoing manipulators with unknown dynamics. Complex Intell. Syst. 2022, 8, 255–269. [Google Scholar] [CrossRef]
Hu, C.; Zhao, L.; Qu, G. Event-Triggered Model Predictive Adaptive Dynamic Programming for Road Intersection Path Planning of Unmanned Ground Vehicle. IEEE Trans. Veh. Technol. 2021, 70, 11228–11243. [Google Scholar] [CrossRef]
Xia, H.; Zhao, B.; Li, Y. Optimal tracking control for reconfigurable manipulators based on critic-only policy iteration algorithm. In Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China, 26–28 July 2017; pp. 2616–2621. [Google Scholar]
Liu, D.; Wang, D.; Li, H. Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach. IEEE Trans. Neural Netw. Learn. Syst. 2014, 25, 411–428. [Google Scholar]
Zhao, B.; Liu, D.; Li, Y. Online fault compensation control based on policy iteration algorithm for a class of affine nonlinear systems with actuator failures. IET Control Theory Appl. 2016, 10, 1816–1823. [Google Scholar] [CrossRef]
Zhang, H.; Cui, L.; Zhang, X.; Luo, Y. Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans. Neural Netw. 2011, 22, 2226–2236. [Google Scholar] [CrossRef]
Abu-Khalaf, M.; Lewis, F.L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 2005, 4, 779–791. [Google Scholar] [CrossRef]
Kuo, C.W.; Tsai, C.C.; Lee, C.T. Intelligent leader-following consensus formation control using recurrent neural networks for small-size unmanned helicopters. IEEE Trans. Syst. Man Cybern. Syst. 2019, 51, 1288–1301. [Google Scholar] [CrossRef]
Tang, Z.G.; Shan, X.H.; Li, S.T. Dynamic Modeling and Extension Adaptive Sliding Mode Control of Onboard Craning Manipulator. IFAC-PapersOnLine 2020, 53, 530–535. [Google Scholar]

Figure 1. Model of HDFRMS.

Figure 2. Schematic diagram of rigid–flexible manipulator.

Figure 3. Schematic diagram of asymmetric hydraulic cylinder.

Figure 4. Basic idea of traditional ADP.

Figure 5. Flowchart of critic-only policy iteration algorithm.

Figure 6. Block diagram of HDFRMS with controllers.

Figure 7. The structure of BP neural network.

Figure 8. Tracking performances and vibration suppression of the manipulator without uncertainties.

Figure 9. Weights of critic neural network.

Figure 10. Tracking performances and vibration suppression of the manipulator with uncertainties.

Figure 11. Control current comparisons with full payload and uncertainties.

Figure 12. Tracking performances and vibration suppression of the manipulator with uncertainties in different conditions with varying payload.

Figure 13. Comparison with the ASMC in SSS.

Table 1. Notation, frequently used symbols of mechanical subsystem.

Symbol	Definition	Symbol	Definition
$L_{i}$	length of ith link	$r_{1}$	upper surface radius of link 1
$θ_{i}$	angle of ith joint	$r_{2}$	lower surface radius of link 1
$m_{i}$	mass of the ith actuator	$M$	mass of payload
$ρ_{i}$	mass density per unit length of ith link	$E I$	elastic stiffness
$L_{11}$ $L_{21}$	installation location of the first hydraulic cylinder	$ω (r, t)$	elastic deformation of flexible link 3
$L_{22}$ $L_{31}$	installation location of the second hydraulic cylinder	$Φ_{j} (r)$	jth mode shape function of flexible link 3
$g$	acceleration of gravity	$q_{j} (t)$	jth modal displacement of flexible link 3

Table 2. Notation, frequently used symbols of hydraulic subsystem.

Symbol	Definition	Symbol	Definition
$P_{s}$	supply pressure	$x_{v}$	spool displacement
$P_{r}$	return pressure	$y$	piston displacement
$P_{L}$	payload pressure	$L$	hydraulic cylinder stroke
$β_{e}$	bulk modulus	$w$	area gradient of the valve
$C_{d}$	flow coefficient of the serve-valve port	$I$	servo valve current
$V_{t}$	equivalent volume	$K_{q}$	flow gain coefficient
$K_{c}$	flow/pressure coefficient	$Q_{L}$	load flow of hydraulic cylinder/motor
$P_{1}$ $P_{2}$	left and right actuator chamber pressure	$Q_{1}$ $Q_{2}$	left and right actuator chamber fluid flow
$D$	volume displacement of the motor	$V_{1}$ $V_{2}$	left and right actuator chamber volume
$C_{t m}$	equivalent leakage coefficient	$A_{1}$ $A_{2}$	left and right effective piston area

Table 3. HDFRMS structure parameters.

Notation	Value	Unit	Notation	Value	Unit	Notation	Value	Unit
$L_{1}$	1.8	m	$ρ_{1}$	20	kgm⁻¹	$C_{d i}$	0.85	----
$L_{2}$	2.5	m	$ρ_{2}$	40	kgm⁻¹	$C_{d}$	0.8	----
$L_{3}$	6	m	$ρ_{3}$	40	kgm⁻¹	$C_{t m 1}$	$7 \times 10^{- 13}$	m⁵/N·s
$L_{11}$	1	m	$A_{1}$	0.015	m²	$C_{t m i}$	$5 \times 10^{- 13}$	m⁵/N·s
$L_{21}$	0.2	m	$A_{2}$	0.02	m²	$P_{s}$	$7 \times 10^{6}$	Pa
$L_{22}$	1	m	$E I$	$10^{7}$	Nm²	$w_{i}$	$0.08$	m
$L_{31}$	0.2	m	$V_{t}$	0.002	m³	$w$	0.04	m
$r_{1}$	0.2	m	$K_{c i}$	$6 \times 10^{- 13}$	m³/pa·s	$β_{e}$	$7 \times 10^{8}$	Nm⁻²
$r_{2}$	0.4	m	$K_{c}$	$4 \times 10^{- 13}$	m³/pa·s	$D$	0.2	m²
$m_{2}$	5	kg	$K_{i i}$	$10$	cm/A	$L$	1.2	m
$m_{3}$	8	kg	$K_{i}$	$8$	cm/A	$ρ$	$870$	kgm⁻³

Table 4. Controller performance comparisons with uncertainties.

Error Accuracy	Joint	Without Payload	Half Payload	Full Payload	Gradual Change	Sudden Change (2 s)	Sudden Change (6 s)
RMSE	1	1.29	1.16	0.97	1.03	1.12	1.12
$(10^{- 3}$ )	2	1.76	1.63	1.34	1.39	1.61	1.59
	3	2.24	2.07	1.79	1.92	2.03	2.01
ISE	1	5.43	5.18	4.91	4.99	5.05	5.05
$(10^{- 4}$ )	2	5.88	5.50	5.37	5.43	5.50	5.50
	3	6.96	6.74	6.44	6.62	6.69	6.67
IAE	1	3.15	2.85	2.74	2.59	2.77	2.77
$(10^{- 3}$ )	2	3.87	3.16	3.39	3.24	3.52	3.51
	3	4.02	3.66	3.93	3.69	3.80	3.80

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Adaptive Dynamic Programming-Based Cross-Scale Control of a Hydraulic-Driven Flexible Robotic Manipulator

Featured Application

Abstract

1. Introduction

2. Dynamic Modeling of HDFRMS

2.1. Mechanical Subsystem Model

2.2. Hydraulic Subsystem Model

3. Cross-Scale Decoupling of Manipulator System

3.1. First Singular Perturbation Decomposition

3.2. Second Singular Perturbation Decomposition

4. Control Design

4.1. ADP Trajectory Control

4.2. Robust Optimal Vibration Control

4.3. Adaptive Sliding Mode Servo Control

4.4. Composite Control

5. Simulations

5.1. Reference-Tracking Performance in System without Uncertainties

5.2. Reference-Tracking Performance in System with Uncertainties

5.3. Reference-Tracking Performance in System with Varying Payloads and Uncertainties

5.4. The Comparison with the ASMC in SSS

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics