Featured Application
The proposed method is suitable for application in robotic arm systems.
Abstract
This paper focuses primarily on adaptive dynamic programming (ADP)-based tracking control of the hydraulic-driven flexible robotic manipulator system (HDFRMS) with varying payloads and uncertainties via singular perturbation theory (SPT). Firstly, the dynamics is derived using a driven Jacobin matrix, which represents the coupling between the hydraulic servo-driven system and rigid–flexible manipulator established using the assumed mode method and Lagrange principle. Furthermore, the whole dynamic model of the manipulator system is decoupled into a second slow subsystem (SSS), a second fast subsystem (SFS) and a first fast subsystem (FFS). The three subsystems can describe a large range of movement, flexible vibration and electro-hydraulic servo control, respectively. Hereafter, an adaptive dynamic programming trajectory tracking control law with a critic-only policy iteration algorithm is presented in the second slow timescale, while both robust optimal control (ROC) in the second first timescale and adaptive sliding mode control (ASMC) in the first fast timescale are also designed using the Lyapunov stability theory. Finally, the numerical simulations are carried out to illustrate the rightness and robustness of the singular perturbation decomposition and proposed composite control algorithm.
1. Introduction
The hydraulic-driven flexible robotic manipulator system (HDFRMS) has wide implementation potentials in a large number of engineering fields human can hardly deal with directly, such as forest exploitation, heavy payload motion control, construction work, mobile equipment applications and other industrial areas [,,], since their energy consumption and weight are lower, but the speed is higher.
Unlike rigid manipulators, the dynamics of flexible manipulators is the nonlinear rigid–flexible coupling infinite-order model [,]. Indeed, many control methods have been presented for flexible multibody dynamics system, in particular flexible link manipulators and flexible joint manipulators, involving PID control [,], neural network control [], sliding mode control [], adaptive control [], optimal control [], etc. Nevertheless, ill-conditioned numerical issues and inaccurate or unreliable control will occur, because these dynamic model-based control methods are directly designed ignoring multiple timescales in dynamics. Singular perturbation theory (SPT) is used to deal with the problem of control under different timescales.
The two-order dynamic model of flexible systems can be decoupled into the slow subsystem (SS) in slow timescale, which is the traditional timescale, and the fast subsystem (FS) in new timescale, namely fast timescale based on SPT. In [,] the flexible link manipulator system is decomposed through the perturbation parameter that the small-time constant is defined as, and two sub-controllers are designed. The composite control consisting of fuzzy sliding mode control in SS and linear quadratic regulator (LQR) optimal control in FS are proposed in []. Additionally, in [], the slow controller is substituted by the learning controller and the disturbance observer to improve the trajectory tracking precision. An adaptive non-saturated control is preferentially considered to handle actuator saturation involving dynamics parameters in the flexible joint manipulator system, where there are any parametric uncertainties [,]. Additionally, the adaptive neural backstepping control in SS is presented as the uncertain nonlinearity compensator []. Furthermore, the fuzzy control [], the robust fuzzy sliding mode control [] and the adaptive high-gain Kalman filter [] are investigated to improve robustness.
Compared with the electrical-driven manipulator, hydraulic-driven manipulators have higher power to handle heavier payloads whose weights are greater than 100 kg, and they have lower stiffness and response time. However, the order of their dynamic equations that are more complicated is raised from the second order to the third order. It is so difficult to control them because of electro-hydraulic coupling and inherent elastic vibration. A hybrid controller including a feedforward control and feedback control [], and model-based adaptive energy shaping controller [] are proposed in the hydraulic manipulator system. Many observers, such as the adaptive observer and variable structure disturbance observer, are also designed to handle uncertainties and vibrations [,,]. However, the description of dynamics is not complete yet. That is, flexible vibration or hydraulic drive has always been ignored in past controller designs. Therefore, the control method which can track desired trajectories precisely and suppress flexible vibrations based on the complete electro-hydraulic coupling model needs be presented.
In the work, we discuss an ADP-based trajectory tracking control law and singular perturbation decoupling for the HDFRMS with uncertainties and varying payloads. The main contributions include the following three aspects.
(1) The whole third-order dynamic model including three link rigid–flexible manipulator, asymmetric four-way valve controlled hydraulic cylinder and hydraulic actuator is derived. It is more beneficial and complete than the actual engineering system, since both the flexible vibration and hydraulic drive are considered in the dynamic model.
(2) A novel singular perturbation decoupling method for the electro-hydraulic servo flexible multibody dynamics system is presented. Unlike the traditional method that contains one perturbation parameter and two timescales, two singular perturbation parameters are used to establish three timescales.
(3) An optimal control method based on the critic-only policy iteration algorithm for tracking control is proposed, which simplifies the controller structure of ADP. The controller can improve the control precision and optimize the energy consumption, because the model network and control network of traditional ADP are not considered.
The rest of the paper is organized as follows. We derive the whole dynamics of the manipulator system in Section 2. Three decoupled subsystems are obtained based on SPT in Section 3. Section 4 presents two tasks that are the design of composite control scheme and the closed-loop stability analysis at the same time. Some simulation results are exhibited in Section 5. Finally, Section 6 draws some conclusions.
2. Dynamic Modeling of HDFRMS
We consider the mechanical structure involving the first rigid link with a rotary joint, the second rigid link with a revolute joint and the last flexible link with another revolute joint (see Figure 1).
 
      
    
    Figure 1.
      Model of HDFRMS.
  
2.1. Mechanical Subsystem Model
Link 1 and 2 of HDFRMS are rigid. However, the third link, like a slender beam, is flexible. Its vibration will occur because the end-effector with heavy payload and manipulator system have large overall motions. Only the transverse vibration is considered, except for the axial deformation and shear deformation. Therefore, the Euler–Bernoulli beam theory is sufficient to describe the flexible deformation and vibration of the third link.
Concretely, the dynamic modeling of HDFRMS consists of two parts: mechanical subsystem modeling and hydraulic subsystem modeling. 
We derive the dynamics of the mechanical subsystem using vibration analysis result-based AMM via Lagrange’s equation. The following assumptions are made based on the actual situation. Table 1 shows the nomenclature of the mechanical subsystem.
 
       
    
    Table 1.
    Notation, frequently used symbols of mechanical subsystem.
  
Assumption 1. 
The payload is held tightly by the manipulator end-effector, and their contact is static.
Assumption 2. 
The mass of actuators focuses on joints of the manipulator.
Let  and  be the inertial Cartesian coordinate and the moving coordinates fixed on the robotic manipulator, respectively. Figure 2 illustrates the schematic diagram of four primary coordinate systems for HDRMES.
 
      
    
    Figure 2.
      Schematic diagram of rigid–flexible manipulator.
  
Considering AMM, we find the elastic displacement  of the third link as:
      
        
      
      
      
      
    
The higher order modes are ignored, and only previous order modes are considered in practical engineering application, because they can make greater impacts on system performances.
When flexible link 3 is conceived as cantilever beam model, one end of this link is supported simply, and the other end is free []. Therefore, we can obtain the elastic displacement
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where , , .  is position of any point on the third link, and .
Considering the coordinate system and the kinematics of three link rigid–flexible manipulators, the kinetic energy of the mechanical subsystem can be obtained as
		
      
        
      
      
      
      
    
Including the elastic potential energy of flexible link 3, gravitational potential energy of manipulators and payloads, the potential energy of the mechanical subsystem can be derived as
        
      
        
      
      
      
      
    
        where , , , , , , , , , ,  and .
Defining a Lagrange function , which consists of total potential energy  and total kinetic energy , we get .
According to the following Lagrange equation []
        
      
        
      
      
      
      
    
        the dynamic model of the rigid–flexible manipulator without the viscous damping and the external interference can be expressed as
        
      
        
      
      
      
      
    
        where  is the  positive definite, symmetric, time-varying generalized inertia matrix,  is the stiffness matrix of the flexible link 3,   consists of Coriolis force, centrifugal force and gravity,  is the joint angles,  is the mode coordinates and  is the control torque. 
Remark 1. 
Utilizing the Lagrange equation, p contains joint angles and elastic displacement . The joint angles and control torque are considered when . Otherwise, we have
      
        
      
      
      
      
    
2.2. Hydraulic Subsystem Model
Joint 1 is a rotary joint, which is driven by a rotary hydraulic motor. Joints 2 and 3 are revolute joints, which are driven by a linear asymmetric hydraulic cylinder. The manipulator needs to move the payload along some predefined trajectory in practice.
The hydraulic-driven subsystem consists of a hydraulic cylinder and hydraulic motor. Additionally, Table 2 shows the nomenclature of the hydraulic subsystem. The asymmetric four-way valve-controlled hydraulic cylinder is a kind of hydraulic actuator converting hydraulic energy into linear motion mechanical energy as shown in Figure 3.
 
       
    
    Table 2.
    Notation, frequently used symbols of hydraulic subsystem.
  
 
      
    
    Figure 3.
      Schematic diagram of asymmetric hydraulic cylinder.
  
Actually, the time constant of mechanical systems is much greater than the hydraulic serve-valve time constant; therefore, the servo valve spool displacement  is proportional to the control input by ignoring the dynamics of the servo valve. We obtain , where  is the proportional valve gain and  is the amplifier gain.
Considering the law of cosines, depending on the installation location of hydraulic cylinder, we have
        
      
        
      
      
      
      
    
        where  is the matrix relating the actuator linear displacement to the link angular displacement. 
Additionally, the details of this matrix  are described below.
        
      
        
      
      
      
      
    
According to the relationships of force balances and the basic theory of the hydraulic system [], we obtain
        
      
        
      
      
      
      
    
Applying (9) and (11), we can derive that
        
      
        
      
      
      
      
    
        where .
Similarly, the kinetics of the valve-controlled hydraulic motor can be derived as
        
      
        
      
      
      
      
    
        where .
By simple calculation of (13), we can obtain the relationship between current and torque as follows.
        
      
        
      
      
      
      
    
From (12) and (14), the model of the hydraulic subsystem can be given as
        
      
        
      
      
      
      
    
From the previous derivations, both (7) and (9) are the dynamic model of the HDFRMS.
        
      
        
      
      
      
      
    
3. Cross-Scale Decoupling of Manipulator System
The dynamic model of the HDFRMS is essentially a strongly coupled third-order time-varying nonlinear differential equation. Therefore, it makes system control law design more difficult. In addition to this, it is difficult to guarantee system control performances due to the coupling when there are uncertainties and disturbances in system.
In this section, the complete dynamics for HDFRMS will be decomposed twice for control design based on singular perturbation. They are second slow subsystem, second fast subsystem and fast subsystem, respectively.
3.1. First Singular Perturbation Decomposition
Firstly, the complete dynamic model can be decoupled into the first stage SS that characterizes the rigid–flexible manipulator, as well as the first stage FS characterizing the hydraulic servo-driven part.
Defining the first singular perturbation parameter , it satisfies . Then, the dynamic model (16) can be rewritten as
        
      
        
      
      
      
      
    
        where , , .
When perturbation parameter  is small enough, the higher order infinitesimal  can be ignored. According to multiple timescale theories, the variables  and  can be represented as
        
      
        
      
      
      
      
    
        where subscript  and subscript  represent the first level of fast timescales and slow timescales, respectively.
Setting , we obtain the first stage SS, namely FSS
        
      
        
      
      
      
      
    
We now introduce the first level of fast timescale  in the boundary layer, and we have
        
      
        
      
      
      
      
    
The first-level slow variable  is considered constant near the boundary layer . According to (17) and (19), the first stage FS, namely FFS, can be expressed as
        
      
        
      
      
      
      
    
3.2. Second Singular Perturbation Decomposition
The FSS (19) will be decoupled again. We can have the SSS characterizing the large overall motion of the manipulator without the whole flexibilities, and the SFS is only about the flexible vibration of the third link.
We define the inverse matrix  of matrix  as
        
      
        
      
      
      
      
    
        where , .
Additionally, we make
        
      
        
      
      
      
      
    
Substituting (22) and (23) into the first stage of the slow subsystem (19), we can obtain
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
        where .
We define the second singular perturbation parameter  with . 
Comparing with the first singular perturbation parameter, they satisfy the following inequality: . 
We denote
        
      
        
      
      
      
      
    
Setting , substituting (26) into the (24) and (25), we can derive
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
By solving Equation (28), we obtain
        
      
        
      
      
      
      
    
		where subscript  represents the second stage slow timescale, and  is the control current under the second stage slow timescale.
Substituting (29) into Equation (27), the SSS can be derived as
        
      
        
      
      
      
      
    
Similarly, ignoring , the variables  and  can be represented as
        
      
        
      
      
      
      
    
        where the subscript  represents the second stage fast timescale.
We introduce the second level of fast timescale  in the boundary layer, and we have
        
      
        
      
      
      
      
    
The second level slow variable  is considered constant near the boundary layer . According to Equations (25), (29) and (31), SFS can be obtained as
        
      
        
      
      
      
      
    
        where  is the control current under the second fast timescale .
4. Control Design
4.1. ADP Trajectory Control
ADP is a newly emerging approximate optimum method in the field of optimal control, which is an important branch of machine learning. It has been applied in blast furnace gas systems [], reusable launch vehicles [], manipulators [], road intersection path planning [] and other fields, and achieved good results, especially in improving the tracking accuracy of reusable dynamic systems.
The structure of the traditional ADP control system consisting of action network, critic network and model network is shown in Figure 4, and the model network is not used when dynamic model is derived. In SSS, the trajectory tracking controller, whose optimal feedback control depends only on the gradient of the optimal cost function output from the critic network obtained by online iteration [,,], is designed based on ADP with critic network only. The controller that not only simplifies the training process, but also eliminates the approximation error between the two networks, can always improve the tracking accuracy.
 
      
    
    Figure 4.
      Basic idea of traditional ADP.
  
Considering the uncertainties and disturbances of the actual system, let , then SSS (30) is organized as
        
      
        
      
      
      
      
    
        where , , and  and  are both Lipschitz functions.
Assumption 3. 
The sum of uncertainties and disturbances  in the second slow subsystem has an unknown upper bound  , which is .
Let the desired trajectory and actual trajectory of the subsystem be  and , respectively, then the trajectory tracking error is
        
      
        
      
      
      
      
    
We define the performance index as follows
        
      
        
      
      
      
      
    
        where  is the utility function. There is ,  holds for all  and , and ,  are positive definite matrixes [].
Let  denote the desired control law, then
        
      
        
      
      
      
      
    
Therefore,
        
      
        
      
      
      
      
    
For SSS, the control law  includes two parts, desired control  and optimal control , which is
        
      
        
      
      
      
      
    
Therefore,
        
      
        
      
      
      
      
    
The optimal control  can ensure that the trajectory tracking error of the subsystem converges to the steady state in an optimal manner. 
Equation (30) can be rewritten as
        
      
        
      
      
      
      
    
        where  is the utility function, ,  holds for all e and .  and  is a set of allowable control sequences. 
Definition 1. 
For the second slow tracking error subsystem (40), if there exists a set of tolerance controls that are continuous and satisfy  when , then the subsystem is guaranteed byto converge on a compact set with a finite performance index function [,].
If the performance index (41) is continuously differentiable, then its infinitesimal form can be expressed as
        
      
        
      
      
      
      
    
        where , ,  is the partial derivatives of with respect to  which is .
We define the Hamiltonian function and the optimal performance index as
        
      
        
      
      
      
      
    
      
        
      
      
      
      
    
Clearly, with  meeting the
        
      
        
      
      
      
      
    
        where .
If  exists and is continuously differentiable, the optimal feedback control law can be solved by a single network evaluation structure strategy iterative algorithm with circular iterations as
        
      
        
      
      
      
      
    
Combining (42) and (45), we have
        
      
        
      
      
      
      
    
Figure 5 is the single network evaluation structure strategy iteration process. With the control strategy evaluation using Equation (42), based on the evaluation results using Equation (45) to find the optimal feedback control law, in the algorithm, to improve the system regulation effect and performance index function using neural network approximation, there are
        
      
        
      
      
      
      
    
        where  is the number of neurons in the hidden layer,  is the ideal neural network weight,  is the neural network activation function and  is the evaluation network approximation error. 
 
      
    
    Figure 5.
      Flowchart of critic-only policy iteration algorithm.
  
Then, we calculate performance index function gradient 
      
        
      
      
      
      
    
        where ,  denotes the gradients of  and , respectively. 
Substituting (49) into (42), we can obtain the Hamiltonian function
        
      
        
      
      
      
      
    
        where  is the residual error of the approximation neural network.
The definitions  and  are, respectively, the estimates and estimation errors of the evaluation neural network weights . Therefore, the output of the evaluation network  and its gradient are
        
      
        
      
      
      
      
    
The approximate Hamiltonian function is
        
      
        
      
      
      
      
    
The performance criterion that needs to be minimized for the neural network training process [] is
        
      
        
      
      
      
      
    
The weights are then updated using the gradient descent method, with
        
      
        
      
      
      
      
    
        where ,  is the evaluation network learning rate and .
Due to
        
      
        
      
      
      
      
    
        we obtain
        
      
        
      
      
      
      
    
Therefore, the error update rate of the weight estimation is
        
      
        
      
      
      
      
    
The ideal optimal feedback control law and the corresponding iterative control law are
        
      
        
      
      
      
      
    
Assumption 4. 
The desired control law  and  both have unknown upper bounds, which are
      
        
      
      
      
      
    
Theorem 1. 
Under the conditions of Assumptions 3 and 4, if the solution of the neural network-based  equation exists, considering the state space model (30) in SSS and the evaluation network weight update rate (57), if the optimal control law of the system trajectory tracking is chosen as
      
        
      
      
      
      
    this ensures that both the weight approximation error and the system trajectory tracking error are eventually consistent and bounded.
Proof.  
Considering the Lyapunov theory, we define a positive definite energy function
        
      
        
      
      
      
      
    
Derivative of time, we obtain
        
      
        
      
      
      
      
    
Because  is a  function, there must exist . When , it holds . According to assumption 1, we know that  and  are bounded, so we can set
        
      
        
      
      
      
      
    
        and then
        
      
        
      
      
      
      
    
Using the trigonometric inequality, we have
        
      
        
      
      
      
      
    
Clearly, in the collections
        
      
        
      
      
      
      
    
        in addition to them, and they must satisfy the conditions
        
      
        
      
      
      
      
    
Then, there is
        
      
        
      
      
      
      
    
Thus, considering the Lyapunov stability theory, both the joint angle tracking error in SSS and the neural network weight approximation error are eventually consistent and bounded.
This completes the proof. □
4.2. Robust Optimal Vibration Control
Aiming at designing the optimal control law using the quadratic form, the model of SFS is rewritten as
        
      
        
      
      
      
      
    
        where , , .
The control purpose is that the appropriate control law is proposed to adjust the system state to zero, that is, suppress elastic vibrations in SFS. It is easy to verify that it is completely controllable, and the optimal control method can be used, since SFS is a linear system. 
The quadratic performance index function is selected as follows:
      
        
      
      
      
      
    
The algebraic Riccati equation can be written as
        
      
        
      
      
      
      
    
Then, the optimal feedback control law is
        
      
        
      
      
      
      
    
A robust optimal control law with great dynamic performance is given below to ensure the stability of the system while there are uncertainties in SFS. Therefore, we obtain the dynamic equation with uncertainties
        
      
        
      
      
      
      
    
Theorem 2. 
For the second fast subsystem with parameter uncertainty, if the uncertainty satisfies
      
        
      
      
      
      
    where  is the minimum eigenvalue of  and  is the spectral radius of , and  is elected optimal control law (72), the closed loop asymptotic stability of SFS can be guaranteed.
Proof.  
Considering the Lyapunov theory, define the second positive definite energy function
		
      
        
      
      
      
      
    
Substituting (69), (72) and (73) into the time derivative of Equation (74), we obtain
        
      
        
      
      
      
      
    
Because both  and  are symmetric matrices, we have
        
      
        
      
      
      
      
    
Substituting inequality (76) into Equation (75), we have
        
      
        
      
      
      
      
    
When the inequality  holds,  will be less than 0. Hence, the second fast closed-loop subsystem is stable. 
This completes the proof. □
4.3. Adaptive Sliding Mode Servo Control
In the actual work of the hydraulic servo control system, there are some uncertain factors such as the perturbation of the elastic modulus of hydraulic oil, hydraulic oil leakage and friction of moving parts. They will directly affect the stability and dynamic characteristics of the hydraulic servo control subsystem. Therefore, the dynamics of FFS with uncertainties will be from Equation (21) to
        
      
        
      
      
      
      
    
        where  is the overall uncertainty, and it is bounded.
If  is the estimated value of , define the estimated error
        
      
        
      
      
      
      
    
We define the tracking error vector
        
      
        
      
      
      
      
    
        then select a sliding surface
        
      
        
      
      
      
      
    
Theorem 3. 
Considering the hydraulic servo FFS, the ASMC is presented as
      
        
      
      
      
      
    
      
        
      
      
      
      
    
Then, the closed-loop FFS is uniformly ultimately stable.
Proof.  
Considering the Lyapunov theory, we define the third positive definite energy function
		
      
        
      
      
      
      
    
Differentiate (84) with respect to time, and we can have
        
      
        
      
      
      
      
    
From (85), , so  is bounded. Considering the Lyapunov function as , the integration of  can be obtained as . Therefore, it is concluded that error can asymptotically converge to zero as  based on Barbalat Lemma. 
This completes the proof. □
4.4. Composite Control
According to the theoretical analysis above, the control law for the HDFRMS by (16) can be expressed as (60), (72), (82) and (83). Additionally, Figure 6 is the whole block diagram for the manipulator control closed-loop system. 
 
      
    
    Figure 6.
      Block diagram of HDFRMS with controllers.
  
In Figure 6, the desired angles and angle velocities  after trajectory planning are provided as input to the manipulator system. Meanwhile, three control outputs ,  and  can be individually calculated using information measured by the sensors such as photoelectric encoders, pressure sensors and strain gauges. However, these control outputs are then summed after timescale transformation. At last, the closed-loop HDFRMS with control schemes can be realized.
Theorem 4. 
The stabilities of SSS (30), SFS (33) and FFS (21) are guaranteed under adaptive dynamic programming trajectory control law (60), robust optimal vibration control law (72) and adaptive sliding mode servo control law (82) and (83), based on SPT. Therefore, the closed-loop stability of the whole manipulator system (16) can be guaranteed under the composite controller.
5. Simulations
In this section, some simulation results (implemented in MATLAB) are analyzed to indicate the effectiveness and robustness of the singular perturbation decoupling method and proposed composite control laws in previous parts. All the tests are conducted on the HDFRMS, whose dynamics is given in (16) with these structure parameters (see Table 3).
 
       
    
    Table 3.
    HDFRMS structure parameters.
  
In ADP trajectory control, the back propagation (BP) neural network, the critic neural network, is selected as 6-8-1 with 6 input neurons, 8 hidden neurons and 1 output neuron (see Figure 7). Additionally, its activation function, learning rate and initial weight values are selected as sigmoid function, , and , respectively. 
 
      
    
    Figure 7.
      The structure of BP neural network.
  
In addition, the parameters of robust optimal vibration control and adaptive sliding mode servo control can be given as , , , . 
The initial positions of the manipulator joints are selected as , and their desired trajectories as chosen as .
The sampled time instant is 0.1 ms. Time taken is 10 s. The simulations are illustrated in Figure 8, Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13.

 
      
    
    Figure 8.
      Tracking performances and vibration suppression of the manipulator without uncertainties.
  
 
      
    
    Figure 9.
      Weights of critic neural network.
  

 
      
    
    Figure 10.
      Tracking performances and vibration suppression of the manipulator with uncertainties.
  

 
      
    
    Figure 11.
      Control current comparisons with full payload and uncertainties.
  

 
      
    
    Figure 12.
      Tracking performances and vibration suppression of the manipulator with uncertainties in different conditions with varying payload.
  

 
      
    
    Figure 13.
      Comparison with the ASMC in SSS.
  
5.1. Reference-Tracking Performance in System without Uncertainties
When there is no uncertainty in HDFRMS, the tracking performances of joints (see Figure 8a–c) and vibration suppression of the flexible link 3 (see Figure 8d–f) are tested. Both simulations and numerical results use the same control parameters under several different working conditions, including the end-effector without payload (), with half payload () and with full payload ().
Figure 8a–c indicates that all joint angles can quickly track to the prospective trajectories in 3 s or less. In particular, the trajectory tracking speed of joint 1 is faster than other joints because of the rotary joint, which is insensitive to vibrations. Figure 8d–f shows that both first-order mode and second-order mode can be controlled effectively, even if the vibration suppression needs to spend 3 s handling full payload. However, only forced vibration caused by a large range of joint motion exists in system. In addition, Figure 9 illustrates that the weights of the critic neural network converge to .
5.2. Reference-Tracking Performance in System with Uncertainties
To verify the robustness of HDFRMS closed-loop system, as shown in Figure 10 and Figure 11, the total uncertainties  in SSS and the uncertainties  in FFS are chosen as  and , respectively. Then, both the sine and cosine function are used to express parameter uncertainties in the manipulator system, and the random function represents external disturbances and noises.
Figure 10 shows the tracking performances of joints and vibration suppression of the third flexible arm in HDFRMS with uncertainties. Compared to the previous cases, the tracking time of joint angle 2, nearly 3.5 s, is the longest response time of the system according to Figure 10a–c. Furthermore, the tracking error fluctuation of joint angle 3 is less than 0.01 rad between 9.2 s and 9.5 s, because of the combined effects of system uncertainties, accumulated errors, full payload and motion coupling. However, the largest error fluctuating ranges can satisfy the requirements of steady state. Figure 10d–f shows that the vibration is also so small and can be suppressed quickly.
The chattering comparisons of control currents with full payload and uncertainties are shown in Figure 11. Clearly, the chattering can be weakened by the saturation function whose boundary layer is 0.01 instead of the sign function.
5.3. Reference-Tracking Performance in System with Varying Payloads and Uncertainties
To test if the composite controller is more adaptive in the condition of varying payloads, three working conditions are considered. The first working condition is that the payload mass is gradual changed from 600 kg to 0 kg, starting from 0 s to 6 s. The other working conditions are that the payload mass is suddenly changed from 600 kg to 200 kg at 6 s and 2 s.
Figure 12 shows the tracking performances and vibration suppression of the manipulator with uncertainties in different conditions with varying payloads. Table 4 is the composite controller performance comparisons with uncertainties, where three evaluation criteria for the trajectory tracking errors are adopted: the root mean squared error (RMSE), integral squared error (ISE), and integral absolute error (IAE) []. 
 
       
    
    Table 4.
    Controller performance comparisons with uncertainties.
  
Figure 12a–c indicates that the tracking error converges faster than the sudden change of mass, when the payload mass changes gradually. The settling time is about 1 s. In addition, the convergence rate of tracking error is almost the same whether the sudden time is 2 s or 6 s. Their error curves nearly coincide (see Figure 12a–c) and their RMSE, ISE and IAE are almost the same (see Table 4). Therefore, the sudden change in mass does not affect the tracking performance of manipulator system, whether in the transition process or in the steady state.
5.4. The Comparison with the ASMC in SSS
We chose an ASMC that can deal with uncertain systems to compare with ADP in SSS. The ASMC can be designed as
        
      
        
      
      
      
      
    
        where ,  and . The closed-loop stability proof with ASMC is similar to this paper [].
The payload mass is selected as 450 kg. The uncertainties and other controller parameters remain unchanged, consistent with previous simulations.
The simulation results can be seen in Figure 13, indicating that the transient response of the ADP in SSS is significantly faster than that of the ASMC. The ASMC has few superiorities for trajectory tracking, and the longer settling time will inevitably lead to larger average errors. In particular, the settling time of joint angle 2 is more than 4 s (see Figure 13b). Moreover, the vibration suppression of the first-order mode becomes worse in the transition process when ASMC is used in SSS and other controllers remain the same as before.
In summary, the composite controller proposed in this paper can adequately represent the manipulator trajectory tracking performances under six different cases because controller parameters always remain constant. Additionally, the control has greater robustness with system uncertainties, no matter how the payload mass is changed. Compared with the ASMC with the same robustness, the settling time of ADP is shorter.
6. Conclusions
This paper discusses the dynamics modeling and trajectory tracking control of HDFRMS in practical engineering applications. The dynamics of the manipulator system, which are modeled as AMM and the Lagrange principle, are decomposed into second slow, second fast and first fast subsystem describing the rigid motion, flexible vibration and servo-hydraulic-driven control, based on SPT. The control laws of all subsystems with independent state variables can be designed respectively. Additionally, the adaptive dynamic programming trajectory tracking control with critic-only policy iteration algorithm, robust optimal vibration control and adaptive sliding mode servo control are established by the Lyapunov stability theory, while the total uncertainty boundary in HDFRMS is unknown. Finally, numerical simulations successfully demonstrate that the composite controller is not only effective but also robust. 
Although it was not the focus of this paper, we believe that the cross-scale decoupling method can be extended to rigid–flexible, macro–micro, and electro–hydraulic complex multi-body dynamics systems. Moreover, the selection of perturbation parameters and their influences on the system should be further studied. Additionally, we will also discuss how to improve robustness for further research based on ADP, when frictions of hydraulic cylinder exist in the manipulator servo actual system.
Author Contributions
Conceptualization, X.W. and Z.T.; methodology, Z.T.; software, J.X.; validation, X.W., J.Y. and Z.T.; formal analysis, J.X.; investigation, Z.T.; resources, Z.T.; data curation, X.W.; writing—original draft preparation, X.W. and Z.T.; writing—review and editing, J.Y. and Z.T.; visualization, J.X.; supervision, X.W.; project administration, Z.T.; funding acquisition, J.X. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded in part by the Natural Science Foundation of Jilin Province under Grant 20220101120JC, in part by the Science Research Planning Project of Jilin Province Department of Education under Grant JJKH20221007KJ, in part by the Guiding Project for Tackling Key Scientific and Technological Problems in Quzhou under Grant 2021Z219 and in part by the Zhejiang Province Basic Public Welfare Research Project under Grant LGC22E050006.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Yang, H.J.; Liu, J.K. Distributed piezoelectric vibration control for a flexible-link manipulator based on an observer in the form of partial differential equations. J. Sound Vib. 2016, 363, 77–96. [Google Scholar] [CrossRef]
- Lu, E.; Li, W.; Yang, X.F.; Wang, Y.Q.; Liu, Y.F. Dynamic modeling and analysis of a rotating piezoelectric smart beam. Int. J. Struct. Stab. Dyn. 2018, 18, 1850003. [Google Scholar] [CrossRef]
- Liu, Y.F.; Li, W.; Yang, X.F. Coupled dynamic model and vibration responses characteristic of a motor-driven flexible manipulator system. Mech. Sci. 2015, 6, 235–244. [Google Scholar] [CrossRef]
- Xu, B.; Yuan, Y. Two performance enhanced control of flexible-link manipulator with system uncertainty and disturbances. Sci. China Inf. Sci. 2017, 60, 050202. [Google Scholar] [CrossRef]
- Ju, J.Y.; Li, W.; Wang, Y.Q. Vibration observation for a translational flexible-link manipulator based on improved Luenberger observer. J. Vibroeng. 2016, 18, 238–249. [Google Scholar]
- Su, Y.; Mueller, P.C.; Zheng, C. Global asymptotic saturated PID control for robot manipulators. IEEE Trans. Control Syst. Technol. 2010, 18, 1280–1288. [Google Scholar] [CrossRef]
- Zamora-Gomez, G.; Zavala-Río, A.; Lpez-Araujo, D.; Santibáñez, V. Further results on the global continuous control for finite-time and exponential stabilisation of constrained-input mechanical systems: Desired conservative force compensation and experiments. IET Control Theory Appl. 2019, 13, 159–170. [Google Scholar] [CrossRef]
- Riad, A.; Meddahi, H. A fast adaptive artificial neural network controller for flexible link manipulators. Int. J. Adv. Comput. Sci. Appl. 2016, 7, 298–308. [Google Scholar] [CrossRef]
- Li, F.; Zhang, Z.; Wu, Y.; Chen, Y.; Liu, K.; Yao, J. Improved fuzzy sliding mode control in flexible manipulator actuated by PMAs. Robotica 2022, 40, 2683–2696. [Google Scholar] [CrossRef]
- Dog, M.; Istefanopulos, Y. Optimal nonlinear controller design for flexible robot manipulators with adaptive internal model. IET Control Theory Appl. 2007, 1, 770–778. [Google Scholar]
- Bastos, G. A non-inherent parametric estimation for dynamical equivalence of flexible manipulators. Optim. Control Appl. Methods 2022, 43, 825–841. [Google Scholar] [CrossRef]
- Mehrez, M.W.; El-Badawy, A.A. Effect of the joint inertia on selection of under-actuated control algorithm for flexible-link manipulators. Mech. Mach. Theory 2010, 45, 967–980. [Google Scholar] [CrossRef]
- Lu, E.; Li, W.; Yang, X.; Liu, Y.F. Modelling and composite control of single flexible manipulators with piezoelectric actuators. Shock Vib. 2016, 7, 2689178. [Google Scholar] [CrossRef]
- Xu, B. Composite learning control of flexible-link manipulator using NN and DOB. IEEE Trans. Syst. 2018, 48, 1979–1985. [Google Scholar] [CrossRef]
- Siciliano, B.; Book, W.J. A singular perturbation approach to control of lightweight flexible manipulators. Int. J. Robot. Res. 1988, 7, 79–90. [Google Scholar] [CrossRef]
- Cheng, X.; Zhang, Y.J.; Liu, H.S.; Wollherr, D.; Buss, M. Adaptive neural backstepping control for flexible-joint robot manipulator with bounded torque inputs. Neurocomputing 2021, 458, 70–86. [Google Scholar] [CrossRef]
- Xu, B.; Shi, Z.K.; Yang, C.G. Composite fuzzy control of a class of uncertain nonlinear systems with disturbance observer. Nonlinear Dyn. 2015, 80, 341–351. [Google Scholar] [CrossRef]
- Cheng, X.; Liu, H.S.; Zeng, Z.; Lu, W.K. Robust fuzzy sliding mode control and vibration suppression of free-floating flexible-link and flexible-joints space manipulator with external interference and uncertain parameter. J. Dyn. Syst. Meas. Control Trans. Asme 2022, 144, 1004–1015. [Google Scholar]
- Xie, L.M.; Yu, X.Y.; Chen, L. Saturated Output Feedback Control for Robot Manipulators With Joints of Arbitrary Flexibility. Robotica 2022, 40, 997–1019. [Google Scholar] [CrossRef]
- Dindorf, R.; Wos, P. A Case Study of a Hydraulic Servo Drive Flexibly Connected to a Boom Manipulator Excited by the Cyclic Impact Force Generated by a Hydraulic Rock Breaker. IEEE Access 2022, 10, 7734–7752. [Google Scholar] [CrossRef]
- Franco, E.; Garriga-Casanovas, A.; Tang, J.; Baena, F.R.; Astolfi, A. Adaptive energy shaping control of a class of nonlinear soft continuum manipulators. IEEE/ASME Trans. Mechatron. 2022, 27, 280–291. [Google Scholar]
- Zeng, H.; Sepehri, N. On tracking control of cooperative hydraulic manipulators. Int. J. Control 2007, 80, 454–469. [Google Scholar] [CrossRef]
- Zhang, X.; Shi, G. Dual extended state observer-based adaptive dynamic surface control for a hydraulic manipulator with actuator dynamics. Mech. Mach. Theory 2022, 169, 104647. [Google Scholar]
- Li, S.; Zhu, K.; Chen, L.; Yan, Y.; Guo, Q. Variable Structure Disturbance Observer Based Dynamic Surface Control of Electrohydraulic Systems with Parametric Uncertainty. Energies 2022, 15, 1671. [Google Scholar]
- Irani, A.N.; Talebi, H.A. Tip tracking control of a rigid-flexible manipulator based on deflection estimation using neural network. In Proceedings of the ISSNIP Biosignals and Biorobotics Conference 2011, Vitoria, Brazil, 6–8 January 2011; pp. 1–6. [Google Scholar]
- Merritt, H.E. Hydraulic Control Systems; Wiley: New York, NY, USA, 1967. [Google Scholar]
- Zhao, J.; Wang, T.; Pedrycz, W.; Wang, W. Granular prediction and dynamic scheduling based on adaptive dynamic programming for the blast furnace gas system. IEEE Trans. Cybern. 2019, 51, 2201–2214. [Google Scholar] [CrossRef]
- Wang, X.; Quan, Z.; Zhang, J. Optimal 3-dimension trajectory-tracking guidance for reusable launch vehicle based on back-stepping adaptive dynamic programming. Neural Comput. Appl. 2022, 35, 5319–5334. [Google Scholar]
- Ren, X.; Li, H. Adaptive dynamic programming-based feature tracking control of visual servoing manipulators with unknown dynamics. Complex Intell. Syst. 2022, 8, 255–269. [Google Scholar] [CrossRef]
- Hu, C.; Zhao, L.; Qu, G. Event-Triggered Model Predictive Adaptive Dynamic Programming for Road Intersection Path Planning of Unmanned Ground Vehicle. IEEE Trans. Veh. Technol. 2021, 70, 11228–11243. [Google Scholar] [CrossRef]
- Xia, H.; Zhao, B.; Li, Y. Optimal tracking control for reconfigurable manipulators based on critic-only policy iteration algorithm. In Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China, 26–28 July 2017; pp. 2616–2621. [Google Scholar]
- Liu, D.; Wang, D.; Li, H. Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach. IEEE Trans. Neural Netw. Learn. Syst. 2014, 25, 411–428. [Google Scholar]
- Zhao, B.; Liu, D.; Li, Y. Online fault compensation control based on policy iteration algorithm for a class of affine nonlinear systems with actuator failures. IET Control Theory Appl. 2016, 10, 1816–1823. [Google Scholar] [CrossRef]
- Zhang, H.; Cui, L.; Zhang, X.; Luo, Y. Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans. Neural Netw. 2011, 22, 2226–2236. [Google Scholar] [CrossRef]
- Abu-Khalaf, M.; Lewis, F.L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 2005, 4, 779–791. [Google Scholar] [CrossRef]
- Kuo, C.W.; Tsai, C.C.; Lee, C.T. Intelligent leader-following consensus formation control using recurrent neural networks for small-size unmanned helicopters. IEEE Trans. Syst. Man Cybern. Syst. 2019, 51, 1288–1301. [Google Scholar] [CrossRef]
- Tang, Z.G.; Shan, X.H.; Li, S.T. Dynamic Modeling and Extension Adaptive Sliding Mode Control of Onboard Craning Manipulator. IFAC-PapersOnLine 2020, 53, 530–535. [Google Scholar]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
