Similar to most flexible actuators, PAM actuators exhibit significant hysteresis characteristics between displacement and pressure. This hysteresis effect impacts the accuracy of the PAM position modeling and its application in bionics. Therefore, it is necessary to establish a high-precision hysteresis mathematical model that is widely applicable under different input conditions to comprehensively and accurately describe this complex hysteresis behavior. To improve the applicability and accuracy of the PAM hysteresis model, this section proposes an FN–QUPI model to establish the hysteresis relationship between pneumatic pressure and displacement in PAMs. First, the QUPI (quadratic unparallel Prandtl–Ishlinskii) operator is introduced based on the UPI operator, where the QUPI operator replaces the linear envelope function with a quadratic function to enhance modeling accuracy. Then, the QUPI operator is incorporated into the nonlinear autoregressive moving average model to describe the complex hysteresis of the PAM, and a feedforward neural network is used to build the nonlinear function relationship of the NARMAX (nonlinear autoregressive moving average) model, forming the FN–QUPI model. Since the model reduces the number of unknown parameters, the particle swarm optimization algorithm is employed to solve all model parameters, enabling the model to adapt to dynamic system changes and achieve higher accuracy. Finally, the inverse model of QUPI is introduced.
2.1. The FN–QUPI Model
Currently, the classic PI (Prandtl–Ishlinskii) model is widely used in the field of pneumatic artificial muscle hysteresis modeling. Its advantages lie in having a specific inverse model that supports analytical solutions while maintaining low structural complexity and ease of application. However, this model struggles to describe the asymmetric hysteresis of the PAM. The shortcomings of the classic PI model can be addressed through appropriate optimization methods, such as introducing a nonlinear envelope function within the play operator. This enhancement improves the model’s adaptability to different hysteresis phenomena, providing higher robustness when handling complex inputs. A representative example is the CPI model [
18], where the descending edge of the CPI (classical Prandtl–Ishlinskii) operator is multiplied by a weight coefficient (greater than zero) to yield the UPI (unparallel Prandtl–Ishlinskii) operator [
19], making it easier to apply in the modeling of complex hysteresis phenomena. The UPI operator can be expressed as:
In the equation, r represents the dead-zone threshold, u(t) represents the input signal at time t, and αj represents the slope of the descending edge of the UPI operator.
Considering that the quadratic function and the hysteresis curve of the pneumatic artificial muscle have similar shapes, and given its basic properties of being invertible, bounded, and continuous, a quadratic function is adopted as the envelope function for the UPI model in this study. This model is named the QUPI model. By integrating this function with the UPI operator, the descending and ascending edges of complex hysteresis phenomena can be obtained. The schematic diagram of the QUPI operator is shown in
Figure 1 (the horizontal axis represents the pneumatic muscle pressure
p, and the vertical axis represents the pneumatic muscle displacement
y). The envelope function of the basic operator in the QUPI model consists of two quadratic parabolas, with the axis of symmetry aligned with the starting points of the ascending and descending branches of the hysteresis curve, making it more consistent with hysteresis characteristics. Based on the analytical expression of a quadratic function, the analytical expressions for a single QUPI operator can be derived when the pressure increases and decreases, as shown in Equations (2) and (3). Subsequently, the final QUPI model is formed by stacking Na × Nb small QUPI operators, and the structure of the QUPI model is shown in
Figure 2. The computational formula of the QUPI model is given in Equation (4).
In the equation, p represents the pneumatic pressure of the pneumatic muscle, and y represents the displacement of the pneumatic muscle. p0 and pm represent the minimum and maximum values of the pressure data, respectively; y0 and ym represent the minimum and maximum values of the displacement data, respectively; r1 and r2 represent the lengths of the edges of the line segments. dij denotes the weight of the operator. In this study, Na and Nb are set to 10.
Based on the maximum and minimum lengths, as well as the minimum and maximum pressures from the collected hysteresis data, the parameters y0, ym, p0, and pm in the QUPI model are determined, thereby reducing the number of parameters required for model identification. The only parameters that require identification in the entire QUPI model are the weight terms of the QUPI operators, which reduces the computational load of the model.
Since the QUPI model can only describe the static hysteresis behavior of the PAM under specific conditions, an additional nonlinear model is needed to capture the dynamic characteristics of the PAM. The NARMAX model, a nonlinear form of the autoregressive moving average model, is a commonly used black-box model for describing nonlinear systems. It effectively addresses system nonlinearity and dynamic issues without relying on the internal structure or parameters of the system. For the PAM actuator, its output displacement is mainly influenced by two factors: the input voltage at previous time steps and the current driving voltage. The NARMAX model can accurately describe the memory effect and rate-dependent characteristics of the PAM’s hysteresis behavior. Considering the impact of load on the PAM’s dynamic characteristics, load variations are introduced into the input part of the NARMAX model. Its specific expression is as follows:
In the equation, represents the output of the NARMAX model, f(t) and represent the output of the QUPI model at time t, and represents the PAM load. na, nb, and nc are the maximum delay times of , f(t) and m(t), respectively. Accordingly, the input vector of the dynamic submodel contains the present and historical values of the output and load of the hysteresis submodel, as well as the historical output displacement values, which can capture the dynamic information of the driving frequency and load.
In this study, the nonlinear function of the NARMAX model is established through a feedforward neural network (FNN), which ensures that the model can meet the requirements of dynamic system changes. The FNN has been widely applied in complex nonlinear systems, which is directly related to its excellent learning capability. Aside from these characteristics, the FNN performs well in simulating dynamic systems and is, therefore, widely used in nonlinear system analysis. Given that hysteresis properties exhibit multi-valued mapping, to improve the accuracy of the hysteresis model, the FN–QUPI model, i.e., the FNN–NARMAX model, is designed in this study. Its structure is shown in
Figure 3, where
u(
t) and
y(
t) represent the system’s input and output, respectively, and
y(
t) is the output of the NARMAX model. Z
−1 denotes a unit delay. Its magnitude represents the output result from the previous iteration, corresponding to the parameter
. The expression for the FNN is as follows:
In the equation, wij and wj are the weights between the input and hidden layers, along with between the hidden and output layers, separately; I and O are the inputs and outputs of the FNN, respectively; Sj(k) is the input of the jth neuron within the hidden layer. Furthermore, m and n indicate the number of neurons in the input and hidden layers of the FNN separately. In this paper, the activation function φ is chosen as an upward-opening univariate quadratic function. This is because the increase in the function value as the independent variable increases correspond roughly to the trend in the first half of the hysteresis, while the decrease in the function value as the independent variable decreases aligns with the trend in the latter half of the hysteresis.
The FN–QUPI model is a composite model that combines a neural network with an improved UPI operator to address the hysteresis nonlinearity present in PAM systems. This model offers significant advantages in modeling complex hysteresis effects, enhancing control accuracy, and improving system adaptability. (1) High-precision hysteresis compensation: The hysteresis phenomenon in PAM systems often leads to reduced control accuracy. The FN–QUPI model, with its refined hysteresis modeling capabilities, can effectively compensate for this issue. (2) Strong robustness and adaptability under complex conditions: The robustness of the FN–QUPI model is reflected in its ability to adapt well to various operating conditions. Whether under sinusoidal input, complex harmonic input, or step signal input, the FN–QUPI model can maintain high prediction accuracy and stability. This robustness is especially beneficial in dynamic industrial or medical environments. When faced with sudden load changes, pressure variations, or other disturbance factors, the FN–QUPI model can quickly adapt, ensuring the system continues to operate normally. (3) Reduced system lag and improved dynamic response speed: Due to the inherent lag in PAM systems, there is often a noticeable response delay under rapid dynamic input. The FN–QUPI model, by accurately predicting the relationship between displacement and pressure, reduces the lag effect, enabling the system to quickly follow input signals.
2.2. The Inverse Model of QUPI
In order to address the issue of low-trajectory tracking control accuracy caused by hysteresis characteristics in the operation of pneumatic artificial muscles (PAMs), this paper designs and applies an inverse model based on the QUPI model, aiming to compensate for the hysteresis effect in the control system, thereby improving the system’s control accuracy and response speed. In complex dynamic systems, hysteresis often leads to delays and errors between input and output, and traditional control methods show limitations when dealing with such nonlinear effects. The introduction of the inverse model effectively resolves this issue. The core function of the inverse model in the control system lies in its ability to predict the input signal based on known output values, allowing the system to adjust in advance, thus achieving precise feedforward control. This control strategy enables the pre-adjustment of input parameters before system operation, reducing the negative impact of hysteresis, achieving smoother dynamic responses, and higher control accuracy. Additionally, the system can quickly adjust to the correct operating point, significantly improving its response speed. Based on the detailed introduction of the QUPI model earlier, this paper further applies the inverse model of the QUPI operator to system control to compensate for the hysteresis characteristics of the PAM. The output expression of the QUPI operator is simplified as follows:
In the equation, q is a constant, [u](k) represents the input at time k, dj is the weight coefficient, and j is the number of QUPI operators.
Since the relationship between pressure and displacement forms a hysteresis model, its inverse model is also a hysteresis model. The inverse model of the QUPI operator is:
In the equation,
represents the threshold of the inverse model, and
g is the density function of the inverse model. The expressions are as follows:
In the equation, q−1 is a constant equal to 1/q. Therefore, the key to solving the QUPI inverse model lies in identifying the parameter q−1 and the vector. In this paper, particle swarm optimization (PSO) is used for parameter identification to obtain the optimal solution. The QUPI inverse model demonstrates excellent real-time compensation capabilities in practical system operations. By introducing the density function and the recursive update formula for the threshold, the input signal can be dynamically adjusted in real time according to the changes in the system’s state.
2.3. Parameter Optimization
The parameter optimization of the FN–QUPI model includes two parts, namely the weight coefficients dij of the QUPI hysteresis operator and the neural network parameters wij and the three coefficients of the activation function (quadratic function) in the FNN–NARMAX. The parameters to be optimized in the QUPI inverse model are q−1 and g in the formula.
The parameter identification of the PAM system involves multiple nonlinear characteristics, and traditional optimization methods may easily fall into local optima. However, particle swarm optimization (PSO), by simulating the cooperation and competition among individuals in a swarm, can effectively avoid this issue. Additionally, PSO has a fast convergence speed and relatively low computational complexity, making it highly suitable for parameter optimization in PAM system models, ensuring more accurate and stable model parameters [
20]. PSO relies on information exchange among particles in the swarm, referred to as a “swarm”. The swarm is composed of individuals known as particles, and it is initialized with a population of random solutions. Each particle is also assigned a random velocity function. Particles adjust their trajectories toward their current best solution, P
best. Moreover, each particle modifies its trajectory based on the current best position, G
best, achieved by other particles in its neighborhood. The fitness function measures the performance of each particle to determine whether the best-fitting solution has been achieved. Throughout this process, the fitness of the best individual improves over time and eventually stabilizes when the process concludes, with the final stabilized parameters aligning with the globally optimal parameters.
In this paper, the particle swarm optimization (PSO) is selected for system model recognition. In terms of the FN–QUPI model and its inverse model, the parameters to be discerned have been greatly reduced compared with other hysteresis models. Therefore, the least-squares method shown in Formula (12) can be directly used as the fitness function. The specific objective is to achieve a satisfactory fitness value for the model through iterative optimization or to terminate the calculation when the maximum number of iterations is reached. By combining particle swarm optimization, the optimal parameter values for the FN–QUPI model are ultimately obtained. The process of parameter identification for the FN–QUPI model using particle swarm optimization is shown in Algorithm 1.
In the equation,
represents the actual value of the PAM’s driving mechanism,
represents the model output value, and
H represents the number of samples in identification.
Algorithm 1. The parameter identification process using particle swarm optimization |
Input: Training dataset of pressure and displacement. |
Parameter settings: maximum iterations ite = 100; number of QUPI operators Nr = 10; |
number of particles in the swarm N = 50; polynomial function order Np = 2. |
Initialization of parameters: weights in the QUPI model w~(−0.02,0.02); |
Parameters of the activation function in the hidden layer: a~(−1,0); b~(0,0.5); c~(0,0.5). |
The weights of the two-layer neural network wij~(0,0.5). |
If i ≤ ite (where i represents the current iteration count) |
Calculate the output of the FN-QUPI model O(k). |
Calculate the particle swarm fitness function S. |
Update the parameters. |
End |
Output: Final computed results of the model. |