A Simple Control Strategy for Planar 2R Underactuated Robot via DEA Optimization

Huang, Zixin; Gong, Xiangyu; Wan, Xiao; Zhou, Hongjian

doi:10.3390/act14030156

Open AccessArticle

A Simple Control Strategy for Planar 2R Underactuated Robot via DEA Optimization

¹

School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan 430205, China

²

Yunnan Key Laboratory of Unmanned Autonomous Systems, Yunnan Minzu University, Yunnan 650504, China

³

Fujian Key Laboratory of Special Intelligent Equipment Safety Measurement and Control, Fujian Special Equipment Inspection and Research Institute, Fuzhou 350008, China

⁴

School of Mechanical & Electrical Engineering, Wuhan Institute of Technology, Wuhan 430205, China

^*

Author to whom correspondence should be addressed.

Actuators 2025, 14(3), 156; https://doi.org/10.3390/act14030156

Submission received: 20 February 2025 / Revised: 12 March 2025 / Accepted: 18 March 2025 / Published: 20 March 2025

(This article belongs to the Special Issue Modeling and Nonlinear Control for Complex MIMO Mechatronic Systems)

Download

Browse Figures

Versions Notes

Abstract

In various fields, planar 2R underactuated robots have garnered significant attention due to their numerous applications. To guarantee the stable control of these robots, a simple control strategy is presented in this paper, and we utilize the intelligent optimization algorithm to enhance the controller parameters. Initially, a comprehensive dynamic model is developed for the robot with its control properties described. Subsequently, we design a PD controller to control the movement of the planar 2R underactuated robot. The differential evolution algorithm (DEA) is used to optimize the parameters of the PD controller to obtain the best control effect and make each link reach the target state. The findings from the simulation demonstrate the efficacy of the approach, and the designed strategy shows a higher stability and convergence rate, highlighting its important contribution to the field of underactuated robots.

Keywords:

planar underactuated robot; differential evolution algorithm; underactuated system

1. Introduction

Systems with multiple inputs and degrees of freedom are common in everyday life. An underactuated system results when the number of control inputs of the original system is less than the system freedom number [1,2,3,4,5,6,7,8,9]. The benefits of an underactuated system include low weight, low cost, and low energy consumption. When some devices of the fully actuated system fail, the underactuated control method can improve reliability and ensure continuous operation. The underactuated robot is a typical system in an underactuated system. Underactuated robots can be categorized into two cases based on whether or not they are affected by gravity: vertical underactuated robots [10,11,12] and planar underactuated robots [13,14,15]. This paper focuses on planar underactuated robots.

Planar underactuated robots have several potential applications in space research [16] and deep-sea exploration [17,18]. Since deep sea and space are microgravity environments, robots have more advantages than humans in terms of operation and safety when performing some command operations. Therefore, researching this type of system’s control method has enormous practical implications. The motion process of the planar robot is through the control method designed to make the robot arrive from any position in the plane to any position we need. According to prior studies of planar underactuated robots, the nonlinear approximation model is found to be uncontrollable at any equilibrium point during the control process. Therefore, the realization of this kind of robot position control is more complicated and difficult than other robot systems. As a matter of fact, the characteristics of control for the underactuated robot are closely related to the number of links and the passive joint positions. The control characteristics of an underactuated robot will vary depending on the position and number of passive joints. According to whether there is a complete integrable property and constraint relationship between underactuated links, they can be divided into holonomic systems and non-holonomic systems [19,20].

Planar 2R underactuated robots are divided into planar Acrobot robots and planar Pendubot robots. In planar Acrobot robots, if the first joint fails or is damaged, it becomes a passive joint. It possesses holonomic characteristics and a relationship defined by angular constraints. Lai [21] et al. deduced the angular constraint of the link based on integrability, and achieved position control by regulating the active joint with the aid of the angle constraint. In [22], good position control was achieved by using the method of model reduction and energy attenuation. In [23], a position control strategy is proposed for the planar Acrobot robot. Under the coupling constraint between the systems, the goal position of the robot is realized by using two stages of control. The planar Pendubot robot, a 2R underactuated robot whose second joint is passive, is a member of the nonholonomic system that is subject to an angular acceleration restriction [24]. It is challenging to intuitively determine the mathematical connection between these links, which are active and passive due to the intricate nonlinear features of such systems. To accomplish stable control of the system, an open-loop iterative control approach was created by Luca [25] on account of the nilpotent approximation model. In addition, an innovative optimization technique based on Fourier transform was put forth in [26] to realize the stability control for planar Pendubot. The above research focuses on planar Acrobot and planar Pendubot. The following is the unified control method. In [27], the unified control of the planar 2R underactuated robot is studied, and a control strategy that incorporates trajectory planning alongside tracking control is introduced, which successfully realizes the control objectives of the system. For 2R underactuated robots, Wang [28] proposes a trajectory planning tracking control strategy based on the basis function superposition method and intelligent optimization algorithm, and only relied on the second-order nonholonomic constraints commonly possessed by 2-DOF rotary underactuated robots to achieve unified control of such systems, simulation tests verified the efficacy of the strategy.

Based on the above analysis of previous studies, the research of planar 2R underactuated robots lays a foundation for the control theory research of underactuated robots. Currently, most previous studies have only proposed control methods for robots with specific structures and functions. Due to the uncertainty of passive joint position, it is necessary to explore a unified control strategy for both planar Acrobot and planar Pendubot in order to realize effective control of planar 2R underactuated robots. However, when a joint of a planar full actuated joint fails and becomes a passive joint, if the position of the passive joint cannot be directly determined, the present control methods are relatively simple and the control stability time is longer. Therefore, in order to realize the effective control of the planar 2R underactuated robot, in this case, it is essential to investigate a more simple and general method to realize the unified control strategy of the two robots above.

Based on the above analysis, this paper presents a simple planar 2R underactuated robot control strategy. By optimizing the PD parameters of the controller, the time required for the 2R underactuated robot to stabilize to the ideal state is shortened. The structure of the subsequent chapters of this paper is elaborated from the following aspects: A dynamic model of a planar 2R underactuated robot system is established in Section 2.1, and the characteristics of the system are described in Section 2.2. Then, in Section 3, a PD controller with adjustable parameters is designed. To attain the desired control objective, in Section 4, the differential evolution algorithm (DEA) [29] is used to continuously enhance the parameters to obtain the optimal control effect, so as to ensure that each link can best achieve the control objective. Finally, in Section 5, three sets of simulation experiments with different initial conditions and target conditions are carried out for the 2R underactuated robot Acrobot and Pendubot, respectively, to assess the efficacy and universal applicability of the method.

2. Model and Characteristics

This section is mainly to establish a planar 2R underactuated robot model and analyze its underactuated control characteristics, so as to pave the way for the subsequent controller design.

2.1. Model

The mechanical architecture of the planar 2R underactuated robot is shown in Figure 1.

θ_{i}

is the angle of i-th (i = 1, 2) link,

m_{i}

is the mass of this link,

L_{i}

and

L_{c i}

, respectively, are the length of the link and the distance from the midpoint of the link to the center of the joint axis, and

J_{i}

stands for the moment of inertia.

τ_{r}

is the applied torque at the i-th joint and r = 1, 2.

When an active joint of the planar robot fails or is damaged, the joint will lose its driving ability and become a passive joint, and the whole system will become an underactuated system. Moreover, due to the different positions of damaged or failed joints, the planar underactuated robot will exhibit different characteristics.

The object of this paper is the planar 2R underactuated robot, whose dynamic model is constructed as follows

M (θ) \ddot{θ} + H (θ, \dot{θ}) = τ

(1)

In this model,

θ = {[θ_{1}, θ_{2}]}^{T}

is the vector of angle,

\dot{θ} = {[{\dot{θ}}_{1}, {\dot{θ}}_{2}]}^{T}

is the angular velocity, and

\ddot{θ} = {[{\ddot{θ}}_{1}, {\ddot{θ}}_{2}]}^{T}

is the angular acceleration of the planar robot. In the dynamic model, the

M (θ)

contains the Coriolis and centrifugal forces and it belongs to

R^{2 \times 2}

.

M (θ)

and

H (θ, \dot{θ})

are expressed in detail, as follows in the dynamic model [30].

M (θ) = [\begin{matrix} M_{11} (θ) & M_{12} (θ) \\ M_{21} (θ) & M_{22} (θ) \end{matrix}], H (θ, \dot{θ}) = [\begin{matrix} H_{1} (θ, \dot{θ}) \\ H_{2} (θ, \dot{θ}) \end{matrix}]

(2)

According to the position of the passive joint, the planar 2R underactuated robot can be categorized into two distinct scenarios:

Situation 1: the planar Acrobot, the passive joint is the first joint and torque vector is $τ = {[0, τ_{2}]}^{T}$ ;
Situation 2: the planar Pendubot, the passive joint is the second joint and torque vector is $τ = {[τ_{1}, 0]}^{T}$ .

The following is the precise representation of every element in the matrix above:

\{\begin{matrix} M_{11} (θ) = n_{1} + n_{2} + 2 n_{3} cos θ_{2} \\ M_{12} (θ) = M_{21} (θ) = n_{2} + n_{3} cos θ_{2} \\ M_{22} (θ) = n_{2} \\ H_{1} (θ, \dot{θ}) = - n_{3} (2 {\dot{θ}}_{1} {\dot{θ}}_{2} + {\dot{θ}}_{2}^{2}) sin θ_{2} \\ H_{2} (θ, \dot{θ}) = n_{3} {\dot{θ}}_{1}^{2} sin θ_{2} \end{matrix}

(3)

where

\{\begin{matrix} n_{1} = m_{1} l_{c 1}^{2} + m_{2} l_{1}^{2} + J_{1} \\ n_{2} = m_{2} l_{c 2}^{2} + J_{2} \\ n_{3} = m_{2} l_{1} l_{c 2} \end{matrix}

(4)

2.2. Control Characteristic Analysis

The passive part of (1) is

M_{p 1} {\ddot{θ}}_{1} + M_{p 2} {\ddot{θ}}_{2} + H_{p} = 0

(5)

when p = 1, for the underactuated planar Acrobot constraint relationship; when p = 2, for underactuated planar Pendubot constraint relationship.

Equation shown below could be obtained by Equation (5) and the constraint relation of underactuated characteristics.

\{\begin{matrix} {\dot{θ}}_{p} = - \int_{0}^{T} (\frac{\sum_{r = 1, r \neq p}^{2} M_{p r} (θ) {\ddot{θ}}_{r} + H_{p}}{M_{p p}}) d t - {\dot{θ}}_{p 0} \\ θ_{p} = \int_{0}^{T} {\dot{θ}}_{p} d t - θ_{p 0} \end{matrix}

(6)

where the initial angle and angular velocity of the underactuated link are denoted by

θ_{p 0}

and

{\dot{θ}}_{p 0}

, respectively. The Equation (6) specifically expresses that the underactuated link is constrained by the actuated link state.

3. Controller Design

We design a PD controller in this subsection in order to steer all links toward their desired states. Let the

x = {[x_{1} x_{2} x_{3} x_{4}]}^{T}

, in which

x_{1}, x_{2}

represent

θ_{1}, θ_{2}

, the angles of the two links of the robot and

x_{3}, x_{4}

are

{\dot{θ}}_{1}, {\dot{θ}}_{2}

, the angular acceleration at the two linking joints. The following state space expression can be further obtained as

\dot{x} = f (x) + g (x) τ

(7)

In this state space Equation (7),

f (x) = [\begin{matrix} f_{1} \\ f_{2} \end{matrix}] = - M^{- 1} (θ) H (θ, \dot{θ})

(8)

g (x) = M^{- 1} (θ) = [\begin{matrix} g_{11} (x) & g_{12} (x) \\ g_{21} (x) & g_{22} (x) \end{matrix}]

(9)

The active link’s control goal determines the selection of the Lyapunov function, and it is given in Equation (10).

\begin{matrix} V (x) = \frac{P_{r}}{2} {(x_{i} - x_{i d})}^{2} + \frac{1}{2} M_{r p} x_{m}^{2} \end{matrix}

(10)

where

P_{r}

represents the positive constant, and

r = 1, 2

;

x_{i}

represents the angular state of the link,

x_{i d}

represents the angular target state of the link,

i = 1, 2

;

x_{m}

represents the angular velocity of the link,

m = 3, 4

.

Then, the derivative of

V (x)

is going to be

\begin{matrix} \dot{V} (x) = x_{m} (P_{r} (x_{i} - x_{i d}) + τ_{r}) \end{matrix}

(11)

Therefore, the controller design of the system is as follows

τ_{r} = - P_{r} (x_{i} - x_{i d}) - D_{r} x_{m}

(12)

where

D_{r} > 0

,

g_{r r}

is the principal diagonal element of the matrix of Equation (9), which is

g_{11}

and

g_{22}

. Since

M (θ)

is a positive definite matrix, the elements of the principal diagonal are not zero, that is,

g_{r r}

is nozero, which effectively avoids the control torque singularity problem.

τ_{r}

is divided into

τ_{1}

and

τ_{2}

, which correspond to the control torque of planar Pendubot and planar Acrobot, respectively.

Bringing Equation (12) into Equation (11) gives

\dot{V} (x) = - D_{r} x_{m}^{2} \leq 0

(13)

When Equation (12) is substituted for Equation (7), the system which is closed loop will be

\dot{x} = F (x) = [\begin{matrix} f_{1} \\ f_{2} \end{matrix}] + [\begin{matrix} g_{11} & g_{12} \\ g_{21} & g_{22} \end{matrix}] [\begin{matrix} - P_{1} (x_{2} - x_{2 d}) - D_{1} x_{4} \\ - P_{2} (x_{1} - x_{1 d}) - D_{2} x_{3} \end{matrix}]

(14)

Equation (14) is a closed-loop control system with a PD controller. It is obvious from Equation (13) that

V (x)

is bounded. Then, we define the following conditions

Ω_{1} = \{x \in R^{4} |V (x) \leq λ_{1}\}

(15)

in which

λ_{1} > 0

. Any solution x of Equation (15) coming from

Ω_{1}

still remains in

Ω_{1}

for all

t \geq 0

. By setting

Φ_{1}

as the invariant of Equation (14), we obtain

Φ_{1} = \{x (t) \in Ω_{1} |\dot{V} (x) = 0\}

(16)

Set

\dot{V} (x) \equiv 0

,

x_{m} \equiv 0

, then substitute it into Equation (7) and obtain the following result.

f_{r} = - g_{r r} τ_{r}

(17)

According to the different positions of the passive joints,

f_{r}

can be specifically divided into

f_{1}

and

f_{2}

in Equation (8). The passive joint is

f_{1}

in the first link, and the passive joint is

f_{2}

in the second link. Substituting Equation (17) into Equation (12) obtains

x_{i} = x_{i d}

. Then, the maximum invariant set is

W_{1} = \{x \in R^{4} |x_{i} = θ_{i d}, x_{m} = 0\}

(18)

According to the LaSalle invariability principle [31], when Equation (16) reaches the predetermined control goal, the state of the actuated link is

θ_{i} = θ_{id}

,

{\dot{θ}}_{i} = 0

.

Therefore, when the following conditions are satisfied, the linkage of the planar 2R underactuated robot system is controlled to the target state.

\{\begin{matrix} |x_{i} - x_{i d}| \leq e_{1} \\ |x_{m}| \leq e_{2} \end{matrix}

(19)

where

e_{1}

and

e_{2}

are small positive error coefficients.

According to the above analysis, the underactuated link is still rotating when the actuated link stabilizes to the desired state, and the underactuated link is the one containing the underactuated joint. Therefore, this paper designed a simple PD controller for the actuated link, optimized the controller parameters through DEA, and successfully realized the stability of the whole system.

4. Controller Parameter Optimization

The active link’s control objectives could be achieved by the intended controller because the controller (12) forces the actuated link to remain stable at its goal position. Therefore, the DEA is used to calculate

P_{r}

and

D_{r}

to guarantee that the unactuated link successfully attains its intended state.

The DEA’s evaluation function, as explained in [29].

h = |θ_{p} - θ_{p d}| + |{\dot{θ}}_{p}|

(20)

where

p = 1, 2

, and

p \neq r

.

The following are the steps of the designed controller calculated by DEA.

Process 1: Random initialization of $\tilde{P_{r}}$ and $\tilde{D_{r}}$ .
Process 2: Submit $\tilde{P_{r}}$ and $\tilde{D_{r}}$ into Equations (12) and (5). Then, calculate ${\tilde{θ}}_{p}$ and ${\dot{\tilde{θ}}}_{p}$ by means of numerically integrating Equation (6) according to ${\tilde{θ}}_{2}$ , ${\dot{\tilde{θ}}}_{2}$ and ${\ddot{\tilde{θ}}}_{2}$ , $θ_{2 d}$ , ${\dot{θ}}_{2} = 0$ , and ${\ddot{θ}}_{2} = 0$ .
Process 3: If h is less than $η_{1}$ (a tiny positive constant), the optimization operation is completed. The result of the promotion are $P_{r} = {\tilde{P}}_{r}$ and $D_{r} = {\tilde{D}}_{r}$ . If not, the program moves on to the next phase.
Process 4: Update ${\tilde{P}}_{r}$ and ${\tilde{D}}_{r}$ through mutation, crossover, and selection procedures by using $p_{m}$ (mutation factor) and $p_{c}$ (crossed factor). The procedure then turns to Process 2.

The basic operation flow of DEA is shown in Figure 2.

Based on the optimized controller parameters

\tilde{P_{r}}

and

\tilde{D_{r}}

, the link controller can be determined, and then the initial and target state parameters can be set to verify the designed controller by simulation.

5. Simulations

In this section, the proposed control strategy is verified by Matlab simulation, and three sets of simulations are carried out on the plane Acrobat and plane Pendubot for a planar 2R underactuated robot under different initial and target states, respectively, to assess the efficacy of the proposed control strategy. We choose the same length, weight, and moment of inertia for both links. Table 1 shows the unified parameters of the planar 2R underactuated robot model.

5.1. The Simulation for Planar Acrobot

When the first link fails or is damaged, the planar 2R underactuated robot has underactuated characteristics. In this case, the main drive is the second link, and torque is applied to link 2, i.e.,

τ_{2}

. How to use the motion of the second actuated link to control the first link to stabilize the target state is a problem that the simulation experiment must take into account. The model structure is transformed into a planar underactuated robot Acrobot. The converted Acrobot model is shown in Figure 3.

Therefore, we consider that we will first move the second link to the target state under the action of torque. At the same time, according to the above constraints of the planar Acrobot underactuated robot, under the action of a PD controller, the appropriate parameters are optimized to control the first underactuated connecting rod to move to the target state and finally realize the stability control of the whole planar Acrobot.

5.1.1. Simulation Result 1 for Planar Acrobot

The parameters of the DEA are selected as follows: maximum number of iterations

G_{m a x}

= 100, the

G_{m a x}

is generally used as the termination condition of the evolutionary process; population number N = 50, the greater the population number, the greater the probability of obtaining the optimal solution, but the calculation time is longer; mutation factor

p_{m}

= 0.5, the mutation factor

p_{m}

is an important parameter for controlling population diversity and convergence; crossover factor

p_{c}

= 0.7, the crossover factor

p_{c}

can control the degree of participation of each dimension of the individual parameters in the crossover, as well as the balance of global and local search capabilities;

η

= 0.005,

η

is used as the judgment threshold for the final optimization success. DEA is used to control parameters calculated and optimized in Equation (12) so that it can attain optimal control efficacy on the planar Acrobot. The results of parameter optimization are as follows

\{\begin{matrix} P_{1} = 11.9401 \\ D_{1} = 11.1636 \end{matrix}

(21)

The initial and target states of our chosen simulation 1 are shown in Equation (22).

\{\begin{matrix} [θ_{10} θ_{20} {\dot{θ}}_{10} {\dot{θ}}_{20}] = [0 0 0 0] \\ [θ_{1 d} θ_{2 d} {\dot{θ}}_{1 d} {\dot{θ}}_{2 d}] = [- 1.999 14.106 0 0] \end{matrix}

(22)

where

θ_{10}

,

θ_{20}

,

θ_{1 d}

,

θ_{2 d}

represent the starting angle and desired angle of the first link and the second link, respectively, and

{\dot{θ}}_{10}

,

{\dot{θ}}_{20}

,

{\dot{θ}}_{1 d}

,

{\dot{θ}}_{2 d}

represent the initial angular velocity and target angular velocity of the two links, respectively.

The control of planar Acrobot has achieved the target shown in the simulation results 1, and each state variable is primly convergent to the predetermined value. As shown in Figure 4a,b, both the angle and angular velocity of the two links converge steadily from the initial state set by Equation (22) to the target state. Figure 4c shows the torque variation of the second actuated link.

As can be seen from the Figure 4, the planar Acrobot has reached the target state and remained stable before t = 5 s, the maximum angular velocity does not exceed

\dot{θ} = \pm 15

rad/s, and the torque range is maintained at

τ_{2} = \pm 20 N \cdot m

. The control strategy has good rapidity and system stability.

5.1.2. Simulation Result 2 for Planar Acrobot

Under the condition that the algorithm parameters remain unchanged, we transform different starting conditions and desired condition parameters to continue the optimization. DEA parameters are as follows: maximum number of iterations

G_{m a x}

= 100, population number N = 50, mutation factor

p_{m}

= 0.3, crossover factor

p_{c}

= 0.7, convergence judgment threshold

η

= 0.005. The optimization parameters of simulation result 2 are shown as follows.

\{\begin{matrix} P_{1} = 19.5405 \\ D_{1} = 5.3866 \end{matrix}

(23)

In simulation result 2, the starting state and the desired state are

\{\begin{matrix} [θ_{10} θ_{20} {\dot{θ}}_{10} {\dot{θ}}_{20}] = [0 0 0 0] \\ [θ_{1 d} θ_{2 d} {\dot{θ}}_{1 d} {\dot{θ}}_{2 d}] = [0.28 - 4.20 0 0] \end{matrix}

(24)

As shown in Figure 5a,b, both the angle and angular velocity of the two links converge steadily from the initial state set by Equation (24) to the target state. Figure 5c shows the torque variation of the second actuated link. The planar Acrobot before t = 4 s achieves the target state and maintains stability, the maximum absolute angular speed does not exceed

\dot{θ}

= 10 rad/s, the value of the torque is still maintained in the range of

τ_{2} = \pm 10 N \cdot m

, and planar Acrobot fast stability control requirements are achieved.

5.1.3. Simulation Result 3 for Planar Acrobot

To assess the validity of the suggested control strategy, we set up a set of simulations, set state parameters, and optimized PD controller parameters. DEA parameters are as follows: maximum number of iterations

G_{m a x}

= 100, population number N = 50, mutation factor

p_{m}

= 0.3, crossover factor

p_{c}

= 0.8, convergence judgment threshold

η

= 0.005. The parameters of simulation result 3 are

\{\begin{matrix} P_{1} = 18.5003 \\ D_{1} = 9.8357 \end{matrix}

(25)

In simulation 3, the starting and desired states of the planar Acrobot are

\{\begin{matrix} [θ_{10} θ_{20} {\dot{θ}}_{10} {\dot{θ}}_{20}] = [- 0.2998 0.6000 0 0] \\ [θ_{1 d} θ_{2 d} {\dot{θ}}_{1 d} {\dot{θ}}_{2 d}] = [0 0 0 0] \end{matrix}

(26)

As shown in Figure 6a,b, both the angle and angular velocity of the two links converge steadily from the initial state set by Equation (26) to the target state. Figure 6c shows the torque variation of the second actuated link. The final results indicate that the control method successfully converges to the target state within t = 3 s, the angular velocity range is

\dot{θ} = \pm 1

rad/s, and the range of torque is maintained at

τ_{2} = \pm 1.2 N \cdot m

. The control goal can still be achieved.

In simulations 2 and 3 of the planar Acrobot, we set different parameters and different target states, but in the end, it is ideal in terms of speed and stability. In addition, compared with reference [27], this method is simpler and can control the target state quickly without the need for trajectory planning.

5.2. The Simulation for Planar Pendubot

When the second link of the planar 2R underactuated robot fails or is damaged, then the model has underactuated characteristics, and the model structure is transformed into the planar Pendubot underactuated robot. The specific structure is shown in Figure 7, and simulation experiments are carried out on the planar Pendubot model.

Also, three sets of different starting state parameters and desired state parameters are selected to carry out simulation on the planar Pendubot underactuated robot. For the planar Pendubot underactuated robot model, in this case, the main driver is the first link, and torque is applied to link

τ_{1}

. The first link moves to the target state under the action of torque. At the same time, according to the constraint relation of the planar Pendubot underactuated robot, the second underactuated link is controlled to move to the desired state, and the stability control of the entire planar Pendubot is finally realized.

5.2.1. Simulation Result 1 for Planar Pendubot

Similarly, the parameters of the DEA are selected as follows: maximum number of iterations

G_{m a x}

= 100, population number N = 50, mutation factor

p_{m}

= 0.5, crossover factor

p_{c}

= 0.7, convergence judgment threshold

η_{1}

= 0.005. Using DEA for the controller parameters, we calculated and optimized Equation (12), so that it can attain optimal control efficacy on the planar Pendubot. The results of parameter optimization are as follows

\{\begin{matrix} P_{2} = 16.1889 \\ D_{2} = 10.0598 \end{matrix}

(27)

In simulation result 1, the starting state and the desired state are

\{\begin{matrix} [θ_{10} θ_{20} {\dot{θ}}_{10} {\dot{θ}}_{20}] = [0 0 0 0] \\ [θ_{1 d} θ_{2 d} {\dot{θ}}_{1 d} {\dot{θ}}_{2 d}] = [5.00 - 2.65 0 0] \end{matrix}

(28)

The control of planar Pendubot reaches the target shown in simulation results 1, and each state variable basically converges to the predetermined value. As shown in Figure 8a,b, both the angle and angular velocity of the two links converge steadily from the initial state set by Equation (28) to the target state. Figure 8c shows the control torque variation of the first link. Before t = 5 s, the planar Pendubot has reached the target state and maintained stability. The angular velocity range is

\dot{θ} = \pm 15 rad / s

, and the control torque range is within

τ_{1} = \pm 50 N \cdot m

, which meets the control requirements of the system. Therefore, this control strategy has good rapidity and system stability for planar Pendubot.

5.2.2. Simulation Result 2 for Planar Pendubot

In the same case that the algorithm parameters are unchanged, different starting state parameters and desired state parameters are transformed to continue the optimization. DEA parameters are as follows: maximum number of iterations

G_{m a x}

= 100, population number N = 50, mutation factor

p_{m}

= 0.3, crossover factor

p_{c}

= 0.7, convergence judgment threshold

η

= 0.005. Optimization parameters of simulation result 2 are as follows:

\{\begin{matrix} P_{2} = 15.5203 \\ D_{2} = 9.7898 \end{matrix}

(29)

In simulation result 2, the starting state and the desired state are

\{\begin{matrix} [θ_{10} θ_{20} {\dot{θ}}_{10} {\dot{θ}}_{20}] = [4.5 3.8 0 0] \\ [θ_{1 d} θ_{2 d} {\dot{θ}}_{1 d} {\dot{θ}}_{2 d}] = [- 3 8.8 0 0] \end{matrix}

(30)

The planar Pendubot control achieves the target shown in simulation result 2, and each state variable basically converges to the predetermined value. As shown in Figure 9a,b, both the angle and angular velocity of the two links converge steadily from the initial state set by Equation (30) to the target state. Figure 9c shows the control torque variation of the first link. Under the second set of state parameters, the changes of state variables of the planar Pendubot double links mechanism and the changes of torque of the movable link mechanism are analyzed. Before t = 4 s, the planar Pendubot has reached the target state and remains stable. The angular velocity range is

\dot{θ} = \pm 22 rad / s

, and the control torque range is within

τ_{1} = \pm 100 N \cdot m

, which reaches the control goal of the system.

5.2.3. Simulation Result 3 for Planar Pendubot

To confirm the general applicability of the strategy to the planar Pendubot underactuated robot, the third set of simulation experiments was conducted, and the following state parameters were selected to optimize the PD controller by differential evolution algorithms. DEA parameters are as follows: maximum number of iterations

G_{m a x}

= 100, population number N = 50, mutation factor

p_{m}

= 0.3, crossover factor

p_{c}

= 0.8, convergence judgment threshold

η

= 0.005. The parameters of simulation result 3 are

\{\begin{matrix} P_{2} = 14.8870 \\ D_{2} = 6.4524 \end{matrix}

(31)

In simulation result 3, the initial and target states of each state variable of the planar Pendubot are

\{\begin{matrix} [θ_{10} θ_{20} {\dot{θ}}_{10} {\dot{θ}}_{20}] = [- 8 5 0 0] \\ [θ_{1 d} θ_{2 d} {\dot{θ}}_{1 d} {\dot{θ}}_{2 d}] = [0 4.3 0 0] \end{matrix}

(32)

Under the third set of state parameters, the planar Pendubot control reaches the target shown in the simulation results, and each state variable basically converges to the predetermined value. As shown in Figure 10a,b, both the angle and angular velocity of the two links converge steadily from the initial state set by Equation (32) to the target state. Figure 10c shows the control torque variation of the first link. Before t = 3 s, the planar Pendubot has reached the target state and remains stable. The angular velocity range is

\dot{θ} = \pm 25 rad / s

, and the control torque range is within

τ_{1} = \pm 500 N \cdot m

.

In simulations 2 and 3 of planar Pendubot, different parameters and different target states are set, and the simulation results indicate that the results are good in terms of speed and stability. In addition, compared with reference [27], this method is simpler and can control the target state quickly without the need for trajectory planning.

5.3. Analysis of Simulation Results

For planar Arcobot and planar Pendubot underactuated robots, three groups of different state parameters were selected to design a simple PD controller, and the parameters of the PD controller were successfully optimized by the differential evolution algorithm. Different mutation factor

p_{m}

and crossover factor

p_{c}

parameters are set to verify the effect on the convergence rate and show the sensitivity of the algorithm to the parameters. The proposed control strategy’s effectiveness was validated through simulation. The following are three different groups of parameter indicators in the control process of planar Arcobot and planar Pendubot underactuated robots, as shown in Table 2 and Table 3.

The results of Table 2 and Table 3 show that the rotation angle of all the links in the simulation successfully converges, and the stable control of the target state is achieved. The change in angular velocity is proportional to the change in torque. The larger the variation factor pm, the slower the convergence rate and the increased stabilization time required. The larger the crossover factor pc, the faster the convergence rate, and the less time it takes to reach stability.

In optimization design, DEA has the following main characteristics compared with traditional optimization methods:

(1) The DEA searches from a group of multiple points rather than from a single point, which is the main reason why it can find the overall optimal solution with a high probability;

(2) The evolutionary criterion of DEA is based on adaptive information and does not need to rely on other auxiliary information (such as requiring the function to be derivable or continuous), which greatly expands its application scope;

(3) DEA has inherent parallelism, which makes it very suitable for large-scale parallel distributed processing and reduces the overhead of inter-cost;

(4) DEA adopts probabilistic transition rules and does not require deterministic rules.

6. Conclusions

A simple PD control strategy is proposed for a planar 2R underactuated robot. The appropriate controller parameters are obtained by DEA, and the PD controller makes the whole system move from the initial state to the target state under the actuated link. Finally, the effectiveness and universality of the proposed strategy are verified by simulation, and the potential relationship between angular velocity variation and torque magnitude, as well as the influence of mutation factor and crossover factor on the convergence rate, are obtained.

In future work, we will use this simple and effective control method to deal with multi-degrees of freedom underactuated robots, cluster underactuated robot control, and multi-passive joint underactuated robots. This control method expands the theory of nonlinear control and underactuated control and provides a theoretical basis and reference for subsequent research.

Author Contributions

Planning research programs and making comments, Z.H.; collect data for simulation and paper writing, X.G.; model building and literature collection, X.W.; provide technical support and advice, H.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Nature Science Foundation of Hubei Province (Grant No. 2023AFB380), the Hubei Key Laboratory of Intelligent Robot (Grant No. HBIRL202301), the Foundation of Yunnan Key Laboratory of Unmanned Autonomous Systems (Grant No. 202408YB06), the Open Project Program of Fujian Key Laboratory of Special Intelligent Equipment Measurement and Control, Fujian Special Equipment Inspection and Research Institute (Grant No.FJIES2024KF12).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Lu, B.; Fang, Y.C. Online trajectory planning control for a class of underactuated mechanical systems. IEEE Trans. Autom. 2024, 69, 442–448. [Google Scholar] [CrossRef]
Yang, T.; Sun, N.; Chen, H.; Fang, Y.C. Adaptive optimal motion control of uncertain underactuated mechatronic systems with actuator constraints. IEEE/ASME Trans. Mech. 2023, 28, 210–222. [Google Scholar] [CrossRef]
Yang, T.; Sun, N.; Fang, Y.C. Neuroadaptive control for complicated underactuated systems with simultaneous output and velocity constraints exerted on both actuated and unactuated states. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 4488–4498. [Google Scholar] [CrossRef] [PubMed]
Han, F.; Yi, J.G. Stable learning-based tracking control of underactuated balance robots. IEEE Rob. Autom. 2021, 6, 1543–1550. [Google Scholar] [CrossRef]
Yang, T.; Chen, H.; Sun, N. Adaptive neural network output feedback control of uncertain underactuated systems with actuated and unactuated state constraints. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 7027–7043. [Google Scholar] [CrossRef]
Pucci, D.; Romano, F.; Nori, F. Collocated adaptive control of underactuated mechanical systems. IEEE Trans. Robot. 2015, 31, 1527–1536. [Google Scholar] [CrossRef]
Yang, T.; Sun, N.; Fang, Y.C. Adaptive fuzzy control for a class of MIMO underactuated systems with plant uncertainties and actuator dead zones: Design and experiments. IEEE Trans. Cybern. 2022, 52, 8213–8226. [Google Scholar] [CrossRef]
Huang, Z.X.; Lai, X.Z.; Zhang, P.; Meng, Q.X.; Wu, M. A general control strategy for planar 3-DoF underactuated manipulators with one passive joint. Inform. Sci. 2020, 534, 139–153. [Google Scholar] [CrossRef]
Xu, F.; Wang, H.H.; Liu, Z.; Chen, W.D. Adaptive visual servoing for an underwater soft robot considering refraction effects. IEEE Trans. Ind. Electron. 2020, 67, 10575–10586. [Google Scholar] [CrossRef]
Wang, L.J.; Lai, X.Z.; Meng, Q.X.; Wu, M. Effective control method based on trajectory optimization for three-link vertical underactuated manipulators with only one active joint. IEEE Trans. Cybern. 2023, 53, 3782–3793. [Google Scholar] [CrossRef]
Zou, Y.; Zhou, Z.Q.; Dong, X.W.; Meng, Z.Y. Distributed formation control for multiple vertical takeoff and landing UAVs with switching topologies. IEEE/ASME Trans. Mech. 2018, 23, 1750–1761. [Google Scholar] [CrossRef]
Wang, L.J.; Lai, X.Z.; Zhang, P.; Wu, M. A control strategy based on trajectory planning and optimization for two-link underactuated manipulators in vertical plane. IEEE Trans. Syst. Man Cybern. 2022, 52, 3466–3475. [Google Scholar] [CrossRef]
Li, D.W.; Wei, Z.A.; Huang, Z.X. Two-stage control strategy based on motion planning for planar prismatic–rotational underactuated robot. Actuators 2024, 13, 278. [Google Scholar] [CrossRef]
Wang, Y.W.; Lai, X.Z.; Zhang, P.; Wu, M. Control strategy based on model reduction and online intelligent calculation for planar n-link underactuated manipulators. IEEE Trans. Syst. Man Cybern. 2017, 50, 1046–1054. [Google Scholar] [CrossRef]
Li, J.; Wang, L.J.; Chen, Z.; Huang, Z.X. Drift suppression control based on online intelligent optimization for planar underactuated manipulator with passive middle joint. IEEE Access 2021, 9, 38611–38619. [Google Scholar] [CrossRef]
Ma, Z.Q.; Huang, P.F.; Lin, Y.X. Learning-based sliding-mode control for underactuated deployment of tethered space robot with limited input. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 2026–2038. [Google Scholar] [CrossRef]
Wang, K.; Liu, Y.H.; Li, L.Y. Vision-based tracking control of underactuated water surface robots without direct position measurement. IEEE Trans. Control Syst. Technol. 2015, 23, 2391–2399. [Google Scholar] [CrossRef]
Wang, J.S.; Wang, J.; Han, Q.L. Receding-horizon trajectory planning for under-actuated autonomous vehicles based on collaborative neurodynamic optimization. IEEE/CAA J. Autom. Sin. 2022, 9, 1909–1923. [Google Scholar] [CrossRef]
Oriolo, G.; Nakamura, Y. Control of mechanical systems with second-order nonholonomic constraints: Underactuated manipulators. In Proceedings of the 30th IEEE Conference on Decision and Control, Brighton, UK, 11–13 December 1991; pp. 2398–2403. [Google Scholar] [CrossRef]
Wang, Z.Y.; Xin, X.; Liu, Y.N. Strong stabilization of two-link underactuated planar robots. In Proceedings of the 2022 China Automation Congress (CAC), Xiamen, China, 25–27 November 2022; pp. 4125–4129. [Google Scholar] [CrossRef]
Lai, X.Z.; She, J.H.; Cao, W.H.; Yang, S.X. Stabilization of underactuated planar acrobot based on motionstate constraints. Int. J. Non. Linear Mech. 2015, 77, 342–347. [Google Scholar] [CrossRef]
Xiong, P.Y.; Feng, G.F.; Zeng, H.F.; Pan, C.Z. Position control for a planar underactuated manipulator based on model reduction and energy attenuation. In Proceedings of the 2022 41st Chinese Control Conference (CCC), Hefei, China, 25–27 July 2022; pp. 884–889. [Google Scholar] [CrossRef]
Luo, Y.B.; Lai, X.Z.; Wu, M. A positioning control strategy of planar Acrobot. In Proceedings of the 30th Chinese Control Conference, Yantai, China, 22–24 July 2011; pp. 743–747. [Google Scholar]
Shoji, T.; Katsumata, S.; Nakaura, S.; Sampei, M. Throwing motion control of the springed Pendubot. IEEE Trans. Control Syst. Technol. 2013, 21, 950–957. [Google Scholar] [CrossRef]
Luca, A.D.; Mattone, R.; Oriolo, G. Stabilization of an underactuated planar 2R manipulator. Int. J. Robust Nonlinear Control 2000, 10, 181–198. [Google Scholar] [CrossRef]
Urrea, C.; Kern, J.; Alvarez, E. Design of a generalized dynamic model and a trajectory control and position strategy for n-link underactuated revolute planar robots. Control Eng. Pract. 2022, 128, 5316–5329. [Google Scholar] [CrossRef]
Huang, Z.X.; Hou, M.Y.; Wei, S.Q.; Wang, L.J. The unified control strategy for planar Acrobot and Pendubot. J. Shenzhen Univ. Sci. Eng. 2023, 40, 275–283. [Google Scholar] [CrossRef]
Wang, H.N.; Hua, Y.; Chen, L. Unified control strategy of 2-DOF rotary underactuated manipulator. Modul. Mach. Tool Autom. Manuf. Technol. 2024, 11, 152–155. [Google Scholar] [CrossRef]
Storn, R.; Price, K. Differential evolution—A simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
Cao, S.Q.; Lai, X.Z.; Wu, M. Motion control method of planar Acrobot based on trajectory characteristics. In Proceedings of the 31st Chinese Control Conference, Hefei, China, 25–27 July 2012; pp. 4910–4915. [Google Scholar]
LaSalle, J.P. Stability theory for ordinary differential equations. J. Differ. Equ. 1968, 4, 57–65. [Google Scholar] [CrossRef]

Figure 1. Model of a planar 2R underactuated robot.

Figure 2. Basic operational flow of DEA.

Figure 3. The model of the planar Acrobot.

Figure 4. Simulation result 1 for planar Acrobot: (a) angle; (b) angular velocity; (c) control torque.

Figure 5. Simulation result 2 for planar Acrobot: (a) angle; (b) angular velocity; (c) control torque.

Figure 6. Simulation result 3 for planar Acrobot: (a) angle; (b) angular velocity; (c) control torque.

Figure 7. The model of the planar Pendubot.

Figure 8. Simulation result 1 for planar Pendubot: (a) angle; (b) angular velocity; (c) control torque.

Figure 9. Simulation result 2 for planar Pendubot: (a) angle; (b) angular velocity; (c) control torque.

Figure 10. Simulation result 3 for planar Pendubot: (a) angle; (b) angular velocity; (c) control torque.

Table 1. Planar 2R underactuated robot model parameters.

Link i	$L_{i}$ (m)	$L_{ci}$ (m)	$m_{i}$ (kg)	$J_{i}$ (kg · m²)
1	1.0	0.5	1.0	0.0833
2	1.0	0.5	1.0	0.0833

Table 2. Comparison of three simulation groups of planar Arcobot.

Simulation i	Mutation Factor $p_{m}$	Crossover Factor $p_{c}$	Stable Time	Angular Velocity	Control Torque
1	0.5	0.7	$\leq 5 s$	$\pm 15 rad / s$	$\pm 20 N \cdot m$
2	0.3	0.7	$\leq 4 s$	$\pm 10 rad / s$	$\pm 10 N \cdot m$
3	0.3	0.8	$\leq 3 s$	$\pm 1 rad / s$	$\pm 1.2 N \cdot m$

Table 3. Comparison of three simulation groups of planar Pendubot.

Simulation i	Mutation Factor $p_{m}$	Crossover Factor $p_{c}$	Stable Time	Angular Velocity	Control Torque
1	0.5	0.7	$\leq 5 s$	$\pm 15 rad / s$	$\pm 50 N \cdot m$
2	0.3	0.7	$\leq 4 s$	$\pm 22 rad / s$	$\pm 100 N \cdot m$
3	0.3	0.8	$\leq 3 s$	$\pm 25 rad / s$	$\pm 500 N \cdot m$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Huang, Z.; Gong, X.; Wan, X.; Zhou, H. A Simple Control Strategy for Planar 2R Underactuated Robot via DEA Optimization. Actuators 2025, 14, 156. https://doi.org/10.3390/act14030156

AMA Style

Huang Z, Gong X, Wan X, Zhou H. A Simple Control Strategy for Planar 2R Underactuated Robot via DEA Optimization. Actuators. 2025; 14(3):156. https://doi.org/10.3390/act14030156

Chicago/Turabian Style

Huang, Zixin, Xiangyu Gong, Xiao Wan, and Hongjian Zhou. 2025. "A Simple Control Strategy for Planar 2R Underactuated Robot via DEA Optimization" Actuators 14, no. 3: 156. https://doi.org/10.3390/act14030156

APA Style

Huang, Z., Gong, X., Wan, X., & Zhou, H. (2025). A Simple Control Strategy for Planar 2R Underactuated Robot via DEA Optimization. Actuators, 14(3), 156. https://doi.org/10.3390/act14030156

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Simple Control Strategy for Planar 2R Underactuated Robot via DEA Optimization

Abstract

1. Introduction

2. Model and Characteristics

2.1. Model

2.2. Control Characteristic Analysis

3. Controller Design

4. Controller Parameter Optimization

5. Simulations

5.1. The Simulation for Planar Acrobot

5.1.1. Simulation Result 1 for Planar Acrobot

5.1.2. Simulation Result 2 for Planar Acrobot

5.1.3. Simulation Result 3 for Planar Acrobot

5.2. The Simulation for Planar Pendubot

5.2.1. Simulation Result 1 for Planar Pendubot

5.2.2. Simulation Result 2 for Planar Pendubot

5.2.3. Simulation Result 3 for Planar Pendubot

5.3. Analysis of Simulation Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI