1. Introduction
The telerobotics system combines the intelligence of the operator and the operation ability of the robot, it can achieve a variety of operational tasks in a hazardous and dangerous environment. Based on these excellent characteristics, teleoperation systems have been widely used in nuclear radiation, space, deep sea, and complex industrial environments, and play an important role in these engineering applications [
1,
2]. In a typical teleoperation system, the operator controls the movement of the master robot, while the slave robot is regulated to achieve the position-tracking movement of the master robot [
3]. Generally, two controllers are introduced at the master end and the slave end, respectively, to realize position-tracking control. This control structure is also defined as the bilateral control. The operator can sense the interaction condition between the robot and task environment based on bilateral control. The position/force signals of the master robot and the slave robot need to be transmitted to each other’s controller through the communication channel to achieve position tracking and force feedback. Compared with the unilateral control mode, the bilateral control mode provides the operator with better telepresence feedback of the operation force, which improves the operation’s precision and efficiency.
In bilateral control of teleoperation systems, the position/force signals of the master robot and the slave robot are always transmitted through the communication channel to each other’s controller to achieve position tracking and force feedback. Nevertheless, the communication delay cannot be inevitable. This fact creates an inconsistent factor, i.e., the position signals with time delay will reduce the tracking performance and stability of the teleoperation system, and may even cause system instability [
3]. In general, stability is regarded as the most fundamental performance for the communication delay teleoperation system [
4]. Currently, passive theory and the Lyapunov method are commonly used to analyze the stability of a teleoperation system and design an appropriate structure for position control. For a teleoperation system, communication delay factor can be equivalent to injecting additional energy. Therefore, the passivity theory is to reduce the energy of the communication process by designing an effective controller, which can ensure the passivity of the closed-loop system and maintain the stability of system. The passive theory-based control methods of the telerobotics system have the scattering method [
5,
6], the traditional wave method, and improved wave variable method [
7,
8]. However, the passivity assumptions of the master and slave sides are insufficient for engineering practice conditions. Then, numerous scholars have employed the Lyapunov theory to analyze the stability of the time-delay teleoperation system and to develop the control strategy. Some typical control strategies based on the Lyapunov method include
control [
9], adaptive control [
10,
11,
12], sliding mode control [
13], output feedback control [
14], feedforward-feedback position control [
15], and so on. These methods have produced effective position tracking controllers for teleoperation systems with fixed or time-varying communication delays.
Comparative to other control methods, the proportional damping injection (P+d) is a control strategy for time-delayed telerobotics with a simple structure and good control performance. The main component of damping injection control is to utilize damping terms to eliminate superfluous energy and maintain the stability of the time-delay system. Proportional plus damping and proportional differential plus damping (PD+d) are often part of the damping injection controller’s fundamental construction [
16]. The Lyapunov—Krasovskii method is usually employed for the design of damping injection controllers and the analysis of closed-loop system stability. Some improved damping injection controllers have been proposed and designed. In [
17], a PD controller without speed measurement is proposed for the teleoperation system, and a first-order filter is designed for speed estimation. For a teleoperation system with passive and nonpassive injected force, Islam [
18] developed a novel PD+d control approach, and symmetric and asymmetric communication delays were also taken into consideration. In [
19,
20,
21], three novel P+d-like controllers are presented to guarantee the position/force-tracking performances for the network telerobitics system. In [
22], the authors present three strategies that adjust the damping and stiffness of the bilateral teleoperation system to maintain the stable teleoperation. In [
23], a simplified P+d control with gravity compensation is presented, and the effectiveness of the controller is also verified by experiments. The gravity torque of the robot is commonly defined as the known information to construct the controller in the above damping injection control approach. For practical systems, gravity torque may be difficult to obtain accurately, and the effect of friction torque and other additional injection forces could weaken the control performance of the teleoperation system. Adaptive control is a very effective way to solve the problem of model uncertainty and has been successfully applied in teleoperation systems. In addition, adaptive control can contribute to ensure the passivity and stability of the delay teleoperation system. Scholars have proposed some new damping injection control methods combining adaptive strategy. In [
24], a nonlinear proportional plus nonlinear damping (nP+nD) controller based on adaptive environmental force term is presented. In [
25], an adaptive fuzzy logic system is utilized to estimate uncertain torques, and an improved PD+d controller is proposed. In our previous work [
26], the adaptive method combined with RBF neural network was used to build an improved damping injection controller. However, the control scheme designed based on the above work only achieves asymptotic stability of the closed-loop system. It implies that the synchronization error between the master and slave robots can converge to zero when the time becomes infinite. The stability performance of finite time can ensure the system converges to a small neighborhood in a finite time, which has faster convergence speed, higher convergence accuracy, and better anti-interference ability. It can be seen that finite-time stability has more engineering application value than asymptotic stability.
The finite-time control method has also been researched and developed by some scholars for the time-delay teleoperation system. Yang [
27] presented an adaptive fuzzy finite-time control strategy to address the control issue of a teleoperation system with uncertain dynamics. In [
28], an adaptive finite-time control method is proposed to address the actuator saturation problem. In some work [
29,
30,
31,
32,
33], several finite-time control structures are proposed to realize the position tracking control of telerobotics system with the factors of model parameter uncertainty, varying communication delay, actuator saturation, output constraints, etc. These finite-time controllers are basically based on the nonsingular terminal sliding mode (NTSM) and auxiliary variables combined with speed and position errors. The research work is still relatively insufficient in spite of the development of finite-time control methods for teleoperation systems. There are still some challenges and problems that need to be solved.
The first issue is how to simplify the finite-time control scheme to reduce the complexity of controller design. The existing finite-time controller has some disadvantages, such as the complex design process and many controller gain parameters. What is far more important is that the impact of communication delay on stability cannot be obtained in the above finite time approaches. Yang [
34] proposed a new continuous nonsmooth proportional damping injection finite-time control method, which successfully simplified the finite-time controller for the teleoperation system. However, the author did not consider the unknown gravity torque, friction torque, and non-zero external force. In our paper, we expect to reduce the amount of fractional power to simplify the selection constraints of controller parameters. The second one is how to establish the relationship between communication delay, controller gain, and fractional power parameters. In this paper, we expect to use Lyapunov–Krasovskii functional for stability analysis and further establish this relationship. The last problem is how to deal with the unknown gravity, friction, and external torques, and the radial basis function (RBF) neural network and adaptive method are utilized to implement the compensation of unknown dynamics and external forces.
Based on the line of analyzing the existing work and comparing it with our previous work, the fundamental contributions of this paper are summarized as follows:
In this paper, a novel simplified finite-time adaptive proportional damping injection control strategy is proposed for the teleoperation system with asymmetric varying communication delay. Compared with most of the existing proportional damping injection control methods (such as [
19,
20,
21,
22,
23,
24,
25,
35] and our previous work in [
26]), the proposed control structure achieves the finite-time tracking performance of the master–slave robot position, and the method presented above only achieves the asymptotically stability performance. Compared with the only existing finite-time proportional damping injection control strategy in [
34], the proposed controller is more concise and has fewer controller gain parameters, which is more convenient to use in the actual teleoperation system.
The RBF neural network and the adaptive control method are presented to realize the estimation and compensation of external forces. Compared with some existing control methods, such as [
18,
19,
20,
21,
24,
25,
26], the proposed method does not require us to measure and estimate the external forces. In addition, the unknown dynamics can also be compensated.
The remainder of this paper is organized as follows. In
Section 2, we describe the dynamic models and properties of the teleoperation system, the definition of the RBF neural network, and some related mathematical definitions and lemmas. In
Section 3, the proposed adaptive finite-time control method based on the P+d control scheme is proposed and the stability of the closed-loop system is also discussed. In
Section 4, simulation experiments are implemented to verify the stability and good position-tracking performance of the system. Finally, this work is concluded in
Section 5.
2. Problem Statement
In this section, the dynamic model and some important properties of the teleoperation system are described first, and then the objectives of the system control, the concepts and properties of the RBF neural network, and some concepts and lemmas of mathematics and control theory are also stated later.
2.1. Teleoperation System Dynamics
A typical dynamic model of the telerobotics system is composed of the master robot dynamic, slave robot dynamic, and a communication model. For the master and slave robots, if they are assumed to be the manipulators with n rotational degree of freedom (dof), the dynamic models in joint space can be defined as
where for
,
,
, and
are, respectively, defined as the position, velocity and acceleration vectors of the master/slave robot at time
.
is the inertia matrix of the robot,
is the coriolis and centrifugal matrix,
is the gravitational torque.
is friction and external interference torques. The torques exerted by the operator and the environmental is
. The values of
and
are related to the force
applied by the operator to the master robot, and the force
applied by the environment to the slave robot, which can be further described as
and
. The Jacobian matrices of the master and slave robots are defined as
and
, respectively. The control torques of the master and slave robots are defined as
and
.
The robot dynamics in Equation (1) are nonlinear and have time-varying parameters. For
are defined as the master and slave robots, respectively, and there are some important properties as follows [
36,
37]
Property 1. The inertia matrix is positively symmetrical. Moreover, the following inequalities are held if the maximum eigenvalue and minimum eigenvalue are satisfied with Property 2. For any nonzero vector , the following equation always holds Property 3. For all position and velocity vectors , there exists a positive positive scalar and the following inequality is always satisfied Property 4. The time derivative of is bounded if the velocity and acceleration are all bounded.
Property 5. The gravitational torque and friction torque are all bounded.
The variable delay signals of the forward and backward communication channels are defined as and . In the control structure of the bilateral teleoperation system, the position information of the master and the slave side and are transmitted to the other side through the communication channel, which will cause the transmission delay and can be described as and . In order to simplify the expression in the controller and stability derivation process, and are introduced here to replace the expression and , respectively.
In the design of a bilateral control scheme, it is necessary to make some assumptions about communication delay, operator force, and external force signals:
Assumption 1. The asymmetric time-varying communication delays and are bounded. There are two positive constants to satisfy the boundedness of communication delays as Assumption 2. The operator force and environment force are also bounded, and they have and .
Control Objectives The control objective of this study is to develop a finite-time controller to guarantee the finite-time position-tracking control performance of the time-delay teleoperation system. This means the tracking errors should converge to a very small domain of 0 within the fixed time.
2.2. RBF Neural Network
The radial basis function (RBF) neural network is a commonly used three-layer feedforward neural network. One of its most important properties is that it can approximate any nonlinear function [
38]. Therefore, it is used to deal with laws that are difficult to analyze in the system. In addition, it also has a good generalization ability and fast learning speed. In nonlinear function approximation, data classification, system modeling, and control system design, RBF neural networks are frequently utilized because of the aforementioned characteristics. In the adaptive control method, RBF neural network is employed to estimate the unknown dynamics of the system. The radial basis function is the key part of the RBF neural network, and the Gaussian function is usually selected as the radial basis kernel function. The concept of RBF neural network can be stated as follows [
39].
For a continuous function
,
is the input vector, then the function
can be redefined by RBF neural network as
where
is the weight matrix,
k is the number of network nodes.
can also be calculated based on the Gaussian basis function as
is the the Gaussian center vector of the i-th node.
c is the width of Gaussian function. w is the bounded approximate error.
Remark 1. It should be mentioned that the weight matrix of the network is formed and obtained by the Lyapunov stability criterion in the adaptive control approach with RBF neural network. The developed adaptive learning laws can guarantee the convergence and stability of the closed-loop system. A variety of neural network input values should be considered in determining the Gauss-centered vector of i-the node . With specifying an appropriate width of c, the input data values of the neural network should fall inside the domain of Gaussian functions.
Remark 2. With the definition of the Gaussian basis function, it can be seen that for all input vectors , the Gauss function is always bounded. This property will be used in the system stability analysis.
It is a different procedure for the RBF neural network to build the adaptive control structure and directly estimate the unknown function. In general, the RBF neural network is employed in adaptive control design to replace the uncertain element of the model and normalize the input data using the radial basis function. Based on the defined weight error vector, a Lyapunov function can be developed, and the learning rate of the neural network’s weight coefficient can also be solved while guaranteeing the closed-loop system’s stability. Therefore, the learning rate (also defined as the adaptive law) should be calculated during the stability analysis of the closed-loop system. From a certain point of view, the Lyapunov function constructed by a weight error can be regarded as its cost function.
2.3. Fundamentals of Mathematics and Lemma
The norm of a time function is shown as . The norm of a time function is defined as . The definition of and spaces can be given based on the definition of and norms. The and spaces are defined as the sets and . For any time vector , the function is defined as , where p is a positive constant. The derivative of can be calculated as .
Before designing the finite-time control structure, the definition of finite-time control for nonlinear systems is given here [
40]. Consider the nonlinear system
,
f characterizes a continuous mapping
on an open neighborhood
of the origin. The zero solution
of the system is finite-time stable if it is Lyapunov stable and finite-time convergent in a neighborhood
. The finite time convergence means that there exists a function
,
is an open neighborhood of the origin, such that
and for
,
. The zero solution is globally finite-time stable when
.
Lemma 1 ([
41]).
Considering a system as , , the system is finite-time stable if there exists a continuous positive differentiable function and the following inequality can be always establishedwhere , and , is an open neighborhood of origin. If the system is globally finite-time stable, and there exists a positive scalar v satisfying , which can make the system converge to a neighborhood in finite-time T as Several mathematical lemmas are also given below for the analysis of system stability.
Lemma 2 ([
34]).
For any real numbers A, B, and , the following two inequalities hold Lemma 3 ([
34]).
For any time vectors , , and a real value function , the following inequality holds where .
4. Simulation Results and Analysis
In this section, simulation experiments are designed and implemented to analyze and verify the effectiveness of the proposed control structure. The simulation is implemented in the Simulink environment of MATLAB 2021b. In the simulation, a pair of two manipulators with 2-dof revolute joints is employed. In addition, in order to better simulate the interaction process between the operator, teleoperation system, and environment, the operator dynamic is introduced at the master robot side and the environment contact model is also employed at the slave robot side. These models are assumed to be stiffness-damping models. The detailed mathematical descriptions of operator force
and environmental force
are shown as
where for
,
and
are the end position and velocity vectors of the master/slave robots in the Cartesian coordinate system.
can be obtained through the forward kinematic of the robot, and
can be calculated through Jacobian matrices with the joint velocities.
and
are the damping and stiffness coefficients of the operator force dynamic.
and
are the damping and stiffness coefficients of the environment force model, respectively.
is the external operator force vector. In this simulation, the values of
and
are set as 10 N/(m/s),
is 5 N/(m/s), and
is 2 N/(m/s).
and
are set as
. The principle diagram of the simulation is shown in
Figure 1.
The dynamic matrices and vectors
,
,
, and
are defined as follows [
37]
where
,
,
,
,
,
,
,
,
,
, and
. For simplicity, the parameters of the master robot are defined as [
24 kg,
kg,
m. The parameters of the slave robot are set as
kg,
kg,
m.
The time-varying communication delays are considered in the simulation experiments, which are shown in
Figure 2. The curves of the external force
applied by the operator in the x-direction and y-direction are shown in
Figure 3. It can be seen that from 2 to 7 s, the operator applied an external force in the negative x-direction and positive y-direction to change the end position of the master robot. Then, the external force injected by the operator disappears to zero. However, it should be noted that at this time, the operator and environment still have forces acting on the master and slave robots. This simulation factor is not available in some related work.
Theorems 1 and 2 are used to set controller gains in the simulation to guarantee the stability and finite-time control performance of closed-loop teleoperation systems. In the master and slave controllers, the controller gain parameters are selected as , , . The gains of adaptive learning rate are selected as , . The initial joint position of the master and slave robots are and , respectively. The initial joint velocities and are zeros.
We carried out three parts of simulation experiments to verify the stability of the closed-loop system, the influence of power on the control performance, and the advantages of finite-time control performance. The results of these simulations are shown in
Figure 4,
Figure 5,
Figure 6,
Figure 7,
Figure 8,
Figure 9,
Figure 10,
Figure 11,
Figure 12,
Figure 13,
Figure 14,
Figure 15,
Figure 16 and
Figure 17, respectively.
4.1. Verification of System Stability
Figure 4 and
Figure 5 show the position-tracking performances of the closed-loop system.
Figure 4 represents that the joints of master and slave robots have position deviation with the action of the extra force of the operator. When the extra injection forces of the operator are removed, the joint positions of the master and the slave robots quickly realize mutual tracking.
Figure 5 gives the results of the joint position difference between the master robot and the slave robot at the same time. These results vividly present the position tracking effect of the master and slave robots. Seven to eight seconds after the injection forces are removed, the master robot and slave robot quickly realize the position-tracking performance without static error. These results demonstrate that stable and efficient position-tracking control performance can be achieved with the proposed control method. In addition, the results also verify some important conclusions obtained from the above theorem proving.
Figure 6 shows the curves of operator force and environmental force. The upper subfigure is the operator force, and the lower one is the environmental force. It can be seen that in the teleoperation system, the operator exerts a non-zero force on the master robot, and the environment also exerts a force on the slave robot. In addition, the stability of the operator and environmental force results further shows that the closed-loop system is stable.
Figure 7 shows the control values of the master and slave controllers.
In order to verify the boundedness of adaptive learning parameters, the curves of adaptive learning parameters
,
,
, and
are also shown in
Figure 8 and
Figure 9 below.
The above simulation results verify the statement of Theorem 1 and the conclusions obtained from its proof expression. The whole closed-loop teleoperation system is stable, and the state variables for constructing the Lyapunov function are also bounded.
4.2. Effect of the Power Coefficient
In the second part of the simulation, the influences of the power parameter
of the master and slave controllers are shown and analyzed. We choose the different power values of
as
. In this simulation, the initial position status of the master and slave robots are all set to 0.
Figure 10 and
Figure 11 show the control results based on these power parameters.
From
Figure 10 and
Figure 11, it can be seen that the larger the power is, the slower the convergence rate has. When
, the control scheme is degraded to the traditional proportional damping injection controller. However, the faster convergence speed will cause the oscillation of the control torque, which further limits its application in the actual robot system. It also further shows that the finite-time control structure proposed in this paper has a faster convergence rate than the traditional proportional damping injection control structure.
4.3. Comparison of Control Performance
For this subsection, we chose three representative P+d control methods, applied them in the simulation, and compared the results of these methods with the proposed control scheme to confirm the effectiveness. There are three instances created based on these P+d control strategies. Case 1: in [
23], De Lima proposed a simplified P+d control method, and gravity compensation is also considered in the controller. In the simulation experiment, the gravity torque vector of the control law in case 1 is inaccurate, and we intend to demonstrate the benefits of the adaptive method in our method by evaluating the position-tracking performance with Case 1. Case 2: in [
12], Yang introduced the adaptive term to compensate for the gravity torque and proposed an adaptive P+d control strategy. Case 3: in [
22], this control strategy employs hybrid control terms, and a new control scheme is designed based on the P+d control approach. In Case 3, the operator and environment torques need to be measured and sent into the controllers. In the proposed approach, the RBF neural network is introduced to estimate the external force, and we expect to evaluate the performance of external force estimation by analyzing and contrasting the method of Case 3. Additionally, as these three comparison approaches are non-finite-time performance control structures, we expect to validate the efficacy of the proposed control method by contrasting the simulation results of these non-finite-time methods.
Figure 12,
Figure 13,
Figure 14,
Figure 15,
Figure 16 and
Figure 17 show the detailed simulation results and comparisons.
The joint position-tracking performances based on the Case 1 method are given in
Figure 12 and
Figure 13. These results show that the position-tracking performances of the master and slave robots can reach stability after the additional injection forces are removed. The time when the final tracking error converges into the stable region is around 10.5 s. From the curve of position-tracking errors in
Figure 13, it can be seen that the proposed control approach has a higher convergence accuracy and speed than the Case 1 method.
Figure 14 and
Figure 15 show the joint position tracking of the master and slave robots with the method in Case 2. It can be seen that the position-tracking performance in the method of Case 2 has steady-state errors in the first joint. In the control structure of Case 2, the authors only introduce the adaptive control method in the slave controller. However, the proposed finite-time control scheme still has advantages in the convergence speed and accuracy of position tracking.
Figure 16 and
Figure 17 show the joint position tracking of the master/slave robots with the method in Case 3. In Case 3, the author has proved that with the action of operator force and environmental force, when
,
is held. Therefore, theoretically, if the time is long enough, the convergence precision of Case 3 is similar to the finite-time control method we proposed. The simulation results illustrate this point. However, it is clear that the convergence speed of the finite-time control performance is faster, demonstrating the efficacy of the suggested control method.
The comparisons of these three P+d control methods show that the proposed finite-time control scheme has higher convergence speed and accuracy in position tracking error performances. From
Figure 15,
Figure 16 and
Figure 17, it can be seen that based on the proposed method, the position errors of the master–slave robot have converged to a small neighborhood of 0 in about 8 s, while the performances with Case 1 and Case 2 have converged to a relatively stable state in about 11 s and 9.5 s, respectively, and neither of them can converge to the small neighborhood of 0, which means that there are steady-state errors. The root mean square error (RMSE) index is introduced here to quantify the speed and accuracy of position-tracking performances of the proposed approach and these three methods. The RMSE is defined as follows:
where
represents the position tracking error of each joint of the robot,
n denotes the length of the error sequence. The smaller the RMSE, the faster the tracking and the higher the accuracy of the position performance. In order to avoid the impact of the extra force, only the RMSE values of the system after 8 seconds are analyzed and calculated. The RMSEs of the position tracking errors with these three control schemes are shown in
Table 1. It can be seen that the tracking errors with the proposed control method have smaller RMSE values, which means that the master and slave robots have better position-tracking performance.