1. Introduction
The dynamic positioning (DP) system can control a ship to keep it in the desired position or navigate on a pre-determined track by using thrusters actively and automatically to balance the environmental forces caused by wind, waves, current, etc. The DP technique has been widely applied in many marine engineering industries, such as offshore drilling and deep water pipe laying [
1].
Trajectory tracking is an essential task for DP ships. Typical trajectory tracking tasks for DP ships including oceanic exploration, area target searching, pipe laying, etc.; some of these tasks require a high precision in tracking performance. Therefore, the design of a high-level DP controller with high performance is crucial. However, as the ship is a nonlinear, strongly coupled, and constrained system, it brings severe challenges to the design of the controller. Moreover, the DP ship is frequently perturbed by unknown environmental disturbances, and anti-disturbance should be solved. Various control theories have been resorted to for trajectory tracking problems by plenty of researchers. A nonlinear trajectory tracking controller was designed by backstepping in [
2]; this method can avoid the linearization of the model of the ship. The dynamic surface control technique, which can prevent “computation explosion” in backstepping design, was applied in the controller [
3]. This controller successfully made the underactuated ship track a smooth curve trajectory. However, this technique requires high accuracy in the controlled model. Under unknown environmental disturbances and model uncertainties, the sliding mode control was adopted in the ship trajectory tracking controller due to its robustness [
4,
5]. The sliding mode control theory combined with the control allocation algorithm was adopted in an AUV to complete the task of trajectory tracking and station-keeping [
4,
5]. The simulation results and experimental results in [
4,
5] verified the tracking and station-keeping performance and the system robustness under the effects of unknown disturbances. However, the “chattering” effect in sliding mode control cannot be entirely eliminated. Observers were introduced to the control system to estimate unknown disturbances and model uncertainties in [
6,
7,
8,
9]. The estimated values were utilized to design the trajectory tracking controller such that the ship could overcome the influence of disturbances and uncertainties. Corresponding simulations showed that the trajectory tracking accuracy was improved. Besides the trajectory tracking of ships, the strategy of the controller combined with the observer is also applied in many other fields. In [
10], a fast terminal sliding mode control technique based on the disturbance observer is developed to stabilize the underactuated robotic system. Both simulation results and experimental validations on a cart-inverted pendulum system were provided to demonstrate the effectiveness of the proposed method. A lumped perturbation observer-based extended multiple sliding surface control method was presented for the SISO system with matched and unmatched uncertainties, and numerical simulation was performed to verify the efficacy of the proposed method compared with integral type sliding mode control and dynamic surface control [
11]. With the development of intelligent control theories, fuzzy control, adaptive control, neural network, etc., they were combined with other advanced control theories to achieve ship trajectory tracking [
12,
13,
14]. In [
15], an adaptive non-singular fast terminal sliding mode control with an integral surface for the tracking control of nonlinear systems with external disturbances was developed. The disturbances were compensated for by an appropriate parameter-tuning adaptation law, and the tracking task was accomplished by a new fast terminal sliding scheme with a self-tuning algorithm to guarantee control inputs were chattering-free. Integrating these different theories can overcome each other’s drawbacks and improve control performance, but the design of such controllers is relatively complicated. Input constraints are not considered in most of the above controllers, which may deteriorate control performance and even the stability of the system in practical applications. In fact, some techniques can solve the problem of control input constraints, such as the composite nonlinear feedback (CNF) method. A CNF controller was developed to achieve the reference tracking of uncertain time-delayed linear systems constrained by actuator saturation. The simulation results illustrated that the transient performance was improved [
16]. However, the CNF method has not been fully developed for nonlinear systems.
Compared with other control techniques, the distinct advantage of Model Predictive Control (MPC) is its capability to cope with system constraints explicitly and optimize control performance, so it is widely applied in various industries [
17]. However, stability cannot be guaranteed due to the optimal control problem with a finite horizon [
18]. According to recent research, there are two main ways to achieve stability. The first is constructing the appropriate terminal cost and constraint set for the optimal control problem [
19,
20,
21], but there has yet to be a unified constructing method. The second is choosing a sufficient large prediction horizon [
22]. This can avoid creating a terminal cost and constraint set, but the larger the prediction horizon, the greater the computing burden on MPC. Unlike the two ways above, a Lyapunov-based MPC (LMPC) scheme was proposed. In this scheme, a Lyapunov contraction constraint was established in the optimization problem so that LMPC could inherit the stability property of the designed virtual Lyapunov-based controller [
23,
24]. The LMPC scheme was employed in [
25,
26] to achieve trajectory tracking of AUVs. The recursive feasibility and stability were theoretically guaranteed by choosing appropriate controller parameters, but the suggested selection guidelines were conservative.
For the problem of unknown environmental disturbance affecting control performance, a nominal nonlinear MPC controller and a sliding mode controller which handled disturbances specifically were designed [
27]. The final control law was obtained by adding the control laws of the two controllers above. ESO was employed to estimate environmental disturbances online, and then a feedforward compensation was provided to the nominal MPC trajectory tracking control law [
28,
29]. However, the final control law in [
27,
28,
29] may violate the input constraints. In [
30], environmental disturbances were estimated through a disturbance observer, and the maximum estimation error of the observer was used to tighten the input constraints. The tightened constraints could ensure that the final control law would satisfy the input constraints. Tube-based MPC schemes were developed in [
31,
32], which could keep the trajectory under disturbances in a small tubular region centered on the nominal trajectory. However, many controller parameters need to be determined in the tube-based MPC scheme, placing a high computing burden on the processor.
In view of the research above, we focus on the trajectory tracking control of perturbed DP ships in this paper. Enlightened by the work in [
25], a novel trajectory tracking control scheme is developed for DP ships, in which environmental disturbances and input constraints are taken into consideration. The major features of the proposed control scheme are summarized below:
- (1).
By incorporating the ESO into the LMPC scheme, the anti-disturbance LMPC (ADLMPC) scheme is developed. Estimations of environmental disturbances by ESO are utilized to update the prediction model online rather than providing feedforward compensation on MPC control law. Not only can this improve the control performance, but it avoids the control law violating input constraints.
- (2).
A new-type Lyapunov contraction constraint involving a virtual control law designed using the backstepping technique and auxiliary dynamic system is added to the ADLMPC scheme. As the virtual control law meets the requirements of the control inputs, both the recursive feasibility of ADLMPC and its closed-loop stability are guaranteed naturally. Compared with the LMPC controller proposed in [
25,
26], the proposed ADLMPC controller parameters selection guidelines are more flexible due to the rigorous range of parameters for the LMPC controller that must be satisfied to guarantee recursive feasibility and closed-loop stability.
The rest of this paper is structured in the following sections.
Section 2 presents the mathematical model of the DP ship at low speed.
Section 3 introduces the control objective and the design of the ADLMPC scheme. Additionally, an analysis of recursive feasibility and closed-loop stability is provided in
Section 3.
Section 4 performs simulations and shows the analysis of simulation results.
Section 5 comes up with conclusions.
The following notations are used in this paper: the Euclidean norm is denoted by ; diag{·} indicates diagonal operation. and are the maximum eigenvalue and minimum eigenvalue of the matrix within the brace, respectively.
3. ADLMPC of Low-Speed Trajectory Tracking Based on ESO
An MPC controller is a natural choice for the objective above due to its capability to handle constraints. Considering its recursive feasibility, closed-loop stability, and anti-disturbance ability, a novel ADLMPC scheme of low-speed trajectory tracking is developed by integrating LMPC with ESO.
Figure 2 shows the architecture of the novel trajectory tracking control scheme.
In the ADLMPC scheme, ESO can estimate environmental disturbances d. The prediction model can provide predictive state over the prediction horizon and be updated online through estimated environmental disturbances , improving the prediction capability and control performance. The optimization module can determine the control input τ based on tracking error ζe and predictive state ζP. Moreover, the Lyapunov contraction constraint is added to the optimization module to guarantee closed-loop stability.
3.1. ESO Design
To facilitate the design of the ESO, Equation (3) can be rewritten as
The ESO can be developed for Equation (7) as
where
and
are estimations of
v and
d, respectively;
and
are observer gain matrices that can be designed. Let the speed estimation error and the disturbance estimation error be
and
, respectively. Considering Assumption 2, the estimation errors above can converge to a neighborhood of the origin by determining appropriate
β1 and
β2. Details of the proof can be seen in [
33].
3.2. ADLMPC Controller Design
Based on the ESO and the control objective in
Section 2, the optimization problem at time
t0 can be defined by
where
N denotes the prediction horizon and
Q and
R represent weighting matrices which are diagonal and positive-definite. In detail,
Q can be adjusted to specify the emphasis placed on the trajectory tracking error of particular states, while
R can be tuned to describe the energy cost of the control inputs. For example, if a high precision of trajectory tracking is needed, the values of elements in
Q can be set large. Additionally, if energy consumption is limited in some cases, the values of elements in
R can be set large.
τmpc is the control input produced by the ADLMPC controller;
τmax and
τmin are the maximum and minimum
τmpc;
and
are the maximum and minimum change rates of
τmpc. Equation (9e) means the constraints of ship speed states
u,
v, and
r, due to the DP ship, are tracking trajectory at low speed. Equation (9f) is the Lyapunov contraction constraint which exploits a virtual Lyapunov-based trajectory tracking control law
τvir. It should be noted that
τvir does not actually control the DP ship but only guarantees closed-loop stability.
V(·) is the corresponding Lyapunov function candidate for Equation (6).
Remark 2: Equation (9a–9f) has to be discretized first to solve the optimization problem numerically.
Then, we need to design τvir and V properly. Many techniques, such as sliding mode control, backstepping, and dynamic surface control, can be utilized to develop τvir. Considering the complexity of the controlled system and input constraints, we incorporate the auxiliary dynamic system into the backstepping technique to develop τvir.
Define the following variables:
where
and
are position error and speed error, respectively;
denotes reference position; and
is intermediate control.
By differentiating both sides of Equation (10), we can obtain
In order to stabilize Equation (10),
vd can be designed as follows:
where
is a specified control gain matrix that is positive-definite and diagonal.
From Equations (7) and (10), the time derivative of δ is
where
τc denotes the command control input without consideration of input constraints;
is the error between command control law and virtual control law which satisfies input constraints. To weaken the impact of Δ
τ on control performance, we introduce the following auxiliary dynamic system [
34]:
where
is a specified control gain matrix which is positive-definite and diagonal;
is the state of the auxiliary dynamic system.
Based on
ξ, the new speed error
considering input constraints is defined as
Considering Equations (14) and (15), the time derivative of
becomes
In order to stabilize Equation (17),
τc can be designed below
where
is a specified control gain matrix that is positive-definite and diagonal.
According to Equations (9c), (9d) and (18),
τvir can be presented as
where
h denotes the sampling period;
is the saturation function of the control input, and
is the saturation function of the change rate of the control input.
The Lyapunov function candidate
V for Equation (6) is designed below:
By differentiating both sides of Equation (20), the following equation can be obtained:
Based on Equations (12), (13), (15), (17), (18) and (21), the right-hand side of Equation (9f) can be modified to
where
ξvir is the state of the auxiliary dynamic system affected by
τvir.
According to Equations (7), (12), (15), (17) and (21), the left-hand side of Equation (9f) can be described as
where
ξmpc is the state of the auxiliary dynamic system affected by
τmpc.
Therefore, the expression of Equation (9e) is
3.3. Stability Analysis
In the last section, the Lyapunov contraction constraint (Equation (24)) is established, but the choice of control gain matrices K1, K2, and K3 is not discussed. To satisfy the recursive feasibility of ADLMPC and closed-loop stability, we analyze the selection guidelines of K1, K2, and K3 in this section.
Theorem 1. Provided that control gain matrices K1, K2, and K3 ensure
, , and , the command control input τc (Equation (18)) designed for the mathematical model of the DP ship (Equation (6)) can guarantee the uniform ultimate boundedness of all signals in the trajectory tracking control system under virtual control law τvir.
Proof . According to Young’s inequality and Equation (22), the following inequality can be obtained:
where
and
. □
To solve Equation (25), we can obtain
Equation (26) indicates that if
then , , and , . Therefore, V(t) is uniformly ultimately bounded. In accordance with Equation (20), we can judge the uniform ultimate boundedness of e, , and ξ. Further consideration of Assumption 1 together with Equations (10), (11), (13) and (16) concludes that the uniform ultimate boundedness of η, v, and vd are all satisfied, which completes the proof.
Then, we analyze the recursive feasibility of ADLMPC and closed-loop stability of the system controlled by
τmpc. Input constraints are considered in the design of virtual control law
τvir(
t) in ADLMPC. Therefore
τvir(
t) is always the feasible solution of Equation (9), which naturally satisfies the recursive feasibility. In terms of closed-loop stability, the following inequality can be obtained based on Equations (9f) and (25):
The solution of Equation (27) is similar to Equation (26), which means the uniform ultimate boundedness of all signals in the trajectory tracking control system are satisfied under the MPC control law τmpc.
4. Simulation Results
In this section, comparative simulations of the trajectory tracking of the model DP ship using Matlab are conducted to verify the efficacy and superiorities of the proposed ADLMPC scheme. The parameters for the model DP ship are taken from a supply vessel named Northern Clipper [
35].
The reference trajectory adopted in simulation case 1 is a circular trajectory defined by
The disturbance on the model ship in simulation case 1 is assumed to be . The initial state of the model ship is . The upper bound and lower bound of speed states u, v, and r are set as ±3 m/s, ±2 m/s, and ±1°/s, respectively. The maximum and minimum force and torque generated by thrusters are and , respectively. The minimum and maximum change rates of force and torque are and .
For the ADLMPC controller in simulation case 1, the prediction horizon N and sampling period h are set to be 25 s and 0.5 s, respectively; weighting matrices are designed as and ; control gain matrices are developed as , and ; observer gain matrices in ESO are chosen as and . The virtual controller developed through backstepping and an auxiliary dynamic system in ADLMPC is the comparative controller. The choices of control gain matrices are the same as those in the ADLMPC controller. Moreover, the ESO should be incorporated into the virtual controller to estimate and compensate for disturbances. The main reason for choosing the virtual controller as the comparative controller is that the control law generated by the virtual controller not only can make the ship track the reference trajectory, but also satisfies the control inputs constraints.
To solve the optimization control problem Equation (9) numerically, the software package called CasADi is selected. This package is developed to assist rapid and efficient implementation of different methods for numerical optimal control [
36]. Due to Equation (9) being a quadratic programming problem, this package solves the problem using the sequential quadratic programming (SQP) method, using the IPOPT solver.
Simulation results of case 1 are presented in
Figure 3,
Figure 4,
Figure 5,
Figure 6,
Figure 7,
Figure 8,
Figure 9 and
Figure 10. Specifically, the ship
trajectories controlled by the ADLMPC controller and comparative virtual
controller with ESO (VIR+ESO) are illustrated in
Figure 3 and
Figure 4. As we can see, both controllers show high qualities of control performance, which can accurately control the model ship perturbed by environmental disturbances to track reference trajectory. However,
Figure 3 illustrates that the model ship can track the reference trajectory more rapidly with the proposed ADLMPC controller. Meanwhile,
Figure 4 demonstrates more detailed information. The position tracking error
and the heading tracking error of both controllers are presented in
Figure 5, which indicates that tracking errors resulting from both controllers converge to a small neighborhood of the origin. The efficacy and closed-loop stability of both controllers are verified. In detail, by the effect of ADLMPC and the virtual controller, the position tracking error achieves convergence in about 100 s and 300 s, and the corresponding steady-state error is near 0 and about 0.5 m. Moreover, the mean square errors (MSE) for both controllers are listed in
Table 1. It can be seen from
Table 1 that compared with the VIR+ESO, the MSEs of position states (
x and
y) with the proposed ADLMPC controller appear to have different degrees of decline. The improvements in the MSEs of position states with the ADLMPC controller are 20.2% and 73.5%, respectively. This information shows that no matter the accuracy of tracking or the convergence time, the ADLMPC controller’s tracking performance is better than the virtual controller, which mainly results from the optimization module in ADLMPC. This module can provide optimal or sub-optimal control law by solving the optimization problem online, which means that thrusters can be utilized more efficiently.
Figure 6 and
Figure 7 describe the control inputs of both controllers, from which we can observe that control inputs do not violate the input constraints. Additionally, the torque generated by the ADLMPC controller in the first 100 s is more significant than that generated by the virtual controller. This means that the ADLMPC controller fully utilizes the thrusters to track the reference trajectory rapidly. The rates of change in control inputs produced by both controllers also satisfy the control input constraint in
Figure 8. However, the rates of change in control inputs generated by the virtual controller are much more severe, aggravating the wear of thrusters and reducing their lifetime. It can also be understood that the virtual controller has to frequently change the thrust to achieve a similar tracking performance as the ADLMPC controller.
Figure 9 illustrates the variation of speed states (
u,
v, and
r) over time. Although both controllers can control the model ship to track reference speed, the state of
r controlled by the virtual controller is out of the upper limit in 14–32 s. The main reason is that the speed state constraints are not considered when designing the virtual controller. This highlights the advantage of the ADLMPC controller in tackling system constraints explicitly. The estimation performance of the designed ESO is plotted in
Figure 10. It reveals that the designed ESO can accurately estimate the actual disturbances. Regarding the computing burden of the ADLMPC controller, the average calculation period of the optimization control problem Equation (9) is around 70 ms, which means that the ADLMPC has positive real-time performance.
To demonstrate the effects of the parameters for the ADLMPC controller on tracking performance, another simulation (simulation case 2) with a different set of parameters for the ADLMPC controller is performed. In this simulation case, the reference trajectory and environmental disturbances are the same as in case 1. However, the weighting matrices
Q and
R are changed to
and
. The remaining parameters are maintained, including the prediction horizon, control gain matrices of the virtual controller, and observer gain matrices. The initial states of the model ship and environmental disturbances are also the same as those in case 1. Simulation results are shown in
Figure 11,
Figure 12,
Figure 13,
Figure 14,
Figure 15 and
Figure 16, and ‘ADLMPC1’ and ‘ADLMPC2’ in these figures mean the ADLMPC controller with weighting matrices
Q1,
R1, and with
Q2,
R1, respectively.
The ship trajectories controlled by the ADLMPC1 controller and the ADLMPC2 controller are demonstrated in
Figure 11,
Figure 12,
Figure 13 and
Figure 14. The ADLMPC1 controller shows a high accuracy in tracking performance under the effects of environmental disturbances, whereas the quality of tracking by the ADLMPC2 controller deteriorates. Specifically,
Figure 13 and
Figure 14 illustrate that instead of maintaining the ship position and speed on the reference values, all these states fluctuate near the reference values. On the other hand,
Figure 15 and
Figure 16 show whether the magnitude of control inputs or rate of change in control inputs generated by the ADLMPC2 controller are smaller than those generated by the ADLMPC1 controller, especially in the first 100 s. The main reason for the phenomena above is that the values of elements in
Q1 are larger than those in
Q2, while the values of elements in
R1 are smaller than those in
R2. A decrease in Q and an increase in R mean that we pay more attention to the energy cost of thrusters rather than the accuracy in tracking. Therefore, the ADLMPC controller reduces the energy cost by degrading the tracking performance.
To investigate the advantage of anti-disturbance in the ADLMPC controller compared with the LMPC controller proposed in [
25,
26], we conduct the simulation (simulation case 3) under environmental disturbances. For effective comparison, the parameters for the LMPC controller and ADLMPC controller are set to the same values as those of the ADLMPC controller in case 1. The initial states of the model ship and environmental disturbances are also the same as those in case 1. Simulation results are shown in
Figure 17,
Figure 18 and
Figure 19. These figures demonstrate that the accuracy of trajectory tracking in the effects of environmental disturbances in the ADLMPC controller is much higher than that of the LMPC controller. It can be seen from
Table 2 that the MSEs of
x,
y, and
ψ with the LMPC controller are more than 8 times, 20 times, and 10 times larger than those with the ADLMPC controller. These simulation results can illustrate the effectiveness of anti-disturbance with the ADLMPC controller.