The
n-th order uncertain linear or nonlinear control plant is
where
u is the command input,
and
b are unknown parameters (constant or slowly time varying),
,
, ⋯,
are linear or nonlinear functions where the derivative orders are equal to or less than
, the state variable
which has the superscript
means the
-th-order derivative of the first state
x. The control target is to design a control system to generate the input command
u that can drive the system’s state vector
to follow a designated trajectory
, regardless of model uncertainties and measurement noises.
The control strategy proposed in this paper contains two layers: an adaptation layer and a robustness layer. In the adaptation layer, unknown parameters
and
b are estimated online based on the plant’s past input and output data, by using the BLS estimator. The estimations are denoted as
and
. Then, in the robustness layer, a sliding mode controller is designed assuming that there are bounded errors between the estimations and true values of the model parameters. It is important to remark that for the control plant that could be well modeled with bounded and slowly time-varying parameters, either of the adaptation and robustness layers would be sufficient to achieve a high control performance. However, the control strategy we proposed requires both layers to deal with more intractable control plants as we will discuss later in detail. Both the BLS estimator and the sliding mode controller have been proposed and discussed in our previous work [
15,
16,
17]. Here, we will briefly present how the BLS estimator and the sliding mode controller work and then describe the control strategy that could fully take advantage of them. At the end of this section, an illustrative example is conducted for demonstration purpose.
2.1. Adaptation Layer
To estimate the model parameters, the control plant is rewritten as the following form:
where
and
are calculated from the input signal
u and measurements;
contains all the model parameters in sequence or expressions of them. The most simple configuration is given by
where the full states from
x to
need to be observable. In most cases, only limited states (normally low-order states) could be directly measured, e.g.,
x, and sensor noises are contained. Therefore, Equation (
3) could be constructed by using some tricks, such as applying both sides of Equation (
1) through a low-pass filter to get rid of high-order states and measurement noises. This process will be demonstrated in
Section 3.3.
The
estimation,
predicted output,
estimation error and
prediction error are denoted as
,
,
and
, respectively. In addition, the following expressions are satisfied:
The typical least-squares method for parameter estimations is to minimize
where the function
returns the
-norm of a vector or matrix. This method considers all the past prediction errors and thus has a strong capability to estimate constant parameters with respect to disturbances and measurement, which will be well smoothed out during the estimation process [
18]. On the other hand, it shows poor performance in tracking time-varying parameters because past data are generated by past parameters. To overcome this deficiency, the following error criteria
are adopted instead of Equation (
8). In Equation (
9),
is the
weighting coefficient which puts more attention on the recent data and decreases the influence of the remote data;
is the time-varying
forgetting factor that needs to be tuned based on the estimation errors.
The optimum estimation
should satisfy:
where
estimation gain is introduced here, given by
The first-order derivatives of
and
could be deduced as follows:
The derivations of Equations (
13) and (
14) are based on the following formulas:
and if
then
According to Equation (
12),
, making it impossible to initiate the estimator in simulation. Therefore, Equation (
12) is modified as
where
is defined manually (should be a positive definite matrix with relatively high eigenvalues). Because
monotonically decreases and
Equation (
19) is approaching Equation (
12) as time increases; thus, the error caused by
is gradually eliminated. The first-order derivatives of
and
remain the same as Equations (
13) and (
14) with the adoption of Equation (
19).
The forgetting factor is tuned using
where
,
. Equations (
13), (
14) and (
21) together form the BLS estimator. Larger values of
and
indicate “faster” forgetting, which could increase the estimator’s capability in tracking time-varying parameters. Nevertheless, higher weights on the small range of recent data points would enlarge the negative effects caused by noise, disturbance, modeling errors and uncertainties, and thus create more oscillations in estimations. Therefore, the selections of
and
involve a trade-off between the capabilities of tracking time-varying parameters and rejecting noise and disturbance.
2.2. Robustness Layer
The estimated values of the plant’s parameters:
and
are obtained from
and then are treated as the nominal values of plant’s parameters. The errors between nominal and true values are assumed to be bounded:
where
and
are
estimation error boundaries. Additionally, the sign of
b is known and constant, and the signs of
b and
are identical during the estimation, i.e.,
This assumption is reasonable because the input gain
b is normally constant or varying within a small range.
Sliding mode controllers have been widely adopted for the tracking control of systems which have bounded model uncertainties like Equation (
22) [
19,
20]. The methodologies of the sliding mode controller are presented below. The
compact error combining tracking errors of full states is defined as
where
is a positive constant,
is the first state tracking error. For instance,
,
. Equation (
24) indicates that
e could be regarded as scalar obtained by applying the low-pass filter
on
E for
times, where
s is the Laplace variable. It can be proven [
18] that if there exists a small positive value
that satisfies
then we have
Thus, the tracking of the states
x,
, …,
is equivalent to stabilizing the single scalar
E. It can be obtained that
where
is regarded as the
sliding surface, which is the perfect condition where there are no tracking errors. A sliding mode controller is trying to ensure that the distance from
E to the sliding surface always tends to decrease, i.e.,
despite the presence of model uncertainties. Equation (
29) can be reformed as the following inequality:
where
is a positive constant. Equation (
30) can be also formed based on Barbalat’s lemma [
18], by choosing an energy-type Lyapunov function
and making
to guarantee the boundedness of
E.
The input command is designed as
where
is the
nominal command to achieve
, assuming that the nominal model parameters
and
are used in Equation (
27), and
is the
correction input that is aimed to reject the divergences caused by parameter uncertainties. The ideal form of
is
where
k is the
correction gain that needs to be tuned, and
However, the adoption of Equations (
30) and (
35) would create undesirable chatterings/oscillations around the sliding surface, which might excite unmodeled high-frequency dynamics of the control plant or even make the system unstable [
15,
18]. To avoid such a scenario, the boundary layer
is used to smooth out the command input. The compact error
E is driven back into the boundary layer
rather than the strict sliding surface
; therefore, the following relationships are established
which can be rewritten as
The correction input is modified as
where
Taking Equations (
33), (
34) and (
39) into Equation (
27) yields
Based on Equations (
22) and (
23), the following inequalities are obtained:
where
Two situations: (1). and (2). are discussed below.
(1).
Equations (
38) and (
41) lead to
Because
k can be set as
(2).
Equation (
41) yields a first-order low-pass filter on the compact error
E:
The filter’s input (right side of the above equation) can be interpreted as the perturbation due to parameter uncertainties. Because all the parameters are bounded, the compact error
E will be stabilized in a finite time. The filter’s cut-off frequency is
where
is the frequency limit we set to make the filter work properly. Higher value of cut-off frequency results in lower filter gain, which is beneficial to stabilize
E, while it also might excite unmodeled high-frequency dynamics. There, the frequency limit
is set as high as possible within the bandwidth of the plant model. Equation (
50) results in
Equations (
48) and (
51) together are used to tune
and
k online (initial values:
,
), which works as a first-order exponentially stable filter, leading to the global boundedness of
and
k. With
and
k being determined, the input command
u is obtained according to Equations (
33), (
34) and (
39). The establishment of the sliding mode controller is complete.
2.3. Control Strategy
The layers of adaption and robustness presented above make it is possible to design the control system based on a reformed control plant instead of the original control plant. The reformed control plant could be totally different and much more simpler compared to the original control plant, where their differences are compensated by the control system. In this way, the efforts to determine the control plant for a complicated system as precise as possible could be saved, and the design process of the control system could be significantly simplified.
In the classic sliding mode control where no estimator is implemented, the plant’s parameters need to be bounded in known ranges; thus, the control plant should be carefully modeled. The proposed control strategy does not require that, but on the other hand, it results in an issue that the estimation error boundaries
and
cannot be determined prior to deploying the control system. To address this issue,
and
can be set as relatively large values or updated online based on the estimations
and
, e.g., by using
This approach is conservative and can cause a high value of correction gain
k which would make the controller very aggressive if no boundary layer is adopted. Fortunately, large
and
also results in large boundary layer to increase the probability that the low-pass filter on
E works, and the filter’s cut-off frequency
will not change much according to Equation (
50). Therefore, the control system would still work properly. The proposed control strategy is demonstrated by the following illustrative example.
2.4. An Illustrative Example
A forced Van der Pol oscillator is considered in this illustrative example:
The true values of parameters are
which are unknown to us. In the following, we will implement control systems based on three different control plants. The first one is the
original plant given by Equation (
53); the second and third ones are
and
respectively, which are referred to as
reformed plant 1 and
reformed plant 2. The reformed plant 1 replaces the nonlinear item
in Equation (
53) with a linear item
, and the reformed plant 2 takes a further step to remove that nonlinear item.
The control systems adopt same parameters, which are
for the BLS estimator, and
for the sliding mode controller. The initial estimations of the original plant’s parameters are set as
which are close to the true values, assuming that we have a basic understanding of the original plant. However, there is no such information for the reformed plants; therefore, the initial estimations of the reformed plants’ parameters are simply set as
The designated displacement trajectory
for this example is a randomly generated band-limited white noise with cut-off frequency: 3 Hz, power spectral density:
, sampling frequency: 1024 Hz and time duration: 8 s. The designated velocity trajectory
is obtained by taking derivative operation on
.
The displacement and velocity tracking results of the three control plants are shown in
Figure 1. All three control plants could result in a good tracking performance without obvious differences among them.
To quantitatively compare three control plants, displacement and velocity relative tracking errors (abbreviated to dis. err. and vel. err.):
are calculated. The peak dis. and vel. err. of three control plants are listed in
Table 1. It can be inferred from
Table 1 that the peak errors of reformed plants are kept at a low level (less than 1%) and are close to those of the original plant. Additionally, the reformed plant 2 even achieves better displacement tracking performance than the original plant. Therefore, both reformed plants can be used for the design of control systems.
The true values and estimations of the original plant’s parameters are shown in
Figure 2. Although there are oscillations existing in the estimations, the variations of parameters are generally well tracked by the BLS estimator. For the reformed plants, the parameter estimations are shown in
Figure 3 and
Figure 4, which are varying in large ranges:
and
The model errors of the reformed plants are essentially compensated by the large oscillations of their parameters. It is important to remark that the parameter ranges of the reformed plant 2 is larger than those of the reformed plant 1. The reason behind this observation is that the reformed plant 2 is more simple than the reformed plant 1; thus, higher parameter variations of the reformed plant 2 are required to compensate the plant errors. The numerical example conducted in this section proves that the proposed control strategy is feasible.