1. Introduction
In practical applications, most systems are nonlinear and have unknown disturbances, including matched and mismatched disturbances [
1]. Due to the existence of unknown nonlinearities, it is not easy to design controllers for a class of systems with unknown uncertainties. For the past few decades, most applications have simply used traditional linear control methods to design controllers on inaccurate nominal models, which are increasingly shown to compromise accuracy and overall performance. In addition, control strategies that do not depend on models, such as PID [
2], adaptive technology [
3,
4], fuzzy logic system [
5,
6], and neural network [
7], have achieved certain results in the application of unknown model information, but their parameter adjustment has great blindness and uncertainty, and none of them have the versatility of control strategies, which means that the designed controllers and parameters cannot be simply extended from one application object to other application objects.
Recently, sliding mode control (SMC) has been widely used in nonlinear system control [
8,
9,
10,
11]. Due to its unique high-frequency switching characteristics, it is insensitive to disturbances. Therefore, in many applications, traditional SMC is used to offset the effects of lumped uncertainty. The traditional SMC needs to know the upper bound of the perturbation to select the appropriate switching term gain. However, in most scenarios, the form and size of the disturbance cannot be known in advance, so an excessively large switching term gain will be given to ensure the stability of the system, which will lead to severe output chattering. In response to this problem, many new forms of sliding mode control strategies, such as terminal sliding mode control (TSMC) [
12,
13], super-twisting sliding mode control (STSMC) [
14,
15], and adaptive sliding mode control (ASMC) [
16,
17], have been widely studied and applied. Among them, the ASMC combines the adaptive technology, which can automatically adjust the sliding mode gain according to the real-time state of the system, thus solving the problem that the upper bound of the total disturbance is unknown. In [
18], a robust ASMC is proposed for boost converter control where the load and input voltage are unknown. However, when there are too many unknown uncertainties in the system, such as unmodeled dynamics, parameter changes, external disturbances, etc., the simple sliding mode control strategy cannot realize the complete compensation of disturbances. Therefore, to improve the anti-disturbance ability and overall performance of the system, effective strategies must be taken to estimate and automatically compensate the unknown uncertainty, and then, the system can be effectively controlled by related strategies, such as SMC.
Therefore, how to estimate the total disturbance becomes the key to the control of uncertain nonlinear systems. As of recently, neural networks (NN) and model-based observers are two effective disturbance estimation techniques. The former realizes the identification and estimation of unknown disturbances through online learning and parameter adjustment [
19,
20,
21,
22], while the latter reconstructs the forms of states and disturbances from system errors and control inputs by constructing appropriate differential equations [
23]. In [
24], a recurrent neural network (RNN) controller, which was trained online using a dynamic back-propagation algorithm, was proposed to control an ultrasonic motor drive. In addition, a new RNN structure and a TSMC strategy with a new recurrent neural network were proposed in [
25,
26], improving control accuracy and robustness. In [
27], a novel Hermite neural network-based second-order sliding-mode (HNN-SOSM) controller is proposed for the synchronous reluctance motor drive system. A self-constructing fuzzy neural network and self-evolving Chebyshev fuzzy neural network are proposed for active power filter in [
28,
29]. However, even though NN has strong learning and estimation abilities—and it can solve unknown and uncertain problems—its computational complexity is high, and the parameter learning strategy is ambiguous and difficult to design, which brings difficulties to its wide application.
Fortunately, on the other hand, after the observer was first proposed by Luenberger [
30], there has been a growing body of work on the estimation of states and disturbances [
31,
32,
33]. A high-gain extended state observer was used to estimate the system state and disturbance in [
34], which is applied for electro-hydraulic systems. However, high-gain observers are sensitive to noise, and high-gain can easily cause the system to diverge and oscillate. Recently, a new method called active disturbance rejection control (ADRC) has been widely studied and used due to its inherent disturbance immunity and model-free property [
23,
35]. The complete ADRC includes three parts: tracking differentiator, fastest control rate, and extended state observer (ESO). ESO is the core active disturbance rejection module, which can be divided into nonlinear and linear, respectively called nonlinear ESO (NESO) and linear ESO (LESO). In [
36], an ADRC-based controller was proposed for a magnetic rodless pneumatic cylinder, where a NESO was used to estimate the nonlinear dynamics, and the self-stabilizing region theory was used to prove the NESO’s convergence. In [
37], a convergence theory with explicit error estimates was provided, and the convergence of NESO, consisting of linear and fractional power functions, was demonstrated. Although NESO has achieved effective applications and extensive research, its complex structure and difficult parameter tuning are not conducive to engineering practice and expanded applications. Then, the structure and design of LESO was first proposed by Gao in [
38]. Compared with NESO, the proposed LESO has the characteristics of simple structure and greatly simplifies the parameter tuning through the bandwidth parameterization method. Since then, LESO-based ADRC strategies have been widely used in practical industries such as air–fuel ratio control [
39] and nonlinear servomechanisms [
40], etc. In [
41], the parameter tuning problem of LESO was studied, stating the widely used LADRC bandwidth tuning method is equivalent to tuning the two time constants of the setpoint filter and the robust filter in the internal model control. However, both the bandwidth tuning method and the IMC-based tuning method only slightly simplify the parameter tuning process and reduce the tuning dimension, and they still require careful manual adjustments of parameters such as observation bandwidth or time constant. Furthermore, in [
42], the relationship between the perturbed estimation error and the observation bandwidth is explicitly given. Therefore, aiming at the difficulty of parameter design, according to the perturbation relation and the bandwidth tuning method, an interval type-2 fuzzy neural network is designed to learn the observer bandwidth based on the gradient descent (GD) method.
Therefore, in this paper, a LESO is used to approach the total disturbance of a class of uncertain nonlinear systems to realize the active compensation of the disturbance. For the inevitable observation error, ASMC is used to ensure the stability of the uncertain system, fully cooperating with the observer to estimate the disturbance and avoid excessive control gain to weaken the chattering. Aiming at the problem of LESO parameter design, based on the parameter adjustment of the bandwidth method, the IT2FNN, based on the GD method, is proposed for the first time to estimate the observer bandwidth and minimize the estimation error. The proposed controller combines LESO, IT2FNN, and ASMC. Compared with existing state-of-the-art research, the main contributions of the proposed strategy include:
- (1)
The proposed control strategy is designed for a class of uncertain systems. It does not depend on an accurate mathematical model. It only needs to know the information of the model order to design an efficient controller. The simulation results show that the proposed controller has good versatility and practicability.
- (2)
Compared with existing work, the improved LESO does not need to manually select the observation coefficients, and it can automatically learn to estimate the observer bandwidth through the IT2FNN—based on the GD method—and then give the observation coefficients, according to the bandwidth method, to minimize the estimation error. Compared with NESO, although the improved LESO loses a certain estimation accuracy, it greatly simplifies the observer design and parameter adjustment process, which is more conducive to practical application and expansion.
- (3)
The IT2FNN, based on the GD method, was first proposed to estimate the observer bandwidth of LESO. Through the designed learning process, the neural network can adaptively learn the optimal parameter values to minimize the estimation error. Compared with reinforcement learning methods, the proposed strategy is more practical and has low computational complexity.
- (4)
The combination of ASMC and improved LESO technology enables a coordinated duality of the control process. LESO acts as a forerunner to actively compensate the unknown uncertainty of the system, and ASMC, as the main controller, increases the compensation for the residual disturbance relatively slowly. Such a binary coordinated control strategy can make the control process more visualized and clearer. The proposed controller can reduce the chattering of output while reducing the error and ensuring the optimal comprehensive performance.
This paper is organized as follows. In
Section 2, a general model design for a class of uncertain nonlinear systems is introduced under the ADRC framework. In
Section 3, a new ASMC with improved LESO is proposed. Simulation verifications are given in
Section 4.
Section 5 draws conclusions.
3. Proposed Control System
The block diagram of the proposed control strategy for a class of uncertain nonlinear systems is given in
Figure 1. As can be seen from the figure, the controller includes three components: IT2FNN, LESO, and ASMC. Among them, IT2FNN is designed to optimize the LESO parameters, thus self-learning to estimate the observer bandwidth to achieve parameter self-tuning. Then, LESO is used to estimate the total disturbance of the nonlinear system and, then, realize the feedforward compensation of uncertainty. ASMC can offset the estimation error of the disturbance. The control goal is to design an effective controller so that the system state
tracks the reference signal
asymptotically.
3.1. Design of Linear Extended State Observer
LESO is obtained by simplifying the design of NESO. The construction of the nonlinear feedback function in NESO is very complicated, and the parameter tuning is also difficult. Therefore, although the performance of linear feedback in LESO will be slightly worse in theory, it is obviously worth sacrificing some estimated performance in exchange for the simplicity and practicality of the design. Therefore, this section will design LESO for the
-order uncertain nonlinear system after expansion, and the design process is given as follows.
where
is the observed value of system states and extended total disturbance, while is the observer gain.
Remark 2. Because is regarded as the unmodeled dynamic described in Remark 1, which is part of the total disturbance, did not occur in the observer equation Equation (5). It is worth mentioning that if part of the accurate information of the system can be obtained, it can be directly added into the observer equation, which is conducive to further improving the estimation accuracy of the observer.
The observation error is defined as
The difference between the system equation and the observer equation can find the observation error equation as follows
The observation error equation can be rewritten in vector form as follows
where
it can be seen that as long as the poles of the characteristic polynomial
of
are all in the left half-plane. Additionally, it can be assumed that
is bounded. Then, LESO is proven to be bounded-input bounded-output stable.
Therefore, the observation gain of the LESO can be obtained by the pole configuration method. Gao [
32] proposed a bandwidth parameterization method to select the observation gain and configure the poles of the characteristic polynomial at
, where
is the observation bandwidth. Therefore, the following equation can be solved to obtain the specific observation gain value:
3.2. Structure of Interval Type-2 Fuzzy Neural Network
Although the parameter setting of LESO can be completed by introducing the observation bandwidth, the selection of the observation bandwidth is still a mysterious problem, and a reasonable observation bandwidth value cannot be set in many application scenarios. When the observation bandwidth is selected larger, the observation speed is fast, but if it is too large, it will be too sensitive to noise and lead to divergence, and if it is too small, there will be serious phase lag. Therefore, it is imperative to use adaptive technology, especially the neural network optimization strategy with self-learning ability, to obtain the observation bandwidth of real-time uncertain nonlinear systems. Then, the following section will first introduce the structure and characteristics of the IT2FNN adopted in this paper, and the learning optimization strategy will be given in the next section.
As shown in
Figure 2, The IT2FNN is a five-layer structure which is multiple input single output (MISO). The IT2FNN structure can be divided into the antecedent layer part and the consequent layer part. The computation and operation process of each layer in IT2FNN is given as follows. For a more detailed description, please refer to [
37].
(1) Antecedent layer part: The antecedent layer contains the input layer and the membership layer. Additionally, the role of the antecedent layer is to obtain the input signal and use the type-2 fuzzy member function to perform nonlinear processing on the input signal to improve the distribution of the input signal, which is beneficial to the information extraction and optimal learning of the neural network. It is worth mentioning that, since there are few parameters in this layer, they are mainly the basis width and center vector in the type-2 fuzzy membership function, and they are easily obtained by a priori selection from the statistical features of the input signal. Therefore, instead of using the gradient optimization algorithm to calculate, it is better to directly use the expert experience to obtain the parameters of the leading layer. It not only simplifies the calculation amount of the neural network and optimizes the computing power but it also ensures that the antecedent layer of IT2FNN can abstract more effective and differentiated input information.
Layer 1: The input and output expressions of the input layer are as follows
where
and
are the input and output of
node, respectively.
Layer 2: The input and output expressions of the membership layer are as follows
where
is a center vector;
and
are the upper and lower base widths.
(2) Consequent layer part: The consequent layer part includes the rule layer, the type reduction layer, and the output layer, which is used to perform data derivation and calculation, based on the input information abstracted by the antecedent layer, and output the expected result.
Layer 3: The input and output expressions of the rule layer are as follows
where
and
are the upper bound and lower bound.
Layer 4: The input and output expressions of the type reduction layer are as follows
where
and
are the upper and lower output, respectively.
Layer 5: The input and output expressions of the output layer are as follows
where
is a weighting factor;
is the final output of the IT2FNN.
The IT2FNN is used to approach the observation bandwidth . The main idea is to use the system error and derivative of error as the inputs of the IT2FNN, adopt the optimization strategy based on GD method to adjust the network parameters, and estimate the observation bandwidth . The detailed derivation of the parameter learning strategy of IT2FNN is presented in the next section.
3.3. Parameter Learning of IT2FNN
First, define the loss function
of IT2FNN to estimate bandwidth as
Then, define
as the gradient of the loss function to the network output of the IT2FNN, which can be calculated as
From the literature [
36], it can be known that the relationship between the observation error and the bandwidth is as follows
where
and
are positive constants computable by order, and
is also a positive constant, which satisfies
.
Therefore, the high-order small quantities can be ignored, and the relationship between the bandwidth and the estimation error can be obtained by simplifying the above formula as
where
is a large positive constant.
Then, the following can be deduced
Substituting Equation (28) into
Equation (25) gets
Then, the update rate of the weights can be calculated by back-propagation through the gradient method and the chain rule as follows.
The update law of the weight
is calculated as
The update law of the weight
is calculated as
The update law of the weight
is calculated as
where
,
, and
are the learning rate parameters of the weights
,
, and
, respectively.
Finally, these weights are updated by the following equations.
3.4. Controller Design and Stability Analysis
In this section, an ASMC based on the improved LESO is designed for the -order uncertain nonlinear system represented by Equation (1).
First, define the systematic error vector as
The sliding mode variable is designed as
where
is chosen so that the zeros of the polynomial lie in the left half of the complex plane.
The derivation of the sliding mode variable is defined as
Let
, and it can be deduced that the equivalent control law
is defined as
The total disturbance
is unknown, so the observation value of LESO is used instead, and the observation error is compensated by adding an adaptive switching term. At this time, the real control rate is obtained as follows
Theorem 1. Consider a class of nonlinear uncertain systems with a single input and a single output represented by Equation (1) with partially unknown parameters. Under the premise of satisfying Assumptions 1 and 2, as long as the following conditions are met, the system can be guaranteed to be asymptotically stable.
- (1)
The proposed controller is designed as Equation (43).
- (2)
The parameter adaptation laws of the IT2FNN are designed as Equations (36)–(38).
- (3)
The adaptive law of ASMC is shown as
where
is the learning rate, which is a positive constant.
Proof. The Lyapunov function candidate is designed as
Then, define the estimation errors of the adaptive switching term as
The derivative of the Lyapunov candidate function is as
Assuming
has upper bound as
, then we can get
Therefore, the conditions for asymptotic stability of the system are defined as
If the condition of Equation (49) is satisfied, then
. Integrating
with respect to time, we can find
Since is bounded and is nonincreasing, it is concluded that is bounded. According to Barbalat lemma, one can deduce that
which means the tracking error and sliding surface will converge to zero asymptotically. □
4. Algorithm Verification
To verify the practicability and generality of the proposed control strategy, simulations are designed for an inverted pendulum system and an active power filter system. In the simulation, the simulation software used is Matlab/Simulink, and the Matlab software version is 2019b. In addition, the computer system is 64-bit, the CPU is i7-6500U (2.5GHz).
- A.
Example 1: Single-phase Active power filter
This part takes the single-phase parallel APF as the control object, and it adopts the proposed ASMC-LESO strategy for the first-order dynamic model, obtained by the averaging method, to realize the current control task. The circuit model diagram of APF is given in
Figure 3.
With the development of power electronics technology, a large number of electronic devices, such as air conditioners and microwave ovens, have entered the home. These non-linear loads will produce a large amount of harmonic pollution in the power system, which is very harmful to production and life. The specific performance is that the grid current is distorted, thereby affecting the power quality and causing many safety problems. APF is the most effective harmonic control method, and its working principle can be summarized as: First, the harmonic components added on the load side are calculated by the single-phase harmonic fast detection algorithm, and then, the designed inverter built with IGBTs is used to generate a compensation current as large as the reverse of the harmonic components to achieve active compensation. Therefore, the control task is to design the controller to output an appropriate duty cycle to control the correct switching of the IGBT, according to the calculated reference current, so as to realize the high-precision tracking control task of the current. In addition, the IGBT switching signal is output by triangular carrier modulation, and the switching frequency of IGBT is .
According to the derivation of the circuit model, the first-order model of APF can be obtained as
where
is the external disturbance. For the detailed modeling process, please refer to [
43].
Therefore, according to the results in
Section 3, it can be deduced that the LESO and control law for the APF system is as follows
where
is the observed value of system state
,
is the extended total disturbance,
is the observer gain,
,
is the reference current, and
where
.
In
Table 1, the parameters used in the system simulation are given in detail. In the simulation, it is set to connect to the APF main circuit for control at 0 s and add a nonlinear load at 0.6 s. Before the APF is connected, the power supply current contains severe harmonic distortion due to the influence of the nonlinear load on the load side.
Figure 4 shows the waveforms of load current
, compensation current
, and power supply current
after the APF is connected. It can be seen from the first load current curve that the current on the load side presents a periodic and severely distorted non-sinusoidal waveform. At the same time, it can be seen from the third power supply current curve that the distortion of the power supply current is gradually controlled to an ideal sinusoidal waveform from the beginning, and the harmonic compensation task is completed in a short time. This is due to the fact that, after the APF is connected, a compensation current that is opposite to the load harmonics is generated. As shown in the second curve in
Figure 4, the compensation current enters a steady state process after a short period of adjustment. In more detail, the compensation and reference current tracking comparison curves and error curves are given in
Figure 5. It can be said that the compensation current can track the reference current quickly and without overshoot, and the error basically converges after 0.02 s.
To accurately compare the performance of APF,
Figure 6 and
Figure 7 show the power supply current spectrum when the APF is just connected (0 s) and when it enters the steady state (0.2 s), respectively. It can be seen that, when the controller is just connected at 0 s, the power supply current has a large number of low-order and some high-order harmonic content, and the total harmonic distortion rate (THD) reaches 26.95%, which is far from meeting the requirements of international standards of less than 5%. However, after adopting the proposed control method, the system entered a steady state at 0.2 s, and it can be seen from
Figure 6 and
Figure 7 that the harmonic components are greatly reduced, and the THD is only 2.10%, which means that a good harmonic suppression task is achieved.
In addition, the estimated curve of IT2FNN for observer bandwidth
is given in
Figure 8, showing after a short adaptive learning and correction, the output of the neural network converges to about 3800, realizing real-time estimation of the observer bandwidth. On this basis, the state estimation curve of LESO is given in
Figure 9, where it is shown that LESO also achieves accurate tracking of the system state after the bandwidth has converged, and the red and blue curves almost completely coincide. At the same time, the estimated curve of the total disturbance is shown in the lower half of
Figure 9, showing a periodic sine wave-like curve with an amplitude ranging from −6000 to 0, which is basically consistent with the nominal model. In general, the total disturbance estimates are also relatively accurate. Then, the adaptive curve of the switching term gain is shown in
Figure 10. The gain of the switching term converges to about 0.24 in a very short time, and the gain value is small, which will not cause serious chattering, and indirectly shows that the LESO has a good estimation effect on the total disturbance.
Remark 3. The total disturbanceincludes the system dynamics,and the external disturbance.
It can be seen from Equation (50) that the APF system dynamicsis theoretically an ideal sinusoid. However, since the value of the resistance and inductance is affected by the environment and the power supply voltage and DC side voltage will also fluctuate, then consideringand external disturbances, the estimated total disturbance will fluctuate larger than theof the nominal model. The total disturbance curve estimated by the system is shown in Figure 9, which shows that the total disturbance is indeed an irregular nonlinear function.
Finally, to show the superiority of the proposed controller, the comparison results and analysis with the ASMC method are presented.
Figure 11 shows the current tracking comparison curve and the error comparison curve for the two methods. As can be seen, under the comparative ASMC method, there is obvious overshoot and runaway state in the early control stage because it does not have LESO to accurately observe and compensate the total disturbance of the uncertain system. In addition, even in steady state, although the adaptive gain keeps increasing to offset the uncertainty, there is always a periodic large steady state error. The above phenomenon shows that the high-gain switching term, solely relying on the adaptive sliding mode control, cannot completely offset the uncertainty caused by the unknown dynamics, and there are always some disturbances that cannot be compensated. This shows that the proposed ASMC-LESO method is more robust and practical than AMSC. In addition, in theory, since the LESO prior actively compensates for the unknown uncertainty disturbance, the adaptive gain will be relatively small, thereby weakening the chattering verified in
Figure 12, showing the comparison curves of the output chattering under the two methods. It can be clearly seen that the variation amplitude and variation frequency of the control output of the proposed method are smaller, which means that the chattering is smaller. More specifically, the variances of the control outputs under the AMSC-LESO and ASMC methods are calculated to be 0.1539 and 1.6618, respectively, which are significantly different by an order of magnitude.
Further, THD comparison between ASMC-LESO and ASMC is shown in
Table 2. In the initial control, the THD under the ASMC-LESO and ASMC methods are 26.95% and 67.01%, respectively, and the performance is very different, which is because the ASMC method has the disadvantages of slow response, overshoot, and runaway in the initial control. Then, the THD of the ASMC-LESO under steady state is 2.10%, which is 1.23% lower than 3.33% of the ASMC. A nonlinear load is added on the user side at 0.6 s, so the THD of the two methods also abruptly increased to 8.43% and 9.10%, respectively, when the load suddenly increased. Then, after a short control adjustment, it entered a new steady state, at which time the THD of the two methods were 1.26% and 2.91%, respectively. Overall, regardless of the state, the proposed ASMC-LESO method always has a smaller THD and better harmonic compensation capability than the comparison methods.
- B.
Example 2: Single-stage Inverted Pendulum System
The second-order dynamic model of the single-stage inverted pendulum is as
where
is the acceleration of gravity, is the mass of the car, is the mass of the pendulum, is half of the pendulum length, and are the swing angle and swing speed, the initial state of the system is , and the expected trajectory is .
Therefore, according to the results in
Section 3, it can be deduced that the LESO and control law for the inverted pendulum system are as follows
where
is the reference current.
The angle tracking curve and tracking error curve are given in
Figure 13. Within 1 s, the swing angle of the inverted pendulum completely and accurately tracks the reference angle, and the error also converges to 0. Moreover,
Figure 14 shows the observe bandwidth curve in the output of the IT2FNN, which converges to 99.9 within 1 s.
The observation and comparison curves of LESO for system states
,
, and total disturbance
are shown in
Figure 15. The red curve in the figure is the system state, and the blue curve is the observation curve. It can be seen that the two almost completely overlap after 0.1 s, which means that the observation effect of LESO is very good, and the system state can be accurately estimated. Further,
Figure 16 shows the observation error curves of the three states. It can be seen, more intuitively, that the observation errors of the three states are very small, and the observation error magnitudes of
,
, and
reach
,
, and
, respectively. Finally, the adaptive curve for the gain of the switching term is given in
Figure 17, which also converges to around 5.5 within 1 s.