3.2. Softness Factor Adaptive Adjusting Model Based on Online Obtaining
The softness factor is a key parameter for DMC MPC; it plays plays an important role in balancing the degree of robustness and rapidity for the DMC MPC tracking control system. If softness factor
is chosen as a larger value, the system will have slower response speed and stronger robustness; by contrast, if softness factor
is chosen as a smaller value, the system will have faster response speed and worse robustness [
36]. Thus, both response speed and robustness must be taken into account for softness factor
setting.
Considering the trajectory characteristic and tracking control condition for automatic train operation, the softness factor adaptive adjusting model based on online obtaining is established as follows,
where
represents the final calculated real-time softness factor;
represents the real-time softness factor calculated based on the trajectory characteristic of the present position;
represents the real-time softness factor calculated based on tracking control condition of the present system output
;
and
represent the fusion weights of
and
, respectively; and
.
The whole tracking control curve must be divided into several different types of subintervals by position according to the trajectory characteristic and line conditions. The specific types of subintervals are described as follows.
Type 1: The vibrating area nearby inflection point of tracking control curve.
In this area, there is the strong velocity fluctuation in the velocity trajectory. Thus, aiming at improving the system robustness as much as possible, softness factor should be an appropriate larger value at the cost of reduce acceptable system rapidity.
Type 2: The smooth area of tracking control curve.
In this area, there is no obvious velocity fluctuation in the velocity trajectory. Thus, aiming at improving the system rapidity as much as possible, softness factor should be chosen as an appropriate smaller value at the cost of reduce acceptable system robustness.
Type 3: The connected area in the middle of smooth area and the vibrating area of the tracking control curve.
In this area, the system rapidity and rapidity are taken into account for softness factor setting. Thus, softness factor should be choose a appropriate intermediate value.
In addition, although in the same type of subintervals, the softening factor
almost varies because of the different intensity degrees of velocity fluctuation. The specific calculation formula for softness factor
based on trajectory characteristic of the present position is described as follows,
where
represents the subinterval index
;
represents the number of subintervals;
represents the starting position of the
-th subinterval;
represents the terminal position of tracking control curve, it is a target parking position;
represents the reference value of soften factor in the
-th subinterval;
,
;
and
represent the connected length, in the connected area; the softness factor
is reduced or increased linearly and smoothly, so as to avoid the instability of tracking control system.
Aiming at solving this control problem with fuzzy characteristic, an fuzzy adaptive adjusting method for online obtaining softness factor
is applied. First of all, the satisfaction degree of control is defined, so as to the automatic train operation tracking control problem can be transformed into an optimization decision-making problem by fuzzy reasoning; then, the corresponding real-time parameters of the controller are adjusted online to meet the requirements of the system control quality, so as to achieve the purpose of system optimization control. The specific calculation formula for fuzzy satisfaction degree
of system output
is as follows,
where
and
represent the blur width, which can indicate the requirement of designer, if
, the requirements for the control system are strict, and the automatic train operation tracking control is not so, and this represents a combination of the practical situation;
and
represent the maximum and minimum value of design expectation, respectively, if
, it will be shown as trigonometric membership function; otherwise, it will be shown as trapezoid membership function. The corresponding diagram for fuzzy satisfaction degree calculation
of system output
is shown in
Figure 3.
The error between the output value and the reference target value of system (i.e., fuzzy satisfaction degree
of system output
) should also be considered. If the fuzzy satisfaction degree
is larger, it can indicate that the error between the output value and the reference target value of system is smaller, at this time, there is a small overshoot of the system and softness factor
so that an appropriate lager value needs to chosen to increase the system rapidity; by contrast, there is an obvious overshoot of the system and softness factor
so that an appropriate small value needs to be chosen to reduce the system rapidity to ensure system robustness [
36]. According to the influence of softness factor
for the system dynamic response, the specific exponential calculation formula for softness factor
by fuzzy satisfaction degree
of system output
is as follows,
where
represents the maximum value of softness factor
;
b represents the gain coefficient; it determines the shape of the softening factor
function curve.
The corresponding diagram for softness factor
of the fuzzy satisfaction degree calculation
, and softness factor
of system output
are shown in
Figure 4 and
Figure 5.
3.3. Improved Whale Optimization Algorithm for Softness Factor Adaptive Adjusting Parameters Optimization
Optimization algorithms are used to obtain a set of adjustable parameters for the satisfactory tracking control effect in actual automatic train operation scenarios. The specific softness factor adaptive adjusting parameters optimization model is as follows,
where
x represents the target vector;
represents feasible solution space of
x;
represents the
ig-th equality or inequality constraint for automatic train operation tracking control problem,
represents the number of equalities and inequality constraints; the five adjustable parameters (
,
,
,
,
b) are decision variables.
Objective decomposition is an effective method to solve the multi-objective optimization problems. The Tchebycheff decomposition method is selected in this paper among many objective decomposition methods [
37]. The specific calculation formula for the aggregate function value of the Tchebycheff decomposition method is as follows,
where
represents the reference point,
,
, which is the optimal solution of each objective function at present;
is the weight of the
i th objective,
.
Whale optimization algorithm with strong global optimization ability is chosen in this paper. Whale optimization algorithm (WOA) is a new metaheuristic optimization algorithm learned from whale predatory behavior. There are two operators (position update and prey searching) in the computation process of the whale optimization algorithm [
38]. The specific calculation formula for the position update of the basic whale optimization algorithm is as follows,
where
represents the optimal position vector obtained by the current optimization;
represents the distance between humpback whales and their prey;
p represents the behavioral selection probability of humpback whales,
;
represents the probability of surrounding prey of humpback whales,
; the probability of spiral hunting is
;
B represents a constant, which is used to define the shape of spiral;
l represents the random number in
;
t is the current iteration number;
is the maximum number of iterations;
a represents convergence factor;
A and
C represent the correlation coefficients respectively;
and
are random numbers,
,
.
The specific calculation formulas for convergence factor
a, correlation coefficients
A, and
C is as follows,
After the position updated, prey searching is implemented by means of random individual positions. The specific calculation formula for prey searching of the basic whale optimization algorithm is as follows,
where
is the position vector of randomly selected whales. If
, a search leader individual is randomly selected, and the position of other whales is updated based on the whale position of the leader individual, so as to guide the whales to leave the prey and find a more suitable prey to enhance the global search ability of the algorithm.
The relatively fixed method of linear decline of convergence factor
a will reduce the population diversity maintenance ability, so that the algorithm can easy to fall into local convergence in the late iteration. Aiming at solving this problem, the strategy of cosine decline combined with chaotic random method for convergence factor nonlinear decline is proposed in this paper. The specific calculation formula is as follows,
where
represents the behavioral selection probability of the convergence factor
a,
;
represents the probability of cosine decline of the convergence factor
a,
; and the probability of chaotic random is
.
Compared with the linear decline strategy, the decline rate of the convergence factor is significantly different in the whole iteration cycle caused by the nonlinear decline strategy with a certain degree of chaos uncertainty for convergence factor
a, and it is helpful for maintaining the population diversity, thus the algorithm global convergence performance will be improved [
39].
According to the Tchebycheff decomposition method, the aggregate function value is the fitness index for the multi-objective optimization algorithm. After the computation process of each iteration, the newly generated non-dominated solutions of the current population are put into the elite archive. The archive must kept within a certain size by some elite individuals with small differences from other elite individuals, so as to avoid computational burden of the algorithm. According to the updating rules of the whale optimization algorithm, the reference point plays an important role in guiding the direction of global convergence, and a certain degree of local convergence due to this fixed foraging behavior. Meanwhile, the selector, crossover, and mutation of the genetic algorithm can generate a large number of new solutions with great differences for the whale optimization algorithm based on evolutionary processes, so as to further improve the global convergence performance due to more powerful population diversity maintenance ability.
The specific steps of improved whale optimization algorithm proposed in this paper are as follows.
Step 1: Initialization.
Initialize the whale population (the size is ), and the Tchebycheff aggregation function values of each whale individual are calculated.
Step 2: Iterative computations.
the is obtained;
, the reference point , and , , m represents the number of objectives, a uniformly distributed weight vector set is generated, and .
If the current iteration number is greater than 1, the weight
for solution
are need to be recalculated. According to the literature [
40], in the
t-th iteration, the specific calculation formula for weight
of the
k-th optimization index of the
i-th individual (solution)
in the population is as follows,
where
,
.
For any solution target of Pareto front, its weight vector is . Because the Pareto front is not easily available, it is replaced by the nearest solution target in ;
The strategy of cosine decline combined with chaotic random method is used to calculate convergence factor a;
The updating rules of the whale optimization algorithm are used to update each individual whale.
Step 3: The archive and genetic evolution mechanism.
The Pareto front of the current whale population is obtained, and it is used to expand the .
Some elite individuals with small differences from other elite individuals are deleted, until the size of is not exceed allowed limit archive size .
Three operators (selection, crossover, and mutation) of the genetic algorithm are applied in the whole whale population, so as to further improve the population diversity maintenance ability.
Step 4: Termination judging.
The hypervolume indicator is chosen as termination judging indicator. Using the hypervolume indicator of the dominated portion of the objective space as a measure for the quality of Pareto set approximations is appropriate and effective. The specific formula for hypervolume indicator for objective vector set
A with reference point (0,0,...,0) is as follows.
where
A is any objective vector set in objective space
; if there is an objective vector
a, Pareto superior
z and
a belongs to
A,
; otherwise,
[
41].
Thus, the hypervolume indicator indicates the dominated portion in objective space
. Generally, Hypervolume as Klee’s Measure Problem (HKMP) is an effective calculation method for hypervolume indicator [
42]. As only 3 objects
must be taken into account, the calculation method by using equivalent volume model for the volume of irregular objects can be used. The specific formulas for the hypervolume indicator for 3 objects by using equivalent volume model is as follows,
where
,
, and
are the split numbers for each normalization objective domain (0,1);
represents the central point of the
th cube of normalization objective space.
The schematic diagram of the hypervolume indicator for 3 objects by using equivalent volume model is shown in
Figure 6.
If the hypervolume indicator is reached beforehand, an unchanged number of hypervolume indicators will be resetted; otherwise, make .
If the maximum unchanged number of the hypervolume indicator is reached, the calculation will be terminated; otherwise, return Step 2.
The flowchart of improved whale optimization algorithm proposed in this paper is shown in
Figure 7.
Aiming at improving the global searching ability, the evolution law is must be considered in the computation process. The excellent individuals should have a small mutation probability, so that they can accumulate optimization results effectively, and the poor individuals should choose a large mutation probability, which can be fully eliminated, so as to enhance the capacity of exploration [
43]. The mutation probability calculation formula based on sigmoid function
is as follows,
where
is the mutation probability value;
is the maximum mutation probability;
is the shape factor for the sigmoid function of mutation probability;
is the demarcation point of the whale population;
is the normalized value of fitness function value
of whale individual
x in the whale population.
This mutation probability calculation method has certain fairness, and the whale individuals have appropriate mutation probability according to fitness function value, so as to prevent the population controlled by advantage individuals and persist evolution opportunity for disadvantaged individuals.
A multimodal crossover method is conducive to finding a more satisfactory optimal solution for the complex optimization issue [
44,
45]. The multimodal crossover combining popular blended crossover and unimodal normal distribution crossover is applied in this paper. The specific calculation formula for multimodal crossover is as follows,
where
is the solution after multimodal crossover operation;
and
are two parent solutions for blended crossover;
is the behavioral selection probability about multimodal crossover operation;
and
are the crossover probabilities for blended crossover and unimodal normal distribution crossover, respectively;
is the midpoint for
parent solutions for unimodal normal distribution crossover;
d is the differential vector;
is the
th orthogonal basis;
and
are the random numbers obey normal distribution
and
.