1. Introduction
Under the double pressure of energy crisis and environmental pollution, countries all over the world have developed a series of programs to alleviate this problem, and vigorously developing new energy vehicles is an important measure to alleviate environmental pressure and oil resource exhaustion [
1]. New energy vehicles include a variety of models such as battery electric vehicle (BEV), hybrid electric vehicle (HEV) and fuel cell vehicle (FCV). Although BEV can help alleviate environmental and energy problems, problems with battery technology such as long charging time, battery range and short battery life cause it to have great limitations [
2,
3]. HEV is a kind of new energy vehicle that reduces emissions, but it still needs to consume petroleum resources [
4,
5]. FCV has the advantages of long driving range, fast energy replenishment, zero pollution and wide raw material sources, and it is an important direction of future automotive development [
6,
7,
8].
However, due to the relatively weak output characteristics of fuel cells, an additional energy storage device is required to solve the problems of slow start, slow dynamic response of fuel cells and braking energy recovery in actual application [
9]. Due to different features of the power sources in the fuel cell composite energy source system, a reasonable and effective energy management strategy (EMS) is crucial in coordinating the distribution of power flow and improving fuel economy and system durability.
There have been many papers on EMS in the literature with example literature surveys being that of [
10,
11]. These papers can be separated into two categories, which are namely the rule-based EMS and optimization-based EMS [
12]. The rule-based EMS has advantages such as simple logic and strong adaptability to working conditions. According to the different forms of rules, it can be divided into deterministic rules and fuzzy rules. The EMS of deterministic rules controls the main energy sources (such as engines and fuel cell) in the best working conditions or high efficiency range according to experience. Li Q. et al., conducted regular analysis on the operating characteristics of the fuel cell system and the optimal working area of the battery so that the system could work in the high-efficiency area and achieve the best economy [
13]. The state machine control proposed by Mokrani Z. et al., [
14] and the operation mode control proposed by Garcia P. et al., [
15] were adopted as the rule-based strategies and are effective for extending the lifetime of fuel cells. The fuzzy rule-based EMS is proposed based on fuzzy controller [
16]. Zhou D. et al., proposed an EMS for online driving conditions by integrating three offline optimized fuzzy logic controller parameters [
17]. Shen D. et al., proposed the fuzzy control method based on robust model prediction to design the nonlinear control law to achieve the optimization goal under the uncertainty of power demand [
18].
Optimization-based EMS adopts an active optimization algorithm that can adaptively change the rules or criteria based on the input and outputs and/or the history of these parameters [
19]. Optimization-based strategies can be divided into real-time optimization strategies and global optimization strategies [
20,
21]. Real-time optimization strategies such as model predictive control (MPC) [
22] and the equivalent consumption minimum strategy (ECMS) [
23] have the advantage of high real-time performance, but only local optimum can be achieved. Tao J. et al., proposed an algorithmic framework combining a Q-learning and genetic algorithm for the power split between the fuel cell and supercapacitor of a vehicle, and simulation results show that the SOC of the supercapacitor can be sustained within the desired safe range, while reducing hydrogen consumption [
24]. Papers [
25,
26] are based on the Deep Reinforcement Learning optimizer to improve the driving conditions adaptability of the EMS. Song K. et al., established a novel fuel cell degradation model, which can obtain the efficiency under different states-of-health of the fuel cell. The EMS is adjusted based on the efficiency of the fuel cell to balance the degradation [
27]. Global optimization strategies focus on Dynamic Programming (DP), Pontryagin’s Minimal Principle (PMP) and heuristic algorithms [
28,
29,
30]. Because the global optimization strategy needs to predict the driving condition information and have a large amount of calculation, it is difficult to apply it to real time optimization. However, it can be used as the evaluation standard of other control strategies. Munoz P. et al., adopted DP as the optimization benchmark of the proposed energy management control method based on neural networks for fuel cell vehicles [
31]. Gim J. et al., extracted the allowable current of the fuel cell through the use of DP, then the modulation ratio of fuel cell system is solved by Particle Swarm Optimization algorithm based on the allowable current [
32]. Deng K. et al., introduced the online adaptation mechanism of the PMP’s co-state into the MPC structure, which shows promising fuel economy and battery charge sustaining [
33]. Global Optimization-based EMS highly coupled with the component sizing of fuel cell vehicles. To ensure the comprehensive performance of the vehicle, the optimization of component sizing of powertrain and energy management strategy should be considered simultaneously. The component sizing method based on the optimization algorithm is to search the multi-dimensional component space through various optimization algorithms to find the optimal component combination that minimizes the objective function [
34,
35].
Based on the above literature, the main research gaps are as follows: rule-based EMS are usually determined according to application-specific scenarios. When the system dynamics change, the same set of rules may not apply. Global Optimization-based EMS operate under known working conditions, which makes it difficult to be directly applied in real time control, but it can provide reference standards for the control sequences obtained by other EMS. DP is a common method used in global optimization. However, the DP-based EMS of fuel cell hybrid vehicles generally only uses a single state variable, and the idealized internal resistance model is used in the modeling of the battery. Therefore, the calculation accuracy can be further improved.
To solve the above problems, the main contributions of this paper are as follows: an energy management strategy based on multi-dimensional dynamic programming (MDDP) is proposed. In this strategy, the more accurate Thevenin model, which can reflect the polarization characteristics of the battery in higher accuracy [
36], is used for battery modeling, and battery state of charge (BSOC) and polarization voltage are used as state variables. In the reverse solving process of MDDP, dimension reduction is carried out to avoid a dimension disaster problem and improve computational efficiency. Finally, aiming at improving the fuel economy and durability, the energy management strategy is applied to the component sizing of a commercial vehicle.
This paper is organized as follows.
Section 2 describes the topology of a fuel cell hybrid powertrain and presents the model regarding fuel economy and system durability.
Section 3 introduces the MDDP-based EMS method.
Section 4 applies the proposed method to a component sizing problem.
Section 5 presents the simulation results and discussion. The conclusion is given in
Section 6.
4. Application of MDDP in Component Sizing
In this section, the MDDP algorithm is applied to component sizing. A component sizing solution process considering multiple objectives was designed, and the optimal component sizing space was obtained according to the solution results of analytical MDDP. The application object is a fuel cell commercial vehicle, the basic parameters of which are shown in
Table 5 and
Table 6.
In order to ensure that the power battery pack has a stable voltage and sufficient capacity to maintain the stable output of the fuel cell, the series and parallel number of the individual battery is determined. According to the rated voltage of the motor, the number of batteries in series is 145. The number of batteries in parallel depend on the component matching result.
In this study, simulations were conducted using MATLAB on the basis of the aforementioned energy management strategy under the C-WTCV working conditions.
Figure 8 shows the C-WLVC working conditions. The power demand of a simulated vehicle driving under C-WTVC working conditions is shown in
Figure 9.
Multi-objective optimization problem is an optimization problem in which two or more objective functions can obtain the optimal solution simultaneously. The multi-objective component sizing optimization problem of fuel cell composite energy source can be defined as
where
is multi-objective optimization function,
is the economic index of fuel cell vehicles which is expressed by equivalent hydrogen consumption (
).
is the durability index of fuel cell composite energy source system which is expressed by the capacity degradation of the fuel cell (
) and the ampere-hour throughput (
Aheff) of the battery.
is the optimized vector, including the maximum power of fuel cell (
) and the number of batteries in parallel (
np).
U is the optimization space. The optimization process framework based on MDDP is shown in
Figure 10.
According to the optimization space, 184 combinations can be matched by and np. The EMS based on MDDP for each combination is simulated. The minimum equivalent hydrogen consumption for each combination is obtained. The capacity degradation of the fuel cell () and the ampere-hour throughput (Aheff) is calculated based on the optimal state and the trajectory of control variables.
5. Results and Discussion
In the following paragraphs, simulations for each parameter vector are carried out with MDDP. Fuel economy and system durability for 184 combinations of the component are analyzed and compared.
5.1. Fuel Economy
Figure 11 illustrates the relationship of
and
np to the fuel cell direct hydrogen consumption under the C-WTCV working conditions. The simulation results show that the actual hydrogen consumption decreases with the increase of
under the same
np. When
> 250 kW, the decrease in hydrogen consumption is gentle.
Figure 12 shows the output power variation curve of fuel cell with different
(180–280 kW, 10 kW interval) when
np is 100. It shown the output of fuel cell needs to switch between three modes: high-power demand mode, normal demand mode and steady output mode, when equipped with low-power fuel cell stack. When the system is equipped with a high-power fuel cell stack, the fuel cell operating in normal power demand and smooth output mode is sufficient to provide the required power for driving, and the excess power is used to charge the battery. Because the battery does not maintain a strict constant charge, the direct hydrogen consumption of fuel cell cannot reflect the economic level of fuel cell vehicles, and the equivalent hydrogen consumption of the system is more suitable to evaluate the economy of the vehicle.
Figure 13 illustrates the relationship of
Pfcmax and
np to the fuel cell equivalent hydrogen consumption. As shown in
Figure 13a, with the increase of
Pfcmax, the equivalent hydrogen consumption gradually decreases, and with the increase of the parallel number, the equivalent hydrogen consumption curve will move downward.
Figure 13b shows that the sensitivity of equivalent hydrogen consumption to the
np decreases when
> 300 kW.
5.2. System Durability
Figure 14 shows the relationship of
Pfcmax and
np to the fuel cell capacity degradation rate. It shows the fuel cell capacity degradation rate varies greatly with the number of batteries in parallel (
np) when
Pfcmax is low. When
Pfcmax > 300 kW, the fuel cell capacity degradation rate is maintained at a relatively low level. This phenomenon is because the output of high-power fuel cell is relatively stable, while the output power of low-power fuel cell fluctuates greatly, as illustrated in
Figure 12. The result of the battery lifetime index in
Figure 15 shows that
Aheff increases linearly with
Pfcmax and
np. At a certain required power, with the increase of
Pfcmax, the frequency of high-current charging of the fuel cell will increase, as illustrated in
Figure 16 (
Pfcmax = 270–370 kW, 20 kW interval,
np = 80), which leads to the increase of the ampere-hour throughput of the battery, thus reducing the equivalent cycle life of the battery.
5.3. Discussion
According to the fuel economic analysis, the equivalent hydrogen consumption of the fuel cell when > 300 kW is at a low level and decreases with the increase of np. The system durability simulation results show that increasing np does not improve the durability of the fuel cell when > 300 kW, and the battery Aheff increases linearly with in each np.
According to the simulation of different combinations, the combination with the optimal comprehensive performance is selected as the component sizing result. When > 300, increasing has little effect on the fuel economy and durability of the fuel cell, but will increase Aheff. Therefore, is set to 300 kW, and to ensure that the battery has sufficient capacity to maintain the stable output of the fuel cell, np is set to 100. In this combination, the three objective values of a C-WTVC cycle are = 1864.9 g, Aheff = 45.581 Ah and = 0.002882410%, respectively.
To prove the necessity of the proposed EMS,
Table 7 shows the comparison of the MDDP-based EMS over the DP-based EMS under the selected combination.
The comparison result shows that, compared with the DP-based EMS, the MDDP-based EMS reached a higher global optimization accuracy in which the improvement of fuel economy is most obvious, with a reduction of 3.10%.