In this section, an example analysis will be introduced to prove the effectiveness of the combined model through comparison with other models. The example analysis is divided into four sections: experimental data and evaluation indicators, experimental data preprocessing, wind power prediction experiment, and prediction error analysis.
5.2. Experimental Data Pre-Processing
Considering the correlation of wind power data, in practical application, short-term wind power prediction mostly adopts iterative prediction, that is, the first several wind power values are used to predict the next wind power value. In this paper, the autocorrelation coefficient p of the original wind power sequences is calculated to determine the number of input variables. The results of the autocorrelation coefficient calculation for the wind power sequences are shown in
Table 4. The autocorrelation function diagram is shown in
Figure 2.
From
Table 4, it can be seen that the autocorrelation coefficient p of the first 9 data is 0.9 and above, which has a strong correlation. Therefore, in this paper, the first 8 data are selected as the input quantity, the 9th data is the predicted value, and so on to iteratively predict the power data of all test sets.
First, the original wind power data sequences are preprocessed using the CEEMDAN algorithm and CEEMD algorithm, and the results show that both CEEMDAN algorithm and CEEMD algorithm decompose the data sequences into nine IMF components and one RES component, and since the difference between the two decomposition plots is not large, this paper only places each sequence component after CEEMDAN decomposition, as shown in
Figure 3 below.
Second, the sequences of empirical modal decomposition algorithms decompose more sequence components, and both CEEMD and CEEMDAN in this paper decompose the original wind power sequence into 10 sequence components, which not only ignores the connectivity between the sequences, but also increases the workload of the subsequent prediction model. Calculating the entropy values of these 10 components and combining the components with similar entropy values into a new component can significantly reduce the workload and working time of subsequent prediction with almost no reduction in prediction accuracy. The results of calculating the entropy values of these 10 components are shown in
Figure 4.
From
Figure 4, we can see that the entropy values of the wind power sequences are basically arranged from large to small after CEEMD and CEEMDAN decomposition. The entropy values of component 1 and component 2 after CEEMDAN decomposition are much larger than the other components, indicating that their randomness is also much larger than the other components, so these two components are kept unchanged. The entropy values of component 3 to component 6 are not very different, so they are combined into a new sequence; the entropy values of components 7 to 10 are also very different, so they are also combined. After entropy calculation and reconstruction, the original sequence of 10 components is reconstructed into a new sequence of 4 components.
5.3. Wind Power Prediction Experiment
Since the accuracy of multi-step iterative prediction decreases with increasing step length [
49,
50], it is particularly important to obtain accurate first-step prediction values. To verify the prediction accuracy of the combined prediction model proposed in this paper, KELM, IBAS-KELM, FPA-KELM (KELM improved by the flower pollination algorithm), PSO-KELM (KELM improved by the particle swarm algorithm), CEEMDAN-SE-IBAS-KELM, and the CEEMDAN-SE-KELM combined with error correction proposed in this paper were constructed -IBAS-KELM combined with error correction. The combined prediction models proposed in this paper are also used to achieve wind power prediction 1 step ahead (10 min), 2 steps ahead (20 min), and 3 steps ahead (30 min).
The maximum number of iterations of the IBAS algorithm is set to 100, and the population is 1 while the maximum number of iterations of other optimization algorithms is set to 100 and the population is set to 30 and the parameters of both CEEMD and CEEMDAN are chosen to be 0.2 times standard deviation with 100 times white noise added.
The results of the CEEMDAN-SE-IBAS-KELM prediction model for the wind power of 144 sample points of Sotavento wind farms on March 17, 2021 with 1-step ahead, 2-step ahead and 3-step ahead are shown in
Figure 5. The errors of 1-step, 2-step and 3-step ahead with the true values and the results of the error prediction values are shown in
Figure 6. The results of the 1, 2, and 3-step prediction values of the CEEMDAN-SE-IBAS-KELM prediction model with error correction, are shown in
Figure 7. The error evaluation index of each prediction model, as shown in
Table 5.
As can be seen from
Table 5, the KELM model with the introduction of the kernel function decreased MAE by 16.87%, RMSE by 12.85%, and MAPE by 12.19% compared with the ELM model, which improved the model accuracy and also made the model more stable, indicating that the introduction of the kernel function significantly improved the prediction performance of the model; by comparing the three error evaluation indexes of the IBAS-KELM, CEEMD-IBAS-KELM and CEEMDAN-IBAS-KELM models, it can be seen that the prediction model without decomposition has the largest error, and the error of the decomposed prediction model is significantly reduced. It can be concluded that the prediction accuracy of prediction models based on modal decomposition has been greatly improved, indicating that the signal decomposition technology can effectively reduce the volatility of the original data. Compared with the prediction model without modal decomposition, the prediction models based on modal Compared with BAS-KELM, FPA-KELM and PSO-KELM models, the MAE decreased by 19.35%, 24.57% and 39.45%, respectively; the RMSE decreased by 19.13%, 24.32% and 39.29%; MAPE decreased by 21.74%, 25.00%, 40.00%, respectively, showing not only the effectiveness of the algorithm improvement but also the more effective merit-seeking ability of the IBAS algorithm. In the CEEMDAN-IBAS-KELM model compared with the CEEMD-IBAS-KELM model, MAE decreased by 37.14%, RMSE decreased by 37.41% and MAPE decreased by 36.36%, indicating the better decomposition ability of CEEMDAN. The one-step prediction is the result of the CEEMDAN-SE-IBAS-KELM model prediction, and it can be seen that MAE, RMSE, and MAPE are almost unchanged after SE reconstruction, but make the subsequent prediction model operation much lower. The one-step prediction after error correction decreases by 30.43%, RMSE decreases by 29.67%, and MAPE decreases by 28.57% compared with the uncorrected MAE, which proves that the error compensation correction can significantly improve the prediction accuracy.
From
Figure 6, it can be seen that the prediction accuracy decreases with the increase of the prediction step, and the trend of the error between the predicted and true values in one, two, and three steps is approximately the same, which indicates that the prediction model causes regular errors due to its own characteristics, and it is proved that the prediction error correction proposed in this paper is scientific. And the predicted error of IBAS-KELM is approximately the same as the real error trend, which proves the practicality of using the prediction error value to correct the power prediction value.
From
Table 5, it can be seen that the two-step prediction is 0.064 larger than the one-step prediction in terms of MAE, 0.078 larger in terms of RMSE, and 0.007 larger in terms of MAPE; the three-step prediction is 0.099 larger than the two-step prediction in terms of MAE, 0.119 larger in terms of RMSE, and 0.011 larger in terms of MAPE. It is proved that the prediction accuracy decreases faster and faster as the prediction step increases, which is caused by the fact that the later prediction values are affected by the prediction errors in the previous steps, and the errors will keep accumulating as the prediction step increases. The MAE of the two-step prediction with error correction decreases by 41.35%, RMSE decreases by 39.05%, and MAPE decreases by 50.00% compared with the uncorrected two-step prediction; the MAE of the three-step prediction with error correction decreases by 55.60%, RMSE decreases 50.00%, and MAPE decreases 54.17% compared with the uncorrected three-step prediction. It not only further proves the effectiveness of error correction, but also shows that as the step length increases, the error correction produces more and more influence, and the correction effect increases with the step length, which proves the superiority of error correction.
5.4. Prediction Error Analysis
Due to the randomness and volatility of wind power as well as the constraints of technology development level, the prediction error of wind power is unavoidable. With the advancement of big data analysis technology, it becomes an opportunity and challenge for wind power prediction research to effectively mine the hidden uncertainty information and laws in model prediction errors. By probabilistically fitting the wind power prediction error distribution and quantifying the possible fluctuation range of the prediction error in the probabilistic form, it is beneficial to make the expected optimal decision in the decision problem considering wind power uncertainty, thus making up for the lack of uncertainty information in deterministic prediction and providing more comprehensive information for wind farms.
In order to more intuitively demonstrate the fluctuation range of different prediction model errors, this paper will analyze the relative error of model predictions as a percentage of rated power (i.e., the prediction error minimum value), the prediction error minimum value (hereafter referred to as error) is defined as follows.
where
PE is the standardized value of prediction error;
Pact(
t) is the real value of wind power;
Ppre(
t) is the predicted value of wind power;
Pcap is the rated power of wind farm, i.e., 17 MW.
Three representative prediction models, KELM, IBAS-KELM, and CEEMDAN-SE-IBAS-KELM, were selected, and the frequency density histograms of the errors of these three prediction models were plotted, while the frequency density histograms were curve-fitted with normal distribution curves, and the mean and variance of the fitted curves were calculated, and in order to observe the distribution of the errors of these three models, the fitted curves of these three prediction model errors are put into one graph for comparative analysis. In this paper, the errors are divided into 12 intervals for frequency density calculation, where the horizontal coordinate is the error of the model and the vertical coordinate is the frequency density value, and the higher the frequency density value is, the more the number of errors in the interval. The frequency density histogram and fitted curve of each prediction model error are shown in
Figure 8.
From
Figure 8a,b, we can see that the mean and error of the normal distribution curve fitted by the IBAS-KELM model are reduced compared with the KELM model, where the mean error is reduced by 83.53% and the variance is reduced by 32.99%, and the KELM model error is distributed in the interval [−5%, 5%] and the maximum frequency density value is about 25, while the IBAS-KELM model error is distributed in the interval [−3%, 3%], and the maximum frequency density value near the zero value is about 42, indicating that the KELM model can significantly reduce the overall error after the optimization of IBAS algorithm. From (b) and (c), it can be seen that the variance of the fitted curve of the combined CEEMDAN-SE-IBAS-KELM prediction model is reduced by 61.36% compared with that of the IBAS-KELM model, and most of the errors are concentrated in the interval [−1%, 1%], and the maximum frequency density value near the zero value is about 95, which proves that the wind power data are processed by the CEEMDAN-SE algorithm and the overall prediction results are significantly reduced. SE algorithm, the overall error of the prediction results becomes smaller and more stable. From (d), it can be seen that compared with the KELM model, the mean value of the IBAS-KELM model is closer to zero, but the overall error distribution is still relatively scattered. The prediction results of the CEEMDAN-SE-IBAS-KELM combined model constructed in this paper are not only small in mean value, but the overall error is also concentrated around the zero value with almost no large deviation.
In order to show the prediction ability of different combination models more intuitively, the percentage of the number of prediction sample points in the error intervals of [−0.1%, 0.1%], [−0.3%, 0.3%], [−0.5%, 0.5%] and [−1%, 1%] of the total number of 144 prediction sample points for the combination models constructed in this paper and other comparison combination models are counted. The results of the error interval statistics for each prediction model are shown in
Figure 9 below.
In
Figure 9, the statistical results of the percentage of the number of predicted values between the CEEMDAN-SE-IBAS-KELM model and other comparative models in different error intervals are shown, and it can be seen that in all error intervals, the percentage of predicted values of the model constructed in this paper is more than that of other models. In the error interval [−0.1%, 0.1%], the proportion of the model constructed in this paper is close to 20%, which is 8.33% higher than that of the second place, indicating that the prediction accuracy of the model constructed in this paper is high, and the number of prediction values distributed in the low error interval is the largest. In the error interval [−0.3%, 0.3%], the model constructed in this paper is the only model with more than 45% of all prediction models. In the error interval [−0.5%, 0.5%], the percentages of different prediction models were 67.36%, 52.08%, 38.18%, 31.25%, 31.25%, 23.61%, 20.83%, and 11.81%, respectively, and the constructed model was 15.28%, 29.18%, 36.11%, 36.11%, 43.75%, 46.53%, and 55.55% higher than the other prediction models, respectively. In the interval [−1%, 1%], the number of predicted values of the BAS-KLL model and IBAS-KEL model accounted for more than 50%, and the prediction model using the data decomposition algorithm accounted for more than 70%, and the model constructed in this paper accounted for 93.75%, which was 13.89% higher than that of CEEMD-IBAS-KELM model. In summary, in the short-term wind power prediction, most of the prediction points of the model constructed in this paper are distributed in a low error interval, which shows that the CEEMDAN-SE-IBAS-KELM combination model constructed in this paper has excellent prediction ability.
In order to more intuitively demonstrate the prediction accuracy of the model after error correction, the one-step iterative prediction results and the one-step iterative prediction results with error correction are selected, and the frequency density histograms and fitted curves of the errors of these two models are plotted, as shown in
Figure 10.
From
Figure 10a,b, it can be seen that the mean difference between the single-step iterative error with error correction and the normal distribution curve fitted by the single-step iterative error without error correction is small but the variance is reduced by 35.29%, and the error of the model without error correction is basically distributed in the interval [−1%, 1%] and the maximum frequency density value near the zero value is about 95, while the error of the model with error correction The errors of the model with error correction are basically distributed in the interval [−0.5%, 0.5%], and the maximum frequency density value near the zero value is about 130, indicating that the combined CEEMDAN-SE-IBAS-KELM model with error correction has higher prediction accuracy. From
Figure 10c, it can be seen that the overall distribution of the fitted curves of the combined error correction model is more concentrated, indicating that the error of the model becomes smaller after the error correction, and the overall aggregation is near the zero value.
In order to more intuitively represent the predictive ability of the over-the-top one-two-three step model with error correction and the over-the-top one-two-three step model without error correction, the number of predictions with different model error intervals in [−0.1%, 0.1%], [−0.3%, 0.3%], [−0.5%, 0.5%], and [−1%, 1%] as a percentage of the total number of 144 predictions are counted in this paper, and the specific results of the error interval evaluation for each prediction model are shown in
Table 6 below.
It can be analyzed from
Table 6 that compared with the combined CEEMDAN-SE-IBAS-KELM model without the error correction, the percentage of the number of predicted values in different error intervals for the combined CEEMDAN-SE-IBAS-KELM model with the error correction for the one, two, and three steps ahead is significantly improved. In the error interval [−0.1%, 0.1%], the percentage of the prediction results of the combined error-corrected model with one, two, and three steps ahead of the combined model increased by 1.39%, 10.42%, and 9.03%, respectively; in the error interval [−0.3%, 0.3%], the percentage of the prediction results of the combined model with one, two, three steps ahead of the combined model increased by 16%, respectively, compared with that of the uncombined model with one, two, three steps ahead of the combined model. In the error interval [−0.5%, 0.5%], the prediction results of the model with error correction increase by 17.36%, 25%, 29.17%, and 35.42%, respectively, compared with the prediction results of the model without the combination of one, two, and three steps ahead; in the error interval [−1%, 1%], the prediction results of the model with error correction increase by 17.36%, 25%, 29.17%, and 35.42%, respectively, compared with the prediction results of the model without the combination of one, two, and three steps ahead. In the error interval [−1%, 1%], the prediction results of the model with error correction increase by 4.86%, 23.61%, and 40.97%, respectively, compared with the prediction results of the model without the combination of the one, two, and three steps ahead. In summary, the combined CEEMDAN-SE-IBAS-KELM model with error correction has higher prediction accuracy and is more suitable for multi-step iterative wind power prediction.