Next Article in Journal
Short-Term Load Forecasting Using Adaptive Annealing Learning Algorithm Based Reinforcement Neural Network
Next Article in Special Issue
Scenario Analysis of Carbon Emissions of China’s Electric Power Industry Up to 2030
Previous Article in Journal
Financing Innovations for the Renewable Energy Transition in Europe
Previous Article in Special Issue
Analysis of Power Network for Line Reactance Variation to Improve Total Transmission Capacity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hybrid Short Term Wind Speed Forecasting Using Variational Mode Decomposition and a Weighted Regularized Extreme Learning Machine

School of Electrical Engineering, Northeast Electric Power University, Jilin 132012, China
*
Author to whom correspondence should be addressed.
Energies 2016, 9(12), 989; https://doi.org/10.3390/en9120989
Submission received: 3 August 2016 / Revised: 20 November 2016 / Accepted: 22 November 2016 / Published: 25 November 2016
(This article belongs to the Special Issue Electric Power Systems Research 2017)

Abstract

:
Accurate wind speed forecasting is a fundamental element of wind power prediction. Thus, a new hybrid wind speed forecasting model, using variational mode decomposition (VMD), the partial autocorrelation function (PACF), and weighted regularized extreme learning machine (WRELM), is proposed to improve the accuracy of wind speed forecasting. First, the historic wind speed time series is decomposed into several intrinsic mode functions (IMFs). Second, the partial correlation of each IMF sequence is analyzed using PACF to select the optimal subfeature set for particular predictors of each IMF. Then, the predictors of each IMF are constructed in order to enhance its strength using WRELM. Finally, wind speed is obtained by adding up all the predictors. The experiment, using real wind speed data, verified the effectiveness and advancement of the new approach.

1. Introduction

Wind energy has been gaining more and more attention from all over the world as a renewable and clean energy source due to an increasing consumption of fossil energy and the pressure of environmental protection [1,2,3,4]. However, wind power is severely intermittent, volatile, and stochastic. It is not conducive to the safe and stable operation of the traditional electrical grid, power system generation planning, and economic electric power dispatch. Therefore, large-scale grid-connected operations should be based on accurate wind speed forecasting results in order to reduce wind power fluctuations in power systems. Accurate wind speed forecasting can improve the accuracy of wind power prediction; therefore, research on wind speed forecasting has significance and application value [5,6].
Existing wind speed forecasting methods can be divided into ultra-short-term, short-term, medium-term, and long-term forecasts from the forecast time scale [7]. Short-term wind speed forecasting is an important basis for the economic electric power dispatch of wind power grids. It helps to improve power quality and to maintain the reliability and stability of power grids. The short-term wind speed forecasting model can be divided into three types: statistical, physical, and intelligent models [8,9,10,11,12,13].
The statistical model is built through analysis of the correlation between the wind speed data of each time point in the wind speed series. Linear mathematical models, such as the auto regressive (AR) method [14] and auto regressive integrated moving average (ARIMA) [15], are established using the functional relationship between historical wind speed data and output wind speeds. Statistical models have simple principles and high efficiencies. However, the prediction accuracy of low-order statistical models is not high, while high-order model parameters are extremely difficult to determine.
A physical model relies on information from numerical weather reports (NWPs). This model is constructed on numerous meteorological and geographic properties, including air pressure, temperature, humidity, surface roughness, and contour [16,17]. The input dimension of the physical model is extremely high and the operation is complex due to the large number of factors. These are not conducive to the improvement of prediction accuracies. Moreover, the influence of numerical weather report (NWP) error on wind speed prediction is difficult to quantitatively estimate.
An intelligent model can build the nonlinear intelligent prediction model, which can compensate for the deficiencies in the linear prediction model. The forecasting result is close to that of the real wind speed. The neural networks (NNs) in the existing intelligent model have excellent nonlinear learning and generalization abilities; however, the setting of neural network (NN) parameters is complex. The training time of the intelligent predictor is long and requires a large number of training samples. The limitation of NNs is that they are not conducive to the improvement of prediction accuracy [13,18]. An extreme learning machine (ELM) [19,20,21,22] is proposed, based on the single hidden layer feedforward network (SLFN) [19,20,21,22]. ELM has the advantages of a simple structure, high learning efficiency, and strong generalization ability. ELM has been proven capable of obtaining accurate prediction results using a small training set. However, the modeling process of ELM is considered a structural risk. Therefore, the prediction model based on traditional ELM is not the best model. Moreover, the prediction accuracy of the ELM model is vulnerable to outlier interference [23,24,25]. Weighted regularization ELM (WRELM) considers structure and empirical risks in the building of an optimal predictive model, and integrates weight in order to avoid interference from outliers in the training process [26,27]. WRELM can adjust the correlation weight of ELM automatically, according to the training errors during the training process. According to the characteristics of the training samples, an optimal forecasting model with fewer training samples and a high forecasting accuracy, based on WRELM, can be automatically built [26]. Thus, WRELM is more suitable for short-term wind speed forecasting.
The fluctuations and the randomness of wind speeds are the main factors that influence the accuracy of wind speed forecasting. The signal processing methods used to decompose the signal reduce the fluctuations and the randomness of the data in the wind speed sequence, and obtain a more regular sub-series [28,29,30,31,32]. Subsequently, a hybrid wind speed forecasting model is constructed using the sub-predictors of the sub-series. Thus, the fluctuations and randomness of the original signals are reduced and the classification of wind speed forecasting is improved. Existing hybrid methods often use wavelet transforms (WTs) and empirical mode decomposition (EMD) in order to decompose the original wind speed sequence [28,29,30,31]. WTs have very good decomposition ability for non-singular signals; however, obtaining satisfactory results according to specific circumstances with appropriate basis functions and decomposition scales is difficult [28]. As an adaptive decomposition method, intrinsic mode functions (IMFs), decomposed using EMD, could maintain the regularity of a wind speed signal in different frequency domains. Although constructing the hybrid wind speed forecasting model, based on EMD, is convenient, some defects, such as mode mixing and false modes, still exist. These defects lead to the uncertainty in the center frequency and bandwidth of the signal frequency band of the IMF component and affect the forecasting accuracy of the hybrid model. Variational mode decomposition (VMD) can adaptively decompose the signal in the frequency domain and transform the original signals into several IMFs with strong continuity and correlation [33,34,35,36,37]. Unlike EMD, VMD reduces the nonlinearity and fluctuations of the wind speed time series, and avoids the influence of mode mixing defects on wind speed predictions.
The results of the wind speed time series decomposition differ in different periods when the wind speed time series is pretreated with the signal processing method. Therefore, feature selection, according to the correlation of the elements, should be performed in each sub-sequence in order to construct the optimal prediction model for each. The partial autocorrelation function (PACF) is often used to measure the degree of a time series and to remove autocorrelation interference [35,38]. The IMF sequence generated by the VMD treatment has a certain correlation and stability; therefore, the optimal input vector of the predictor for each IMF can be separately determined by PACF.
A new hybrid method of short-term wind speed prediction, using VMD, PACF, and WRELM, is proposed. First, VMD is used to decompose the wind speed sequence and to obtain relatively stable IMFs. This step reduces the influence of the random and fluctuating wind speed series on the prediction accuracy of wind speed forecasting. Second, PACF is used to determine the maximum input order for each IMF sequence in order to obtain the optimal feature vector of each predictor based on the WRELM for different IMFs. Subsequently, the parameters of the forecast model are automatically adjusted, and the WRELM-based optimal forecasting model is obtained in the training process. Finally, each forecasting model is used to predict the corresponding IMF sequence, and the prediction results of the WRELM model are added to obtain the final wind speed forecasting result. The experiment is performed using real wind speed data from the Measurement and Instrumentation Data Center (MIDC) of the National Renewable Energy Laboratory (NREL) in America in order to verify the effectiveness of the new method.

2. Structure and Methodology of the New Hybrid Model

The new method includes VMD wind speed decomposition, PACF feature selection, and modular WRELM prediction. Figure 1 shows the forecasting process of the new approach.
As shown in Figure 1, the hybrid wind speed forecasting method mainly consists of two stages. The main contents of the two stages are described as follows.
Stage I: Data Processing
As shown in Figure 1, the new method first uses the VMD decomposition method to analyze the wind speed forecast time series in a specific time window. This method also obtains relatively stable IMF sub-sequences. The optimal variational mode layers are determined by the variation of the center frequency of the IMF subsequence. Finally, the optimal IMF sub-sequence is obtained by using the optimal decomposition layer number k.
Stage II: Hybrid Forecasting Model Construction
For every IMF sub-sequence, PACF is used to calculate the partial autocorrelation after the removal of the autocorrelation. Then, the input variable for the corresponding prediction model is selected, based on the partial autocorrelation measured by the PACF value. Subsequently, the WRELM prediction network is constructed for each IMF subsequence layer, with the output variable of wind speed in the forecasting point. A cross-validation method is used to determine the network parameters of WRELM, including the activation function and node number of the hidden layer. The final prediction results can be obtained by combining the predicted results of WRELMs corresponding to different IMF sequences.

2.1. Variational Mode Decomposition (VMD)

VMD decomposes the wind speed time series in the variational framework, which is different from EMD, which uses cyclic screening. The decomposition process of VMD is the construction and solution of the variational problem. Assuming that each mode has a limited bandwidth, which is compacted around a center frequency, the variational problem is decomposing the wind speed series, f, into { u k ( t ) }, where k is number of the modes.
Wind speed time series f is assumed decomposed into k modes, and each mode has a finite bandwidth and a center frequency. The objective function seeks the k modes of wind speed decomposed by VMD, of which the sum value of the estimated bandwidth is the minimum. In addition, the sum of the IMFs must be equal to the input signal, f, which is the constraint condition. The estimation procedure of the IMF bandwidth is as follows:
(a)
Hilbert transform is used to decompose the wind speed time series, f, and the analytic signal of each mode function, u k ( t ) , is calculated in order to obtain the corresponding unilateral spectrum.
(b)
Estimated pulsation is obtained by mixing e j ω k t . The frequency spectrum of each mode is modulated to the corresponding baseband.
(c)
The L2 norm of the demodulation signal gradient is calculated and the bandwidth of each mode is estimated.
Subsequently, the variational problem is described as follows:
{ min { u k } , { ω k } { k t [ ( δ ( t ) + j π t ) * u k ( t ) ] e j ω k t 2 2 } s . t . k u k = f
where { u k } = { u 1 , u 2 , , u k } , k = k 1 k , { ω k } = { ω 1 , ω 2 , , ω k } , and ω k are the center frequency of the kth mode.
To obtain the optimal solution of the constrained variable problem, the issue is addressed as an unconstrained variable problem by introducing a quadratic penalty factor, α , and the Lagrange multiplication operator, λ ( t ) [33,34]. The augmented Lagrange equation is defined as:
L ( { u k } , { ω k } , λ ) = α k t [ ( δ ( t ) + j π t ) * u k ( t ) ] e j ω k t 2 2 + f ( t ) k u k ( t ) 2 2 + λ ( t ) , f ( t ) k u k ( t )
Then, the alternate direction method of multipliers (ADMM) is used to find the iterative sub-optimizations for the augmented Lagrange equation of Equation (2). The solutions for Equation (2) are expressed as:
u ^ k n + 1 ( ω ) = f ^ ( ω ) i k u ^ i ( ω ) + λ ^ ( ω ) 2 1 + 2 α ( ω ω k ) 2
ω k n + 1 = 0 ω | u ^ k ( ω ) | 2   d ω 0 | u ^ k ( ω ) | 2   d ω
where f ^ ( ω ) , λ ^ ( ω ) , u ^ k n + 1 ( ω ) , and u ^ i ( ω ) are the Fourier transform results of f ( t ) , λ ( t ) , u k n + 1 ( t ) , and u i ( t ) , respectively, and n is the number of iterations.

2.2. Partial Autocorrelation Function (PACF)

The data correlation of the mode decomposed by the VMD in each period is different because of the different wind speed fluctuations in each period. Thus, in the design of the prediction model corresponding to various IMF sequences, the correlation between the data in each IMF needs to be analyzed based on the current decomposition results, and the optimal feature vector of each predictor can be selected by the correlation. PACF is used to evaluate the correlation and the result.
For IMF sub-sequence 1 s L , let x t be the forecasting variable of the predictor. If the partial autocorrelation at lag s is outside of the 95% confidence interval, which is calculated as [ 1.96 / L ,   1.96 / L ] , x t s is one of the selected features for the predictor [35,38]. If all the PACF coefficients are within the 95% confidence interval, x t 1 will be selected as the only input variable of the predictor. The PACF process is as follows [35,38]; the covariance at lag s (if s = 0, it is the variance) is described as:
γ ^ s = 1 L t = 1 N s ( x t x ¯ ) ( x t + s x ¯ ) ,   s = 1 , 2 , , M
where x ¯ is the mean value of IMF and M is the maximum lag. Then, the autocorrelation function (ACF) at lag s is:
ρ ^ s = γ ^ s γ ^ 0
According to Equations (5) and (6), PACF at lag s can be calculated as:
α ^ 1 , 1 = ρ ^ 1 α ^ s + 1 , s + 1 = ρ ^ s + 1 j = 1 s ρ ^ s + 1 j α ^ s j 1 j = 1 s ρ ^ j α ^ s j α ^ s + 1 , j = α ^ s j α ^ s + 1 , s + 1 α ^ s , s j + 1 ( j = 1 , 2 , , s ) }
where 1 s M .

2.3. Weighted Regularization Extreme Learning Machine (WRELM)

ELM is proposed based on SLFNs. Compared with other intelligent methods, it has the advantages of having a simple structure, fast calculation speeds, a high forecasting accuracy, and fewer training sample requirements [15,16,17,18]. According to the wind speed forcasting process, the ELM for regression is constructed in this paper using multiple inputs and a single output.
Assuming a data set with N training samples, { ( x i , y i ) } i = 1 N , where the input vector is x i n and the output is y i , the ELM predictor between x i and y i with L hidden nodes can be described as [24]:
j = 1 L β j g ( w j x i + b j ) = y i , i = 1 , 2 , , N
where g ( x ) is the activation function, w j is the randomly selected input weight vector between the ith hidden neuron and the input neuron, b j is the randomly selected bias of the ith hidden node, and β j is the weight that connects the ith hidden neuron and the output neuron. Equations can be described as a single linear system:
H β = y
where H ( w 1 , , w L , x 1 , , x N , b 1 , , b L ) = [ g ( w 1 x 1 + b 1 ) g ( w L x 1 + b L ) g ( w 1 x N + b 1 ) g ( w L x N + b L ) ] N × L , β = [ β 1 , β 2 , , β L ] T , and y = [ y 1 , y 2 , , y N ] T . The structure of the traditional ELM is shown in Figure 2.
The goal of the ELM is to find the optimal least squares solution, β , by minimizing the associated loss function:
min y H β 2 2
In the process of actual regression application, the training sample is more than the testing sample. Thus, the optimal solution can be written as:
β ^ = ( H T H ) 1 H T y
Though ELM has many advantages, the regression accuracy of ELM still suffers from outliers in the training set. Otherwise, the goal of minimizing the training error might cause an overfitting limitation, which could affect the forcasting accuracy of ELM. To overcome the limitation of ELM, WRELM is employed in order to construct the wind speed predictor. WRELM [24] is designed using a regularized extreme learning machine (RELM) [27]. In RELM, a regularization parameter is used to balance the training error and the norm of output weight in order to avoid the overfitting limitation. WRELM further improves RELM by reducing the influence of outliers. The mathematical model of WRELM can be written as [24]:
min β   C   We 2 2 + β 2 2 s u b j e c t   t o   y H β = e
where W = d i a g { w 1 , w 2 , , w N } is the extension form of e 2 2 , which is obtained from the error variable weight, ei, using weighing factor wi, e = [ e 1 , e 2 , , e N ] T W = d i a g { w 1 , w 2 , , w N } ; C is a regularization parameter; and e is a training error vector with N variables, where e = [ e 1 , e 2 , , e N ] T .
The optimal solution of β for WRELM is as follows:
β ^ = ( H T W 2 H + I C ) 1 H T W 2 y
where I is a unit matrix.
Existing research certifies that WRELM has better regression accuracy than ELM or RELM. Moreover, WRELM effectively avoids the influence of overfitting and outliers on the forcasting accuracy [24]. The details of the derivation procedure and the parameter setting of WRELM can be found in Reference [24].

3. Case Study

In this section, the effectiveness of the proposed hybrid approach on historical wind speed observations in order to predict the short-term wind speed is verified using real wind speed data. The observations are obtained from NREL. All the experiments are performed in Matlab 8.5 (2015a) running on an Intel® Core™ i7-6700 processor operating at 3.40 GHz. The WRELM and VMD toolboxes are from Zhang [24] and Dragomiretskiy [33], respectively.

3.1. Data Sets and Evaluation Criteria

The change of wind speed data is uncontrollable because wind speed data is influenced by meteorological factors. The correlation between the wind speed data in the forecast period and the historic wind speed data with a long time interval is low. Therefore, the data quantity for model verification is set as in Reference [32]. The mean half-hour wind speed data for two days (i.e., 96 training samples) are used to train the forecasting model under a new approach. The model will then be constructed between the historical wind speed data and the current wind speed [32]. Considering the fluctuation of wind speeds and weather differences between day and night, the correlation between the 12-hour historical data and the forecasting data is analyzed using PACF in order to construct the optimal input vector of each predictor. Hence, length L of the wind speed time series for correlation analysis is 120, which means that a wind speed time series with 120 samples is decomposed by VMD. The maximum lag, M, is 24, and the number of training samples, N, for WRELM is 96.
In practice, two-thirds of the entire data set is used to train the model and one-third is used to test the forecasting accuracy. The testing set for the new approach contains 48 samples (i.e., one day) because the training set contains 96 samples. The data used for training and testing the effectiveness of the new approach for forecasting the wind speed on 20 February 2004 is shown in Figure 3.
To assess the forecasting accuracy of the new approach and the comparative methods, the mean absolute percentage error (MAPE) and root mean square error (RMSE) are used as evaluation criteria.
MAPE = 1 K i = 1 K | x i x ^ i x i | × 100 %
RMSE = 1 K i = 1 K ( x i x ^ i ) 2
where K is the sample number of the testing set, x i is the real value of wind speed at i, and x ^ i is the corresponding forecasting value.

3.2. Decomposition of the Wind Speed Series Based on VMD

Before the wind speed sequence is decomposed by VMD, the decomposition layers, k, need to be set first. If k is too small, the signal is not fully decomposed. Accurately characterizing the signal into different frequency ranges is difficult. If the signal is over-decomposed, the mode in the high frequency is excessive. Reducing the difference between each mode will increase the complexity and reduce the accuracy of the hybrid model.
The optimal mode number can be selected by the difference between the center frequencies ( Δ f ) of IMF k and IMF (k − 1), as the center frequency is closely related to the decomposition results of VMD [35]. Table 1 shows the center frequency with different modes, k, of the decomposing results of the wind speed decomposition series in Figure 3.
As shown in Table 1, the value of x t s is significantly reduced when the value of k is from one to seven, while the value of Δ f tends to be stable. Thus, we consider seven as the optimal value of k.
To further validate the effectiveness of the method for k optimization, wind data from 20 days, randomly selected from 2004, are used in order to verify the forecasting accuracy of the new hybrid wind speed forecasting method with different k values, from one to 10. The optimal number of decomposition layers is evaluated by the prediction error (MAPE and RMSE) of one-step forecasting using the new approach with different decomposition layers. In those 20 days, the optimal decomposition layer with a minimum error of 17 days is seven. The prediction error of seven decomposition layers of other days, with different optimal decomposition layers, is very close to that of the minimum error of the predicted result with the optimal layer number. The mean errors of the different decomposition layers of the one-step forecasting are shown in Table 2.
As shown in Table 2, when the decomposing layer of VMD is seven (k = 7), the mean MAPE and RMSE are the minimum values, 11.0996 and 0.2810, respectively. To obtain optimal prediction results in general, the optimal decomposition layer with the minimum mean error of seven is selected in the VMD decomposition, according to the mean error of the statistical experiment.
On the basis of determining the decomposition level of VMD, a comparative experiment on decomposing a normalized wind speed series is proposed, based on VMD and EMD. The other parameters of VMD are set according to References [34,35,36,37], and the parameters of EMD are set according to Reference [30]. The experimental results of series decomposition are shown in Figure 4 and Figure 5.
As shown in Figure 4 and Figure 5, the original wind speed signal can be decomposed by VMD and EMD for seven and five IMFs, respectively. The results of the EMD method also include a residue component.
The IMF sequence generated by VMD in the low-frequency area with a large amplitude had fewer fluctuations. The change trends of the data with a large amplitude, such as IMF1, IMF2, and IMF3 of VMD, were stable. For these types of data, any predictor can achieve a better prediction effect. The predictors corresponding to the IMFs with large amplitudes will have fewer forecasting errors. The IMFs in the high-frequency area with a small amplitude changed drastically. The predictors corresponding to IMFs with small amplitudes will have larger errors. However, the error of the high-frequency predictor has a limited effect on the prediction accuracy of the hybrid model because of the small magnitude of the predicted object. The characteristics of the VMD decomposition results are conducive to the improvement of the prediction accuracy of wind speed forecasting because the final prediction result, based on VMD, is the sum of the prediction results of each IMF.
However, IMFs decomposed by EMD have different characteristics. IMFs with a large amplitude change drastically. The forecasting errors of IMFs with large amplitudes are also sizable because of the intense fluctuations in the EMD decomposition results. The violent fluctuation trends of IMFs with large amplitudes will lead to a larger error in the final prediction results of the EMD-based predictor. Furthermore, the results of the EMD are not conducive to the improvement of the prediction accuracy. IMFs decomposed by VMD are significantly more suitable for the hybrid wind speed forecasting model construction than those decomposed by EMD.

3.3. Optimal Feature Vector Construction Based on PACF

The change trends and the correlation between data elements are different in IMFs decomposed by VMD at different periods. To obtain the optimal predictors, the input variables of the predictors should be selected according to the decomposition results of VMD. In this paper, the length of the wind series, L, is 120, and its confidence interval of 95% is [−0.179, +0.179]. The partial autocorrelation of the original wind speed series and the IMFs decomposed by VMD are shown in Figure 6. According to the partial autocorrelograms, the variable with lag s, denoted as x t s with the partial autocorrelation value out of the confidence interval, will be selected to construct the input vector of the predictor corresponding to the specific IMF.
Figure 6 shows the characteristics of the partial autocorrelation of the original wind speed and the IMFs. Only two variables can be selected by PACF because of the large fluctuation in the original wind speed. However, IMFs decomposed by VMD have a certain stability and regularity; thus, more related variables can be selected by PACF. Compared to the original wind speed series, the IMFs of VMD can easily establish a prediction model. The compositions of the input vectors of different predictors determined by PACF are shown in Table 3.

3.4. Forecasting Model for Wind Speed Based on WRELM

After determining the input vector, WRELMs are used to predict the IMFs decomposed by VMD. Using MAPE as the index, and randomly selecting 10 days as the test set, the activation function and the number of hidden layer neurons of WRELM are determined using the cross-validation method [25,28]. The forecasting accuracies with different characteristics of WRELM are shown in Table 4. As Table 4 shows, the WRELM with the “sine” activation function and 20 hidden layer neurons has the highest forecasting accuracy. Thus, the characteristics of WRELM will be selected by the result.

3.5. Forecasting Results of the New Hybrid Model

To comprehensively evaluate the prediction performance of the new hybrid model, four days in different quarters of 2004 are used to predict wind speed. The one-step and multi-step prediction (from two steps ahead to three steps ahead) results are shown in Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10.

3.6. Comparison Experiments and Discussion

Figure 7, Figure 8, Figure 9 and Figure 10 show that, although the wind speed curves on different dates have different fluctuations and ranges of speed, the new model could accurately predict wind speed. One-step-ahead prediction results are very close to the actual wind speed curve. The forecasting accuracy of two-step-ahead and three-step-ahead predictions declined to different degrees. However, the results of the multi-step-ahead forecasting of the new approach are still close to those of the real wind speed. The experiments illustrate the stability and adaptability of the new approach.
The experiment with five types of hybrid wind speed forecasting methods is proposed in this section in order to show the advancements of the new hybrid approach. The wind speed data from 20 February 2004, 20 May 2004, 20 August 2004, and 20 November 2004 are used to verify the accuracy of the different approaches. The one-step-ahead and multi-step-ahead forecasting results of the different approaches are shown in Figure 11, Figure 12, Figure 13 and Figure 14. The mean forecasting errors with five types of wind speed approaches, in four days, are shown in Table 5.
The methods used for the experiments include VMD + ELM, VMD + WRELM, EMD + PACF + WRELM, VMD + PACF + ELM, and VMD + PACF + WRELM. The predictors without the feature selection step use the variables from lag 1 to lag 24 as the input vectors. The ELM characteristics of each method are optimized by cross-validation [21,22].
The contrast methods used to verify the advancement of the new approach do not consider methods with different intelligent algorithms because existing research has illustrated the advanced nature of ELM relative to traditional intelligent algorithms, such as Back Propagation Neural Networks (BPNN) and Support Vector Machine (SVM) [23,24]. The experiments mainly show the advancement of PACF and WRELM used in the new approach.
As shown in Figure 11, Figure 12, Figure 13 and Figure 14, the new approach has the highest accuracy out of the five approaches. The forecasting curve of the new approach is closest to that of the true wind speed curve. Detailed analyses are performed based on Table 5.
Compared to VMD + ELM without PACF, VMD + PACF + ELM reduced MAPE from 11.8011, 16.4512, and 18.4901 to 11.3644, 15.8224, and 17.8260, respectively. Compared to the VMD + WRELM method, VMD + PACF + WRELM reduced MAPE from 11.4518, 15.3523, and 17.6016 to 11.0912, 14.6710, and 16.7378, respectively. The accuracy evaluated by the RMSE was also higher using the methods that included PACF. These verified the benefits of PACF relative to the improvement of the forecasting accuracy of the entire model.
Meanwhile, according to Table 5, the hybrid method with WRELM has a higher forecasting accuracy than methods with ELM in each type of multi-step forecasting. The method with VMD decomposition has a much higher forecasting accuracy than the EMD-based method. The experimental results verified the advancements of WRELM and VMD.
To further illustrate the effectiveness and advancement of the new approach, it was used to predict wind speeds in a longer time interval (five days). In this experiment, the new approach is compared with the Autoregressive Integrated Moving Average (ARIMA) and Support Vector Regression (SVR) based methods. SVR is used instead of WRELM to construct a contrast method with VMD, PACF, and SVR. The kernel function charaters of SVR are set according to References [10,39]. The optimal penalty factor and kernel function variance of SVR are determined using the cross-validation method. The characteristics of ARIMA are set according to References [5,12]. The experimental results of the long interval are shown in Figure 15 and Table 6.
Figure 15 shows that the new approach has the best prediction accuracy in the experiment. The prediction results of the new approach accurately track the change trends of the true wind speed, especially in the time interval of 100 to 150 sample points. When the real wind speed suddenly changed from a high speed to a low speed, the new approach predicted the change trend much more accurately than the other methods. Table 6 shows that the MAPE and RMSE of the new approach are much lower than those of the other methods. The experimental results fully verified the advancement of the new approach.

4. Conclusions

To improve the short-term wind speed forecasting accuracy, a new hybrid wind speed forecasting approach, based on VMD, PACF, and WRELM, is proposed. This new approach has the following advantages:
(1)
The wind speed series is decomposed by VMD with an optimal decomposing layer selected through center frequency analysis in order to reduce the influences of randomness and volatility. The results of VMD are more precise than those of EMD, and show that it is suitable for hybrid forecasting model construction with fewer amplitude fluctuations in the IMFs of a low frequency range.
(2)
PACF, which is used for the correlation analysis of IMFs, determines the optimal input vector of each predictor. It effectively reduces the feature dimension and complexity of the model and leads to the improvement of prediction efficiency and accuracy.
(3)
WRELM is used to build the predictors of the hybrid model. The weights of WRELM are adjusted automatically according to different training sets. WRELM improves the prediction accuracy by avoiding the negative effects of the outlier samples in the data set.
The experimental results show that the new method is effective and advanced.

Acknowledgments

This work is supported by the National Nature Science Foundation of China (No. 51307020), the Science and Technology Development Project of Jilin Province (No. 20160411003XH) and the Science and Technology Foundation of Department of Education of Jilin Province (2016, No. 90).

Author Contributions

Nantian Huang put forward to the main idea and designed the entire structure of this paper. Chong Yuan did the experiments and prepared the manuscript. Guowei Cai guided the experiments and paper writing. Enkai Xing completed data preprocessing.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Men, Z.X.; Yee, E.; Lien, F.S.; Wen, D.Y.; Chen, Y.S. Short-term wind speed and power forecasting using an ensemble of mixture density neural networks. Renew. Energy 2016, 87, 203–211. [Google Scholar] [CrossRef]
  2. Potter, C.W.; Lew, D.; McCaa, J.; Cheng, S.; Eichelberger, S.; Grimit, E. Creating the dataset for the western wind and solar integration study (U.S.A.). Wind Eng. 2008, 32, 325–338. [Google Scholar] [CrossRef]
  3. Heng, J.N.; Wang, C.; Zhao, X.J.; Xiao, L.Y. Research and application based on adaptive boosting strategy and modified CGFPA algorithm: A case study for wind speed forecasting. Sustainability 2016, 8, 235. [Google Scholar] [CrossRef]
  4. Hu, Q.; Zhang, S.; Yu, M.; Xie, Z. Short-term wind speed or power forecasting with heteroscedastic support vector regression. IEEE Trans. Sustain. Energy 2016, 7, 241–249. [Google Scholar] [CrossRef]
  5. Cadenas, E.; Rivera, W.; Campos-Amezcua, R.; Heard, C. Wind speed prediction using a univariate ARIMA model and a multivariate NARX model. Energies 2016, 9, 109. [Google Scholar] [CrossRef]
  6. Noorollahi, Y.; Jokar, M.A.; Kalhor, A. Using artificial neural networks for temporal and spatial wind speed forecasting in Iran. Energy Convers. Manag. 2016, 115, 17–25. [Google Scholar] [CrossRef]
  7. Doucoure, B.; Agbossou, K.; Cardenas, A. Time series prediction using artificial wavelet neural network and multi-resolution analysis: Application to wind speed data. Renew. Energy 2016, 92, 202–211. [Google Scholar] [CrossRef]
  8. Zhao, J.; Guo, Z.H.; Su, Z.Y.; Zhao, Z.Y.; Xiao, X.; Liu, F. An improved multi-step forecasting model based on WRF ensembles and creative fuzzy systems for wind speed. Appl. Energy 2016, 162, 808–826. [Google Scholar] [CrossRef]
  9. Sfetsos, A. A novel approach for the forecasting of mean hourly wind speed time series. Renew. Energy 2002, 27, 163–174. [Google Scholar] [CrossRef]
  10. Santamaría-Bonfil, G.; Reyes-Ballesteros, A.; Gershenson, C. Wind speed forecasting for wind farms: A method based on support vector regression. Renew. Energy 2016, 85, 790–809. [Google Scholar] [CrossRef]
  11. Wang, J.Z.; Zhang, F.Y.; Liu, F.; Ma, J.J. Hybrid forecasting model-based data mining and genetic algorithm-adaptive particle swarm optimisation: A case study of wind speed time series. IET Renew. Power Gener. 2016, 10, 287–298. [Google Scholar] [CrossRef]
  12. Lydia, M.; Kumar, S.S.; Selvakumar, A.I.; Kumar, G.E.P. Linear and non-linear autoregressive models for short-term wind speed forecasting. Energy Convers. Manag. 2016, 112, 115–124. [Google Scholar] [CrossRef]
  13. Hu, Q.H.; Zhang, R.J.; Zhou, Y.C. Transfer learning for short-term wind speed prediction with deep neural networks. Renew. Energy 2016, 85, 83–95. [Google Scholar] [CrossRef]
  14. Hill, D.C.; McMillan, D.; Bell, K.R.W.; Infield, D. Application of auto-regressive models to U.K. wind speed data for power system impact studies. IEEE Trans. Sustain. Energy 2012, 3, 134–141. [Google Scholar] [CrossRef]
  15. Liu, H.; Erdem, E.; Shi, J. Comprehensive evaluation of ARMA–GARCH(-M) approaches for modeling the mean and volatility of wind speed. Appl. Energy 2011, 88, 724–732. [Google Scholar] [CrossRef]
  16. Cassola, F.; Burlando, M. Wind speed and wind energy forecast through kalman filtering of numerical weather prediction model output. Appl. Energy 2012, 99, 154–166. [Google Scholar] [CrossRef]
  17. Barbounis, T.G.; Theocharis, J.B. A locally recurrent fuzzy neural network with application to the wind speed prediction using spatial correlation. Neurocomputing 2007, 70, 1525–1542. [Google Scholar] [CrossRef]
  18. Shrivastava, N.A.; Lohia, K.; Panigrahi, B.K. A multiobjective framework for wind speed prediction interval forecasts. Renew. Energy 2016, 87, 903–910. [Google Scholar] [CrossRef]
  19. Huang, G.-B. What are extreme learning machines? Filling the gap between Frank Rosenblatt’s dream and John von Neumann’s puzzle. Cogn. Comput. 2015, 7, 263–278. [Google Scholar] [CrossRef]
  20. Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
  21. Wang, J.Z.; Hu, J.M.; Ma, K.L.; Zhang, Y.X. A self-adaptive hybrid approach for wind speed forecasting. Renew. Energy 2015, 78, 374–385. [Google Scholar] [CrossRef]
  22. Liu, H.; Tian, H.Q.; Li, Y.F. Four wind speed multi-step forecasting models using extreme learning machines and signal decomposing algorithms. Energy Convers. Manag. 2015, 100, 16–22. [Google Scholar] [CrossRef]
  23. Zong, W.W.; Huang, G.B.; Chen, Y.Q. Weighted extreme learning machine for imbalance learning. Neurocomputing 2013, 101, 229–242. [Google Scholar] [CrossRef]
  24. Zhang, K.; Luo, M. Outlier-robust extreme learning machine for regression problems. Neurocomputing 2015, 151, 1519–1527. [Google Scholar] [CrossRef]
  25. Sun, W.; Liu, M.H. Wind speed forecasting using FEEMD echo state networks with RELM in Hebei, China. Energy Convers. Manag. 2016, 114, 197–208. [Google Scholar] [CrossRef]
  26. Deng, W.; Zheng, Q.; Chen, L. Regularized extreme learning machine. In Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining, Nashville, TN, USA, 30 March–2 April 2009; pp. 389–395.
  27. Martínez-Martínez, J.M.; Escandell-Montero, P.; Soria-Olivas, E.; Martín-Guerrero, J.D.; Magdalena-Benedito, R.; Gómez-Sanchis, J. Regularized extreme learning machine for regression problems. Neurocomputing 2011, 74, 3716–3721. [Google Scholar] [CrossRef]
  28. Meng, A.B.; Ge, J.F.; Yin, H.; Chen, S.Z. Wind speed forecasting based on wavelet packet decomposition and artificial neural networks trained by crisscross optimization algorithm. Energy Convers. Manag. 2016, 114, 75–88. [Google Scholar] [CrossRef]
  29. Fei, S.W. A hybrid model of EMD and multiple-kernel RVR algorithm for wind speed prediction. Int. J. Electr. Power Energy Syst. 2016, 78, 910–915. [Google Scholar] [CrossRef]
  30. Wang, S.X.; Zhang, N.; Wu, L.; Wang, Y.M. Wind speed forecasting based on the hybrid ensemble empirical mode decomposition and GA-BP neural network method. Renew. Energy 2016, 94, 629–636. [Google Scholar] [CrossRef]
  31. Hu, J.M.; Wang, J.Z. Short-term wind speed prediction using empirical wavelet transform and Gaussian process regression. Energy 2015, 93, 1456–1466. [Google Scholar] [CrossRef]
  32. Hu, J.M.; Wang, J.Z.; Ma, K.L. A hybrid technique for short-term wind speed prediction. Energy 2015, 81, 563–574. [Google Scholar] [CrossRef]
  33. Dragomiretskiy, K.; Zosso, D. Variational mode decomposition. IEEE Trans. Signal Process. 2014, 62, 531–544. [Google Scholar] [CrossRef]
  34. Lahmiri, S. Intraday stock price forecasting based on variational mode decomposition. J. Comput. Sci. 2016, 12, 23–27. [Google Scholar] [CrossRef]
  35. Sun, G.Q.; Chen, T.; Wei, Z.N.; Sun, Y.H.; Zang, H.X.; Chen, S. A carbon price forecasting model based on variational mode decomposition and spiking neural networks. Energies 2016, 9, 54. [Google Scholar] [CrossRef]
  36. Wang, Y.X.; Markert, R. Filter bank property of variational mode decomposition and its applications. Signal Process. 2016, 120, 509–521. [Google Scholar] [CrossRef]
  37. Zhang, Y.C.; Liu, K.P.; Qin, L.; An, X.L. Deterministic and probabilistic interval prediction for short-term wind power generation based on variational mode decomposition and machine learning methods. Energy Convers. Manag. 2016, 112, 208–219. [Google Scholar] [CrossRef]
  38. Guo, Z.H.; Zhao, W.G.; Lu, H.Y.; Wang, J.Z. Multi-step forecasting for wind speed using a modified EMD-based artificial neural network model. Renew. Energy 2012, 37, 241–249. [Google Scholar] [CrossRef]
  39. Hu, Q.H.; Zhang, S.G.; Xie, Z.X.; Mi, J.S.; Wan, J. Noise model based v-support vector regression with its application to short-term wind speed forecasting. Neural Netw. 2014, 57, 1–11. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Structure of the new hybrid wind speed forecasting model.
Figure 1. Structure of the new hybrid wind speed forecasting model.
Energies 09 00989 g001
Figure 2. Structure of the extreme learning machine (ELM).
Figure 2. Structure of the extreme learning machine (ELM).
Energies 09 00989 g002
Figure 3. Wind speed series used to verify the effectiveness of the new approach (from 18 February 2004 to 20 February 2004).
Figure 3. Wind speed series used to verify the effectiveness of the new approach (from 18 February 2004 to 20 February 2004).
Energies 09 00989 g003
Figure 4. Decomposition results of variational mode decomposition (VMD).
Figure 4. Decomposition results of variational mode decomposition (VMD).
Energies 09 00989 g004
Figure 5. Decomposition results of empirical mode decomposition (EMD).
Figure 5. Decomposition results of empirical mode decomposition (EMD).
Energies 09 00989 g005
Figure 6. PACFs of the original data and the IMFs extracted from VMD.
Figure 6. PACFs of the original data and the IMFs extracted from VMD.
Energies 09 00989 g006
Figure 7. Forecasting results on 20 February 2004.
Figure 7. Forecasting results on 20 February 2004.
Energies 09 00989 g007
Figure 8. Forecasting results on 20 May 2004.
Figure 8. Forecasting results on 20 May 2004.
Energies 09 00989 g008
Figure 9. Forecasting results on 20 August 2004.
Figure 9. Forecasting results on 20 August 2004.
Energies 09 00989 g009
Figure 10. Forecasting results on 20 November 2004.
Figure 10. Forecasting results on 20 November 2004.
Energies 09 00989 g010
Figure 11. Forecasting results on 20 February 2004, using different methods.
Figure 11. Forecasting results on 20 February 2004, using different methods.
Energies 09 00989 g011
Figure 12. Forecasting results on 20 May 2004, using different methods.
Figure 12. Forecasting results on 20 May 2004, using different methods.
Energies 09 00989 g012
Figure 13. Forecasting results on 20 August 2004, using different methods.
Figure 13. Forecasting results on 20 August 2004, using different methods.
Energies 09 00989 g013
Figure 14. Forecasting results on 20 November 2004, using different methods.
Figure 14. Forecasting results on 20 November 2004, using different methods.
Energies 09 00989 g014
Figure 15. Forecasting results from 20–24 November 2004, using different methods.
Figure 15. Forecasting results from 20–24 November 2004, using different methods.
Energies 09 00989 g015
Table 1. Center frequency corresponding to different mode number, k.
Table 1. Center frequency corresponding to different mode number, k.
kNormalized Center FrequencyΔf
IMF1IMF2IMF3IMF4IMF5IMF6IMF7IMF8IMF9IMF10
10.0011----------
20.00110.2249--------0.2238
30.00110.05360.2254-------0.1718
40.00110.05350.22540.3867------0.1613
50.00110.05330.22450.28670.3902-----0.1035
60.00110.01850.06820.22480.28730.4412----0.1584
70.00110.01850.06810.22480.28690.38450.4443---0.0598
80.00110.01810.06400.14200.22600.28780.38500.4446--0.0596
90.00110.01810.06390.14180.22590.28640.32500.38680.4449-0.0581
100.00110.01810.05480.09070.15980.22650.28690.32600.38710.44500.0579
Table 2. Mean error of one-step-ahead forecasting using the new method with different k values.
Table 2. Mean error of one-step-ahead forecasting using the new method with different k values.
kMean Error
MAPE (%)RMSE (m/s)
120.76290.4742
215.88090.3963
314.73690.3545
413.06980.3154
511.93130.2968
611.32990.2851
711.09960.2810
811.14320.2832
911.64130.2913
1011.93110.3031
Table 3. Input variables determined by PACF.
Table 3. Input variables determined by PACF.
Time SeriesInput Variables
Original x t 1 , x t 4
IMF1 x t 1 , x t 2 , x t 3 , x t 4 , x t 9 , x t 10 , x t 14 , x t 18 , x t 22 , x t 23
IMF2 x t 1 , x t 2 , x t 3 , x t 4 , x t 5 , x t 14 , x t 17 , x t 22 , x t 23
IMF3 x t 1 , x t 2 , x t 3 , x t 4 , x t 5 , x t 7 , x t 14 , x t 17 , x t 22 , x t 23 , x t 24
IMF4 x t 2 , x t 3 , x t 4 , x t 5 , x t 6 , x t 15 , x t 17 , x t 24
IMF5 x t 1 , x t 2 , x t 4 , x t 5 , x t 6 , x t 11 , x t 24
IMF6 x t 1 , x t 2 , x t 6 , x t 19 , x t 20
IMF7 x t 1 , x t 2 , x t 5 , x t 6 , x t 10 , x t 17 , x t 17 , x t 20 , x t 23
Table 4. Forecasting accuracy with different characters of WRELM.
Table 4. Forecasting accuracy with different characters of WRELM.
Activation FunctionMAPE(%) of Different Hidden Layer Numbers
51015202530
Sine11.728010.758510.700910.695510.711010.7631
Sigmoid12.036010.761110.735710.735610.739810.7623
Triangular basis12.498210.898710.727610.713510.713010.7922
Hard-limit13.003910.766710.772710.818810.828910.7881
Radial basis11.758310.896110.921710.868310.795710.8254
Table 5. Mean forecasting error of different methods.
Table 5. Mean forecasting error of different methods.
MethodsMAPE (%)RMSE (m/s)
One-StepTwo-StepThree-StepOne-StepTwo-StepThree-Step
VMD + ELM11.801116.451218.49010.28530.48450.5854
VMD + WRELM11.451815.352317.60160.26340.47690.5536
EMD + PACF + WRELM16.635018.726220.73170.55400.88351.2644
VMD + PACF + ELM11.364415.822417.82600.27580.46910.5491
VMD + PACF + WRELM11.091214.671016.73780.25900.43780.5382
Table 6. The mean forecasting errors of different methods.
Table 6. The mean forecasting errors of different methods.
MethodsMAPE (%)RMSE (m/s)
One-StepTwo-StepThree-StepOne-StepTwo-StepThree-Step
ARIMA20.397426.664030.61910.27730.54050.6149
VMD + PACF + SVR14.976118.414320.14380.26620.46650.6093
VMD + PACF + WRELM13.666516.663518.93720.26520.45120.5943

Share and Cite

MDPI and ACS Style

Huang, N.; Yuan, C.; Cai, G.; Xing, E. Hybrid Short Term Wind Speed Forecasting Using Variational Mode Decomposition and a Weighted Regularized Extreme Learning Machine. Energies 2016, 9, 989. https://doi.org/10.3390/en9120989

AMA Style

Huang N, Yuan C, Cai G, Xing E. Hybrid Short Term Wind Speed Forecasting Using Variational Mode Decomposition and a Weighted Regularized Extreme Learning Machine. Energies. 2016; 9(12):989. https://doi.org/10.3390/en9120989

Chicago/Turabian Style

Huang, Nantian, Chong Yuan, Guowei Cai, and Enkai Xing. 2016. "Hybrid Short Term Wind Speed Forecasting Using Variational Mode Decomposition and a Weighted Regularized Extreme Learning Machine" Energies 9, no. 12: 989. https://doi.org/10.3390/en9120989

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop