Skip to Content
EntropyEntropy
  • Article
  • Open Access

12 July 2023

A New Forecasting Approach for Oil Price Using the Recursive Decomposition–Reconstruction–Ensemble Method with Complexity Traits

,
and
1
School of Economics and Management, Beijing Jiaotong University, Beijing 100044, China
2
Beijing Laboratory of National Economic Security Early-Warning Engineering, Beijing Jiaotong University, Beijing 100044, China
3
National Academy of Economic Security, Beijing Jiaotong University, Beijing 100044, China
4
Department of Mathematics, Beijing Institute of Petrochemical Technology, Beijing 100044, China
This article belongs to the Special Issue Complex Systems Approach to Social Dynamics

Abstract

The subject of oil price forecasting has obtained an incredible amount of interest from academics and policymakers in recent years due to the widespread impact that it has on various economic fields and markets. Thus, a novel method based on decomposition–reconstruction–ensemble for crude oil price forecasting is proposed. Based on the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) technique, in this paper we construct a recursive CEEMDAN decomposition–reconstruction–ensemble model considering the complexity traits of crude oil data. In this model, the steps of mode reconstruction, component prediction, and ensemble prediction are driven by complexity traits. For illustration and verification purposes, the West Texas Intermediate (WTI) and Brent crude oil spot prices are used as the sample data. The empirical result demonstrates that the proposed model has better prediction performance than the benchmark models. Thus, the proposed recursive CEEMDAN decomposition–reconstruction–ensemble model can be an effective tool to forecast oil price in the future.

1. Introduction

Crude oil, which is the world’s most important chemical raw material and strategic resource, ensures the normal operation of the national economy and people’s livelihoods, and it is a critical support for the development of the entire modern industrial society. Crude oil plays an important role in the global economy, political situation, and military strength of various countries as a basic energy source. As a result, changes in crude oil prices have sparked widespread concern worldwide. Because of the interactive impact of various factors such as the global economy, exchange rate changes, speculative behavior, and geopolitics, the oil price always exhibits non-linearity, non-stationarity, and high complexity, which poses significant challenges to crude oil price forecasting.
In the literature, various linear and nonlinear models have been used separately or in combination to make forecast (see, e.g., Buyuksahin & Ertekin [1]). Linear methods assume that a given time series is regular with no sudden movements. It becomes challenging because sudden movements with variation and extreme values are normal in many real-world time series such as financial data and renewable energy data (see, e.g., Xu et al. [2]). Numerous nonlinear time series prediction methods (see, e.g., Kantz & Schreiber [3]) have been proposed in the literature to capture these nonlinearities. Conventional linear methods can better approximate time series with no high volatility and multicollinearity. Zhang et al. [4] and Elman [5] show that nonlinear methods have the advantages when modeling a complex structure in time series with high accuracy. No universal model is suitable for all circumstances because each type of method outperforms others in different domains. Individually capturing general patterns in the time series data using only one linear or nonlinear model appears to be difficult (see, e.g., Khashei & Bijari [6]). To overcome this limitation, Taskaya & Casey [7] proposed hybrid techniques with both linear and nonlinear models. The hybrid methodology is a synthesis of various prediction methods. It is usually a combination of traditional econometric models and AI algorithms (see, e.g., Wang et al. [8]) or a combination of different econometric models or AI algorithms.
In addition to the hybrid methodology, the ensemble learning algorithm is an important paradigm to overcome the limitations of single methods. Both hybrid methodology and the ensemble method consider the shortcomings of single models. With the divide-and-conquer strategy (see, e.g., Yu et al. [9] and Dong et al. [10]), the decomposition–ensemble learning methods are an important branch of ensemble learning paradigms. Because it will take a lot of time to make individual prediction from all decomposed components, the number of decomposed components is necessarily reduced. Yu et al. [11] first proposed a decomposition–ensemble model with a reconstruction step that considered some data characteristics. Recently, Yu & Ma [12] introduced a memory-trait-driven reconstruction method into the decomposition and ensemble framework. Inspired by their work, a new model based on decomposition–ensemble learning with a reconstruction step that considers the data complexity traits is used to explore the price predictions of crude oil. In this model, all steps of mode reconstruction, component prediction, and ensemble prediction are driven by complexity traits. First, a decomposition–ensemble approach is used to decompose the oil price time series. Second, the complexity of these decomposed components are separately computed. Then, each component can be identified based on its complexity ranking from high to low. Different components are predicted through appropriated models. Finally, the forecasting for different components can be aggregated to produce the final prediction output. The contributions of the article are as follows:
  • A novel decomposition–reconstruction–ensemble method is proposed with clustering capability to capture the inner complexity traits. The performance of the proposed recursive CEEMDAN for different complexity traits of data is tested and validated using popular single models and several decomposition–reconstruction–ensemble models.
  • The proposed recursive CEEMDAN technique is used to improve the performance of the CEEMDAN decomposition method by recursively decomposing the rapidly fluctuating components into less volatile sub-components.
  • In the proposed recursive CEEMDAN decomposition–reconstruction–ensemble forecasting methodology, the reconstruction method, prediction method, and ensemble method are determined by the complexity traits of the crude oil data themselves.
The remainder of this paper is organized as follows. Section 2 considers a comparison to the related works. Research data and the decomposition–reconstruction–ensemble method are discussed in Section 3. Section 4 presents the error measures to validate the prediction models. Some main findings are illustrated by comparing the results of the proposed model to the benchmark models. The prediction performance of the proposed model is further discussed in Section 5. Section 6 summarizes this paper and provides the improvement direction of future research.

3. Methodology

3.1. Recursively Decomposition Method

In this paper, we propose a recursive CEEMDAN-based technique for time series forecasting, which attempts to extract more stable sub-components from rapidly changing components to improve the prediction accuracy. The architecture of the proposed method is given in Figure 1.
Figure 1. General framework of the recursive CEEMDAN decomposition–reconstruction–ensemble methodology.
The proposed method recursively calls the CEEMDAN decomposition technique (see, e.g., Torres et al. [35]) for each component until it satisfies one of the following two conditions:
  • The component becomes less complex than the given series.
  • The correlation between the component and the given series exceeds a specified threshold.
The first condition takes into account the sample entropy values of each component. According to the methods proposed by Richman & Moorman [36], the sample entropy value is greater for more complicated components. Therefore, the more complicated components are decomposed again into their own sub-components via CEEMDAN in the algorithm.
The second condition employs Pearson correlation (see, e.g., Hauke & Kossowski [37]) to determine the similarity between the specified component and the series. High correlation is a termination criterion for this recursive method. Recursive decomposition is halted if a sub-component is substantially connected with its higher component regardless of its fluctuation rate.
The pseudo-code of the proposed method is indicated in Algorithm 1. After the given data have been decomposed into components via CEEMDAN at line 1, all components are called in the loop at line 2 where each component is analyzed. At line 3, a sample entropy value of each component and the input data, which is the first condition of the proposed algorithm, is calculated using the sample entropy (SE) method (see, e.g., Richman & Moorman [36]). At line 4, the correlation results are obtained. At line 5, the second condition of the proposed algorithm which measures the correlation between upper component and the sub-component is calculated. If the sub-component has a higher sample entropy value than its upper component and their correlation is low, the given component must be re-decomposed by CEEMDAN. In this case, the Recursive Method function is called again for this sub-component at line 7. Otherwise, no more decomposition is required for the sub-component. At line 8, the ultimate decomposition result is formed.
Algorithm 1: Recursive CEEMDAN-based decomposition method
Input: the original time series data
Output: the decomposed time series
1: Decompose the input time series data with the method CEEMDAN (different IMFs are obtained).
2: for each component imf in IMFs do
3: Compute each component’s and the input data’s sample entropy i m f S E , i n p u t d a t a S E .
4: Derive the correlation between the selected component imf and the input data i m f c o r r .
5: if  i m f S E < i n p u t d a t a S E and i m f c o r r <0.9
then
6: Output the selected component imf.
else
7: Go to step 1 and decompose the selected imf.
8: return The decomposed imfs and their sub-components.
Then, based on the recursive CEEMDAN algorithm, different decomposed components of the original data and their sub-components are obtained. The decomposed components are identified as low-complexity components when they have smaller complexity traits than the original time series after the first decomposition. The decomposed components with larger complexity traits than the original time series will be recognized as high-complexity components when they are recursively decomposed only once. Then, other decomposed components are recognized as medium-complexity components, which implies that these components have larger complexity traits than the original time series and they will be recursively decomposed two or more times.

3.2. Performance Evaluation Criteria

To verify the validity of a forecast, the model outcomes are assessed. Numerous experiments are conducted to evaluate the forecasting performance of the proposed hybrid model and the reference models. In this paper, we use three popular accuracy measures with the following corresponding definitions:
MAE = 1 N t = 1 N d t O t ,
RMSE = 1 N t = 1 N ( d t O t ) 2 ,
MAPE = 1 N t = 1 N d t O t d t ,
where d t and O t are the real and predicted values at time t ( t = 1 , 2 , , N ); N is the number of samples in the testing data set; and d t ^ and O t ^ are the average values of the actual value and predicted value, respectively.
In addition, a Diebold–Mariano (DM) test (see, e.g., Yu et al. [38]) is chosen to prove the superiority of the proposed model. Furthermore, popular single models and several decomposition–reconstruction–ensemble models are built as benchmark models to test the effectiveness of the proposed model. In detail, ES is constructed as the single benchmark model for the traditional econometric model. For AI models, SVR, ELM, and ANN are developed as single benchmark models. As a benchmark model for decomposition–reconstruction–ensemble models, four similar decomposition–reconstruction–ensemble frameworks with different basic prediction models are built.

4. Empirical Result

4.1. Research Data

In this paper, the weekly WTI and Brent crude oil spot price from the US Energy Information Administration (EIA) (http://www.eia.doe.gov/ (accessed on 11 August 2022)) were selected as sample data. The sampling period was from 1 January 2010 to 31 December 2021, and there are 627 observations in total. The training set accounts for 70% of the total sample size, which includes 418 observations, and the test set accounts for 30% of the total sample size, which includes 209 observations. The test data set is used to evaluate how well the proposed model performed compared to the benchmark models.
Table 1 displays these initial crude oil price series with their statistical measurements, which include the minima, maxima, means, and standard deviations. We find that the rejection of the null hypothesis of Gaussian distribution results from the Anderson and Darling test, which is confirmed by the time series data with nonzero skewness and positive excess kurtosis. Overall, the chosen observations are not stationary, and the model construction should consider necessary data preprocessing.
Table 1. Descriptive statistics of crude oil spot price data.

4.2. Experimental Result Analysis

First, the original time series of WTI and Brent crude oil prices are decomposed by CEEMDAN, as shown in Figure 2 and Figure 3. In particular, the price series of WTI and Brent crude oil are decomposed into 8 IMF components and one residual term. Each of the intrinsic mode functions can be categorized into high and low frequencies, with each component showcasing unique characteristics. The decomposition analysis reveals that the residue component exhibits noteworthy long-term trends, while sub-components 1 to 8 are stationary or nearly stationary, as illustrated in Figure 2 and Figure 3. However, the effectiveness of the decomposition process in improving crude oil price forecasting performance remains an open topic for further discussion in subsequent sections.
Figure 2. CEEMDAN decomposition results of the WTI crude oil prices (dollars per barrel).
Figure 3. CEEMDAN decomposition results of the Brent crude oil prices (dollars per barrel).
In the second step, component reconstruction is performed to reduce the computational time complexity. According to Table 2 and Table 3, different decomposed modes have different degrees of complexity, and the complexity traits of each decomposed mode show a downward trend with an increasing time scale. Subsequently, based on the recursive CEEMDAN algorithm, all components are recognized as high-complexity components, medium-complexity components, and low-complexity components. More concretely, IMFs and residual components are identified as low-complexity when they have smaller complexity traits than the original time series after the first decomposition. The IMFs with larger complexity traits than the original time series will be recognized as high-complexity components when they are recursively decomposed with only one step. Then, other IMFs are recognized as medium-complexity components. These components have larger complexity traits than the original time series, and they will be recursively decomposed with two or more steps. Table 2 and Table 3 report the test results of the complexity traits for each decomposed component of WTI and Brent crude oil prices, respectively.
Table 2. Complexity test results of the IMF components for WTI crude oil.
Table 3. Complexity test results of the IMF components for Brent crude oil.
Next, it is necessary to select a suitable method to predict different components. According to the complexity test results, nine components are reduced into six components after the reconstruction. In addition, the complexity traits of the decomposition components will change when the components change. Based on the reconstruction method, three kinds of components with different degrees of complexity, namely the high-complexity component, medium-complexity component, and low-complexity component, can be obtained. Then, the selection of suitable predictive methods driven by complexity traits is achieved through the trial-and-error approach. Table 4, Table 5, Table 6, Table 7, Table 8 and Table 9 presents the selection results for predicting different decomposed components of WTI and Brent crude oil prices with complexity traits.
Table 4. Prediction performance comparison of a high-complexity component under different prediction methods for WTI crude oil price data.
Table 5. Prediction performance comparison of a medium-complexity component under different prediction methods for WTI crude oil price data.
Table 6. Prediction performance comparison of a low-complexity component under different prediction methods for WTI crude oil price data.
Table 7. Prediction performance comparison of a high-complexity component under different prediction methods for Brent crude oil price data.
Table 8. Prediction performance comparison of a medium-complexity component under different prediction methods for Brent crude oil price data.
Table 9. Prediction performance comparison of a low-complexity component under different prediction methods for Brent crude oil price data.
Table 4, Table 5 and Table 6 show the performance value of different combination models such as X-SVR-SVR, SVR-X-SVR, and SVR-SVR-X. For example, in the X-SVR-SVR model (see Table 4), the second and third SVR methods indicate that the medium-complexity component and low-complexity component use the SVR model, while X will try four different methods (i.e., ES, SVR, ELM, ANN) to find a suitable model for the high-complexity component. To facilitate computational convenience, the ADD is temporarily employed as an ensemble method for investigating the correlation between the memorable component and the prediction method. Based on the aforementioned explanations, Table 4 presents the experimental findings regarding the high-complexity components.
For the parameter of the ES, a simple first-order ES with a smoothing constant is chosen. The smoothing constant is determined using the principle of the minimum root mean square error. For the parameters of the SVR model, the Gaussian RBF kernel function is adopted, and the grid search method is used to set the regularization and kernel parameters. For the ELM and ANN models, the number of nodes in the hidden layer is set to 30.
Table 4, Table 5 and Table 6 illustrate that an ANN is suitable for high-complexity component forecasting, while SVR is suitable for both medium-complexity and low-complexity component forecasting. The ANN-SVR-SVR has better prediction accuracy than other model combinations for WTI crude oil price forecasting. Table 7, Table 8 and Table 9 show the experimental results of the high-complexity component, medium-complexity component, and low-complexity component, respectively. Similarly, the SVR-SVR-SVR has better prediction performance than other model combinations for Brent crude oil price forecasting according to Table 7, Table 8 and Table 9.

5. Discussion

5.1. Prediction Performance Comparison

In this part, the proposed model, four single models (i.e., ES, SVR, ELM, ANN), and four decomposition–reconstruction–ensemble models (i.e., D-R-ES, D-R-SVR, D-R-ELM, D-R-ANN), which are considered benchmark models, are performed to predict the testing dataset of WTI and Brent crude oil prices. Here, “D” denotes the chosen decomposition method, and “R” denotes the proposed reconstruction rule of the component. The results are shown in Table 10, Table 11, Table 12 and Table 13. According to these results, the proposed model almost outperforms all of the considered benchmark models. The final form of the proposed model is simply the decomposition–reconstruction–ensemble model with the form of “D-R-SVR” for Brent crude oil price forecasting. Thus, the model with the form of “D-R-SVR” is not considered a target model in Table 13.
Table 10. Performance comparison of WTI crude oil spot price data.
Table 11. Performance comparison of Brent crude oil spot price data.
Table 12. Performance comparison of WTI crude oil spot price data with DM test.
Table 13. Performance comparison of Brent crude oil spot price data with DM test.
Furthermore, the decomposition–reconstruction–ensemble models make predictions better than the single models according to Table 10 and Table 11. In particular, for WTI crude oil price forecasting, the decomposition–reconstruction–ensemble models have average MAE, RMSE, and MAPE values of 1.0867, 1.6535, and 0.0345, respectively, while the single models have average MAE, RMSE, and MAPE values of 1.2334, 1.8382, and 0.0408, respectively. For the Brent crude oil price forecasting, the prediction accuracy values for the decomposition–reconstruction–ensemble models are 1.1463, 1.5782, and 0.0233, while those for the single models are 1.3429, 1.9266, and 0.0290. The main reason is that the decomposition–reconstruction–ensemble can minimize the complexity of crude oil data, which boosts its prediction performance compared to benchmark single models.
Comparing with the eight benchmark models, i.e., ES, SVR, ELM, ANN, D-R-ES, D-R-SVR, D-R-ELM, and D-R-ANN, the proposed model shows superior performance in crude oil price forecasting. In Table 10, the proposed model improves the prediction accuracy by 59.89%, 63.33%, and 61.82% on average compared to the benchmark single models and by 52.42%, 55.06%, and 53.10% on average compared to the benchmark decomposition–reconstruction–ensemble models. Then, Table 11 shows that the proposed model improves the accuracy of the Brent crude oil price forecasting by 62.01%, 65.88%, and 65.66% on average compared to the benchmark single models and by 52.96%, 51.17% and 51.47% on average compared with the benchmark decomposition–reconstruction–ensemble models. Therefore, the proposed recursive CEEMDAN decomposition–reconstruction–ensemble prediction method can effectively improve the prediction performance of WTI and Brent crude oil prices.
In addition, the DM test is used to compare the prediction performance of different models in the benchmark models in Table 12 and Table 13 to statistically prove the superiority of the proposed model for WTI and Brent crude oil price forecasting. These conclusions are statistically proven by data from the DM test, as indicated by the p-values (in brackets). First, at a significance level of 5%, the proposed model outperforms all benchmark models, which suggests that the proposed recursive CEEMDAN decomposition–reconstruction–ensemble prediction model is better than the listed benchmark models for WTI and Brent crude oil price forecasting. Second, when the decomposition–reconstruction–ensemble models in the benchmark models are tested as the target models in Table 12 and Table 13, only the D-R-SVR can be proven to be better than all single models with the significance level of 5%. Third, focusing on different decomposition–reconstruction–ensemble models in the benchmark models, although the D-R-SVR can be statistically demonstrated to be better than their D-R-based counterparts at the confidence level of 5%, it is essential to choose the appropriate prediction model for the reconstructed components with different degrees of complexity.

5.2. Further Discussion

In this section, we perform the EEMD decomposition method and two different reconstruction rules to compare the prediction performance of the proposed model. The two rules are mode reconstruction based on the threshold setting of SE (see, e.g., Zhang et al. [39]) and fine-to-coarse (FTC) (see, e.g., Yu et al. [38] and Zhang et al. [39]). Different models are performed as the benchmark models, which are denoted in the form of R-D-R-SA, where “R-D” indicates different recursive decomposition methods to be compared, “R” indicates different reconstruction rules, and “SA” represents the selected predictive methods driven by the complexity traits and simple addition for the final ensemble. Table 14 and Table 15 and Figure 4 and Figure 5 show the results of different models. Similarly, the DM test is performed to evaluate the accuracy of different prediction models, and the corresponding results are presented in Table 16. According to Table 14, Table 15 and Table 16 and Figure 4 and Figure 5, the main findings are as follows.
Table 14. Performance comparison of WTI crude oil spot price data with EEMD decomposition method and different reconstruction rules.
Table 15. Performance comparison of Brent crude oil spot price data with EEMD decomposition method and different reconstruction rules.
Figure 4. Performance comparison of different models for WTI crude oil price forecasting.
Figure 5. Performance comparison of different models for Brent crude oil price forecasting.
Table 16. DM test results across different models for WTI and Brent crude oil price forecasting.
First, as Table 14 and Table 15 show, no model can outperform other models under all indicators. Compared with the EEMD decomposition-based models, the proposed model for WTI crude oil price forecasting improves the prediction accuracy by 10.10%, 13.28%, and 11.35% on average, and the proposed model for Brent crude oil price forecasting improves the prediction accuracy by 17.27%, 21.0%, and 16.50% on average. One possible reason is that CEEMDAN minimizes the complexity of WTI and Brent crude oil price data. Thus, it can effectively filter out the meaningful components and significantly enhance the forecast accuracy.
Second, the proposed model is better than the benchmark models based on other reconstruction rules. In concrete, compared with the benchmark models with different reconstruction rules, the proposed model for WTI crude oil price forecasting improves the prediction accuracy by 3.51%, 6.75%, and 4.59% on average, and the proposed model for Brent crude oil price forecasting improves the prediction accuracy by 7.78%, 9.40%, and 7.05% on average. Table 16 also shows that the DM test at the 10% level of significance confirms the superiority of the suggested model. Thus, the WTI and Brent crude oil data can be better predicted using the proposed reconstruction approach based on the complexity trait.
Third, the proposed model has lower M A E , R M S E , and M A P E than other models based on the EEMD decomposition models and reconstruction rules from Figure 4 and Figure 5. For example, compared with different reconstruction methods in the benchmark models, the proposed model for WTI crude oil price forecasting improves the prediction accuracy by 10.89%, 16.03%, and 12.75% on average, and the proposed model for Brent crude oil price forecasting improves the prediction accuracy by 20.04%, 24.32%, and 18.84% on average. Thus, the proposed model improves the prediction performance in WTI and Brent crude oil price forecasting. Meanwhile, as shown in Table 16, when the proposed model is used as the target model, all p-values of the DM test fall below the threshold of 10%, so the proposed model has a significantly higher level of accuracy in its predictions than the benchmark models.

6. Conclusions and Future Directions

This paper proposes a new complexity-traits-driven recursively CEEMDAN decomposition–reconstruction–ensemble method for WTI and Brent crude oil price forecasting. All steps of component reconstruction for decomposed components, component prediction, and ensemble prediction are driven by the complexity traits, and the proposed method proves to be more effective than the benchmark models.
In the empirical analysis, the proposed recursive CEEMDAN decomposition– reconstruction–ensemble learning paradigm is significantly better than the most popular single models, different decomposition–reconstruction–ensemble models, and ensemble models based on the EEMD decomposition methods or different reconstruction rules. Based on the empirical experiments, four insightful conclusions can be summarized.
First, the prediction accuracy of WTI and Brent crude oil price data demonstrates that the proposed model outperforms all benchmark models. Specifically, compared with different benchmark models, the proposed model for WTI crude oil price forecasting improves the prediction accuracy by 56.16%, 59.19%, and 57.46% on average, and the proposed model for Brent crude oil price forecasting improves the prediction accuracy by 57.48%, 58.53%, and 58.56% on average. Therefore, the proposed model can be a useful tool to forecast WTI and Brent crude oil prices in the near future.
Second, CEEMDAN can achieve better prediction performance than the EEMD decom-position-based method. For example, compared with the EEMD decomposition-based models, on average, the proposed model improves the prediction accuracy by 10.10%, 13.28%, and 11.35% for WTI crude oil price forecasting and by 17.27%, 21.0%, and 16.50% for Brent crude oil price forecasting.
Third, the prediction performance of crude oil price data can be further improved by selecting appropriate prediction models for the reconstructed components with different degrees of complexity. For example, compared with the benchmark decomposition–reconstruction–ensemble models (i.e., D-R-KRR, D-R-ELM, D-R-SVR, and D-R-ANN), on average, the proposed model improves the prediction accuracy by 52.42%, 55.06%, and 53.10% for WTI crude oil price forecasting and by 52.96%, 51.17%, and 51.47% for Brent crude oil price forecasting. Therefore, it is essential to choose the appropriate prediction models according to the complexity traits.
Finally, compared with the existing reconstruction rules, the recursively decomposition-reconstruction method based on the complexity traits can reduce the modeling complexity well, which shows its usefulness and efficacy in WTI and Brent crude oil price forecasting. For example, on average, the proposed model improves the prediction accuracy by 10.89%, 16.03%, and 12.75% for WTI crude oil price forecasting and by 20.04%, 24.32%, and 18.84% for Brent crude oil price forecasting. Thus, mode reconstruction driven by complexity traits is effective.
In addition to the sample entropy used by our recursive CEEMDAN method, other time series features such as the frequency change rate and autocorrelation can be used. Future research extensions will focus on the following: (1) verifying more advanced decomposition methods under the proposed framework in this paper and (2) exploring more results in other research areas such as the stock market, power market, and other emerging markets using the proposed complexity-trait-driven reconstruction-ensemble learning paradigm.

Author Contributions

Methodology, F.W.; Software, F.W. and R.W.; Formal analysis, F.W.; Data curation, R.W.; Writing—original draft, F.W.; Writing—review & editing, R.W.; Supervision, M.L.; Funding acquisition, F.W. and M.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research is funded by the Fundamental Research Funds for the Central Universities (2021RCW116) and the R&D Program of Beijing Municipal Education Commission (grant No. KJZD20191000401). This research is also supported by Beijing Laboratory of National Economic Security Early-warning Engineering, Beijing Jiaotong University.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

http://www.eia.doe.gov/ (accessed on 11 August 2022).

Acknowledgments

We would like to thank the editors and anonymous reviewers for their valuable comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Buyuksahin, U.C.; Ertekin, S. Improving forecasting accuracy of time series data using a new ARIMA-ANN hybrid method and empirical mode decomposition. Neurocomputing 2019, 361, 151–163. [Google Scholar] [CrossRef]
  2. Xu, W.; Peng, H.; Zeng, X.; Zhou, F.; Tian, X.; Peng, X. Deep belief network-based AR model for nonlinear time series forecasting. Appl. Soft Comput. 2019, 77, 605–621. [Google Scholar] [CrossRef]
  3. Kantz, H.; Schreiber, T. Nonlinear Time Series Analysis: Contents; Cambridge University Press: Cambridge, UK, 1997. [Google Scholar]
  4. Zhang, G.; Patuwo, B.E.; Hu, M.Y. Forecasting with artificial neural networks: The state of the art. Int. J. Forecast. 1998, 14, 35–62. [Google Scholar] [CrossRef]
  5. Elman, J.L. Finding structure in time. Cogn. Sci. 1990, 14, 179–211. [Google Scholar] [CrossRef]
  6. Khashei, M.; Bijari, M. A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl. Soft Comput. 2011, 11, 2664–2675. [Google Scholar] [CrossRef]
  7. Taskaya, T.; Casey, M.C. A comparative study of autoregressive neural network hybrids. Neural Netw. 2005, 18, 781–789. [Google Scholar] [CrossRef]
  8. Wang, J.J.; Wang, J.Z.; Zhang, Z.G.; Guo, S.P. Stock index forecasting based on a hybrid model. Omega 2012, 40, 758–766. [Google Scholar] [CrossRef]
  9. Yu, L.A.; Wang, S.Y.; Lai, K.K. Forecasting crude oil price with an EMD-based neural network ensemble learning paradigm. Energy Econ. 2008, 30, 2623–2635. [Google Scholar] [CrossRef]
  10. Dong, J.; Dai, W.; Tang, L.; Yu, L. Why do EMD-based methods improve prediction? A multiscale complexity perspective. J. Forecast. 2019, 38, 714–731. [Google Scholar] [CrossRef]
  11. Yu, L.; Wang, Z.; Tang, L. A decomposition–ensemble model with datacharacteristic-driven reconstruction for crude oil price forecasting. Appl. Energy 2015, 156, 251–267. [Google Scholar] [CrossRef]
  12. Yu, L.; Ma, M. A memory-trait-driven decomposition–reconstruction–ensemble? learning paradigm for oil price forecasting. Appl. Soft Comput. 2021, 111, 107699. [Google Scholar] [CrossRef]
  13. Kourentzes, N.; Barrow, D.; Petropoulos, F. Another look at forecast selection and combination: Evidence from forecast pooling. Int. J. Prod. Econ. 2019, 209, 226–235. [Google Scholar] [CrossRef]
  14. Guo, J. Oil price forecast using deep learning and ARIMA. In Proceedings of the 2019 International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), Taiyuan, China, 8–10 November 2019; pp. 241–247. [Google Scholar]
  15. Zhang, J.L.; Zhang, Y.J.; Zhang, L. A novel hybrid method for crude oil price forecasting. Energy Econ. 2015, 49, 649–659. [Google Scholar] [CrossRef]
  16. Isah, N.; Bon, A.T. Application of Markov model in crude oil price forecasting. Traektoria Nauk. 2017, 3, 1007–1012. [Google Scholar] [CrossRef]
  17. Mirmirani, S.; Li, H. A Comparison of VAR and Neural Networks with Genetic Algorithm in Forecasting Price of oil Applications of Artificial Intelligence in Finance and Economics; Emerald Group Publishing Limited: Bingley, UK, 2004; pp. 203–223. [Google Scholar]
  18. Zolfaghari, M.; Gholami, S. A hybrid approach of adaptive wavelet transform, long short-term memory and ARIMA-GARCH family models for the stock index prediction. Expert Syst. Appl. 2021, 182, 115149. [Google Scholar] [CrossRef]
  19. Zhu, D.M.; Ching, W.K.; Elliott, R.J.; Siu, T.K.; Zhang, L.M. Hidden Markov models with threshold effects and their applications to oil price forecasting. J. Ind. Manag. Optim. 2017, 13, 757–773. [Google Scholar] [CrossRef]
  20. Drachal, K. Forecasting crude oil real prices with averaging time-varying VAR models. Resour. Policy 2021, 74, 102244. [Google Scholar] [CrossRef]
  21. Yu, L.; Zhang, X.; Wang, S. Assessing potentiality of support vector machine method in crude oil price forecasting. EURASIA J. Math. Sci. Technol. Educ. 2017, 13, 7893–7904. [Google Scholar] [CrossRef] [PubMed]
  22. Wang, J.; George, A.; Hyndman, R.J.; Wang, S. Crude oil price forecasting based on internet concern using an extreme learning machine. Int. J. Forecast. 2018, 34, 665–677. [Google Scholar] [CrossRef]
  23. Jammazi, R.; Aloui, C. Crude oil price forecasting: Experimental evidence from wavelet decomposition and neural network modeling. Energy Econ. 2012, 34, 828–841. [Google Scholar] [CrossRef]
  24. Urolagin, S.; Sharma, N.; Datta, T.K. A combined architecture of multivariate LSTM with Mahalanobis and Z-Score transformations for oil price forecasting. Energy 2021, 231, 120963–120975. [Google Scholar] [CrossRef]
  25. Li, J.; Zhu, S.; Wu, Q. Monthly crude oil spot price forecasting using variational mode decomposition. Energy Econ. 2019, 83, 240–253. [Google Scholar] [CrossRef]
  26. Wang, J.; Wang, J. Forecasting energy market indices with recurrent neural networks: Case study of crude oil price fluctuations. Energy 2016, 102, 365–374. [Google Scholar] [CrossRef]
  27. Yu, L.A.; Dai, W.; Tang, L. A novel decomposition ensemble model with extended extreme learning machine for crude oil price forecasting. Eng. Appl. Artif. Intell. 2016, 47, 110–121. [Google Scholar] [CrossRef]
  28. Cheng, F.Z.; Li, T.; Wei, Y.M.; Fan, T.J. The VEC-NAR model for short-term forecasting of oil prices. Energy Econ. 2019, 78, 656–667. [Google Scholar] [CrossRef]
  29. He, M.X.; Zhang, Y.J.; Wen, Y.D. Forecasting crude oil prices: A scaled PCA approach. Energy Econ. 2021, 97, 656–667. [Google Scholar] [CrossRef]
  30. Wang, D.H.; Fang, T.H. Forecasting crude oil prices with a WT-FNN model. Energies 2022, 15, 1955. [Google Scholar] [CrossRef]
  31. Wang, L.; Wu, J.B.; Cao, Y.; Hong, Y.R. Forecasting renewable energy stock volatility using short and long-term Markov switching GARCH-MIDAS models: Either, neither or both? Energy Econ. 2022, 111, 106056. [Google Scholar] [CrossRef]
  32. Li, X.; Shang, W.; Wang, S. Text-based crude oil price forecasting: A deep learning approach. Int. J. Forecast. 2019, 35, 1548–1560. [Google Scholar] [CrossRef]
  33. Ding, Y.S. A novel decompose-ensemble methodology with AIC-ANN approach for crude oil forecasting. Energy 2018, 154, 328–336. [Google Scholar] [CrossRef]
  34. Zheng, J.L.; Tian, Y.; Luo, J.; Hong, T. A novel hybrid method based on kernel-free support vector regression for stock indices and price forecasting. J. Oper. Res. Soc. 2022, 74, 690–702. [Google Scholar] [CrossRef]
  35. Torres, M.E.; Colominas, M.A.; Schlotthauer, G.; Flandrin, P. A complete ensemble empirical mode decomposition with adaptive noise. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; pp. 4144–4147. [Google Scholar]
  36. Richman, J.S.; Moorman, J.R. Physiological time-series analysis using approximate entropy and sample entropy. Am. J. Physiol. Heart Circ. Physiol. 2000, 278, H2039–H2049. [Google Scholar] [CrossRef] [PubMed]
  37. Hauke, J.; Kossowski, T. Comparison of Values of Pearson’s and Spearman’s Correlation Coefficients on the Same Sets of Data. Quaest. Geogr. 2011, 30, 87–93. [Google Scholar] [CrossRef]
  38. Yu, L.A.; Liang, S.; Chen, R.; Lai, K.K. Predicting monthly biofuel production using a hybrid ensemble forecasting methodology. Int. J. Forecast. 2019, 38, 3–20. [Google Scholar] [CrossRef]
  39. Zhang, X.; Lai, K.K.; Wang, S.Y. A new approach for crude oil price analysis based on Empirical Mode Decomposition. Energy Econ. 2008, 30, 905–918. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.