A Hybrid Forecast Model of EEMD-CNN-ILSTM for Crude Oil Futures Price
Abstract
:1. Introduction
- (1)
- This paper applies the signal decomposition method to time series prediction. The IMF components are integrated into low-frequency, medium-frequency, and high-frequency according to the Zero-Crossing Rate (ZCR). Thus, the number of components input into the model is fixed, which ensures the high availability of the model and solves the problem of long training time for too many input components.
- (2)
- By studying the structure and principle of RNN and LSTM models, this paper proposes ILSTM. ILSTM adds the calculation of the cell state at the previous time to the forget gate and input gate of LSTM and improves the output gate by adding important hidden state information on the basis of the original output so that the model can learn more fully from historical data.
- (3)
- This paper proposes a hybrid model of crude oil futures price prediction based on EEMD-CNN-ILSTM. The introduction of EEMD helps CNN to extract the features of different frequency signals better. Through comparative experiments, it is verified that the hybrid model based on EEMD-CNN-ILSTM for crude oil futures price prediction has the highest prediction accuracy and is superior to the other seven prediction models.
2. Related Work
3. Models
3.1. EEMD
3.2. CNN
3.3. ILSTM
3.4. EEMD-CNN-ILSTM
4. Experiment
4.1. Experimental Environment
4.2. Data Collection and Analysis
4.3. Data Preprocessing and Scaling
4.4. Constructing Time Series Data
4.5. Parameter Tuning
5. Model Comparison
6. Discussion
- (1)
- Decomposition of original data by EEMD. After EEMD decomposition, the low, middle, and high-frequency components of IMF and the residual sequence are input into CNN to extract hidden features, thus making up for the shortcoming of LSTM and ILSTM in extracting features.
- (2)
- ILSTM adds the calculation of in the input gate and the forget gate to ensure the complete learning of the historical state and introduces the crucial hidden state information in the output gate so that the model can learn more fully from historical data. However, ILSTM has the limitation of longer training time than LSTM.
7. Conclusions
- (1)
- Compared with LSTM, ILSTM is an improvement of LSTM. ILSTM adds learning of cell state at the previous time in the input gate and forget gate and in the output gate to further extract important hidden state information, further eases the problems of “gradient disappearance” and “gradient explosion” of RNN and improve the forecasting accuracy.
- (2)
- Fix the number of components after EEMD decomposition. After EEMD decomposition and reconstruction, the low, medium, and high-frequency components of IMF and the residual sequence are obtained, which ensures a fixed number of inputs, ensuring the increased availability of the model and reducing the overall prediction time.
- (3)
- A novel hybrid model based on EEMD-CNN-ILSTM for crude oil futures price prediction is proposed, which covers the shortage of a single forecasting model. By comparison, the overall evaluation results of the EEMD-CNN-ILSTM hybrid model for crude oil futures price are optimal.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Kamyk, J.; Kot-Niewiadomska, A.; Galos, K. The criticality of crude oil for energy security: A case of Poland. Energy 2021, 220, 119707. [Google Scholar] [CrossRef]
- Rehman, O.U.; Ali, Y. Optimality study of China’s crude oil imports through China pakistan economic corridor using fuzzy topsis and cost-benefit analysis. Transp. Res. Part E Logist. Transp. Rev. 2021, 148, 102246. [Google Scholar] [CrossRef]
- Mo, B.; Nie, H.; Jiang, Y. Dynamic linkages among the gold market, us dollar and crude oil market. Phys. A 2018, 491, 984–994. [Google Scholar] [CrossRef]
- Wang, K.H.; Su, C.W. Does high crude oil dependence influence Chinese military expenditure decision-making? Energy Strategy Rev. 2021, 35, 100653. [Google Scholar] [CrossRef]
- Liao, Y.; Hong, W.; Wang, W.; Wang, Y.; Chen, S. An overview of RNN-based mandarin speech recognition approaches. J. Chin. Inst. Eng. 1999, 22, 535–547. [Google Scholar] [CrossRef]
- Naz, S.; Umar, A.I.; Ahmad, R.; Ahmed, S.B.; Shirazi, S.H.; Razzak, M.I. Urdu Nasta’liq text recognition system based on multi-dimensional recurrent neural network and statistical features. Neural Comput. Appl. 2017, 28, 219–231. [Google Scholar] [CrossRef]
- Cho, K.B.; Van Merriënboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv 2014, arXiv:1406.1078. [Google Scholar]
- Rius, A.; Ruisanchez, I.; Callao, M.; Rius, F. Reliability of analytical systems: Use of control charts, time series models and recurrent neural networks (RNN). Chemom. Intell. Lab. Syst. 1998, 40, 1–18. [Google Scholar] [CrossRef]
- Elbaz, K.; Zhou, A.; Shen, S.L. Deep reinforcement learning approach to optimize the driving performance of shield tunnelling machines. Tunn. Undergr. Space Technol. 2023, 136, 105104. [Google Scholar] [CrossRef]
- Yu, L.; Wang, S.; Lai, K.K. Forecasting crude oil price with an EMD-based neural network ensemble learning paradigm. Energy Econ. 2008, 30, 2623–2635. [Google Scholar] [CrossRef]
- Chen, Y.; Liu, B.; Wang, T. Analysing and forecasting China containerized freight index with a hybrid decomposition-ensemble method based on EMD, grey wave and ARMA. Grey Syst. Theory Appl. 2021, 11, 358–371. [Google Scholar] [CrossRef]
- Shu, W.; Gao, Q. Forecasting stock price based on frequency components by EMD and neural networks. IEEE Access 2020, 8, 206388–206395. [Google Scholar] [CrossRef]
- Wu, Y.X.; Wu, Q.B.; Zhu, J.Q. Improved EEMD-based crude oil price forecasting using LSTM networks. Phys. A 2019, 516, 114–124. [Google Scholar] [CrossRef]
- Tang, L.; Wu, Y.; Yu, L. A randomized-algorithm-based decomposition-ensemble learning methodology for energy price forecasting. Energy 2018, 157, 526–538. [Google Scholar] [CrossRef]
- Tang, L.; Lv, H.; Yu, L. An EEMD-based multi-scale fuzzy entropy approach for complexity analysis in clean energy markets. Appl. Soft Comput. 2017, 56, 124–133. [Google Scholar] [CrossRef]
- Xiang, Y.; Zhuang, X.H. Application of ARIMA model in short-term prediction of international crude oil price. Adv. Mater. Res. 2013, 798, 979–982. [Google Scholar] [CrossRef]
- Hou, A.; Suardi, S. A nonparametric GARCH model of crude oil price return volatility. Energy Econ. 2011, 34, 618–626. [Google Scholar] [CrossRef]
- Ramyar, S.; Kianfar, F. Forecasting crude oil prices: A comparison between artificial neural networks and vector autoregressive models. Comput. Econ. 2017, 53, 743–761. [Google Scholar] [CrossRef]
- Murat, A.; Tokat, E. Forecasting oil price movements with crack spread futures. Energy Econ. 2008, 31, 85–90. [Google Scholar] [CrossRef]
- Lanza, A.; Manera, M.; Giovannini, M. Modeling and forecasting cointegrated relationships among heavy oil and product prices. Energy Econ. 2005, 27, 831–848. [Google Scholar] [CrossRef]
- Zhao, Y.; Li, J.; Yu, L. A deep learning ensemble approach for crude oil price forecasting. Energy Econ. 2017, 66, 9–16. [Google Scholar] [CrossRef]
- Yin, Q.; Zhang, R.; Shao, X.L. CNN and RNN mixed model for image classification. Matec Web Conf. 2019, 277, 02001. [Google Scholar] [CrossRef]
- Manowska, A.; Bluszcz, A. Forecasting crude oil consumption in Poland based on LSTM recurrent neural network. Energies 2022, 15, 4885. [Google Scholar] [CrossRef]
- Liu, B.; Chen, J.; Wang, H.; Wang, Q. Renewable energy and material supply risks: A predictive analysis based on an LSTM model. Front. Energy Res. 2020, 8, 163. [Google Scholar] [CrossRef]
- Yang, Y.; Yang, Y. Hybrid method for short-term time series forecasting based on EEMD. IEEE Access 2020, 8, 61915–61928. [Google Scholar] [CrossRef]
- Sun, J.; Di, L.; Sun, Z.; Shen, Y.; Lai, Z. County-level soybean yield prediction using deep CNN-LSTM model. Sensors 2019, 19, 4363. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Zhang, J.; Li, S. Air quality index forecast in Beijing based on CNN-LSTM multi-model. Chemosphere 2022, 308, 136180. [Google Scholar] [CrossRef]
- Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. J. R. Soc. Interface 1998, 454, 903–995. [Google Scholar] [CrossRef]
- Wu, Z.; Huang, N.E. Ensemble empirical mode decomposition: A noise-assisted data analysis method. Adv. Adapt. Data Anal. 2009, 1, 1–41. [Google Scholar] [CrossRef]
- Ma, Z.; Wen, G.; Jiang, C. EEMD independent extraction for mixing features of rotating machinery reconstructed in phase space. Sensors 2015, 15, 8550–8569. [Google Scholar] [CrossRef] [Green Version]
- Guo, Y.; Cao, X.; Liu, B.; Peng, K. El Niño index prediction using deep learning with ensemble empirical mode decomposition. Symmetry 2020, 12, 893. [Google Scholar] [CrossRef]
- Joo, S.; Choi, J.; Kim, N.; Lee, M.C. Zero-crossing rate method as an efficient tool for combustion instability diagnosis. Exp. Therm. Fluid Sci. 2021, 123, 110340. [Google Scholar] [CrossRef]
- Xu, Y.; Xiang, Y.F.; Ma, T.X. Short-term Power Load Forecasting Method Based on EMD-CNN-LSTM Hybrid Model. J. North China Electr. Power Univ. 2022, 49, 2. [Google Scholar]
- Li, S. Prediction of Crude Oil Futures Price Based on EEMD-ARIMA-LSTM Combination Model. Master’s Thesis, Shandong University, Jinan, China, 22 May 2021. [Google Scholar]
- Adler, J.; Parmryd, I. Quantifying colocalization by correlation: The Pearson correlation coefficient is superior to the Mander’s overlap coefficient. Cytom. Part A 2010, 77A, 733–742. [Google Scholar] [CrossRef] [PubMed]
- Shen, S.L.; Elbaz, K.; Shaban, W.M.; Zhou, A. Real-time prediction of shield moving trajectory during tunnelling. Acta Geotech. 2022, 17, 1533–1549. [Google Scholar] [CrossRef]
Component | IMF1 | IMF2 | IMF3 | IMF4 | IMF5 | IMF6 | IMF7 | IMF8 |
---|---|---|---|---|---|---|---|---|
ZCR | 57.3% | 22.6% | 9.9% | 3.8% | 1.1% | 0.2% | 0% | 0% |
Module | Experiment | Component | Evaluation |
---|---|---|---|
EEMD-CNN-ILSTM | Experiment 1 | high-frequency medium-frequency low-frequency | = 0.9742 RMSE = 18.312 MAE = 13.848 |
Experiment 2 | high-frequency low-frequency residual | = 0.9789 RMSE = 16.116 MAE = 11.675 | |
Experiment 3 | high-frequency medium-frequency low-frequency residual | = 0.9858 RMSE = 11.504 MAE = 8.397 |
Data | WTI | USDX | DJIA | S&P 500 | NASDAQ | Russell 2000 |
---|---|---|---|---|---|---|
R | 0.932 | 0.558 | 0.405 | 0.356 | 0.182 | 0.317 |
Date | Open | High | Low | Settle | Close | USDX | DJIA | WTI |
---|---|---|---|---|---|---|---|---|
14/11/2022 | 680.5 | 684.6 | 661.6 | 677.1 | 665.1 | 13,096 | 33,615 | 85.235 |
15/11/2022 | 663 | 670.3 | 644.4 | 659.6 | 649.7 | 13,055 | 33,568.7 | 86.865 |
16/11/2022 | 644.7 | 648.7 | 637.2 | 641.9 | 640.2 | 13,044 | 33,600.4 | 85.334 |
17/11/2022 | 643.6 | 645.2 | 624.8 | 634.8 | 628.9 | 13,041 | 33,669.6 | 85.035 |
Model | Layer | Parameters |
---|---|---|
SVR | SVR | kernel = ‘rbf’, C = 1.0, epsilon = 0.1 |
MLP | Dense | activation = ‘sigmoid’, units = 64 batch size = 32, epochs = 50 |
LSTM | LSTM | activation = tanh, units = 64 batch size = 32, epochs = 50 |
ILSTM | ILSTM | activation = ‘tanh’, units = 64 batch size = 32, epochs = 50 |
CNN-LSTM | Conv1D | filters = 64, padding = ‘valid’, kernel_size = 2, activation = sigmoid |
MaxPooling1D | padding = ‘valid’, pool_size = 1 | |
LSTM | activation = ‘tanh’, units = 64 batch size = 32, epochs = 50 | |
CNN-ILSTM | Conv1D | filters = 64, padding = ‘valid’, kernel_size = 2, Activation = sigmoid |
MaxPooling1D | padding = ‘valid’, pool_size = 1 | |
ILSTM | activation = ‘tanh’, units = 64 batch size = 32, epochs = 50 |
Model | MAE | RMSE | |
---|---|---|---|
SVR | 37.872 | 46.741 | 0.8112 |
MLP | 26.902 | 31.747 | 0.9129 |
LSTM | 21.649 | 26.272 | 0.9403 |
ILSTM | 18.081 | 22.201 | 0.9574 |
CNN-LSTM | 16.655 | 20.822 | 0.9625 |
CNN-ILSTM | 16.032 | 20.175 | 0.9648 |
EEMD-CNN-LSTM | 10.260 | 12.817 | 0.9858 |
EEMD-CNN-ILSTM | 8.397 | 11.504 | 0.9886 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, J.; Zhang, T.; Lu, T.; Xue, Z. A Hybrid Forecast Model of EEMD-CNN-ILSTM for Crude Oil Futures Price. Electronics 2023, 12, 2521. https://doi.org/10.3390/electronics12112521
Wang J, Zhang T, Lu T, Xue Z. A Hybrid Forecast Model of EEMD-CNN-ILSTM for Crude Oil Futures Price. Electronics. 2023; 12(11):2521. https://doi.org/10.3390/electronics12112521
Chicago/Turabian StyleWang, Jingyang, Tianhu Zhang, Tong Lu, and Zhihong Xue. 2023. "A Hybrid Forecast Model of EEMD-CNN-ILSTM for Crude Oil Futures Price" Electronics 12, no. 11: 2521. https://doi.org/10.3390/electronics12112521
APA StyleWang, J., Zhang, T., Lu, T., & Xue, Z. (2023). A Hybrid Forecast Model of EEMD-CNN-ILSTM for Crude Oil Futures Price. Electronics, 12(11), 2521. https://doi.org/10.3390/electronics12112521