Optimizing Multivariate Time Series Forecasting with Data Augmentation
Abstract
:1. Introduction
- Training GANs to generate synthetic time series data that closely mimic real-world datasets;
- Utilizing a Bidirectional Wasserstein Generative Adversarial Network (Bi-WGAN), an advanced model that integrates the features of Bidirectional LSTM with Wasserstein GANs;
- Leveraging the generated data to train deep learning models, including LSTM networks, for time series prediction;
- Comparing the performance of the forecasting models trained on a combination of real and synthetic data.
- Combining Bi-directional LSTM (Bi-LSTM) networks with the Wasserstein Generative Adversarial Network (WGAN) to enhance training and prevent mode collapse in real data distribution mapping;
- Conducting a comparative study on the use of WLoss and Bi-LSTM functions separately and in combination, along with a comparison of prediction errors in the predictive model.
2. Literature Review
2.1. Effectiveness of Deep Neural Networks in Time Series Prediction
2.2. Benefits of Bidirectional LSTM for Time Series Prediction
2.3. Data Augmentation with Generative Adversarial Networks (GANs)
2.4. Contribution of the Present Research
3. Methodology
- Input gate: The input gate controls the flow of new information into the cell state. It determines which parts of the current input are relevant and should be incorporated into the state representation.
- Forget gate: The forget gate controls the flow of old information out of the cell state. It decides which parts of the previous state should be retained or discarded.
- Cell candidate gate: The cell candidate gate proposes the new information that should be added to the cell state. It generates a candidate state vector that potentially contains new information from the current input.
- Output gate: The output gate controls the output of the LSTM cell. It determines which parts of the cell state are relevant and should be passed on as the next hidden state.
4. Implementation
- Data Collection (Step 1):
- Data Preprocessing (Step 2):
- Data Augmentation (Step 3):
- Improving the LSTM Model (Step 4):
- Model Tuning (Step 5):
- Comparison (Step 6):
- Real-World Testing (Step 7):
4.1. Data Collection
4.2. Data Processing
5. Results and Discussion
6. Discussion
7. Conclusions
Author Contributions
Funding
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Agga, Ali, Ahmed Abbou, Moussa Labbadi, and Yassine El Houm. 2021. Short-term self consumption PV plant power production forecasts based on hybrid CNN-LSTM, ConvLSTM models. Renewable Energy 177: 101–12. [Google Scholar] [CrossRef]
- Aldweesh, Arwa, Abdelouahid Derhab, and Ahmed Z. Emam. 2020. Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues. Knowledge-Based Systems 189: 105124. [Google Scholar] [CrossRef]
- Arjovsky, Martín, Soumith Chintala, and Léon Bottou. 2017. Wasserstein GAN. arXiv arXiv:1701.07875. [Google Scholar]
- Bandara, Kasun, Christoph Bergmeir, and Hansika Hewamalage. 2020. LSTM-MSNet: Leveraging forecasts on sets of related time series with multiple seasonal patterns. IEEE Transactions on Neural Networks and Learning Systems 32: 1586–99. [Google Scholar] [CrossRef]
- Bi, Jing, Zexian Chen, Haitao Yuan, and Jia Zhang. 2024. Accurate water quality prediction with attention-based bidirectional LSTM and encoder–decoder. Expert Systems with Applications 238: 121807. [Google Scholar] [CrossRef]
- Brophy, Eoin, Zhengwei Wang, Qi She, and Tomas Ward. 2021. Generative adversarial networks in time series: A survey and taxonomy. arXiv arXiv:2107.11098. [Google Scholar]
- Chandra, Rohitash. 2015. Competition and collaboration in cooperative coevolution of Elman recurrent neural networks for time-series prediction. IEEE Transactions on Neural Networks and Learning Systems 26: 3123–36. [Google Scholar] [CrossRef]
- Chang, Ting-Jen, Tian-Shyug Lee, Chih-Te Yang, and Chi-Jie Lu. 2023. A ternary-frequency cryptocurrency price prediction scheme by ensemble of clustering and reconstructing intrinsic mode functions based on CEEMDAN. Expert Systems with Applications 233: 121008. [Google Scholar] [CrossRef]
- Deng, Grace, Cuize Han, Tommaso Dreossi, Clarence Lee, and David S. Matteson. 2021. IB-GAN: A Unified Approach for Multivariate Time Series Classification under Class Imbalance. arXiv arXiv:2110.07460. [Google Scholar]
- Fang, Zhen, Xu Ma, Huifeng Pan, Guangbing Yang, and Gonzalo R. Arce. 2023. Movement forecasting of financial time series based on adaptive LSTM-BN network. Expert Systems with Applications 213: 119207. [Google Scholar] [CrossRef]
- Frid-Adar, Maayan, Idit Diamant, Eyal Klang, Michal Amitai, Jacob Goldberger, and Hayit Greenspan. 2018. GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification. Neurocomputing 321: 321–31. [Google Scholar] [CrossRef]
- Gautam, Yogesh. 2022. Transfer Learning for COVID-19 cases and deaths forecast using LSTM network. ISA Transactions 124: 41–56. [Google Scholar] [CrossRef] [PubMed]
- Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. Advances in Neural Information Processing Systems 2: 2672–80. [Google Scholar]
- Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2020. Generative adversarial networks. Communications of the ACM 63: 139–44. [Google Scholar] [CrossRef]
- Graves, Alex, and Jürgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks 18: 602–10. [Google Scholar] [CrossRef]
- Gupta, Mehak, and Rahmatollah Beheshti. 2020. Time-series Imputation and Prediction with Bi-Directional Generative Adversarial Networks. arXiv arXiv:2009.08900. [Google Scholar]
- Karras, Tero, Timo Aila, Samuli Laine, and Jaakko Lehtinen. 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv arXiv:1710.10196. [Google Scholar]
- Koo, Eunho, and Geonwoo Kim. 2024. Centralized decomposition approach in LSTM for Bitcoin price prediction. Expert Systems with Applications 237: 121401. [Google Scholar] [CrossRef]
- Lee, Chang-Ki, Yu-Jeong Cheon, and Wook-Yeon Hwang. 2021. Studies on the GAN-based anomaly detection methods for the time series data. IEEE Access 9: 73201–15. [Google Scholar] [CrossRef]
- Lei, Tianyang, Chang Gong, Gang Chen, Mengxin Ou, Kewei Yang, and Jichao Li. 2023. A novel unsupervised framework for time series data anomaly detection via spectrum decomposition. Knowledge-Based Systems 280: 111002. [Google Scholar] [CrossRef]
- Li, Zewen, Fan Liu, Wenjie Yang, Shouheng Peng, and Jun Zhou. 2021. A survey of convolutional neural networks: Analysis, applications, and prospects. IEEE Transactions on Neural Networks and Learning Systems 33: 6999–7019. [Google Scholar] [CrossRef] [PubMed]
- Liu, Penghui, Jing Liu, and Kai Wu. 2020. CNN-FCM: System modeling promotes stability of deep learning in time series prediction. Knowledge-Based Systems 203: 106081. [Google Scholar] [CrossRef]
- Liu, Xiaolei, and Zi Lin. 2021. Impact of Covid-19 pandemic on electricity demand in the UK based on multivariate time series forecasting with Bidirectional Long Short Term Memory. Energy 227: 120455. [Google Scholar] [CrossRef] [PubMed]
- Liu, Yangdong, Yizhe Wang, Xiaoguang Yang, and Linan Zhang. 2017. Short-term travel time prediction by deep learning: A comparison of different LSTM-DNN models. Paper presented at the 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, October 16–19. [Google Scholar]
- Lu, Haodong, Miao Du, Kai Qian, Xiaoming He, and Kun Wang. 2021a. GAN-based data augmentation strategy for sensor anomaly detection in industrial robots. IEEE Sensors Journal 22: 17464–74. [Google Scholar] [CrossRef]
- Lu, Wenjie, Jiazheng Li, Jingyang Wang, and Lele Qin. 2021b. A CNN-BiLSTM-AM method for stock price prediction. Neural Computing and Applications 33: 4741–53. [Google Scholar] [CrossRef]
- Luo, Junling, Zhongliang Zhang, Yao Fu, and Feng Rao. 2021. Time series prediction of COVID-19 transmission in America using LSTM and XGBoost algorithms. Results in Physics 27: 104462. [Google Scholar] [CrossRef]
- Ma, Changxi, Guowen Dai, and Jibiao Zhou. 2021. Short-term traffic flow prediction for urban road sections based on time series analysis and LSTM_BILSTM method. IEEE Transactions on Intelligent Transportation Systems 23: 5615–24. [Google Scholar] [CrossRef]
- Moghar, Adil, and Mhamed Hamiche. 2020. Stock market prediction using LSTM recurrent neural network. Procedia Computer Science 170: 1168–73. [Google Scholar] [CrossRef]
- Nazareth, Noella, and Yeruva Venkata Ramana Reddy. 2023. Financial applications of machine learning: A literature review. Expert Systems with Applications 219: 119640. [Google Scholar] [CrossRef]
- Nguyen, H. Du, Kim Phuc Tran, Sébastien Thomassey, and Moez Hamad. 2021. Forecasting and Anomaly Detection approaches using LSTM and LSTM Autoencoder techniques with the applications in supply chain management. International Journal of Information Management 57: 102282. [Google Scholar] [CrossRef]
- Niu, Zijian, Ke Yu, and Xiaofei Wu. 2020. LSTM-based VAE-GAN for time-series anomaly detection. Sensors 20: 3738. [Google Scholar] [CrossRef] [PubMed]
- Patel, Mohil Maheshkumar, Sudeep Tanwar, Rajesh Gupta, and Neeraj Kumar. 2020. A deep learning-based cryptocurrency price prediction scheme for financial institutions. Journal of Information Security and Applications 55: 102583. [Google Scholar] [CrossRef]
- Pfenninger, Moritz, Samuel Rikli, and Daniel Nico Bigler. 2021. Wasserstein GAN: Deep Generation Applied on Financial Time Series. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3885659 (accessed on 8 September 2024).
- Quilodrán-Casas, César, Vinicius L. S. Silva, Rossella Arcucci, Claire E. Heaney, YiKe Guo, and Christopher C. Pain. 2022. Digital twins based on bidirectional LSTM and GAN for modelling the COVID-19 pandemic. Neurocomputing 470: 11–28. [Google Scholar] [CrossRef] [PubMed]
- Siami-Namini, Sima, Neda Tavakoli, and Akbar Siami Namin. 2019. The performance of LSTM and BiLSTM in forecasting time series. Paper presented at the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, December 9–12. [Google Scholar]
- Silva, Vinicius L. S., Claire E. Heaney, Yaqi Li, and Christopher C. Pain. 2021. Data Assimilation Predictive GAN (DA-PredGAN) Applied to a Spatio-Temporal Compartmental Model in Epidemiology. Journal of Scientific Computing 94: 25. [Google Scholar] [CrossRef]
- Somu, Nivethitha, M. R. Gauthama Raman, and Krithi Ramamritham. 2021. A deep learning framework for building energy consumption forecast. Renewable and Sustainable Energy Reviews 137: 110591. [Google Scholar] [CrossRef]
- Sundaram, Shobhita, and Neha Hulkund. 2021. GAN-based Data Augmentation for Chest X-ray Classification. arXiv arXiv:2107.02970. [Google Scholar]
- Tran, Dat Thanh, Alexandros Iosifidis, Juho Kanniainen, and Moncef Gabbouj. 2018. Temporal attention-augmented bilinear network for financial time-series data analysis. IEEE Transactions on Neural Networks and Learning Systems 30: 1407–18. [Google Scholar] [CrossRef]
- Vidal, Andrés, and Werner Kristjanpoller. 2020. Gold volatility prediction using a CNN-LSTM approach. Expert Systems with Applications 157: 113481. [Google Scholar] [CrossRef]
- Wang, Fei, Zhiming Xuan, Zhao Zhen, Kangping Li, Tieqiang Wang, and Min Shi. 2020a. A day-ahead PV power forecasting method based on LSTM-RNN model and time correlation modification under partial daily pattern prediction framework. Energy Conversion and Management 212: 112766. [Google Scholar] [CrossRef]
- Wang, Heshan, Yiping Zhang, Jing Liang, and Lili Liu. 2023. DAFA-BiLSTM: Deep autoregression feature augmented bidirectional LSTM network for time series prediction. Neural Networks 157: 240–56. [Google Scholar] [CrossRef]
- Wang, Jian Qi, Yu Du, and Jing Wang. 2020b. LSTM based long-term energy consumption prediction with periodicity. Energy 197: 117197. [Google Scholar] [CrossRef]
- Wang, Ting-Chun, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. 2018. High-resolution image synthesis and semantic manipulation with conditional gans. Paper presented at the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, June 18–23. [Google Scholar]
- Wiese, Magnus, Robert Knobloch, Ralf Korn, and Peter Kretschmer. 2020. Quant GANs: Deep generation of financial time series. Quantitative Finance 20: 1419–40. [Google Scholar] [CrossRef]
- Wu, Don Chi Wai, Lei Ji, Kaijian He, and Kwok Fai Geoffrey Tso. 2021. Forecasting tourist daily arrivals with a hybrid Sarima–Lstm approach. Journal of Hospitality & Tourism Research 45: 52–67. [Google Scholar]
- Xayasouk, Thanongsak, HwaMin Lee, and Giyeol Lee. 2020. Air pollution prediction using long short-term memory (LSTM) and deep autoencoder (DAE) models. Sustainability 12: 2570. [Google Scholar] [CrossRef]
- Xu, Hongfeng, Donglin Cao, and Shaozi Li. 2022. A self-regulated generative adversarial network for stock price movement prediction based on the historical price and tweets. Knowledge-Based Systems 247: 108712. [Google Scholar] [CrossRef]
- Yadav, Anita, C. K. Jha, and Aditi Sharan. 2020. Optimizing LSTM for time series prediction in Indian stock market. Procedia Computer Science 167: 2091–100. [Google Scholar] [CrossRef]
- Yuan, Lixiang, Siyang Yu, Zhibang Yang, Mingxing Duan, and Kenli Li. 2023. A data balancing approach based on generative adversarial network. Future Generation Computer Systems 141: 768–76. [Google Scholar] [CrossRef]
- Zhang, Jianguang, Xuyang Zhang, Jianfeng Yang, Zhaoxu Wang, Yufan Zhang, Qian Ai, Zhaoyu Li, Ziru Sun, and Shuangrui Yin. 2020. Deep lstm and gan based short-term load forecasting method at the zone level. Paper presented at the 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), Fukuoka, Japan, February 19–21. [Google Scholar]
- Zhang, Luo, Peng Liu, Lei Zhao, Guizhou Wang, Wangfeng Zhang, and Jianbo Liu. 2021. Air quality predictions with a semi-supervised bidirectional LSTM neural network. Atmospheric Pollution Research 12: 328–39. [Google Scholar] [CrossRef]
- Zou, Yingchao, Lean Yu, and Kaijian He. 2023. Forecasting crude oil risk: A multiscale bidirectional generative adversarial network based approach. Expert Systems with Applications 212: 118743. [Google Scholar] [CrossRef]
Method | Acronym | Advantages | Disadvantages |
---|---|---|---|
Long Short-Term Memory | LSTM | Captures temporal dependencies | Requires large datasets |
Bidirectional LSTM | Bi-LSTM | Considers both past and future data | More complex architecture |
Kernel Convolutional Neural Network | kCNN-LSTM | Effective for feature extraction from spatio-temporal data | May require extensive computational resources |
Variational Autoencoder GAN | VAE-GAN | Effective for anomaly detection | Sensitive to hyperparameters |
Wasserstein GAN | WGAN | Addresses mode collapse issues | More complex to implement than standard GANs |
Balanced GAN | B-GAN | Useful for generating balanced datasets | May not generalize well to unseen data |
Generative Adversarial Network | GAN | Good at generating realistic data | Requires careful training and tuning |
Generator | |
---|---|
Hyperparameter | value |
Layer type | conv, FC |
Layer num | 3 |
Dropout | 0 |
Epoch | 100 |
Learning rate | 0.0005 |
Optimizer | RMSProp |
Critic | |
---|---|
Hyperparameter | value |
Layer type | LSTM, FC |
Layer num | 4 |
Dropout | 0 |
Epoch | 100 |
Learning rate | 0.0005 |
Optimizer | RMSProp |
Critic learn frequency | 5 |
Series | Global stock index (S&P) Price of Bitcoin and Ethereum cryptocurrencies Price of aluminum Global price of gold Price of gold in IRR Price of coins Iranian stock index |
Sequence length | 20 |
Period | One day |
Number of records | 740 |
Test | 148 |
Train | 592 |
10% Generated | 20% Generated | ||
---|---|---|---|
LSTM | WGAN | 0.01912 | 0.0155 |
GAN | 0.01834 | 0.02049 | |
no-GAN | 0.02211 | 0.02438 | |
Bi-LSTM | WGAN | 0.01374 | 0.01019 |
GAN | 0.01926 | 0.01007 | |
no-GAN | 0.01801 | 0.01184 |
Methods | Pros | Cons |
---|---|---|
ARMA | Simple to implement; effective for stationary time series; good for short-term forecasting | Limited to linear patterns; not suitable for non-stationary data; struggles with complex dependencies |
ARIMA | Handles non-stationary data with differencing; well suited for data with seasonality or trends | Computationally expensive; requires manual parameter tuning; assumes linearity |
RNN | Capable of handling sequential data; captures temporal dependencies | Prone to vanishing/exploding gradient issues; struggles with long-term dependencies |
LSTM | Good at capturing long-term dependencies; reduces vanishing gradient problem; suitable for time series forecasting | High computational cost; longer training time; requires a large amount of data |
Bi-LSTM | Better at capturing context from both past and future sequences; enhanced accuracy for sequential data | Increased computational complexity; higher resource demand; slower training times |
GAN | Excellent at generating synthetic data; captures complex patterns in data | Difficult to train; risk of mode collapse; requires careful tuning of hyperparameters |
Bi-WGAN | Improved stability over traditional GAN; can generate high-quality synthetic data; good for imbalanced datasets | High computational cost; complex architecture; requires a significant amount of data and tuning |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Aria, S.S.; Iranmanesh, S.H.; Hassani, H. Optimizing Multivariate Time Series Forecasting with Data Augmentation. J. Risk Financial Manag. 2024, 17, 485. https://doi.org/10.3390/jrfm17110485
Aria SS, Iranmanesh SH, Hassani H. Optimizing Multivariate Time Series Forecasting with Data Augmentation. Journal of Risk and Financial Management. 2024; 17(11):485. https://doi.org/10.3390/jrfm17110485
Chicago/Turabian StyleAria, Seyed Sina, Seyed Hossein Iranmanesh, and Hossein Hassani. 2024. "Optimizing Multivariate Time Series Forecasting with Data Augmentation" Journal of Risk and Financial Management 17, no. 11: 485. https://doi.org/10.3390/jrfm17110485