1. Introduction
By accurately forecasting natural gas futures prices, it is possible not only to provide decision-making support for energy market participants [
1,
2] but also to enhance the market competitiveness of natural gas as a clean energy source, thus promoting the sustainable optimization of the energy structure and the development of a low-carbon economy [
3]. However, the prediction of natural gas futures prices presents various challenges. To effectively tackle challenges such as non-stationarity and noise in data, scholars have introduced the concept of “decomposition–integration” [
4]. This approach decomposes complex price time series into multiple simpler and more manageable subsequences [
5]. Each subsequence is then forecast independently [
6], and their predictions are integrated to obtain the final forecast value [
7]. Specifically, techniques like the completely adaptive ensemble empirical modal decomposition of noise [
8], the improved ensemble empirical modal decomposition (IEEMD) of noise [
9], and variational modal decomposition (VMD) [
10], among others, can decompose natural gas futures price time series into modal components with different frequency characteristics. The advantages of the decomposition–integration approach include the following: decomposing complex problems to enhance the prediction performance and reduce error accumulation [
11]; handling complex data issues such as nonlinearity, randomness, and high noise levels [
12]; improving the computational efficiency by processing in frequency bands to enhance the interpretability [
13]; and enhancing models’ accuracy and robustness [
14].
When constructing forecasting models [
15], various factors influencing natural gas futures prices are thoroughly considered [
16,
17], including but not limited to historical price data [
18], global economic indicators [
19], policy changes [
20], unforeseen events, and energy consumption trends. Related indicators such as volatility [
21] and market sentiment indicators [
22,
23] are also incorporated. These factors are appropriately processed to serve as input features for the model [
24], thereby enhancing its predictive ability and robustness [
25]. The advantages of incorporating influencing factors include gaining a deeper understanding of the data relationships [
26]; supporting multivariate analysis, applicable to multi-input single-output model forms [
27]; providing rich information, where how each influencing factor addresses different forecasting needs [
28,
29]; and integrating multiple methodologies [
30,
31].
Hybrid models combine advanced algorithms and tools such as long short-term memory (LSTM) [
32], autoencoders (AEs), autoregressive integrated moving average (ARIMA) [
33,
34], gated recurrent units (GRUs) [
35], convolutional neural networks (CNNs) [
36], generalized autoregressive conditional heteroskedasticity (GARCH) [
37], multilayer perceptron (MLP), random forest (RF), and others for final prediction purposes [
38,
39]. The advantages of hybrid models include enhancing models’ robustness and stability against noise [
40], offering flexible combinations with multiple choices [
41], conducting multidimensional analyses to overcome the limitations of single models, and being widely applicable to address complex issues [
42].
To address frequent unexpected events in the energy market, such as global pandemics, geopolitical conflicts [
43], and natural disasters, models incorporate dynamic adjustment mechanisms and anomaly detection algorithms to capture the real-time impacts of these events on prices and make corresponding adjustments, thereby enhancing the model’s predictive capabilities under extreme conditions [
44]. To comprehensively reflect the dynamic changes in natural gas futures prices, scholars typically categorize the input features based on their varying degrees of influence on the prices into input feature sets. These include, but are not limited to, historical prices [
45], trading volumes, open interest [
46], macroeconomic indicators [
47], seasonal factors, and market sentiment indices extracted through natural language processing [
48]. During model training and testing [
49], factors such as the prediction periods, time steps [
50], and sequence length segmentation ratios also influence the prediction outcomes. Adjusting these periods or steps can aid in the study of the impact of different prediction windows on model performance to find optimal prediction strategies [
51].
To objectively assess model performance, new models are typically compared with traditional models in terms of predictive performance [
52]. A comparative analysis using evaluation metrics highlights the significant advantages of the new model in terms of prediction accuracy, stability, and generalization capabilities [
53]. Comparing different combinations of models and experimental results helps to validate the accuracy of the proposed model, identifies deficiencies in the control group models [
54], optimizes innovative settings [
55], and promotes interdisciplinary integration.
Table 1 compares the different methods used in the studies cited in this paper for the forecasting of futures prices; these are divided into decomposition and modeling categories, with the decomposition category mainly used to split the price series and enrich the data features; the modeling category is mainly applicable when learning the patterns of price changes and is used for forecasting.
The VMD-GRU/AE/MLP-RF hybrid model proposed in this study is significantly innovative in time series forecasting. The research framework is shown in
Figure 1. The model adopts the “decomposition–selection–forecasting–integration” framework, which not only includes the traditional decomposition and forecasting steps but also introduces a grouping strategy based on the correlation coefficients and model integration to systematically deal with the input variables. Through VMD decomposition, the time series is split into multiple subseries with different frequencies to reduce the complexity. The correlation coefficient grouping strategy quickly filters important variables in the preprocessing stage, reduces the input dimensions, and is more efficient in computation compared to the attention mechanism and SHAP values, making it suitable for large-scale data. In terms of model combination, the pairing of the GRU, AE, and MLP gives full play to the advantages of each model: the GRU efficiently captures the long-term dependence of the time series and has a concise structure; the AE reduces the dimensionality and extracts the key features to remove noise; and the MLP performs nonlinear combination to enhance the expressive ability. This combination ensures a balance between computational efficiency and performance, satisfying the requirement for low computational complexity and small data volumes, being suitable for long-series prediction. Finally, the prediction results are integrated via random forest to further improve the model’s robustness and accuracy. The experiments verify the effectiveness of the model and prove that it has significant advantages in terms of accuracy and robustness [
56].
This article is structured as follows.
Section 2 introduces the theoretical framework and the proposed models used in the study.
Section 3 covers the data preprocessing, introduces the evaluation metrics, and discusses the grouping of the input variables.
Section 4 presents the experimental correlations and compares the results.
Section 5 concludes the study.
2. Methodology
2.1. Variational Mode Decomposition
Variational mode decomposition is a signal processing method. It decomposes complex signals into multiple intrinsic mode functions (IMFs), each with specific frequency characteristics and stability. The formula is given by (
1), where
represents the instantaneous amplitude and
represents the instantaneous phase.
2.2. Pearson Correlation Coefficient
The Pearson correlation coefficient is a widely used statistical measure in statistics, assessing the linear correlation between two variables
X and
Y. It is represented by (
2). The coefficient ranges within [−1, 1], where a value closer to 1 indicates a strong positive correlation between the variables, a value closer to 0 indicates a weak correlation, and a value closer to −1 indicates a strong negative correlation.
2.3. Gated Recurrent Unit
The gated recurrent unit (GRU) is a popular type of recurrent neural network architecture. It retains much of LSTM’s effectiveness while reducing the model complexity and computational burden. The GRU consists of two gate units, the update gate and the reset gate, defined by (
3) and (
4), respectively. Here,
W and
b denote the weights and biases of the gates,
represents the sigmoid activation function,
is the previous hidden state,
is the current input, and
denotes vector concatenation.
2.4. Autoencoder
An autoencoder (AE) is a type of neural network model; it is used primarily for unsupervised or semi-supervised learning tasks such as data dimensionality reduction, compression, and feature extraction. The AE consists of two main components: the encoder and the decoder. The encoder maps the input data to a lower-dimensional encoded space, as shown in (
5), while the decoder attempts to reconstruct the original data from the encoded space, as shown in (
6). Here,
f and
g represent nonlinear transformation functions in the encoder and decoder, respectively.
2.5. Multilayer Perceptron
A multilayer perceptron (MLP) is a type of feedforward artificial neural network model that consists of at least three layers of nodes: an input layer, one or more hidden layers, and an output layer. In an MLP, input signals pass through the input layer, are weighted and transformed nonlinearly by neurons in the hidden layers, and finally reach the output layer to produce prediction results. The equations for the MLP are shown in (
7) and (
8), where
u denotes the weighted sum and
y represents the result.
2.6. Random Forest
Random forest (RF) is a type of machine learning model based on the ensemble learning principle. It enhances the prediction accuracy and stability by constructing multiple decision trees and combining their predictions. The formula is shown in (
9), where
represents individuals and
represents the weights corresponding to predictions.
2.7. VMD-GRU/AE/MLP-RF Hybrid Model
This study constructs a hybrid model named VMD-GRU/AE/MLP-RF.
Figure 2 illustrates the framework of the VMD-GRU/AE/MLP-RF hybrid model, with the following procedural steps.
Step 1. Data Collection. Collect raw data including daily natural gas futures: open, close, high, low prices, trading volume, amplitude, production, import volume, export volume, and storage volume. Output variable is the median price between the open and close prices.
Step 2. Decomposition. Use the VMD model to decompose price data sequences, determining the optimal number of modes. This produces 5 intrinsic mode functions (IMFs) and 1 residual (Res), with the 5 IMFs serving as new input variables.
Step 3. Feature Selection. Employ Pearson correlation coefficients to filter variables. Sort input variables by their correlation coefficients with the price in descending order.
Step 4. Grouping. Divide variables into groups based on correlation coefficient intervals. Strong correlation group: open, close, high, low prices, IMF5. Moderate correlation group: import volume, IMF4, IMF3. Weak correlation group: IMF2, IMF1, amplitude. Negative correlation group: trading volume, production, export volume, storage volume
Step 5. Prediction. Predict results for different correlation groups using different models. Use GRU, AE, MLP to predict strong, moderate, and weak correlation groups based on evaluation metrics. Generate 3 sets of prediction sub-results.
Step 6. Integration. Integrate sub-results using RF to obtain the final prediction result. Train decision trees with bootstrap sampling from the three sets of sub-results. Weight integration of sub-results is performed with weights of
a,
b,
c as defined in (
10), aimed at reducing randomness and incorporating the data characteristics of the sub-results.
5. Conclusions and Discussion
This paper proposes a VMD-GRU-AE-MLP-RF hybrid model for the forecasting of natural gas futures. The approach involves decomposing the price original series using VMD and utilizing the decomposed sub-series along with influencing factors as the input variables. By conducting a correlation analysis, the input variables are categorized into groups based on their degrees of correlation. Different prediction models—the GRU, AE, and MLP—are then applied to forecast each group according to their respective characteristics. Finally, an RF model integrates the predictions from the three groups of sub-results to generate the final forecast, effectively enhancing the prediction accuracy and reducing forecasting errors.
The innovation of this study lies in enriching the input variable information by incorporating sub-series with different frequencies and influencing factors. Furthermore, the adaptation of different prediction models to different input dimensions enhances the model flexibility. The use of the RF model to integrate diverse prediction outputs is another innovative aspect, contributing to improved forecasting performance. Additionally, we paper conducted numerous experiments to empirically analyze and validate the proposed model, and, according to the evaluation indices, all of them achieve good prediction results. For example, in the case of the RMSE, the best result in the model selection experiment was 0.1695; the best result in the stability test was 0.6328; the best result in the correlation experiment was 0.1695; the best result in the ablation experiment was 0.7792; the best result in the time span experiment was 0.6503; and the best result in the all-variable experiment was 0.6828. These experimental results effectively prove the feasibility and validity of the model.
In contrast to previous research methods, the model proposed in this study no longer relies solely on a single advantage, such as enriching data features only through serial decomposition methods, adding only factors that affect prices to aid in forecasting, or relying only on advanced models to improve the forecasting accuracy. Instead, this model employs a variety of advanced algorithms to optimize every aspect of the analytical framework.
In terms of findings, the model screening test employs eight different models to predict three groups of input variables with different degrees of correlation and selects the model that best matches the input variables through the prediction error. This method is more flexible and adaptable than the use of a single model to predict the correlation groups of the input variables. Robustness experiments were also used to comprehensively examine both in-sample and out-of-sample data, and the results showed that this multilink model prediction structure was able to output more stable results, further proving its superiority. The correlation and ablation experiments deeply analyzed the prediction results of different models for the corresponding correlation groups, and the results showed that the high correlation group had the best prediction effect due to its close correlation with the price; the prediction results for the medium correlation group mainly presented the long-term trends of the data; and the prediction results of the low correlation group reflected the short-term trends of the data. By presenting the different information contained in the prediction results in groups, the present study provides a superior perspective for an understanding of the internal logic of the prediction results. In addition, the step size experiment explored the effect of the length of the sliding time window on the prediction results, and it was found that the longer the sliding time window, the richer the historical information included, but too long a window will instead make it difficult for the model to recognize important information. This type of experiment is less common in previous studies, and it provides a new perspective for the design of the prediction model. Finally, a negative variable experiment was conducted by adding negative correlation group data to the input variables, and the results showed that deleting the negative correlation group data could significantly improve the prediction accuracy. This indicates that eliminating inappropriate input variables can optimize the prediction effect. Most of the previous research has “added” the input variables through subsequence decomposition or adding related factors, while this study further optimized the quality of the input variables through “subtracting” the input variables, which provides a direction for the improvement of prediction models.
The hybrid VMD-GRU-AE-MLP-RF model proposed in this paper shows high accuracy and flexibility in natural gas futures forecasting, but there are some limitations. First, the number of subsequences generated in the price decomposition part affects the input variable grouping. Second, changes in the weights assigned to the final forecasts of each correlation group of the input variables can also affect the final results, but the current model does not yet provide a mechanism to dynamically adjust the weights.
The future research directions include the following: firstly, relevant algorithmic mechanisms can be introduced to adapt to different data sets by assigning weights according to the prediction error; secondly, the data characteristics can be enriched by introducing more influencing factors.
Natural gas futures prediction modeling not only aids in market decision-making but also provides support for sustainable development goals. By accurately predicting the natural gas price trend, the model can provide data support for policymakers to optimize the energy structure and reduce the dependence on traditional fossil fuels. In addition, the optimization directions of the model, such as dynamic weight adjustment and multimodal data fusion, are in line with the requirements of sustainable development in terms of flexibility and adaptability. Future research could further explore the application of the model in energy transition and ecosystem protection and promote the deep integration of natural gas futures forecasting with sustainable development goals.