Next Article in Journal
A Tunnel Fire Detection Method Based on an Improved Dempster-Shafer Evidence Theory
Next Article in Special Issue
Indeterministic Data Collection in UAV-Assisted Wide and Sparse Wireless Sensor Network
Previous Article in Journal
Harmonic Vibration Analysis in a Simplified Model for Monitoring Transfemoral Implant Loosening
Previous Article in Special Issue
Synchronous End-to-End Vehicle Pedestrian Detection Algorithm Based on Improved YOLOv8 in Complex Scenarios
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Time Series Prediction of Gas Emission in Coal Mining Face Based on Optimized Variational Mode Decomposition and SSA-LSTM

College of Safety Science and Engineering, Xi’an University of Science and Technology, Xi’an 710054, China
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(19), 6454; https://doi.org/10.3390/s24196454
Submission received: 13 September 2024 / Revised: 29 September 2024 / Accepted: 2 October 2024 / Published: 6 October 2024

Abstract

:
The accurate prediction of gas emissions has important guiding significance for the prevention and control of gas disasters in order to further improve the prediction accuracy of gas emissions in the mining face. According to the absolute gas emission monitoring data of the 1417 working face in a coal mine in Shaanxi Province, a GA-VMD-SSA-LSTM gas emission prediction model (GVSL) based on genetic algorithm (GA)-optimized variational mode decomposition (VMD) and sparrow search algorithm (SSA)-optimized long short-term memory (LSTM) is proposed. Firstly, a VMD evaluation standard for evaluating the amount of decomposition loss is proposed. Under this standard, the GA is used to find the optimal parameters of the VMD. Then, the SSA is used to optimize the key parameters of the LSTM to establish a GVSL prediction model. The model predicts each component and finally superimposes the prediction results for each component to obtain the final gas emission result. The results show that the accuracy of the evaluation indexes of the GVSL model and VMD-LSTM model, as well as the SSA-LSTM model and Gaussian process regression (GPR) model, are compared and analyzed horizontally and vertically under three scenarios with prediction sets of 121,94 and 57 groups. The GVSL model has the best prediction effect, and its fitting degree R2 values are 0.95, 0.96, and 0.99, which confirms the effectiveness of the proposed GVSL model for the time series prediction of gas emission in the mining face.

1. Introduction

The accurate prediction of gas emissions has important guiding significance for preventing and controlling gas disasters [1]. Improving the accuracy of gas emission prediction is one of the important research branches of gas disaster prevention [2,3].
The static models established by traditional prediction methods, such as the mine statistics method, fractional source prediction method, and neural network prediction [4], fail to consider that gas emission is a dynamic nonlinear system [5]. With the deepening of the research, it cannot meet the requirement of prediction accuracy. In order to improve the prediction accuracy of the model, many scholars introduced influencing factors of gas emissions (coal seam gas content, coal thickness, layer spacing, etc.). A multi-index gas emission prediction model was established [6,7,8,9]. WANG Yanbin analyzed the factors affecting gas emissions in the working face and then predicted the gas emissions based on the PCA-PSO-ELM model. The results show that the prediction effect of the model is better than that of the random forest and extreme learning machine models. Although this method improves the prediction accuracy, it has two disadvantages: First, most mines cannot provide detailed data such as coal thickness and adjacent layer thickness. Second, most prediction models are unable to forecast for a long duration and on a large scale [10,11,12]. Therefore, many scholars have explored and studied the timing prediction model of gas emissions based on the data on gas emissions [13,14,15].
At present, the timing series prediction model of gas emissions mostly adopts the combination prediction method based on signal decomposition, such as wavelet decomposition, empirical mode decomposition, and variational mode decomposition [16,17,18,19]. Among them, variational mode decomposition has better noise robustness than other signal decomposition methods [20]. Some scholars have verified its superiority in the fields of power load and wind speed forecasting [21,22,23]. However, the following problem remains:
(1) The effect of variational mode decomposition mainly depends on the setting of decomposition number k and quadratic penalty factor α, but its value is often set by experience and lacks selection criteria, so it is difficult to guarantee the decomposition effect [24,25].
(2) The prediction model plays an important role in the prediction of gas emissions. Benefiting from the advantages of abstracting and extracting features from input signals layer by layer to dig out deeper potential rule information, the deep learning model has been gradually applied to the field of timing prediction [26]. As a deep learning model, the LSTM model introduces the concept of a time sequence into the network structure, which provides a good effect on time sequence prediction and has achieved good application effects in the fields of power load prediction and photovoltaic power prediction [27,28]. However, there are few studies in the field of gas emission time sequence prediction.
For the above problems, a GA-VMD-SSA-LSTM (GVSL) gas emission prediction model based on genetic algorithm (GA)-optimized variational mode decomposition (VMD) and sparrow search algorithm (SSA)-optimized long short-term memory (LSTM) is proposed.

2. Materials and Methods

2.1. Variational Mode Decomposition (VMD)

Different from the EMD, LMD, and EEMD decomposition methods of recursive mode decomposition, VMD is a new non-recursive and adaptive signal decomposition method, which was proposed in 2014 [29]. Thanks to the introduction of the variational model, it effectively avoids the endpoint and modal aliasing effects in the recursive mode decomposition method [30].
Based on the concepts of the Wiener filter, Hilbert transform, signal parsing, mixing, and heterodyne demodulation, the steps of VMD construction are proposed as follows [31]:
(1) In order to evaluate the modal bandwidth, the Hilbert transform is introduced to transform the problem into a constrained variational problem. The equation is shown in (1):
min { u k } , { ω k } k t [ ( δ ( t ) + j π t ) u k ( t ) ] e j ω k t 2 2 s . t . k u k = f ( t )
In the formula, uk is the set of mode decomposition components, ωk is the set of center frequencies corresponding to the decomposition components, k is the number of VMD decompositions, and f (t) is the input signal to be decomposed.
(2) By introducing the Lagrange multiplier and quadratic penalty factor, the constraint form is transformed into an unconstrained form. The equation is shown in (2):
L ( u k , ω k , λ ) : = α k t [ ( δ ( t ) + j π t ) u k ( t ) ] e j w k t 2 2 + f ( t ) k u k ( t ) 2 2 + λ ( t ) , f ( t ) k u k ( t )
where L is the augmented Lagrangian, λ is the Lagrangian multiplier, and α is the quadratic penalty factor.
(3) The alternating direction method of multipliers (ADMM) is introduced to find the saddle point of the augmented Lagrangian and solve the original minimization problem. The optimal solutions of uk and ωk in Equation (1) are obtained, and the calculation process is as follows, as shown in Equations (3) and (4).
u ^ k n + 1 ( ω ) = f ^ ( ω ) i k u ^ i ( ω ) + λ ^ ( ω ) 2 1 + 2 α ( ω ω k ) 2
ω k n + 1 = 0 ω u k ( ω ) 2 d ω 0 u k ( ω ) 2 d ω
In the equation: u ^ k n + 1 ( ω ) is the Wiener filter of the current residual f ^ ( ω ) i k u ^ i ( ω ) + λ ^ ( ω ) 2 , and ω k n + 1 is the center of gravity of the current modal power spectrum.

2.2. Variational Mode Decomposition Based on Genetic Algorithm Optimization

Among the VMD decomposition parameters, k determines the number of mode decompositions, and α affects the fidelity and effect of the modal components. In the field of time series prediction, the selection of parameter values is mainly based on an empirical setting and spectrum analysis setting. The former lacks a theoretical basis and makes it difficult to ensure decomposition quality, while the latter is limited by the absolute gas emission data collected in this paper, which cannot be analyzed for its spectrum and, thus, makes it difficult to determine the parameter values. Therefore, the GA algorithm was introduced to optimize the parameters of k and α in VMD to ensure the decomposition effect of absolute gas emission data in VMD.
In an ideal situation, the data reconstructed by the VMD decomposition component is the same as the original data, but there is often a decomposition loss in the actual decomposition. In order to evaluate this part of the loss, the root mean square error (RMSE) is introduced, as shown in Equation (5).
R M S E = 1 n i = 1 n ( y y ) 2
In the formula, n is the number of samples collected for the absolute gas emission, y is the measured value of absolute gas emission, and y′ is the reconstructed value of gas emission of the VMD decomposition component.
As a method to measure the complexity of nonlinear and non-stationary signals, the sample entropy has the advantages of no need for self-matching and small error [32]. The entropy value represents the complexity of time series data. Therefore, the sample entropy is introduced to evaluate the VMD decomposition effect. The smaller the sample entropy value, the more obvious the periodicity, the less the noise interference, the lower the complexity of the time series data, and the more conducive it is to the training and learning of the gas emission prediction model.
S ampEn ( m , r ) = lim N ln [ A m ( r ) B m ( r ) ]
In the formula, m is the window length of the sequence when calculating the sample entropy, r is the similarity tolerance threshold, Am(r) is the probability of two sequences matching m + 1 points under the similarity tolerance threshold r, and Bm(r) is the probability of two sequences matching m points.
The RMSE and sample entropy are fused to construct the fitness function, which is expressed as Equation (7). It can not only reflect the sequence loss information after decomposition but also contain the sequence decomposition effect.
f i t n e s s = R M S E · S ampEn ( m , r )
At this point, the VMD parameter selection problem is transformed into the following constrained optimization problem, and the expression is as follows (8):
min α , k R M S E S ampEn , s . t . α 200 , 2000 k 3 , 10
Note: k and α optimization range reference [33].

2.3. GA Optimizing VMD

The GA is used to solve the above constraint optimization problem. The steps are as follows:
(1) Input the absolute gas emission time sequence data to be decomposed and set the GA maximum iteration times, population size, crossover probability, and other parameters.
(2) Define the optimization dimension and define its optimization scope.
(3) Initialize the population and generate the initial population. Under the current population, VMD decomposition is performed on the absolute gas emission time series data, and the RMSE and sample entropy of the reconstructed data and the measured data are calculated. The initial best fitness value is calculated, and the initial best chromosome is recorded according to Formula (7).
(4) Iterative optimization is performed according to the maximum number of iterations, and the selection, crossover, and mutation operations are performed to calculate the fitness values of various populations and their respective chromosomes in each iteration.
(5) According to Formula (8), the optimal chromosome is selected and decoded to obtain the optimal values of α and k.
(6) The value of k in the VMD decomposition parameter is a positive integer. The round(k) rounding process is carried out to obtain the final VMD parameter optimization value.

2.4. Sparrow Search Algorithm to Optimize Long and Short-Term Memory Networks’ Long Short-Term Memories (LSTM)s

LSTM is a type of deep learning. It solves the problems of gradient explosion and gradient disappearance in RNN training through the “gate” structure [34], which can effectively learn long-term dependence and is widely used in the prediction and classification of time series data. Figure 1 shows the unit structure of LSTM.
Its unit structure realizes information protection and control by forgetting the gate, input gate, and output gate. The LSTM steps are as follows:
(1) Information discard: The cell output at time t − 1 and cell input at time t are read, and Equation (9) is used to complete the information discarded at time t.
f t = σ ( W f · [ h t 1 , x t ] + b f )
(2) Information update: Equation (10) determines which information needs to be updated by the sigmoid layer, Equation (11) determines how much new information is added to time t by the tanh layer, and the last two parts are combined by Equation (12) to complete the new cell information update.
i t = σ ( W i · [ h t 1 , x t ] + b i )
c t = tanh ( W c · [ h t 1 , x t ] + b c )
c t = f t · C t 1 + i t · c t
(3) Information output: Equation (13) is used to determine which part of the cell information needs to be output by the sigmoid layer, the cell state is processed by the tanh layer, and the final information output is completed in conjunction with Equation (14).
O t = σ ( W O · [ h t 1 , x t ] + b O )
h t = O t · tanh ( C t )
In the formula, ht−1 is the output of the previous moment, xt is the input of the current moment, the sigmoid activation function, tanh is the hyperbolic tangent activation function, Wf, Wi, Wc, and WO are the weight values of different ‘gates’, and bf, bi, bc, and bO are the bias values of different ‘gates’.

2.5. SSA Optimizing LSTM

The SSA is a new swarm intelligence optimization algorithm proposed by Xue [35], which is superior to GWO, BA, and other swarm intelligence optimization algorithms in convergence speed, robustness, and stability [36]. In order to avoid over-fitting or under-fitting of the LSTM model caused by human experience, the SSA is introduced to optimize hyperparameters such as MaxEpochs and InitialLearnRate in LSTM so as to establish the optimal gas emission prediction model. The SSA optimization LSTM steps are as follows:
(1) Input the VMD decomposition time sequence data and set the maximum number of iterations of the SSA, population number, security warning value, and other parameters.
(2) Define the optimization dimension and define its optimization scope.
(3) The population is initialized, and the fitness value corresponding to each sparrow is calculated according to Equation (15) and sorted. The initial global optimal fitness value is determined according to the sorting result, and the initial global optimal position is recorded.
f i t n e s s = M S E = 1 n i = 1 n ( I M F I M F )
In the formula, n is the number of IMF samples, IMF is the modal data of the VMD decomposition, and IMF′ is the prediction data of the LSTM model.
(4) Iterative optimization is performed according to the maximum number of iterations, and the fitness values corresponding to each sparrow under each iteration are calculated and their positions recorded.
(5) Optimize the best fitness value and the best position according to Equation (16), and the best position obtained is the final optimization value of each parameter.
min M S E , s . t . numHiddenUnits 2 , 200 InitialLearnRate 0.0001 , 1 L 2 Regularization 0.00001 , 1 MaxEpochs 2 , 300

2.6. Construction of GA-VMD-LSTM Prediction Model

Based on the above analysis, the process of the prediction model in Figure 2 is constructed. The specific steps are as follows:
(1) Pre-processing of the absolute gas emission time series data. Detect missing values and outliers to ensure data integrity.
(2) VMD data decomposition. Firstly, the new fitness was constructed as the evaluation standard, and then the GA optimization algorithm was used to optimize the VMD parameters k and α to obtain the optimal VMD parameter settings. In this way, the timing data are decomposed into IMF1, IMF2, …, IMFk.
(3) SSA-LSTM model prediction. The IMFk decomposition data are divided into a training set and a prediction set. The training set data are used to build the SSA-LSTM model, and the prediction set data are used to predict the SSA-LSTM model and output the predicted value.
(4) According to the principle of equal weight superposition, the predicted values of each model were superimposed to obtain the final prediction results of the gas emission.
(5) Model effect evaluation. The mean absolute error (MAE), mean absolute percentage error (MAPE), root mean square error (RMSE), and decision coefficient (R2) were used to evaluate the effect of the prediction model. The equation is from (17) to (20):
M A E = 1 n i = 1 n α t α ^ t
M A P E = 1 n i = 1 n α t α ^ t α t
R M S E = 1 n i = 1 n ( α t α t ^ ) 2
R 2 = 1 i = 1 n ( α i α i ^ ) i = 1 n ( α i α i ¯ ) 2 2
In the equation, αi is the measured data of the gas emission, αi is the predicted data of the gas emission, and n is the number of samples collected.

3. Results and Discussion

Taking the 1417 fully mechanized mining face of a coal mine in Shaanxi as the research object, the main coal seam of the 1417 working face is a 4-2 coal seam; the thickness of the coal seam is 4.0~19.0 m, and the average thickness is 10.0 m. The working face adopts the gas control measures of “pre-mining strata hole pre-extraction + roof directional long drilling + upper corner buried pipe extraction + air exhaust”.
The absolute gas emission data of the 1417 working face, from 26 January 2022 to 30 April 2022, were collected, as shown in Table 1.
The outliers and missing values should be detected before constructing the time series prediction model. Data points other than ±1.5 IRQ (the IQR represents the interquartile distance) were taken as outliers to draw a box plot in Figure 3.
It can be seen from Figure 3 that there are seven outliers in the collected data, and the specific values are shown in Table 2.
For missing data, through the statement shuju [!complete.cases (shuju),], data were detected and no missing values were found.

3.1. Data INTERPOLATION

The Table 2 outliers were deleted and interpolated. In order to obtain the best filling method, the outflow data without outliers (99 groups of data from 21 February 2022 to 25 March 2022) were extracted from the original data for random missing [37] processing. The EM interpolation, mean interpolation, linear interpolation, and random forest interpolation were used for interpolation processing, and their mean square errors were compared to optimize the best interpolation method. The mean square error of each interpolation method is shown in Table 3.
As can be seen in Table 3, linear interpolation has the highest interpolation accuracy under six types of miss rates, so linear interpolation is selected for interpolation. The linear interpolation data are shown in Table 4.

3.2. VMD Decomposition of Gas Emission Data

The time series data of the absolute gas emission at the 1417 working face after linear interpolation, from 26 January 2022 to 30 April 2022, totaled 283 sets of sample data. The timing diagram is shown in Figure 4
Firstly, the GA algorithm was used to optimize the VMD parameters. The GA-related parameter settings are as follows: the maximum number of iterations is 10; population size 10; crossover probability 0.8; variation probability 0.1; k optimization range [3, 10]; α optimization range [200, 2000]. Its iterative optimization curve is shown in Figure 5.
As can be seen in Figure 5, at the tenth iteration, the minimum fitness value of 0.0096 is obtained, k is 10, and α is 483.70. The VMD decomposition results are shown in Figure 6.
As can be seen in Figure 6, 10 IMF decomposition components of different frequencies were obtained after the VMD decomposition of the absolute gas emission time series data. The IMF1 component, which characterizes the trend change for the absolute gas emission, and the IMF2~IMF10 components with certain periodic characteristics are obtained by decomposition, which reduces the complexity of the original data. In order to evaluate the optimal VMD decomposition effect after GA optimization (k = 10, α = 483.70), the decomposition results of k = 3, 5, 8 (α = 2000) were compared with those of k = 10, and the comparison results are shown in Figure 7 and Table 5.
According to Figure 7, when k = 10, the coincidence between the reconstructed value curve of VMD and the actual value curve of the absolute gas emission is the best. At the sudden change point of gas emission, it is also the closest to the original data. While reducing the complexity of the data, the fluctuation information of the original data is retained. The RMSE values of k = 3, 5, 8, and 10 calculated from Table 5 are 0.87, 0.66, 0.46, and 0.13, respectively, and the data decomposition loss of k = 10 is the lowest. The VMD decomposition effect optimized by the GA is superior to the VMD decomposition effect set by the empirical value in decomposition loss and mutation data retention.

3.3. Prediction of Gas Emission

In order to verify the effectiveness of the VMD decomposition algorithm, the decomposed data were divided into training sets and prediction sets in a 4:1 ratio.
The training set data realizes the optimization of the key parameters of LSTM by the SSA, and the optimization value is shown in the table.
The optimal LSTM model parameters were determined by the SSA optimization values in Table 6, thus completing the construction of the GVSL prediction model.
In order to verify the prediction effect of the model, the GVSL prediction model is used to predict the absolute gas emissions of 57 groups in the future of the prediction set. The prediction results are shown in Figure 8, and the prediction errors are shown in Table 7.
It can be seen in Figure 8 and Table 7 that the fitting degree of the predicted value curve and the actual value curve of each component of the GVSL prediction model is high. The average absolute error of prediction of each model fluctuated in the range of 0.0047~0.0460 m3/min and was maintained at a low level. The GVSL prediction model of each component has a better prediction effect and successfully predicts the changing trend of each VMD decomposition component.
The prediction results of each component of the GVSL model were superimposed to obtain the final prediction results of absolute gas emission, as shown in Figure 9.
It can be seen from Figure 9 that the reconstructed value curve obtained by each GVSL model coincides with the actual value curve of absolute gas emission. The absolute error ranges from 0.0014 to 0.4895 m3/min, and the average absolute error is 0.1156 m3/min. The relative error ranges from 0.01% to 2.46%, and the average absolute error is 0.73%. The model can well predict the trend of absolute gas emission in the prediction of the 57 sets.

3.4. Comparative Analysis of Prediction Models

In the three scenarios (scenario 1: the sample size of the training set was 162, and the sample size of the prediction set was 121; scenario 2: the sample size of the training set was 189 groups, and that of the prediction set was 94 groups; scenario 3: the training set sample size was 226, and the prediction set sample size was 57), we compared and analyzed the prediction effects of the GVSL, VMD-LSTM, SSA-LSTM, and GPR models. The results are shown in Figure 10 and Table 8.
It can be seen in Figure 10 and Table 8 that the GVSL model has the best prediction effect, especially at the sudden change point of gas emission. It can be clearly seen that the GVSL model is superior to other models, which verifies the feasibility of the model for predicting the absolute gas emission of the working face. The MAE, MAPE, RMSE, and R2 values of the GVSL, VMD-LSTM, SSA-LSTM, and GPR models are compared horizontally. It is concluded that the prediction effect of the GVSL model is better than that of the other three models.
The MAE, MAPE, RMSE, and R2 values of the GVSL model in scenario 1, scenario 2, and scenario 3 were compared, and the R2 values were 0.95, 0.96, and 0.99, respectively. The R2 of the GVSL model in scenario 3 is 0.99, which is closer to 1; that is, the larger the proportion of the training set and prediction set, the more advantages the GVSL model has.

4. Conclusions

(1) The interpolation accuracy of random forest interpolation, mean interpolation, EM interpolation, and linear interpolation is compared and analyzed under six types of missing rates. It is determined that linear interpolation is used to interpolate the outliers to ensure the integrity of the data structure.
(2) The optimal k value and α value of VMD under the new fitness function are 10 and 483.70. Through comparative analysis of the VMD decomposition results when k = 3, 5, 8, and 10, it was found that the data decomposition loss of k = 10 is 0.13, which is lower than the pair ratio, and the VMD decomposition effect after GA optimization is the best.
(3) The prediction effects of the GVSL, VMD-LSTM, SSA-LSTM, and GPR models were compared and analyzed under three scenarios. The results show that the prediction accuracy of the GVSL model is the highest, which proves that the model can be effectively applied to the prediction of gas emission in the coal face.

Author Contributions

Conceptualization, Z.Y. and J.Z. (Jingzhao Zhang); methodology, Y.C.; software, Y.C.; validation, Y.C., Y.H. and C.Z.; formal analysis, Y.C.; investigation, J.G.; resources, J.Z. (Jingzhao Zhang); data curation, Y.C.; writing—original draft preparation, Y.C.; writing—review and editing, Z.Y.; visualization, Y.H.; supervision, J.Z. (Jinlong Zhang); project administration, Z.Y.; funding acquisition, F.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Research and Development Program of Shaanxi Province, grant numbers 2020GY-139 and 2022GY-150.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author due to privacy.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Fu, H.; Yu, X.; LU, W. Based on Ant Colony and Particle Swarm Hybrid Algorithm and LS-SVM Gas Emission Prediction. J. Transduct. Technol. 2016, 29, 373–377. [Google Scholar]
  2. Huang, W.; Tong, M.; Ren, Z. Nonlinear Combination Prediction Method of Gas Emission Based on SVM. J. China Univ. Min. Technol. 2009, 38, 234–239. [Google Scholar]
  3. Fu, H.; Xie, S.; Xu, Y.; Chen, Z. Research on dynamic prediction model of coal mine gas emission based on ACC-ENN algorithm. J. China Coal Soc. 2014, 39, 1296–1301. [Google Scholar]
  4. Zhang, L.; Qin, Y.; Jiang, W.; Jing, H.; Zhao, G. Research status and prospects of mine gas emission prediction methods in my country. Saf. Coal Mines 2007, 38, 58–60. [Google Scholar]
  5. Fan, B.; Bai, C.; Li, J. Prediction of gas emission from coal mining face based on LMD-SVM. J. Min. Saf. Eng. 2013, 30, 946–952. [Google Scholar]
  6. Fu, H.; Xie, S.; Xu, Y.; Chen, Z. Research on prediction model of mine gas emission based on MPSO-WLS-SVM. Chin. Saf. Sci. J. 2013, 23, 56–61. [Google Scholar]
  7. Dong, X.; Jia, J.; Bai, Y.; Fan, C. Prediction of gas emission from coal mining face based on SVM coupled genetic algorithm. J. Saf. Environ. 2016, 16, 114–118. [Google Scholar]
  8. Feng, S.; Shao, L.; LU, W.; Meng, T.; Gao, Z. Application of PCA-PSO-LSSVM Model in Gas Emission Prediction. J. Liaoning Technol. Univ. (Nat. Sci.) 2019, 38, 124–129. [Google Scholar]
  9. Wang, Y. Gas emission prediction based on PCA-PSO-ELM. J. Hunan Univ. Sci. Technol. (Nat. Sci. Ed.) 2020, 35, 1–9. [Google Scholar]
  10. Zhang, Q.; Jia, B.; Dong, X.; Li, Z. Prediction of gas emission in mining face by PCA-GA-SVM. J. Liaoning Technol. Univ. (Nat. Sci.) 2015, 34, 572–577. [Google Scholar]
  11. Xiang, P.; Xie, X.; Shuang, H.; Liu, C.; Wang, H.; Xu, J. Research on Gas Emission Prediction Based on KPCA-CMGANN Algorithm. China Saf. Sci. J. 2020, 30, 39–47. [Google Scholar]
  12. Wang, L.; Liu, Y.; Liu, Z.; Qi, J. Research on Gas Emission Prediction Model Based on IABC-LSSVM. Sens. Microsyst. 2022, 41, 34–38. [Google Scholar]
  13. Huang, W.; Shi, S. Gas gushing time series prediction based on improved Lyapunov index. J. China Coal Soc. 2009, 34, 1665–1668. [Google Scholar]
  14. Dan, Y.; Hou, F.; Fu, H.; Ma, J. Prediction of Gas Emission in Chaotic Time Series Based on Improved Extreme Learning Machine. China Saf. Sci. J. 2012, 22, 58–63. [Google Scholar]
  15. Liu, J.; An, F.; Lin, D.; Guo, Z.; Zhang, L. Natural mode SVM modeling and prediction of gas emission from coal mining face. Syst. Eng. Theory Pract. 2013, 33, 505–511. [Google Scholar]
  16. Lu, G.; Li, X.; Zu, B.; Dong, J. Research on time-varying sequence prediction of gas emission based on EMD-MFOA-ELM. J. Saf. Sci. Technol. 2017, 13, 109–114. [Google Scholar]
  17. Dai, W.; Fu, H.; Ji, C. VMD-DE-RVM interval prediction method for gas emission in mining face. China Saf. Sci. J. 2018, 28, 109–115. [Google Scholar]
  18. Xiao, P.; Xie, H.; Shuang, H.; Liu, C.; Xu, J.; Hong, J. Application of Wavelet-Extreme Learning Machine in Time-varying Sequence Prediction of Gas Emission. J. Xi’an Univ. Sci. Technol. 2020, 40, 839–845. [Google Scholar]
  19. Zhan, G.; Wang, Y.; Fu, H.; Wang, S. Prediction of Gas Emission Based on Variational Mode Decomposition and Deep Integration Combination Model. Control Eng. China 2022, 29, 1–12. [Google Scholar]
  20. Dragomiretskiy, K.; Zosso, D. Variational Mode Decomposition. IEEE Trans. Signal Process. 2014, 62, 531–544. [Google Scholar] [CrossRef]
  21. Zhang, Y.; Han, P.; Wang, D.; Wang, S. Short-term wind speed prediction of wind farms based on variational mode decomposition and LSSVM. Acta Energiae Solaris Sin. 2018, 39, 194–202. [Google Scholar]
  22. Zhang, S.; Su, X.; Chen, R.; Liu, W.; Zuo, Y.; Zhang, Q. Short-term power load forecasting based on variational modal decomposition and FABP. Chin. J. Sci. Instrum. 2018, 39, 67–73. [Google Scholar]
  23. Balakrishnan, R.; Geetha, V.; Kumar, M.R.; Leung, M.-F.; Lucchi, E. Reduction in Residential Electricity Bill and Carbon Dioxide Emission through Renewable Energy Integration Using an Adaptive Feed-Forward Neural Network System and MPPT Technique. Sustainability 2023, 15, 14088. [Google Scholar] [CrossRef]
  24. Ma, H.; Tong, Q.; Zhang, Y. Application of Variational Mode Decomposition of Optimization Parameters in Fault Diagnosis of Rolling Bearings. China Mech. Eng. 2018, 29, 390–397. [Google Scholar]
  25. Zhang, S.; Li, J.; Jiang, A.; Huang, J.; Liu, H.; Ai, H. Novel two-stage short-term power load forecasting based on FPA-VMD and BiLSTM neural network. Power Syst. Technol. 2022, 46, 3269–3279. [Google Scholar]
  26. LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
  27. Liu, Y.; Zhao, Q. CNN-LSTM ultra-short-term power load forecasting based on cluster empirical mode decomposition. Power Syst. Technol. 2021, 45, 4444–4451. [Google Scholar]
  28. Meng, A.; Xu, X.; Chen, J.; Wang, C.; Zhou, T.; Yin, H. Ultra-short-term photovoltaic power prediction based on reinforcement learning and combined deep learning model. Power Syst. Technol. 2021, 45, 4721–4728. [Google Scholar]
  29. Hao, S.; He, T.; Ma, X.; Zhang, X.; Wu, Y.; Wang, H. KDBiDet: A Bi-Branch Collaborative Training Algorithm Based on Knowledge Distillation for Photovoltaic Hot-Spot Detection Systems. IEEE Trans. Instrum. Meas. 2024, 73, 3504615. [Google Scholar] [CrossRef]
  30. Liu, J.; Quan, H.; Yu, X.; He, K.; Li, Z. Fault diagnosis of rolling bearing based on parameter optimization VMD and sample entropy. Acta Autom. Sin. 2022, 48, 808–819. [Google Scholar]
  31. Chen, C.; Li, X.; Yang, L.; Qu, H.; Wang, Y.; He, C. Application of Variational Mode Decomposition in Power System Harmonic Detection. Power Syst. Prot. Control 2018, 46, 63–70. [Google Scholar]
  32. Yang, D.; Feng, F.; Zhao, Y.; Jiang, P.; Ding, C. VMD sample entropy feature extraction method and its application in planetary gearbox fault diagnosis. J. Vib. Shock 2018, 37, 198–205. [Google Scholar]
  33. Zhao, X.; Zhang, S.; Li, Z.; Li, F.; Hu, Y. Fault feature signal extraction method based on VMD. J. Vib. Meas. Diagn. 2018, 38, 11–19, 202. [Google Scholar]
  34. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
  35. Li, Y.; Wang, S.; Chen, Q.; Wang, X. Comparative Research on Several New Swarm Intelligence Optimization Algorithms. Comput. Eng. Appl. 2020, 56, 1–12. [Google Scholar] [CrossRef]
  36. Zhao, H.; Shen, X.; Lv, L.; Lan, P.; Liu, J.; Liu, D. Load data restoration based on GAN and its application in short-term load forecasting of EV. Autom. Electr. Power Syst. 2021, 45, 143–151. [Google Scholar]
  37. Ma, Z.; Li, Y.; Liu, Z.; Gu, C. Fault Feature Extraction of Rolling Bearing Based on Variational Mode Decomposition and Teager Energy Operator. J. Vib. Shock 2016, 35, 134–139. [Google Scholar]
Figure 1. LSTM cell structure.
Figure 1. LSTM cell structure.
Sensors 24 06454 g001
Figure 2. Flow diagram of gas emission prediction model.
Figure 2. Flow diagram of gas emission prediction model.
Sensors 24 06454 g002
Figure 3. Outlier discriminant boxplot.
Figure 3. Outlier discriminant boxplot.
Sensors 24 06454 g003
Figure 4. The 1417 mining working face timing diagram.
Figure 4. The 1417 mining working face timing diagram.
Sensors 24 06454 g004
Figure 5. Iterative optimization curve of GA to optimize VMD.
Figure 5. Iterative optimization curve of GA to optimize VMD.
Sensors 24 06454 g005
Figure 6. Gas emission of the No. 1417 mining working face by VMD (a) IMF1~IMF5 (b) IMF6~IMF10.
Figure 6. Gas emission of the No. 1417 mining working face by VMD (a) IMF1~IMF5 (b) IMF6~IMF10.
Sensors 24 06454 g006
Figure 7. Reconstruction curve by VMD.
Figure 7. Reconstruction curve by VMD.
Sensors 24 06454 g007
Figure 8. The prediction results of each decomposition component by GVSL.
Figure 8. The prediction results of each decomposition component by GVSL.
Sensors 24 06454 g008
Figure 9. Reconstructed predictions by GVSL model.
Figure 9. Reconstructed predictions by GVSL model.
Sensors 24 06454 g009
Figure 10. Comparison of different prediction models. (a) scenario 1 (b) scenario 2 (c) scenario 3.
Figure 10. Comparison of different prediction models. (a) scenario 1 (b) scenario 2 (c) scenario 3.
Sensors 24 06454 g010
Table 1. Gas emission data.
Table 1. Gas emission data.
Serial NumberTimeClassGas Ventilation Volume/(m3·min−1)Gas Drainage Volume/(m3·min−1)Absolute Gas Emission Quantity/(m3·min−1)
11/26165.2510.9116.16
21/2705.5112.1717.68
31/2784.9912.7017.69
41/27165.5112.1017.61
51/2806.3014.5920.89
………… ………………
2804/2983.1511.4414.59
2814/29163.1510.9314.08
2824/3003.3811.3514.73
2844/3082.9310.9913.92
2834/30164.0511.8115.86
Table 2. Outliers of gas emission data.
Table 2. Outliers of gas emission data.
Serial NumberTimeClassAbsolute Gas Emission Quantity/(m3·min−1)
232/3023.22
612/151626.66
622/16027.14
642/161631.95
652/17031.85
722/19827.59
752/20830.04
Table 3. Imputation error comparison for random missing.
Table 3. Imputation error comparison for random missing.
Mean Square Error of Different Interpolation Methods
Absence Rate/%EM Algorithm ImputationMean Imputationlinear InterpolationRandom Forest Imputation
52.303.040.113.47
101.281.780.131.81
151.131.420.161.48
sor1.531.660.391.84
251.481.810.412.00
301.622.200.392.14
Table 4. Linear interpolation fill data.
Table 4. Linear interpolation fill data.
Serial NumberTimeClassAbsolute Gas Emission Quantity/(m3·min−1)
232/3021.41
612/151623.99
622/16023.50
642/161622.46
652/17021.92
722/19824.00
752/20823.73
Table 5. Reconstruction data by VMD.
Table 5. Reconstruction data by VMD.
Serial NumberActual Value (m3·min−1)VMD Decomposition Reconstruction Value (m3·min−1)
k = 3k = 5k = 8k = 10
116.1616.1614.1315.1816.10
217.6817.6817.2717.2617.52
317.6917.6915.6017.6117.56
417.6117.6118.9118.4817.87
520.8920.8920.7820.6420.87
………………………………
28014.5914.6914.9214.7914.66
28114.0815.2314.8514.7714.17
28214.7315.2915.4515.2814.87
28413.9214.6414.7214.3214.07
28315.8615.3715.5316.0516.04
Table 6. LSTM parameter values for SSA optimization.
Table 6. LSTM parameter values for SSA optimization.
Decomposed ComponentNum
Hidden
Units
Max
Epochs
InitialLearnRateL2
Regularization
IMF11612550.03190.0284
IMF298540.06310.0604
IMF3200810.00810.0001
IMF421160.75780.8235
IMF536300.13230.1069
IMF6115250.01180.0001
IMF730730.00010.0001
IMF812250.00010.0001
IMF9690.00110.0010
IMF10200600.01160.0001
Table 7. Prediction error of each GVSL model.
Table 7. Prediction error of each GVSL model.
GVSL
Forecasting Model
Absolute Error (m3·min−1)
Minimum ValueMaximum ValueMean Value
IMF10.00090.02660.0152
IMF20.01780.08080.0460
IMF30.00030.04260.0146
IMF40.00020.06480.0140
IMF500.09990.0348
IMF600.05830.0218
IMF70.00050.03070.0109
IMF80.00060.03990.0144
IMF90.00020.02540.0083
IMF100.00100.01890.0047
Note: To avoid multiple zeros in two decimal places, four decimal places are reserved here.
Table 8. Model evaluation index comparison.
Table 8. Model evaluation index comparison.
Evaluating IndicatorGVSLVMD-LSTMSSA-LSTMGPR
Scenario oneMAE0.270.600.650.77
MAPE/%1.723.824.275.14
RMSE0.310.720.830.88
R20.950.740.670.68
Scenario twoMAE0.180.520.730.68
MAPE/%1.163.504.764.56
RMSE0.220.610.970.77
R20.960.740.340.65
Scenario threeMAE0.110.300.370.39
MAPE/%0.711.912.332.62
RMSE0.140.410.530.44
R20.990.880.800.87
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, J.; Cui, Y.; Yan, Z.; Huang, Y.; Zhang, C.; Zhang, J.; Guo, J.; Zhao, F. Time Series Prediction of Gas Emission in Coal Mining Face Based on Optimized Variational Mode Decomposition and SSA-LSTM. Sensors 2024, 24, 6454. https://doi.org/10.3390/s24196454

AMA Style

Zhang J, Cui Y, Yan Z, Huang Y, Zhang C, Zhang J, Guo J, Zhao F. Time Series Prediction of Gas Emission in Coal Mining Face Based on Optimized Variational Mode Decomposition and SSA-LSTM. Sensors. 2024; 24(19):6454. https://doi.org/10.3390/s24196454

Chicago/Turabian Style

Zhang, Jingzhao, Yuxin Cui, Zhenguo Yan, Yuxin Huang, Chenyu Zhang, Jinlong Zhang, Jiantao Guo, and Fei Zhao. 2024. "Time Series Prediction of Gas Emission in Coal Mining Face Based on Optimized Variational Mode Decomposition and SSA-LSTM" Sensors 24, no. 19: 6454. https://doi.org/10.3390/s24196454

APA Style

Zhang, J., Cui, Y., Yan, Z., Huang, Y., Zhang, C., Zhang, J., Guo, J., & Zhao, F. (2024). Time Series Prediction of Gas Emission in Coal Mining Face Based on Optimized Variational Mode Decomposition and SSA-LSTM. Sensors, 24(19), 6454. https://doi.org/10.3390/s24196454

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop