1. Introduction
Spatial interpolation is a computational technique used to predict attribute values at any location in a region. It involves filling the gaps between sampled points and converting discrete sample data into a continuous surface [
1,
2,
3]. This method is based on Tobler’s [
4] first law of geography and the principle of distance decay. Currently, spatial interpolation is widely used in various fields such as spatial analysis, 3D geological modeling, management of geographic data, smart city geological management, and spatial assessment [
5,
6,
7,
8,
9,
10]. Recently, as China’s coal mining industry moves towards intelligent mining practices [
11,
12,
13], constructing high-precision 3D geological models has become a critical technical necessity. The quality and reliability of these models directly hinge upon the selection of a scientifically sound and appropriate spatial interpolation method [
14,
15].
Spatial interpolation methods are broadly classified into three categories: deterministic interpolation, geostatistical interpolation, and machine learning interpolation. Deterministic interpolation relies on the similarity of sample points or surface smoothness to develop a predictive model. Common deterministic methods include inverse distance weighting (IDW) and radial basis function (RBF) [
16]. Nistor et al. [
17] employed IDW for 3D geological modeling of groundwater levels. Liu et al. [
18,
19] conducted a systematic analysis of IDW parameters and introduced the concept of “Relative distance” for 3D geological modeling. Addressing the issue of RBF stability, Iske [
20] proposed a method for 3D geological modeling utilizing scattered data. Cuomo et al. [
21] explored a method for reconstructing implicit curves and surfaces through RBF interpolation. Deterministic interpolation algorithms offer advantages in terms of simplicity, flexibility, and speed. However, they require a high density of spatially distributed sample points. Therefore, interpolation precision reduces when sample points are sparsely distributed.
Geostatistical interpolation analyzes natural characteristics by referencing spatial or spatiotemporal data. Central to this field are kriging [
22] and its variations, including ordinary kriging, simple kriging, probabilistic kriging, and directed kriging, along with their optimized algorithms [
23,
24]. These methods utilize the semivariogram to characterize spatial autocorrelation and achieve optimal, linear, and unbiased estimations. Adhikary et al. [
25] developed a spatially interpolated precipitation prediction employing collaborative kriging. Cheng et al. [
26] generated a multi-attribute 3D geological model of a high-geostress tunnel utilizing indicator kriging. An et al. [
27] evaluated the effectiveness of discrete smooth interpolation (DSI), kriging, and function interpolation for transparent surfaces by analyzing the resultant errors. Che et al. [
28] built a 3D fault model utilizing weighted kriging. However, kriging methods necessitate stable, consistently distributed data, and when significant spatial variability exists between sampling points, the resulting interpolation is excessively smooth, failing to capture localized fluctuations. With advances in geostatistics, Bayesian theory and its derived methods are widely applied in fields such as engineering geology, spatiotemporal interpolation, and public health. Yang HQ et al. [
10,
29,
30,
31] precisely identified the soil–rock boundary using Bayesian evidence theory. They inverted issues such as rainfall-induced slope instability based on Bayesian theory. In the field of spatiotemporal interpolation, Bayesian Maximum Entropy (BME) interpolation [
32,
33] has become prominent in the field. In contrast to kriging, BME is knowledge-driven, capable of integrating diverse data sources with varying levels of precision under a rigorous and comprehensive theoretical framework. This enhances predictive precision and offers adaptability [
34,
35]. Reyes et al. [
36] applied BME to predict PM2.5 concentration distributions, achieving a 21.89% error reduction compared to ordinary kriging.
Machine learning also has certain applications in the field of spatial interpolation. Wang et al. [
37] introduced a leapfrog algorithm to optimize the variation function of kriging experimentation, proposing the Shuffled Frog Leaping Algorithm (SFLA)–kriging algorithm, integrating borehole data from an open-pit mine. Zhang et al. [
38] employed a Particle Swarm Optimization (PSO) algorithm for variogram parameter estimation, while Ding et al. [
39] developed a PSO–kriging 3D geological modeling method based on a PSO-optimized kriging algorithm. Liu and Zhang [
40] advanced a k-nearest neighbor (KNN)–kriging hybrid method for 3D geological modeling. In contrast to geostatistical interpolation, machine learning operates without reliance on data trends or assumptions; however, overfitting remains a central challenge affecting the predictive precision of machine learning. Currently, its primary application in 3D geological modeling is reflected by optimizing the parameters of geostatistical interpolation algorithms.
For high-precision 3D geological modeling of a coal face, the data sources for terrain elevation, strata, and coal seam thickness vary. Terrain elevation data sources are typically dense and widespread, whereas strata thickness data sources are sparse, exhibit inconsistent variations between strata, and demonstrate significant shifts in spatial data characteristics. Coal seam thickness data sources comprise not only “hard” data, such as those derived from drilling and tunnel realities, but also “soft” data, such as coal thickness estimations from channel wave exploration or other geophysical prospecting methods, which contain uncertainties. Effectively integrating these “soft” and “hard” datasets for interpolation presents a challenge. Currently, research on selecting scientifically robust interpolation methods for high-precision 3D geological modeling of a workface, especially specific data sources, is limited. Similarly, studies focusing on 3D geological modeling of coal faces, addressing issues of missing strata and the integrated use of “soft” and “hard” data, are scarce. Focusing on the 25214 workface in the Hongliulin coal mine, this study systematically evaluates the interpolation data sources for terrain elevation, strata, and coal seam thickness. The precision of nine interpolation methods, including kriging, IDW, RBF, and BME, was evaluated through cross-validation. By combining stratigraphic sequence analysis with the optimal interpolation method, a high-precision 3D geological model of the 25214 workface was constructed. The objective of this research is to develop a high-precision three-dimensional geological model of the coal face, reflecting the actual geological characteristics, and to offer scientific support for intelligent coal mining operations.
2. Principles and Algorithms
2.1. Deterministic Interpolation Method
Inverse distance weighting (IDW) [
41] interpolates values based on spatial proximity. It operates on the principle that closer distances exert greater influence on the interpolated result. The interpolation formula is:
where
represents the predicted value for the insertion point,
denotes the known value of point
,
expresses the Euclidean distance between the insertion point
and the known point
,
indicates a power parameter of distance, usually 2, and the degree of attenuation of the control distance, and
N refers to the number of known points involved in the interpolation.
RBF (radial basis function) interpolation is an interpolation method based on radial basis function, which produces smooth surface by fitting the known point position and attribute value. The basic formula for RBF is:
where
represents the predicted value of
to be interpolated,
denotes the weight coefficient, determined by the location and value of known points,
expresses the radial basis function, the common radial basis functions include thin plate spline, regular spline, higher-order spline et al., and
refers to the distance between the interpolation point
and the known points
.
Common radial basis functions are:
(1) Thin plate spline function (TPS):
(2) Regular spline function (RS):
(3) Higher-order spline function (HS):
where
represents the Euclidean distance between a point to be inserted and a known point,
denotes the natural logarithm;
expresses the parameter that controls the order of polynomials, and
indicates a non-negative integer that determines the order of functions.
2.2. Kriging Interpolation
Kriging interpolation [
22] leverages spatial autocorrelation statistics. Its core principle is minimizing the variance of estimation error. The fundamental kriging algorithm, often termed ordinary kriging (OK) interpolation, assumes a spatially constant but unknown mean (i.e., stationary data). The interpolation formula is:
where
represents the predicted value of the point
to be inserted,
denotes the weight coefficient determined by minimizing the variance of the prediction error, and
expresses the value of the known point
.
The weight
coefficients of ordinary kriging satisfy the following constraints:
The weights are calculated utilizing a semivariance function
that describes the variation in similarity between known points as distance
increases:
where
represents the distance between two points, and
denotes the logarithm of points whose distance is
.
The semivariogram model is fitted by the semivariogram theory . The commonly utilized semivariogram models are:
- (1)
- (2)
- (3)
- (4)
represents a semi-variogram model, denotes the nugget effect, expresses the base value, depicts the variation.
2.3. Bayesian Maximum Entropy (BME)
The Bayesian Maximum Entropy (BME) interpolation method is a geostatistical approach that integrates Bayesian statistical theory with the principle of maximum entropy. This method effectively combines high-precision measurement data (hard data) with estimation data containing uncertainties (soft data), accounting for data errors and uncertainties throughout the spatial interpolation process to achieve more accurate variable estimates.
According to data precision, the BME method classifies interpolation data into two categories. Hard data is high-precision measurement data with negligible error, such as coal seam thickness measurements from boreholes and roadways; soft data is estimated data with inherent uncertainty, such as coal seam thickness inferred from channel wave exploration inversion. Soft data can be further classified into interval-type, probability-type, and function-type data.
Within the theoretical framework of BME (
Figure 1), all data and information are collectively referred to as the knowledge base (K). Based on the nature of the information, the knowledge base can be further divided into:
General knowledge (G) describes the global statistical characteristics of the variable, such as the mean, covariance, etc.; specific knowledge (S) refers to site-specific observational information within the study area, including both hard and soft data.
The BME method involves three computational stages: prior, intermediate, and posterior. The spatial variable to be estimated is treated as a random variable x, which is assumed to follow a certain probability distribution at each location within the study area. The goal of spatial interpolation is to estimate the random variable x at any unsampled location. To obtain the final estimate, xmap is introduced to represent the optimal estimate of the random variable x at a given location.
In this study, xmap represents the coal seam thickness. In other contexts, it can represent elevation, stratigraphic thickness, or other spatial variables. Essentially, BME is applicable to the interpolation of any spatial variable, provided that spatial correlation exists. This assumption is widely applicable in geosciences. For example, elevation data typically exhibit smooth and continuous spatial variations, whereas coal seam thickness data may present zero-thickness or abrupt changes. However, within a certain region, coal seam thickness still exhibits spatial correlation. Therefore, the BME method is suitable for interpolating various geoscientific variables with spatial correlation, including but not limited to elevation, stratigraphic thickness, and coal seam thickness.
Prior stage: Based on general knowledge
G, the prior probability density function of the random variable
x is established as follows:
where
is a random variable,
is a prior probability density function of
,
is a known function of the relationship between
,
is a constant regularization constraint, and
is a Lagrange multiplier.
Intermediate stage: All identified observational data (hard and soft data) are integrated to construct the specific knowledge base S, which is incorporated into the BME model.
Posterior Stage: By integrating specific knowledge
S (i.e., hard data and soft data), the posterior probability density function of the random variable is derived using Bayesian inference. In this stage, the final estimate is typically obtained as the expectation of the posterior probability density function.
where
is a posteriori probability density distribution function,
is a joint probability density distribution function for “hard data” and “soft data”, and finally,
is obtained based on a posteriori probability density function and utilized as an estimate of the variable:
To quantitatively evaluate the performance of the aforementioned nine interpolation methods, mean absolute error (MAE) and root mean square error (RMSE) were calculated as evaluation indexes utilizing cross-validation. Considering that the present dataset is not extensive, leave-one-out cross-validation was employed. Each data point was iteratively excluded, and the mean MAE and RMSE across all data points were finally computed.
where
represents the number of sampling points,
denotes the true value of sampling point
, and
expresses the predicted value in
.
3. Study Area and Data Source Analysis
3.1. Study Area
This study focuses on the 25214 workface at the Hongliulin coal mine, north of Yulin City, Shaanxi Province. The region presents a complex geomorphology, represented by rolling sand dunes to the west and Loess Hills and gullies in the northeast. The overall topography slopes downwards from northwest to southeast (
Figure 2).
Borehole data, collected over a 300 m near the 25214 workface (
Figure 3), indicates the presence of Quaternary (Q
4), Neogene Baode Formation (N
2b), and Jurassic Yan’an Formation (J
2y) strata. The Quaternary system consists primarily of Holocene eolian sand deposits and Lishi Formation loess. The red soil of the Baode Formation constitutes the principal aquiclude in the workface roof. Near the 25214 workface, erosion has removed J
2y5 and J
2y4 strata, leaving only J
2y3, J
2y2, and J
2y1. The primary coal seams cap each of these sections: 3
−1 Coal sits atop J
2y3, 4
−2 Coal above J
2y2, and 5
−2 Coal crowns J
2y1. The main mining face 25214 is extracting from the 5
−2 coal, therefore no interpolation has been conducted for the 3
−1 and 4
−2 coal. This study primarily evaluates the topography Q
4, N
2b, J
2y3, J
2y2, 5
−2 Coal, and J
2y1.
3.2. Data Source and Feature Analysis
An analysis was conducted of the spatial interpolation data sources—terrain elevation, stratum, and coal seam thickness—within 300 m near the 25214 workface.
- (1)
Terrain data
The data sources for terrain interpolation consist of elevation measurement points obtained from aerial photogrammetry using an unmanned aerial vehicle (UAV) equipped with a high-resolution camera, and from Real-Time Kinematic (RTK) positioning surveys. Both datasets cover the entire mining area; however, this study extracted a total of 1700 elevation points within the 25214 workface and its 300 m extended boundary (
Figure 4). Within this range, the UAV photogrammetric data points are densely distributed. These points were obtained through the deployment of ground control points and aerial triangulation densification processing, ensuring high positional accuracy. In contrast, the RTK survey data points are more sparsely distributed, as they were collected using conventional ground surveying methods. Although the two datasets differ in acquisition techniques and individual point accuracies, both are considered high-precision measurements. Therefore, they are treated as “hard data” in the interpolation process, providing a reliable basis for terrain modeling.
- (2)
Stratigraphic data
Formation thickness data for the study area were obtained from 27 boreholes, the spatial distribution of which is shown in
Figure 5. These boreholes provide thickness measurements for five primary stratigraphic units: the Quaternary (Q
4), Neogene Baode Formation (N
2b), and Jurassic Yan’an Formation members (J
2y3, J
2y2, J
2y1). Due to variations in borehole depths, data for the J
2y2 and J
2y1 units were available from 20 boreholes, whereas the remaining 7 boreholes did not reach these deeper strata.
The thickness ranges of the five stratigraphic units are summarized in
Table 1 as follows: Q
4 (0–90.29 m), N
2b (0–92 m), J
2y3 (11.3–48.15 m), J
2y2 (51.06–81.4 m), and J
2y1 (6.85–30.15 m). The Q
4 and N
2b formations exhibit substantial thickness variability across the workface, reflecting notable stratigraphic undulation and local erosion. In contrast, the J
2y3, J
2y2, and J
2y1 units generally show more uniform thicknesses, though localized variations are still present.
Furthermore, local absences of certain strata, particularly Q4 and N2b, were observed in some boreholes, indicating erosional truncation or non-deposition in specific areas. These spatial discontinuities further highlight the heterogeneity in the overlying strata. Collectively, these stratigraphic data provide the fundamental basis for subsequent geological modeling and interpolation analyses.
- (3)
Coal seam data
The interpolation data for coal seam thickness in the 25214 workface were obtained from multiple sources, including borehole measurements, roadway exposure measurements during tunnel excavation (
Figure 6), and channel wave exploration inversion results. During tunnel excavation, coal seam thickness was measured along the roadway with an average spacing of approximately 50 m between measurement points. These measurements were used to assist in roadway positioning and adjust the excavation direction. Notably, seam thinning was observed approximately 500 m from the retreat support roadway, where a sandstone scour zone had eroded the seam. To further delineate this thinning anomaly, channel wave exploration was conducted within the range of 310–690 m from the retreat roadway, and the seam thickness was inferred by inverting the channel wave group velocity.
The layout for the channel wave exploration at 25214 workface is depicted in
Figure 7. Twenty shot points and forty-two receiver points were planned for each of the two survey lines. Shot points were spaced 20 m apart, and receiver points 10 m apart. In practice, shots S21 and S25 were not fired, resulting in data acquisition from 38 shots and 1596 data points (
Table 2).
The channel wave exploration results indicate that the coal seam thinning zone correlates with a high-velocity anomaly.
Figure 8 presents the seismic record for shot S11 in the 25214 workface. The magenta line represents an auxiliary line at 925 m/s, while the green line demonstrates the actual arrival time of each channel wave. From this figure, the channel wave velocity is approximately 925 m/s for receivers 1 through 29. However, arrivals at receivers 30 through 42 exhibit an earlier arrival time, indicating a velocity increase and, therefore, coal seam thinning. Picking different channel waves produces the velocity distribution across the workface (
Figure 9). While the channel wave velocity is generally stable, the data define an anomalous high-velocity range, referred to as SYC1.
High-velocity anomalies typically indicate local coal seam thinning or the presence of a sandstone scoured zone, as channel waves travel faster in sandstone than in coal. In the SYC1 region, on-site observations confirmed that the coal seam thinning was caused by a localized sandstone scour body. This scour body replaced part of the coal seam with harder sandstone, resulting in a velocity increase in the channel wave propagation.
Notwithstanding their dense and uniform distribution across the workface, the inverted coal thickness data from channel waves are considered “soft data”, indicating lower precision and uncertainty. In subsequent interpolation, a comparison was made between the measured data from the tunnel and the inversion coal thickness data of the nearby trough waves. It was found that the error range was within ±20% of the measured data. Therefore, in the BME interpolation, the trough wave exploration inversion coal thickness data was used as interval-type [lb, ub] soft data for the calculations.
4. Analysis and Visualization of Interpolation Results
4.1. Terrain Modeling
To develop an accurate model for estimating the spatial distribution of topographic elevation data in the study area, spherical kriging, exponential kriging, Gaussian kriging, linear kriging, IDW (
p = 2), TPS-RBF, RS-RBF, and HS-RBF, were employed to create topographic contour maps (
Figure 10). In the interpolation, the neighbor points are selected as 15. Visually, the eight contour maps exhibited minimal differences and effectively represented the undulating nature of the terrain. To quantitatively assess each interpolation algorithm, cross-validation was performed utilizing the mean absolute error (MAE) and root mean square error (RMSE) as evaluation criteria (
Table 3). In terms of computation time and memory consumption, there is little difference among the eight interpolation methods. The kriging methods take 51.87–63.10 s, and the IDW and RBF methods take 42.05–49.36 s, which is related to their weighting methods. Kriging-type methods require a relatively long time to fit the variogram, but overall the time is acceptable, with a memory consumption of between 124–212 KB, which is negligible on modern computer hardware.
The results indicate a high degree of precision for both geostatistical and deterministic interpolation methods, fulfilling the requirement for high-precision modeling. The linear kriging interpolation method demonstrated the highest precision, with MAE = 1.01 m and RMSE = 1.20 m. The linear variogram proved most suitable for fitting the variogram of the topographic elevation points. This suggests that the linear kriging method effectively captures the spatial distribution characteristics of topographic elevation points and represents the optimal interpolation method for this topographic model. A 3D terrain model was generated based on linear kriging interpolation (
Figure 11), visualizing the relief characteristics of the surface elevation in the study area. The 3D terrain model and contour map were translated along the
Z-axis, and the 3D terrain model was stretched by a factor of two along the
Z-axis to enhance visual clarity. The details and overall trends of the terrain in the study area were clearly reflected. After selecting the linear kriging method, the sensitivity experiment of its neighbor points parameters is carried out.
Figure 12 represents the MAE and RMSE change diagrams of the interpolation results under the condition of different number of neighbor points. It can be seen that when the number of interpolation points is 0–9, the MAE and RMSE values of the model show a rapid downward trend, which is due to the elevation of the terrain interpolation data; the value is large, the terrain points are numerous, the neighbor range of 0–9 points cannot well capture the local differences of the terrain; the neighbor range tends to be stable at 10–25 points of MAE and RMSE, proving that the neighbor range of 10–25 points is able to capture the local variation and undulation characteristics of the terrain. The parameter sensitivity experiments show that the model is stable when the neighborhood points are more than 10.
4.2. Formation Modeling
For the strata (Q
4, N
2b, J
2y3, J
2y2, and J
2y1), the optimal interpolation method was selected (
Table 4) using the same eight interpolation methods as for terrain interpolation and evaluated through cross-validation (MAE and RMSE). Taking the Q
4 stratum as an example, its best result was spherical kriging (MAE = 2.13 m, RMSE = 2.83 m). From the eight interpolation results in
Figure 13, the spatial distribution of the spherical kriging image is also more reasonable, without abnormal breakpoints or irregular polylines. The same indicators and criteria were used to determine the optimal interpolation methods for the other four strata, N
2b was IDW (
p = 2), J
2y3 was RS-RBF, J
2y2 was TPS-RBF, and J
2y1 was HS-RBF.
The data source of stratum interpolation is borehole data, which is less distributed.
Figure 14 shows that the change of MAE and RMSE is very small when the number of neighbor points is 3–10, which is between 0.05 and 0.1 m. It proves that the number of neighbor points has little effect on the interpolation of strata and the model is stable. Because the number of formation interpolation boreholes is small, the calculation time and memory consumption are very small, so the statistical analysis is not carried out separately.
For certain formations (such as Q
4 and N
2b), a virtual thickness assignment method addressed missing borehole data. Specifically, the missing zone is determined by the contour map generated by the optimal interpolation method, and the thickness of the missing zone is assigned to a negative value, then the interpolated surface is regenerated, while the missing line (0 line) is drawn automatically. During 3D layer modeling, grid intersection eliminated these areas of missing stratum data, allowing for accurate definition of these zones (
Figure 15). Employing the optimal interpolation method in conjunction with this stratum deletion method produced the layer models for the Q
4, N
2b, and J
2y3 strata (
Figure 16), verifying the effectiveness and scientific basis of the missing data handling procedure.
4.3. Coal Modeling
Considering that the interpolation results inform the three-dimensional geological model and recognizing that the channel wave exploration does not consist of the entire workface, the spatial distribution of 5−2 coal thickness was analyzed across the entire 25214 workface, as well as in the channel wave exploration area, to ensure accurate representation.
Assessment of coal thickness distribution across the entire workface employed eight interpolation methods. Low MAE and RMSE values indicate the reliability of these methods (
Table 5). Among these methods, spherical, exponential, and linear kriging demonstrated comparably high precision. Visual inspection of the interpolated distributions (
Figure 17) indicates a more uniform color distribution and aesthetically preferable representation from spherical kriging in the 25214 workface. Therefore, spherical kriging was selected as the optimal interpolation method for 5
−2 coal thickness in this workface.
In addition to the eight methods described above, Bayesian Maximum Entropy (BME) interpolation was employed to estimate the spatial distribution of coal thickness specifically in the channel wave exploration area. BME interpolation was implemented utilizing Seksgui V1.0.9, an open-source MATLAB R2020a-based software package available at Spatiotemporal Epistematics Knowledge Synthesis Graphical User Interface. Construction of “soft data” is central to BME interpolation. In this study, interval “soft data” were constructed based on the error analysis of channel wave exploration. Specifically, considering a 20% error in the retrieved coal thickness near roadway exposures, interval “soft data” [lb, up] were generated. For instance, a measured coal thickness of 4.6 m yields corresponding “soft data” of [3.68 m, 5.52 m], thus offering comprehensive input for the BME interpolation.
Eight interpolation methods, including kriging, IDW, and RBF, cannot accommodate data uncertainty during the interpolation process. Therefore, when interpolating coal thickness, data from channel wave inversion, borehole data, and mine roadway data were incorporated as input sources. The MAE and RMSE values for the nine interpolation methods are lower (
Table 6) accordingly with the uniform distribution and gridded output of the geophysically inverted coal thickness data. The BME method has the highest accuracy but consumes more time and memory than the other eight algorithms, which is determined by the computational complexity of the BME algorithm itself. The time consumption is 83.57 s and the memory consumption is 387 KB.
Figure 18 illustrates that BME interpolation offers greater precision in areas where the coal seam thins and in the vicinity of the maximum coal thickness of 7.04 m, effectively capturing localized differences in the coal seam. This enhanced performance is attributed to the inability of the other eight methods to utilize coal thickness data uncertainty; thus, these data are treated as “Hard Data” during interpolation, leading to greater discrepancies between interpolated and observed values. BME interpolation, in contrast, incorporates both “Hard Data” and “Soft Data”, utilizing uncertainty intervals and analyzing interpolation weights at various locations to achieve an optimal solution. Accordingly, BME interpolation was chosen for accurate coal thickness predictions in the channel wave exploration area, while spherical kriging was applied to predict coal thickness in other sections of the 25214 workface.
According to the results of BME parameter sensitivity experiments (
Figure 19), the accuracy of the model (MAE and RMSE) tends to be stable when the number of neighbor points exceeds 5, and it performs best at 10 neighbor points. This result shows that there is a sensitivity between the interpolation accuracy of the BME method and the number of neighbor points, and in this experiment, 10 neighbor points were selected as the best balance points.
The BME method considers both soft data and hard data in the process of interpolation, and uses these data to model uncertainty. Due to the large number of points in the in-seam wave inversion data, selecting 10 neighbor points can effectively capture the spatial variation of the data and provide enough local information for accurate interpolation. After this point, adding more neighbor points has little effect on the accuracy improvement, indicating that the BME method has a limited demand for the number of neighbor points, and 10 neighbor points are sufficient to ensure the stability and accuracy of the model.
In the actual recovery process of the trough wave exploration area, three measurement lines were conducted at distances of 2409 m, 2515 m, and 2632 m from the starting recovery direction to measure the coal thickness, totaling 39 actual coal thickness measurement points to validate the interpolation results (
Figure 20).
Table 7 provides a quantitative error analysis comparing BME with other interpolation methods (such as spherical, exponential, Gaussian, linear, IDW, TPS, RS, HS) against the actual coal seam thickness measurement points. Among the minimum errors, Gaussian kriging and linear kriging performed the best with 0.00 m, followed by BME, exponential kriging, TPS-RBF, RS-RBF, and HS-RBF methods with 0.01 m. In terms of maximum error, the BME method performed the best with 0.57 m, while the other methods ranged between 0.81 m and 1.86 m. For average error, the BME method also performed the best with 0.20 m, while the other methods slightly lagged behind BME, ranging from 0.25 m to 0.35 m. These results indicate that BME has high accuracy in predicting coal seam thickness and can effectively integrate soft and hard data information for predictions
The BME and spherical kriging interpolation results were combined to create a 3D geological model of the 5
−2 coal seam (
Figure 21). The yellow area represents the BME interpolation domain, while the gray surface depicts the spherical kriging results. The BME results lie beneath the gray surface, offering a more accurate representation in the coal seam thinning zone.
5. 3D Geological Modeling of 25214 Workface
Three-dimensional geological modeling hinges on the accurate mapping of stratum boundaries. Following the experimental results detailed in Chapter 3, a sequence analysis of the 25214 workface terrain, strata, and coal seam was conducted, resulting in a high-precision 3D geological model. Leveraging optimal interpolation, the layer model was developed according to the sequence presented in
Figure 22. The layer was discretized into a 10 × 10 m mesh, triangulated utilizing the shortest distance method. This approach facilitated a smooth and accurate representation of missing strata areas. In
Figure 23, the triangular mesh between the upper and lower boundaries of the 5
−2 coal seam appears denser and more constricted. This is attributed to the grid splicing employed for the 5
−2 coal seam, where the higher grid density in the trough wave exploration zone created a mismatch of grid nodes between upper and lower levels. This mismatch resulted in a denser triangular network upon connection of the upper and lower levels. While in practical application, separate fine models for coal seams and strata may be generated, or the stratum grid resolution could match the coal seam’s, the focus of this study is on applying high-precision interpolation methods to 3D geological modeling, and therefore this aspect is not further optimized here.
6. Discussion
In this study, nine interpolation methods were applied to the spatial interpolation of surface elevation, topography, and coal seam thickness data, with cross-validation to select the best method. A novel approach for redrawing zero lines based on negative virtual thickness values was introduced, addressing the challenge of defining missing stratigraphic regions. For coal seam thickness, a BME interpolation method was proposed, integrating “soft data” and “hard data” from channel wave exploration, boreholes, and mine roadway data, enabling more accurate identification of abnormal coal thickness variations, which were effectively visualized in the 3D model.
The advantage of this study lies in the selection of diverse interpolation methods, including kriging, IDW, RBF, and BME. Compared to other studies, such as Ding et al. [
39] and Che et al. [
28], which focused on kriging methods for coal seam interpolation, the methods in this paper demonstrate higher accuracy. While these studies focused on entire mine coal seams, the present study is limited to the mining face, allowing for more detailed results. Additionally, the integration of soft and hard data through BME offers improved results compared to traditional methods like kriging and Bayesian kriging (BK), particularly in handling data gaps and inconsistencies [
42].
However, this study also has limitations. The in-seam exploration area does not cover the entire workface, and the method performs better with evenly distributed and large volumes of data. BME has also been shown to perform well with only soft data [
43]. Future research should explore enhanced soft data construction methods, similar to probabilistic soft datasets, and apply machine learning to improve interpolation accuracy, especially as data sources increase. At present, small datasets in mining environments may cause machine learning models to overfit, but this approach remains worth exploring [
44,
45].
In geologically complex regions, such as fault zones and fractured coal seams, traditional interpolation methods (e.g., kriging, IDW, RBF) often fail to capture local characteristics due to spatial heterogeneity and geological discontinuities. The BME method mitigates this by integrating soft and hard data, allowing for more accurate interval estimates for missing data and reducing errors caused by geological discontinuities. However, BME may face challenges in highly discontinuous environments, particularly in fault zones, where interpolation accuracy may decline. Future improvements could involve increasing auxiliary data (e.g., geophysical exploration) and optimizing search radii to limit data influence from across faults, ensuring more accurate results in fault and fracture zones.
In three-dimensional geological modeling, real-time updates are crucial for mining accuracy, particularly for coal seam models. While terrain and strata data are updated less frequently, coal seam models must be continuously updated with real-time data from mining operations. Despite its high accuracy, BME’s computational complexity can cause delays in real-time updates. However, coal seam changes are generally stable, except in areas with abnormal geological structures (such as fault zones). In stable conditions, the model updated during mining progress can provide valuable guidance for subsequent mining cuts, especially as mining typically advances 8–12 cuts per day.
To mitigate these delays, future research could focus on accelerating the BME calculation process via parallel computing or GPU acceleration. Additionally, machine learning could be applied to predict coal thickness trends, enabling faster and more precise real-time updates.
Although this article studies the mining operations that are currently being extracted, the proposed interpolation methods are also highly applicable to closed coal mines, such as those in Europe. In such areas, geological data remain critical for monitoring geological hazards (e.g., subsidence, surface water leakage) and environmental concerns (e.g., groundwater contamination, wastewater treatment, ecological restoration). Three-dimensional geological models, reconstructed from historical data using BME, can aid in assessing disaster risks and guiding environmental remediation efforts. For example, accurate coal seam data can support groundwater flow and subsidence monitoring, providing essential data for long-term environmental management after mine closure. Furthermore, integrating soft and hard data through BME can help evaluate environmental risks and support mine reclamation and pollution remediation.
Thus, while real-time updates may not be as crucial in closed mines, updating three-dimensional geological models based on historical data still provides valuable insights for disaster monitoring, environmental protection, and reclamation efforts. In particular, regions like Europe, with closed coal mines, will benefit from these advanced modeling techniques in post-mining environmental restoration and management.
7. Conclusions
Based on the statistical characteristics and spatial distribution features of the topographic elevation, strata, and coal seam thickness data within a 300 m range near the 25214 workface of the Hongliulin coal mine, the accuracies of nine interpolation methods were evaluated through cross-validation. Among these, the topographic and strata data were subjected to eight interpolation methods, while the coal seam thickness interpolation included an additional BME method. A method for accurately delineating areas with missing strata was proposed, and a fine three-dimensional geological model of the 25214 workface was established using the optimal interpolation methods for each layer.
Interpolation Results for Terrain Elevation Points: For the terrain elevation data, which is evenly distributed and dense, the performance of eight interpolation methods, including kriging, IDW, and RBF, was generally satisfactory. Among these methods, linear kriging exhibited the highest accuracy (MAE = 1.01 m, RMSE = 1.20 m), capturing the local variations in terrain elevation most effectively.
Interpolation Results for Stratigraphic Thickness Data: For stratigraphic thickness data with sparse sampling points, the optimal interpolation methods for five strata are as follows: For Q4 stratum, spherical kriging (MAE = 2.13 m, RMSE = 2.83 m); for N2b stratum, IDW (p = 2) (MAE = 2.08, RMSE = 2.44); for J2y3 stratum, RS-RBF (MAE = 0.89 m, RMSE = 1.05 m); for J2y2 stratum, TPS-RBF (MAE = 1.96 m, RMSE = 2.25 m); and for J2y1 stratum, HS-RBF (MAE = 2.36 m, RMSE = 2.71 m).
Method for Handling Missing Stratigraphic Areas: A novel method for addressing missing stratigraphic regions was proposed. By virtually redrawing the zero line of stratum thickness based on negative virtual thickness values, a 3D layer model was constructed. Grid intersection techniques were then employed to eliminate gaps within the model, enabling the accurate definition of missing stratigraphic regions in the 3D geological model.
Precise Prediction Interpolation Method for Coal Seam Thickness: A precise interpolation method for predicting coal seam thickness based on Bayesian Maximum Entropy (BME) was proposed. This method integrates hard data from boreholes and mine roadway measurements with soft data from channel wave exploration. The interpolation accuracy after integration (MAE = 0.64 m, RMSE = 0.66 m) is significantly superior to that of the other eight methods. Compared to the real 5−2 coal thickness measurement points, the BME method shows an error range between 0.01 and 0.57 m, with an average error of 0.20 m, demonstrating the highest accuracy.
Construction of a High-Precision 3D Geological Model: By employing the optimal interpolation methods identified for terrain, individual strata, and coal seams, a high-precision 3D geological model of the coal mining face was constructed using TIN grid modeling. This model provides reliable 3D geological support for intelligent coal mining operations.