A 3D Geological Modeling Method Using the Transformer Model: A Solution for Sparse Borehole Data

Hang, Zhenquan; Xue, Tao; Chen, Jianping; Shi, Yujin; Yin, Zehang; Cui, Zijia; Zhou, Guanyun

doi:10.3390/min15030301

Open AccessArticle

A 3D Geological Modeling Method Using the Transformer Model: A Solution for Sparse Borehole Data

by

Zhenquan Hang

¹,

Tao Xue

^2,3,*,

Jianping Chen

¹,

Yujin Shi

⁴,

Zehang Yin

⁵,

Zijia Cui

¹ and

Guanyun Zhou

¹

School of Earth Science and Resources, China University of Geosciences (Beijing), Beijing 100083, China

²

School of Information Engineering, China University of Geosciences (Beijing), Beijing 100083, China

³

Frontiers Science Center for Deep-Time Digital Earth, China University of Geosciences (Beijing), Beijing 100083, China

⁴

Shanghai Geological Engineering Exploration, Shanghai 200436, China

⁵

Faculty of Science, The University of Sydney, Sydney, NSW 2006, Australia

^*

Author to whom correspondence should be addressed.

Minerals 2025, 15(3), 301; https://doi.org/10.3390/min15030301

Submission received: 27 January 2025 / Revised: 27 February 2025 / Accepted: 13 March 2025 / Published: 15 March 2025

(This article belongs to the Special Issue Application of Big Data Mining, Machine Learning and Artificial Intelligence in Geoscience, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

Three-dimensional (3D) geological models are essential for geological analysis and mineral resource estimation. Although conventional on-site survey methods, such as boreholes, provide local engineering geological information for 3D geological modeling, accurately predicting strata in areas with sparse borehole data remains a challenge. This study proposes a 3D geological modeling method using the Transformer model under the conditions of sparse borehole data. First, a K-dimensional tree was used to identify boreholes adjacent to the target point, and a borehole context sequence was constructed using stratigraphic information from neighboring boreholes. Subsequently, the relationship between the target point and its adjacent borehole sequence was calculated using the multi-head attention mechanism of the Transformer model. Finally, trained Transformer encoders were used to predict the stratigraphic category of the target point, and the normalized information entropy was used to quantify uncertainty during the modeling process. Experimental results showed that the accuracy of the method was 0.86, outperforming the accuracy and uncertainty of a recurrent neural network. The root mean square error is smaller than the inverse distance weight and Kriging. Compared to other methods, the proposed method can more accurately describe the geometric shape and distribution of geological bodies and reveal the sedimentary laws of the study area.

Keywords:

3D geological modeling; transformer model; sparse data; uncertainty quantification

1. Introduction

In engineering geology, 3D geological models constructed using 3D geological modeling technologies are very helpful for geological analysis and the quantitative estimation of mineral resources [1,2]. Boreholes provide accurate, direct, and detailed information on stratigraphic distribution, which is commonly used in 3D geological modeling [3,4]. However, drilling techniques are typically limited by budget and terrain constraints [5,6]. Although various 3D geological modeling methods have been developed over the past three decades, performing 3D geological modeling with sparse borehole data remains a major challenge [7,8].

The existing 3D geological modeling methods can be divided into deterministic modeling methods and stochastic modeling methods (Table 1). On the one hand, deterministic modeling methods can be divided into explicit modeling and implicit modeling. Explicit modeling allows geologists to directly contribute and fully utilize geological knowledge. However, explicit modeling requires extensive interactive modifications and is labor intensive [9,10]. Implicit modeling can automate the modeling process [11]. Implicit modeling may not optimally use available data or include sufficient geological constraints to produce plausible models with consistent geological relevance [12]. For cases with sparse data, both explicit and implicit modeling rely on interpolation techniques as solutions. Common interpolation methods include discrete smooth interpolation [13], inverse distance weighting (IDW) [14], spline interpolation [15], and Kriging [16,17]. The spline and discrete smooth interpolation methods assume data smoothness, are subjective in parameter selection, and share the limitation of being isotropic interpolation techniques. In IDW, a distance attenuation parameter is uniformly applied across the study area without considering the spatial variability of the data [18]. As a typical geostatistical interpolation method, Kriging not only captures the spatial distance relationships between sample and interpolation points but also reflects the spatial variability of the data by creating a variogram. However, this method has high requirements for the distribution of data assumptions and depends on expert-driven parameter selection, which is subjective and limited [19,20]. In practical applications, borehole data are typically densely sampled along the borehole line and do not necessarily follow spherical, cubic, or exponential distribution models [21,22]. On the other hand, the stochastic modeling method can generate multiple numerical geographic domain models representing the inherent uncertainties of these geographic domains, which can better describe the spatial random distribution and uncertainty characteristics of geological blocks and geological region boundaries [23]. Stochastic modeling methods mainly include coupled Markov chain (CMC), stochastic Markov random field (MRF), Gaussian simulation, and multi-point statistics (MPS). With CMC and MRF, it is difficult to determine the probability of the transition matrix, which has a certain subjectivity [24,25,26]. Gaussian simulation has a limited ability to represent complex geological features. MPS uses training images to simulate correlations between multiple points, making it suitable for describing complex geological phenomena. However, MPS faces challenges when it is difficult to obtain complete and reliable 3D training images [27,28,29].

Three-dimensional geological modeling based on deep learning has recently emerged as an area of active research. Studies on deep neural networks [30,31], convolutional neural networks (CNNs) [32,33], recurrent neural networks (RNNs) [19], graph neural networks (GNNs) [34,35], and generative adversarial networks (GANs) [8] have demonstrated the effectiveness of various deep learning algorithms in 3D geological modeling. In neural networks, it is not necessary to calculate the experimental variogram, and nonlinear features can be captured globally by learning the basic interrelationships within the data set. These advantages make deep learning a practical alternative for dealing with sparse data issues [36,37]. Many studies have explored deep learning methods to deal with data sparsity. Guo et al. [31] proposed a semi-supervised learning method based on pseudo-labels to generate pseudo-labels for unlabeled data, thereby supplementing high-confidence predictions for further training. Lyu et al. [8] developed a multi-scale GAN method for developing 3D underground geological models from limited borehole data and 3D training images that represent prior geological knowledge. The Transformer model, developed by Google researchers in 2017 [38], has been recognized as the fourth largest category of deep learning models after multilayer perceptron, CNNs, and RNNs [39]. Initially designed for machine translation, it is the foundation for various models such as BERT and generative pre-trained transformer (GPT), showing excellent performance in image classification and natural language processing [40,41]. The Transformer model uses a unique self-attention mechanism to simulate long-term dependencies and global relationships in the data, dynamically weighing the importance of different parts of the input, which is particularly beneficial for tasks that need to understand the global context [38,39]. Previous studies have demonstrated the effectiveness of the Transformer model in solving sparse data problems. Based on this model, Feng et al. [42] effectively extracted internal correlations between multiple traffic parameters with sparse data, while Tiwari et al. [43] used the Transformer model to quickly estimate diffusion tensor parameters from sparse measurements.

This study proposes a method for 3D geological modeling using the Transformer model under the conditions of sparse borehole data. First, the borehole data were preprocessed, and the K-dimensional tree (KD-tree) was used to identify boreholes adjacent to the target point. The borehole context sequence was then constructed using stratigraphic information from adjacent boreholes. Subsequently, feature relationships between the target point and its adjacent borehole sequence were calculated using the unique multi-head attention mechanism of the Transformer model. Finally, Transformer encoders were used to predict the stratigraphic category at the target point, and normalized information entropy was used to quantitatively evaluate the uncertainty during the modeling process. The remainder of this manuscript is organized as follows: Section 2 describes details of the proposed method; Section 3 introduces the data preparation and experimental results; Section 4 discusses the results; Section 5 presents the conclusions.

2. Materials and Methods

Compared to image data, borehole data exhibit clustering characteristics, with local concentration and overall dispersion. In this study, a method of 3D geological modeling using the Transformer model under conditions of sparse borehole data is proposed. The algorithm flowchart is shown in Figure 1. This method predicts the stratigraphic classification of the target point by performing data preprocessing, building KD-tree, constructing borehole context sequences, and training the Transformer model. Among them, KD-tree is used to efficiently organize borehole data, borehole context sequences are used to provide features of adjacent boreholes, and the Transformer model is used to identify complex relationships between borehole context sequences.

2.1. Data Preprocessing

The borehole data include the borehole identifier (ID), location coordinates (X and Y), stratum category, and the start and end elevations of each stratum. The preprocessing of borehole data was conducted as follows:

(1): Data cleaning: This step checked for errors in each borehole’s data. For example, if the thickness of a stratum was zero, the stratum was removed. Similarly, if the start elevation of a stratum was less than the end elevation of the stratum, it was considered an error, and the stratum was removed.
(2): Data normalization: Considering the large difference in the number of bits between the X and Y coordinates and the elevation, the coordinates were normalized to the range [0, 1]. The normalization formula is shown in Equation (1):

I_{0} = \frac{I_{i} - I_{m i n}}{I_{m a x} - I_{m i n}}

(1)

where I₀ is the normalized value, I_i is the original value, and I_max and I_min are the maximum and minimum values of the coordinates, respectively.

(3): Data encoding: As the stratum name was a character string, a mapping between the string and the integer was created, and the string was encoded as an integer label for processing by the classifier. To prevent data leakage, the preprocessed borehole data were divided into three parts according to the borehole ID, in a specific proportion, to construct the training set, validation set, and test set.

2.2. Construction of KD-Tree

To facilitate the search for boreholes adjacent to the target point, a KD-tree was constructed based on the X and Y coordinates of each borehole. The KD-tree is a data structure widely used in computer science and computational geometry. It divides the data space based on the number of dimensions of each data point, enabling efficient organization and search of points in the multidimensional data space [44,45,46]. The KD-tree constructed in this study had two dimensions, these being the X and Y coordinates of the boreholes. During the construction, each node represented a data point, and the data set was recursively divided into subsets according to the dimensions. Equation (2) represents the dimension selection when the dimension k of the KD-tree is 2.

s p l i t_d i m (d) = \{\begin{matrix} x, i f d m o d 2 = 0 \\ y, i f d m o d 2 = 1 \end{matrix}

(2)

where d is the depth of KD-tree, split_dim(d) represents the dimension of segmentation, and d mod 2 represents the remainder of d divided by 2. When split_dim(d) = x, it represents that the left and right subtrees of the depth are divided according to the X coordinate of the borehole. When split_dim(d) = y, it represents that the left and right subtrees of the depth are divided according to the Y coordinate of the borehole.

The data on dimension split_dim(d) are arranged from small to large to obtain the data set {

A_{1}

,

A_{2}

,

A_{3}

,…,

A_{n}

}, where

A_{1}

<

A_{2}

<

A_{3}

<…<

A_{n}

and n is the number of data points on dimension split_dim(d). According to Equation (3), the median of the dimension split_dim(d) can be calculated.

m = \{\begin{matrix} A_{\frac{n + 1}{2}}, n = 2 l + 1 \\ \frac{A_{\frac{n}{2}} + A_{\frac{n}{2} + 1}}{2}, n = 2 l \end{matrix}

(3)

where m represents the median in the current dimension, n = 2l + 1 represents n being odd, and n = 2l represents n being even.

For each partition, the median of the current dimension was selected as the partition point, dividing the data set into two parts: the left subtree contained all coordinates smaller than the point, while the right subtree contained all coordinates greater than the point. Equation (4) is a formula for dividing subtrees.

\{\begin{matrix} S_{l e f t} = {s p l i t_d i m (d) < m} \\ S_{r i g h t} = {s p l i t_d i m (d) > m} \end{matrix}

(4)

where

S_{l e f t}

is a left subtree divided by m and

S_{r i g h t}

is a right subtree divided by m.

For each subset, the next dimension was recursively selected, and the division continued based on the median until each leaf node contained only one data point. Through this recursive space division, the KD-tree could effectively organize data, enabling nearest neighbor queries to be completed in logarithmic time.

2.3. Construction of Borehole Context Sequence

To fully exploit the stratigraphic relationship between adjacent boreholes, this study constructed multiple context sequences for each borehole. A given elevation in a borehole was treated as the target point, and the KD-tree was used to query the nearest neighboring borehole ID to the target point. Each nearest neighbor borehole ID was then traversed to extract the corresponding stratigraphic records, and features (including stratum ID, start and end elevations of the stratum, and X and Y coordinates) were added to the sequence list. The spatial information of the target point was subsequently added to the end of the sequence, and the stratum category of the target point was used as the prediction target for the Transformer model. For example, when the number of adjacent boreholes was equal to three (K = 3), the 1st stratum (Figure 2a), the 2nd stratum (Figure 2b), the 3rd stratum (Figure 2c), and the 4th stratum (Figure 2d) of the training borehole T were treated as the target points to construct the borehole context sequence. The borehole context sequence for borehole T, with the 1st stratum as the target point, is shown in Figure 3. This sequence contains the stratigraphic features of adjacent boreholes A, B, and C, as well as the spatial characteristics of the target point. After constructing all the borehole context sequences, the maximum length of each sequence was calculated to determine the filling length, and each sequence was then filled to the maximum length to ensure uniformity across all sequences, facilitating batch training.

2.4. The Training of the Transformer Model

The original Transformer model [38] design includes both encoders and decoders. In common generation tasks, such as machine translation, the input is a sentence, and the output is a translated sentence. In this case, the decoder is used to gradually generate the output sequence, relying on the context information produced by the encoder. In our study, the stratum classification task was a classification problem. The task was to classify the stratum based on the adjacent borehole context information of the target point, rather than generating a new sequence. Therefore, the model only needed to use the Transformer encoders (as shown in Figure 4) to encode the input data, extract useful features, and output the probability of the target point belonging to each category.

To extract the characteristic relationships between the target point and its adjacent borehole stratigraphic sequence, it was necessary to calculate the self-attention mechanism. The calculation of self-attention can be divided into the following steps. First, each point in the input sequence was mapped to three different representations with dimensions through the input embedding layer, which were called query (Q), key (K), and value (V). Subsequently, the Q vector was matched with all K vectors using dot product multiplication. The resulting matrix was scaled and passed through the softmax function to obtain the attention scores between different points in the sequence. Finally, the attention score was multiplied by the V vector to generate a new representation of the input sequence. Essentially, the self-attention mechanism modified the representation of each point by considering the weighted contribution of all other points in the sequence. This allowed distant points in the sequence to focus on each other’s values and share important information that may otherwise be overlooked. The self-attention mechanism is represented by Equation (5) [38]:

Attention (Q, K, V) = soft m a x (\frac{Q K^{T}}{\sqrt{d_{k}}}) V

(5)

where Q, K, and V are the three matrices used for calculating self-attention, and d_k is the dimension of the K matrix.

The multi-head attention mechanism allowed the model to focus on the self-attention mechanism of different subspaces. Under the condition that the parameter quantity was generally unchanged, multi-head attention divided the three parameters, Q, K, and V, into multiple groups. Each group was mapped to different subspaces in the high-dimensional space to calculate the self-attention weights, enabling the model to focus on different parts of the input. After performing multiple parallel calculations, the self-attention information in all subspaces was merged. As self-attention was distributed differently across subspaces, multi-head attention was seeking for associations from different perspectives of the input data, making it possible to encode multiple relationships and subtle differences. The multi-head attention formula [38] is shown in Equation (6):

\{\begin{matrix} M u l t i H e a d (Q, K, V) = C o n c a t ({head}_{1}, {head}_{2}, \dots, {head}_{h}) W^{o} \\ {head}_{i} = A t t e n t i o n (Q W_{i}^{Q}, K W_{i}^{K}, V W_{i}^{V}) \end{matrix}

(6)

Taking one of the heads in the multi-head self-attention mechanism as an example, the process of calculating self-attention is shown in Figure 5. The Transformer model iterates through each position in the borehole context sequence, computing self-attention from the 1st, 2nd, 3rd… to the N-th position. As there may be long-distance dependencies between different stratigraphic categories, the multi-head self-attention mechanism can help the model identify complex associations across multiple strata.

As the self-attention mechanism is independent of the order in the input sequence, position encoding was added to track the location of different points. Position encoding was typically introduced using sine and cosine functions, as shown in Equation (7) [38]:

\{\begin{matrix} P E_{(pos, 2 i)} = s i n (p o s / 10000^{2 i / d}) \\ P E_{(pos 2 i + 1)} = c o s (p o s / 10000^{2 i / d}) \end{matrix}

(7)

where pos denotes the position in the sequence, d represents the dimension of the position encoding, 2i denotes the even dimension, and 2i + 1 denotes the odd dimension.

In the actual training process, batch training can be used to improve model efficiency, with each batch processing a sequence of multiple target points. The hyperparameters of the model, such as embedding dimension, the number of heads of multi-head attention, the number of stacked layers of the encoder, and the learning rate, were adjusted by the grid search method. The standard cross-entropy loss function was used as the loss function. The use of the Adam optimizer can well adapt to the training of the Transformer model. After training, model performance was evaluated on the test set using conventional classification evaluation indicators including confusion matrix, receiver operating characteristic (ROC) curve, accuracy, precision, recall, F1 score, and Kappa coefficient.

accuracy = \frac{T P + T N}{T P + T N + F P + F N}

(8)

precision = \frac{T P}{T P + F P}

(9)

r ecall = \frac{T P}{T P + F N}

(10)

F 1 score = 2 \times \frac{p recision \times r ecall}{p recision + r ecall}

(11)

K a p p a = \frac{P_{0} - P_{c}}{1 - P_{c}}

(12)

p_{0} = \frac{\sum_{i = 1}^{n} x_{i i}}{N}, p_{c} = \frac{\sum_{i = 1}^{n} x_{i +} x_{+ i}}{N^{2}}

(13)

where True Positive (TP) represents the number of samples that the model correctly predicts as positive; True Negative (TN) represents the number of samples that the model correctly predicts as negative; False Positive (FP) represents the number of negative samples that the model incorrectly predicts as positive; and False Negative (FN) represents the number of positive samples that the model incorrectly predicts as negative.

x_{i i}

denotes the elements on the diagonal of the confusion matrix,

x_{i +}

denotes the sum of all elements in row i,

x_{+ i}

denotes the sum of all elements in column i, and N denotes the sum of all elements.

p_{0}

represents the proportion of observation accuracy or the consistency unit;

p_{c}

denotes the proportion of coincident or expected coincident units.

2.5. Model Prediction and Uncertainty Analysis

The prediction process involved dividing the study area into a large number of grids, finding K adjacent boreholes for each grid, and constructing borehole context sequences based on the stratigraphic information of adjacent boreholes. The borehole context sequence for all grids in the study area was then input into the trained Transformer model to predict the stratigraphic category of each grid (as shown in Figure 6).

The softmax layer outputs the probability that each grid unit belongs to a specific stratum category. In this study, the normalized information entropy of each grid unit was calculated by combining the probability distribution to quantify the uncertainty in the prediction modeling process. The calculation of the normalized information entropy is shown in Equation (14):

H (X) = - \frac{\sum_{x \in S} p (x) l n (p (x))}{S_{m a x}}

(14)

where S is the possible stratum category of each target point, S_max is equal to ln (n), and n is the number of possible stratigraphic categories. The information entropy of each data point was obtained by calculating the probability p(x) of each target point across all stratum categories. The magnitude of the information entropy reflected the complexity of a certain position in the geological model. The closer the information entropy was to 0, the higher the certainty that the data point belonged to a specific stratum category. On the contrary, the closer the information entropy was to 1, the higher the uncertainty.

3. Experiments and Results

3.1. Experiments

The test data consist of 37 boreholes located in Huangpu District, Shanghai, with a depth range of 85.3–134.6 m, an average depth of 103.4 m, a minimum thickness of 0.5 m, and a maximum thickness of 45.8 m. These boreholes are distributed over an area of 5000 × 6200 m (as shown in Figure 7). The engineering geological layer refers to the stratum divided according to geological age, geological origin, material composition, and physical and mechanical properties of rock and soil. The study area was divided into 15 engineering geological layers, with the geological age, sequence number, and lithology of each stratum shown in Table 2. The strata in this area are mainly composed of Quaternary loose sediments, which are located in human activity sites. The stratum is characterized by the frequent formation of lenticles and pinch-outs and is not significantly affected by faults, joints, and other fault structures [3,4]. The borehole data are public data from the website https://data.sigs.cn/sigs-service-platform/#/home (accessed on 5 January 2024).

The experimental environment for this study included PyTorch 2.1.0 and Python 3.10. The device used is an Intel Xeon Platinum 8358P 2.60 GHz CPU and an NVIDIA-A800 GPU. The grid search method was used for parameter optimization, and the optimal values for each parameter are shown in Table 3.

3.2. Results

Table 4 shows that all accuracy indexes for the test set exceeded 0.85. The confusion matrix for the test set classification is shown in Figure 8. The confusion matrix displays the predicted and actual values for each stratum category, reflecting the reliability of the model’s classification results. As seen in Figure 8, the classification accuracy for most layers was very high, with only a few layers being misclassified as adjacent layers. The ROC curve shows the performance of the classifier under different thresholds. The closer the curve is to the upper left corner, the better the prediction performance of the model. The area under the ROC curve (AUC) serves as a comprehensive measure of all potential classification thresholds. AUC values greater than 90% are considered excellent, in the range of 75%–90% they are considered good, in the range of 50%–75% they are poor, and values under 50% represent unacceptable performance [50]. The experimental results (Figure 9) show that the classification performance on the test set is generally above good, with most of the results categorized as excellent.

In this study, the grid size used for 3D geological modeling is 100 m × 100 m × 0.5 m, and the study area is divided into 50 × 62 × 271 grids. Figure 10a shows the results of the 3D geological model, illustrating the geometry and distribution of geological bodies. Figure 10b shows the fence diagrams derived from the model, which presents the stratigraphic coverage relationship inside the model. Figure 11 shows the distribution of some strata on the plane. It can be seen that some strata such as brown–yellow silty clay and grass yellow–gray sandy silt, gray sandy clay are only distributed in some areas, which is consistent with the sedimentary law of the delta area. In addition, Figure 12 shows the proportion of each stratum in the model. Figure 13 shows the comparison between the test set boreholes and the predicted boreholes. The predicted results align with the original boreholes, including the thickness of each stratum and the sedimentary sequence in the original boreholes.

4. Discussion

4.1. Comparison of the Transformer Model and Other Methods

To verify the reliability of the proposed method, this study used IDW, Kriging, and RNN to construct a 3D geological model. Firstly, by comparing the profile (Figure 14c) obtained by the Transformer model with the profiles (Figure 14a,b) obtained by traditional modeling methods (IDW and Kriging), it can be seen that the overall distribution of strata in these profiles is consistent. In detail, the transformer profile is closer to adjacent boreholes in some strata such as gray silty clay and grass yellow–gray sandy silt. In addition, according to Table 5, the root mean square error (RMSE) of the Transformer is smaller than the RMSE of the IDW and Kriging methods. Then, by comparing the profile (Figure 14c) obtained by the Transformer model with the profile (Figure 14d) generated by the RNN, it can be seen that the profile obtained by the Transformer model is consistent with the stratigraphic information from the adjacent boreholes, accurately revealing the stratigraphic coverage relationship and the pinch-out phenomenon. In contrast, the model obtained by the RNN fails to capture the pinch-out phenomenon observed in the nearby boreholes. In addition, the Transformer model outperforms the RNN in terms of accuracy, precision, recall, F1 score, and Kappa coefficient (Figure 15). A possible explanation for this is that during the calculation of the RNN, information with very long interval distances will be lost, preventing the establishment of long-term dependencies in the context [51,52].

4.2. Analysis of Model Uncertainty

For the 3D geological model, the stratum boundary information provided by the borehole data is accurate, while the stratum boundary outside the borehole data area is predicted [31]. In this study, normalized information entropy was introduced to quantify the uncertainty during the modeling process. The visualized 3D normalized information entropy model (Figure 16) shows the uncertainty at different positions within the model and profile. Figure 16 reveals that the uncertainty of the region at the boundaries of the strata is high, as the classifier is susceptible to interference in these areas. In addition, the RNN also shows high uncertainty in the interior of some strata, especially in the strata above −60 m, indicating that the RNN model performs less effectively than the Transformer model. The possible reason is that the thickness of the strata above −60 m is thinner than that below −60 m, and RNN does not capture the complex stratigraphic relationship.

4.3. Advantages and Disadvantages of the Proposed Method

The formation and evolution of geological structures and phenomena are challenging to describe because they are accompanied by a high degree of nonlinearity. Deep learning methods can extract high-order features from complex structures [53,54]. Although borehole data provide accurate stratigraphic distribution information, budget and terrain constraints often limit the availability of borehole data, making accurate stratigraphic prediction in sparse data areas a challenge. Traditional interpolation methods fail to account for the anisotropy between borehole data points. The self-attention mechanism in the Transformer model completely replaces the cyclic layer, enabling the analysis of longer input sequences [55]. While the Transformer model has achieved significant success in natural language processing and other fields, its application to 3D geological modeling is rare, particularly under the conditions of sparse borehole data. By using the Transformer model to extract relationships between different boreholes, the proposed method effectively utilizes limited borehole data. It improves modeling accuracy in sparse data areas and provides a more reliable basis for geological analysis and mineral resource estimation. In many fields, only sparse measurement data can be obtained due to the limitation of measurement equipment, cost considerations, or the difficulty of data acquisition. The idea of the proposed method can be extended to solve the problem of modeling with sparse measurement data; for example, groundwater modeling, medical image modeling, traffic flow prediction, etc.

Although the proposed method can learn complex geological laws from borehole data, effectively integrating the prior knowledge and experience of geologists into the model is still a problem to be solved. A possible solution is to introduce multi-source data such as geological profile data, geophysical data, and geological structure data, and convert these data into virtual boreholes and geological constraints [56,57]. The next research will explore the solution further.

5. Conclusions

In this study, a method for 3D geological modeling using the Transformer model under conditions of sparse borehole data is proposed. The borehole context sequence is constructed by using the stratigraphic information from adjacent boreholes and is input into the Transformer encoders. The model calculates the multi-head self-attention for each part of the borehole context sequence to quantify the relationship between the target point and its adjacent borehole strata. The experimental results show an accuracy of 0.86, with better classification accuracy and lower uncertainty compared to an RNN. In addition, the RMSE of this method is smaller than IDW and Kriging. A comparison with the IDW, Kriging, and RNN results demonstrates that this method more accurately reveals the geometric shape and distribution of the geological body, which aligns with the sedimentary laws of the region. Compared with the deterministic modeling method, this method allows for uncertainty assessment. This method holds significant practical value in geological engineering, mineral resource exploration, and other fields.

Author Contributions

Conceptualization, Z.H., T.X., and J.C.; methodology, Z.H., T.X., and J.C.; software, Z.H. and Z.Y.; validation, Z.H., T.X., and J.C.; formal analysis, Z.H., T.X., and Z.Y.; investigation, Z.H., T.X., and Y.S.; resources, T.X. and Y.S.; data curation, Y.S. and Z.Y.; writing—original draft preparation, Z.H. and T.X.; writing—review and editing, Z.H., T.X., J.C., and G.Z.; visualization, Z.H. and Z.C.; supervision, T.X.; project administration, T.X.; funding acquisition, T.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by The Major Basic Survey Project of Shanghai Municipal Bureau of Planning and Natural Resources: Evaluation and Application of Geological Resources and Environment Survey in Shanghai’s Post-Industrialization Period: SHXM-00-20180425-5255, SHXM-00-20190513-1217, and SHXM-00-2020401-0282; “Deep-time Digital Earth” Science and Technology Leading Talents Team Funds for the Central Universities for the Frontiers Science Center for Deep-time Digital Earth, China University of Geosciences (Beijing): 2652023001.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy.

Acknowledgments

We thank the China University of Geosciences (Beijing) for the computing resource support with the high-performance computing platform.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

3D	Three-dimensional
KD	K-dimensional
IDW	Inverse distance weighting
CMC	Coupled Markov chain
MRF	Markov random field
MPS	Multi-point statistics
CNN	Convolutional neural network
RNN	Recurrent neural network
GAN	Generative adversarial network
GPT	Generative pre-trained transformer
ID	Identifier
ROC	Receiver operating characteristic
AUC	Area under the curve
RMSE	Root mean square error

References

Lyu, M.; Ren, B.; Wu, B.; Tong, D.; Ge, S.; Han, S. A parametric 3D geological modeling method considering stratigraphic interface topology optimization and coding expert knowledge. Eng. Geol. 2021, 293, 106300. [Google Scholar] [CrossRef]
Zhou, G.; Chen, J.; An, W.; Liu, C.; Li, W. Three-dimensional mineral prospectivity mapping based on natural language processing and random forests: A case study of the Xiyu diamond deposit, China. Ore Geol. Rev. 2024, 169, 106082. [Google Scholar] [CrossRef]
Ji, G.; Wang, Q.; Zhou, X.; Cai, Z.; Zhu, J.; Lu, Y. An automated method to build 3D multi-scale geological models for engineering sedimentary layers with stratum lenses. Eng. Geol. 2023, 317, 107077. [Google Scholar] [CrossRef]
Zhu, L.; Zhang, C.; Li, M.; Pan, X.; Sun, J. Building 3D solid models of sedimentary stratigraphic systems from borehole data: An automatic method and case studies. Eng. Geol. 2011, 127, 1–13. [Google Scholar] [CrossRef]
Yeh, C.H.; Lu, Y.C.; Khoshnevisan, S.; Juang, C.H.; Tien, Y.M.; Dong, J.Y. LiDAR-based 3D litho-stratigraphic models calibrated with limited boreholes. Engineering Geology. 2024, 332. [Google Scholar] [CrossRef]
Chen, G.; Zhu, J.; Qiang, M.; Gong, W. Three-dimensional site characterization with borehole data—A case study of Suzhou area. Eng. Geol. 2018, 234, 65–82. [Google Scholar] [CrossRef]
Qiu, Y.; Zhang, N.; Yin, Z.; Wang, Y.; Xu, C.; Zhang, P. Novel multi-spatial receptive field (MSRF) XGBoost method for predicting geological cross-section based on sparse borehole data. Eng. Geol. 2024, 338, 107604. [Google Scholar] [CrossRef]
Lyu, B.; Wang, Y.; Shi, C. Multi-scale generative adversarial networks (GAN) for generation of three-dimensional subsurface geological models from limited boreholes and prior geological knowledge. Comput. Geotech. 2024, 170, 106336. [Google Scholar] [CrossRef]
Hou, W.; Yang, L.; Deng, D.; Ye, J.; Clarke, K.; Yang, Z.; Zhuang, W.; Liu, J.; Huang, J. Assessing quality of urban underground spaces by coupling 3D geological models: The case study of Foshan city, South China. Comput. Geosci. 2016, 89, 1–11. [Google Scholar] [CrossRef]
Kessler, H.; Mathers, S.; Sobisch, H.-G. The capture and dissemination of integrated 3D geospatial knowledge at the British Geological Survey using GSI3D software and methodology. Comput. Geosci. 2009, 35, 1311–1321. [Google Scholar] [CrossRef]
Wang, J.; Zhao, H.; Bi, L.; Wang, L. Implicit 3D Modeling of Ore Body from Geological Boreholes Data Using Hermite Radial Basis Functions. Minerals 2018, 8, 443. [Google Scholar] [CrossRef]
Jessell, M.; Aillères, L.; De Kemp, E.; Lindsay, M.; Wellmann, F.; Hillier, M.; Laurent, G.; Carmichael, T.; Martin, R. Next Generation Three-Dimensional Geologic Modeling and Inversion. In Proceedings of the SEG Conference on Keystone—Building Exploration Capability for the 21st Century, Keystone, CO, USA, 27–30 September 2014. [Google Scholar]
Mallet, J.L. Discrete modeling for natural objects. J. Int. Assoc. Math. Geol. 1997, 29, 199–219. [Google Scholar] [CrossRef]
Liu, H.; Chen, S.; Hou, M.; He, L. Improved inverse distance weighting method application considering spatial autocorrelation in 3D geological modeling. Earth Sci. Informatics 2019, 13, 619–632. [Google Scholar] [CrossRef]
Jātnieks, J.; Popovs, K.; Saks, T. A comprehensive approach to the 3D geological modelling of sedimentary basins: Example of Latvia, the central part of the Baltic Basin. Estonian J. Earth Sci. 2015, 64, 173–188. [Google Scholar] [CrossRef]
Calcagno, P.; Chilès, J.P.; Courrioux, G.; Guillen, A. Geological modelling from field data and geological knowledge Part I. Modelling method coupling 3D potential-field interpolation and geological rules. Phys. Earth Planet. Inter. 2008, 171, 147–157. [Google Scholar] [CrossRef]
von Harten, J.; de la Varga, M.; Hillier, M.; Wellmann, F. Informed Local Smoothing in 3D Implicit Geological Modeling. Minerals 2021, 11, 1281. [Google Scholar] [CrossRef]
Lu, G.Y.; Wong, D.W. An adaptive inverse-distance weighting spatial interpolation technique. Comput. Geosci. 2008, 34, 1044–1055. [Google Scholar] [CrossRef]
Zhou, C.; Ouyang, J.; Ming, W.; Zhang, G.; Du, Z.; Liu, Z. A Stratigraphic Prediction Method Based on Machine Learning. Appl. Sci. 2019, 9, 3553. [Google Scholar] [CrossRef]
Smirnoff, A.; Boisvert, E.; Paradis, S.J. Support vector machine for 3D modelling from sparse geological information of various origins. Comput. Geosci. 2008, 34, 127–143. [Google Scholar] [CrossRef]
Sun, L.; Wei, Y.; Cai, H.; Yan, J.; Xiao, J. Improved Fast Adaptive IDW Interpolation Algorithm based on the Borehole Data Sample Characteristic and Its Application. J. Physics Conf. Ser. 2019, 1284, 012074. [Google Scholar] [CrossRef]
Wang, Y.; Akeju, O.V.; Zhao, T. Interpolation of spatially varying but sparsely measured geo-data: A comparative study. Eng. Geol. 2017, 231, 200–217. [Google Scholar] [CrossRef]
Fouedjio, F.; Scheidt, C.; Yang, L.; Achtziger-Zupančič, P.; Caers, J. A geostatistical implicit modeling framework for uncertainty quantification of 3D geo-domain boundaries: Application to lithological domains from a porphyry copper deposit. Comput. Geosci. 2021, 157, 104931. [Google Scholar] [CrossRef]
Qi, X.-H.; Li, D.-Q.; Phoon, K.-K.; Cao, Z.-J.; Tang, X.-S. Simulation of geologic uncertainty using coupled Markov chain. Eng. Geol. 2016, 207, 129–140. [Google Scholar] [CrossRef]
Gong, W.; Zhao, C.; Juang, C.H.; Tang, H.; Wang, H.; Hu, X. Stratigraphic uncertainty modelling with random field approach. Comput. Geotech. 2020, 125, 103681. [Google Scholar] [CrossRef]
Li, Z.; Wang, X.; Wang, H.; Liang, R.Y. Quantifying stratigraphic uncertainties by stochastic simulation techniques based on Markov random field. Eng. Geol. 2016, 201, 106–122. [Google Scholar] [CrossRef]
Chen, Q.; Liu, G.; Ma, X.; Li, X.; He, Z. 3D stochastic modeling framework for Quaternary sediments using multiple-point statistics: A case study in Minjiang Estuary area, southeast China. Comput. Geosci. 2020, 136, 104404. [Google Scholar] [CrossRef]
Zhao, Y.; Chen, J.; Yang, S.; He, K.; Shimada, H.; Sasaoka, T. A Multi-Point Geostatistical Modeling Method Based on 2D Training Image Partition Simulation. Mathematics 2023, 11, 4900. [Google Scholar] [CrossRef]
Abdollahifard, M.J.; Baharvand, M.; Mariéthoz, G. Efficient training image selection for multiple-point geostatistics via analysis of contours. Comput. Geosci. 2019, 128, 41–50. [Google Scholar] [CrossRef]
Kim, H.-S.; Ji, Y. Three-dimensional geotechnical-layer mapping in Seoul using borehole database and deep neural network-based model. Eng. Geol. 2022, 297, 106489. [Google Scholar] [CrossRef]
Guo, J.; Xu, X.; Wang, L.; Wang, X.; Wu, L.; Jessell, M.; Ogarko, V.; Liu, Z.; Zheng, Y. GeoPDNN 1.0: A semi-supervised deep learning neural network using pseudo-labels for three-dimensional shallow strata modelling and uncertainty analysis in urban areas from borehole data. Geosci. Model Dev. 2024, 17, 957–973. [Google Scholar] [CrossRef]
Bi, Z.; Wu, X.; Li, Z.; Chang, D.; Yong, X. DeepISMNet: Three-dimensional implicit structural modeling with convolutional neural network. Geosci. Model Dev. 2022, 15, 6841–6861. [Google Scholar] [CrossRef]
Jessell, M.; Guo, J.; Li, Y.; Lindsay, M.; Scalzo, R.; Giraud, J.; Pirot, G.; Cripps, E.; Ogarko, V. Into the Noddyverse: A massive data store of 3D geological models for machine learning and inversion applications. Earth Syst. Sci. Data 2022, 14, 381–392. [Google Scholar] [CrossRef]
Hillier, M.; Wellmann, F.; Brodaric, B.; de Kemp, E.; Schetselaar, E. Three-Dimensional Structural Geological Modeling Using Graph Neural Networks. Math. Geosci. 2021, 53, 1725–1749. [Google Scholar] [CrossRef]
Han, S.; Zhang, Y.; Wang, J.; Tong, D.; Lyu, M. Graph neural network-based topological relationships automatic identification of geological boundaries. Comput. Geosci. 2024, 188, 105621. [Google Scholar] [CrossRef]
Battalgazy, N.; Valenta, R.; Gow, P.; Spier, C.; Forbes, G. Addressing Geological Challenges in Mineral Resource Estimation: A Comparative Study of Deep Learning and Traditional Techniques. Minerals 2023, 13, 982. [Google Scholar] [CrossRef]
Zhou, Y.Z.; Wang, J.; Zuo, R.G.; Xiao, F.; Shen, W.J.; Wang, S.G. Machine learning, deep learning and Python language in field of geology. Acta Petrologica Sinica. 2018, 34, 3173–3178. [Google Scholar]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Wei, J.; Wang, Z.; Li, Z.; Li, Z.; Pang, S.; Xi, X.; Cribb, M.; Sun, L. Global aerosol retrieval over land from Landsat imagery integrating Transformer and Google Earth Engine. Remote Sens. Environ. 2024, 315, 114404. [Google Scholar] [CrossRef]
Zhao, Y.; Zhang, J.; Zong, C. Transformer: A General Framework from Machine Translation to Others. Mach. Intell. Res. 2023, 20, 514–538. [Google Scholar] [CrossRef]
Li, Y.H.; Yao, T.; Pan, Y.W.; Mei, T. Contextual Transformer Networks for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 1489–1500. [Google Scholar] [CrossRef]
Feng, R.; Li, Z.; Liu, B.; Ding, Y. A Joint Spatiotemporal Prediction and Image Confirmation Model for Vehicle Trajectory Concatenation with Low Detection Rates. IEEE Trans. Intell. Transp. Syst. 2024, 25, 11701–11715. [Google Scholar] [CrossRef]
Tiwari, A.; Singh, R.K.; Shigwan, S.J. SwinDTI: Swin transformer-based generalized fast estimation of diffusion tensor parameters from sparse data. Neural Comput. Appl. 2023, 36, 3179–3196. [Google Scholar] [CrossRef]
Kakde, H.M. Range Searching Using kd Tree; Florida State University: Tallahassee, FL, USA, 2005. [Google Scholar]
Anzola, J.; Pascual, J.; Tarazona, G.; Crespo, R.G. A Clustering WSN Routing Protocol Based on k-d Tree Algorithm. Sensors 2018, 18, 2899. [Google Scholar] [CrossRef] [PubMed]
Tiwari, V.R. Developments in KD tree and KNN searches. Int. J. Comput. Appl. 2023, 975, 8887. [Google Scholar] [CrossRef]
Castellucci, G.; Bellomaria, V.; Favalli, A.; Romagnoli, R. Multi-lingual intent detection and slot filling in a joint bert-based model. arXiv 2019, arXiv:1907.02884. [Google Scholar]
Lan, L.; Zhang, Q.; Zhu, W.; Ye, G.; Shi, Y.; Zhu, H. Geotechnical characterization of deep Shanghai clays. Eng. Geol. 2022, 307, 106794. [Google Scholar] [CrossRef]
Shi, Y.J.; Yan, X.X.; Wang, J.H.; Fang, Z.; Li, B. Evaluation of Engineering Geological Condition in Shanghai Coastal Area. In Proceedings of the International Symposium on Coastal Engineering Geology (ISCEG), Shanghai, China, 20–21 September 2012; Tongji University: Shanghai, China, 2012. [Google Scholar]
Ray, P.; Le Manach, Y.; Riou, B.; Houle, T.T.; Warner, D.S. Statistical Evaluation of a Biomarker. Anesthesiology 2010, 112, 1023–1040. [Google Scholar] [CrossRef]
Tian, T.L.; Song, C.; Ting, J.; Huang, H.Y. A French-to-English Machine Translation Model Using Transformer Network. In Proceedings of the 8th International Conference on Information Technology and Quantitative Management (ITQM)—Developing Global Digital Economy after COVID-19, Chengdu, China, 9–11 July 2021. [Google Scholar]
Huang, Z.; Wang, P.; Wang, J.; Miao, H.; Xu, J.; Zhang, P. Improving Transformer Based End-to-End Code-Switching Speech Recognition Using Language Identification. Appl. Sci. 2021, 11, 9106. [Google Scholar] [CrossRef]
Yang, Z.; Chen, Q.; Cui, Z.; Liu, G.; Dong, S.; Tian, Y. Automatic reconstruction method of 3D geological models based on deep convolutional generative adversarial networks. Comput. Geosci. 2022, 26, 1135–1150. [Google Scholar] [CrossRef]
Fan, W.; Liu, G.; Chen, Q.; Cui, Z.; Yang, Z.; Huang, Q.; Wu, X. Geological model automatic reconstruction based on conditioning Wasserstein generative adversarial network with gradient penalty. Earth Sci. Informatics 2023, 16, 2825–2843. [Google Scholar] [CrossRef]
Castangia, M.; Grajales, L.M.M.; Aliberti, A.; Rossi, C.; Macii, A.; Macii, E.; Patti, E. Transformer neural networks for interpretable flood forecasting. Environ. Model. Softw. 2023, 160, 105581. [Google Scholar] [CrossRef]
Zhu, L.-F.; Li, M.-J.; Li, C.-L.; Shang, J.-G.; Chen, G.-L.; Zhang, B.; Wang, X.-F. Coupled modeling between geological structure fields and property parameter fields in 3D engineering geological space. Eng. Geol. 2013, 167, 105–116. [Google Scholar] [CrossRef]
Zhang, Q.; Zhu, H. Collaborative 3D geological modeling analysis based on multi-source data standard. Eng. Geol. 2018, 246, 233–244. [Google Scholar] [CrossRef]

Figure 1. Algorithm flowchart.

Figure 2. The construction principle of the borehole context sequence. When the number of adjacent boreholes was equal to three (K = 3), the 1st stratum (a), the 2nd stratum (b), the 3rd stratum (c), and the 4th stratum (d) of the training borehole T were treated as the target points to construct the borehole context sequence.

Figure 3. An example of one of the constructed borehole context sequences. This borehole context sequence contains the stratigraphic features of adjacent boreholes A, B, and C, as well as the spatial characteristics of the target point.

Figure 4. Transformer encoder structure [47].

Figure 5. Self-attention calculation for one head.

Figure 6. Prediction of stratigraphic categories of unknown grids.

Figure 7. Distribution map of the boreholes located in Huangpu District, Shanghai. The number in the figure represents the borehole ID, and the purple line represents the profile line.

Figure 8. Confusion matrix of the test data set.

Figure 9. ROC curve for classification performance. This dotted line is a diagonal line from the lower left corner to the upper right corner, indicating that the true positive rate and false positive rate of the classifier are equal. If the ROC curve is above the dotted line, the performance of the classifier is better than the random guess. If the ROC curve is below the dotted line, the performance of the classifier is lower than the random guess.

Figure 10. Three-dimensional geological model (a) and fence diagrams (b).

Figure 11. The distribution of strata.

Figure 12. Histograms of percentage of each stratum.

Figure 13. Comparison between the actual and predicted values for the test set boreholes.

Figure 14. Comparison of profiles obtained by the IDW (a), Kriging (b), Transformer model (c), and the RNN (d).

Figure 15. Accuracy, precision, recall, F1 score, and Kappa coefficient of the Transformer model and the RNN.

Figure 16. Three-dimensional normalized information entropy model for (a) the Transformer model and (b) the RNN and the profile for (c) the Transformer model and (d) the RNN.

Table 1. Comparison of advantages and disadvantages of 3D geological modeling methods.

Modeling Methods		Advantages	Disadvantages
Deterministic methods	Explicit method	It is convenient for geologists to participate directly and maximize the use of geological knowledge, and the results are controllable.	Time-consuming, laborious, and subjective.
Deterministic methods	Implicit method	High modeling efficiency.	This method cannot best use available data or contain sufficient geological constraints.
Stochastic methods	Coupled Markov chain, Markov random field	This method can generate multiple possible geological models, reflecting the uncertainty of geology.	Using this method it is difficult to determine the transition probability matrix; the reliability depends on experience.
	Gaussian simulations	Smooth model.	The ability to represent complex geological structures is limited.
	Multi-point statistics	This method can capture complex geological models; the generated model is highly consistent with the training image.	At least one training image is needed to represent geological knowledge; the tuning of MPS parameters is also needed.
	Machine learning and deep learning	It has strong objectivity, a strong modeling ability for complex nonlinear geological structures, and high modeling efficiency.	High computing power requirements.

Table 2. Engineering geological stratigraphy of Huangpu District, Shanghai (modified after [48,49]).

Geological Era		Engineering Geological Layer
Geological Era		No.	Name
Holocene	$Late phase Q h_{3}$	①	Fill
	$Late phase Q h_{3}$	②	Brown–yellow silty clay
	$Middle phase Q h_{2}$	③	Gray muddy silty clay
	$Middle phase Q h_{2}$	④	Gray muddy clay
	$Early phase Q h_{1}$	⑤1	Gray silty clay
		⑤2	Gray sandy clay
		⑤3	Gray silty clay
		⑤4	Gray–green silty clay
Upper Pleistocene	$Late phase Q p_{3}^{2}$	⑥	Dark green–brown–yellow silty clay
		⑦1	Grass yellow–gray sandy silt
		⑦2	Gray yellow–gray powder sand
		⑧	Gray silty clay
	$Early phase Q p_{3}^{1}$	⑨1	Cyan–gray silty sand
	$Early phase Q p_{3}^{1}$	⑨2	Gray gravelly medium sand
Middle Pleistocene	$Q p_{2}$	⑩	Blue–gray silty clay

Table 3. Transformer model parameter values.

Parameters	Value
Training set: validation set: test set	6:2:2
Embed dims	256
Num heads	8
Num layers	6
Learning rate	$1 × 10^{- 5}$
Number of training epochs	500
Loss function	Cross-entropy loss
Optimizer	Adam
Number of neighbor boreholes (k)	3

Table 4. Accuracy, precision, recall, F1 score, and Kappa Coefficient values for the test data set.

Metric	Value
Accuracy	0.86
Precision	0.88
Recall	0.86
F1 Score	0.86
Kappa Coefficient	0.85

Table 5. Comparison of RMSE of the Transformer model, IDW, Kriging.

Borehole ID	RMSE of IDW	RMSE of Kriging	RMSE of Transformer Model
3	4.135	3.435	2.965
17	4.515	3.947	3.293
36	4.093	3.376	2.936
Average	4.248	3.586	3.065

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hang, Z.; Xue, T.; Chen, J.; Shi, Y.; Yin, Z.; Cui, Z.; Zhou, G. A 3D Geological Modeling Method Using the Transformer Model: A Solution for Sparse Borehole Data. Minerals 2025, 15, 301. https://doi.org/10.3390/min15030301

AMA Style

Hang Z, Xue T, Chen J, Shi Y, Yin Z, Cui Z, Zhou G. A 3D Geological Modeling Method Using the Transformer Model: A Solution for Sparse Borehole Data. Minerals. 2025; 15(3):301. https://doi.org/10.3390/min15030301

Chicago/Turabian Style

Hang, Zhenquan, Tao Xue, Jianping Chen, Yujin Shi, Zehang Yin, Zijia Cui, and Guanyun Zhou. 2025. "A 3D Geological Modeling Method Using the Transformer Model: A Solution for Sparse Borehole Data" Minerals 15, no. 3: 301. https://doi.org/10.3390/min15030301

APA Style

Hang, Z., Xue, T., Chen, J., Shi, Y., Yin, Z., Cui, Z., & Zhou, G. (2025). A 3D Geological Modeling Method Using the Transformer Model: A Solution for Sparse Borehole Data. Minerals, 15(3), 301. https://doi.org/10.3390/min15030301

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A 3D Geological Modeling Method Using the Transformer Model: A Solution for Sparse Borehole Data

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Preprocessing

2.2. Construction of KD-Tree

2.3. Construction of Borehole Context Sequence

2.4. The Training of the Transformer Model

2.5. Model Prediction and Uncertainty Analysis

3. Experiments and Results

3.1. Experiments

3.2. Results

4. Discussion

4.1. Comparison of the Transformer Model and Other Methods

4.2. Analysis of Model Uncertainty

4.3. Advantages and Disadvantages of the Proposed Method

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI