Three-Dimensional Gravity Inversion Based on Attention Feature Fusion

Chen, Chen; Li, Houpu; Zhang, Yujie; Jin, Xiaomei; Liu, Jianfeng

doi:10.3390/s24175697

Open AccessArticle

Three-Dimensional Gravity Inversion Based on Attention Feature Fusion

by

Chen Chen

¹,

Houpu Li

^2,*,

Yujie Zhang

^1,*,

Xiaomei Jin

¹ and

Jianfeng Liu

¹

School of Mathematics and physics, China University of Geosciences, Wuhan 430074, China

²

College of Electrical Engineering, Naval University of Engineering, Wuhan 430074, China

^*

Authors to whom correspondence should be addressed.

Sensors 2024, 24(17), 5697; https://doi.org/10.3390/s24175697 (registering DOI)

Submission received: 21 June 2024 / Revised: 31 July 2024 / Accepted: 27 August 2024 / Published: 1 September 2024

(This article belongs to the Section Industrial Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

Three-dimensional gravity inversion is a process of obtaining the location, shape, and physical property parameters of underground anomaly sources using gravity anomaly data observed on the surface. In recent years, with the rapid development of data-driven methods, the application of deep learning (DL) to 3D gravity inversion has also attracted wide attention and achieved certain results. In this paper, based on the U-Net network, a three-dimensional gravity inversion method using an attention feature fusion mechanism is proposed. Using U-Net as the basic framework, the coarse-grained semantic features and fine-grained semantic features in the encoder and decoder are connected by long hops, and the global and local semantic features are aggregated through the attention feature fusion module, which avoids feature loss in the network training process. Compared with the inversion results of the U-Net network, the proposed method has a higher vertical resolution and effectively alleviates the influence of the skin effect on three-dimensional gravity inversion. Ablation experiments show that the attention feature fusion module is the key to improving the vertical resolution and prediction accuracy of inversion results. Noise experiments show that the inversion network in this study has a strong anti-noise ability and good generalization performance. The experimental results of the inversion network used in the prediction of the SAN Nicolas deposit in Mexico show that the inversion network can clearly predict the basic location and general shape of the sulfur deposit, and the results are in good agreement with the known geological data.

Keywords:

gravity anomalies and earth structure; inverse theory; neural networks; attention feature fusion

1. Introduction

Gravity exploration is a method to calculate the location and physical property parameters of a geological body through the density difference between the underground geological body and surrounding rocks [1]. With the development of space exploration technology, satellite gravity, airborne gravity, and microgravity measurements have been widely applied [2]. In the field of gravity exploration, the research hotspot of quantitative interpretation of gravity anomaly data is three-dimensional gravity physical property inversion, which is mainly used for the exploration of geological structures and salt domes and the delineation of coal basins in oil and gas prospect areas. This study uses deep and regional geological structures to predict crustal movements and cooperates with other geophysical exploration methods to search for metallic and non-metallic minerals [3,4].

Traditional geophysical inversion often adopts linear inversion. The linear inversion theory was first proposed by Backus and Gilbert [5] and later applied to geophysical inversion by Parker [6]. In order to solve the problem of multiple solutions in linear inversion and improve the stability of solutions, Tikhononv and Arsenin [7] introduced regularization technology into linear inversion. However, the final result of the linear iterative inversion method depends on the selection of the initial model to a large extent, so the linear inversion method is often limited in practical application. The proposed nonlinear inversion method solves the problem that the objective function of linear inversion is easy to fall into the local minimum value to a large extent. Common nonlinear inversion methods include the genetic algorithm [8], simulated annealing algorithm [9], ant colony algorithm [10], global particle swarm algorithm [11], Hunger Games Search optimization [12], and neural network algorithm [13]. Essa describes a fast-imaging technique, the “R-parameter imaging technique”, for the interpretation of gravity data measured along profile [14]. Nonlinear inversion has better optimization ability and inversion efficiency, especially the deep learning method.

Due to the breakthrough application of the deep neural network (DNN) in speech recognition and image recognition [15], the DNN has been widely used in a variety of scenarios and has become the basic network for the practical application of artificial intelligence technology [16,17,18,19,20,21].

In recent years, Huang et al. [22,23] constructed simulation data using the random walk method and used the V-Net network for large-scale gravity inversion. Yang et al. [24] used an improved U-Net system for three-dimensional gravity inversion. Li et al. [25] developed GV Net using convolutional neural networks for inverting residual gravity anomalies. Xu et al. [26] proposed a fast reconstruction method for underground density models based on ResUnet. Lv et al. [27] introduced a multi-task approach to accurately locate geological bodies first and then performed precise inversions on them. Wang et al. [28] proposed using multi-scale functional MS-UNet networks to alleviate the problem. Zhang et al. proposed a novel 3D gravity inversion method based on encoder–decoder neural networks [29]. The successful application of these methods indicates that as a technology with adaptive learning ability and nonlinear mapping ability, the deep learning method may become a supplement or substitute for traditional methods in the future.

Although the current gravity inversion technology is relatively mature in development, due to the inherent properties of the geophysical field, there are still many problems unsolved in the process of solving, such as the skin effect. Since the shallow model unit makes a greater contribution to the observation surface than the deep model unit, when the test set is tested with the trained neural network, a clearer outline is often shown for shallow units, while the outline is more blurred for deep units. Although U-Net is an effective means to carry out three-dimensional gravity inversion, it still has some limitations. The inversion results have a poor effect in the deep part of the model and a low resolution in the vertical direction.

In order to solve the above problems, this paper refers to the framework of U-Net and uses residual network and attention mechanism to build a neural network containing an attention feature fusion module to improve the accuracy of three-dimensional gravity inversion in the vertical direction.

The rest of the research is organized as follows. In Section 2, the basic principles of gravity forward modeling and gravity inversion are briefly introduced. In Section 3, the structure of the inversion network designed in this paper is introduced in detail, including the residual module, the attention feature fusion module, and the multi-scale channel attention module. Section 4 introduces the dataset, implementation details, and experimental results of the experiment. The noise experiment, contrast experiment, and ablation experiment are designed and carried out, and the inversion network is applied to the processing of real data. The conclusions are contained in Section 5.

2. Background

The purpose of gravity forward is to understand gravity anomalies caused by geologic bodies of different sizes, occurrences, and densities. Gravity inversion is used to obtain the density, shape, and other physical property parameters of the geological body caused by the anomaly. In the current methods of solving inversion problems, forward modeling is an important part of the inversion process.

2.1. Forward Modeling

In the calculation of arbitrary shape, size, and geologic body caused by the gravity anomaly, on the surface of the observation area, plasmid profile control is usually divided into countless volume elements, assuming a split unit for location (ξ, η, ζ) and the density of ρ (ξ, η, ζ). The gravity field generated by the body at the measuring point P(x, y, z) can be expressed as follows [30]:

F = γ ∭_{V}^{} ρ (ξ, η, ζ) \frac{1}{r^{3}} d v,

(1)

where V represents the volume of the geological body, r represents the distance from the volume element Q(ξ, η, ζ) to the observation point P, and γ represents the gravitational constant. Then, the gravity anomaly of the body is the component of the gravity field in the vertical direction, which can be expressed as follows:

g = \frac{\partial V}{\partial z} = γ ∭_{V} ρ (ξ, η, ζ) \frac{z - ζ}{r^{3}} d v .

(2)

In the actual forward modeling of gravity anomalies, the underground abnormal body is usually divided into several cuboid units, as shown in Figure 1b, and then the anomalies generated by each cuboid unit on the measuring point are calculated. The sum of the anomalies generated by all cuboid units on the measuring point is the anomalies generated by the whole model body in the underground half space on the observation point. When the number of units divided is large enough and the volume of units is small enough, the above method can fit the geological body of any shape. Then, the approximate calculation formula of gravity anomaly can be expressed as follows:

d = G m,

(3)

where d is a

p \times 1

vector composed of gravitational outliers at each observation point; m is a

q \times 1

vector composed of the density of each element after the partition of the geological body; and G is the sensitivity matrix, and the matrix elements in

g_{i j}

are a quantitative description of the first j a model unit for the first i observation point the contribution of gravity [31].

2.2. Inversion Modeling

The gravity inversion problem can be understood as the inverse process of the forward modeling problem, where m in Equation (3) is unknown, and the density parameter m of the geologic body in the model space should be deduced by the known observation data d. However, at this time, we are faced with a problem. Since the number of observation points is far less than the number of geologic body sections, that is, the dimension of d is less than that of m, solving the gravity inversion problem solves the underdetermined equations, in which case the solution of the equations is non-unique, and the solution is obviously unstable. In this case, the vertical resolution of the inversion result will be greatly reduced, and the skin effect will appear.

The least square method is often used to fit the objective function in inversion modeling, which can be expressed as follows:

ϕ = {‖d - G m‖}^{2} .

(4)

In order to reduce the impact of measurement errors and multi-solution problems, Tikhonov proposed regularization inversion [32], adding constraints to the objective function to limit the scope of the model. The objective function can be rewritten as follows:

ϕ = ϕ_{d} + α ϕ_{m} {= ‖d - G m‖}^{2} + α {‖m - m_{0}‖}^{2},

(5)

where

ϕ_{d}

denotes data fitting item,

ϕ_{m}

represents a model-fitting item, and α represents a regularization parameter.

m_{0}

indicates the initial model and is associated with prior information; when the prior information is insufficient or absent, let

m_{0}

be 0. What is introduced here is the regularization function under the two-norm constraint. In the study of practical problems, there are many forms of regularization functions [7].

This paper focuses on the DL data-driven inversion method, which deduces the density model directly from the observed gravity anomaly data and depicts the underground anomaly with sharp boundaries. The DNN represents the mapping relationship from the data domain to the model domain through a complex, nonlinear network [28], which can be expressed as follows:

m = F (d) .

(6)

Unlike the regularization inversion proposed by Tikhonov, which minimizes the error between the predicted outlier and the observed outlier, DNN inversion minimizes the error between the predicted density model and the theoretical density model.

In the field of gravity inversion based on DL, Huang et al. [23] used the U-Net network to perform sparse gravity inversion. The experimental results show that the U-Net network can better reflect the location, shape, and boundary of underground geological bodies, but the influence of the skin effect still exists.

Li et al. [33] advocate feature fusion after the skip connection from the perspective of attention. Inspired by Li, this paper uses the attention feature fusion mechanism to fuse the encoded feature and the decoded feature. On this basis, a three-dimensional inversion network based on the attention feature fusion mechanism is proposed, which enables the network to learn the physical property information of the deep part of the underground model well and improves the reconstruction accuracy of the deep part. In this way, the skin effect problem in three-dimensional gravity inversion can be alleviated. This paper constructs the encoder and decoder by stacking residual modules, and the splicing of features between the encoder and decoder will be realized using the attention fusion mechanism. The training of the inversion network requires a large amount of data, but the data sample type used by most methods is relatively single, and the shape of the anomaly source is relatively regular, resulting in poor generalization of the inversion network. In order to improve the generalization of the network, on the basis of constructing the conventional model, this paper adds a random model composed of random walk mode to the dataset to increase the randomness and diversity of the training data. The method proposed in this paper can better reverse the physical property information of the deep model and alleviate the influence of the skin effect to a certain extent. At the same time, the noise test and real data experiment show that the proposed method can effectively improve the generalization of the inversion network and obtain a clear and focused anomaly source, which provides a basis for practical application.

3. Methodology

The encoder–decoder structure of U-Net excels in extracting intricate two-dimensional characteristics from gravity anomaly data, facilitating the reconstruction of a precise three-dimensional density model. Nevertheless, the simplistic combination of encoded and decoded features within U-Net often leads to the erosion of vital feature information, consequently hindering the prediction model’s capability to accurately capture nuanced, in-depth details. Therefore, this paper refers to the framework of U-Net and uses residual network and attention mechanism to build a neural network containing an attention feature fusion module to improve the accuracy of three-dimensional gravity inversion in the vertical direction.

This paper designs an inversion network based on U-Net structure and, combined with an attention feature fusion module, aims at making the network fully learn the global and local features in the training process so as to alleviate the influence of the skin effect to a greater extent. A density model dataset named GravInv is constructed, which consists of seven types of models. Through experiments, the inversion network we designed has a better vertical resolution on both synthetic data and actual data. The design comparison experiment and ablation experiment verify that adding the attention feature fusion module to the neural network is the key to improving the resolution and inversion accuracy of the deeper model, and the skin effect is alleviated to a large extent.

3.1. Construction of the Inversion Network

In order to make the gravity three-dimensional inversion task better reconstruct the depth part density model, high-resolution inversion is achieved in the vertical direction. In this paper, our inversion network is based on the framework of U-Net [34], which includes a contraction path for capturing context and a symmetric expansion path for supporting accurate localization, and the convolution layer is replaced by Basic Block [35] for training a deeper network. Replace the simple feature matching process between the encoder and decoder with the attention feature fusion (AFF) module [36], which uses the idea of the attention mechanism for feature fusion, and then extract the features again. In order to obtain better feature representation, which can help improve the resolution of the prediction results, the multi-scale channel attention module (MS—CAM) [36] will further polymerize the global and local features and finally extract the features of different scales of the characteristics of fusion. The network structure is shown in Figure 2.

The encoder on the left uses Basic Block to extract features from the gravity anomaly observation data and then changes the size of the feature map through the maximum pooling layer. It is worth mentioning that after the convolutional layer in each Basic Block, there is a BN layer and a ReLU activation function layer to prevent gradient disappearance or gradient explosion problems.

The decoder on the right upsamples the gravity anomaly features extracted by the encoder to the same size as the features of the previous layer. Then, the features extracted by the encoder and the features sampled by the decoder are fused in AFF through the long-skip connection. The features after fusion are further extracted by Basic Block. AFF can be expressed as follows:

Z = M (X ⨄ Y) \otimes X + (1 - M (X ⨄ Y)) \otimes Y,

(7)

where

Z \in R^{C \times H \times W}

is the fusion feature with the number of channels C and size H × W and ⨄ indicates the initial feature integration. The AFF module [36] is shown in Figure 3. After the initial feature integration, features

X

and

Y

obtain the attention fusion weights through the MS-CAM. The fusion feature

Z

is obtained by adding the initial features X and Y to the inner product of the corresponding weights, respectively. In the AFF module, for simplicity, we chose the addition of feature elements as the initial feature integration.

In order to make the network pay more attention to the features of small geological bodies, the local features and global features extracted by Basic Block are input into the MS-CAM to learn the attention weights so as to aggregate the contextual semantic features of different sensitive fields of different scale targets. To keep it as lightweight as possible, only the local context is added to the global context within this module. Global channel context G(X) by the input characteristics X after global average pooling and local channel context

L (X) \in R^{C \times H \times W}

by Point-wise Convolution (PWConv) are calculated as follows:

L (X) = B (P W C o n v_{2} (δ (B (P W C o n v_{1} (X))))),

(8)

where

B

denotes batch of normalized operation (BN), δ denotes ReLU linear activation operation, and

P W C o n v_{1}, P W C o n v_{2}

represents Point-wise Convolution. Given the global channel context G(X) and local channel context L(X), the features

X^{'} \in R^{C \times H \times W}

with the number of channels C and size H × W can be extracted by the MS-CAM as follows:

X^{'} = X \otimes M (X) = X \otimes σ (L (X) \oplus G (X)),

(9)

where

M (X)

denotes attention weights generated for the MS-CAM.

\oplus

denotes addition and

\oplus

denotes inner product. The structure of the MS-CAM is shown in Figure 4. In MS-CAM’s aggregation of local and global context information, the local channel context L(X) has the same shape as the input feature, preserving and highlighting fine details in the underlying feature.

In the decoding process, after Basic Block and MS-CAM feature extraction four times for each level, the features of each level are upsampled to the same size, AFF is used again for attention feature fusion, and finally, the predicted model density value is output through a 1 × 1 convolution and the ReLU activation function.

Basic Block is introduced in the encoding block to keep the accuracy of the network in the deep condition and to prevent the gradient explosion or gradient disappearance in the training process. In addition, the network’s skip connection between the encoder and the decoder aims to allow the network to combine more feature information, reduce information loss in the encoding and decoding process, and help the network obtain high-resolution semantic features.

3.2. Loss Function

In the training process of deep neural networks, the optimization problem is usually solved by minimizing the loss function. For image regression and reconstruction problems, the L1 or L2 norm is often the most common metric used to define the loss function. However, considering that we assume that the solution of the three-dimensional gravity inversion problem has sparsity [37], that is, the part concerned by the three-dimensional gravity inversion is only a small part of the underground space, it will cause the network to converge to the local minimum in the process of gradient descent, resulting in the network prediction results biased to the background. In order to describe the edge contour of geological bodies more clearly, the Dice loss function, which can clearly describe the shape and position information of small targets even when the number of target and background voxels is unbalanced, is selected as the loss function of the inversion network [38].

Dice loss is widely used in medical image segmentation tasks to measure the similarity of two sets. The Dice coefficient is defined as follows [39]:

D i c e C o e f f i c i e n t = \frac{2 |A \cap B|}{|A| + |B|} .

(10)

In this study, set A and set B, respectively, represent the set formed by the density model predicted by the network and the real density model. The dice coefficient can be specifically expressed as follows [38]:

D i c e C o e f f i c i e n t = \frac{2 \sum_{i = 1}^{N} m_{i}^{T} {\hat{m}}_{i}}{\sum_{i = 1}^{N} m_{i}^{T} m_{i} + \sum_{i = 1}^{N} {\hat{m}}_{i}^{T} {\hat{m}}_{i}},

(11)

where

m_{i}

and

{\hat{m}}_{i}

represent the ith true density model and the reconstruction of the density model, respectively, and N represents the total number of density models.

The loss function used in the network training process in this paper, that is, the Dice loss, is equal to the 1-Dice coefficient as follows:

D i c e L o s s = 1 - D i c e C o e f f i c i e n t = 1 - \frac{2 \sum_{i = 1}^{N} m_{i}^{T} {\hat{m}}_{i}}{\sum_{i = 1}^{N} m_{i}^{T} m_{i} + \sum_{i = 1}^{N} {\hat{m}}_{i}^{T} {\hat{m}}_{i}} .

(12)

The value of the Dice loss ranges from 0 to 1, and the smaller the value, the smaller the error between the network inversion results and the real label, and the better the performance of the model.

In summary, the whole inversion process can be summarized as Algorithm 1.

Algorithm 1: A 3D gravity inversion method based on the attention fusion mechanism

Input: Training data pair

{d}_{p = 1}^{P}

,

{m}_{p = 1}^{P}

, Batch size

b s

, Learning rate

η

Initialization: weight

W^{(t)}

, offset

b^{(t)}

,

t = 0

Repeat:

f o r s = 1 : P / / b s o r P / / b s + 1

l o s s = 0 f o r i = 1 : b s_{s}

(1) Forward propagation

Encoding

f o r j = 1 : 4

{d_{j}^{i}}^{'} \leftarrow R E L U (B N (W_{2 j - 1}^{(t)} * d_{j - 1}^{i} + b_{2 j - 1}^{(t)}))

c_{j} \leftarrow R E L U (B N (W_{2 j}^{(t)} * {d_{j}^{i}}^{'} + b_{2 j}^{(t)}))

d_{j}^{i} \leftarrow D o w n s a m p l i n g (c_{j}) w i t h D r o p o u t (0.2)

{d_{5}^{i}}^{'} \leftarrow R E L U (B N (W_{9}^{(t)} * d_{4}^{i} + b_{9}^{(t)}))

d_{5}^{i} \leftarrow R E L U (B N (W_{10}^{(t)} * {d_{5}^{i}}^{'} + b_{10}^{(t)}))

Decoding

f o r j = 4 : - 1 : 1

{d_{j}^{i}}^{'} \leftarrow U p s a m p l i n g (d_{j + 1}^{i}) w i t h D r o p o u t (0.2)

a_{j}^{'} \leftarrow R E L U (B N (W_{24 - 3 j}^{(t)} * A F F ({d_{j}^{i}}^{'}, c_{j}) + b_{24 - 3 j}^{(t)}))

a_{j} \leftarrow R E L U (B N (W_{25 - 3 j}^{(t)} * a_{j}^{'} + b_{25 - 3 j}^{(t)}))

d_{j}^{i} \leftarrow S i g m o i d (M S - C A M (a_{j}))

{\hat{m}}^{i}

\leftarrow R E L U (A F F (d_{1}^{i}, d_{2}^{i}, d_{3}^{i}, d_{4}^{i}, d_{5}^{i}))

l o s s \leftarrow l o s s + L_{i} ({\hat{m}}^{i}, m^{i})

(2) Back propagation

W^{(t + 1)} \leftarrow W^{(t)} + A d a m (η, l o s s / {b s}_{s})

b^{(t + 1)} \leftarrow b^{(t)} + A d a m (η, l o s s / {b s}_{s})

Until the neural network converges

The gravity anomaly data of the area to be reconstructed are input d, and the reconstructed model

\hat{m}

is predicted.

4. Experiments

4.1. Simulation Datasets

In the DL inversion method, in addition to designing a suitable network, it is necessary to construct a large number and various simulation data to train the network. We divided the square observation area with a surface edge length of 1600 m into 32 × 32 grids, with a sampling interval of 50 m. Divide the underground area with a length and width of 1600 m and a depth of 800 m into a rectangular prism of 32 × 32 × 16 with a side length of 50 m. Set the density of geological value to

1 g / c m^{3}

and the background density to

0 g / c m^{3}

, which will be a three-dimensional inversion problem, and for the approximately 3D segmentation problem, the underground space is divided into the background region (

0 g / c m^{3}

) and the geological area (

1 g / c m^{3}

).

A simulation dataset containing six conventional models was constructed, and considering the diversity of underground geologic bodies, random walk [23] was used to generate random models in order to make the network have a certain generalization ability. In this paper, a simulation dataset composed of six conventional models and random models and their corresponding abnormal data was used to train the neural network. The dataset was named the GravInv dataset. The GravInv dataset consists of a training set and a test set, with 10000 random models and 2000 for each conventional model in the training set, totaling 22000. The test set includes 100 random models and 100 for each conventional model, totaling 700 models. The six conventional and random models are shown in Figure 5.

4.2. Implementation Details

In the training phase, we set the number of iterations to 50 epochs, the batch size to 32, and the learning rate to 3 × 10⁻⁴ and adopted the Adam algorithm to optimize the network parameters (PC configuration: 11th GenIntel(R) Core(TM) i5-11320H @3.20 GHz, RAM: 16 GB; the same as below).

4.3. Evaluation Metrics

This paper refers to a large number of studies on 3D gravity inversion and uses the following indicators to evaluate the accuracy of the reconstruction of the gravity three-dimensional inversion model as follows:

M A E = \frac{1}{q} \sum_{i = 1}^{q} |\tilde{m} (i) - m (i)|,

(13)

E_{m} = \frac{1}{q} \sum_{i = 1}^{q} \frac{{‖\tilde{m} (i) - m (i)‖}_{2}}{{‖m (i)‖}_{2}},

(14)

E_{a c c} = \frac{C (|\tilde{m} (i) - m (i)| < t)}{q},

(15)

R^{2} = 1 - \frac{\sum_{i = 1}^{p} {(\tilde{d} (i) - d (i))}^{2}}{\sum_{i = 1}^{p} {(d (i) - \bar{d} (i))}^{2}},

(16)

{\bar{E}}_{a c c} = \frac{\sum_{i = 1}^{w} {E_{a c c}}_{i}}{w},

(17)

where

\tilde{m} (i)

and

m (i)

represent the ith element in the vector of the predicted model and the vector of the theoretical model, respectively,

p

and

q

represent the dimensions of vectors

d

and

m

, respectively. The MAE is the average absolute error, reflecting the average error between the reconstructed model and the theoretical model;

E_{m}

is the average relative error, which reflects the relative error between the real density model and the prediction model based on the L2 norm.

E_{a c c}

[31] is used to measure the inversion accuracy of the underground areas. t represents the threshold value; when the absolute value of the error between

\tilde{m} (i)

and

m (i)

is less than the threshold value t, the prediction is considered correct, and

𝒞

is used to represent the number of correctly predicted models. Divide the number of correctly predicted models by the total number of outlier meshes to obtain the accuracy value. In this study, the t value is selected as 0.01.

R^{2}

is a determination coefficient that reflects the degree of fitting between predicted gravity anomaly data and observed gravity anomaly data.

\tilde{d} (i)

represents the predicted gravity anomaly value,

d (i)

represents the observed gravity anomaly value, and

\bar{d} (i)

represents the mean value of observed outliers.

{\bar{E}}_{a c c}

represents the average value of

E_{a c c}

for a specific model in the test set, and

w

represents the number of models in the test set.

4.4. Simulation Datasets Experiment

4.4.1. Ternary Inversion

This section meticulously assesses the feasibility and validity of the proposed inversion network on the GravInv test set, ensuring a rigorous evaluation process. After undergoing 50 rounds of intensive training, the neural network successfully reconstructs the 3D density model, demonstrating its proficiency in the task. Notably, the network’s training efficiency is evident, as the loss function stabilizes around the 20th round of training, indicating a swift convergence. Figure 6 presents a compelling visualization of the loss curve for the inversion network, clearly illustrating that the training process achieves convergence within approximately 20 rounds, further validating the network’s efficiency and stability.

In order to facilitate the observation of the inversion shape of the abnormal body, all 3D views except the ablation experiment only show the part with a residual density value greater than

0.5 g / c m^{3}

. In the following, we will select two more complex models and a random model from the test set to demonstrate the effect of 3D reconstruction of the inversion network.

Figure 7 shows the 3D inversion results of the inclined levee model and the vertical section. By comparing the 3D view of the theoretical model and the predicted model, it can be observed that the network proposed in this paper cannot only well inverse the basic position and shape information of the simulated density model but also identify the buried depth range of the model. By comparing Figure 7c, it can be seen that the neural network can clearly inverse the contours and position information of the deep part of the local body, and it has a high-depth resolution. The inversion accuracy can be calculated according to Equation (15). In this paper, 100 test samples are selected for each model, and the inversion accuracy of 100 test samples is averaged to obtain the average inversion accuracy, which is expressed as

{\bar{E}}_{a c c}

. After calculation, the average inversion accuracy of the model

{\bar{E}}_{a c c}

is as high as 96.43%.

Figure 8 shows the 3D inversion results of the syncline model and the vertical section. By comparing the 3D view of the theoretical model and the prediction model, it can be observed that the network proposed in this paper can clearly reproduce the shape and trend of the model. By comparison with Figure 8c, it can be seen that the density model predicted by the neural network has a clear boundary in the vertical direction, which is basically consistent with the contour of the theoretical model. Moreover, the average inversion accuracy of the model

{\bar{E}}_{a c c}

is as high as 96.47%.

Figure 9 shows the 3D inversion results of the stochastic model and the vertical section. By comparing the 3D view of the theoretical model and the prediction model, it can be observed that the network proposed in this paper still has a good inversion effect on the stochastic model and can basically reproduce the general shape and position information of the model. However, with the deepening of the depth, some information of the model is missing. By comparing Figure 9c, it can be seen that the prediction model has a relatively clear outline in the shallow part; however, with the increase in depth, the boundary of the model will inevitably become blurred, but the basic information of the density model can still be roughly distinguished. Moreover, the average inversion accuracy of the model

{\bar{E}}_{a c c}

is as high as 92.71%. It can be considered that the inversion network proposed in this paper has good generalization performance.

4.4.2. Noise Experiment

In the actual gravity anomaly inversion scenario, the gravity anomaly data obtained are often noisy. Therefore, in order to verify the robustness of the inversion network proposed in this paper, add a certain level of Gaussian noise to the gravity anomaly data constructed in Section 4.1 using noisy gravity data to train the inversion network and simulate real-case scenarios. As the noise level increases, the inversion accuracy is undoubtedly reduced. Therefore, in order to verify the robustness of the inversion network on the premise of ensuring the effectiveness of the network prediction, the method of Equation (18) is used to add noise to the gravity data [23] as follows:

d^{n o s i e} = d + λ \times m a x (d) \times r a n d o m (0, 1, (s i z e (d))),

(18)

where

d^{n o s i e}

denotes the gravity data after adding noise and λ denotes the noise weight coefficient; to control the size of the added noise,

r a n d o m (0, 1, (s i z e (d)))

denotes the same as the d size of Gaussian noise. In order to ensure that the added noise effectively reflects the noisy characteristics of the real data and does not overwrite the original characteristics of the gravity data, Gaussian noise with a noise weight coefficient of 5% is selected to be added to the gravity data. The gravity anomaly figure with 5% Gaussian noise added is shown in Figure 10.

Data with added noise are used to train the inversion network, and the trained network is used to test the synthesis data without noise. The inversion results predicted by the network are shown in Figure 11, Figure 12 and Figure 13.

It can be seen in Figure 11, Figure 12 and Figure 13 that the inversion network based on the attention feature fusion mechanism has good anti-noise ability. Figure 11b, Figure 12b and Figure 13b show the 3D view of the prediction model. Both the shape and position information of the prediction model of the geological body are in good agreement with the theoretical model. It can be seen in Figure 11c, Figure 12c and Figure 13c that the prediction models all have relatively clear boundaries, which are consistent with the theoretical models in both horizontal and vertical directions. Although the boundary clarity and vertical resolution are reduced compared with the inversion results in Section 4.4.1, the average accuracy of the prediction results is still high. In the noisy experiment, the average inversion accuracy of the above three models

{\bar{E}}_{a c c}

are 94.92%, 95.07%, and 92.68%, respectively. The experimental results show that the inversion network based on the attention fusion mechanism can accurately describe the shape, location, and physical property information of the anomaly source.

The advantages of 3D gravity inversion based on DL come from the guidance of loss function and the fitting ability of neural networks to complex functions [31]. Data-driven deep learning methods determine that as more and more data are input into the neural network, its generalization and accuracy will gradually improve. The noise experiment shows that the 3D gravity inversion based on DL has better anti-noise performance, which lays a foundation for actual geological data processing.

4.4.3. Contrast Experiment

In order to show that the inversion network proposed in this paper can alleviate the skin effect to a certain extent, the inversion results of the U-Net network are compared with those of the inversion network proposed in this subsection. The inversion results of the U-Net network are shown in Figure 14, Figure 15 and Figure 16.

As can be seen in Figure 14, Figure 15 and Figure 16, compared with the prediction model obtained by the inversion network proposed in this paper, the definition of the model boundary and the accuracy of the physical property parameters in the deeper part of the model are not as good as those in Section 4.4.1. Figure 14b, Figure 15b and Figure 16b show a 3D view of the prediction model, which is surrounded by many low-density units whose shapes are not clearly visible compared to Figure 7b, Figure 8b and Figure 9b. It can be seen in Figure 14c, Figure 15c and Figure 16c that the resolution of the prediction model gradually decreases at the deeper parts; the boundary information of the anomaly source cannot be clearly reflected, and the accuracy of physical property parameters also declines, showing a more serious skin effect. However, the inversion results in Section 4.4.1 can still clearly reverse the anomaly source boundary information in the deep part of the model, and the accuracy of the physical property parameters is also higher than the inversion results of the U-Net network. The average inversion accuracy of the three models predicted by the U-Net network

{\bar{E}}_{a c c}

is 93.51%, 94.81%, and 93.09%, respectively, which are all lower than the average inversion accuracy of the three prediction models in the inversion network in this paper. The experimental results show that the inversion network with the AFF module as the core can effectively improve the 3D inversion effect, especially in the vertical resolution, which is much better than U-Net and can alleviate the influence of the skin effect to a certain extent.

4.4.4. Ablation Experiment

The key point of the inversion network proposed in this paper is that the AFF module is used to aggregate the global and local context information of features on the attention channel, which reduces the possibility of losing part of the feature information in the network training process to a certain extent so that the network can learn the features of the input data more comprehensively and achieve the effect of high-precision inversion. Therefore, in order to fully demonstrate that the AFF module added in this paper can effectively improve the inversion accuracy, in this section, we will focus on comparing the inversion results without and with AFF modules in the inversion network. The GravInv dataset and the same hyperparameter setting were used to train the inversion network without the AFF module. For ease of observation, only the parts with a residual density greater than

0.3 g / c m^{3}

were displayed in 3D views in the ablation experiments, and the prediction results are shown in Figure 17, Figure 18 and Figure 19.

As can be seen in Figure 17, Figure 18 and Figure 19, the network prediction results are not ideal after the removal of the AFF module, which indicates that the AFF module plays a crucial role in the network proposed in this paper and is the key to high-precision 3D gravity inversion. Figure 17b, Figure 18b and Figure 19b show the 3D view of the prediction results, from which it can be seen that after the removal of the AFF module, the inversion network can only predict the general location and buried depth of the underground anomaly source, etc. However, the inversion error for the shape and physical property parameters of the anomaly source is large, which is quite different from the prediction results of the inversion network in this paper. It can be seen in Figure 17c, Figure 18c and Figure 19c that the neural network cannot accurately reproduce the shape of the anomaly source after removing the AFF module let alone obtain clear boundary information and accurate model density. Therefore, the ablation experiment shows that the AFF module is added to the inversion network in this study to learn the features of input data more effectively, which makes a great contribution to the realization of three-dimensional gravity inversion with high precision.

According to the evaluation indicators in Section 4.3, the errors of the above experiments are shown in Table 1.

In Table 1, it can be seen that the inversion network proposed in this paper occupies the smallest relative error of the model and is superior to U-Net in terms of model reconstruction error, as well as the case without the AFF module and noise. And, the inversion network proposed in this article can accurately fit surface gravity data.

The average inversion accuracy of

{\bar{E}}_{a c c}

predicted by the model is shown in Table 2. Due to the significant difference between the predicted results in the ablation experiment and the theoretical model, the experimental results are not listed here.

As can be seen in Table 2, the average inversion accuracy of the inversion network proposed in this paper is much higher than that of the other two networks for the inclined levee model and the syncline model, occupying a great advantage. The average inversion accuracy of the stochastic model is slightly lower than that of the U-Net, but the gap is not large. Compared with the other two models, the average inversion accuracy of the inversion network designed in this paper is basically negligible. Therefore, the average accuracy of the inversion network for model reconstruction is better than that of U-Net.

4.4.5. Comparison of Reconstruction Effects at Different Depths

In order to more clearly observe the effect of the inversion network proposed in this article on alleviating skin effect, this section will display the

x o y

profile of the inversion results of the neural network at 400 m and 600 m in order to observe the effectiveness of the neural network in alleviating the skin effect.

In Figure 20, it can be seen that the inversion network proposed in this paper can accurately invert the shape, position, and density parameters of the anomalous body when the depth is 400 m. However, when the depth is 600 m, the accuracy of the density parameters in the inversion results decreases to a certain extent. However, the predicted body shape, position, and boundary are still relatively clear, and the geological body situation can still be accurately identified.

As shown in Figure 21, when the depth is 400 m, the upper neural network can accurately reverse the shape, position, and density parameters of the anomalous body, but there may be some low-density impurities around the geological body. However, when the depth is 600 m, the inversion results are basically consistent with the theoretical model, and the density parameters are almost the same, indicating that the inversion network proposed in this article can better reflect the characteristics of geological bodies in deeper areas and alleviate the influence of the skin effect to a certain extent.

In Figure 22, it can be seen that when the depth is 400 m, the neural network can basically invert the shape and position of the anomalous body, and the density parameters of the inversion results are also close to the theoretical model. When the depth is 600 m, the neural network can accurately deduce the location information of the abnormal body and roughly deduce the shape of the abnormal body, and the physical boundary is relatively fuzzy. There is a certain gap between the prediction of the density parameters and theoretical models.

In summary, a comprehensive comparison of the inversion results across various depths of the aforementioned models reveals a distinct advantage of the inversion network introduced in this paper. Specifically, it is evident that this network significantly enhances the reconstruction accuracy for deep models, showcasing its superiority over traditional methods. Furthermore, it effectively mitigates the impact of the skin effect to a notable extent, contributing to more accurate and reliable inversions.

4.5. Field Example

In order to verify that the inversion network designed in this paper is meaningful in processing actual data, the inversion network is applied to the actual gravity anomaly data in the field of the SAN Nicolas deposit.

The San Nicholas deposit in central Mexico is an unexploited massive sulfide deposit formed by volcanic eruptions [40], occurring in the feldspathic magnesian volcanic rock sequence from the Upper Jurassic to the Lower Cretaceous, containing gold, silver, copper, zinc, and other substances [41]. Drilling data show that the top of the sulfide ore body is located 150–220 m below the surface, and the bottom of the ore body extends 400–450 m below the surface, with a thickness of about 280 m [41]. The residual gravity anomaly map of the study area is shown in Figure 23. The gravity anomaly in the center of the region is caused by massive sulfides, while other anomalies are caused by ferromagnetic volcanic rocks. The residual gravity anomaly can reach up to 2.2

m G a l

. As can be seen in Table 3, the density value of the sulfide is 3.5

g / c m^{3}

, and the density value of the surrounding rock is distributed between 2.3

g / c m^{3}

and 2.7

g / c m^{3}

[42]. Therefore, the residual density value of the study area is 1.1

g / c m^{3}

.

The density prior information was set as 1.1

g / c m^{3}

. In order to enable the inversion network to more accurately reflect the shape, location, and physical property parameters of underground sulfide in the SAN Nicholas mining area, a dataset with a model density of 1.1

g / c m^{3}

was re-generated to train the inversion network. Figure 24 shows the retraining of the inversion network to the Saint Nicholas prediction results of the measured data in the deposit, Figure 24a and Figure 24b are, respectively, the

A A^{'}

line in

N o r t h i n g = - 400 m

and

B B^{'}

in

E a s t i n g = - 1700 m

vertical profiles, and the black line represents the actual contour ore bodies. It can be seen in Figure 24 that the predicted distribution of the inversion network proposed in this paper is focused, the position information of the predicted results is basically consistent with the drilling information, and the abnormal source shape and physical boundary of the predicted model are also consistent with the actual situation. However, although the position information of U-Net is consistent with the actual situation, the inversion of the physical boundary is relatively fuzzy; especially, when the depth deepens, the resolution of the prediction results begins to decline, and the prediction at the bottom of the anomaly source is quite different from the actual situation. Therefore, in the scenario of the SAN Nicholas actual address data, the inversion network proposed in this paper has a more significant advantage in predicting deep anomalies.

5. Conclusions

In this paper, we propose a 3D gravity inversion method based on U-Net using an attentional feature fusion mechanism. With U-Net as the basic network framework, the residual module is used to replace the convolutional layer, and the operation of splicing features after jumping connections in U-Net is replaced by the fusion of contextual semantic features through the attention feature fusion module so as to enable the network to fully learn the features of the input data and avoid the feature loss in the network training process. The Dice loss function, which can clearly describe the shape and position information of small targets, even when the number of target and background voxels is unbalanced, is used as the loss function. Compared with the inversion results of the U-Net network, the method proposed in this paper has a higher vertical resolution. Ablation experiments show that the attention feature fusion module added to the network in this paper is the key to improving the vertical resolution and prediction accuracy of the inversion results. Meanwhile, noise experiments show that the inversion network in this study has a strong anti-noise ability and good generalization. It also lays a foundation for the inversion network in this paper to process the actual address data. The experimental results of the inversion network used in the prediction of the SAN Nicolas deposit in Mexico show that the inversion network can basically delineate the basic position and shape of the sulfur deposit, and the network has certain generalization performance, which is a promising subsurface density estimation tool.

Author Contributions

Methodology, C.C. and Y.Z.; software, C.C.; validation, C.C. and X.J.; resources, J.L.; supervision, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This study was financially supported by the National Science Foundation for Outstanding Young Scholars (No. 42122025) and the National Natural Science Foundation of China (No. 42374174).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data underlying this paper cannot be shared publicly. The data will be shared upon reasonable request to the corresponding author.

Acknowledgments

Special thanks to all the scholars who participated in this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

Li, Y.; Oldenburg, D.W. 3-D inversion of gravity data. Geophysics 1998, 63, 109–119. [Google Scholar] [CrossRef]
Zhang, M.-H.; Qiao, J.-H.; Zhao, G.-X.; Lan, X.-Y. Regional gravity survey and application in oil and gas exploration in China. China Geol. 2019, 2, 382–390. [Google Scholar] [CrossRef]
Gross, L. Weighted cross-gradient function for joint inversion with the application to regional 3-D gravity and magnetic anomalies. Geophys. J. Int. 2019, 217, 2035–2046. [Google Scholar] [CrossRef]
Ghalenoei, E.; Dettmer, J.; Ali, M.Y. Trans-dimensional gravity and magnetic joint inversion for 3-D earth models. Geophys. J. Int. 2022, 230, 363–376. [Google Scholar] [CrossRef]
Backus, G.E.; Gilbert, J. Numerical applications of a formalism for geophysical inverse problems. Geophys. J. Int. 1967, 13, 247–276. [Google Scholar] [CrossRef]
Parker, R.L. Understanding inverse theory. Annu. Rev. Earth Planet. Sci. 1977, 5, 35–64. [Google Scholar] [CrossRef]
Zhdanov, M.S. Geophysical Inverse Theory and Regularization Problem; Elsevier: Amsterdam, The Netherlands, 2002; pp. 29–88. [Google Scholar]
Mirjalili, S. Genetic Algorithm; Springer: Berlin, Germany, 2019; pp. 43–55. [Google Scholar]
Kirkpatrick, S.; Gelatt, C.D.; Vecchi, M.P. Optimization by simulated annealing. Science 1983, 220, 671–680. [Google Scholar] [CrossRef]
Dorigo, M.; Stützle, T. Ant colony optimization: Overview and recent advances. Handb. Metaheuristics 2019, 272, 311–351. [Google Scholar]
Essa, K.S.; Geraud, Y. Parameters estimation from the gravity anomaly caused by the two-dimensional horizontal thin sheet applying the global particle swarm algorithm. J. Pet. Sci. Eng. 2020, 193, 107421. [Google Scholar] [CrossRef]
Ai, H.; Essa, K.S.; Ekinci, Y.L.; Balkaya, Ç.; Géraud, Y. Hunger Games Search optimization for the inversion of gravity anomalies of active mud diapir from SW Taiwan using inclined anticlinal source approximation. J. Appl. Geophys. 2024, 227, 105443. [Google Scholar] [CrossRef]
Li, H.; Wang, X.; Ding, S. Research and development of neural network ensembles: A survey. Artif. Intell. Rev. 2018, 49, 455–479. [Google Scholar] [CrossRef]
Essa, K.S.; Mehanee, S.A.; Soliman, K.S.; Diab, Z.E. Gravity profile interpretation using the R-parameter imaging technique with application to ore exploration. Ore Geol. Rev. 2020, 126, 103695. [Google Scholar] [CrossRef]
Araya-Polo, M.; Jennings, J. Deep-learning tomography. Lead. Edge 2018, 37, 58–66. [Google Scholar] [CrossRef]
Yang, F.; Ma, J. Deep-learning inversion: A next-generation seismic velocity model building method. Geophysics 2019, 84, R583–R599. [Google Scholar] [CrossRef]
Kattenborn, T.; Leitloff, J.; Schiefer, F.; Hinz, S. Review on Convolutional Neural Networks (CNN) in vegetation remote sensing. ISPRS J. Photogramm. Remote Sens. 2021, 173, 24–49. [Google Scholar] [CrossRef]
Li, S.C.; Liu, B.; Ren, Y.X. Deep-learning inversion of seismic data. IEEE Trans. Geosci. Remote Sens. 2020, 58, 2135–2149. [Google Scholar] [CrossRef]
Puzyrev, V. Deep learning electromagnetic inversion with convolutional neural networks. Geophys. J. Int. 2019, 218, 817–832. [Google Scholar] [CrossRef]
Oh, S.; Noh, K.; Seol, S.J.; Byun, J. Cooperative deep learning inversion of controlled-source electromagnetic data for salt delineation. Geophys. J. Soc. Explor. Geophys. 2020, 85, E121–E137. [Google Scholar] [CrossRef]
Jiao, J.; Dong, S.; Zhou, S.; Zeng, Z.; Lin, T. 3-D Gravity and Magnetic Joint Inversion Based on Deep Learning Combined with Measurement Data Constraint. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5900814. [Google Scholar] [CrossRef]
Huang, R.; Liu, S.; Qi, R.; Zhang, Y. Deep Learning 3D Sparse Inversion of Gravity Data. J. Geophys. Res. Solid Earth 2021, 126, e2021JB022476. [Google Scholar] [CrossRef]
Huang, R.; Zhang, Y.; Vatankhah, S.; Liu, S.; Qi, R. Inversion of large-scale gravity data with application of VNet. Geophys. J. Int. 2022, 231, 306–318. [Google Scholar] [CrossRef]
Yang, Q.; Hu, X.; Liu, S.; Jie, Q.; Wang, H.; Chen, Q. 3-D Gravity Inversion Based on Deep Convolution Neural Networks. IEEE Geosci. Remote Sens. Lett. 2022, 19, 3001305. [Google Scholar] [CrossRef]
Li, Y.; Chen, S.; Zhang, B.; Li, H. Fast imaging for the 3D density structures by machine learning approach. Front. Earth Sci. 2023, 10, 1028399. [Google Scholar] [CrossRef]
Xu, Z.; Wang, R.; Zhdanov, M.S.; Wang, X.; Li, J.; Zhang, B.; Wang, Y. Inversion of the Gravity Gradiometry Data by ResUnet Network: An Application in Nordkapp Basin, Barents Sea. IEEE Trans. Geosci. Remote Sens. 2023, 61, 4502410. [Google Scholar] [CrossRef]
Lv, M.Z.; Zhang, Y.J.; Liu, S. Fast forward approximation and multitask inversion of gravity anomaly based on UNet3+. Geophys. J. Int. 2023, 234, 972–984. [Google Scholar] [CrossRef]
Wang, R.; Ding, Y.; Xu, Z.; Zhdanov, M.S.; Xian, M.; Zhang, Y.; Li, J.; Jiang, C.; Guo, Z. Employing MS-UNets Networks for Multiscale 3-D Gravity Data Inversion: A Case Study in the Nordkapp Basin, Barents Sea. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4502813. [Google Scholar] [CrossRef]
Zhang, L.; Zhang, G.; Liu, Y.; Fan, Z. Deep learning for 3-D inversion of gravity data. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5905918. [Google Scholar] [CrossRef]
Rezaie, M.; Moradzadeh, A.; Kalate, A.N. Fast 3D focusing inversion of gravity data using reweighted regularized Lanczos bidiagonalization method. Pure Appl. Geophys. 2017, 174, 359–374. [Google Scholar] [CrossRef]
Zhang, L.; Zhang, G.; Fan, Z.; Ma, J. A Multitask Deep Learning for Simultaneous Denoising and Inversion of 3-D Gravity Data. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5923117. [Google Scholar] [CrossRef]
Tikhonov, A.N.; Arsenin, V.Y. Solutions of Ill-Posed Problems; Wiley: Hoboken, NJ, USA, 1977; pp. 1–30. [Google Scholar]
Li, Y.; Jia, Z.; Lu, W. Self-Supervised Deep Learning for 3D Gravity Inversion. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5924311. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015, Proceedings of the 18th International Conference, Munich, Germany, 5–9 October 2015; Springer International Publishing: Cham, Switzerland, 2015; pp. 234–241. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Dai, Y.; Gieseke, F.; Oehmcke, S.; Wu, Y.; Barnard, K. Attentional Feature Fusion. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Snowmass Village, CO, USA, 1–5 March 2020; pp. 3559–3568. [Google Scholar]
Li, Z.; Yao, C.; Zheng, Y.; Wang, J.; Zhang, Y. 3D magnetic sparse inversion using an interior-point method. Geophys. J. Soc. Explor. Geophys. 2018, 83, J15–J32. [Google Scholar] [CrossRef]
Milletari, F.; Navab, N.; Ahmadi, S.-A. V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. In Proceedings of the 2016 Fourth International Conference on 3D Vision (3DV), Stanford, CA, USA, 25–28 October 2016; pp. 565–571. [Google Scholar]
Li, X.; Sun, X.; Meng, Y.; Liang, J.; Wu, F.; Li, J. Dice Loss for Data-imbalanced NLP Tasks. Annu. Meet. Assoc. Comput. Linguist. 2019. [Google Scholar] [CrossRef]
Phillips, N.; Oldenburg, D.; Chen, J.; Li, Y.; Routh, P. Cost effectiveness of geophysical inversions in mineral exploration: Applications at San Nicolas. Lead. Edge 2001, 20, 1351–1360. [Google Scholar] [CrossRef]
Johnson, B.J.; Montante-Martinez, A.; Canela-Barboza, M.A.R.I.O.; Danielson, T.J.; Sherlock, R.; Logan, M.A.V. Geology of the San Nicolas deposit, Zacatecas, Mexico, VMS Deposits of Latin America: Geological Association of Canada. Miner. Depos. Div. Spec. Publ. 2000, 2, 71–85. [Google Scholar]
Vassallo, L.F.; Aranda-Gómez, J.J.; Solorio-Munguía, J.G. Hydrothermal alteration of volcanic rocks hosting the Late Jurassic-Early Cretaceous San Nicolas VMS deposit, southern Zacatecas, Mexico. Rev. Mex. De Cienc. Geológicas 2015, 32, 254–272. [Google Scholar]

Figure 1. Schematic diagram of gravity anomaly calculation: (a) arbitrary geological body, (b) cuboid model.

Figure 2. Inversion network structure diagram.

Figure 3. AFF structure.

Figure 4. Structure of the MS-CAM.

Figure 5. Density model of the Gravinv dataset. (a) Rectangular prism model, (b) inclined dike model, (c) vertical pinch-out model, (d) vertical parallel prism model, (e) syncline model, (f) fault model, (g) single random model, (h) combination random model.

Figure 6. Inversion of the training loss curve of the network.

Figure 7. Comparison of the theoretical model and prediction model for the inclined levee model: (a) is the theoretical model of the verification set; (b) is the prediction model obtained through network inversion; (c) is the vertical section of the geological body, with the theoretical model section on the (left) and the prediction model section on the (right).

Figure 8. Syncline model comparison between the theoretical model and prediction model: (a) is the theoretical model of the verification set; (b) is the prediction model obtained through network inversion; (c) is the vertical section of the geological body, with the theoretical model section on the (left) and the prediction model section on the (right).

Figure 9. Comparison of theoretical models and predictive models for stochastic models: (a) is the theoretical model of the verification set; (b) is the prediction model obtained through network inversion; (c) is the vertical section of the geological body, with the theoretical model section on the (left) and the prediction model section on the (right).

Figure 10. Gravity anomaly with 5% Gaussian noise added: (a1) is the gravity anomaly diagram of the cube model, (a2) is the gravity anomaly diagram of the cube model after adding 5% Gaussian noise, (b1) is the gravity anomaly diagram of the vertical parallel prism model, (b2) is the gravity anomaly diagram of the vertical parallel prism model after adding 5% Gaussian noise.

Figure 11. Comparison between theoretical and predictive models of inclined embankment models. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained through network inversion trained on noisy data, and (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 12. Comparison between the Diagonal Model Theory Model and the prediction model. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained by inverting the network trained with noisy data, and (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 13. Comparison between the random model theoretical model and the prediction model. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained through network inversion trained on noisy data, and (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 14. Comparison between theoretical and predictive models of inclined embankment models. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained by U-Net inversion, and (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 15. Comparison between the Diagonal Model Theory Model and the prediction model. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained by U-Net inversion, and (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 16. Comparison between the Diagonal Model Theory Model and the prediction model. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained by U-Net inversion, and (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 17. Comparison between theoretical and predictive models of inclined embankment models. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained by removing the AFF module from the inversion network, and (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 18. Comparison between the Diagonal Model Theory Model and the prediction model. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained by removing the AFF module from the inversion network, (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 19. Comparison between the random model theoretical model and the prediction model. (a) Shows the theoretical model in the validation set, (b) shows the prediction model obtained by removing the AFF module from the inversion network, (c) shows the vertical profile of the geological body, with the theoretical model profile on the (left) and the prediction model profile on the (right).

Figure 20. Comparison of neural network inversion profiles at different depths: (a1) is the profile of the inclined embankment theoretical model at 400 m and (b1) is the profile of the inclined embankment prediction model inverted by a neural network at 400 m; (a2) is the profile of the inclined embankment theoretical model at 600 m and (b2) is the profile of the inclined embankment prediction model inverted by a neural network at 600 m.

Figure 21. Comparison of neural network inversion profiles at different depths: (a1) is the profile of the theoretical model of the syncline model at 400 m and (b1) is the profile of the syncline model predicted by the neural network inversion at 400 m; (a2) is the profile of the theoretical model of the syncline at 600 m and (b2) is the profile of the syncline model predicted by the neural network inversion at 600 m.

Figure 22. Comparison of neural network inversion profiles at different depths: (a1) is the profile of the random model theoretical model at 400 m and (b1) is the profile of the neural network inversion random model prediction model at 400 m; (a2) is the profile of the random model theoretical model at 600 m and (b2) is the profile of the neural network inversion random model prediction model at 600 m.

Figure 23. Residual gravity anomaly map and geological section: (a) residual gravity anomaly map of the SAN Nicholas deposit, including AA’ and BB’, which represent two drilling lines, and (b) is the AA’ line drilling geological section [40].

Figure 24. Inversion profile of the SAN Nicholas deposit: (a) prediction results of the inversion network in this paper, (b) prediction results of U-Net, (Ⅰ)

E a s t i n g = - 1700 m

, (Ⅱ)

N o r t h i n g = - 400 m

. The black line is the actual outline of the ore body.

Figure 24. Inversion profile of the SAN Nicholas deposit: (a) prediction results of the inversion network in this paper, (b) prediction results of U-Net, (Ⅰ)

E a s t i n g = - 1700 m

, (Ⅱ)

N o r t h i n g = - 400 m

. The black line is the actual outline of the ore body.

Table 1. Comparison of prediction errors of the U-Net, inversion network, noise experiment, and ablation experiment.

	U-Net	Our Method (W/O AFF)	Our Method (Add Noise)	Our Method
MAE	0.0074	0.0175	0.0114	0.0072
E_m	0.3648	1.9173	0.4649	0.3605
R²	0.9447	0.2859	0.9162	0.9813

The bold font in the table represents the numerical values of the evaluation indicators corresponding to the model with the best reproduction effect.

Table 2.

{\bar{E}}_{a c c}

of the U-Net, inversion network, and noise-added experiment.

Table 2.

{\bar{E}}_{a c c}

of the U-Net, inversion network, and noise-added experiment.

	U-Net	Our Method (Add Noise)	Our Method
Inclined levee model	93.51%	94.92%	96.43%
Syncline model	94.81%	95.07%	96.47%
Stochastic model	93.09%	92.68%	92.71%

The bold font in the table represents the numerical values of the evaluation indicators corresponding to the model with the best reproduction effect.

Table 3. Physical properties of major rock units in the SAN Nicolas deposit.

Rock Type	$Density (g / c m^{3}$ )
Tertiary breccia	2.3
Mafic volcanics	2.7
Sulphide	3.5
Quartz rhyolite	2.4
Graphitic mudstone	2.1

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, C.; Li, H.; Zhang, Y.; Jin, X.; Liu, J. Three-Dimensional Gravity Inversion Based on Attention Feature Fusion. Sensors 2024, 24, 5697. https://doi.org/10.3390/s24175697

AMA Style

Chen C, Li H, Zhang Y, Jin X, Liu J. Three-Dimensional Gravity Inversion Based on Attention Feature Fusion. Sensors. 2024; 24(17):5697. https://doi.org/10.3390/s24175697

Chicago/Turabian Style

Chen, Chen, Houpu Li, Yujie Zhang, Xiaomei Jin, and Jianfeng Liu. 2024. "Three-Dimensional Gravity Inversion Based on Attention Feature Fusion" Sensors 24, no. 17: 5697. https://doi.org/10.3390/s24175697

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Three-Dimensional Gravity Inversion Based on Attention Feature Fusion

Abstract

1. Introduction

2. Background

2.1. Forward Modeling

2.2. Inversion Modeling

3. Methodology

3.1. Construction of the Inversion Network

3.2. Loss Function

4. Experiments

4.1. Simulation Datasets

4.2. Implementation Details

4.3. Evaluation Metrics

4.4. Simulation Datasets Experiment

4.4.1. Ternary Inversion

4.4.2. Noise Experiment

4.4.3. Contrast Experiment

4.4.4. Ablation Experiment

4.4.5. Comparison of Reconstruction Effects at Different Depths

4.5. Field Example

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI