DACLnet: A Dual-Attention-Mechanism CNN-LSTM Network for the Accurate Prediction of Nonlinear InSAR Deformation

Lu, Junyu; Wang, Yuedong; Zhu, Yafei; Liu, Jingtao; Xu, Yang; Yang, Honglei; Wang, Yuebin

doi:10.3390/rs16132474

Open AccessArticle

DACLnet: A Dual-Attention-Mechanism CNN-LSTM Network for the Accurate Prediction of Nonlinear InSAR Deformation

by

Junyu Lu

¹,

Yuedong Wang

^1,*

,

Yafei Zhu

²,

Jingtao Liu

¹,

Yang Xu

³,

Honglei Yang

¹

and

Yuebin Wang

¹

School of Land Science and Technology, China University of Geosciences, Beijing 100083, China

²

School of Engineering and Technology, China University of Geosciences, Beijing 100083, China

³

School of Information Engineering, China University of Geosciences, Beijing 100083, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2024, 16(13), 2474; https://doi.org/10.3390/rs16132474

Submission received: 9 May 2024 / Revised: 29 June 2024 / Accepted: 4 July 2024 / Published: 5 July 2024

(This article belongs to the Special Issue SAR Data Processing and Applications Based on Machine Learning Method)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Nonlinear deformation is a dynamically changing pattern of multiple surface deformations caused by groundwater overexploitation, underground coal mining, landslides, urban construction, etc., which are often accompanied by severe damage to surface structures or lead to major geological disasters; therefore, the high-precision monitoring and prediction of nonlinear surface deformation is significant. Traditional deep learning methods encounter challenges such as long-term dependencies or difficulty capturing complex spatiotemporal patterns when predicting nonlinear deformations. In this study, we developed a dual-attention-mechanism CNN-LSTM network model (DACLnet) to monitor and accurately predict nonlinear surface deformations precisely. Using advanced time series InSAR results as input, the DACLnet integrates the spatial feature extraction capability of a convolutional neural network (CNN), the advantages of the time series learning of a long short-term memory (LSTM) network, and the enhanced focusing effect of the dual-attention mechanism on crucial information, significantly improving the prediction accuracy of nonlinear surface deformations. The groundwater overexploitation area of the Turpan Basin, China, is selected to test the nonlinear deformation prediction effect of the proposed DACLnet. The results demonstrate that the DACLnet accurately captures developmental trends in historical surface deformations and effectively predicts surface deformations for the next two months in the study area. Compared to traditional LSTM and CNN-LSTM methods, the root mean square error (RMSE) of the DACLnet improved by 85.09% and 68.57%, respectively. These research results can provide crucial technical support for the early warning and prevention of geological disasters and can serve as an effective alternative tool for short-term ground subsidence prediction in areas lacking hydrogeological and other related data.

Keywords:

dual-attention mechanism; DACLnet; time series InSAR; nonlinear deformation; deformation prediction

1. Introduction

Nonlinear InSAR (interferometric synthetic aperture radar) deformation refers to the measurement and analysis of land surface deformations that deviate from simple, predictable linear trends. Such nonlinear deformations arise from the complex interplay of dynamic forces affecting surface or subsurface structures, exhibiting patterns that change in unpredictable ways over time. They can be triggered by various factors, including the overextraction of groundwater, the exploitation of underground resources, landslides, urban construction activities, earthquakes, and underground mining operations [1,2,3]. Moreover, they pose severe and even catastrophic risks to ground structures, potentially leading to geological disasters such as ground collapse and large-scale landslides [4]. These phenomena can profoundly impact ecological systems, society, and economies; therefore, it is imperative to predict them accurately, promptly, and efficiently. Geodetic survey methods enable stability surveys and the monitoring of surface deformations with early detection capabilities. Meanwhile, deep learning techniques can learn from monitored surface deformation data for analysis and interpretation purposes while predicting future trends in surface deformations [5]; therefore, integrating geodetic survey methods with deep learning to achieve high-precision monitoring and the accurate prediction of nonlinear surface deformation is a vital frontier issue in contemporary geodesy.

Traditional ground deformation measurement methods, like leveling and GNSS, can achieve high-precision in situ monitoring; however, their spatial resolution is generally low, and they are both time-consuming and labor-intensive. In contrast, interferometric synthetic aperture radar (InSAR) technology has emerged as a crucial technique for monitoring surface deformation due to its advantages, including its high spatial resolution, comprehensive coverage, high monitoring accuracy, and facilitation of efficient, noncontact measurements. It has demonstrated remarkable achievements in various fields, including earthquakes [6,7,8], volcanoes [9,10,11], landslides [12,13,14], urban infrastructure monitoring [15,16,17], mining subsidence monitoring [18,19], and frozen ground deformation monitoring [20,21]. With the increasing abundance of spaceborne synthetic aperture radar (SAR) satellites in recent years [22] and the maturation of InSAR technology [23,24,25], it will play an even more important role in surveying and investigating geological disasters; however, due to the fixed revisit cycle of spaceborne SAR satellites, achieving the dynamic monitoring and timely warning of nonlinear surface deformations is difficult.

In the past decade, deep learning technology has been extensively utilized in time series data analyses [26], showcasing remarkable capabilities in specialized areas such as surface deformation monitoring [27]. Among these techniques, convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have gained widespread adoption in imaging geodesy owing to their excellent performance in image processing and sequence data analyses [28,29]. For example, some researchers have employed convolutional neural network (CNN) technology to detect subtle deformations in volcanoes and successfully predict short-term InSAR deformation maps [29,30]. Furthermore, Prabhakar et al. demonstrated outstanding predictive performance by effectively predicting InSAR time series deformation maps using a multiscale attention-directed RNN method [31]. Meanwhile, as an improved version of RNNs, long short-term memory (LSTM) networks have demonstrated unparalleled superiority in this field [32,33]. Several research teams have validated their efficiency and accuracy in sequence prediction tasks. For example, Chen et al. employed LSTM models to simulate and predict the InSAR deformation time series of Beijing Capital International Airport and compared them with other benchmark models such as multilayer perceptron (MLP) and RNNs; LSTM yielded more accurate outcomes [34]. Similarly, Bao et al. constructed a ground deformation prediction model based on LSTM, specifically for the short-term prediction of severe deformations in the SPIA area of Shanghai Pudong International Airport, successfully revealing the spatiotemporal evolution pattern of ground deformation in the SPIA reclamation area [35].

However, traditional deep learning models have certain limitations when facing long-term sequence dependencies and complex spatiotemporal pattern analyses. Although CNNs perform well in handling various types of data, especially in extracting local features, their inherent limitations in convolutional operations make it challenging to capture long-distance dependencies in time series data [36]. On the other hand, while RNN architectures are designed explicitly for ordered data streams, they often encounter the problem of vanishing or exploding gradients when dealing with long-term dependencies [37,38]. Furthermore, LSTM has greatly improved the issue of long-term memory retention and has achieved significant results in various sequence prediction tasks [32,33,34,35]; however, due to the design of LSTM internal units, the output is constrained within a specific numerical range. Even in the multilayer structure, gradient disappearance is still possible. Constructing large and deep LSTM networks may risk overfitting the model [39,40]. Due to the inherent limitations of the models above, they may face obstacles when addressing prediction tasks involving complex and long-term spatiotemporal evolution characteristics of surface nonlinear deformation. In recent years, transformer models have been proposed as an innovative sequence modeling technique and have gained rapid popularity [41]. With their unique self-attention mechanism, these models can efficiently capture both short-term and long-term dependencies in sequence data, thereby avoiding the vanishing or exploding gradient problem encountered by traditional RNNs when dealing with long-range dependencies.

Despite the extensive utilization of CNN or LSTM models in existing studies for surface deformation prediction on InSAR time series data, there remains a lack of research on quantitatively evaluating and predicting complex nonlinear surface deformation using attention mechanism methods. Although some success has been achieved, research has largely focused on relatively simple deformation patterns. These models typically lack a deep understanding of and precise prediction capabilities for complex nonlinear surface deformation dynamics, especially in large-scale subsidence areas with nonlinear characteristics. Additionally, despite the growing body of deep learning research on subsidence prediction, practical predictive applications for large-area surface deformation are still relatively rare. This is mainly because large-area predictions require models not only to process large amounts of data but also to effectively integrate information across different temporal and spatial scales. Considering the spatiotemporal characteristics of complex nonlinear surface deformation, this study draws inspiration from transformers’ attention mechanism and innovatively constructs a CNN-LSTM model called the DACLnet by integrating dual-attention mechanisms to more accurately simulate and predict surface nonlinear deformation. By integrating dual-attention mechanisms into the CNN-LSTM network and combining it with advanced interferometric point target analysis (IPTA) InSAR technology [42], the DACLnet enhances the focus on crucial nonlinear information, enabling the refined monitoring and accurate prediction of surface nonlinear deformation. Finally, taking the oasis area in the Turpan Basin, Xinjiang, as the experimental region, the DACLnet is employed to predict and test surface nonlinear deformation caused by periodic groundwater exploitation.

2. Methodology

The method mainly includes the following steps: (a) Use advanced IPTA-InSAR technology during the data acquisition stage to obtain the ground deformation time series. (b) Design and construct the DACLnet model. (c) Divide the dataset into training sets and test sets for model training and optimization.

2.1. Deformation Signals Monitored Using Advanced IPTA-InSAR

Permanent scatterer interferometric synthetic aperture radar (PS-InSAR) technology, with its high precision in monitoring land surface deformations, has been widely applied in various surface monitoring scenarios. This technique mainly relies on permanent scatterers (PS points) on the surface, such as artificial buildings and exposed rocks. By analyzing the differences in radar echo signals from these points at different times, it can accurately measure minute deformations of the ground or buildings, with precision up to the millimeter level.

However, PS-InSAR technology faces certain applicational limitations in nonurban areas, especially in regions with dense vegetation and complex terrain. In these areas, due to the scarcity of suitable targets that can serve as PS points, traditional PS-InSAR methods struggle to obtain a sufficient density of PS points for effective monitoring.

To overcome this limitation, we use an improved IPTA-InSAR method [42,43] to monitor nonlinear deformations, thereby enhancing the anti-interference ability of InSAR against spatial–temporal incoherence errors in monitoring geological hazard-prone areas with dense vegetation coverage and increasing the number of effectively monitored points. The data processing flowchart of this method is shown in Figure 1. Specifically, the improved method introduces differential interferometric SAR (DInSAR) based on multiview processing in the traditional PS selection process. This is carried out by processing single-look complex (SLC) images through multiview processing. Subsequently, we set appropriate spatiotemporal baseline thresholds to obtain sufficient interferometric pairs. Then, PS points are selected based on the coherence of the differential interferogram. The differential phase,

δ φ_{i, j}

, corresponding to two adjacent point targets,

i, j

, is expressed as follows:

δ φ_{i, j} = \frac{4 π}{λ} t ∆ υ_{i, j} + \frac{4 π}{λ} \frac{B_{⊥} {∆ z}_{i, j}}{R s i n θ} + ∆ φ_{i, j, r e s}

(1)

In the formula,

∆ υ_{i, j}

represents the linear deformation rate between two point targets;

{∆ z}_{i, j}

represents the elevation residual between two point targets;

t

is the time interval;

R, B_{⊥}

,

θ

, and

λ

are the orbital parameters of the satellite; and

∆ φ_{i, j, r e s}

represents the differential residual phase between two point targets, primarily including the differences in the atmospheric phase, noise phase, and nonlinear deformation phase between the two points.

During the point selection process, the analysis is focused on the points with low spectral diversity [44] and low amplitude dispersion [45]. In monitoring regions with significant terrain variation, InSAR is considerably affected by terrain-related atmospheric delay errors; therefore, before the regression analysis, we used an iterative removal method for elevation-correlated atmospheric delay signals [42,46] to mitigate their influence on subsequent computations while suppressing this signal in highly coherent point targets within the unwrapped interferogram. Subsequently, we used multivariate linear regression to determine linear deformation, topographic errors, and residual phases.

After linear regression, we employed singular value decomposition (SVD) to extract the original time series displacement. The residual phase obtained after removing linear deformation primarily includes nonlinear deformation, atmospheric delays, and phase noise. As atmospheric delays are highly correlated spatially but less so temporally, and phase noise is random in both space and time, we utilize temporal high-pass filtering and spatial low-pass filtering to suppress atmospheric delays. Ultimately, both linear and nonlinear deformations contribute to the final time series displacement, resulting in the observation of temporal surface deformations.

2.2. CNN-LSTM Model Embedded within Dual-Attention Mechanisms

The DACLnet is a CNN-LSTM network model that integrates a dual-attention mechanism, combining the spatial feature extraction capability of a convolutional neural network (CNN) with the time series processing advantage of long short-term memory (LSTM). The structure diagram of the DACLnet model is depicted in Figure 2. Additionally, it introduces a dual-attention mechanism to effectively focus on recognizing and utilizing features in complex spatiotemporal data. By collaboratively applying the dual-attention mechanism within both the CNN and LSTM layers, the model not only emphasizes crucial local and global information but also identifies and enhances key spatial details and moments in the time series. This significantly reduces interference from irrelevant information and markedly improves the model’s ability to predict complex patterns and sequence data. Furthermore, the dual-attention mechanism grants the DACLnet exceptional adaptability, allowing it to flexibly adjust its focus in response to various types of nonlinear surface deformations, thus enhancing its generalization capability and robustly predicting unknown data. These features make the DACLnet highly effective in the precise prediction of nonlinear surface deformations, demonstrating its capability to efficiently handle and analyze complex data.

In this model, the CNN layer is primarily responsible for extracting the task of local spatial features from the input data. Specifically, the data are fed into a convolutional neural network (CNN), which effectively captures complex information across various spatial dimensions through its multilayered convolutional structure, subsequently transforming it into high-dimensional feature vectors that can be understood and captured using machine learning models. The operation of its convolutional layer can be expressed as follows [47]:

y [i] = b + \sum_{j = 0}^{W - 1} x [i + j] \cdot h [j]

(2)

where

W

represents the width of the convolutional kernel (window size),

b

denotes the bias term,

i

is the index of the output sequence,

j

refers to the index of elements in the convolutional kernel,

x [i + j]

signifies the value of the input sequence at the position of

i + j

, and

h [j]

indicates the weight of the convolutional kernel at the position of

j

. This method not only leverages the local information of time series data but also enhances model sensitivity to temporal changes by covering the complete annual cycle with a sliding window, thereby improving prediction accuracy and model generalization ability.

After the CNN successfully extracts the local spatial features of the time series data, it is essential to comprehend the complex dependencies among different time windows or sequences. We employ a multi-head attention mechanism to explore and integrate potential global dependencies between these features. This mechanism transforms these data into a high-quality feature vector set reflecting the overall structural characteristics, meticulously reflecting the overall structural characteristics. Operating on different data segments in parallel, the framework proficiently captures the intricate relationships and interactions among these segments. This parallel processing capability allows the mechanism to analyze complex interactions and correlations both among different time windows and across various sequences. By simultaneously focusing on different time windows or sequences of data in parallel, this mechanism yields crucial insights into the inherent correlations and the inter-relationships among external sequences within the same subsidence time series. This enhanced understanding improves our predictive accuracy and deepens our comprehension of dynamic geological changes.

For each attention head,

h

(a total of H heads), the Q, K, and V are calculated individually:

\begin{array}{l} Q_{h} = X W_{Q, h} \\ K_{h} = X W_{K, h} \\ V_{h} = X W_{V, h} \end{array}

(3)

Then, the attention score is computed and subsequently normalized:

A_{h} = s o f t m a x (\frac{Q_{h} K_{h}^{T}}{\sqrt{d_{k, h}}})

(4)

Finally, the attention output of each head is obtained:

C_{h} = A_{h} V_{h}

(5)

The outputs of all heads are combined and the ultimate output is derived through a fully connected layer:

C = C o n c a t (C_{1}, C_{2}, \dots, C_{H}) W_{O}

(6)

where

X

denotes the input sequence;

W_{Q, h}, W_{K, h}, W_{V, h}

represent the weight matrices of different heads;

W_{O}

corresponds to the weight matrix for the merged fully connected layer; and

d_{k, h}

signifies the dimension of the key vector.

Although introducing a multi-head attention mechanism can significantly improve the comprehensiveness of feature expression, it may also lead to an exponential increase in the number of feature vectors. Certain data may contribute minimally to the final deformation prediction accuracy or even be redundant; therefore, to ensure computational efficiency and effective resource utilization, we incorporate gating units into LSTM for feature selection and filtering [48]. These units can intelligently identify and eliminate the features with negligible influence on the prediction results, thus simplifying the feature vector set. In LSTM, the “discarding” process is achieved through a forget gate operation, which determines which parts of the cell state should be “forgotten”:

f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f})

(7)

where

f_{t}

represents the retention degree and is the output of the forget gate at the current time step, with a value range between 0 and 1;

σ

is the sigmoid function used to generate activation values between 0 and 1;

W_{f}

is the weight matrix of the forget gate;

b_{f}

is the bias term of the forget gate; and

h_{t - 1}

is the hidden state from the previous time step and the input at the current time step. The “retention” is accomplished by combining input gates and new candidates’ cell states, which selectively add new information to the cell state.

Subsequently, the feature vectors filtered via LSTM gating units undergo additional recalculations through an attention mechanism. This framework is designed to extract and analyze temporal relationships critical for processing time series data. As a specialized type of recurrent neural network (RNN), LSTM is well equipped to handle long-term dependencies through its internal memory units, which effectively store and transmit relevant information. However, challenges such as vanishing or exploding gradients can still arise, particularly in deeper network architectures. To mitigate these issues, the DACLnet integrates attention strategies within the LSTM layer, which enables the model to focus selectively on the most significant segments of the input sequences at different timesteps. This precise focus improves the accuracy of capturing long-term dependencies in time series data, significantly boosting the model’s predictive accuracy. This process refines a more compact and highly abstract representation of global relationships:

A = s o f t m a x (\frac{Q K^{T}}{\sqrt{d_{k}}})

(8)

where

d_{k}

is the dimension of the key vector.

This batch of optimized feature vectors can be fed into the encoding and decoding layers for deep learning processing. Following deep fusion and mapping through the fully connected layer, the model ultimately generates accurate predictions of surface deformation, thereby achieving the efficient utilization of InSAR time series data while ensuring robustness and accuracy in prediction performance.

2.3. Network Training

Prior to inputting the InSAR time series into the DACLnet model for formal training, we initially filter and preprocess the original time series dataset. The filtering stage includes two steps: (1) eliminating weak subsidence points to exclude data with insignificant changes that may affect the efficiency of model learning and result accuracy and (2) implementing data dilution processing to reduce the overall dataset size, thereby decreasing computational load during processing and training while retaining crucial information for maintaining the integrity and effectiveness of the analysis, ultimately ensuring accurate model training and improving training efficiency. Subsequently, in the data preprocessing stage, the sliding window method is used to extract the time series features from the processed dataset. The size of the sliding window is generally selected to cover a certain time range (e.g., one year), enabling the capture of the periodic and seasonal changes of deformation. Through this method, the subsequences generated by each window can reflect the important features and dynamic variations within the specified timeframe and provide abundant input samples for model training.

During the model training stage, the data are divided into training and testing sets. The former are used for model training, while the latter are employed to evaluate the model’s learning and generalization capability. The root mean square error (RMSE) serves as the loss function for model training, effectively measuring the difference between predicted and actual values. In terms of parameter optimization, the Adam optimizer [49] is adopted, which adaptively adjusts the learning rate and optimizes model parameters based on the gradient’s first-order and second-order moment estimates. Dropout techniques [50] are introduced during model training to prevent overfitting, and cross-validation is used to ensure the stability and generalization of the model across different data subsets. The model constantly learns and adjusts through iterative training to optimize its predictive capacity for dynamic deformation time series.

3. Study Area and Data Processing

3.1. The Study Area

The Turpan Basin is located in the eastern part of the Xinjiang Uygur Autonomous Region, China, adjacent to the Hami depressions in the southern region of the eastern Tianshan Mountains, as shown in Figure 3. It possesses a unique geographical location and exhibits a diverse ecological environment. With an extensive geological history, this basin has undergone multiple geological periods and accumulated abundant rock sequences. Its strata are characterized by substantial sedimentary material and possess significant mineral resources, such as oil and natural gas [51].

The region has a typical inland arid climate, featuring low annual precipitation and high evaporation rates, resulting in significant water evaporation and salt accumulation. The scarcity of local water resources primarily stems from the aridity of the climate. Water resources in the basin mainly rely on the melting snow and ice from the Tianshan Mountains and groundwater extraction. The unique geological hydrological features have given rise to this area’s distinctive landforms and ecosystems. Notably, Aiding Lake, situated at an altitude of −155 m [52], represents the lowest point in China.

The Turpan Basin has developed agriculture and is a crucial production area for grain and cash crops in Northwest China. It is the oldest wine-producing region in China and currently serves as the largest grape and Hami melon production base. Moreover, this area is adorned with thousands of kilometers of traditional water conservancy facilities known as “Karez”, which represent a form of unique, ancient hydraulic engineering designed to combat arid environments; however, with escalating agricultural water demands coupled with declining groundwater levels, there is an alarming prevalence of seasonal overextraction of groundwater for farmland irrigation purposes. Consequently, this unsustainable practice has resulted in prolonged nonlinear surface deformation and disrupted the delicate ecological hydrological balance within the region.

3.2. InSAR Datasets

From the Sentinel-1 satellite, we collected ascending/landing InSAR datasets encompassing data from the Turpan Basin between March 2015 and April 2020 (Figure 3). Table 1 presents the basic parameters of both the ascending and descending SAR image datasets. The AT41F135 image set was observed from 25 March 2015 to 27 April 2020. The DT121F449 image set was observed from 19 March 2015 to 27 April 2020. The spatial coverage and temporal span of both the ascending and descending orbit data exhibit substantial consistency, which ensures the uniformity of the spatiotemporal reference for deformation results as well as accuracy verification among outcomes.

3.3. Data Processing

The improved IPTA-InSAR technique is employed in this study to sequentially process ascending and descending InSAR datasets, with the results uniformly encoded into the geographic coordinate system of an external DEM. Firstly, we establish the thresholds for temporal and spatial baselines and construct an interferometric pair network based on a small baseline set (Figure 4). Then, based on the minimum cost flow (MCF) phase unwrapping technique [53], “two-pass” DInSAR processing is applied to each interferometric pair with multiple viewing numbers of 20:4 [54,55]. A shuttle radar topography mission (SRTM) digital elevation model (DEM) with a resolution of 30 m [56] was employed to remove the topographic phases. In the DInSAR data processing, point targets with a coherence lower than 0.3 were eliminated. The points with low spectral diversity [44] and low amplitude dispersion [45] were selected for the IPTA analysis. We employed a window-iterative estimation method to mitigate local topographically correlated atmospheric delay errors [42,46]. Finally, the time series deformation results were obtained for each map dataset, as shown in Figure 5.

4. Results

4.1. Monitoring Results

4.1.1. Deformation in the Turpan Basin

We used the improved IPTA-InSAR technique described in Section 2.1. to process the ascending and descending Sentinel-1 satellite data from March 2015 to April 2020. We obtained the time series deformation within the study area (Figure 5). As depicted in Figure 4, the InSAR datasets for rail ascension and rail descent exhibited excellent consistency in their monitoring outcomes, revealing substantial subsidence across the oasis region of the Turpan Basin, especially within the plain oasis area located south of the Flaming Mountains fault zone. To illustrate the temporal–spatial development characteristics of deformation, we selected two feature points, P1 and P2, from regions displaying significant deformation monitored via the ascending and descending rails, respectively (Figure 5c,d). It is evident that both ascending and descending orbit-based time series deformations demonstrate consistent patterns and magnitudes of change. Over the monitoring period, point P1 experienced an accumulated subsidence approaching 500 mm, while point P2 exhibited a cumulative subsidence exceeding 150 mm. The time series deformation reveals an overall subsidence trend in this area with notable nonlinear variation characteristics.

From the spatial and temporal characteristics of deformation, it can be seen that the land subsidence in the oasis area near point P1, closer to the southern Flaming Mountains fault zone, exhibits significantly greater magnitude compared to its vicinity at point P2. This region predominantly comprises extensive farmlands. The water source for agricultural irrigation in the Turpan Basin mainly relies on the melting water from the Tianshan glaciers and precipitation; however, due to the obstruction of the Flaming Mountains fault zone, the effective supplementation of the surface runoff and aquifers in the southern area of the Flaming Mountains fault zone with the Tianshan water source becomes challenging, resulting in a high dependence on groundwater in the agricultural irrigation. Consequently, the continuous overexploitation of aquifers occurs periodically. Consequently, the local land surface shows an overall subsidence trend and exhibits periodic accelerated subsidence characteristics consistent with the agricultural planting and irrigation cycles [57].

4.1.2. Reliability Evaluation of InSAR Results

After preprocessing the ascending data, we obtained a total of 574,662 cumulative subsidence deformation time series, and after similarly processing the descending data, we obtained 596,863 series. To quantitatively evaluate the accuracy of the monitoring results, we converted the deformation velocity results independently calculated from ascending and descending orbits into the vertical direction [58]. Subsequently, we plotted the distribution statistics of the ascending and descending rates, as shown in Figure 6. By comparing the histograms of the ascending rates (blue graph a) with the descending rates (red graph b), we observed that the overall distributions of the two datasets are fairly similar; however, the ascending rates have a larger proportion of data near zero, while the descending data exhibit more significant extreme values.

As shown in Figure 6, the differences in deformation monitoring results between ascending and descending tracks can be attributed to their distinct geometric configurations and atmospheric conditions. To further evaluate and verify the reliability of the data solution results and the effectiveness of our processing methods, we filtered for homonymous points based on latitude and longitude, ultimately selecting 45,002 homonymous points. This selection was followed by the creation of a comparative plot (Figure 6c). The RMSE between AT41F135 and DT121F449 results is 5.7 mm/yr. By performing statistical analysis and fitting procedures, we obtained a linear regression equation of

y = 1.1 x + 0.94

, with a coefficient of determination R² of 0.97. Notably, most data points exhibit differences that are less than three times the RMSE (between the black dashed lines in Figure 6c). These findings demonstrate the reliability of the InSAR monitoring results in our study.

4.2. DACLnet Results

4.2.1. Network Training Results

According to the training method described in Section 2.3., we filtered out points with cumulative subsidence of less than 10 mm from the original InSAR deformation time series dataset, resulting in a large-scale deformation dataset containing 574,662 deformation time series. To improve the accuracy and efficiency of model training and prediction, we employed an equidistant data sparsification strategy (interval set to 100) to obtain a dataset containing 5748 deformation time series. Considering the 12-day revisit period of the Sentinel-1 satellite, we input data with a sliding window size of 30 to cover approximately one year of InSAR deformation time series. This window configuration allowed us to better capture the dynamic changes in nonlinear deformation on the Earth’s surface within each window over a continuous year while effectively preserving the seasonality of and annual trends in the data. Subsequently, we randomly divided the initial 70% of the 5748 deformation time series as the training set. We allocated the remaining 30% for testing to ensure a balance between model training accuracy and efficiency. This division ensures that the model can learn key feature patterns of nonlinear surface deformations from a sufficient number of training samples, while the testing set is used to independently assess the model’s generalization performance. This ratio of splitting the training and testing sets follows widely accepted best practices in the field of deep learning, aiming to balance the adequacy and effectiveness of model training with the assessment of its generalization capabilities. Finally, we trained and tested the constructed DACLnet model using the abovementioned dataset.

During model training (as shown in Figure 7), the model is determined to be fully trained based on a downward trend in the loss function. When the loss function rapidly decreases and gradually stabilizes at zero, this typically indicates that the model has learned the intrinsic patterns of the training data, the parameters have been optimized, and the training process is nearing saturation.

During the parameter optimization process, the Adam optimizer is selected due to its adaptive learning rate feature, which dynamically adjusts the learning rate based on the first and second moment estimates of the parameter gradients. This capability accelerates model convergence and demonstrates superior optimization performance when dealing with complex datasets, ensuring the efficient updates of model parameters. For the training process, we utilized the Adam optimizer with a learning rate of 0.005. The entire training process consisted of five epochs, comprising 14,615 iterations. Such settings ensured sufficient exposure to data and facilitated profound learning to accurately capture and predict intricate dynamics within the deformation time series. Additionally, a dropout rate of 0.2 was set during training to mitigate overfitting risks and enhance the model’s generalization ability. This technique enhances the model’s ability to adapt to unseen data by randomly dropping a portion of the neuron outputs, forcing the model to learn more generalized features. To ensure the stability and generalization performance of the model, multiple cross-validations were conducted to ensure the stability and generalization performance of the model. In each iteration, the model assimilated spatiotemporal patterns of the deformation sequence based on the data within the input window to improve the accuracy of surface deformation prediction. Detailed information regarding the key parameters utilized in the model training process is provided in Table 2.

4.2.2. Model Performance Testing

To test the performance of the DACLnet, we used the same training scheme to train the existing LSTM and CNN-LSTM models. Evaluation metrics, including the MAE (mean absolute error), RMSE (root mean square error), and MAPE (mean absolute percentage error), were utilized to compare and analyze the prediction results of these three models. The findings demonstrated that the DACLnet model exhibited outstanding performance in predicting nonlinear surface deformation, particularly in capturing complex spatiotemporal patterns and long-term dependencies, as depicted by the black rectangular boxes in Figure 8. The DACLnet more accurately simulated the local fluctuation features of complex time series compared to the pure LSTM and CNN-LSTM models.

The results in Table 3 present the MAE, RMSE, and MAPE for each model’s predicted outcomes compared to the actual observations. The LSTM model exhibited a moderate level of prediction accuracy, with MAE, RMSE, and MAPE values of 0.0197, 0.0369, and 0.6103, respectively. It displayed higher error rates than the other models. The CNN-LSTM model effectively captured the spatiotemporal features of the time series data by incorporating convolutional layers, resulting in improved performance with MAE, RMSE, and MAPE values of 0.0164, 0.0175, and 0.1345, respectively. The proposed DACLnet model had MAE, RMSE, and MAPE values of 0.0015, 0.0055, and 0.0750, respectively. Compared to the LSTM and CNN-LSTM models, the DACLnet model exhibited a significant reduction in MAE of 92.39% and 90.85%, RMSE of 85.09% and 68.57%, and MAPE of 87.71% and 44.24%, respectively. By introducing a dual-attention mechanism, the DACLnet effectively enhanced the focus on critical temporal features, thereby improving the accuracy of predicting nonlinear deformations.

4.2.3. Prediction Result

After confirming the prediction reliability of the DACLnet model, we employed it to forecast nonlinear surface deformations in the future. Figure 9 illustrates the projected surface deformations for the upcoming two months utilizing the DACLnet. A comparison with actual observed data indicates that the discrepancy between the predicted outcome for the next two months and the actual observation values remains within a controlled margin of 0.5 mm. This demonstrates a high degree of agreement between the nonlinear deformation predicted with the DACLnet model and the actual observations.

A further analysis of Table 4, which details the DACLnet model’s prediction errors across intervals of 12, 24, 36, 48, and 60 days using metrics such as the MAE, RMSE, and MAPE, indicates that despite a gradual increase in error values over extended periods, the highest MAPE remains notably low at 8.23% at 60 days. The sustained accuracy within acceptable error thresholds underscores the robustness of the DACLnet model for long-term forecasting in infrastructure monitoring and geological assessments. Moreover, this consistent prediction accuracy affirms the model’s practical utility in real-world scenarios.

Figure 10 shows the surface time series deformation rate in the main deformation area of the Turpan Basin, monitored with InSAR and predicted with the DACLnet using IDW (inverse distance weighting) interpolation. The predicted values are highly similar to the actual values in spatial distribution. By analyzing historical data, the DACLnet model can capture the nonlinear characteristics of surface deformation and provide reliable predictions for future deformation trends.

The experimental results demonstrate that the DACLnet model can accurately predict the development trends of nonlinear deformation in the temporal domain and achieve excellent prediction outcomes in the spatial domain, thereby showcasing the robust applicability and accuracy of the DACLnet for predicting nonlinear surface deformation tasks.

4.2.4. Reliability Evaluation of the DACLnet Results

We calculated the observed and predicted deformation rates and then conducted statistical and correlation analyses between the observed deformation rates per subsidence point and the DACLnet prediction results, as shown in Figure 11. The monitoring and prediction results exhibit a strong correlation. The linear regression equation is

y = 0.98 x + 0.18

, with an RMSE of 1.01 mm/yr and a correlation coefficient of R² 0.99. Most point discrepancies are less than three times the RMSE (between the black dashed lines in Figure 9). This indicates a high degree of fit between the model and the data, demonstrating robust predictions’ reliability.

5. Discussion

5.1. Analysis of the Temporal Variability in the Correlation between Observed and DACLnet-Simulated Deformations

In Figure 9, we observe variations in the correlation between actual deformations and DACLnet-simulated deformations across different time spans. For the data presented in Figure 8, we employed a differencing method to test for stationarity. We found that data prior to 28 March 2019 did not exhibit stationarity at a 5% significance level, whereas data afterward showed good stationarity. This indicates that the statistical characteristics of the data vary significantly over different periods, which may be one of the main reasons for the changes in the correlation between the simulated and observed data. We believe that the DACLnet model exhibits certain limitations when handling strongly non-stationary data, particularly under rapidly changing environmental conditions, where the nonlinear and non-stationary characteristics of the data can impact the model’s simulation performance.

Furthermore, the seasonal and cyclical characteristics of surface deformations also significantly affect the model’s predictive performance. Surface deformations often show distinct seasonal patterns due to cyclical changes in natural factors, such as precipitation and temperature. Although the DACLnet model captures seasonal features in time series well through its dual-attention mechanism, the predictive accuracy still fluctuates during transitional seasons due to potential limitations in the training data. This limits the model’s performance during specific periods.

5.2. Model Performance and Data Sparsity Issues

In our study, by analyzing the correlation between actual satellite measurement data and predicted position data, we confirmed the reliability and accuracy of the DACLnet model in the short-term prediction of ground subsidence; however, in the accuracy analysis of surface deformation prediction results (see Figure 11), there is a certain deviation between the DACLnet prediction results and the actual observed values for the annual subsidence rates near −150 mm/yr. This discrepancy primarily arises from the limited number of relevant points assessed (see Figure 6), resulting in insufficient deformation data within the dataset, which contains evenly spaced samples. Consequently, during training, the model fails to thoroughly learn the deformation patterns under these exceptional circumstances. In addition, the DACLnet may lack sufficient generalization ability to handle atypical or extreme surface subsidence phenomena. In future research, we will strive to improve the dataset’s balance and completeness to enhance the model’s adaptability and prediction accuracy for complex surface deformation events.

5.3. Long-Term Applicability of the Model and Future Directions for Optimization

Although the DACLnet model, which integrates deep learning and InSAR technology, has shown promising practicability and effectiveness in predicting nonlinear surface deformation, further exploration is needed to assess its applicability in long-term prediction and complex geographical conditions. The extension of the prediction time window may introduce error accumulation that could potentially impact the accuracy of long-term predictions. Additionally, the DACLnet lacks an established internal connection between surface nonlinear deformation and local geological hydrological conditions. This may limit its predictive accuracy under longer time scales and more complex environmental conditions; therefore, the model currently serves as an effective alternative tool for short-term ground subsidence prediction in areas lacking hydrogeological data. In the future, our research will focus on optimizing the model structure and incorporating additional environmental factors to improve the accuracy and robustness of the DACLnet in nonlinear surface deformation prediction.

6. Conclusions

This study proposes a novel DACLnet model to address the challenge of predicting nonlinear surface deformation. By leveraging the spatial feature extraction capability of a CNN and the time series learning advantage of LSTM, along with an intensive focus on crucial information provided by an attention mechanism, it enables an in-depth understanding and dynamic capture of nonlinear deformation features. Through the practical prediction of nonlinear surface deformation in the Turpan Basin, the DACLnet model performs well and accurately in processing complex spatiotemporal series data. Compared with the LSTM and CNN-LSTM models, the DACLnet achieves respective improvements of 85.09% and 68.57% in prediction accuracy for nonlinear deformation (measured using the RMSE of the test set). Moreover, through small-sample learning and training, the DACLnet achieves the high-precision and short-term prediction of large-scale and nonlinear surface deformations, providing a reliable and efficient tool for early dynamic geological disaster warnings, which are crucial for disaster prevention and reduction.

Author Contributions

J.L. (Junyu Lu): Conceptualization, Methodology, Formal Analysis, and Writing—Original Draft Preparation. Y.W. (Yuedong Wang): Conceptualization, Resources, Supervision, Project administration, and Writing—Review and Editing. Y.Z.: Methodology, Conceptualization, Investigation, and Visualization. J.L. (Jingtao Liu): Data Curation and Investigation. Y.X.: Visualization and Investigation. H.Y.: Conceptualization, Formal Analysis, and Supervision. Y.W. (Yuebin Wang): Methodology, Formal Analysis, and Supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This work is financially supported by the China University of Geosciences (Beijing) University Student Innovation and Entrepreneurship Training Program, and the project title is ‘Time-series InSAR Surface Deformation Prediction Based on Deep Learning’; in part by the Fundamental Research Funds for the Central Universities under Grant (2652023069); and in part by the National Natural Science Foundation of China (42174026).

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Acknowledgments

The authors thank the European Space Agency (ESA) for providing free and open Sentinel-1 data.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

Abbreviation	Full Name
CNN	Convolutional Neural Network
LSTM	Long Short-Term Memory Network
DACLnet	CNN-LSTM Model Embedded within Dual-Attention Mechanisms
InSAR	Interferometric Synthetic Aperture Radar
SAR	Synthetic Aperture Radar
RNN	Recurrent Neural Network
MLP	Multilayer Perceptron
SPIA	Shanghai Pudong International Airport
IPTA	Interferometric Point Target Analysis
DEM	Digital Elevation Model
SLC	Single-Look Complex
DInSAR	Differential Interferometric Synthetic Aperture Radar
EDAD	Elevation-dependent Atmospheric Delay
SVD	Singular Value Decomposition
SRTM	Shuttle Radar Topography Mission
MCF	Minimum Cost Flow
MAE	Mean Absolute Error
RMSE	Root Mean Square Error
MAPE	Mean Absolute Percentage Error
R²	Coefficient of Determination

References

Scigala, R.; Szafulera, K. Linear discontinuous deformations created on the surface as an effect of underground mining and local geological conditions-case study. Bull. Eng. Geol. Environ. 2020, 79, 2059–2068. [Google Scholar] [CrossRef]
Chen, G.; Yang, J.; Liu, Y.; Kitahara, T.; Beer, M. An energy-frequency parameter for earthquake ground motion intensity measure. Earthq. Eng. Struct. Dyn. 2023, 52, 271–284. [Google Scholar] [CrossRef]
Chen, G.; Li, Q.; Li, D.; Wu, Z.; Liu, Y. Main frequency band of blast vibration signal based on wavelet packet transform. Appl. Math. Model. 2019, 74, 569–585. [Google Scholar] [CrossRef]
Diao, X.; Wu, K.; Chen, R.; Yang, J. Identifying the Cause of Abnormal Building Damage in Mining Subsidence Areas Using InSAR Technology. IEEE Access 2019, 7, 172296–172304. [Google Scholar] [CrossRef]
Zhang, W. Geological disaster monitoring and early warning system based on big data analysis. Arab. J. Geosci. 2020, 13, 946. [Google Scholar] [CrossRef]
Massonnet, D.; Rossi, M.; Carmona, C.; Adragna, F.; Peltzer, G.; Feigl, K.; Rabaute, T. The displacement field of the landers earthquake mapped by radar interferometry. Nature 1993, 364, 138–142. [Google Scholar] [CrossRef]
Hu, J.; Liu, J.; Li, Z.; Zhu, J.; Wu, L.; Sun, Q.; Wu, W. Estimating three-dimensional coseismic deformations with the SM-VCE method based on heterogeneous SAR observations: Selection of homogeneous points and analysis of observation combinations. Remote Sens. Environ. 2021, 255, 112298. [Google Scholar] [CrossRef]
Wu, Z.; Zhao, L.; Liu, L.; Zhu, R.; Gao, Z.; Qiao, Y.; Tian, L.; Zhou, H.; Xie, M. Surface-deformation monitoring in the permafrost regions over the Tibetan Plateau, using Sentinel-1 data. Sci. Cold Arid Reg. 2018, 10, 114–125. [Google Scholar]
Xu, W.; Ruch, J.; Jónsson, S. Birth of two volcanic islands in the southern Red Sea. Nat. Commun. 2015, 6, 7104. [Google Scholar] [CrossRef]
Babu, A.; Kumar, S. SBAS interferometric analysis for volcanic eruption of Hawaii island. J. Volcanol. Geotherm. Res. 2019, 370, 31–50. [Google Scholar] [CrossRef]
Xu, W.; Xie, L.; Aoki, Y.; Rivalta, E.; Jónsson, S. Volcano-Wide Deformation After the 2017 Erta Ale Dike Intrusion, Ethiopia, Observed with Radar Interferometry. J. Geophys. Res.-Solid Earth 2020, 125, e2020JB019562. [Google Scholar] [CrossRef]
Dong, J.; Zhang, L.; Tang, M.; Liao, M.; Xu, Q.; Gong, J.; Ao, M. Mapping landslide surface displacements with time series SAR interferometry by combining persistent and distributed scatterers: A case study of Jiaju landslide in Danba, China. Remote Sens. Environ. 2018, 205, 180–198. [Google Scholar] [CrossRef]
Song, C.; Yu, C.; Li, Z.; Utili, S.; Frattini, P.; Crosta, G.; Peng, J. Triggering and recovery of earthquake accelerated landslides in Central Italy revealed by satellite radar observations. Nat. Commun. 2022, 13, 7278. [Google Scholar] [CrossRef]
Xiong, Z.; Zhang, M.; Ma, J.; Xing, G.; Feng, G.; An, Q. InSAR-based landslide detection method with the assistance of C-index. Landslides 2023, 20, 2709–2723. [Google Scholar] [CrossRef]
Ma, P.; Lin, H.; Wang, W.; Yu, H.; Chen, F.; Jiang, L.; Zhou, L.; Zhang, Z.; Shi, G.; Wang, J. Toward Fine Surveillance: A Review of Multitemporal Interferometric Synthetic Aperture Radar for Infrastructure Health Monitoring. IEEE Geosci. Remote Sens. Mag. 2022, 10, 207–230. [Google Scholar] [CrossRef]
Ma, P.; Wang, W.; Zhang, B.; Wang, J.; Shi, G.; Huang, G.; Chen, F.; Jiang, L.; Lin, H. Remotely sensing large- and small-scale ground subsidence: A case study of the Guangdong-Hong Kong-Macao Greater Bay Area of China. Remote Sens. Environ. 2019, 232, 111282. [Google Scholar] [CrossRef]
Liu, P.; Chen, X.; Li, Z.; Zhang, Z.; Xu, J.; Feng, W.; Wang, C.; Hu, Z.; Tu, W.; Li, H. Resolving Surface Displacements in Shenzhen of China from Time Series InSAR. Remote Sens. 2018, 10, 1162. [Google Scholar] [CrossRef]
Yang, Z.; Li, Z.; Zhu, J.; Wang, Y.; Wu, L. Use of SAR/InSAR in Mining Deformation Monitoring, Parameter Inversion, and Forward Predictions: A Review. IEEE Geosci. Remote Sens. Mag. 2020, 8, 71–90. [Google Scholar] [CrossRef]
Wang, Y.; Yang, Z.; Li, Z.; Zhu, J.; Wu, L. Fusing adjacent-track InSAR datasets to densify the temporal resolution of time-series 3-D displacement estimation over mining areas with a prior deformation model and a generalized weighting least-squares method. J. Geod. 2020, 94, 47. [Google Scholar] [CrossRef]
Yang, H.; Jiang, Q.; Han, J.; Kang, K.; Peng, J. InSAR measurements of surface deformation over permafrost on Fenghuoshan Mountains section, Qinghai-Tibet Plateau. J. Syst. Eng. Electron. 2021, 32, 1284–1303. [Google Scholar] [CrossRef]
Zhao, R.; Li, Z.; Feng, G.; Wang, Q.; Hu, J. Monitoring surface deformation over permafrost with an improved SBAS-InSAR algorithm: With emphasis on climatic factors modeling. Remote Sens. Environ. 2016, 184, 276–287. [Google Scholar] [CrossRef]
Liao, M.; Balz, T.; Rocca, F.; Li, D. Paradigm Changes in Surface-Motion Estimation From SAR: Lessons From 16 Years of Sino-European Cooperation in the Dragon Program. IEEE Geosci. Remote Sens. Mag. 2020, 8, 8–21. [Google Scholar] [CrossRef]
Xue, F.; Lv, X.; Dou, F.; Yun, Y. A Review of Time-Series Interferometric SAR Techniques: A Tutorial for Surface Deformation Analysis. IEEE Geosci. Remote Sens. Mag. 2020, 8, 22–42. [Google Scholar] [CrossRef]
Even, M.; Schulz, K. InSAR Deformation Analysis with Distributed Scatterers: A Review Complemented by New Advances. Remote Sens. 2018, 10, 744. [Google Scholar] [CrossRef]
Yu, H.; Lan, Y.; Yuan, Z.; Xu, J.; Lee, H. Phase Unwrapping in InSAR A review. IEEE Geosci. Remote Sens. Mag. 2019, 7, 40–58. [Google Scholar] [CrossRef]
Passalis, N.; Tefas, A.; Kanniainen, J.; Gabbouj, M.; Iosifidis, A. Deep Adaptive Input Normalization for Time Series Forecasting. IEEE Trans. Neural Netw. Learn. Syst. 2020, 31, 3760–3765. [Google Scholar] [CrossRef]
Hu, X.; Burgmann, R.; Xu, X.; Fielding, E.; Liu, Z. Machine-Learning Characterization of Tectonic, Hydrological and Anthropogenic Sources of Active Ground Deformation in California. J. Geophys. Res.-Solid Earth 2021, 126, e2021JB022373. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
Anantrasirichai, N.; Biggs, J.; Albino, F.; Bull, D. The Application of Convolutional Neural Networks to Detect Slow, Sustained Deformation in InSAR Time Series. Geophys. Res. Lett. 2019, 46, 11850–11858. [Google Scholar] [CrossRef]
Valade, S.; Ley, A.; Massimetti, F.; D’Hondt, O.; Laiolo, M.; Coppola, D.; Loibl, D.; Hellwich, O.; Walter, T. Towards Global Volcano Monitoring Using Multisensor Sentinel Missions and Artificial Intelligence: The MOUNTS Monitoring System. Remote Sens. 2019, 11, 1528. [Google Scholar] [CrossRef]
Prabhakar, K.R.; Nukala, V.H.; Nayak, M.; Gubbi, J.; Purushothaman, B. Multi-scale Attention Guided Recurrent Neural Network for Deformation Map Forecasting. In Proceedings of the Image and Signal Processing for Remote Sensing XXVII, Online, 13–17 September 2021; p. 11862. [Google Scholar] [CrossRef]
Yazbeck, J.; Rundle, J.B. Predicting Short-Term Deformation in the Central Valley Using Machine Learning. Remote Sens. 2023, 15, 449. [Google Scholar] [CrossRef]
Liu, Q.; Zhang, Y.; Wei, J.; Wu, H.; Deng, M. HLSTM: Heterogeneous Long Short-Term Memory Network for Large-Scale InSAR Ground Subsidence Prediction. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 8679–8688. [Google Scholar] [CrossRef]
Chen, Y.; He, Y.; Zhang, L.; Chen, Y.; Pu, H.; Chen, B.; Gao, L. Prediction of InSAR deformation time-series using a long short-term memory neural network. Int. J. Remote Sens. 2021, 42, 6921–6944. [Google Scholar] [CrossRef]
Bao, X.; Zhang, R.; Shama, A.; Li, S.; Xie, L.; Lv, J.; Fu, Y.; Wu, R.; Liu, G. Ground Deformation Pattern Analysis and Evolution Prediction of Shanghai Pudong International Airport Based on PSI Long Time Series Observations. Remote Sens. 2022, 14, 610. [Google Scholar] [CrossRef]
Ali, O.; Saif-ur-Rehman, M.; Glasmachers, T.; Iossifidis, I.; Klaes, C. ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data. Comput. Biol. Med. 2024, 168, 107649. [Google Scholar] [CrossRef]
Wisdom, S.; Powers, T.; Hershey, J.R.; Le Roux, J.; Atlas, L. Full-Capacity Unitary Recurrent Neural Networks. In Proceedings of the Advances in Neural Information Processing Systems 29 (NIPS 2016), Barcelona, Spain, 5–10 December 2016; p. 29. [Google Scholar]
Arjovsky, M.; Shah, A.; Bengio, Y. Unitary Evolution Recurrent Neural Networks. In International Conference on Machine Learning; PMLR: London, UK, 2016; p. 48. [Google Scholar]
Kang, J.; Zhang, W.-Q.; Liu, W.-W.; Liu, J.; Johnson, M.T. Advanced recurrent network-based hybrid acoustic models for low resource speech recognition. EURASIP J. Audio Speech Music. Process. 2018, 6, 1–15. [Google Scholar] [CrossRef]
Kang, J.; Zhang, W.-Q.; Liu, J. Gated Recurrent Units Based Hybrid Acoustic Models for Robust Speech Recognition. In Proceedings of the 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP), Tianjin, China, 17–20 October 2016. [Google Scholar]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. In Proceedings of the Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; p. 30. [Google Scholar]
Xiong, Z.; Feng, G.; Feng, Z.; Miao, L.; Wang, Y.; Yang, D.; Luo, S. Pre- and post-failure spatial-temporal deformation pattern of the Baige landslide retrieved from multiple radar and optical satellite images. Eng. Geol. 2020, 279, 105880. [Google Scholar] [CrossRef]
Wang, Y.; Feng, G.; Li, Z.; Xu, W.; Zhu, J.; He, L.; Xiong, Z.; Qiao, X. Retrieving the displacements of the Hutubi (China) underground gas storage during 2003-2020 from multi-track InSAR. Remote Sens. Environ. 2022, 268, 112768. [Google Scholar] [CrossRef]
Werner, C.; Wegmüller, U.; Strozzi, T.; Wiesmann, A. Interferometric point target analysis for deformation mapping. In Proceedings of the IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No. 03CH37477), Toulouse, France, 21–25 July 2003; pp. 4362–4364. [Google Scholar]
Ferretti, A.; Prati, C.; Rocca, F. Permanent scatterers in SAR interferometry. IEEE Trans. Geosci. Remote Sens. 2001, 39, 8–20. [Google Scholar] [CrossRef]
Dong, J.; Zhang, L.; Liao, M.; Gong, J. Improved correction of seasonal tropospheric delay in InSAR observations for landslide deformation monitoring. Remote Sens. Environ. 2019, 233, 111370. [Google Scholar] [CrossRef]
Ghiasi-Shirazi, K. Generalizing the Convolution Operator in Convolutional Neural Networks. Neural Process. Lett. 2019, 50, 2627–2646. [Google Scholar] [CrossRef]
Zhou, Q.; Zhou, C.; Wang, X. Stock prediction based on bidirectional gated recurrent unit with convolutional neural network and feature selection. PLoS ONE 2022, 17, e0262501. [Google Scholar] [CrossRef]
Chang, Z.; Zhang, Y.; Chen, W. Effective Adam-Optimized LSTM Neural Network for Electricity Price Forecasting. In Proceedings of the 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 23–25 November 2018; Li, W., Babu, M., Eds.; IEEE: Piscataway, NJ, USA, 2018; pp. 245–248. [Google Scholar]
Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
Zhang, M.; Philp, P. Geochemical characterization of aromatic hydrocarbons in crude oils from the Tarim, Qaidam and Turpan Basins, NW China. Pet. Sci. 2010, 7, 448–457. [Google Scholar] [CrossRef]
Yan, N.; Wu, B.; Zhu, W. Assessment of Agricultural Water Productivity in Arid China. Water 2020, 12, 1161. [Google Scholar] [CrossRef]
Pepe, A.; Lanari, R. On the extension of the minimum cost flow algorithm for phase unwrapping of multitemporal differential SAR interferograms. IEEE Trans. Geosci. Remote Sens. 2006, 44, 2374–2383. [Google Scholar] [CrossRef]
Lee, J.; Hoppel, K.; Mango, S.; Miller, A. Intensity and phase statistics of multilook polarimetric and interferometric sar imagery. IEEE Trans. Geosci. Remote Sens. 1994, 32, 1017–1028. [Google Scholar] [CrossRef]
Mestre-Quereda, A.; Lopez-Sanchez, J.; Ballester-Berman, J.; Gonzalez, P.; Hooper, A.; Wright, T. Evaluation of the Multilook Size in Polarimetric Optimization of Differential SAR Interferograms. IEEE Geosci. Remote Sens. Lett. 2018, 15, 1407–1411. [Google Scholar] [CrossRef]
Farr, T.; Rosen, P.; Caro, E.; Crippen, R.; Duren, R.; Hensley, S.; Kobrick, M.; Paller, M.; Rodriguez, E.; Roth, L.; et al. The shuttle radar topography mission. Rev. Geophys. 2007, 45, RG2004. [Google Scholar] [CrossRef]
Wang, Y.; Feng, G.; Li, Z.; Luo, S.; Wang, H.; Xiong, Z.; Zhu, J.; Hu, J. A Strategy for Variable-Scale InSAR Deformation Monitoring in a Wide Area: A Case Study in the Turpan-Hami Basin, China. Remote Sens. 2022, 14, 3832. [Google Scholar] [CrossRef]
Grenerczy, G.; Wegmüller, U. Deformation analysis of a burst red mud reservoir using combined descending and ascending pass ENVISAT ASAR data. Nat. Hazards 2013, 65, 2205–2214. [Google Scholar] [CrossRef]

Figure 1. Flowchart of the improved IPTA method. The full names of abbreviations such as SLC, DEM, EDAD, and SVD can be found in Abbreviations Section.

Figure 2. Structure diagram of the DACLnet model.

Figure 3. The Turpan Basin and SAR data coverage.

Figure 4. Spatiotemporal baseline maps, where (a) shows the spatiotemporal baseline map for AT41F135 and (b) shows the spatiotemporal baseline map for DT121F449.

Figure 5. Surface subsidence velocity maps for the Turpan–Hami Basin: (a) AT orbit; (b) DT orbit. Panels (c,d) depict the temporal deformations of feature points P1 and P2, monitored using the AT and DT orbit datasets.

Figure 6. (a,b) Distribution statistics and (c) correlation between the subsidence rate results on AT143F135 and DT121F449. The black dashed line represents three times the RMSE.

Figure 7. Loss decrease chart during model training.

Figure 8. A comparison of sedimentation simulation results between the DACLnet and LSTM as well as CNN-LSTM models under the same training strategy.

Figure 9. Prediction of nonlinear deformations from historical deformation sequences from 25 March 2015 to 27 February 2020, spanning from 27 February 2020 to 27 April 2020.

Figure 10. A spatial variation map of surface deformation velocity in the Turpan Basin. (a) Actual deformation results processed with InSAR from the ascending track (AT41F135) of Sentinel-1. (b) The InSAR deformation results predicted using the DACLnet model.

Figure 11. A comparison between the InSAR deformation velocity observations and DACLnet prediction results, derived from a comprehensive dataset of 574,662 ascending track observations and predictions. The black dashed line indicates a range three times greater than the RMSE.

Table 1. The image parameters of the study areas.

Frame	Heading	Incidence	Pixel Spacing (Rg × Az)	Time	Number
AT41F135	−9.21°	33.65°	2.33 × 13.95 m	25/03/2015–27/04/2020	122
DT121F449	−170.36°	33.57°	2.33 × 13.95 m	19/03/2015–27/04/2020	107

Table 2. Parameter information for the DACLnet model configuration.

Parameter	Configuration
Optimizer	Adam
Dropout	0.2
Learning rate	0.005
Training epochs	5
Training iterations	14,615
Input window length	30
Output window length	1
Number of attention heads	8

Table 3. Accuracy evaluation results of LSTM, CNN-LSTM, and the DACLnet.

Model	MAE	RMSE	MAPE
LSTM	0.0197	0.0369	0.6103
CNN-LSTM	0.0164	0.0175	0.1345
DACLnet	0.0015	0.0055	0.0750
vs. LSTM	92.39%	85.09%	87.71%
vs. CNN-LSTM	90.85%	68.57%	44.24%

Table 4. A comparison of prediction errors across different time periods (the data in this table come from the corresponding time points and predicted data of 574,662 deformation time series).

Period	MAE	RMSE	MAPE
12 days	0.0045	0.0068	0.0146
24 days	0.0108	0.0161	0.0318
36 days	0.0151	0.0220	0.0426
48 days	0.0215	0.0306	0.0592
60 days	0.0303	0.0430	0.0823

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lu, J.; Wang, Y.; Zhu, Y.; Liu, J.; Xu, Y.; Yang, H.; Wang, Y. DACLnet: A Dual-Attention-Mechanism CNN-LSTM Network for the Accurate Prediction of Nonlinear InSAR Deformation. Remote Sens. 2024, 16, 2474. https://doi.org/10.3390/rs16132474

AMA Style

Lu J, Wang Y, Zhu Y, Liu J, Xu Y, Yang H, Wang Y. DACLnet: A Dual-Attention-Mechanism CNN-LSTM Network for the Accurate Prediction of Nonlinear InSAR Deformation. Remote Sensing. 2024; 16(13):2474. https://doi.org/10.3390/rs16132474

Chicago/Turabian Style

Lu, Junyu, Yuedong Wang, Yafei Zhu, Jingtao Liu, Yang Xu, Honglei Yang, and Yuebin Wang. 2024. "DACLnet: A Dual-Attention-Mechanism CNN-LSTM Network for the Accurate Prediction of Nonlinear InSAR Deformation" Remote Sensing 16, no. 13: 2474. https://doi.org/10.3390/rs16132474

APA Style

Lu, J., Wang, Y., Zhu, Y., Liu, J., Xu, Y., Yang, H., & Wang, Y. (2024). DACLnet: A Dual-Attention-Mechanism CNN-LSTM Network for the Accurate Prediction of Nonlinear InSAR Deformation. Remote Sensing, 16(13), 2474. https://doi.org/10.3390/rs16132474

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

DACLnet: A Dual-Attention-Mechanism CNN-LSTM Network for the Accurate Prediction of Nonlinear InSAR Deformation

Abstract

1. Introduction

2. Methodology

2.1. Deformation Signals Monitored Using Advanced IPTA-InSAR

2.2. CNN-LSTM Model Embedded within Dual-Attention Mechanisms

2.3. Network Training

3. Study Area and Data Processing

3.1. The Study Area

3.2. InSAR Datasets

3.3. Data Processing

4. Results

4.1. Monitoring Results

4.1.1. Deformation in the Turpan Basin

4.1.2. Reliability Evaluation of InSAR Results

4.2. DACLnet Results

4.2.1. Network Training Results

4.2.2. Model Performance Testing

4.2.3. Prediction Result

4.2.4. Reliability Evaluation of the DACLnet Results

5. Discussion

5.1. Analysis of the Temporal Variability in the Correlation between Observed and DACLnet-Simulated Deformations

5.2. Model Performance and Data Sparsity Issues

5.3. Long-Term Applicability of the Model and Future Directions for Optimization

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI