A Multicomponent Linear Frequency Modulation Signal-Separation Network for Multi-Moving-Target Imaging in the SAR-Ground-Moving-Target Indication System

Ding, Chang; Mu, Huilin; Zhang, Yun

doi:10.3390/rs16040605

Open AccessArticle

A Multicomponent Linear Frequency Modulation Signal-Separation Network for Multi-Moving-Target Imaging in the SAR-Ground-Moving-Target Indication System

by

Chang Ding

¹

,

Huilin Mu

^2,*

and

Yun Zhang

³

¹

Shaanxi Key Laboratory of Artificially Structured Functional Materials and Devices, Air Force Engineering University, Xi’an 710051, China

²

Air and Missile Defense College, Air Force Engineering University, Xi’an 710051, China

³

Department of Electronic Engineering, Harbin Institute of Technology, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2024, 16(4), 605; https://doi.org/10.3390/rs16040605

Submission received: 9 January 2024 / Accepted: 4 February 2024 / Published: 6 February 2024

(This article belongs to the Special Issue Exploitation of SAR Data Using Deep Learning Approaches)

Download

Browse Figures

Versions Notes

Abstract

:

Multi-moving-target imaging in a synthetic aperture radar (SAR) system poses a significant challenge owing to target defocusing and being contaminated by strong background clutter. Aiming at this problem, a new deep-convolutional-neural-network (CNN)-assisted method is proposed for multi-moving-target imaging in a SAR-GMTI system. The multi-moving-target signal can be modeled by a multicomponent LFM signal with additive perturbation. A fully convolutional network named MLFMSS-Net was designed based on an encoder–decoder architecture to extract the most-energetic LFM signal component from the multicomponent LFM signal in the time domain. Without prior knowledge of the target number, an iterative signal-separation framework based on the well-trained MLFMSS-Net is proposed to separate the multi-moving-target signal into multiple LFM signal components while eliminating the residual clutter. It works well, exhibiting high imaging robustness and low dependence on the system parameters, making it a suitable solution for practical imaging applications. Consequently, a well-focused multi-moving-target image can be obtained by parameter estimation and secondary azimuth compression for each separated LFM signal component. The simulations and experiments on both airborne and spaceborne SAR data showed that the proposed method is superior to traditional imaging methods in both imaging quality and efficiency.

Keywords:

synthetic aperture radar (SAR); multi-moving-target imaging; ground-moving-target indication (GMTI); clutter suppression; multicomponent LFM signal-separation network

Graphical Abstract

1. Introduction

Ground-moving-target indication (GMTI) in a synthetic aperture radar (SAR) system has attracted much interest from wide-area traffic monitoring, as well as military surveillance activities due to its long-range, all-day, and all-weather imaging capability [1,2,3,4]. However, the SAR system is typically designed for imaging stationary scenes, so when a target is in motion, it can cause the defocusing and misplacement of the target in the SAR image. This is because the target moves during the synthetic aperture time, causing a phase shift in the received signal [5,6,7]. Especially for adjacent multiple targets with overlapping images, it brings additional difficulties to the imaging task in the SAR-GMTI system. Furthermore, the moving-target signal may be corrupted by undesirable clutter and noise, which is not conducive to target imaging.

An efficient method to solve this problem is time–frequency-representation (TFR)-based methods, such as the Wigner–Ville distribution (WVD) [8,9], Chirplet decomposition [10,11], the fractional Fourier transform [12,13], and Lv’s distribution (LVD) [14,15], where the signal of the moving target during the coherent processing interval (CPI) can be represented as a linear frequency modulation (LFM) form. The refocused image can be generated through parameter estimation, as well as azimuth filtering. However, owing to the corrupted moving-target signal due to the strong clutter and the cross-term induced by the multiple LFM signal components, the imaging performance of TFR-based methods is limited.

With the development of deep learning technology, deep neural networks have been widely utilized in the SAR moving-target-imaging task [16,17,18,19,20,21,22]. In [19], a deep CNN-based method was first explored and applied for multi-moving-target imaging in a SAR-GMTI system. A SAR moving-target-imaging method based on the improved U-net network was presented in [20]. An approach based on long short-term memory networks was proposed for tracking moving targets with consecutive SAR images. Then, the refocusing process could be optimized by utilizing the predicted trajectories [22]. However, the above imaging networks may be sensitive to radar system parameters given that they are trained on synthetic datasets under a specific SAR system, limiting the applicability of deep-learning-based methods in practical systems. The imaging performance of the previously trained networks might be compromised when handling new datasets with different system parameters. Besides, in the ship-target-imaging application, a complex-valued channel fusion U-shaped network was designed for ship target refocusing [23], and a complex-valued convolutional autoencoder based on the attention mechanism was proposed to improve the imaging of ship targets in the GEO SA-Bi SAR system [24]. A novel Omega-KA-net based on sparse optimization was proposed to realize moving-target imaging [21]. The high-quality imaging results can be obtained under down-sampling and a low signal-to-noise ratio (SNR). However, these methods mainly focus on ship target imaging and do not take into account the interference of ground clutter in complex ground scenes.

The signal separation and the residual clutter removal are crucial for accurate parameter estimation and high-quality imaging of multiple moving targets. Recently, deep learning has achieved superior performance for audio separation and chirp-signal-parameter-estimation tasks [25,26,27,28,29]. Considering the multicomponent LFM signal characteristics of the radar echo for the SAR-GMTI system, one might wonder whether this deep-learning-based technique can be applied to multicomponent LFM signal separation and can bring huge advancements for target imaging in both accuracy and efficiency. A complex-valued deep neural network was designed for the parameter estimation of chirp signals [28]. Moreover, a framework combining the fractional Fourier transform and the alternating direction method of multipliers network was reported to achieve the parameter estimation of chirp signals under sub-Nyquist sampling [29]. However, these methods are based on the assumption that the component number is a priori known. Thus, they may not perform optimally for practical imaging applications.

To solve the difficulties of the obvious residual clutter and cross-term interferences, the high sensitivity to the system parameters, as well as the unknown target number in existing multi-moving-target-imaging methods, a deep CNN-assisted method is proposed for multi-moving-target imaging in the SAR-GMTI system. The multi-moving-target-signal model was first analyzed and formulated by a multicomponent LFM signal form with additive perturbation after the basic SAR imaging processing and clutter suppression. The SAR system parameters and target motion information are implicitly embedded within the multicomponent LFM signal parameters, which means that the signal model is capable of comprehensively representing multi-moving-target signals under different SAR systems. Given the unknown target number, an iterative signal-separation framework based on a deep CNN is proposed. In this framework, a network named MLFMSS-Net was designed to extract the most-energetic LFM signal component from the multicomponent LFM signal and iteratively applied multiple times until all the LFM signal component separation and residual clutter suppression were achieved. The network MLFMSS-Net was designed based on a convolutional encoder–decoder architecture and trained on the dataset with various SAR system parameters, target information, and clutter types, which was generated by the multicomponent LFM-signal model. This allows the network to exhibit strong robustness and makes it a suitable solution for practical imaging applications. Consequently, a well-focused multi-moving-target image can be obtained by parameter estimation and secondary azimuth compression for each separated LFM signal component. The signal separation performance of MLFMSS-Net was explored by a simulated multicomponent LFM signal. Simulations and experiments on both airborne and spaceborne SAR data were further performed to verify the effectiveness of the MLFMSS-Net-assisted imaging method. The experimental results showed that, compared with traditional imaging methods, the proposed method achieved high-quality and high-efficiency imaging without prior knowledge of the target number in different SAR systems.

Overall, the main contributions of this paper are presented as follows:

(1): We designed the MLFMSS-Net based on a convolutional encoder–decoder architecture to separate the most-energetic LFM signal component from the multicomponent LFM signal with additive perturbation. The network exhibited strong robustness and low dependence on the system parameters, making it more suitable for practical imaging tasks;
(2): We propose a MLFMSS-Net-assisted multi-moving-target-imaging method for the SAR-GMTI system. Given the unknown target number, an iterative signal-separation framework based on the trained MLFMSS-Net is presented to separate the multi-moving-target signal into multiple LFM signal components while eliminating the residual clutter. Both the imaging quality and efficiency of the proposed method were greatly improved.

The remainder of this paper is organized as follows. The multi-moving-target-signal model for the SAR-GMTI system is first established in Section 2. On this basis, Section 3 proposes the MLFMSS-Net-assisted multi-moving-target-imaging method. The iterative signal-separation framework based on MLFMSS-Net and the corresponding network details are provided in this section. Section 4 discusses the experimental results and performance analysis, including the results on the simulated multicomponent LFM signal and simulations and experiments on both airborne and spaceborne SAR data. Finally, the discussion and conclusion are given in Section 5 and Section 6, respectively.

2. Multi-Moving-Target-Signal Model for the SAR-Ground-Moving-Target-Indication System

The geometry of the 3D broadside SAR configuration with a ground moving target is first established in Section 2.1. Then, the multi-moving-target-signal model is theoretically derived and analyzed in Section 2.2.

2.1. Geometric Configuration

As depicted in Figure 1, the dual-channel SAR-GMTI system is installed on an airborne or spaceborne platform that moves along a predetermined flight track on the X-axis. The platform flies with a constant velocity V while maintaining a height of

z = H

in the 3D Cartesian coordinate system OXYZ. In the SAR-GMTI system, a phased-array antenna transmitting the LFM signal is positioned along the movement direction of the platform. The antenna is equally divided into two parts to receive radar echoes. According to the equivalent principle of the antenna phase center (APC) [30], the bistatic transmitter–receiver can be treated as a monostatic channel. The equivalent APC of the ith monostatic channel can be represented by

q_{i} (t_{m}) = [V t_{m} - (i - 1) d] \vec{x} + H \vec{z}, i = 1, 2

, where

t_{m}

is the slow time.

\vec{x}

and

\vec{z}

represent the unit vectors of the X-axis and Z-axis, respectively. The spacing between

q_{1} (t_{m})

and

q_{2} (t_{m})

is denoted as d.

A ground target moves on the ground plane (

z = 0

) with the constant acceleration within the CPI. The position vector of the moving target can be expressed as

p (t_{m}) = (x_{0} + v_{x} t_{m} + \frac{1}{2} a_{x} t_{m}^{2}) \vec{x} + (y_{0} + v_{y} t_{m} + \frac{1}{2} a_{y} t_{m}^{2}) \vec{y}

(1)

where

\vec{y}

represents the unit vector of the Y-axis. At

t_{m} = 0

, the target is located at

x_{0}

and

y_{0}

, which correspond to the along- and across-track ground coordinates, respectively. The along- and across-track ground velocities of the target are represented as

v_{x}

and

v_{y}

, whereas the corresponding accelerations are denoted by

a_{x}

and

a_{y}

, respectively. According to the SAR-GMTI acquisition geometry, the radial velocity and acceleration can be, respectively, expressed as

v_{r} = v_{y} s i n ϕ

and

a_{r} = a_{y} s i n ϕ

, where

ϕ

denotes the incident angle. The instantaneous slant range from the ith channel to the moving target can be given by

R_{i} (t_{m}) = ∥R_{i} (t_{m})∥ = ∥p (t_{m}) - q_{i} (t_{m})∥, i = 1, 2 .

(2)

The second-order Taylor series expansion of

R_{1} (t_{m})

can be expressed by

\begin{array}{l} R_{1} (t_{m}) & = \sqrt{{(x_{0} + (v_{x} - V) t_{m} + \frac{1}{2} a_{x} t_{m}^{2})}^{2} + {(y_{0} + v_{y} t_{m} + \frac{1}{2} a_{y} t_{m}^{2})}^{2} + H^{2}} \\ \approx R_{B} + v_{r} (t_{m} - t_{a c}) + \frac{{(V - v_{x})}^{2} + a_{r} R_{B}}{2 R_{B}} {(t_{m} - t_{a c})}^{2} \end{array}

(3)

where

t_{a c}

is the broadside time and

R_{B} = R_{1} (t_{a c})

. For the dual-channel SAR system where two channels move along the same trajectory, it is observed that the second channel lags behind the first channel by a time delay

Δ t = d / V

. According to the literature [31], the difference between

R_{2} (t_{m} + Δ t)

and

R_{1} (t_{m})

is mainly influenced by the radial velocity

v_{r}

. Therefore,

R_{2} (t_{m} + Δ t)

can be reasonably approximated as

R_{2} (t_{m} + Δ t) \approx R_{1} (t_{m}) + v_{r} Δ t

.

2.2. Signal Model

Assume that an LFM signal with the carrier frequency

f_{0}

is transmitted as follows:

s (\hat{t}) = rect (\frac{\hat{t}}{T_{p}}) exp [j 2 π (f_{0} \hat{t} + \frac{1}{2} μ {\hat{t}}^{2})]

(4)

where

\hat{t}

is the fast time.

T_{p}

and

μ

denote the pulse duration and the frequency modulation rate, respectively. The received signal of the moving target for the ith channel after demodulation can be represented by

s_{i t} (\hat{t}, t_{m}) = A_{t} rect (\frac{t_{m}}{T_{a}}) rect (\frac{\hat{t} - 2 R_{i} (t_{m}) / c}{T_{p}}) exp [j π μ {(\hat{t} - \frac{2 R_{i} (t_{m})}{c})}^{2}] exp [- j \frac{4 π}{λ} R_{i} (t_{m})]

(5)

where

A_{t}

denotes the reflection coefficient of the moving target.

T_{a}

is the synthetic aperture time.

λ

and c represent the wavelength and the light speed, respectively. Then, the range compression matched to

μ

is implemented to obtain the moving-target signal as follows:

s_{i t} (\hat{t}, t_{m}) = A_{t} rect (\frac{t_{m}}{T_{a}}) sinc [B (\hat{t} - \frac{2 R_{i} (t_{m})}{c})] exp [- j \frac{4 π}{λ} R_{i} (t_{m})]

(6)

where

B = μ T_{p}

denotes the transmitted signal bandwidth. According to Equation (6), it can be seen that the unavoidable range cell migration is introduced by the system velocity and target motion. The Keystone transformation, an effective and widely used method, is employed to correct the arbitrary range migration without prior knowledge of the target motion information [32]. After the range cell migration correction (RCMC), the moving-target signal can be expressed as

s_{i t} (\hat{t}, t_{m}) = A_{t} rect (\frac{t_{m}}{T_{a}}) sinc [B (\hat{t} - \frac{2 R_{B}}{c})] exp [- j \frac{4 π}{λ} R_{i} (t_{m})]

(7)

It is widely acknowledged that the SAR echo consists of the desired moving-target signal and the unwanted background clutter, as well as the unavoidable noise as follows:

s_{i} (\hat{t}, t_{m}) = s_{i t} (\hat{t}, t_{m}) + s_{i c} (\hat{t}, t_{m}) + n_{i} (\hat{t}, t_{m})

(8)

where

s_{i c} (\hat{t}, t_{m})

is the strong background clutter and

n_{i} (\hat{t}, t_{m})

denotes the inevitable noise. According to Equation (8), it is a challenging task to directly extract the moving-target signal from the SAR echo. To overcome this difficulty, clutter suppression is required to reduce the clutter signal while preserving the desired moving-target signal, leading to a high signal-to-clutter plus noise ratio (SCNR). One commonly used technique for clutter suppression is the displaced phase center antenna (DPCA) method, which has been proven to be effective and simple to implement [1,33]. The DPCA technique is performed by subtracting the second channel signal obtained at the delay time

(t_{m} + Δ t)

from the first channel signal acquired at

t_{m}

, and the clutter-suppressed signal can be expressed by

\begin{matrix} \begin{matrix} s_{12} (\hat{t}, t_{m}) = & s_{1} (\hat{t}, t_{m}) - s_{2} (\hat{t}, t_{m} + Δ t) \\ = & s_{1 t} (\hat{t}, t_{m}) [1 - exp (j 2 π f_{d t} Δ t)] + c n_{12} (\hat{t}, t_{m}) \end{matrix} \end{matrix}

(9)

where

f_{d t} = - 2 v_{r} / λ

denotes the Doppler centroid frequency of the moving target.

c n_{12} (\hat{t}, t_{m})

represents the additive perturbation, which is the sum of the residual clutter and the noise. It cannot be completely eliminated owing to the channel imbalance induced by small errors in the fabrication and excitation of the antenna. Considering the unknown motion parameters of the target, azimuth compression matched to stationary target parameters is performed to obtain the SAR image as follows:

s (\hat{t}, t_{m}) = A \cdot rect (\frac{t_{m} + Δ t_{m} - t_{a c}}{Δ T_{a}}) sinc [B (\hat{t} - \frac{2 R_{B}}{c})] exp [j π γ {(t_{m} - t_{a c})}^{2}] + c n_{12} (\hat{t}, t_{m})

(10)

where

A = A_{t} [1 - exp (j 2 π f_{d t} Δ t)]

and

Δ t_{m} = f_{d t} / γ_{d t}

is the azimuth shift caused by the radial velocity of the target. As can be seen from Equation (10), the moving-target signal in the SAR image can be formulated as the form of an LFM signal, where the Doppler chirp rate is

γ = γ_{d c} γ_{d t} / (γ_{d c} - γ_{d t})

and the signal length is denoted as

Δ T_{a} = B_{a} / γ

.

B_{a}

is the Doppler bandwidth. The Doppler chirp rates of the stationary and moving targets can be, respectively, represented by

γ_{d c} = - \frac{2 V^{2}}{λ R_{B}}, γ_{d t} = - \frac{2 {(V - v_{x})}^{2} + 2 a_{r} R_{B}}{λ R_{B}}

(11)

Due to the

γ

induced by the motion parameter of the target, the moving-target image appears defocused in the azimuth direction. It is important to note that multiple moving targets may appear in the SAR imaging scenario. SAR is a linear system, which means that the multi-moving-target signal can be generated by the linear superposition of the signal contributions from each target. Therefore, the azimuth signal for each range cell can be described as a multicomponent LFM signal as follows:

\begin{matrix} \begin{matrix} s (t_{m}) = \sum_{k = 1}^{K} & s_{k} (t_{m}) \\ = \sum_{k = 1}^{K} & A^{k} rect (\frac{t_{m} + Δ t_{m}^{k} - t_{a c}^{k}}{Δ T_{a}^{k}}) exp [j π γ^{k} {(t_{m} - t_{a c}^{k})}^{2}] + c n_{12} (t_{m}) \end{matrix} \end{matrix}

(12)

where

s_{k} (t_{m})

is the LFM signal contributed by the kth moving target.

A^{k}

,

Δ t_{m}^{k}

,

t_{a c}^{k}

,

Δ T_{a}^{k}

, and

γ^{k}

denote the parameters of the kth moving target, respectively. K is the number of targets. The multicomponent LFM signal can be also modeled in the discrete domain as follows:

\begin{matrix} \begin{matrix} s (n) = \sum_{k = 1}^{K} s_{k} (n) = \sum_{k = 1}^{K} & A^{k} rect (\frac{n + Δ n^{k} - n_{a c}^{k}}{Δ N_{a}^{k}}) exp [j a^{k} {(n - n_{a c}^{k})}^{2}] + c n_{12} (n), \\ n = - N / 2, - N / 2 + 1, \dots, N / 2 - 1 \end{matrix} \end{matrix}

(13)

where

n = t_{m} PRF

,

Δ n^{k} = Δ t_{m}^{k} PRF

,

n_{a c}^{k} = t_{a c}^{k} PRF

,

Δ N_{a}^{k} = Δ T_{a}^{k} PRF

, and

a^{k} = π γ^{k} / {PRF}^{2}

.

s_{k} (n)

represents the discrete form of the kth moving-target signal.

PRF

denotes the pulse repetition frequency of the SAR system. N is the sampling number and is assumed to be even. As can be seen from Equation (13), the SAR system parameters (including

λ

, v,

PRF

, and

R_{B}

) and target motion information are implicitly embedded within the multicomponent LFM signal parameters. Therefore, signals from different SAR systems (including airborne SAR and spaceborne SAR) and moving targets can be obtained by adjusting various signal parameters. This means that the signal model provided by Equation (13) possesses the capacity to comprehensively represent multi-moving-target signals under different SAR systems.

3. MLFMSS-Net-Assisted Multi-Moving-Target Imaging

To realize the simultaneous refocusing of multiple moving targets, a MLFMSS-Net-assisted multi-moving-target-imaging method is proposed in this section. In the following, the overall scheme is presented in Section 3.1. In Section 3.2, Section 3.3 and Section 3.4, the dataset, the architecture, and the training procedure of the designed network named MLFMSS-Net will be discussed in detail, respectively.

3.1. Overall Scheme

The flowchart of the proposed MLFMSS-Net-assisted multi-moving-target-imaging method is illustrated in Figure 2. In the preprocessing procedure, basic SAR imaging processing (including demodulation, range compression, RCMC, and azimuth compression) and clutter suppression are performed on the SAR raw data. Then, a constant false alarm rate (CFAR) detector is utilized to detect and extract the defocused moving-target image, each range cell of which can be modeled as a multicomponent LFM signal form given by Equation (13). Considering that different motion parameters of targets lead to different values of

a^{k}

in Equation (13), it is difficult to directly perform the parameter estimation and the azimuth compression matched to each target parameter on

s (n)

. Therefore, the multicomponent LFM signal separation during the residual clutter elimination, i.e.,

s (n)

separation into

s_{k} (n), k = 1, \dots, K

and

c n_{12} (n)

elimination from

s (n)

, is a critical step for the multi-moving-target imaging before the parameter estimation and secondary azimuth compression. However, the unknown target numbers pose significant challenges for the signal-separation task. To address this problem, an iterative signal-separation framework based on a deep CNN is proposed. In this framework, a network named MLFMSS-Net is designed to extract the most-energetic LFM signal component from the multicomponent LFM signal, iteratively applied multiple times until all LFM signal components are successfully separated.

The most-energetic LFM signal-component-extraction task can be formulated as a regression problem, i.e.,

s_{k} (n) = f_{ϕ} (s (n))

, which can be solved by MLFMSS-Net.

f_{ϕ} (\cdot)

is the network with parameters

ϕ

. Utilizing a set of training data, MLFMSS-Net is trained through the minimization of the loss function. This training process enables the determination of optimal network parameters

ϕ

, leading to improved performance of the model. The network details will be discussed in subsequent subsections. Then, the well-trained MLFMSS-Net is applied in the iterative signal-separation framework. Specifically, the number of iterations and the residual are first initialized by

k = 1

and

r s_{k} (n) = s (n)

, respectively. At each iterative step, the well-trained MLFMSS-Net is employed to

r s_{k} (n)

for the most-energetic LFM signal component

s_{k} (n)

extraction. Then, the new signal component

s_{k} (n)

is eliminated from

r s_{k} (n)

to obtain the updated residual as follows:

r s_{k + 1} (n) = r s_{k} (n) - s_{k} (n) = s (n) - \sum_{j = 1}^{j = k} s_{j} (n)

(14)

The number of iterations is updated by

k \leftarrow k + 1

. The aforementioned steps are repeated to extract a new LFM signal component from the residual

r s_{k} (n)

until k either exceeds the maximum number

K_{m a x}

of moving targets or the average power of the residual

∥ r s_{k} (n) ∥_{2}^{2}

is less than an empirical threshold value

ε

. Given that

ε

is feasible for a new LFM signal component declaration, it is advisable to set the value of

ε

equal to the average power of the background clutter surrounding the moving targets for the residual clutter suppression. Then, let the target number

K \leftarrow k - 1

. As a consequence, all LFM signal components’,

s_{k} (n), k = 1, \dots, K

, separation and residual clutter

c n_{12} (n)

suppression from

s (n)

have been achieved based on the proposed framework.

To realize K-moving-target imaging, in the imaging procedure, WVD is applied for the quadratic coefficient

{\hat{a}}^{k}

estimation of

s_{k} (n)

. Secondary azimuth compression matched to

{\hat{a}}^{k}

is further performed on the corresponding signal component

s_{k} (n)

to generate the well-focused kth moving-target image as follows:

s_{o k} (n) = F^{- 1} \{F [s_{k} (n)] \cdot F [s_{a} (n)]\}

(15)

where

F (\cdot)

and

F^{- 1} (\cdot)

denote the Fourier transform and the inverse Fourier transform, respectively. The azimuth matching function is expressed as

s_{a} (n) = rect (n) exp (- j π {\hat{a}}^{k} n^{2})

. The azimuth multi-moving-target image of each range cell can be generated by the linear superposition of each target image as follows:

s_{o} (n) = \sum_{k = 1}^{k = K} s_{o k} (n)

(16)

Therefore, the iterative signal-separation framework based on MLFMSS-Net, as well as the imaging procedure are implemented on each of the range cell data containing multiple targets to obtain the whole well-refocused moving-target image. The specific implementation procedures of the proposed method can be summarized in Algorithm 1.

Algorithm 1: The MLFMSS-Net-assisted multi-moving-target-imaging method

1: Preprocessing: Implement the basic SAR imaging processing and clutter suppression to obtain the defocused image of multiple moving targets
2: Input: Defocused multi-moving-target image
3: Output: Refocused multi-moving-target image
4: for $i \leftarrow 1$ to $N_{r}$ do
5: select the signal $s (n)$ from the ith range cell;
6: initialize $k \leftarrow 1$ and $r s_{k} (n) \leftarrow s (n)$ ;
7: while $k \leq K_{m a x}$ or $∥ r s_{k} (n) ∥_{2}^{2} \geq ε$ do
8: $s_{k} (n) \leftarrow f_{ϕ} (s (n))$ by the well-trained MLFMSS-Net;
/* design MLFMSS-Net $f_{ϕ}$ , where the network parameters $ϕ$ are
optimized by minimizing the loss function through the
training data */
9: $r s_{k + 1} (n) \leftarrow r s_{k} (n) - s_{k} (n)$ ;
10: $k \leftarrow k + 1$ ;
11: end
12: $K \leftarrow k - 1$ ;
13: for $k \leftarrow 1$ to K do
14: estimate ${\hat{a}}^{k}$ of $s_{k} (n)$ by WVD;
15: $s_{o k} (n) \leftarrow F^{- 1} \{F [s_{k} (n)] \cdot F [s_{a} (n)]\}$ ;
16: end
17: $s_{o} (n) \leftarrow \sum_{k = 1}^{k = K} s_{o k} (n)$ ;
18: end

3.2. Dataset Description

Data pairs consisting of the multicomponent LFM signal and the corresponding most-energetic LFM signal component are required for the training of MLFMSS-Net. The multicomponent LFM signal from different SAR systems can be simulated according to Equation (13). It is worth mentioning that this network primarily focuses on refocusing the moving targets, rather than imaging the SAR observation scene. If all the large-sized SAR data are directly utilized as the network input, it would require a large amount of memory and computation time for training. Moreover, many clutter areas without moving targets are not useless for training. Instead of all the SAR data, therefore, numerous small-sized data patches with the length

N = 512

containing moving targets should be extracted from the SAR data and processed by the proposed MLFMSS-Net-assisted imaging method. The number of LFM signal components is a random integer from 0 to 5 for each signal sample. The signal parameters, respectively, obey the following uniform distributions:

| A^{k} | \sim U (0, 1)

,

Δ n^{k} \sim U (0, N)

,

n_{a c}^{k} \sim U (0, N)

,

Δ N_{a}^{k} \sim U (0, N)

. Given that the SAR system parameters are implicitly embedded within the

a^{k}

of

s (n)

,

a^{k}

follows a uniform distribution over the reasonable range

U (- 0.05, 0.05)

. This ensures that the dataset contains multicomponent LFM signals from various SAR systems.

Because of the terrain’s heterogeneity, according to the analysis in [34], the additive perturbation

c n_{12} (n)

was simulated under the assumption of a compound clutter model. This model accounts for heterogeneity in the data by assuming that each sampling cell is the product of a speckle random variable (RV)

\bar{X}

and a statistically independent texture RV

Δ \in [0, \infty)

, i.e.,

Y = Δ \times \bar{X}

.

\bar{X}

is characterized by a complex Gaussian distribution

N^{C} (0, (1 - ρ) (σ_{c}^{2} + σ_{n}^{2}))

, where

ρ

is the correlation coefficient between two channels.

σ_{c}^{2}

and

σ_{c}^{2}

are the clutter and noise powers, respectively. It is convenient to normalize

\bar{X}

to its expectation, i.e.,

X = \bar{X} / ((1 - ρ) (σ_{c}^{2} + σ_{n}^{2}))

.

Δ

can be physically interpreted as the fluctuating variance (power) of the speckle. It is independent of the radar position and depends on the spatial distribution, orientation, and type of clutter under measurement.

Δ

obeys the following distribution [34]:

f_{Δ} (δ) = \frac{2 {(ν - 1)}^{ν}}{Γ (ν)} δ^{- (2 ν + 1)} exp (\frac{(1 - ν)}{δ^{2}})

(17)

where

ν

represents the texture parameter indicating the level of clutter heterogeneity in the imaging scene. A higher value of

ν

suggests a more-homogeneous scene, while a lower value indicates a more-heterogeneous scene. The density function of

c n_{12} (n)

can be calculated by Bayes rule as follows [35]:

f_{Y} (y; ν) = \frac{Γ (ν + 1)}{π Γ (ν)} \frac{{(ν - 1)}^{ν}}{{({| y |}^{2} + ν - 1)}^{(ν + 1)}}

(18)

According to the aforementioned distribution,

c n_{12} (n)

can be simulated with a given

ν

. To obtain the dataset containing the clutter with various heterogeneity levels,

ν

should follow the uniform distribution

ν \sim U (3, 50)

. Consequently, the multicomponent LFM signal can be obtained by summing the LFM signal components and the additive perturbation with different input SCNRs obeying the uniform distribution

U (- 5 dB, 15 dB)

in the dataset. The real and imaginary parts of the multicomponent LFM signal are divided into two independent channels as the network input

I \in R^{N \times 2}

.

In addition, the most-energetic LFM signal component in the multicomponent LFM signal will be adopted as the output of MLFMSS-Net. Similarly, the real and imaginary parts of the signal component are treated as two independent channels, i.e.,

O \in R^{N \times 2}

.

Therefore, thousands of data pairs with diverse SAR system parameters, target motion characteristics, and clutter types have been generated for training and testing. It is worth noting that the minimax normalization strategy is utilized to normalize the dataset into the range [0, 1] for better network training and performance optimization.

3.3. Architecture of Multicomponent Linear Frequency Modulation Signal-Separation-Net

As shown in Figure 3, MLFMSS-Net consists of the encoder, the signal-separation module, and the decoder. The notation

k 16 n 32 s 8

represents a 1D convolutional layer with a kernel size of 16, a filter number of 32, and a stride of 8. A convolutional layer without zero padding is utilized as the encoder to transform short segments of the network input

I \in R^{N \times 2}

into their corresponding representations

F \in R^{\hat{N} \times 512}

in an intermediate feature space. The signal-separation module is then implemented by estimating a weighting function (mask)

M \in R^{\hat{N} \times 512}

, which is elementwise multiplied with the encoder output, i.e.,

S = M ⊙ F

, where ⊙ denotes the elementwise multiplication. Finally, a transposed convolutional layer as the decoder is exploited to transform the masked encoder feature into the LFM signal component with the highest energy

\hat{O} \in R^{N \times 2}

. We describe the details of the signal-separation module in the following.

3.3.1. Signal-Separation Module

For the separation mask

M

estimation, the global layer normalization (gLN) is first exploited to normalize the feature over both the channel and time dimensions [36]. A convolutional layer of kernel size 1 is added as a bottleneck layer to adaptively fuse the effective features and stabilize the training of the deeper network. Inspired by the network for audio separation, the mask can be generated by a temporal convolutional network (TCN) [26,37]. This TCN is composed of multiple 1D dilated convolutional blocks (1D Convs), which enable the network to capture long-term dependencies within the input signal while maintaining a compact model size. The 1D Conv with an exponentially increasing dilation factor ensures a sufficiently large temporal context window to take advantage of the long-range dependencies of the signal. In the signal-separation module, eight 1D Convs with dilation factors

d = 1, 2, \dots, 2^{7}

are repeated 3 times. A residual path and a skip-connection path of each 1D Conv are applied: the residual path of the current 1D Conv is directly fed into the next block, while the skip-connection paths for all blocks are summed up and utilized as the output of the TCN, which is passed to a parametric rectified linear unit (PReLU) activation function [38]. Finally, a convolutional layer with a Sigmoid function is applied to estimate the mask.

3.3.2. One-Dimensional Dilated Convolutional Block

In each 1D Conv, as shown in Figure 3, a convolutional layer with a kernel size of 1 is first added to increase the complexity and richness of the features. The PReLU activation function is utilized to introduce nonlinearity and is followed by a gLN. Subsequently, to further reduce the number of parameters while maintaining a certain level of feature representation capability, the depthwise separable convolution can be employed as a substitute for the standard convolutional operation [39]. The depthwise separable convolution operator decouples the standard convolution operation into a depthwise convolution (D-Conv) with dilation factor d and a pointwise convolution. Besides, a PReLU activation function together with a normalization operation is added after the D-Conv. Each 1D Conv consists of a residual path and a skip-connection path, which improve the flow of information and gradient throughout the network.

3.4. Training Procedure

To achieve the desired signal separation performance of MLFMSS-Net, the network is required to be trained by minimizing a signal separation loss, denoted as

L_{s p} (\hat{O}, O)

. Here,

\hat{O}

represents the predicted output of the network

f_{ϕ} (I)

and

O

is the ground truth LFM signal component with the highest energy. The signal-separation task can be formulated as a regression problem, where the goal is to minimize the difference between the predicted and ground truth signals. This is typically achieved by applying the pixelwise squared Euclidean distance as the loss function:

L_{s p} (\hat{O}, O) = {∥ \hat{O} - O ∥}_{2}^{2}

(19)

An NVIDIA GeForce GTX3090 GPU was utilized to train the MLFMSS-Net model. The model was trained with 500,000 training samples and 150,000 validation samples. Then, 150,000 samples were used for testing. The batch size was configured to 32 during the training process. Furthermore, Adam [40] was exploited as the optimization algorithm to update

ϕ

at each step. The learning rate of the network was initially set to

5 \times 10^{- 4}

and decayed by 0.5 for every 5 epochs. This allowed the network to gradually adjust its parameters towards the optimal values that minimize the loss function.

During training, MLFMSS-Net underwent 125,000 backpropagation iterations within 1.46 h, resulting in a good fitting performance. The well-trained network is capable of achieving accurate predictions of outputs for unseen inputs. It has learned and captured the mapping relationship between its input and output, allowing it to generate the LFM signal component by simply calculating

\hat{O} = f_{ϕ} (I)

.

4. Experimental Results and Performance Analysis

In this section, the signal separation performance of the trained MLFMSS-Net is analyzed by the multicomponent LFM signal simulation in Section 4.1. Moreover, the imaging results on both the simulated and real SAR data are further presented to verify the feasibility and effectiveness of the proposed MLFMSS-Net-assisted multi-moving-target-imaging method in Section 4.2 and Section 4.3.

4.1. Results on Simulated Multicomponent Linear Frequency Modulation Signal

In this subsection, the signal separation performance of the trained MLFMSS-Net is explored by a three-component LFM signal, the parameters of which are listed in Table 1. The additive perturbation with

ν = 5

was added with the SCNR

= 10 dB

. The real and imaginary parts of the three-component LFM signal are shown in Figure 4a,b, respectively. The three LFM signal components overlap with each other and are contaminated by the additive perturbation, which makes the signal separation more challenging. Utilizing the iterative signal-separation framework based on MLFMSS-Net, the separation results of the three LFM signal components are given in Figure 4c–h. It can be seen that the separated signal components are similar to the ground truth signals. Therefore, MLFMSS-Net achieved a good signal separation performance.

The correlation coefficient between the ground truth and predicted signal components was utilized to numerically evaluate the separation accuracy in various SCNR scenarios as follows:

ρ = \frac{\sum_{i = 1}^{N} |\hat{O} (i) O^{*} (i)|}{\sqrt{\sum_{i = 1}^{N} {|\hat{O} (i)|}^{2} \sum_{i = 1}^{N} {|O^{*} (i)|}^{2}}} .

(20)

where

O^{*}

is the conjugate of

O

. The correlation between the two signals becomes stronger as the value of

ρ

approaches 1. A total of 500 trials were conducted for each input SCNR within the range [−5 dB, 10 dB]. The Chirplet decomposition method [10,11] was utilized to compare the separation accuracy of MLFMSS-Net.

As depicted in Figure 5, the correlation coefficient rose as the SCNR increased. Compared with Chirplet decomposition, it was evident that MLFMSS-Net exhibited superior accuracy in signal separation. Notably, when the SCNR exceeded 0 dB, MLFMSS-Net achieved a correlation coefficient exceeding 95%. This is because MLFMSS-Net was trained by minimizing the squared Euclidean distance, i.e.,

min ∥ f_{ϕ} (I) {- O ∥}_{2}^{2}

. Once this training process was complete, the optimal parameters

ϕ

could be obtained. The trained MLFMSS-Net was capable of generating the predicted signal component

\hat{O} = f_{ϕ} (I)

that had the minimum difference from the ground truth signal component

O

.

4.2. Results on Simulated SAR Data

To effectively analyze the imaging quality of the proposed MLFMSS-Net-assisted method in different SAR systems, two relatively complicated imaging scenarios were simulated in airborne and spaceborne SAR systems for testing, respectively. The parameters of the airborne and spaceborne SAR systems are listed in Table 2 and Table 3, respectively.

The first scenario containing four moving point targets and the additive perturbation with the input SCNR

= 5 dB

was simulated in the airborne SAR system, as shown in Figure 6a. It can be observed that the responses of moving targets are defocused in azimuth and overlap with each other in the SAR image. Chirplet decomposition [10,11] was utilized to separate the multi-moving-target signal and estimate the corresponding Doppler chirp rate. The refocused target image can be obtained by the secondary azimuth compression matched to the estimated Doppler chirp rate. As shown in Figure 6b, all targets are refocused in azimuth with some residual clutter around them. In Figure 6c, the well-refocused SAR image can be generated by the proposed MLFMSS-Net-based imaging method. Moreover, it provided better residual clutter suppression performance than Chirplet decomposition. Besides, the correlation coefficients of targets MT2-MT4 with different methods are observed in Figure 7. It can be seen that MLFMSS-Net achieved better separation accuracy than Chirplet decomposition, especially in the low SCNR case.

Furthermore, another scenario with the input SCNR

= 0 dB

in the spaceborne SAR system is simulated in Figure 8a. The imaging results based on Chirplet decomposition and MLFMSS-Net are, respectively, depicted in Figure 8b,c. The moving-target image obtained by the Chirplet-decomposition-based imaging method is submerged in the strong background clutter, which brings difficulty to target detection. In comparison, the MLFMSS-Net-based imaging method realizes image focusing and residual clutter suppression in a strong clutter environment. Therefore, the proposed MLFMSS-Net-based imaging method is capable of achieving multi-moving-target imaging with different radar system parameters, moving target characteristics, and clutter types.

The image focusing performance of the proposed method was quantitatively evaluated by the image entropy [41], with smaller values indicating better focus and clarity. In addition, the output SCNR was utilized to numerically measure the clutter suppression performance. As listed in Table 4 for the two scenarios, the imaging quality of the Chirplet-decomposition-based imaging method is greatly enhanced compared with the original image. The MLFMSS-Net-based imaging method achieves the best image focusing and clutter suppression performance, which further verifies the effectiveness of the proposed method. In addition, MLFMSS-Net costs a much shorter running time of 6.876 s compared with the 273.6 s required by Chirplet decomposition. Therefore, the proposed imaging method based on MLFMSS-Net achieves great improvements in both imaging quality and efficiency.

4.3. Results on Real SAR Data

In this subsection, real airborne SAR data and satellite TerraSAR-X data are both utilized to further demonstrate the effectiveness of the proposed imaging method. The parameters of the two SAR systems were the same as those of the simulations, as listed in Table 2 and Table 3. Figure 9a shows the original image containing six moving targets MT1-MT6 in the airborne SAR system. All moving targets are defocused owing to the target motion and obscured by the background clutter. MT3-MT6 are close in distance and overlap with each other, making it difficult to distinguish them. The imaging results based on Chirplet decomposition and MLFMSS-Net are shown in Figure 9b,c, respectively. Six moving targets can be well refocused, making them easy to distinguish. Compared with Chirplet decomposition, the MLFMSS-Net-based imaging method obtains few virtual targets and provides superior imaging performance in azimuth.

Moreover, the imaging results of TerraSAR-X data containing three moving targets were obtained as shown in Figure 10. As shown in Figure 10b,c, the moving targets are well refocused while the background residual clutter is suppressed based on Chirplet decomposition and MLFMSS-Net. The entropy and SCNR for TerraSAR-X data are listed in Table 5. Compared with other methods, the imaging method based on MLFMSS-Net has lower image entropy and a larger SCNR, which achieves better refocusing and clutter suppression performance.

5. Discussion

In Section 4.2, two simulated data acquired by the airborne and spaceborne SAR systems were both utilized to verify the applicability of the proposed imaging method. The results showed that the trained MLFMSS-Net realizes the separation of multicomponent LFM signals from different SAR systems. With the assistance of MLFMSS-Net, the proposed imaging method achieves high-quality and high-efficiency imaging without prior knowledge of the system parameters. The reason behind this success lies in the fact that the training dataset was generated based on a multicomponent LFM signal model, the parameters of which implicitly embed the radar system parameter information. By training MLFMSS-Net on this abundant dataset containing different signal parameters, the network becomes adaptable to various radar systems. In Section 4.3, experiments on two sets of real data acquired from airborne SAR systems and the TerraSAR-X satellite further confirmed the ability of the proposed method to improve the robustness of moving-target imaging, while reducing sensitivity to the system parameters. This is of great significance for the application of the proposed method in real-world scenarios.

Meanwhile, experiments with different target numbers in Section 4 demonstrated that the proposed method is capable of achieving the multi-moving-target imaging without prior knowledge of the target number, exhibiting its suitability for practical imaging scenarios.

However, it is worthwhile to remark that the length of the separated signals in the experiments was fixed owing to the fixed training sample length in the dataset, limiting the flexibility of moving-target imaging processing. Therefore, our future work will focus on how to improve the adaptability of the network model to the separated signals with different lengths, such as more-diverse data sample generation and multi-scale model building.

6. Conclusions

In this paper, we propose a MLFMSS-Net-assisted multi-moving-target-imaging method for the SAR-GMTI system. The network MLFMSS-Net was designed to extract the most-energetic LFM signal component from the multicomponent LFM signal and trained on the dataset with various SAR system parameters, target information, and clutter types, which was generated by the multicomponent LFM signal model. The well-trained MLFMSS-Net was iteratively applied multiple times to realize all LFM signal component separation and residual clutter suppression. Without prior knowledge of the target number, compared with other methods, the target imaging and residual clutter suppression performance were improved with high processing efficiency, especially in low-input SCNR scenarios. It provides further potential to implement the subsequent image interpretation including target classification and recognition. Meanwhile, the proposed method enhances the robustness of moving-target imaging, while reducing sensitivity to the system parameters, preliminarily addressing a significant challenge faced by deep-learning-based imaging methods. This allows the deep-learning-based imaging method to be a suitable solution for practical imaging applications.

In future work, we will further investigate how to incorporate the parameter estimation and pulse compression tasks into the deep network, aiming to overcome the limitations of the inherent resolution on estimation accuracy for traditional TFR-based methods and the signal bandwidth on imaging resolution for matched filtering methods. Additionally, the proposed method focuses on the imaging of ground moving targets with accelerated motions, the radar echoes of which can be characterized as the multicomponent LFM signal model. However, this signal model and the corresponding imaging method are not suitable for maritime targets with complex motions. A multicomponent higher-order phase signal model needs to be established. Subsequent research will explore how to fully utilize the powerful feature extraction and signal separation capabilities of MLFMSS-Net and generalize its applications to complex signal models.

Author Contributions

Conceptualization, C.D. and H.M.; methodology, H.M.; software, C.D.; validation, H.M. and Y.Z.; formal analysis, C.D.; investigation, H.M.; resources, C.D.; data curation, Y.Z.; writing—original draft preparation, C.D. and H.M.; writing—review and editing, H.M.; visualization, C.D.; supervision, Y.Z.; project administration, H.M.; funding acquisition, C.D. and H.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under Grant No.62201612 and U22A2014, China Postdoctoral Science Foundation Funded Project under Grant No.2022M723880, Natural Science Foundation of Shaanxi Province under Grant No.2023-JC-QN-0740, and Open Fund of Shaanxi Key Laboratory of Artificially Structured Functional Materials and Devices under Grant No.AFMD-KFJJ-22104 and AFMD-KFJJ-22201.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Raney, R.K. Synthetic Aperture Imaging Radar and Moving Targets. IEEE Trans. Aerosp. Electron. Syst. 1971, AES-7, 499–505. [Google Scholar] [CrossRef]
Chen, L.; Ni, J.; Luo, Y.; He, Q.; Lu, X. Sparse SAR Imaging Method for Ground Moving Target via GMTSI-Net. Remote Sens. 2022, 14, 4404. [Google Scholar] [CrossRef]
Chen, Y.; Li, G.; Zhang, Q.; Sun, J. Refocusing of Moving Targets in SAR Images via Parametric Sparse Representation. Remote Sens. 2017, 9, 795. [Google Scholar] [CrossRef]
He, Z.; Chen, X.; Yi, T.; He, F.; Dong, Z.; Zhang, Y. Moving Target Shadow Analysis and Detection for ViSAR Imagery. Remote Sens. 2021, 13, 3012. [Google Scholar] [CrossRef]
Tong, X.; Bao, M.; Sun, G.; Han, L.; Zhang, Y.; Xing, M. Refocusing of Moving Ships in Squint SAR Images Based on Spectrum Orthogonalization. Remote Sens. 2021, 13, 2807. [Google Scholar] [CrossRef]
Thammakhoune, S.; Yonel, B.; Mason, E.; Yazici, B.; Eldar, Y.C. Phase-Space Function Recovery for Moving Target Imaging in SAR by Convex Optimization. IEEE Trans. Comput. Imaging 2021, 7, 1018–1030. [Google Scholar] [CrossRef]
Shi, H.; Zhang, L.; Liu, D.; Yang, T.; Guo, J. SAR Imaging Method for Moving Targets Based on Omega-k and Fourier Ptychographic Microscopy. IEEE Geosci. Remote Sens. Lett. 2022, 19, 4509205. [Google Scholar] [CrossRef]
Park, J.W.; Won, J.S. An Efficient Method of Doppler Parameter Estimation in the Time-Frequency Domain for a Moving Object from TerraSAR-X Data. IEEE Trans. Geosci. Remote Sens. 2011, 49, 4771–4787. [Google Scholar] [CrossRef]
Huang, P.; Liao, G.; Yang, Z.; Xia, X.G.; Ma, J.T.; Zhang, X. A Fast SAR imaging method for ground moving target using a second-order WVD transform. IEEE Trans. Geosci. Remote Sens. 2016, 54, 1940–1956. [Google Scholar] [CrossRef]
Peng, Z.K.; Meng, G.; Chu, F.L.; Lang, Z.Q.; Zhang, W.M.; Yang, Y. Polynomial chirplet transform with application to instantaneous frequency estimation. IEEE Trans. Instrum. Meas. 2011, 60, 3222–3229. [Google Scholar] [CrossRef]
Zhang, Y.; Mu, H.; Xiao, T.; Jiang, Y.; Ding, C. SAR imaging of multiple maritime moving targets based on sparsity Bayesian learning. IET Radar Sonar Navig. 2020, 14, 1717–1725. [Google Scholar] [CrossRef]
Sun, H.B.; Liu, G.S.; Gu, H.; Su, W.M. Application of the fractional Fourier transform to moving target detection in airborne SAR. IEEE Trans. Aerosp. Electron. Syst. 2002, 38, 1416–1424. [Google Scholar]
Li, Z.; Zhang, X.; Yang, Q.; Xiao, Y.; An, H.; Yang, H.; Wu, J.; Yang, J. Hybrid SAR-ISAR image formation via joint FrFT-WVD processing for BFSAR ship target high-resolution imaging. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5215713. [Google Scholar] [CrossRef]
Yang, L.; Bi, G.; Xing, M.; Zhang, L. Airborne SAR moving target signatures and imagery based on LVD. IEEE Trans. Geosci. Remote Sens. 2015, 53, 5958–5971. [Google Scholar] [CrossRef]
Yang, L.; Zhao, L.; Bi, G.; Zhang, L. SAR ground moving-target imaging algorithm based on parametric and dynamic sparse Bayesian learning. IEEE Trans. Geosci. Remote Sens. 2016, 54, 2254–2267. [Google Scholar] [CrossRef]
Oveis, A.H.; Giusti, E.; Ghio, S.; Martorella, M. A Survey on the Applications of Convolutional Neural Networks for Synthetic Aperture Radar: Recent Advances. IEEE Aerosp. Electron. Syst. Mag. 2021, 37, 18–42. [Google Scholar] [CrossRef]
Gao, J.; Deng, B.; Qin, Y.; Wang, H.; Li, X. Enhanced radar imaging using a complex-valued convolutional neural network. IEEE Geosci. Remote Sens. Lett. 2019, 16, 35–39. [Google Scholar] [CrossRef]
Mu, H.; Zhang, Y.; Jiang, Y.; Ding, C. CV-GMTINet: GMTI using a deep complex-valued convolutional neural network for multichannel SAR-GMTI system. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5201115. [Google Scholar] [CrossRef]
Mu, H.; Zhang, Y.; Ding, C.; Jiang, Y.; Er, M.H.; Kot, A.C. DeepImaging: A ground moving-target imaging based on CNN for SAR-GMTI system. IEEE Geosci. Remote Sens. Lett. 2021, 18, 117–121. [Google Scholar] [CrossRef]
Lu, Z.J.; Qin, Q.; Shi, H.Y.; Huang, H. SAR moving-target imaging based on convolutional neural network. Digit. Signal Prog. 2020, 106, 1–10. [Google Scholar] [CrossRef]
Zhang, H.; Ni, J.; Xiong, S.; Luo, Y.; Zhang, Q. Omega-KA-Net: A SAR ground moving-target imaging network based on trainable Omega-K algorithm and sparse optimization. Remote Sens. 2022, 14, 1664. [Google Scholar] [CrossRef]
Zhou, Y.; Shi, J.; Wang, C.; Hu, Y.; Zhou, Z.; Yang, X.; Zhang, X.; Wei, S. SAR Ground Moving Target Refocusing by Combining mRe³ Network and TV^β-LSTM. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5200814. [Google Scholar] [CrossRef]
Hua, Q.; Zhang, Y.; Jiang, Y.; Xu, D. CV-CFUNet: Complex-Valued Channel Fusion UNet for Refocusing of Ship Targets in SAR Images. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 4478–4492. [Google Scholar] [CrossRef]
Lian, M.; Bolic, M. An Attention Based Complex-valued Convolutional Autoencoder for GEO SA-Bi SAR Ship Target Refocusing. In Proceedings of the IEEE Sensors Applications Symposium (SAS), Ottawa, ON, Canada, 18–20 July 2023; pp. 1–6. [Google Scholar]
Luo, Y.; Chen, Z.; Yoshioka, T. Dual-path RNN: Efficient long sequence modeling for time-domain single-channel speech separation. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Barcelona, Spain, 4–8 May 2020; pp. 46–50. [Google Scholar]
Luo, Y.; Mesgarani, N. Conv-TasNet: Surpassing ideal time–frequency magnitude masking for speech separation. IEEE-ACM Trans. Audio Speech Lang. 2019, 27, 1256–1266. [Google Scholar] [CrossRef]
Luo, Y.; Mesgarani, N. TasNet: Time-domain audio separation network for real-time, single-channel speech separation. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, AB, Canada, 15–20 April 2018; pp. 696–700. [Google Scholar]
Su, H.; Bao, Q.; Chen, Z. Parameter estimation processor for chirp signals based on a complex-valued deep neural network. IEEE Access 2019, 7, 176278–176290. [Google Scholar] [CrossRef]
Su, H.; Bao, Q.; Chen, Z. ADMM–Net: A Deep Learning Approach for Parameter Estimation of Chirp Signals under sub-Nyquist Sampling. IEEE Access 2020, 8, 75714–75727. [Google Scholar] [CrossRef]
Jao, J.K.; Yegulalp, A. Multichannel synthetic aperture radar signatures and imaging of a moving target. Inverse Probl. 2013, 29, 054009. [Google Scholar] [CrossRef]
Deming, R.W. Along-track interferometry for simultaneous SAR and GMTI: Application to Gotcha challenge data. In Algorithms for Synthetic Aperture Radar Imagery XVIII; SPIE: Bellingham, WA, USA, 2011; Volume 8051, pp. 201–218. [Google Scholar]
Zhu, D.; Li, Y.; Zhu, Z. A keystone transform without interpolation for SAR ground moving-target imaging. IEEE Geosci. Remote Sens. Lett. 2007, 4, 18–22. [Google Scholar] [CrossRef]
Cerutti-Maori, D.; Sikaneta, I. A generalization of DPCA processing for multichannel SAR/GMTI radars. IEEE Trans. Geosci. Remote Sens. 2013, 51, 560–572. [Google Scholar] [CrossRef]
Cerutti-Maori, D.; Sikaneta, I.; Cerutti-Maori, D. Two-step detector for RADARSAT-2’s experimental GMTI mode. IEEE Trans. Geosci. Remote Sens. 2013, 51, 436–454. [Google Scholar]
Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series, and Products; Academic Press: Cambridge, MA, USA, 2014. [Google Scholar]
Ba, J.L.; Kiros, J.R.; Hinton, G.E. Layer Normalization. arXiv 2016, arXiv:1607.06450. [Google Scholar]
Lea, C.; Flynn, M.D.; Vidal, R.; Reiter, A.; Hager, G.D. Temporal convolutional networks for action segmentation and detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 156–165. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the 2015 IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1026–1034. [Google Scholar]
Howard, A.G.; Zhu, M.; Chen, B.; Kalenichenko, D.; Wang, W.; Weyand, T.; Andreetto, M.; Adam, H. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv 2017, arXiv:1704.04861. [Google Scholar]
Kingma, D.; Ba, J. Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations, Banff, AB, Canada, 14–16 April 2014. [Google Scholar]
Kragh, T.J.; Kharbouch, A.A. Monotonic iterative algorithm for minimum-entropy autofocus. In Proceedings of the Adaptive Sensor Array Processing Workshop, Lexington, MA, USA, 6–7 June 2006; Volume 40, pp. 1147–1159. [Google Scholar]

Figure 1. Geometry of the dual-channel SAR-GMTI system.

Figure 2. Flowchart of the MLFMSS-Net-assisted multi-moving-target-imaging method.

Figure 3. Architecture of MLFMSS-Net.

Figure 4. Three-component LFM signal separation results based on MLFMSS-Net. (a,b) The real and imaginary parts of the three-component LFM signal, respectively. (c,d) The real and imaginary parts of LFM signal component 1, respectively. (e,f) The real and imaginary parts of LFM signal component 2, respectively. (g,h) The real and imaginary parts of LFM signal component 3, respectively.

Figure 5. Correlation coefficients of three LFM signal components versus input SCNR using different methods for the simulated three-component LFM signal.

Figure 6. Multiple moving target simulation for airborne SAR system. (a) Original image. (b,c) Imaging results based on Chirplet decomposition and MLFMSS-Net, respectively.

Figure 7. Correlation coefficients of three LFM signal components versus input SCNR using different methods for the simulated airborne SAR data.

Figure 8. Multiple moving target simulation for spaceborne SAR system. (a) Original image. (b,c) Imaging results based on Chirplet decomposition and MLFMSS-Net, respectively.

Figure 9. Airborne SAR data imaging results. (a) Original image. (b,c) Refocused images based on Chirplet decomposition and MLFMSS-Net, respectively.

Figure 10. TerraSAR-X data imaging results. (a) Original image. (b,c) Refocused images based on Chirplet decomposition and MLFMSS-Net, respectively.

Table 1. Parameters of simulated three-component LFM signal.

Components	$\| A^{k} \|$	$Δ n^{k}$	$n_{ac}^{k}$	$Δ N_{a}^{k}$	$a^{k}$
1	1	100	100	100	−0.008
2	0.8	100	200	80	0.01
3	0.5	50	0	100	0.002

Table 2. Parameters of airborne SAR system.

Parameter	Notation	Value
Wavelength	$λ$	0.03 m
Platform velocity	V	150 m/s
Pulse repetition frequency	PRF	800 Hz
Slant range	$R_{B}$	10 km

Table 3. Parameters of spaceborne SAR system.

Parameter	Notation	Value
Wavelength	$λ$	0.03 m
Platform velocity	V	7068 m/s
Pulse repetition frequency	PRF	4560 Hz
Slant range	$R_{B}$	700 km

Table 4. Imaging quality for simulated SAR data.

Platform	Metric	Original	Chirplet	MLFMSS-Net
Airborne	Entropy	9.849	6.682	4.200
Airborne	SCNR (dB)	5.000	31.46	41.07
Spaceborne	Entropy	9.859	8.249	5.095
Spaceborne	SCNR (dB)	0.000	18.29	38.42

Table 5. Imaging quality for real SAR data.

Platform	Metric	Original	Chirplet	MLFMSS-Net
Airborne	Entropy	9.233	6.333	5.570
Airborne	SCNR (dB)	−0.671	29.28	34.88
TerraSAR-X	Entropy	8.418	4.867	4.485
TerraSAR-X	SCNR (dB)	5.036	31.20	47.83

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ding, C.; Mu, H.; Zhang, Y. A Multicomponent Linear Frequency Modulation Signal-Separation Network for Multi-Moving-Target Imaging in the SAR-Ground-Moving-Target Indication System. Remote Sens. 2024, 16, 605. https://doi.org/10.3390/rs16040605

AMA Style

Ding C, Mu H, Zhang Y. A Multicomponent Linear Frequency Modulation Signal-Separation Network for Multi-Moving-Target Imaging in the SAR-Ground-Moving-Target Indication System. Remote Sensing. 2024; 16(4):605. https://doi.org/10.3390/rs16040605

Chicago/Turabian Style

Ding, Chang, Huilin Mu, and Yun Zhang. 2024. "A Multicomponent Linear Frequency Modulation Signal-Separation Network for Multi-Moving-Target Imaging in the SAR-Ground-Moving-Target Indication System" Remote Sensing 16, no. 4: 605. https://doi.org/10.3390/rs16040605

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Multicomponent Linear Frequency Modulation Signal-Separation Network for Multi-Moving-Target Imaging in the SAR-Ground-Moving-Target Indication System

Abstract

1. Introduction

2. Multi-Moving-Target-Signal Model for the SAR-Ground-Moving-Target-Indication System

2.1. Geometric Configuration

2.2. Signal Model

3. MLFMSS-Net-Assisted Multi-Moving-Target Imaging

3.1. Overall Scheme

3.2. Dataset Description

3.3. Architecture of Multicomponent Linear Frequency Modulation Signal-Separation-Net

3.3.1. Signal-Separation Module

3.3.2. One-Dimensional Dilated Convolutional Block

3.4. Training Procedure

4. Experimental Results and Performance Analysis

4.1. Results on Simulated Multicomponent Linear Frequency Modulation Signal

4.2. Results on Simulated SAR Data

4.3. Results on Real SAR Data

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI