2.1. Multi-Look SAR Tomography: Problem Formulation and Filter Design
Let us consider a stack of N azimuth-range focused SAR images, co-registered with respect to a given reference (master) image. We assume that the dataset has compensated for the atmospheric phase screen (APS) as well as for possible nonlinear deformation resulting from a small scale (lower resolution) analysis. In a given image pixel, the signal is a noisy version of the integrated backscattering function over the so-called multi-dimensional parameter space.
The vector collecting the parameters of interest (i.e., RT for 3-D case; RT and MDV for 4-D case; and RT, MDV, and TD for the 5-D case) is referred to as parameter vector p. It spans the parameter space, which is discretized in K bins, corresponding to the parameter vectors . RT can be referred to the elevation direction (orthogonal to the range and azimuth) or to the vertical direction: in the following, both definitions are used and specified according to the context.
In a given pixel, the
N-length data vector, e.g.,
g, is modeled as
where the dependence on the pixel has been omitted for sake of simplicity. In Equation (1), the
K-length vector
where (·)
T is the transposition operator, collects the samples of the backscattering distribution function over the bins (backscattering coefficients), whereas
w is the additive noise contribution. Furthermore,
A is the
N ×
K system matrix whose columns are referred to as
steering vectors.
With reference to the
kth bin, the corresponding steering vector, e.g.,
, is a structured versor (
) whose
nth component is
where
is the vector collecting the Fourier mate variables of the parameter vector. Such variables, whose spans determine the Rayleigh resolution (i.e., the resolution capability of the imaging system), depend on the adopted system parameters (transmitted wavelength, spatial baseline distribution, acquisition epochs, and so on) (see references [
4,
24,
25] for the 3-D, 4-D, and 5-D cases, respectively).
At this point, it is worth noting that Equation (1) models the signal component as a superposition over the bins of contributions (steering vectors) weighted according to the corresponding backscattering coefficients. On the one hand, along the elevation dimension, a real scenario can certainly involve the superposition of contributions associated to different bins corresponding to physically distinct scatterers. The same reasoning cannot be extended along the velocity and thermal dilation. For instance, for the 4-D case, the velocity can be interpreted as a spectral variable describing the harmonic content composing a generic, non-linear scatterer deformation. For PSI/TomoSAR it follows that, along the velocity (4-D) and thermal dilation (5-D) directions, the presence of backscattering distribution over multiple bins does not imply the presence of multiple (physical) PS, but only a spreading of the backscattering contribution associated with a deformation of a (physical) PS that cannot be described by a linear motion (4-D) or linear motion and a thermal dilation (5-D) according to the available temperatures. For the sequel, we assume that, for a given elevation, the backscattering is impulsive along the velocity and thermal dilation directions.
It is as well as understood that Equation (1) neglects the presence of decorrelation across the data-stack.
From a statistical point of view, the data vector is typically modeled as a zero-mean complex circular Gaussian random vector, with covariance matrix
where
and
are the variance of the backscattering coefficient corresponding to the
kth bin contribution and the power spectral density (PSD) of the (white) noise contribution, respectively;
E(·) and
are the statistical expectation operator and the
N ×
N identity matrix, respectively; and (·)
H stands for hermitian operator. Notice that, according to the previous assumption on the backscattering distribution, the variance
turns out to be concentrated in a single bin along the additional directions with respect to the elevation (4-D/5-D spaces).
Multi-look SAR tomography is aimed at reconstructing, pixel by pixel, the backscattering distribution along the bins (γ) from a set of L independent and homogeneous looks, e.g., .
A proper filter, e.g.,
, is exploited to carry out, look by look, an estimate
of the backscattering coefficient
γk. To mitigate the noise effect, such an estimate is subsequently averaged over all the looks, although this leads to an unavoidable spatial (range-azimuth) resolution loss. The multi-look reconstruction is thus
where
is the sampling covariance matrix of the data. It is worth noting that, under the Gaussian assumption, the sampling covariance matrix in Equation (6) is also the Maximum Likelihood Estimate (MLE) of the statistical covariance matrix in Equation (4).
Since the data and the unknowns in Equation (1) are related by a (typically non-uniformly sampled) Fourier operator, some filter design criteria coming from the spectral estimation theory have been effectively exploited. Among them, the minimum output energy (MOE) criterion allows recovering the spectral component corresponding to the
kth bin while limiting as much as possible the effects of interfering contribution. The problem is cast as:
being the statistical covariance matrix in Equation (4).
The Capon filter [
26] is the solution of the problem in Equation (7). Standard Lagrangian optimizations lead to the expression
showing that the Capon is an adaptive (data dependent) filter, because of the presence of the (inverse) statistical covariance matrix. In practical situations, the matrix
is unknown and, therefore, its sampling (multi-look) estimation
, defined as in Equation (6), is exploited. Accordingly, by substituting the filter expression in Equation (8) within the multi-look reconstruction in Equation (5), the Capon reconstruction can be written as:
A consideration is now in order.
The use of the inverse sampling covariance matrix makes the Capon filter to be intrinsically a multi-look processing. In other words, the Capon reconstruction in Equation (9) cannot be straightforwardly specialized to the single-look case, although a single-look Capon based TomoSAR algorithm has been proposed in paper [
11].
A much simpler nonparametric filter usually exploited for the reconstruction of the backscattering profile is Beam Forming (BF). Interestingly, BF can be considered as the solution of the problem in Equation (7) when the data are assumed to be a white process, that is, when
is proportional to the identity matrix. Such a condition leads in fact to the solution:
which provides the multi-look reconstruction
obtained by substituting Equation (10) into Equation (5).
Differently from the Capon, the BF in Equation (10) is a non-adaptive (data independent filter). To emphasize this aspect, the Capon is sometimes referred to as adaptive BF (ABF). Moreover, the BF reconstruction in Equation (11) can be straightforwardly specialized to the single-look case (), which makes it preferable to the Capon when a full spatial resolution analysis is required.
However, despite its implementation complexity compared to the BF, the ABF is expected to provide better sidelobes suppression and, as a consequence, mitigation of the leakage between interfering scatterers as well as (tomographic) super-resolution [
5,
23].
2.2. Multi-Look Detection in SAR Tomography: Problem Formulation and Solution Strategies
The assumed correspondence between (physical) PSs and bins allows identifying the presence of the stronger PSs interfering in the same pixel and estimating their tomographic parameters by selecting the highest peaks of the tomographic reconstruction.
However, the disturbance (noise and clutter) in the processed data along with the leakage level introduced by the exploited reconstruction technique could determine a misinterpretation of the results that makes necessary a further processing aimed at testing the reliability of the revealed PSs.
In this context is framed the TomoSAR detection problem, which in the multi-look case consists of determining, pixel by pixel, the number
of present PSs and estimating the corresponding parameters vectors from a set of
L independent and homogeneous looks. Such a problem can be conveniently cast in terms of the multiple (composite) hypothesis test
where
are the exploited looks. The
mth hypothesis in Equation (12) assumes the presence of
m PSs, characterized by the (unknown) parameters vectors
which are look-independent because of the looks homogeneity. The corresponding steering vectors are collected by the system matrix
whereas the backscattering coefficients form the look-dependent vector
Finally, each look is corrupted by an additive noise contribution , usually modelled as a white complex circular Gaussian random vectors, with (unknown) PSD .
In the following, Equation we deal with the problem of detecting up to two interfering scatterers (i.e.,).
From a statistical point of view, the joint probability density function (pdf) of the looks can be modeled according to two different complex multivariate Gaussian distributions. The first one leads to the zero-mean model, which characterizes the backscattering coefficients in Equation (14) as uncorrelated zero-mean complex circular Gaussian random variables with (unknown) variances , The second one corresponds to the nonzero-mean model, which considers, instead, the backscattering coefficients in Equation (14) as (unknown) deterministic parameters.
Different detection strategies can be followed.
A first possibility is to start from the tomographic reconstruction and exploit proper indexes to extract the information about peaks corresponding to possible persistent scatterers. In this case, the exploited reconstruction technique plays a key role in terms of achievable tomographic resolution and estimation accuracy: a particular interest is thus for the Capon based algorithms.
A second strategy is strictly framed in the detection theory context and exploits schemes based on the Generalized Likelihood Ratio Test (GLRT). In this case, according to the assumed statistical model, the estimates of the unknown parameters are based on the Least Squares (LS) criterion that, in the case of single scatterers, provides the same solution achievable by selecting the highest peak of the BF reconstruction. Moreover, this strategy also has the advantage of allowing to control the false alarm rate (FAR).
2.3. GLRT Detection
The multi-look GLRT for the binary hypotheses test
is:
where, under
(
),
is the joint pdf of the looks and
is the vector collecting all the unknown parameters. Moreover,
T is the detection threshold, set according to the desired FAR.
As for the detection of single scatterers, the single-look and, more recently, the multi-look GLRT detector for the binary hypothesis test
have been derived in references [
14,
22] respectively.
By assuming the zero-mean model for the joint pdf of the looks, the test in Equation (15) leads to the multi-look GLRT detector [
22]:
where
is the multi-look MLE under
of the parameter vector
p associated with the present PS and
is the sample covariance matrix, defined as in Equation (6).
It is worth noting that the test in Equation (16) is a BF-based detector, since its statistic represents the highest normalized peak (belonging to the interval [0, 1]) of the BF reconstruction in Equation (11). It can be rewritten as
which is the correlation index (according to the Frobenius inner product) between the estimated (sample) covariance matrix
and the estimated, so-called
signature matrix , associated with the ideal response
(signature)
of a PS with parameter vector
[
22]. Equation (18) is also referred to as BF correlation index, because of the relation
that defines the BF filter at the bin
.
It can be easily shown that, under
, the test statistic
in Equation (18) increases with the signal to noise ratio (SNR) of the present PS, defined as
As final remarks, it is worth underlining that the detector in Equation (16) has the CFAR property that reflects in the possibility to use the same detection threshold for processing all pixels with the same (constant) FAR. Moreover, the reduction of the noise effect induced by the multi-look processing translates in a higher detection rate (DR). It has been demonstrated, indeed, that, for a fixed FAR, the DR increases with the number
L of the exploited looks [
22].
As for the double scatterers case, a GLRT-based detector for the ternary hypothesis test
has been proposed for the single-look case in [
15] and subsequently extended to the multi-look case in [
22]. It is a simple and efficient two-stage scheme, referred to as sequential GLRT with cancelation (SGLRTC), that sequentially tests the pairs of hypotheses
and
, where
denotes the complement of
(i.e.,
or
). At the first stage, aimed at testing the presence of two scatterers, the decision rule from Equation (16) is applied to the vector obtained by canceling from the data the contribution of the dominant scatterer. The latter is in turn provided by the highest peak of the BF reconstruction. If hypothesis
is selected, the final decision is demanded to the second stage that select among the null and the single scatterer hypothesis, again according to the rule in Equation (16).
The SGLRTC results to be in practice CFAR. Moreover, from the computational point of view, it is a very efficient detection scheme, since the cancelation step allows just doubling the effort required by the GLRT detector for single scatterers. Unfortunately, this is paid for by a reduction of the (tomographic) resolution capabilities, since the cancellation process does not allow locating scatterers whose distance in the parameter space is below the Rayleigh resolution. Moreover, it has limited capabilities in contrasting the leakage.
To provide super-resolution capabilities, a single-look double-stage GLRT-based detector, referred to as sup-GLRT (since it deals with a signal support estimation problem), has been proposed in [
16]. Similar to the single-look SGLRTC, the sup-GLRT assumes the nonzero-mean model for the pdf of the data. However, differently from the SGLRTC that implements the binary tests starting from the higher hypothesis, the sup-GLRT sequentially tests the pairs of hypotheses
and
(
being the complement of
), thus starting from the lower hypothesis. More specifically, letting
g be the (single) exploited look, the first stage implements the rule
where
is the joint MLE of the parameter vectors associated with the two present PSs under
,
is the projector in the noise subspace, which is the orthogonal complement to the signal subspace spanned by the two estimated directions
and
, and
is the projector in the signal subspace,
being the matrix collecting the two estimated directions.
The second stage acts when the first one selects
. It implements the rule
where
and
are still given by Equation (21), and
>is the MLE of the parameter vector associated with the present PS under
. Moreover,
is the projector in the noise subspace, which is the orthogonal complement to the signal subspace spanned by the estimated direction
.
The sup-GLRT is a CFAR detection scheme, and very effective compared to SGLRTC in terms of resolution capability. Indeed, by exploiting the join estimation procedure from Equation (21), it is also able to detect scatterers whose separation in the parameter space is below the Rayleigh resolution. Moreover, for separation above the Rayleigh resolution, the avoidance of the cancellation step exploited in the SGLTRC allows improving the handling of leakage associated to sidelobes. However, the super-resolution capability is paid for by a high computational effort, which turns out to be combinatorial [
16]. Accordingly, the sup-GLRT is very computationally demanding, especially when the dimensionality of the parameter space increases and its discretization is carried out with a high number of bins [
27].
To overcome the high computational complexity limitation of the sup-GLRT, the so-called (single-look) fast-sup-GLRT is introduced in [
17]. The basic idea is on splitting the joint estimate
of the two parameters vectors, performed as in Equation (21), into two decoupled estimates
and
. To this aim, the estimate
is firstly carried out by Equation (25). Subsequently, the estimate
is performed as
The reduction of the computational effort is paid for by a higher leakage effect on the dominant scatterer when a secondary scatterer is present. Nevertheless, the fast sup-GLRT has been demonstrated to achieve better detection performances with respect to the SGLRTC. This improvement depends on the fact that, differently from the cancelation carried out by the SGLRTC, the estimation procedure in Equation (27) does not constrain the steering vectors associated with and to be orthogonal.
2.4. Capon-Based Detection
The attractive characteristic of the Capon reconstruction related to the mitigation of the leakage effect has encouraged the derivation of multi-look Capon-based detectors which should be able to achieve better performances in presence of multiple PSs interfering in the same pixel. One possibility could be the modification of the GLRT in Equation (16), which is a BF-based detector, in a Capon-based detection scheme. To this aim, the BF correlation index from Equation (18) could be substituted with the Capon correlation index
where
is the Capon filter at the bin
corresponding to the highest peak of the Capon reconstruction. Unfortunately, because of the presence of the inverse sampling covariance matrix in the filter (see Equation (8)), the use of the index in Equation (28) results to be critical in terms of numerical instability related to the finite precision number representation, especially for high values of SNR.
Another possibility is given by the multi-look Capon-based iterative detector addressed in [
23]. It exploits the
L looks
to sequentially test the hypotheses in Equation (12), that is
, where the matrix
accounts for the vector parameters
corresponding to the
m highest peaks of the Capon reconstruction.
The
mth iteration verifies the presence of a further scatterer with respect to the
m−1 already detected in the previous one, by ending the algorithm if the additional scatterer is declared to be absent. To this aim, the following joint condition is tested
where
εm is a properly defined fitting error exploited to limit the FAR,
is the SNR corresponding to the
mth scatterer, and
and
are two fixed (independent from
m) thresholds.
The fitting error is a normalized index accounting for the data contribution within the noise subspace, i.e., the orthogonal complement to the signal subspace
where
is the least squares (LS) estimate of the vector
.
It is worth noting that, by increasing the iteration number
m, which represents the dimensionality of the signal subspace, the fitting error from Equation (32) decreases. Accordingly, it is enough to verify the fitting error condition just for
, whereas for
the detection rule in Equation (30) reduces to
As for the SNR in Equation (31), the power
of the
mth scattarer is evaluated as
where
is the last (
mth) component of
in Equation (33).
Regarding to the noise power
, in [
23] the authors did not provide any information on the adopted estimation strategy. However, assuming the nonzero-mean Gaussian model for the looks (see
Section 2.1), Equation (33) turns out to be the MLE of the backscattering coefficients and, thus, the MLE of the noise level can also be exploited [
15]
Some considerations are now in order.
It can be shown that, for , the fitting error condition is equivalent to the one on the SNR, thus the test turns out to always be equivalent to that in Equation (34) or, in other words, the condition on the fitting error is redundant.
Furthermore, it can be shown that the test in Equation (34) is equivalent to the multi-look GLRT for the pair derived by assuming the looks following the nonzero-mean Gaussian model. However, differently from the GLRT, which exploits the MLEs of all the unknown parameters, the test in Equation (34) makes use of the parameter vectors provided by the highest peaks of the Capon reconstruction. It has been empirically demonstrated, however, that simple peaks location leads to the worst results in terms of resolution even with respect to the method based on signal orthogonal projections. Additionally, the corresponding test statistic is always lower than the one of the “full” GLRT rule (which is maximized by the MLEs).
These considerations make evident the difficulty to derive, in the context of super-resolution multi-look detection, an effective Capon-based detection scheme. On the other hand, multi-look SGLRTC does not provide super-resolution capabilities, differently from the sup-GLRT proposed in article [
16], which however has been designed for the single-look case. Accordingly, in the next subsection, we derive the multi-look version of the sup-GLRT.
2.5. Proposed Multi-Look Sup-GLRT Detection Algorithm
The single-look sup-GLRT detector and its fast implementation proposed in references [
16,
17], respectively, are described in
Section 2.3. Such schemes carry out the estimation of the parameters vector without imposing the orthogonally of the corresponding directions, thus achieving satisfactory performances also when multiple interfering PSs below the (tomographic) Rayleigh resolution are present in the same pixel. Accordingly, to profitably exploit such feature also when the processed datasets are characterized by low SNRs, in this section, we propose the extension of the sup-GLRT detector to the multi-look case. Additionally, for the fast implementation of the multi-look sup-GLRT, a Capon reconstruction is also proposed.
Let us consider a set of
L independent and homogeneous looks, modeled according to the nonzero-mean statistical characterization (see
Section 2.1), which extends the model exploited in [
16] to the multi-look case. By supposing the presence of up to two scatterers, the first stage of the multi-look sup-GLRT implements the decision rule
where
, defined as in Equation (6), represents the sampled correlation matrix of the exploited looks, and
is given by Equation (22). Moreover,
is the joint MLE of the parameters vector under
.
The second stage, instead, operates the decision according to the rule
where
and
are still given by Equation (21), and
is the MLE of the parameter vector under
.
It is worth noting that, assuming the zero-mean statistical model for the exploited looks, Equations (37) and (39) can be shown to be still the decision rules of the multi-look sup-GLRT only if (scatterers with the same power level).
Similar to the single-look counterpart, a fast implementation of the proposed multi-look detector can be obtained by splitting the joint estimate
into two decoupled estimates
and
, the latter performed as
The estimate
is given, instead, by the position corresponding to the highest peak of the multi-look tomographic reconstruction. However, differently from the (single-look) fast implementation proposed in [
17,
27] which exploits the Beam-Forming reconstruction, we prefer to use the Capon reconstruction carried out on the selected looks. Indeed, the capability of the Capon filter to mitigate the leakage effect should guarantee a better estimate of the parameters vector
when multiple interfering PSs are actually present in the same pixel, thus improving also the subsequent estimate in Equation (41).