1. Introduction
Synthetic aperture radar (SAR) shows a strongly growing trend towards remote sensing applications such as rescue, maritime monitoring, resource surveys, and agricultural estimation due to its all-weather conditions, cloud-resistance, and day-and-night imaging capabilities compared with the traditional remote sensing mode. Ship detection in maritime activities is a typical SAR remote sensing application. At present, many high-resolution SAR satellites have been launched into orbit, such as TerraSAR-X, Sentinel-1, Capella, and GaoFen-3. These satellites generate SAR data continuously for various industries. SAR ship images are also produced and added to various ship detection datasets.
Compared with sea clutter, ship targets have higher intensities in SAR images. This means that ship targets are brighter than sea clutter due to the stronger corner reflection of the ship’s metal structure [
1]. However, the azimuth ambiguities, the state of the sea, and the wake of the ship all result in significant interference, which can cause a bright and heterogeneous sea background. In addition, phase errors often lead to a degradation in the focusing quality of SAR imagery [
2], leading to an extended back-projection (EBP) algorithm to compensate for the phase errors being proposed. In [
3], the authors performed parametrization of the minimum cost flow (MCF) algorithm to address the problem of phase unwrapping in a SAR radar. The bright high-intensity outliers significantly affect the accuracy of ship detection. Meanwhile, tiny ships show little discernible texture and shape information in high-resolution SAR imagery. Especially in complex environments, such as multi-target areas, shores, narrow waterways, and breakwaters, the probability of detection degrades sharply. It can be said that the challenge of SAR ship detection still exists.
SAR ships in complex environments from a DSSDD dataset [
4] are shown in
Figure 1, which shows some typical scenes such as different sizes of ships, the azimuth ambiguities, multi-target areas, and breakwaters, as well as inshore ships and different sea states. These complex environmental interferences, with their brightness and shape being close to the ship targets, make it difficult to distinguish between them. At the same time, the sidelobe and azimuth ambiguity even cause the loss of ship features such as shape and texture.
In the past decade, a large amount of research on SAR ship detection has been carried out. Meanwhile, it was revealed that the most widely used methods are the CFAR-based algorithms that are constantly developed in research [
5]. CFAR detectors estimate clutter-truncated thresholds adaptively to segment ship target and sea clutter pixels. Mathematically, sea clutter can be described by a theoretical probability density function (PDF). The adaptive thresholds can be calculated from the statistical parameters of the clutter samples according to the PDF, which sustains a constant probability of false alarm.
CFAR detectors have been practically applied to control the high false alarm rate (FAR) of radar receivers and have shown remarkable effectiveness [
6]. Up until now, many CFAR-based detectors have been proposed. SAR image pixels are detected one by one through a small detection window that is divided into a test region, a guard region, and a background region. To a large extent, CFAR detectors depend on the statistical models of sea clutter to estimate the thresholds. For different sea states, the statistical models are different, such as Log-normal [
7], K [
8], Weibull [
9], G [
10], generalized Gamma [
11], and alpha-stable [
12]. According to a statistical model and a given PFA, some parameters of the sea clutter samples in a sliding window are calculated to estimate the adaptive threshold. Then, the central (or tested) pixel in the sliding window is compared with the estimated threshold.
In the past few years, artificial intelligence technology has developed rapidly and is being applied in many fields such as image interpretation, autonomous driving, and neural machine translation. SAR ship detection is part of image interpretation tasks. For this task, convolutional neural networks (CNNs) demonstrate strong recognition capability [
13,
14,
15,
16,
17]. CNNs learn the features of SAR ships autonomously and point out the probabilities of the suspected ships in SAR imagery. However, similar to the traditional methods, CNN-based ship detection algorithms also face the common problem that the detection probability deteriorates in complex environments. To solve this, with the efforts of many scholars, CNN-based methods have achieved significant ship detection performance on high-resolution SAR images [
18,
19,
20,
21].
In recent years, there has been an obvious trend that more and more research on SAR ship detection has focused on artificial intelligence algorithms. Nevertheless, traditional methods based on threshold segmentation are still used with vigor.
The differences between them are expressed in the following aspects:
Algorithm principles. The threshold estimation of the traditional methods is based on a theoretical clutter statistical model such as Log-normal, Rayleigh, and Otsu. This means that the threshold is adaptive to changes in sea clutter as long as the clutter statistics fit the known model. CNN-based methods learn ship target features autonomously and then fix the weight parameters. However, as the CNN deepens, the learned features become unexplainable gradually.
Detection rules. In the traditional methods, each pixel is determined by comparing it with an adaptive threshold. CNN-based methods determine ship targets according to the confidence levels calculated from the statistical features of SAR ship images such as shape and texture.
Detection results. Binary segmentation images are the output results of traditional methods. For most CNN-based methods, the ships are marked by external boxes.
Parameters. Artificial intelligence algorithms must be trained towards the dataset and save the necessary number of parameters. Traditional methods are the opposite.
The threshold estimation of the traditional methods is mainly affected by the clutter statistical model and the specified PFA. Complex scenes often produce many outliers, which result in large statistical parameters. As a consequence, the threshold is overestimated. A ship pixel tends to be determined as a clutter pixel when comparing it with a large threshold. In addition, an inappropriate PFA may cause a larger threshold deviation due to incorrect statistical parameters.
To solve the probability of detection degradation in complex environments, we proposed a novel SAR ship detector based on clutter intensity statistics (CIS), which is irrelevant to the clutter statistical model and PFA, and estimates the adaptive threshold using clutter intensity information and simple calculations. Firstly, the statistical parameters of outlier-contaminated clutter samples, including mean, standard deviation, and the maximum intensity of the background clutter, are calculated in a sliding window; secondly, the three statistical parameters are used to calculate the adaptive threshold according to the novel intensity statistical model; and finally, the pixel under test is determined by comparing it with the adaptive threshold.
The major contributions of CIS are listed below:
Clutter intensity statistics (CIS) are proposed to detect SAR ships in complex environments. CIS establishes the relationship between the ship target and the outlier, which expands their difference. The influence of outliers is effectively alleviated, especially for complex scenes. Although the CIS detector is irrelevant to traditional clutter statistical distribution models and PFA, it still projects outstanding performance.
The structure of the detection window is no longer a sensitive factor for SAR ship detection. As the max intensity of outliers in a sliding window becomes one term of the adaptive threshold estimation formula, the threshold estimation is less affected by the intensities and quantity of outliers in clutter samples.
Adjustment factor λ is an adjustor that is utilized to adjust thresholds to raise the probability of detection or decrease false alarms. λ is the only global parameter. The optimal λ is determined according to the experimental results on detection performance under the different simulated clutters.
The other parts of this paper are listed as follows.
Section 2 introduces the related work on SAR ship detection. The CIS detection methodology details and detection rule are described in
Section 3. The analysis of experimental results is given in
Section 4, including the analysis of the detecting accuracy, the selection of the optimal adjustment factor, and the structure of the detection window as well as the computational efficiency. The experimental results sufficiently demonstrate the outstanding performance of the CIS detector in complex environments.
Section 5 discusses some key derivations of the CIS model and the influence of the size of the detection window on CIS detection accuracy. Finally, the conclusion is presented in
Section 6.
2. Related Work
In real SAR images, sea clutter is nonideal. It tends to be contaminated by outliers such as speckle noise, sidelobes, and other objects. The outliers cause inaccurate clutter parameter estimation. As a result, the thresholds are overestimated, and then ship targets are removed due to such a large threshold. To solve the interference from outliers, many CFAR detectors are proposed. CA-CFAR [
22] and TP-CFAR [
7] are widely applied due to their simple calculations, but both have large detection losses on homogeneous backgrounds. The OS-CFAR [
23] based on order statistics shows a markable advantage in multi-target detection but is seriously interfered with by outliers and computational inefficiency. GO-CFAR [
24] and SO-CFAR [
25] improve on CA-CFAR and use the maximum and the minimum means of four blocks that make up the background region to estimate the local detection thresholds, respectively. However, GO-CFAR suffers from detection loss, while the false alarm of SO-CFAR has increased. VI-CFAR [
26] shows a good performance by utilizing the special clutter samples of the background window to estimate the detection threshold, but the detection probability degrades in heterogeneous clutter. In [
27], the statistics-truncated CFAR detector is proposed, which raises the detection probability, but the false alarm increases due to the low threshold resulting from the clutter with the removal of high-intensity samples. In [
28], the correlation between adjacent pixels and CFAR detection is combined to preserve target signals. Consequently, the parameters estimated with the truncated sea clutter are smaller than those estimated with the real sea clutter, resulting in a high false alarm rate. TS-LNCFAR [
29] improves [
27] and shows a better detection performance. Its detection results are greatly affected by the truncation depth. In [
30], a feature group based on the invariant area ratio is designed to eliminate the capture effect. However, the heterogeneous sea clutter deteriorates the detection probability. In [
31], an iterative censoring scheme is proposed, which censors the clutter samples iteratively based on an automatically generated target feature map. However, it takes much time to converge. NLM-CFAR [
32] suppresses coherent speckle noise with non-local mean filtering. However, the iterative determination of the proper non-local mean also needs much time. Ray-CFAR [
33] only utilizes the outermost samples of the detection window for parameter estimation, ensuring that the signals do not overflow, but it still cannot fully adapt to complex environments, resulting in ship target pixel loss. This is also a common problem for CFAR detectors. IB-CFAR [
34] uses spatial and intensity information to detect ships. It shows a significant detection performance and robustness. However, IB-CFAR heavily relies on the details of SAR images. For medium- or low-resolution SAR images, its performance is weakened. OR-CFAR [
35] raises the probability of detection in multi-target backgrounds. It removes high-intensity clutter samples first and then uses the remaining samples for parameter estimation and threshold calculation. However, the detection window without a guard region cannot always adapt to complex environments. Further, the probability of detection declines sharply for scenes with few sea clutter samples around ships such as narrow waterways. In [
36], a trimodal discrete (3MD) radar clutter model is proposed based on the idea that sea texture can be statistically modeled as discrete in nature instead of using continuous texture statistics. It achieves robust results and low computational complexity according to computing the texture parameters.
It is known that a CFAR detector requires a clear clutter statistical model and a specified PFA. And the detection results are strongly affected by outliers. Complex environments, such as multi-target areas, shores, and sea states, make the clutter statistical model inconstant [
37]. This situation makes CFAR detectors obtain inconstant detection performance. In order to raise SAR ship detecting probability and reduce the interference from complex environments, a new SAR ship detector based on clutter intensity statistics (CIS) is proposed.
3. Methods
Figure 2 shows the CIS detection flowchart and some detection details. CIS detection contains two main parts: (1) adaptive detection threshold estimation using simple calculations, such as the standard deviation, mean, maximum intensity, and adjustment depth, and (2) the detection rule. After a comparison with the adaptive threshold, the tested pixel is determined as a ship pixel or a clutter. After all the pixels are examined, a binary image containing ship targets only is the output.
TP-CFAR [
7] is described in
Section 3.1 first. On the one hand, the CIS detector is inspired by TP-CFAR. On the other hand, CFAR detection is briefly introduced using TP-CFAR as an example.
Section 3.2 and
Section 3.3 introduce the CIS detection methodology and the detection rule, respectively.
3.1. TP-CFAR Detector
For TP-CFAR, the adaptive threshold is calculated using the mean, standard deviation, and scale factor. Let
I be the clutter samples and
I = {
I1,
I2, ···,
IN−1,
IN}. The mean and the standard deviation are computed by
where
μ is the mean and
σ is the standard deviation.
If the intensity of the tested pixel
I satisfies formula (3), the tested pixel is considered as a ship pixel; otherwise, it is considered a clutter pixel.
where
T is the threshold and
ĸ is the scale factor, which is calculated using the statistical model of the clutter samples
I.
The probability density function of
I is supposed to obey Gaussian in TP-CFAR. When a PFA is specified,
ĸ can be deduced as
where Θ (∙) is the cumulative distribution function of the standard normal distribution.
ĸ is calculated using the inverse function of Θ (∙).
3.2. CIS Detection
For the TP-CFAR detector, the PFA must be given, and the PDF of clutter distribution should be consistent with Gaussian. However, the statistics of the outlier-contaminated clutter suffer from instability. Meanwhile, outliers in sea clutter make it hard for TP-CFAR to obtain constant detection performance against a fixed PFA. An outlier is the non-negligible point that affects detection performance. Hence, the threshold estimation of the CIS detector considers interference from outliers.
In the TP-CFAR algorithm, formula (3) shows the relationship between the mean, the standard deviation of clutter samples, and the ship target. Inspired by it, the relation can be expressed through an approximate signal-to-clutter ratio:
The maximum intensity
ξ of outliers in the sea background window can be expressed as
For the CIS detector, the pixels except for the test (or center) pixel or protection window in a small detection window are collectively referred to as “clutter pixels”, although these pixels may contain ship pixels.
Sea clutter with outliers and ship targets are inhomogeneous, and the intensity values are different. It is known that
ξ is larger than
μ, which is calculated using
I. As mentioned in [
1], the ship’s target intensity is larger than the average intensity of the sea clutter around it. Let
Is be a ship pixel and
Is >
μ. Ideally,
Is >
ξ, and then
Is −
μ can be divided into many equal parts, and each one is 0 < Ω
k ≤ 1 and Ω
1 = … = Ω
n. Expression (5) can be deduced as
where 1 <
k ≤
n and
ĸ’ is the ideal threshold factor. Note that the ideal situation,
Is >
ξ, is discussed, but the final formula also handles non-ideal situations. It is proved in the complex scene experiments of
Section 4. Computing the geometric average of (7), the new inequality is described as
where
v represents
n −
k + 2.
v is hard to determine simply, which can be understood that
v is determined while
μ + (
k − 1) ∙ Ω
k-1 is approximately equal to
ξ. Therefore, (8) is proceeded to deduce until eliminating the uncertain term of Ω
k. By computing the geometric average of the terms in (
Is −
μ)/
σ and Ω
k/σ, (9) can be obtained:
The larger the
n, the smaller the Ω
k. If Ω
k is small enough, it can be satisfied that the ratio of Ω
k to
σ is less than 1.0. In real clutter samples, high-intensity outliers make the sea background inhomogeneous, and their intensity values are higher than the mean of sea clutter. Then, the standard deviation of the outlier-contaminated clutter tends to be far larger than 1.0. Therefore, the situation that the ratio of Ω
k to
σ is less than 1.0 exists. Then, formula (9) can be deduced as
where
λ represents
v/2 and 1 ≤
λ.
Formula (9) is obtained by the calculation of the geometric average twice. The uncertain relationship between (8) and (10) is established. It eliminates the outliers whose intensities are close to
Is, which is discussed in
Section 5.
Formula (10) eliminates the uncertain term of Ω
k to raise the computing efficiency. Formulas (8) and (10) express similar physical meanings: the ratio of signal to clutter and the ratio of high-intensity outlier to clutter, respectively. Thus, by combining (7) and (10), the ideal formula (11) can be obtained:
where
λ is a global parameter determined by Monte Carlo simulation experiments. When (
Is –
μ)/
σ >
ĸ′,
Is is considered a ship pixel. Therefore, the CIS detection threshold
Tc is described as
The adjustment factor λ is an adjustor by which Tc can be changed. Therefore, the fact can be seen that the PFA and clutter statistical model are not prerequisites for the adaptive threshold estimation of the CIS detector. The only thing to consider is the adjustment factor λ. This means that the complexity of the threshold estimation has decreased.
3.3. The Rule of the CIS Detection
The main steps of CIS are elaborated below:
The adjustment factor λ is initialized. The size of the test window is determined according to the sizes of the ships in a SAR image.
The parameters μ, ξ, and σ of clutter samples in the background window are computed according to formulas (1), (2), and (6).
The adaptive threshold Tc is estimated using the CIS model, as shown in formula (12). Tc is used to separate the ship pixels from sea clutter in a local window.
The intensity value of the tested pixel is compared with Tc. The rule of CIS detection is
where
H1 represents the situation that the detected pixels are ship pixels. Meanwhile,
H0 represents the opposite situation.
If I > Tc, the detected pixel is regarded as a ship pixel; otherwise, it is regarded as clutter.
- 5.
If all the pixels are detected, the final detection result is the output. Otherwise, the next pixel is moved it and steps from (2) to (4) are repeated.