Small Target Detection Method Based on Low-Rank Sparse Matrix Factorization for Side-Scan Sonar Images

He, Ju; Chen, Jianfeng; Xu, Hu; Ayub, Muhammad Saad

doi:10.3390/rs15082054

Open AccessArticle

Small Target Detection Method Based on Low-Rank Sparse Matrix Factorization for Side-Scan Sonar Images

by

Ju He

,

Jianfeng Chen

^*

,

Hu Xu

and

Muhammad Saad Ayub

School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2023, 15(8), 2054; https://doi.org/10.3390/rs15082054

Submission received: 16 March 2023 / Revised: 2 April 2023 / Accepted: 11 April 2023 / Published: 13 April 2023

(This article belongs to the Section Ocean Remote Sensing)

Download

Browse Figures

Versions Notes

Abstract

:

Target detection in side-scan sonar images plays a significant role in ocean engineering. However, the target images are usually severely interfered by the complex background and strong environmental noise, which makes it difficult to extract robust features from small targets and makes the target detection task quite challenging. In this paper, a novel small target detection method in sonar images is proposed based on the low-rank sparse matrix factorization. Initially, the side-scan sonar images are preprocessed so as to highlight the individual differences of the target. Then, the problems of target feature extraction and noise removal are characterized as the problem of matrix decomposition. An improved Robust Principal Component Analysis algorithm is used to extract target information, and the fast proximal gradient method is used to optimize the solution. The original sonar image is reconstructed into the low-rank background matrix, the sparse target matrix, and the noise matrix. Eventually, a morphological operation is used to filter out the noise and refine the target edges in the target matrix for improving the accuracy of target detection. Experimental results show that the proposed method not only achieves better detection performance in comparison to the conventional baseline algorithms but also performs robustly in various signal-to-clutter ratio conditions.

Keywords:

matrix factorization-based detector; small target detection; side-scan sonar; sonar image

Graphical Abstract

1. Introduction

With the development of side-scan sonar (SSS) image systems and image processing technology, underwater target detection has gained increasing importance in oceanographic research [1,2,3,4]. Due to the long detection range and strong penetrability of underwater SSS images, they have been widely used in fields such as marine surveying and mapping, suspicious target detection, and marine resources exploration [5,6,7,8]. However, there are two major challenges in the development of a detection algorithm based on underwater SSS images.

Firstly, the complexity and variability of underwater scenes and the influence of noise have brought great difficulties to the automatic interpretation of SSS images. Due to the transmission and scattering characteristics of acoustic waves, the collected SSS images (low frequency) often have low spatial resolution and image quality problems including serious noise and reverberation [9,10,11]. Small targets are usually buried in complex background clutter with a low signal-to-clutter ratio (SCR), which increases the difficulty of automatic target detection. Furthermore, SSS images weaken the essential characteristics of the target, such as texture and contour. This exacerbates the difficulty of improving the robustness of the algorithms.

Secondly, it is hard to build a universal training dataset for the SSS image. Although the side-scan sonar image data are quite rich, there are currently few public side-scan sonar image datasets since the labeling images require experts with extensive domain knowledge which raises labeling costs. To reduce the difficulty of data annotation, many recent papers address the sample problem through deep learning methods with automatic sample augmentation [12]. However, different seabed environments and observation conditions lead to large differences in the distribution of the target image features. Different seabed backgrounds, different navigation routes, different equipment, viewpoints, and different parameters usually lead to variant shapes of the same target in the image. Detection using data-driven supervised learning algorithms faces the problem of detection robustness in different environments. Figure 1 shows the side-scan sonar images and their 3D visualization results, in which the columnar target is the red rectangular box.

With the development of artificial intelligence, there have been a large number of sonar image processing algorithms based on deep learning [13,14,15], such as real-time underwater maritime object detection algorithm in side-scan sonar images based on transformer-Yolov5 [13], seagrass classification algorithm [16], segmentation algorithm based on pulse coupled neural network [17], and underwater obstacle detection method based on Yolov3 [10]. These kinds of algorithms have high accuracy, especially in harsh environments. However, the detection approaches based on deep learning require a large amount of training data to obtain excellent detection results, meaning a long training time, powerful computational resources, and expensive manual labeling costs. The traditional methods do not need a lot of labeled training data and hardware configuration, which still have high research value in sonar target detection.

Some conventional target detection methods for sonar images can usually achieve good performance in applications by focusing on feature extraction in the pixel domain, such as TOPHAT filtering [18,19], Accumulated Cell Average Constant False Alarm Rate [8] and so on. However, they are sensitive to environmental noise and do not perform robustly when the target size varies over a large range. Similarly, maximizing detection probability in low SCR images without training data remains a problem.

To combat the aforementioned challenges, this paper introduces low-rank sparse matrix factorization in the sonar target detection technology. We proposed an end-to-end sonar small target detection algorithm robust to high background noise, which can directly detect the foreground target without the need to perform image filtering. The innovations of the proposed algorithm are as follows:

(1): A small target detection algorithm for side-scan sonar based on low-rank sparse matrix factorization is proposed. The proposed framework has good performance in the removal of environmental noise with different variances and the accuracy of the small target detection;
(2): An improved Robust Principal Component Analysis (RPCA) algorithm is used to extract target information and the fast proximal gradient method is initially employed to optimize the solution in sonar target detection. We explicitly consider the noise information based on the RPCA algorithm, and estimate the low-rank matrix, sparse matrix, and noise matrix at the same time in matrix factorization, and do not need training data;
(3): Extensive experimental evaluation on real-world SSS image datasets with different noise backgrounds is conducted to show that the algorithm is not only more robust to different target types but also has better detection performance compared with other traditional detection algorithms.

2. Related Work

The underwater target detection algorithm of a sonar image mainly uses spatial domain information and frequency domain information to capture the difference between the target and the background in several appearances, such as contour, gray distribution state, and local contrast, to separate the target from the background. The mainstream sonar target detection algorithms can be grouped into two categories: the pixel domain and the feature domain. Among them, the algorithms based on pixel domain include the frame difference method, the optical flow method, the morphological filtering, and the constant false alarm rate. Feature-based algorithms can be divided into detection algorithms based on local feature representation, principal component analysis, and deep learning theory.

2.1. Target Detection Based on Pixel Domain

The underwater sonar target detection algorithms based on the pixel domain realize detection through the operation between pixels. In pixel operation, the frame difference method and the optical flow method [20,21] are suitable for target detection. Morphological filtering [22,23] is also a classic target detection algorithm, and Rahnemoonfar et al. proposed a framework to enhance the sonar image based on top-hat transformation and utilized morphological operators to identify seagrass blowouts [16]. The constant False Alarm Rate (CFAR) algorithm [24,25,26] is often used in underwater sonar target detection, which utilizes a set detection threshold to determine the object distances from different pixel cells. In order to improve accuracy, researchers have made some improvements to CFAR, such as the Cell Average Constant False Alarm Rate (CA_CFAR) [27], Ordered Statistical Constant False Alarm Rate (OS_CFAR) [28], and the Accumulated Cell Average Constant False Alarm Rate (ACA_CFAR) [8] algorithm. The underlying principle of these algorithms is relatively simple and does not need to extract the deep features of the target. However, inhomogeneous intensity and complex seabed textures such as sand ripples in sonar imagery cause severe degradation in the performance of existing approaches.

2.2. Target Detection Based on Feature Domain

The underwater sonar target detection algorithms based on feature domain realize detectors by extracting object features. These algorithms extract color, contour, shape, and other information about the targets, and then classify the targets. There are some classical object detection methods such as Markov random field [29], and sonar image segmentation based on level set [30]. In recent years, a large number of deep learning detection algorithms have emerged. The algorithms have excellent performance in large image processing and are widely used in the field of object detection because of their powerful feature extraction abilities. Zhou proposed an image segmentation method based on a simplified version of the pulse coupled neural network [17]. Palomeras designed a convolutional neural network to detect mine-like objects in forward-looking sonar data [31]. Cao applied a one-stage detection network named YOLOv3 to determine obstacle candidate areas in a sonar image [10]. However, most neural network algorithms require training with a large number of sonar images. It should be noted that not only is it difficult to obtain learning samples, but training a neural network requires a long time, which causes the application bottleneck of the algorithm.

2.3. Traditional Detection for Small Targets

Considering the above problems, we investigate traditional small object detection algorithms that do not require training data. Small target detection algorithms based on local feature representation and principal component analysis achieve robust performance. Several algorithms based on local feature representation are proposed including local contrast method (LCM) [32], improved local contrast method (ILCM) [33], and accelerated multiscale weighted local contrast measure (AMWLCM) [34]. Some algorithms separate sparse components from low-rank components based on principal component analysis and then search for the target in the sparse components. Gao et al. [35] proposed an IPI model for weak and small target detection through image matrix construction and RPCA application to reconstruct the target and background matrices.

Although the IPI method has obtained great success in small target detection, we note that there are still two limitations that prevent it from achieving better performance in sonar image detection. One limitation is that the algorithm does not consider the noise matrix, as it will reduce target signal strength and remain clutter background edges in the detection result. The other limitation is that the local patch image is first constructed from the original image, which will be time-consuming.

Aiming to eliminate the aforementioned limitations, this paper explores the prior of spatial correlation between the target and the background in a SSS image, designs a low-rank sparse matrix factorization model to detect the small target, and estimates the target and noise matrix simultaneously, which can effectively reduce the false alarm rate in SSS images with heavy environment noise and reverberation, thus improving the precision of small target detection and performing robustly in various SCR conditions.

3. The Proposed Method

Considering the sparsity of the small target in the SSS images and the different imaging mechanisms, we design a matrix factorization method for small target detection in the side-scan sonar image. We aim at the problems of low SCR and insufficient target appearance features in SSS images where our proposed matrix factorization method decomposes the origin SSS image into the target matrix, the background matrix, and the noise background.

In the non-target region of the SSS image, there are background obstructions and noise disturbances. While the background obstruction is continuous and generated by the reflection of the sea floor (reverberant noise), the independent Gaussian noise occurs due to the sonar equipment and electronic noise [36]. We uniformly divide the non-Gaussian part into the background. As the small underwater target represents a small proportion of the entire scanned image, we choose the sparse matrix to construct the object model. Besides considering that the SSS image background has a certain correlation between the intensities of neighboring pixels, the image background is modeled as a low-rank matrix. Therefore, the SSS image observation model using the sparse, low-rank, and Gaussian noise is reconstructed. In this paper, the noise parameters are estimated iteratively, so that the low-rank matrix can be recovered accurately from noisy observations.

We will introduce in detail the proposed method for detecting small underwater targets in sonar image sequences. Figure 2 shows the complete framework of the method of small target detection proposed in this paper. Initially, the sonar image D is constructed from the original SSS image

D_{i}

using image preprocessing. Then the improved convex optimization algorithm is applied to image D to simultaneously estimate the low-rank background image B, the sparse target T, and the noise image N. Following that, a series of morphological operations are carried out on the target image T. With the assistance of the components given above, our algorithm can be divided into the following parts.

3.1. Image Preprocessing

From the perspective of Principal Component Analysis (PCA), the mean is subtracted to regulate the features of the data. For a SSS image, we are not interested in the illuminance of background and noise but focus on salient regions such as the targets. Equation (1) can remove the average brightness value of the image and highlight the significant target, which is more conducive to subsequent detection. Figure 3 shows the result of subtracting the mean from the input image

D_{i}

, and the image preprocessing is computed as:

D = m a x \{D_{i} - m e a n (D_{i}), 0\},

(1)

where

D_{i}

is the input SSS image and

m e a n

denotes to calculate the global gray mean of the input image. After subtracting the mean, the negative values were replaced with zero to remove some noise. The first row of Figure 3 shows a representative SSS image and the preprocessed image, and the second row shows the corresponding histograms. We can see from the figures that the preprocessed image can remove a significant part of the noise.

3.2. Low-Rank Matrix and Sparse Matrix

Due to the reason mentioned above, the complex SSS image background satisfies the low-rank characteristics and the small target meets the sparse feature, which is the precondition of our matrix factorization model. First, we take the gray SSS image for a two-dimensional matrix and further analyze the feature of the complex background images. The singular values of the image matrix can present the correlation in different rows, therefore, we select the low-rank matrix to model the background matrix, which contains constant seafloor reflection. As shown in Figure 4, we analyze the feature of the complex background in the preprocessed SSS image, and the first row of Figure 4 shows three representative sonar background images with the same size of 300 × 300 pixels. The second row of Figure 4 shows the singular values of the corresponding preprocessed images; it can be seen that although the original background images of the three images are extremely different, their singular values decrease rapidly to zero. Based on the above discussion, we can consider the background image B as a low-rank matrix given by:

r a n k (B) \leq r,

(2)

where r is a constant,

r < < m, n

, the size of a input image is m × n pixels.

Subsequently, according to the few pixels occupation of the small target in the SSS image, we can ignore the appearances of the objects and regard the target matrix as a sparse matrix T. Figure 5 shows the average area proportion of the target in the whole image from the three sequences. The proportion of small targets in SSS images occupies very few pixels compared to the total images, so the sparse target matrix T satisfies the following conditions:

\frac{{∥T∥}_{1}}{{∥D∥}_{1}} < k,

(3)

where

{∥T∥}_{1}

represents

ℓ_{1} - norm

of the target matrix T,

{∥D∥}_{1}

represents

ℓ_{1} - norm

of the target matrix D, and k is a constant much less than 0.01, which means that the matrix T satisfies this sparseness property.

The characteristic of a non-local low-rank exists universally in natural images, which propels many preeminent non-local methods in various fields, such as a non-local low-rank technique for the hyperspectral image (HSI) denoising [37,38,39], compressed HSI reconstruction [40], inpainting [41,42], a non-local low-rank model for infrared small target detection [43], and so on.

It is noteworthy that we can adopt a more general low-rank assumption that all background images come from a mixture of low-rank subspace clusters. The inherent spatial correlation between the pixels of an image indicates that the background is expressed continuously, and the pixels are highly correlated [19]. It can be seen intuitively in Figure 4 that the background has non-local self-correlation property. In such scenarios, the sonar background image can be constructed as one low-rank subspace. Hence we only exploit one low-rank subspace assumption in the matrix decomposition to make it a more simple expression.

3.3. Optimization Method

Based on the property of the side-scan sonar image, the convex optimization problem is to simultaneously separate background, target, and noise images from sonar images. The total optimization problem is translated into three sub-problems: (1) the low-rank background matrix estimation; (2) the sparse target matrix estimation; (3) the noise matrix estimation. To solve the problems efficiently, the soft thresholding operator [44] for target estimation, the singular value thresholding operator [45] for background estimation, and the gradient descent method for noise estimation are utilized in our optimization method. The singular value thresholding operator is the proximity operator associated with the nuclear norm. Details about the proximity operator can be found in Ref. [46].

In this paper, it is assumed that random noise satisfies the independent and identically distributed condition, D is the SSS preprocessed image, T is the target matrix, B is the background matrix, and N is the noise matrix. Then the problem can be described as an optimization problem:

min {∥B∥}_{*} + λ_{1} {∥T∥}_{1} + λ_{2} {∥N∥}_{F}^{2}, s . t . D = B + T + N,

(4)

where

{∥B∥}_{*}

represents the nuclear norm of the matrix B, which is the sum of its singular values.

{∥T∥}_{1}

denotes the of

ℓ_{1} - norm

of the matrix T, and

{∥N∥}_{F}

denotes the Frobenius norm of the matrix N.

The above convex optimization problem is to simultaneously separate background, target, and noise images from SSS images. In SSS images with low SCR, background estimation and noise estimation are significant in many pieces of research, which can be used to evaluate the reliability of the detection method as well as the reverberation estimation in sonar image systems. To solve the background and noise estimation problem, Equation (4) can be further evolved into a dual problem:

min {∥B∥}_{*} + λ_{1} {∥T∥}_{1} + λ_{2} {∥N∥}_{F}^{2} + \frac{1}{2 μ} {∥D - B - T - N∥}_{F}^{2},

(5)

where

μ

is a given positive parameter, Formula (5) remains an optimization problem and we could solve it based on the accelerated proximal gradient (APG) method because it has a good balance between efficiency and accuracy in solving related RPCA methods [47,48,49,50]. The constraints of Formula (5) are convex relaxed into the objective function, so the function is defined as:

\begin{matrix} min F (B, T, N, μ) = & μ ({∥B∥}^{*} + λ_{1} {∥T∥}_{1} + λ_{2} {∥N∥}_{F}^{2}) + \frac{1}{2} {∥D - B - T - N∥}_{F}^{2} . \end{matrix}

(6)

This is necessary with an appropriate choice of the regularizing parameter

λ_{1} > 0

and

λ_{2} > 0

. Throughout this paper, unless otherwise stated, we will fix

λ_{1} = λ_{2} = \frac{1}{\sqrt{max (m, n)}}

. The size of an input image is m×n pixels. For simplicity, we hypothesize

g (B, T, N, μ) = μ ({∥B∥}^{*} + λ_{1} {∥T∥}_{1} + λ_{2} {∥N∥}_{F}^{2})

and

f (B, T, N) = \frac{1}{2} {∥D - B - T - N∥}_{F}^{2}

, therefore, Equation (6) can be rewritten as:

min F (B, T, N, μ) = g (B, T, N, μ) + f (B, T, N) .

(7)

Instead of directly optimizing the formula, the quadratic model is used to approximate Equation (7).

f (Y_{}^{B}, Y_{}^{T}, Y_{}^{N})

has a known value and is closer to

f (B, T, N)

. The partial derivatives of

f (B, T, N)

with respect to B, T, and N are solved alternately. It means that only one variable is solved at a time and the other variables are set to constants. Therefore, the optimization problem is transformed into three sub-problems until convergence. For the exact global optimal solution, Ref. [51] proves that the alternatively stepwise result is the exact global optimal solution.

(1): Estimate the low-rank matrix B.

$B_{k + 1} = \underset{}{arg min_{B}} μ^{k} {∥B∥}_{*} + \frac{L_{f}}{2} {∥B - G_{k}^{B}∥}_{F}^{2},$

(8)

where $G_{k}^{B} = Y_{k}^{B} + \frac{1}{L_{f}} (D - Y_{k}^{B} - Y_{k}^{T} - Y_{k}^{N})$ and $L_{f}$ is a constant. This is a nuclear norm minimization problem that can be obtained by solving the singular values of the observed data matrix Y using a soft-thresholding operation. Assuming the singular value decomposition of $G_{k}^{B}$ where $G_{k}^{B} = U S V^{T}$ , Equation (8) can be solved as:

$B_{k + 1} = U S_{\frac{μ_{k}}{2}} [S] V^{T},$

(9)

where $V^{T}$ is the transpose of the matrix V,

$S_{\frac{μ_{k}}{2}} [x] = \{\begin{matrix} s i g n (x) (|x| - \frac{μ_{k}}{2}) & i f |x| > \frac{μ_{k}}{2} \\ 0 & o t h e r w i s e \end{matrix},$

(10)
(2): Estimate the sparse matrix T.

$T_{k + 1} = arg min_{T} λ_{1} μ^{k} {∥T∥}_{1} + \frac{L_{f}}{2} {∥T - G_{k}^{T}∥}_{F}^{2},$

(11)

$S_{\frac{λ μ_{k}}{2}} [x] = \{\begin{matrix} s i g n (x) (|x| - \frac{λ μ_{k}}{2}) & i f |x| > \frac{λ μ_{k}}{2} \\ 0 & o t h e r w i s e \end{matrix},$

(12)
(3): Estimate the noise matrix N.

$N_{k + 1} = arg min_{N} λ_{2} μ^{k} {∥N∥}_{F}^{2} + \frac{L_{f}}{2} {∥N - G_{k}^{N}∥}_{F}^{2},$

(13)

where $G_{k}^{N} = Y_{k}^{N} + \frac{1}{L_{f}} (D - Y_{k}^{B} - Y_{k}^{T} - Y_{k}^{N}) .$ The above equation can be solved using the gradient descent method, taking the derivative of Equation (14) by N and making the result equal to 0; the final representation is then obtained:

$L_{f} (N - G_{N}^{k}) + 2 λ μ_{k} N = 0 \Rightarrow N_{k + 1} = \frac{L_{f}}{2 λ μ_{k} + L_{f}} G_{k}^{N} .$

(14)

The pseudocode of low-rank and sparse and noise matrix decomposition is shown in Algorithm 1. It can be seen that this algorithm takes the sonar image matrix

D_{i}

for input, initializes iteration variables, and searches for the optimal solution according to the enhanced RPCA algorithm. The output of this algorithm is the background matrix B, the target matrix T, and the noise matrix N.

Algorithm 1 Low-Rank and Sparsity and Noise Matrix Decomposition

Input:: Sonar image matrix $D_{i} \in R^{m \times n}$
Output:: $B = B_{k}, T = T_{k}, N = N_{k}$
1:: $D = m a x \{D_{i} - m e a n (D_{i}), 0\}$
2:: $B_{0} = B_{- 1} = 0, T_{0} = T_{- 1} = 0, t_{0} = t_{- 1} = 1$
3:: while not converged do
4:: $Y_{k}^{B} = B_{k} + \frac{t_{k - 1} - 1}{t_{k}} (B_{k} - B_{k - 1})$ , $Y_{k}^{T} = T_{k} + \frac{t_{k - 1} - 1}{t_{k}} (T_{k} - T_{k - 1})$ , $Y_{k}^{N} = N_{k} + \frac{t_{k - 1} - 1}{t_{k}} (N_{k} - N_{k - 1})$
5:: $G_{k}^{B} = Y_{k}^{B} + \frac{1}{L_{f}} (D - Y_{k}^{B} - Y_{k}^{T} - Y_{k}^{N})$
6:: $(U, S, V) = svd (G_{k}^{B}), B_{k + 1} = U S_{\frac{μ_{k}}{2}} [S] V^{T}$
7:: $G_{k}^{T} = Y_{k}^{T} + \frac{1}{L_{f}} (D - Y_{k}^{B} - Y_{k}^{T} - Y_{k}^{N})$
8:: $G_{k}^{N} = Y_{k}^{N} + \frac{1}{L_{f}} (D - Y_{k}^{B} - Y_{k}^{T} - Y_{k}^{N})$
9:: Estimate the low-rank matrix B: $B_{k + 1} = U S_{\frac{μ_{k}}{2}} [S] V^{T}$
10:: Estimate the sparse matrix T: $T_{k + 1} = S_{\frac{λ μ_{k}}{2}} [G_{k}^{T}]$
11:: Estimate the noise matrix N: $N_{k + 1} = \frac{L_{f}}{2 λ μ_{k} + L_{f}} G_{k}^{N}$
12:: $t_{k + 1} = \frac{1 + \sqrt{4 {t_{k}}^{2} + 1}}{2}$ , $μ_{k + 1} = max (η μ_{k + 1}, \bar{μ})$
13:: $k = k + 1$
14:: end while
15:: return $B_{k}, T_{k}, N_{k}$

3.4. Morphological Operation

In estimating the optimal solution of the target matrix and the noise matrix at the same time, few losses incur in the object outline features of estimating the target matrix. Therefore, we use the greatest similarity matching to restore the target after determining its approximate position from the optimal target matrix. By visualizing the partial sparse matrix (the target image), it is discovered that the target contains some isolated noise or holes that necessitate mathematical morphology operations such as cleaning and filling. We employ morphological approaches to solve it to increase the precision of target identification since mathematical morphology has high filtering qualities and can sharpen the edge of the target.

After getting the target matrix T, the image erosion operation is conducted on the target matrix to generate the eroded image

T_{1}

, which is defined as follows:

T_{1} = T ⊖ o_{1} = {z | {(o_{1})}_{z} \subseteq T},

(15)

where

o_{1}

is a structuring element created as a flat disk with a radius of 1 in this paper.

The image morphological dilation technique is conducted on

T_{1}

to obtain the dilated target image

T_{2}

. Dilation is the dual operation of erosion while the operation “dilation” is defined as follows:

T_{2} = T_{1} \oplus o_{2} = {z | {({\hat{o}}_{2})}_{z} \cap T_{1}] \neq \emptyset},

(16)

where

o_{2}

is a structuring element created as a rectangular matrix with size 3 × 3 in experiments.

T_{2}

is the final target result. The experiment found that the morphological operation is necessary in order to get a clear and complete target.

4. Experimental Results and Discussion

Real data experiments were undertaken to demonstrate the effectiveness of the proposed method for sonar object detection. We selected seven different detection methods for comparison, i.e., IPI [35], LCM [32], AMWLCM [34], ILCM [33], TOPHAT [18,19], MAXMEAN [52,53], and ACA_CFAR [8]. For all the experiments, the parameter selection was consistent with the description from the original papers.

4.1. Real Sonar Datasets Description

We conducted real-world experiments to obtain actual underwater SSS images for the validation of the performance of our proposed method. In the real-world datasets, three sequences of sonar images were adopted. Side-scan sonar is mounted on an underwater unmanned platform, and the experimental scheme with only one side of the side-scan sonar transmit beam is shown in Figure 6.

The beam emission direction is perpendicular to the carrier navigation direction, and the experiment collects SSS images under various conditions according to the experimental conditions. The working underwater depth of the Autonomous Underwater Vehicle (AUV) was changed for the sonar scanner system. The tilt angle of scanning was adjusted constantly and the target images were collected under different beam angles. At the same depth, the heading angles are 0°, 30°, 60°, and 90°, respectively. In addition, we set an autonomous mode for the AUV’s running mode, and the initial direction of navigation is different, hence the motion direction of the side-scan sonar was changed to scan the target from multiple angles. Figure 7 shows the appearance of our AUV platform. The collected targets were randomly distributed in real field lakes with a maximum depth of more than 100 m, and the maximum range of our sonar system was set to 80 m (up to 100 m) for a 0.04 m range resolution.

Table 1 lists the specifications of the sonar. Next, we captured images by changing the tilt angle and altitude of the SSS. We conducted 11 experiments scanning the underwater objects and took 300 frames of SSS images. Three groups of images with different signal-to-clutter ratios were collected, 100 images were collected in each group, and the targets contain cylinders, spheres, and a model of the human body. The targets are arranged in different areas. Some obtained images are shown in Figure 8, which shows examples of images obtained in different conditions.

4.2. Evaluation

To get an accurate evaluation of the proposed method, the criteria of the receiver operating characteristic (ROC) curve are adopted, and the probability of detection

P_{d}

and false alarm rate

F_{a}

are defined as:

P_{d} = \frac{N}{M}, F_{a} = \frac{F}{S},

(17)

in the

P_{d}

calculation, N is the total number of true targets detected, and M is the number of actual targets. In the

F_{a}

calculation, F is the total number of false detection, and S is the total number of images. When the detection box overlaps with the ground truth box and the center point distance of the two boxes is lower than the set threshold with 4 pixels [53], we regard that the detection object is correctly detected. For quantitative analysis, both experiments have fixed false-alarm rates

F_{a}

by changing the binarization thresholds for each group.

4.3. Comparison with Other Methods

In this section, we first introduce the evaluation metrics and the data for comparison. Then, we use real sonar image sequences to demonstrate the effectiveness and practicality of the proposed method. The experiments were conducted on a computer with 16-GB memory, Intel Core CPU i7-11370H, 3.3 GHz processor, and the code was implemented in MATLAB.

In order to visually show the effectiveness of the algorithm in improving image SCR and suppressing the background, simulation tests are carried out under three typical sequences with different SCR of sequence 1, sequence 2, and sequence 3 with 100 images in each sequence. The specific image properties and introduction are shown in Table 2.

We conduct testing experiments and quantitative analysis on actual datasets, appropriately comparing them with classical and state-of-the-art methods: IPI, LCM, AMWLCM, ILCM, TOPHAT, MAXMEAN, and ACA_CFAR. It should be highlighted that IPI designs a sliding window to traverse the original image, constructs a block image matrix after vectorization of the block image, and applies RPCA to reconstruct the component matrix of the target and background from this matrix. LCM is based on local feature representation and introduces multi-scale theory. AMWLCM and ILCM are improved algorithms based on LCM. MAXMEAN is designed to filter horizontally and vertically, calculating the mean and median values in each direction. Top-hat calculates the difference between the original image and its morphology to eliminate the background and extract the regional edge and target information. ACA_CFAR has better detection performance in a uniform environment by adaptively selecting reference units. The results of the proposed and comparison algorithms are demonstrated in Figure 9, which clarifies that the proposed algorithm exhibits better performance visually for almost all sequences, especially when the noise interference is serious. The experimental parameters

L_{f} = 2

,

λ = 1 / \sqrt{max (m, n)}

,

η = 0.9

, and the radius of the morphological structure is set to 1. The results of the evaluation metrics obtained by the proposed method and the comparison algorithms are listed in Table 3.

From Table 3, we can see that the proposed method has better performance with a low false alarm rate and is more robust to different SCR conditions compared with other baseline methods. Table 3 also illustrates that all detection methods have superior detection capability for high SCR sonar images than low SCR ones. Despite this, when the background noise is low in sequence 1, some baseline methods such as IPI and LCM also achieve a high probability of detection, but the proposed method gets the highest detection rate of 0.920 with the lowest false alarm rate of 1.0. When the detector is accurate and the false alarm rate is near 0, our method can still maintain a detection rate of 0.8. Besides, Table 3 also illustrates the performance comparison of baseline methods in a quite challenging environment (sequence 3) where the background noise is loud and includes seafloor reverberation, and the detection rates of baseline method TOPHAT, MAXMEAN, and AMWLCM are all less than 0.5 with the lowest false alarm rate of 0.5, which can be deemed as detection failure. Additionally, our algorithm’s detection rate is 0.732 with the same false alarm rate, which is higher than the other comparative methods. The ROC curves for our method in Figure 9 have a bigger region contained by the horizontal axis and a better capacity to detect. The suggested algorithm achieves a high detection rate with a low false alarm rate.

The 3D visualization result of our algorithm is shown in Figure 10, where Line 1 is the original sonar image, and the red box in the image is the ground truth of the target. Line 2 is the 3D visualization result of the original image, and Line 3 is the detection result of the proposed algorithm.

In order to evaluate the visual performance of the proposed method, we conduct qualitative analysis. Figure 11 shows the sonar images of three cylindrical targets and the resultant visualization of the eight detection methods. Among them, the first line is the original SSS image samples of the three sequences and the red box in the image is the target ground truth. The detection result comparison in lines 2–9 is OURS, IPI, LCM, AMWLCM, ILCM, TOPHAT, MAXMEAN, and ACA_CFAR.

It can be seen in the visualization results of all algorithms (Figure 11) that most baseline algorithms can detect the object in severe reverberation sonar images, but there are some problems in the high false detection rate of the background noises and poor integrity detection of the target area. Due to weak detection performance, some baseline methods (TOPHAT, MAXMEAN, and AMWLCM) have a high false alarm rate and misclassify a large number of noisy backgrounds as targets. While the ILCM algorithm based on local contrast can enhance the real target, it has a weak ability to suppress the background and cannot effectively distinguish the real target and bright spot noise in the background. In addition, the ACA_CFAR method can detect the objects well in weak noise, with the enhancement of background noise in sequence 3. The algorithm is affected by speckles, which leads to the degradation of detection performance. Besides, as a general rule for the appearance of the detected target, the more accurate model cannot accurately segment foreground and background in low SCR scenes. The baseline methods (LCM, AMWLCM) can roughly detect the location of the target, but the completeness of the target is poor. Compared with other baseline algorithms, the proposed algorithm eliminates a large number of clutter and false alarms, and is robust to small object detection in sonar images with different SCR.

4.4. Ablation Experiments

In order to better understand the performance of the algorithm, we analyze the effect of each of the steps. First, to evaluate the effectiveness of the decomposition for the noise matrix (improved RPCA) in matrix factorization, we compared the algorithm to detect the foreground without considering the decomposition for the noise matrix (RPCA) and considering noise matrix (OURS), and tested it on images with different SCR. Second, the performance of the algorithm with image preprocessing and improved RPCA was analyzed. Finally, the performance of the algorithm containing image preprocessing, improved RPCA, and morphological operation was analyzed. The resultant ROC curve obtained can be seen in Figure 12.

In Figure 12, when the background noise is small (sequence 1), a higher detection rate can be achieved with RPCA. However, when the background noise is severe and has bottom reverberation (sequence 2), and the algorithm only does preprocess operation and uses the original RPCA, the result will be affected by noise. In this circumstance, the proposed algorithm has the lowest misinformation rate. When the false alarm rate is 0.5 or tends to 0, the proposed algorithm carries out image preprocessing and improved RPCA at the same time, and the detection rate is above 0.8. In sequence 3, the SCR is low, so when the noise matrix is not considered, the detector fails because the detection rate is below 0.5. The detection rate of the proposed algorithm reaches 0.732 when the false alarm rate is 0.5, which is better than the RPCA method.

For a more intuitive comparison of ablation experiments, the detection rate of each sequence image is shown in Figure 13 where the false alarm rate

F_{a}

is 0.5, 1, and 1.8, respectively. The results of evaluation metrics obtained by the proposed and comparative algorithms are tabulated in Table 4.

As shown in Figure 13, among the three detection sequences with different SCR, for the same false-alarm rate, the method with image preprocessing and improved RPCA performs the best. To compare the effects of the algorithms more intuitively, the following figure shows the visual results of the detection algorithm. Among them, the first line is the original side-scan sonar image, the other lines are the detection results of the ablation experiment, and the red box in the images is the ground truth result.

From the visualization results of the ablation experiment shown in Figure 14, the performance of the algorithm is poor when using the RPCA method, and the detection result is good when using the improved RPCA method without image preprocessing, however, some noisy background may not be completely filtered out. When the image preprocessing, improved RPCA method, and morphological operations are carried out at the same time, the algorithm eliminates a large number of clutter and false alarms, and is robust to small object detection in sonar images with different SCR.

Finally, we compare the average computational cost of the eight detection methods (see Table 5). From the experimental results, we can get the conclusion that TOPHAT method and MAXMEAN algorithm can achieve a fast detection speed, ACA_CFAR requires more sorting operations, and our algorithm can achieve a faster detection speed than ACA_CFAR and AMWLCM.

5. Conclusions

In this paper, an effective sonar target detection algorithm employing matrix decomposition was presented. The key idea of the proposed method is to use the improved Robust Principal Component Analysis algorithm to extract target information, and use the fast proximal gradient method to optimize the solution. The problems of sonar image feature extraction and noise removal are characterized as matrix decomposition. The experimental results showed that our method improves the probability of detection and significantly outperforms the conventional methods including TOPHAT, MAXMEAN, and ACA_CFAR. The proposed method does not need to rely on big data training and performs robustly in strong noise. Consequently, we can conclude that our method is suitable for SSS small target detection.

However, there are some limitations to the proposed method, the detected target must satisfy the sparse property, and some strong clutter signals or abnormal signals may have sparse characteristics similar to the target and cause false alarm in some cases. In the future, our research will focus on the robustness of detection in the presence of noise.

Author Contributions

Conceptualization, J.H. and J.C.; methodology, J.H. and J.C.; validation, J.H. and H.X.; writing—original draft preparation, J.H.; writing—review and editing, J.H. and M.S.A.; visualization, J.H.; supervision, J.C.; project administration, J.C.; funding acquisition, J.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China grant number 62071383.

Data Availability Statement

The data presented in this study are available on request from the author (J.H.).

Acknowledgments

The authors acknowledge editors and reviewers for comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lurton, X. An Introduction to Underwater Acoustics: Principles and Applications; Springer: Berlin/Heidelberg, Germany, 2002; Volume 2. [Google Scholar]
Sledge, I.J.; Emigh, M.S.; King, J.L.; Woods, D.L.; Cobb, J.T.; Príncipe, J.C. Target Detection and Segmentation in Circular-Scan Synthetic Aperture Sonar Images Using Semisupervised Convolutional Encoder–Decoders. IEEE J. Ocean. Eng. 2022, 47, 1099–1128. [Google Scholar] [CrossRef]
Song, Z.; Bian, H.; Zielinski, A. Application of acoustic image processing in underwater terrain aided navigation. Ocean Eng. 2016, 121, 279–290. [Google Scholar] [CrossRef]
Grządziel, A. Results from Developments in the Use of a Scanning Sonar to Support Diving Operations from a Rescue Ship. Remote Sens. 2020, 12, 693. [Google Scholar] [CrossRef] [Green Version]
Wang, V.T.; Hayes, M.P. Synthetic Aperture Sonar Track Registration Using SIFT Image Correspondences. IEEE J. Ocean. Eng. 2017, 42, 901–913. [Google Scholar] [CrossRef]
Cook, D.A.; Brown, D.C. Synthetic Aperture Sonar Image Contrast Prediction. IEEE J. Ocean. Eng. 2018, 43, 523–535. [Google Scholar] [CrossRef]
Reed, S.; Ruiz, I.; Capus, C.; Petillot, Y. The fusion of large scale classified side-scan sonar image mosaics. IEEE Trans. Image Process. 2006, 15, 2049–2060. [Google Scholar] [CrossRef] [PubMed]
Acosta, G.G.; Villar, S.A. Accumulated CA–CFAR Process in 2-D for Online Object Detection From Sidescan Sonar Data. IEEE J. Ocean. Eng. 2015, 40, 558–569. [Google Scholar] [CrossRef]
Abu, A.; Diamant, R. Enhanced Fuzzy-Based Local Information Algorithm for Sonar Image Segmentation. IEEE Trans. Image Process. 2020, 29, 445–460. [Google Scholar] [CrossRef]
Cao, X.; Ren, L.; Sun, C. Research on Obstacle Detection and Avoidance of Autonomous Underwater Vehicle Based on Forward-Looking Sonar. IEEE Trans. Neural Netw. Learn. Syst. 2022, 1–11. [Google Scholar] [CrossRef] [PubMed]
Jiang, Y.; Ku, B.; Kim, W.; Ko, H. Side-scan sonar image synthesis based on generative adversarial network for images in multiple frequencies. IEEE Geosci. Remote Sens. Lett. 2020, 18, 1505–1509. [Google Scholar] [CrossRef]
Huo, G.; Wu, Z.; Li, J. Underwater object classification in sidescan sonar images using deep transfer learning and semisynthetic training data. IEEE Access 2020, 8, 47407–47418. [Google Scholar] [CrossRef]
Yu, Y.; Zhao, J.; Gong, Q.; Huang, C.; Zheng, G.; Ma, J. Real-time underwater maritime object detection in side-scan sonar images based on transformer-YOLOv5. Remote Sens. 2021, 13, 3555. [Google Scholar] [CrossRef]
Sun, Y.; Zheng, H.; Zhang, G.; Ren, J.; Xu, H.; Xu, C. DP-ViT: A Dual-Path Vision Transformer for Real-Time Sonar Target Detection. Remote Sens. Remote Sens. 2022, 14, 5807. [Google Scholar] [CrossRef]
Cheng, Z.; Huo, G.; Li, H. A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification. Remote Sens. 2022, 14, 355. [Google Scholar] [CrossRef]
Rahnemoonfar, M.; Rahman, A.F.; Kline, R.J.; Greene, A. Automatic Seagrass Disturbance Pattern Identification on Sonar Images. IEEE J. Ocean. Eng. 2019, 44, 132–141. [Google Scholar] [CrossRef]
Zhou, T.; Si, J.; Wang, L.; Xu, C.; Yu, X. Automatic Detection of Underwater Small Targets Using Forward-Looking Sonar Images. IEEE Trans. Geosci. Remote Sens. 2022, 60, 4207912. [Google Scholar] [CrossRef]
Toet, A.; Wu, T. Small maritime target detection through false color fusion. In Proceedings of the Optics and Photonics in Global Homeland Security IV, Orlando, FL, USA, 17–20 March 2008; Volume 6945, pp. 171–179. [Google Scholar]
Zhu, H.; Liu, S.; Deng, L.; Li, Y.; Xiao, F. Infrared Small Target Detection via Low-Rank Tensor Completion With Top-Hat Regularization. IEEE Trans. Geosci. Remote Sens. 2020, 58, 1004–1016. [Google Scholar] [CrossRef]
Lane, D.; Chantler, M.; Dai, D. Robust tracking of multiple objects in sector-scan sonar image sequences using optical flow motion estimation. IEEE J. Ocean. Eng. 1998, 23, 31–46. [Google Scholar] [CrossRef]
Rao, A.S.; Gubbi, J.; Marusic, S.; Palaniswami, M. Crowd Event Detection on Optical Flow Manifolds. IEEE Trans. Cybern. 2016, 46, 1524–1537. [Google Scholar] [CrossRef]
Maussang, F.; Chanussot, J.; Hetet, A.; Amate, M. Mean–Standard Deviation Representation of Sonar Images for Echo Detection: Application to SAS Images. IEEE J. Ocean. Eng. 2007, 32, 956–970. [Google Scholar] [CrossRef]
Ekstrom, M.P. Digital Image Processing Techniques; Academic Press: Cambridge, MA, USA, 2012; Volume 2. [Google Scholar]
Zheng, L.; Tian, K. Detection of small objects in sidescan sonar images based on POHMT and Tsallis entropy. Signal Process. 2018, 142, 168–177. [Google Scholar] [CrossRef]
Richards, M.A. Fundamentals of Radar Signal Processing; McGraw-Hill Education: New York, NY, USA, 2014. [Google Scholar]
Schwegmann, C.P.; Kleynhans, W.; Salmon, B.P. Manifold Adaptation for Constant False Alarm Rate Ship Detection in South African Oceans. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 3329–3337. [Google Scholar] [CrossRef] [Green Version]
Cao, Z.; Li, J.; Song, C.; Xu, Z.; Wang, X. Compressed Sensing-Based Multitarget CFAR Detection Algorithm for FMCW Radar. IEEE Trans. Geosci. Remote Sens. 2021, 59, 9160–9172. [Google Scholar]
Villar, S.A.; De Paula, M.; Solari, F.J.; Acosta, G.G. A Framework for Acoustic Segmentation Using Order Statistic-Constant False Alarm Rate in Two Dimensions From Sidescan Sonar Data. IEEE J. Ocean. Eng. 2018, 43, 735–748. [Google Scholar] [CrossRef]
Mignotte, M.; Collet, C.; Perez, P.; Bouthemy, P. Sonar image segmentation using an unsupervised hierarchical MRF model. IEEE Trans. Image Process. 2000, 9, 1216–1231. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ye, X.F.; Zhang, Z.H.; Liu, P.X.; Guan, H.L. Sonar image segmentation based on GMRF and level-set models. Ocean Eng. 2010, 37, 891–901. [Google Scholar] [CrossRef]
Palomeras, N.; Furfaro, T.; Williams, D.P.; Carreras, M.; Dugelay, S. Automatic Target Recognition for Mine Countermeasure Missions Using Forward-Looking Sonar Data. IEEE J. Ocean. Eng. 2022, 47, 141–161. [Google Scholar] [CrossRef]
Chen, C.L.P.; Li, H.; Wei, Y.; Xia, T.; Tang, Y.Y. A Local Contrast Method for Small Infrared Target Detection. IEEE Trans. Geosci. Remote Sens. 2014, 52, 574–581. [Google Scholar] [CrossRef]
Han, J.; Ma, Y.; Zhou, B.; Fan, F.; Liang, K.; Fang, Y. A Robust Infrared Small Target Detection Algorithm Based on Human Visual System. IEEE Geosci. Remote Sens. Lett. 2014, 11, 2168–2172. [Google Scholar]
Liu, J.; He, Z.; Chen, Z.; Shao, L. Tiny and Dim Infrared Target Detection Based on Weighted Local Contrast. IEEE Geosci. Remote Sens. Lett. 2018, 15, 1780–1784. [Google Scholar] [CrossRef]
Gao, C.; Meng, D.; Yang, Y.; Wang, Y.; Zhou, X.; Hauptmann, A.G. Infrared Patch-Image Model for Small Target Detection in a Single Image. IEEE Trans. Image Process. 2013, 22, 4996–5009. [Google Scholar] [CrossRef] [PubMed]
Cobb, J.T. Sonar Image Modeling for Texture Discrimination and Classification; University of Florida: Gainesville, FL, USA, 2011. [Google Scholar]
He, W.; Yao, Q.; Li, C.; Yokoya, N.; Zhao, Q.; Zhang, H.; Zhang, L. Non-local meets global: An iterative paradigm for hyperspectral image restoration. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 2089–2107. [Google Scholar] [CrossRef] [PubMed]
Chang, Y.; Yan, L.; Zhong, S. Hyper-Laplacian Regularized Unidirectional Low-Rank Tensor Recovery for Multispectral Image Denoising. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 5901–5909. [Google Scholar]
Xie, T.; Li, S.; Fang, L.; Liu, L. Tensor Completion via Nonlocal Low-Rank Regularization. IEEE Trans. Cybern. 2019, 49, 2344–2354. [Google Scholar] [CrossRef]
Wang, L.; Xiong, Z.; Shi, G.; Wu, F.; Zeng, W. Adaptive Nonlocal Sparse Representation for Dual-Camera Compressive Hyperspectral Imaging. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2104–2111. [Google Scholar] [CrossRef] [PubMed]
Ng, M.K.P.; Yuan, Q.; Yan, L.; Sun, J. An Adaptive Weighted Tensor Completion Method for the Recovery of Remote Sensing Images With Missing Data. IEEE Trans. Geosci. Remote Sens. 2017, 55, 3367–3381. [Google Scholar] [CrossRef]
Xie, Q.; Zhao, Q.; Meng, D.; Xu, Z. Kronecker-Basis-Representation Based Tensor Sparsity and Its Applications to Tensor Recovery. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 40, 1888–1902. [Google Scholar] [CrossRef]
Zhu, H.; Ni, H.; Liu, S.; Xu, G.; Deng, L. TNLRS: Target-Aware Non-Local Low-Rank Modeling With Saliency Filtering Regularization for Infrared Small Target Detection. IEEE Trans. Image Process. 2020, 29, 9546–9558. [Google Scholar] [CrossRef]
Wright, J.; Ganesh, A.; Rao, S.; Peng, Y.; Ma, Y. Robust principal component analysis: Exact recovery of corrupted low-rank matrices via convex optimization. Adv. Neural Inf. Process. Syst. 2009, 22, 1–9. [Google Scholar]
Cai, J.F.; Candès, E.J.; Shen, Z. A Singular Value Thresholding Algorithm for Matrix Completion. SIAM J. Optim. 2010, 20, 1956–1982. [Google Scholar] [CrossRef]
Hiriart-Urruty, J.B.; Lemaréchal, C. Convex Analysis and Minimization Algorithms I: Fundamentals; Springer Science & Business Media: Cham, Switzerland, 2013; Volume 305. [Google Scholar]
Li, L.; Li, W.; Du, Q.; Tao, R. Low-Rank and Sparse Decomposition With Mixture of Gaussian for Hyperspectral Anomaly Detection. IEEE Trans. Cybern. 2021, 51, 4363–4372. [Google Scholar] [CrossRef]
Gao, Z.; Cheong, L.F.; Wang, Y.X. Block-Sparse RPCA for Salient Motion Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 36, 1975–1987. [Google Scholar] [CrossRef] [PubMed]
Xu, Y.; Wu, Z.; Chanussot, J.; Wei, Z. Joint Reconstruction and Anomaly Detection From Compressive Hyperspectral Images Using Mahalanobis Distance-Regularized Tensor RPCA. IEEE Trans. Geosci. Remote Sens. 2018, 56, 2919–2930. [Google Scholar] [CrossRef]
Vaswani, N.; Bouwmans, T.; Javed, S.; Narayanamurthy, P. Robust Subspace Learning: Robust PCA, Robust Subspace Tracking, and Robust Subspace Recovery. IEEE Signal Process. Mag. 2018, 35, 32–55. [Google Scholar] [CrossRef] [Green Version]
Beck, A.; Teboulle, M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2009, 2, 183–202. [Google Scholar] [CrossRef] [Green Version]
Deshpande, S.D.; Er, M.H.; Venkateswarlu, R.; Chan, P. Max-mean and max-median filters for detection of small targets. In Proceedings of the Signal and Data Processing of Small Targets 1999, Denver, CO, USA, 19–23 July 1999; Volume 3809, pp. 74–83. [Google Scholar]
Deng, H.; Sun, X.; Liu, M.; Ye, C.; Zhou, X. Small Infrared Target Detection Based on Weighted Local Difference Measure. IEEE Trans. Geosci. Remote Sens. 2016, 54, 4204–4214. [Google Scholar] [CrossRef]

Figure 1. Side-scan sonar images and their 3D visualization. (a–c) The first row shows three side-scan sonar images and the second row shows the 3D visualization results of the corresponding images with different SCR conditions, in which the red rectangles are small target samples from different areas of the seafloor.

Figure 2. The flowchart of our proposed algorithm based on low-rank sparse matrix factorization. The algorithm can be divided into three parts: image preprocessing, matrix factorization, and morphological operation. The red rectangular box is the detection result.

Figure 3. Original SSS image and preprocessed image. (a) A representative SSS image. (b) The preprocessed image. (c) The histogram of the original SSS image (a). (d) The histogram of the preprocessed image (b).

Figure 4. Sonar background images and their singular value results. The first row (a–c) shows three sonar background images and the second row (d–f) shows the singular values of the corresponding preprocessed images. (a–c) The background images with a different submarine environment, and the black area is an acoustic shadow in subfigure (c).

Figure 5. The area proportion of the target in the whole image for three sequences. The proportion of small targets in SSS images occupies very few pixels compared to the total image.

Figure 6. Illustration of the experimental scheme. The experiment collects SSS images under various conditions including various scanning heights, scanning angles, and orientation angles.

Figure 7. Overall structure of AUV. The AUV consists of seven parts: head section, side-scan sonar, navigation and control unit, propulsion unit I, propulsion unit II, tail section, and towrope.

Figure 8. The obtained SSS image containing the small targets. They were collected in different waters.

Figure 9. The ROC curves of eight algorithms for the three real image sequences. (a) Real Seq. 1, (b) Real Seq. 2, (c) Real Seq. 3.

Figure 10. 3D visualization of the sonar image detection result. The first row shows three input sonar images (the red box in the image is the target), the second row shows the input SSS images plotted in 3D (across-track, along-track, and backscattering strength), and the third row shows the detection results plotted in 3D (across-track, along-track, and backscattering strength).

Figure 11. Visualization result of different algorithms in three sequences. The red rectangular box is the ground truth and the yellow rectangular box is the detection result.

Figure 12. ROC curves of (a) Sequence 1, (b) Sequence 2, (c) Sequence 3.

Figure 13. Probability of detection for the (a)

F_{a}

= 0.5, (b)

F_{a}

= 1, (c)

F_{a}

= 1.8.

Figure 13. Probability of detection for the (a)

F_{a}

= 0.5, (b)

F_{a}

= 1, (c)

F_{a}

= 1.8.

Figure 14. Visualization results of the ablation experiment. The red rectangular box is the ground truth.

Table 1. Specifications of the side-scan sonar.

Parameter	Value
Operating frequency	260 kHz/800 kHz
Transducer beam width	260 kHz: 2.2° × 75°/800 kHz: 0.7° × 30°
Range resolution	0.2% of range
Maximum imaging range	100 m
Image size	2000 × 500 pixels
Frame rate	6–20 fps
Depth rating	300 m

Table 2. Details of the three sequences.

Sequence	Number	The Target Size in the Whole Image	The Mean of SCR	The Variances of SCR
Seq1 (High SCR)	100	0.011	6.36	0.16
Seq2 (Medium SCR)	100	0.012	4.91	0.53
Seq3 (Low SCR)	100	0.009	1.08	0.37

Table 3. Probability of detection for different algorithms.

$F_{a}$	Method	$P_{d}$ (Seq1)	$P_{d}$ (Seq2)	$P_{d}$ (Seq3)
0.5	IPI	0.850	0.852	0.711
	LCM	0.825	0.855	0.666
	AMWLCM	0.599	0.778	0.140
	ILCM	0.640	0.853	0.583
	TOPHAT	0.598	0.245	0.350
	MAXMEAN	0.622	0.511	0.347
	ACA_CFAR	0.661	0.781	0.525
	OURS	0.895	0.900	0.732
1.0	IPI	0.877	0.875	0.743
	LCM	0.861	0.887	0.754
	AMWLCM	0.625	0.823	0.210
	ILCM	0.733	0.877	0.665
	TOPHAT	0.702	0.315	0.379
	MAXMEAN	0.721	0.547	0.465
	ACA_CFAR	0.750	0.799	0.665
	OURS	0.920	0.902	0.783

Table 4. Probability of detection for the ablation experiment.

$F_{a}$	Image Preprocessing	RPCA	Improved RPCA	Morphological Operation	$P_{d}$ (Seq1)	$P_{d}$ (Seq2)	$P_{d}$ (Seq3)
	✘	✔	✘	✘	0.474	0.368	0.301
0.5	✘	✘	✔	✘	0.526	0.672	0.579
	✔	✘	✔	✘	0.862	0.840	0.706
	✔	✘	✔	✔	0.895	0.900	0.732
	✘	✔	✘	✘	0.737	0.605	0.403
1.0	✘	✘	✔	✘	0.837	0.805	0.753
	✔	✘	✔	✘	0.899	0.881	0.737
	✔	✘	✔	✔	0.920	0.902	0.783
	✘	✔	✘	✘	0.868	0.642	0.516
1.8	✘	✘	✔	✘	0.842	0.811	0.758
	✔	✘	✔	✘	0.921	0.887	0.780
	✔	✘	✔	✔	0.947	0.903	0.815

Table 5. Computational cost comparison among the proposed algorithm and other algorithms. (unit: s).

Sequence	IPI	LCM	AMWLCM	ILCM	TOPHAT	MAXMEAN	ACA_CFAR	OURS
1	1.832	0.309	4.902	0.270	0.260	0.316	33.499	1.853
2	2.799	0.371	6.360	0.319	0.275	0.304	46.8	2.943
3	1.486	0.303	3.972	0.280	0.257	0.288	28.754	1.495

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

He, J.; Chen, J.; Xu, H.; Ayub, M.S. Small Target Detection Method Based on Low-Rank Sparse Matrix Factorization for Side-Scan Sonar Images. Remote Sens. 2023, 15, 2054. https://doi.org/10.3390/rs15082054

AMA Style

He J, Chen J, Xu H, Ayub MS. Small Target Detection Method Based on Low-Rank Sparse Matrix Factorization for Side-Scan Sonar Images. Remote Sensing. 2023; 15(8):2054. https://doi.org/10.3390/rs15082054

Chicago/Turabian Style

He, Ju, Jianfeng Chen, Hu Xu, and Muhammad Saad Ayub. 2023. "Small Target Detection Method Based on Low-Rank Sparse Matrix Factorization for Side-Scan Sonar Images" Remote Sensing 15, no. 8: 2054. https://doi.org/10.3390/rs15082054

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Small Target Detection Method Based on Low-Rank Sparse Matrix Factorization for Side-Scan Sonar Images

Abstract

1. Introduction

2. Related Work

2.1. Target Detection Based on Pixel Domain

2.2. Target Detection Based on Feature Domain

2.3. Traditional Detection for Small Targets

3. The Proposed Method

3.1. Image Preprocessing

3.2. Low-Rank Matrix and Sparse Matrix

3.3. Optimization Method

3.4. Morphological Operation

4. Experimental Results and Discussion

4.1. Real Sonar Datasets Description

4.2. Evaluation

4.3. Comparison with Other Methods

4.4. Ablation Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI