A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification

Bai, Chengzu; Zhang, Shuo; Wang, Xinning; Wen, Jiaqiang; Li, Chong

doi:10.3390/app14041489

Open AccessArticle

A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification

by

Chengzu Bai

^1,2,

Shuo Zhang

^1,2,

Xinning Wang

^3,*,

Jiaqiang Wen

³ and

Chong Li

³

¹

Beijing Institute of Applied Meteorology, Beijing 100029, China

²

Key Laboratory of Smart Earth, Beijing 100029, China

³

College of Engineering, Ocean University of China, Qingdao 266100, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2024, 14(4), 1489; https://doi.org/10.3390/app14041489

Submission received: 6 December 2023 / Revised: 30 January 2024 / Accepted: 8 February 2024 / Published: 12 February 2024

(This article belongs to the Section Marine Science and Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

High-resolution synthetic aperture radars (SARs) are becoming an indispensable environmental monitoring system to capture the important geophysical phenomena on the earth and sea surface. However, there is a lack of comprehensive models that can orchestrate such large-scale datasets from numerous satellite missions such as GaoFen-3 and Sentinel-1. In addition, these SAR images of different ocean scenes need to convey a variety of high-level classification features in oceanic and atmospheric phenomena. In this study, we propose a multichannel neural network (MCNN) that supports oceanic SAR scene classification for limited oceanic data samples according to multi-feature fusion, data augmentation, and multichannel feature extraction. To exploit the multichannel semantics of SAR scenes, the multi-feature fusion module effectively combines and reshapes the spatiotemporal SAR images to preserve their structural properties. This fine-grained feature augmentation policy is extended to improve the data quality so that the classification model is less vulnerable to both small- and large-scale data. The multichannel feature extraction also aggregates different oceanic features convolutionally extracted from ocean SAR scenes to improve the classification accuracy of oceanic phenomena with different scales. Through extensive experimental analysis, our MCNN framework has demonstrated a commendable classification performance, achieving an average precision rate of 96%, an average recall rate of 95%, and an average F-score of 95% across ten distinct oceanic phenomena. Notably, it surpasses two state-of-the-art classification techniques, namely, AlexNet and CMwv, by margins of 23.7% and 18.3%, respectively.

Keywords:

synthetic aperture radar; MCNN; ocean phenomena; deep learning; image classification

1. Introduction

Synthetic aperture radar (SAR) has proven itself as a vital technology for a wide variety of air–sea interaction processes [1,2] to exploit its multi-dimensional constructional capability. It allows various satellite missions to obtain thousands of spatiotemporal images, e.g., ERS-1/2 (1991–2003), Environmental Satellite (ENVISAT)/Advanced Synthetic Aperture Radar (ASAR) (2002–2012), TerraSAR-X (2007–present), RADARSAT-1/2 (1995–present), GaoFen (GF)-3 (2016–present), and Sentinel-1 (S-1) (2014–present). Due to the independence between atmosphere and sunlight, the accessibility of SAR images offers unprecedented convenience for exploration in research directions such as bathymetric mapping [3], ocean current analysis [4], wind direction estimation [5], ship target detection [6], and ice shelf monitoring [7]. Moreover, SAR data have been used in diverse applications, including fish species recognition [8], urban land classification [9], marine spatial planning [10], and earthquake disaster prediction [11]. To further broaden the versatility of SAR technology and highlight the potential to contribute to an array of scientific and practical domains, geophysical researchers have employed machine learning and deep learning techniques to support fish species recognition, military objective protection, and natural hazard prevention [12,13,14]. However, SAR images inherently exhibit speckle noise due to the attenuation of echo signals, which is characterized by a distribution of granular patterns [15]. This noise is typically characterized as multiplicative, not only affecting SAR images but also influencing all coherent images. Therefore, the primary challenge with the automated classification of SAR images becomes speckle noise interference. On the one hand, the noise poses difficulties in multiple tasks such as feature extraction and edge detection [16]. This interference not only diminishes the contrast of the images but also modifies the spatial statistics of the underlying scene backscatter, thereby compromising the interpretation and recognition of targets. On the other hand, the speckle noise interference poses a big challenge for oceanic scene classification with multiple SAR images and deteriorates the reliability of prediction models like predicting meteorological conditions and analyzing atmospheric patterns [17,18,19]. As depicted in Figure 1, the filtered image (Figure 1b) reveals enhanced details, in contrast to the original SAR image (Figure 1a), illustrating improvements achieved through noise removal.

Due to the small differences in backscattering characteristics [21], it is difficult to precisely distinguish the specific ocean phenomenon from similar marine environments. Moreover, the surface current features cannot be extracted because of the wind to backscatter data in dual-polarized mode [22]. Furthermore, there is presently a lack of comprehensive annotated datasets for SAR targets. This limitation arises from the expensive nature of acquiring SAR data and the associated high costs involved in ensuring quality annotation [23]. Hence, relatively insufficient labeled data are used for training and modeling due to the long acquisition period and expensive labeling process [24]. Two recent works report that most previous ocean SAR image-based applications just involve limited regional scene case studies such as those conducted in the East China Sea and the Malacca Strait, where the SAR images are employed for the identification of ocean eddies and the analysis of the spatial-temporal distribution to investigate the characteristics of internal solitary waves [25,26].

To tackle the aforementioned issues, deep neural network techniques have been well-studied to alleviate the influence of speckle noise and backscattering features [27]. For example, a deep encoder–decoder convolutional neural network (CNN) architecture is established for the speckle filtering tasks [28], which outperformed classical algorithms on both simulated and real data. A noisy reference-based SAR deep learning filter was proposed [29], achieving a better despeckling performance over certain state-of-the-art deep learning techniques. But when these deep learning methods are simply applied to SAR image-based detection, some important SAR domain knowledge and corresponding oceanic features are eliminated by specific feature extraction and transformation [30]. Compared to deep learning strategies, manual classification and recognition significantly depend on visual features [31], introducing inefficiencies in processing marine big data. Each visual task necessitates predefined artificial features, resulting in time-consuming efforts. Furthermore, these manually defined features are relatively low-level when compared to those extracted by deep learning methods, thereby constraining the comprehensive representation of marine objects. Therefore, for effective utilization of SAR images by scientific users, it is critical to develop an appropriate deep learning framework for standardizing the use of SAR images so that they can be seamlessly integrated into the repertoire of the public oceanic datasets for category information analysis.

Recently, data augmentation has been extensively studied to synthetically enhance the diversity of training datasets by a magnitude of 1000 within image rotation, random transformation, and horizontal flip [32]. Furthermore, several efforts have explored the use of data augmentation to further boost the performance of artificial intelligence methods for automatic SAR image classification and oceanic phenomenon detection [33,34]. In addition, the transfer-learning strategy has been verified as an efficient way to develop a robust CNN model for specific applications in the case of limited datasets [23]. Hence, an effective oceanic phenomenon recognition system also needs to provide an augmentation mechanism for SAR scene detection applications to efficiently expand the data size without costly data collection.

In addition, the routine SAR wave mode (WM) measurements from multiple satellite missions typically exhibit a series of fine spatial resolution (4 m), large scene footprints (20 km × 20 km), and high signal-to-noise ratios, which improves SAR imagery despeckling effects [35]. Because the massive wave mode data at the global scale, approximately 120 k images per month, contain redundant geophysical information, it is feasible to dynamically capture the useful geophysical properties by a dynamic method that combines feature extraction utilizing the wavelet–radon transform (WR) and classification employing the neural network technique, specifically the dynamic neural network (DNN) [36]. The conventional image processing methods are commonly based on visual inspection and handcrafted features [37,38], which are notably impractical for the generalization of the structural patterns of different geophysical phenomena [39,40].

CNN architectures serve as an automated framework for feature learning and representation, facilitating the extraction of features from images in a unified spatiotemporal manner of the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) [41]. AlexNet, introduced by Krizhevsky et al. in 2012 [42], emerges as the pioneering large-scale convolutional neural network (CNN) in the field of computer vision. It achieves a notable image classification performance, attaining an 83.6% top-five accuracy compared to the manual classification methods (e.g., histograms of oriented gradients (HoG) [43], scale invariant feature transform (SIFT) [44], and local binary pattern (LBP) [45]). The architecture known as GoogLeNet or Inception-v1, as detailed in the work by [46], won the ILSVRC 2014 challenge with an accuracy rate of 93.7%. Then, the classical Inception architecture evolved into both Inception-v2 and Inception-v3 [47], achieving an incredible performance with a 94.4% accuracy on the ILSVRC 2012 classification dataset, in contrast to the manual classification method SIFT [44]. A variant of Inception-v3 CNNs (denoted CMwv by [48]) has been employed for automatic SAR image classification and achieved the optimal accuracy for geophysical phenomenon detection on the public dataset. However, CMwv exhibits limited generalization capability in capturing oceanic signatures, as evidenced by its notably low precision level, approaching zero, in the detection of oceanic fronts (OF), and a recall level below 50% in the identification of pure ocean waves (PW). Moreover, the utilization of Inception-v3’s image input techniques by CMwv leads to a lack of consideration for distinctive features associated with various oceanic phenomena and the domain-specific knowledge embedded in SAR images.

Taken together, for a greater good on improving model reliability and guiding the spatiotemporal analysis, it is imperative to integrate oceanic scene classification with sophisticated characteristics of natural images for a combined scheme that can overcome the inefficiency of geophysical phenomenon prediction observed in several WV datasets. In this study, we present a multichannel neural network (MCNN) management scheme to mitigate the impacts of insufficient data samples on ocean SAR scene classification. Based on multichannel data augmentation, MCNN involves multiple versions of standard CNN models equipped with convolution kernels of different sizes (

1 \times 1

,

3 \times 3

,

5 \times 5

), conducting convolution operations to capture image features at different scales. This design enables the model to emphasize both local and global features, thereby improving its capacity to recognize ocean SAR scenes of varying scales. Moreover, MCNN employs a comprehensive approach to extract multiple features from the original SAR images. These extracted features are then organized into multiple channels within the intricately designed Inception CNN module, effectively fulfilling the goal of capturing deeper features. Additionally, several techniques have been incorporated into MCNN, such as multi-feature fusion, data augmentation, and multichannel feature extraction, to address the challenges arising from the aforementioned inadequate data. We use a number of SAR image datasets to evaluate the performance of MCNN and validate our design strategies through experimental analysis.

In summary, our paper makes the following contributions:

We first illustrate and describe the wave mode data from the Sentinel-1 and TenGeoP-SARwv datasets. To cover the whole stereoscopic structure of the geophysical phenomena, we construct a new multi-dimensional dataset and data processing module that efficiently extracts, concatenates, and processes different area data from multiple satellite missions.
We propose a novel deep learning framework, namely, MCNN, including multi-feature fusion for capturing valuable information, data augmentation for perturbing feature embedding, and multichannel feature extraction for accurately distinguishing different types of oceanic phenomena.
We evaluate the performance of MCNN with a broad set of WV data and deep learning methods. Our experimental results demonstrate that MCNN achieves perfect classification accuracy and significantly improves the data quality, outperforming two state-of-the-art methods by 23.7% and 18.3%, respectively.

The rest of the paper is organized as follows: Section 2 illustrates the Sentinel-1 wave mode and TenGeoP-SARwv datasets and Section 3 details the design of the MCNN framework. Section 4 provides an analysis of ocean SAR scene classification accuracy and presents the experimental results, followed by Section 5, which concludes this paper.

2. Dataset and Data Processing

In this study, we mainly use ocean SAR images from Sentinel-1 (S-1) in different wave modes because of its routine wave mode measurements at the global scale. To train a greater deep learning classification architecture, we construct training datasets from the TenGeoP-SARwv database. All the related datasets are illustrated in the following.

2.1. Sentinel-1 Wave Mode

The Sentinel-1 (S-1) mission is designed as a two-satellite constellation including polar-orbiting and sun-synchronous satellites [35,49], namely, Sentinel-1A and Sentinel-1B, launched by the European Space Agency (ESA) in April of 2014 and 2016, respectively. In order to provide an effective 6-day repeat cycle, the two satellites share the same orbital plane offset by a 180° phase difference. The S-1 microwave SAR instruments operate at a wavelength of approximately 5.5 cm. These satellites are equipped with identical microwave SAR instruments that operate four imaging modes: WV, strip map, extra-wide swath, and interferometric. WV is the primary imaging mode used over open ocean, unless there are specific requests for other imaging modes. S-1 WV small SAR image scenes are acquired in a leapfrog scheme at two alternating center incidence angles of 23° (WV1) and 36.5° (WV2). The small SAR image scenes are referred to as imagettes in this research. Each imagette has dimensions of 20 km × 20 km and a spatial resolution of 5 m. Considering that certain horizontal (HH) images have been acquired during special phases, the two incidence angles frequently apply in linear vertical (VV) image transmission and polarization receipt. This study mainly focuses on S-1A WV data, considering that S-1B imagettes exhibit essentially equivalent characteristics to those of S-1A imagettes [48]. The joint utilization of S-1A and S-1B satellites will enhance the coverage in both temporal and spatial dimensions for various applications related to geophysical phenomena. Furthermore, the designed classification models can also be applied to S-1B.

2.2. TenGeoP-SARwv Dataset

TenGeoP-SARwv is a comprehensive dataset containing over 37,000 labeled SAR images (20 km × 20 km) captured over the ocean [50], each associated with ten well-defined geophysical phenomena described in Figure 2 that are frequently observed and expertly categorized. This dataset holds significant utility for researchers and practitioners engaged in the fields of oceanography, meteorology, and remote sensing, as well as those involved in deep learning endeavors. Additionally, this dataset comprises 16,068 imagettes acquired in WV1 mode and 21,485 imagettes acquired in WV2 mode. Each imagette within this dataset is annotated to correspond to ten geophysical phenomena, encompassing both oceanic and meteorological characteristics. For instance, Figure 2 demonstrates ten conventional geophysical phenomena from Figure 2a–j, including atmospheric fronts (AF), biological slicks (BS), icebergs (IB), low-wind areas (LWA), microconvective cells (MCC), oceanic fronts (OF), pure ocean waves (PW), rain cells (RC), sea ice (SI), and windstreaks (WS). Furthermore, data quality assurance is meticulously conducted through a two-stage process. Initially, the raw data undergo transformation through the application of the European Space Agency’s developed nominal calibration method, resulting in the generation of the normalized radar cross section (NRCS). Secondly, the NRCS executes recalibration, employing the CMOD5n model function [51] to mitigate the impact of incidence angle variations.

This dataset is generously supplied by the French Research Institute for the Exploitation of the Sea (IFREMER). It is available at http://www.seanoe.org/data/00456/56796/ (accessed on 7 February 2024). Figure 3 displays the distribution of the TenGeoP-SARwv dataset in a 5° × 5° global spatial grid. Although this SAR dataset includes the majority of WV acquisition such as the Pacific, Indian, and South Atlantic oceans, the SAR image density in each region is quite low, considering that the largest number of WV images is less than 40. The task of recognizing multiple oceanic phenomena can be easily hindered by this lack of high-quality data. To satisfy the requirement of training an effective classification model, these imagettes are chosen with the criteria that one geophysical phenomenon dominates with its special characteristics. It is worth mentioning that the format of SAR images is important for visual interpretation and dynamic detectability.

2.3. Data Processing

In order to avoid overfitting in deep neural networks for oceanic phenomenon prediction in the ocean, we expand the geographic scope with the range of the operational maritime domain. In the TenGeoP-SARwv dataset of Figure 2a–j provided by the ESA, these ten grayscale images collectively encompass an extensive span of the ocean’s surface, encapsulating a diverse array of features. Employing despeckling techniques such as a Lee filter, enhanced Lee filter, and supervised denoising method [20,52], these images strive to achieve an optimal equilibrium between mitigating speckle noise and retaining intricate feature details. However, it is imperative to acknowledge that the pixel intensity within these images inherently exhibits a limited range, a characteristic attributed to the underlying principles of SAR imaging system theory. This means that their features are not readily apparent because of low contrast. All original global data records are mixed in a disorderly manner. Thus, firstly, we create nested loops and conditionals to iterate over all records and judge whether the oceanic phenomenon is within the above geographic region. Each oceanic phenomenon is identified according to a specific signature or pattern. Due to the complicated nature of the C-band radar scatter response from the sea surface, it is predominantly contingent upon the incidence angle and the relative azimuthal angle between the radar system and the directional vector of the surface wind. In certain atmospheric circumstances, notably under conditions characterized by elevated wind speeds exceeding 15 m/s, the backscatter phenomenon is influenced by the combined effects of winds and oceanic wave states. As a result, these mentioned ten phenomena other than ocean waves (i.e., wind wave, swell, surface wave) may be inadequately represented or captured in the dataset.

SAR images exhibit inherent statistical characteristics, and a majority of statistical models employed in their analysis have originated from the multiplicative noise model. Hence, these imagettes are characterized by the presence of multiplicative noise, which leads to variability and interference in the form of undesired fluctuations, thereby affecting the fidelity of the acquired data [53]. It is obvious that the varied noise causes rapid changes in image grayscale and serious speckle in the bright regions. To completely develop SAR image speckle, we construct a noise model and image intensity definition based on coherent radiation as follows:

\begin{matrix} N (t) = B (t) * s (t), \end{matrix}

(1)

\begin{matrix} C (i, j) = x (i, j) * n (i, j) . \end{matrix}

(2)

In Equation (1), the variable

N (t)

denotes the signal with noise embedded into it, while

B (t)

represents the radar backscatter property without noise for ground targets. The selection of an optimal distribution for each of these random variables relies on the types of ground targets. The variable

s (t)

denotes speckle noise, which is independent of

B (t)

. During the modeling process, we opt to model SAR image speckle noise as multiplicative noise with a mean of 1. This modeling principle ensures that the multiplicative noise does not introduce an overall bias, preserving the average brightness and intensity of the image unaffected by speckle noise. This is advantageous for various image processing tasks, allowing us to assume that the average reflection intensity remains unaltered during image processing. Thus, in Equation (1), the mean value of

s (t)

is one and its variance is relevant to the equivalent number of imagettes. In addition, in consideration of the multiplicative noise model, we define the intensity of an image with speckle noise by Equation (2).

C (i, j)

stands for the intensity of SAR image pixels with the coordinate

(i, j)

. The variable

x (i, j)

represents the intensity of SAR image pixels without speckle noise, and

n (i, j)

represents the intensity of speckle noise.

To eliminate speckle noise in SAR images, a Lee filter is designed for preserving edges and point features in speckle noise reduction [54]. In a Lee filter, a variable-sized window is moving consistently from left to right and top to bottom across the SAR image to compute the local statistic. To sustain the characteristics of oceanic phenomena, our Lee filter is based on a linear speckle noise model and minimum mean square error methods to enhance the data quality as follows:

\hat{Y} (t) = \bar{I} (t) + W (t) [I (t) - \bar{I} (t)],

(3)

where

\bar{I} (t)

stands for the mean value of the intensity within the filter window;

W (t)

represents the adaptive filter coefficient defined as Equation (4):

W (t) = 1 - \frac{C_{v}}{C_{I}} .

(4)

The variable

\hat{Y} (t)

represents the intensity of the filtered SAR images in Equation (3).

I (t)

is the intensity of global pixels without noise. Typically, the value of

W (t)

tends to zero in homogeneous regions, yielding analogous outcomes to that achieved by the mean filter. Meanwhile, the value of

W (t)

tends towards edges, leading to pixel alteration in close proximity to these edges. The impact of the Lee filter is clearly exhibited in Figure 1.

To improve the overall visual quality of imagettes, we apply global histogram equalization in image processing by redistributing the intensity values of the pixels in an image [55], aiming to maximize the full dynamic range available. This method involves computing the cumulative distribution function of the image’s pixel intensities and then adjusting the intensity values based on this cumulative distribution. In our data processing, we first calculate a probability density function (PDF) of the pixel intensity of an image, as defined in Equation (5):

p (X_{k}) = \frac{n_{k}}{N}, f o r k = 0, 1, \dots, L - 1 .

(5)

The variable

n_{k}

stands for the number of image pixels with an intensity that equals

X_{k}

. The variable N represents the total number of image pixels. To ensure that specific intensity values are assigned to each pixel in our imagettes, we apply a ceiling operation to round the intensity values to the nearest integer. This rounding operation ensures that each pixel is associated with a discrete and unique integer intensity value, contributing to the creation of a PDF based on these rounded values. For instance, the rounding method converts the continuous intensity values (e.g., 0.831, 0.832, 0.842) into discrete integer values (e.g., 1, 1, 1), thereby creating a set of unique and distinguishable intensity values for each pixel. If the intensity values of pixels are rounded up to integers, it is possible that the size of the resulting PDF could be less than the size of the original pixels. Then, we add each PDF element and achieve the cumulative density values by the cumulative density function (CDF). This function is defined as follows:

c (X_{k}) = \sum_{j = 0}^{k} p (X_{j}) f o r k = 0, 1, \dots, L - 1 .

(6)

To further enhance the overall brightness and contrast of imagettes, we map the original image intensity into the range [0,

x_{m a x}

] by the transform function:

f (x) = X_{m a x} * c (x),

(7)

where x is the intensity of image pixels. We apply the intensity transformation function uniformly across all pixels to facilitate the mapping of the original image into a new representation characterized by the desired contrast.

Our dataset comprises TenGeoP-SARwv images characterized by low contrast, frequently manifesting as dark imagery. To enhance the performance of subsequent feature extraction, we employ the aforementioned algorithms to generate brighter images.

3. MCNN Framework for Geophysical Phenomenon Classification

In this section, we introduce a novel neural network, referred to as the multichannel neural network (MCNN), which integrates the multichannel features derived from the original input for the purpose of classifying the aforementioned ten predefined geophysical phenomena shown in Figure 2. The comprehensive architecture of the MCNN algorithm, encompassing multi-feature fusion, data augmentation, and multichannel feature extraction, is visually depicted in Figure 4.

The multi-feature fusion module incorporates an efficient image edge filter to selectively preserve intricate details pertaining to oceanic phenomena present in SAR images, concurrently mitigating the disruptive effects of speckle noise. Hence, the multichannel characteristics of the input are integrated with the original SAR imagettes, along with the associated gradient magnitude derived from the applied filter. And then, feature augmentation such as horizontal translation and reflection is implemented to address the challenge of limited sample size. This approach aims to mitigate the small-sample problem by expanding both the volume and diversity of the training dataset. Furthermore, the feature extraction module utilizes a multichannel approach employing transfer-learning techniques to address constraints imposed by limited dataset sizes. Specifically, the Inception-ResNet architecture [56] is strategically chosen to strike a favorable balance between a substantial parameter count and optimal classification performance. Finally, some important weights are optimized with our softmax function to implement optimal parameter decision for oceanic phenomenon classification. The subsequent sections comprehensively detail the different parts of our MCNN algorithm.

3.1. Multi-Feature Fusion

For each original input SAR image, the MCNN algorithm requires resizing the corresponding dimensions to 299 pixels for both height and width. Nevertheless, in the direct implementation of oceanic phenomenon classification, there is a potential for the loss of certain meaningful features and the attenuation of information that is considered less pertinent [57]. Hence, the an image edge filter is implemented to retain the critical structural attributes inherent in SAR imagettes. Table 1 lists the gradient magnitudes of different image edge filters with specified operators. Then the resultant and the initial are concatenated to form the input image channels. Experiments are conducted on the TenGeoP-SARwv dataset to determine the optimal channel combination for oceanic phenomenon classification. The empirical findings indicate that the optimal performance can be achieved when utilizing a multichannel input image reconstructed from both the original image and the central difference gradient. When employing the channel set combined only with the gradient magnitude, the classification accuracy sharply decreases. This can be attributed to the redundancy contained in the gradient magnitude. This does not contribute significantly to the enhancement of classification accuracy. Instead, information pertaining to interference is utilized as an alternative approach. Hence, the input image in this study is resized to the dimensions of

299 \times 299 \times 3

, incorporating three channels with two original SAR images and a central difference gradient.

The combination of multiple SAR images facilitates the creation of an intricate oceanic scene representation, consequently enhancing the precision of decision-making in subsequent tasks. Nonetheless, the efficacy of image fusion critically depends on the stringent geometric alignment of imagettes. Hence, we design and implement the feature-based image registration method to establish an accurate correlation between imagettes and assess the spatial transformation accordingly. In our design, we assume that the spatial resolution of both images is known and remains constant in our scenario. Because the pixel-by-pixel computations and variable resolution correlation can result in a time-intensive implementation, we initially conduct a normalized cross-correlation between the images at a reduced resolution, aiming to approximately delineate the common area of overlap between the two images. High-resolution correlation is subsequently assessed in close proximity to each potential registration point. The variables

I M_{1}

and

I M_{2}

denote two SAR images with both images having dimensions

M \times N

and f represents the image zoom factor. Let

I M_{l 1}

and

I M_{l 2}

denote the low-resolution versions of the two SAR images with dimensions

M l \times N l

. The edged images of the corresponding images are represented by

I M_{1 e}, I M_{2 e}, I M_{l 1 e}, I M_{l 2 e}

, respectively. In this context, the generalized cross-correlation can be mathematically expressed as follows:

ϕ_{1} (u, v) = \frac{\sum_{i = 0}^{M l} \sum_{j = 0}^{N l} (I M_{l 1 e u, v} (i, j) - I M_{l 2 e} (i, j))}{σ_{1 l u, v} σ_{2 l}},

(8)

ϕ (k, l) = \frac{\sum_{i = 0}^{M} \sum_{j = 0}^{N} (I M_{1 e k, l} (i, j) - I M_{2 e} (i, j))}{σ_{1 k, l} σ_{2}},

(9)

where the variable

ϕ_{1} (u, v)

represents the cross-correlation coefficient derived from the low-resolution image pair

I M_{l 1 e}

and

I M_{l 2 e}

. The pair

(u, v)

denotes the coordinate index of the image

I M_{l 1 e}

. The image

I M_{l 1 e u, v}

is a sub-image situated at the

(u, v) t h

position within the image

I M_{l 2 e}

, with dimensions identical to those of image

I M_{l 2}

.

σ_{1 l u, v}

and

σ_{2 l}

stand for the standard deviations of the respective images. Similarly,

ϕ (k, l)

corresponds to the cross-correlation coefficient calculated from the low-resolution image pair

I M_{1 e}

and

I M_{2 e}

, where

(k, l)

signifies the coordinate index of the image

I M_{l 1 e}

. The

I M_{1 e k, l}

image is an image segment located at the

(k, l) t h

pixel of

I M_{2}

with dimensions matching those of

I M_{2 e}

.

σ_{1 k, l}

and

σ_{2}

denote the standard deviations of the corresponding images.

In the context of oceanic phenomenon detection, the extraction of edge information is significant for capturing structural features [59]. The basic idea of the image edge filter is to detect changes in image intensity by using a discrete differentiation operator [60]. To preserve more useful oceanic characteristics in SAR imagettes, we modify the standard Sobel operator to enhance the correlation between two fused SAR images [61]. In this study, we utilize four new operators instead of the standard two operators for retaining global edges and disregarding smaller edges as follows (Table 1):

S_{1} = {(Δ y)}^{T} = [\begin{matrix} 1 & 0 & - 1 \\ 2 & 0 & - 2 \\ 1 & 0 & - 1 \end{matrix}], S_{2} = {(S_{1})}^{T} = Δ y = [\begin{matrix} 1 & 2 & 1 \\ 0 & 0 & 0 \\ - 1 & - 2 & - 1 \end{matrix}],

(10)

S_{3} = [\begin{matrix} 2 & 1 & 0 \\ 1 & 0 & - 1 \\ 0 & - 1 & - 2 \end{matrix}], S_{4} = [\begin{matrix} 0 & 1 & 2 \\ - 1 & 0 & 1 \\ - 2 & - 1 & 0 \end{matrix}] .

(11)

By applying convolutionally the four operators into the original images, we can obtain:

E_{l} (i, j) = \sum_{m = 1}^{3} \sum_{n = 1}^{3} I M (i + m - 1, j + n - 1) * S_{k} (m, n) k = 1, 2, 3, 4

(12)

E (i, j) = \underset{l}{m a x} (E_{l} (i, j)) .

(13)

In Equation (13),

E (i, j)

stands for the appropriate edge of the point

(i, j)

in the whole image

I M

. Hence, a better correlation of the two fused images can be achieved, in contrast to the correlation between them generated from standard Sobel operators or other edge detection methods.

3.2. Data Augmentation

A sufficient amount of training data is crucial to obtain state-of-the-art results in image classification tasks based on the deep learning method. Unfortunately, many application scenarios, such as oceanography and medicine [62,63], cannot assure enough samples for algorithm training purposes. Data augmentation has been proved to be an effective way not only to expand the quantity, but also to improve the quality of the training progress by diversifying the dataset. In this study, we address the issue of limited sample size in the TenGeoP-SARwv dataset for ocean phenomenon detection by employing image warping and oversampling techniques. When utilizing SAR images for training models, we encountered a challenge due to the scarcity of samples. The shortage of training samples poses obstacles to the model in capturing robust feature representations, ultimately impacting its ability to generalize effectively. This limitation may lead to suboptimal predictive performance, especially for categories with insufficient sample representation.

In this study, we tackle the challenges of limited samples and oversampling, both critical aspects for effective model training. Abundant samples are essential for robust model training and mitigating the risks of overfitting. To enhance dataset diversity, we employ data warping techniques initially. Data warping refers to altering existing samples; typical operations include geometric transformations, random cutting, and other alternations. This type of method is straightforward and effective and a random cropping algorithm is designed for this investigation. Parameters such as the maximum crop size and position are first defined as constraints. Then, the starting point and destination point are randomly generated to reconstruct a new sample. In the majority of interactive warping systems, the researchers specify the warp parameters in a broadly defined manner or indicate a point-to-point correspondence. Then, the automated system conducts geometric interpolation procedures to generate a cartographic representation based on the specified geometric parameters with a mapping

M_{s}

:

D^{2} \to D^{2}

of the plane to itself. We apply the mapping to our input SAR images

I (x, y)

and define the output images as

O (x, y)

=

I (M_{s}^{- 1} (x, y))

, where

M_{s}^{- 1}

is the function inverse of

M_{s}

. Note that

M_{s}

does not depend on the input I and the geometric specification S is based on SAR image features. To obtain

M_{s}

from the specification S, we compute the unique thin-plate spline satisfying

M_{s}

with the solution u as follows:

a r g \underset{u (x, y)}{m i n} \underset{D^{2}}{\int \int} (u_{x x}^{2} + 2 u_{x y}^{2} + u_{y y}^{2}) d x d y,

(14)

where

u_{x x}

,

u_{x y}

,

u_{y y}

represent

\frac{\partial^{2} u}{\partial x^{2}}

,

\frac{\partial u}{\partial x} \frac{\partial u}{\partial y}

,

\frac{\partial^{2} u}{\partial y^{2}}

, respectively. In fact,

M_{s}

is independent of the input parameters I, causing suboptimal outcomes. The mapping function ensures the functional minimization across the specified domain. The mapping procedure serves as an intermediate component and then is applied to image I after iterative computation. To enhance the robustness and separability of SAR images, we take contemporary warping procedures with a pair of bivariate real functions as

M_{s} = (M_{s}^{X}, M_{s}^{Y})

independently. When invoking the thin-plate regularization paradigm, the procedure divides the interpolation constraints S into x-constraints and y-constraints and calculates the two mapping functions

M_{s}^{X}

and

M_{s}^{X}

to reduce the warping function of Equation (14). To further squeeze the edges of SAR images when stretching other areas, we use the following image warp

M_{S, I} (x, y)

to optimize the image quality:

\underset{u (x, y)}{m i n} \underset{{[0, 1]}^{2}}{\int \int} [u_{x}^{2} + u_{y}^{2} + μ {(u (x, y) - m (x, y))}^{2}] * ϕ (x, y) d x d y .

(15)

For the x-component, we replace

m (x, y)

with

m x

and set

ϕ (x, y) = 1 + λ E_{x} (x, y)

. For the y-component, we have

m (x, y) = m y

and

ϕ (x, y) = 1 + λ E_{y} (x, y)

.

E_{x} (x, y)

and

E_{y} (x, y)

are the implementation of the x- and y-components, respectively, in the Canny edge detector [59].

To adjust the sample distribution of all classes, we apply the oversampling technique. Additionally, oversampling approaches generate synthetic instances to further benefit this task, whose typical techniques include image mixing, feature augmentations, and deep learning-based methods like generative adversarial networks (GANs) [63]. For the sake of diversity, we choose GANs as our data oversampling method. This story for data augmentation follows a typical neural network development process of data preparation, training, and generating new samples. To perform oversampling on a SAR dataset, an initial step involves training a generative adversarial network (GAN) to estimate the underlying distribution of oceanic data. Subsequently, the trained generator component of the GAN is employed to generate supplementary samples specifically focused on augmenting the representation of the minority class. In this design, we use a GAN to achieve the conditional distribution

P y | x

. The GAN facilitates the training process by enabling robust discriminators (D) that yield informative gradients to the generator, even in scenarios where the quality of the generated samples is suboptimal. This characteristic enhances the training stability, contributing to more effective and reliable model convergence. To further optimize the GAN’s loss function for categorical variables on GAN-based oversampling, we adopt the Wasserstein-1 distance (WD) by modifying the GAN objective to the following equation:

\underset{G}{m i n} \underset{D}{m a x} E_{x \sim p_{d a t a}} [D (x)] - E_{y \sim p_{y}} [D (G (y))] - λ E_{\hat{x} \sim p_{\hat{x}}} [({‖ \nabla_{\hat{x}} D (\hat{x}) ‖}_{2} - 1)^{2}] .

(16)

In this model, the generator G produces synthetic data samples that are indiscernible from authentic data and the discriminator D differentiates authentic instances from the training dataset and synthetic instances generated by the generative model G. The variable

λ

is the penalty coefficient. D is a k-Lipschitz function, and this Lipschitz continuity condition is ensured through the process of clipping [64], where the weights of D are constrained to a bounded interval [−c, c]. The gradients are computed based on linear interpolations

\hat{x} \sim p_{\hat{x}}

between actual and synthetic samples, where

p_{\hat{x}}

represents the probability distribution of these linear interpolations.

3.3. Multichannel Feature Extraction

Despite previous endeavors to augment the dataset, the imperative remains to devise an innovative feature extraction architecture. While employing SAR images for identifying ocean phenomena, we confronted the challenge of the model lacking robustness. The model proved sensitive to the impact of noise and abnormal samples. Given the constraints of the constructed dataset with limited original training data, the construction of a medium-sized network, such as Inception-ResNet, becomes indispensable to fulfill the demands of multichannel feature extraction.

To capture the structural characteristics of ocean SAR imagettes, we implement the Inception-ResNet architecture, which is widely recognized for its intricate multi-branch design in Figure 5. As depicted in Figure 5, the data first pass through a stem network composed of pooling layers and convolutional layers. Then, they undergo dimension reduction in the network’s Inception-Resnet and reduction parts, which serve to reduce the spatial dimensions.

After feature extraction achieved by the aforementioned structure, the output section maps the one-dimensional vector to the predicted results. The stem configuration assumes a critical function in both feature extraction and the consequential reduction in spatial dimensions in the input. Within this network, a series of convolutional layers and filters are integrated through concatenation, indicating the network’s capacity for complicated feature representation. Inception-ResNet architectures denoted as A, B, and C in Figure 5 consist of both pure and residual inception blocks. The pure configuration, characterized by a split–transform–merge architecture, endows a potent representational capability owing to the presence of dense layers.

To expedite gradient propagation through multiple layers, we leverage the residual function, renowned for its efficacy in facilitating information flow and accelerating gradient propagation through layers. It achieves this by approximating a mapping function from the input:

y = F (x) + x .

(17)

Following the establishment of the MCNN architecture, it becomes imperative to specify the loss function, quantifying the disparity between the model predictions and the ground truth associated with the input data. In order to mitigate potential class imbalance issues inherent in the classification task, a weighted balanced cross-entropy loss function, as proposed by [21], is adopted for training purposes in Equation (18):

L_{x} (y, \hat{y}) = - \frac{1}{M} (\sum_{i = 1}^{M} (\sum_{j = 1}^{C} θ_{j} y_{i j} l o g {\hat{y}}_{i j})) .

(18)

Let y and

\hat{y}

represent the true label and the label predicted by the neural network, respectively, on the training dataset X sampled from the TenGeoP-SARwv database. For the true label, we assign numerical values to categorical classes (i.e., AF → 0, BS → 1, IB → 2, LWA → 3, MCC → 4, OF → 5, PW → 6, RC → 7, SI → 8, WS → 9, shown in Figure 2). For the predicted label, the whole MCNN model takes input data and produces output predictions between 0 and 9. Here, M signifies the total number of samples in X, and C denotes the number of distinct oceanic phenomena types. Additionally,

θ_{j}

denotes the weight assigned to the samples belonging to the j-th class. The computation of a loss value for the entire set X is a computationally intensive task. To address this challenge, a transfer-learning strategy is implemented in this study. Specifically, half of the layers within the Inception-ResNet architecture are preserved in a frozen state. Subsequently, the parameters

θ_{j}^{t}

of the unfrozen layers in Equation (19) and the ultimate classifier layer undergo iterative optimization using the gradient descent method, taking into account the Adam optimization algorithm, as described in [65]:

θ_{j}^{t} = θ_{j}^{t - 1} - \frac{a \hat{m_{t}}}{\sqrt{\hat{v_{t}}} + ε} .

(19)

Here, the terms

\hat{m_{t}}

and

\hat{v_{t}}

represent the bias-corrected first and second moments, respectively, and are iteratively updated according to the following expressions (20):

\hat{m_{t}} = \frac{β_{1} m_{t - 1} + (1 - β_{1}) g_{t}}{1 - β_{1}} .

(20)

Both

β_{1}

and

β_{2}

represent the exponential decay rates associated with moment estimates. The gradient

g_{t}

can be calculated by using the stochastic objective function

f (θ)

:

g_{t} = \nabla_{θ} f_{t} (θ_{j}^{t - 1}) .

(21)

Appropriate parameters for the classification of ocean SAR imagettes have been determined through a series of experiments, resulting in the optimal settings of

ε = 10^{- 8}

,

α = 10^{- 3}

,

β_{1} = 0.9

, and

β_{2} = 0.999

. The choice of employing the Adam optimization algorithm is grounded in its efficacy in handling nonstationary objectives and effectively addressing challenges associated with highly noisy and sparse gradients commonly encountered in the context of ocean SAR imagery.

Following the extraction of multichannel features using the Inception-ResNet architecture, feature concentration is performed to derive classification outcomes. In this investigation, the utilization of the softmax function is employed, leveraging its capacity to acquire discriminative yet generative compact vector representations for image classification, as discussed in prior work [66]. Let there be C classes in the assessment dataset. Denoting the original image input as K and the MCNN parameter as

θ

, the softmax function is applied to ascertain the probability of y belonging to the k-th class:

P (y_{k} = 1 | K, θ) = \frac{e^{K^{T} θ_{k}}}{\sum_{j = 1}^{C} e^{K^{T} θ_{j}}} .

(22)

Here,

P (y_{k} = 1 | K, θ)

represents the probability of the image belonging to the k-th class, e denotes the exponential function, and

θ_{k}

signifies the parameters corresponding to the k-th class. This formulation allows for the derivation of a probability distribution over the multiple classes for effective image classification. For inputs, denoted as K, not falling within the confines of any predefined classes (

P (y_{k} = 1 | K, θ) < 0.5, k = 1, 2, \dots, C

), a categorization process is implemented to assign such inputs to a distinct and specific category.

4. Experimental Evaluation

To assess the performance of our MCNN method, we choose AlexNet [42] as the baseline due to its noteworthy achievements in image classification tasks. Additionally, for implementing comparative analysis, we incorporate another popular method, CMwv [48], to evaluate the effectiveness of the MCNN technique. In order to ensure an equitable comparison between MCNN and the two alternative CNN-based methodologies, as delineated in Section 4.1, we adopt identical training methodologies, following the protocols introduced by Wang et al. [48]. A subset of 320 images per class is randomly selected as input from the annotated TenGeoP-SARwv dataset, categorized by WV1 and WV2 modes. It is pertinent to acknowledge that the input size is relatively modest in comparison to the overall dataset. Seventy percent (70%) and thirty percent (30%) of the input dataset are designated for the training and validation subsets, respectively, in accordance with the previously outlined methodologies for feature extraction and neural network weight optimization. A distinct evaluation dataset, consisting of 5000 vignettes captured under the WV1 and WV2 modes, has been established to quantitatively evaluate the effect of different classification models. Subsequently, a geophysical application is exemplified through the utilization of a classified map depicting rain cells (RCs). This map is then juxtaposed with precipitation data obtained from the Global Precipitation Measurement (GPM) Program (available on the NASA data archive website: https://pmm.nasa.gov/data-access/downloads/gpm (accessed on 7 February 2024)).

4.1. Experimental Results of Ocean SAR Scene Classification

The TenGeoP-SARwv dataset has been employed for training our MCNN algorithm through a series of iterations exceeding 2000. As depicted in Figure 6, the comprehensive accuracy and loss metrics exhibit a notable and rapid evolution within the initial 270 iterations, stabilizing thereafter around the 1000th iteration. Consequently, the neural network weights derived from the training process up to 1100 iterations are incorporated into the ultimate configuration of the MCNN model.

Figure 7 illustrates the normalized confusion matrix obtained from the MCNN model. The diagonal and off-diagonal elements of the matrix correspond to accurately and inaccurately classified observations, respectively, normalized with respect to the total number of observations. The percentages indicated in the blue cells within the row (column) summaries on the right (bottom) pertain to recall (precision) metrics. Additionally, F-score parameters, as defined by Sokolova et al. [67], are employed as evaluation metrics, offering a comprehensive assessment considering both precision and recall statistics. The anticipated values for these three parameters are all in proximity to unity.

Table 2 provides a comparative analysis of distinct classification models based on metrics such as recall, precision, and F-score. The interpretation of the table is as follows:

(1): Despite the established robustness of AlexNet in image classification, its performance in ocean SAR scene classification falls short of acceptability without the incorporation of sophisticated domain knowledge.
(2): The experimental results evaluate the effectiveness of the CMwv model in accurately identifying biological slicks (BS), low-wind areas (LWA), microconvective cells (MCC), rain cells (RC), sea ice (SI), and windstreaks (WS). However, suboptimal results are evident when employing the CMwv model to detect other geophysical phenomena, with a notable instance being the lowest precision value recorded at 0.05 for the oceanic front (OF) class.
(3): The MCNN algorithm demonstrates superior performance in terms of recall, precision, and F-score, all surpassing 0.9 for both WV1 and WV2 imagettes, with the exception of the LWA class. Despite the limited number of samples available for training, the MCNN algorithm leverages data augmentation and transfer-learning strategies effectively, mitigating the risk of model overfitting. Furthermore, the utilization of an image edge filter optimally encodes discriminant information present in SAR imagettes. On average, the MCNN algorithm yields recall, precision, and F-score values that are 7.9% (8.0%), 43.2% (35.7%), and 25.0% (21.7%) higher, respectively, compared to CMwv when applied to WV1 (WV2) imagette classification. These results signify the potential of the proposed CNN-based model as a viable operational tool for automated ocean SAR scene classification.

4.2. Geophysical Application

To further validate the robustness of our MCNN algorithm, this classification model was systematically applied to all SAR imagettes spanning the period from March 2016 to February 2017 within the TenGeoP-SARwv dataset. Subsequently, a geophysical map delineating detected RCs was generated and meticulously compared with precipitation data sourced from the GPM Program. Within the context of S-1 imagettes, a classified RC was defined as comprising several kilometer-scale semicircular to circular-shaped patches, characteristic of rain downdrafts in convective RCs, as elaborated in [50]. Over the specified timeframe, encompassing March 2016 to February 2017, it was observed that approximately 5% of all SAR imagettes were successfully classified as RCs by the MCNN algorithm. The cartographic outcome of the classified RC occurrences, as depicted in the central panel of Figure 8, indicates discernible spatial patterns. The geophysical map presenting the yearly averaged GPM precipitation, along with the classified outcomes derived from the CMwv algorithm, is concurrently displayed in the right and left panels, respectively. It is imperative to acknowledge that the GPM product is designed to furnish gridded global multisatellite precipitation estimates, resulting in notable disparities in coverage resolutions between the SAR-classified RC occurrences and the GPM precipitation results. Despite observable distinctions in extratropical regions, areas characterized by heightened RC occurrences, as identified by the MCNN and CMwv algorithms, exhibit significant alignment with regions characterized by elevated GPM precipitation, particularly within the Intertropical Convergence Zone and South Pacific Convergence Zone. This concordance is consistent with the precipitation climatology outlined in prior studies [68,69].

5. Conclusions

To address the challenges associated with low efficiency in global ocean SAR scene classification, a novel robust classification model, denoted MCNN, that included multi-feature fusion, data augmentation, and multichannel feature extraction was successfully developed based on the Inception-ResNet CNN architecture. In a departure from conventional deep learning methodologies, a meticulously selected image edge filter was employed to distinguish distinct oceanic phenomena. Furthermore, several data augmentation and feature extraction techniques were implemented to mitigate the limitations posed by small-sample training datasets. The model performance was rigorously validated on a meticulously annotated dataset [70], focusing on the detection of ten geophysical phenomena prevalent on the ocean surface. Comparative analyses were conducted against classical and state-of-the-art methods. Additionally, our MCNN algorithm was applied to generate a geophysical map of RCs, demonstrating acceptable consistency with precipitation data obtained from the GPM Program. These results underscore the potential utility of the MCNN tool in obtaining global/regional and annual/seasonal statistics of diverse geophysical phenomena and enhancing specific aspects of numerical ocean models. While this study exclusively utilized S-1 WV SAR imagettes, the adaptability of the MCNN algorithm to other SAR data archives, such as GF-3 and TerraSAR-X, is acknowledged.

Despite the notable contributions of this study, it is essential to acknowledge certain limitations. The analysis is constrained to SAR images featuring ten common types of ocean phenomena, and there may be occurrences beyond the scope of these datasets that could influence the generalizability of the findings. Moreover, the effectiveness of the proposed methods may vary in the context of diverse datasets or under conditions not represented in the current study. As a result, the robustness and accuracy of the model may be limited when recognizing ocean phenomena beyond the scope of our dataset. These limitations emphasize the importance of future research to explore a more comprehensive range of datasets and scenarios for a thorough understanding of the model’s capabilities.

In future studies, our objective is to augment the dataset by including SAR images of additional ocean phenomena, thereby bolstering the model’s robustness. The expansion of the dataset will enable us to tackle challenges associated with imbalanced categories and enhance the model’s adaptability to new samples. This approach seeks to improve the model’s generalization capabilities and contribute to more effective and comprehensive recognition of diverse ocean phenomena.

This methodology holds potential for broader applications, such as weather prediction and hazard forecasting through improved classification rates. The enhanced performance of accurate recognition and classification for ocean phenomena in SAR images may contribute to more reliable scarce data inputs for weather prediction models, ultimately leading to increased precision in forecasting and hazard prediction.

Author Contributions

The authors confirm contribution to the paper as follows: conceptualization, C.B. and S.Z.; methodology, C.B. and S.Z.; software, S.Z. and J.W.; validation, S.Z., C.B. and X.W.; formal analysis, C.L.; investigation, J.W.; resources, C.B. and S.Z.; data curation, C.L. and J.W.; writing—original draft preparation, C.B. and S.Z.; writing—review and editing, X.W., J.W. and C.L.; visualization, J.W.; supervision, C.L.; project administration, X.W.; funding acquisition, C.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work is funded by the National Natural Science Foundation of China, Natural Science Foundation of Qingdao, and Natural Science Foundation of Shandong Province under No. 42106200, No. 20230921, No. ZR2023QF042 and No. ZR2020QF058, respectively.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

SAR	Synthetic Aperture Radar
MCNN	Multichannel Neural Network
CNN	Convolutional Neural Network
WM	Wave Mode
ILSVRC	ImageNet Large-Scale Visual Recognition Challenge
OF	Oceanic Front
CMwv	Sentinel-1 WV SAR vignette classification model
PW	Pure Ocean Waves
S-1	Sentinel-1
ESA	European Space Agency
AF	Atmospheric Front
BS	Biological Slicks
IB	Icebergs
LWA	Low-Wind Areas
MCC	Microconvective Cells
RC	Rain Cells
SI	Sea Ice
WS	Windstreaks
NRCS	Normalized Radar Cross Section
IFREMER	French Research Institute for the Exploitation of the Sea
PDF	Probability Density Function
CDF	Cumulative Density Function
GPM	Global Precipitation Measurement
GAN	Generative Adversarial Network
D	Discriminators
G	Generators
WD	Wasserstein-1 Distance

References

Korosov, A.A.; Rampal, P. A Combination of Feature Tracking and Pattern Matching with Optimal Parametrization for Sea Ice Drift Retrieval from SAR Data. Remote Sens. 2017, 9, 258. [Google Scholar] [CrossRef]
Shao, W.; Lai, Z.; Nunziata, F.; Buono, A.; Jiang, X.; Zuo, J. Wind Field Retrieval with Rain Correction from Dual-Polarized Sentinel-1 SAR Imagery Collected during Tropical Cyclones. Remote Sens. 2022, 14, 5006. [Google Scholar] [CrossRef]
Bian, X.; Shao, Y.; Zhang, C.; Xie, C.; Tian, W. The feasibility of assessing swell-based bathymetry using SAR imagery from orbiting satellites. ISPRS J. Photogramm. Remote Sens. 2020, 168, 124–130. [Google Scholar] [CrossRef]
Fang, H.; Xie, T.; Perrie, W.; Zhao, L.; Yang, J.; He, Y. Ocean wind and current retrievals based on satellite SAR measurements in conjunction with buoy and HF radar data. Remote Sens. 2017, 9, 1321. [Google Scholar] [CrossRef]
Wineteer, A.; Perkovic-Martin, D.; Monje, R.; Rodríguez, E.; Gál, T.; Niamsuwan, N.; Nicaise, F.; Srinivasan, K.; Baldi, C.; Majurec, N.; et al. Measuring winds and currents with Ka-band Doppler scatterometry: An airborne implementation and progress towards a spaceborne mission. Remote Sens. 2020, 12, 1021. [Google Scholar] [CrossRef]
Amoon, M.; Bozorgi, A.; Rezai-rad, G.A. New method for ship detection in synthetic aperture radar imagery based on the human visual attention system. J. Appl. Remote Sens. 2013, 7, 071599. [Google Scholar] [CrossRef]
Wang, Y.R.; Li, X.M. Arctic sea ice cover data from spaceborne SAR by deep learning. Earth Syst. Sci. Data Discuss 2020, 2020, 1–30. [Google Scholar]
Zhang, H.; Wang, T.; Liu, M.; Jia, M.; Lin, H.; Chu, L.; Devlin, A.T. Potential of combining optical and dual polarimetric SAR data for improving mangrove species discrimination using rotation forest. Remote Sens. 2018, 10, 467. [Google Scholar] [CrossRef]
Lv, Q.; Dou, Y.; Niu, X.; Xu, J.; Xu, J.; Xia, F. Urban land use and land cover classification using remotely sensed SAR data through deep belief networks. J. Sens. 2015, 2015, 538063. [Google Scholar] [CrossRef]
Benassai, G.; Di Luccio, D.; Corcione, V.; Nunziata, F.; Migliaccio, M. Marine spatial planning using high-resolution synthetic aperture radar measurements. IEEE J. Ocean. Eng. 2018, 43, 586–594. [Google Scholar] [CrossRef]
Alizadeh Zakaria, Z.; Ebadi, H.; Farnood Ahmadi, F. Investigation of the application of geospatial artificial intelligence for integration of earthquake precursors extracted from remotely sensed SAR and thermal images for earthquake prediction. Multimed. Tools Appl. 2023, 82, 22853–22870. [Google Scholar] [CrossRef]
Alaba, S.Y.; Nabi, M.M.; Shah, C.; Prior, J.; Campbell, M.D.; Wallace, F.; Ball, J.E.; Moorhead, R. Class-Aware Fish Species Recognition Using Deep Learning for an Imbalanced Dataset. Sensors 2022, 22, 8268. [Google Scholar] [CrossRef]
Tian, W.; Fang, L.; Li, W.; Ni, N.; Wang, R.; Hu, C.; Liu, H.; Luo, W. Deep-Learning-Based Multiple Model Tracking Method for Targets with Complex Maneuvering Motion. Remote Sens. 2022, 14, 3276. [Google Scholar] [CrossRef]
Hussain, M.A.; Chen, Z.; Zheng, Y.; Zhou, Y.; Daud, H. Deep Learning and Machine Learning Models for Landslide Susceptibility Mapping with Remote Sensing Data. Remote Sens. 2023, 15, 4703. [Google Scholar] [CrossRef]
Singh, P.; Diwakar, M.; Shankar, A.; Shree, R.; Kumar, M. A Review on SAR Image and its Despeckling. Arch. Comput. Methods Eng. 2021, 28, 4633–4653. [Google Scholar] [CrossRef]
Zhu, X.X.; Montazeri, S.; Ali, M.; Hua, Y.; Wang, Y.; Mou, L.; Shi, Y.; Xu, F.; Bamler, R. Deep learning meets SAR: Concepts, models, pitfalls, and perspectives. IEEE Geosci. Remote Sens. Mag. 2021, 9, 143–172. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, C.; Chen, J.; Wang, F. Shape-Constrained Method of Remote Sensing Monitoring of Marine Raft Aquaculture Areas on Multitemporal Synthetic Sentinel-1 Imagery. Remote Sens. 2022, 14, 1249. [Google Scholar] [CrossRef]
Scafetta, N. Distribution of the SARS-CoV-2 pandemic and its monthly forecast based on seasonal climate patterns. Int. J. Environ. Res. Public Health 2020, 17, 3493. [Google Scholar] [CrossRef]
Doin, M.P.; Lasserre, C.; Peltzer, G.; Cavalié, O.; Doubre, C. Corrections of stratified tropospheric delays in SAR interferometry: Validation with global atmospheric models. J. Appl. Geophys. 2009, 69, 35–50. [Google Scholar] [CrossRef]
Fracastoro, G.; Magli, E.; Poggi, G.; Scarpa, G.; Valsesia, D.; Verdoliva, L. Deep learning methods for synthetic aperture radar image despeckling: An overview of trends and perspectives. IEEE Geosci. Remote Sens. Mag. 2021, 9, 29–51. [Google Scholar] [CrossRef]
Yan, Z.; Chong, J.; Zhao, Y.; Sun, K.; Wang, Y.; Li, Y. Multifeature fusion neural network for oceanic phenomena detection in SAR images. Sensors 2019, 20, 210. [Google Scholar] [CrossRef]
Zhang, G.; Perrie, W. Dual-Polarized Backscatter Features of Surface Currents in the Open Ocean during Typhoon Lan (2017). Remote Sens. 2018, 10, 875. [Google Scholar] [CrossRef]
Huang, Z.; Pan, Z.; Lei, B. Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data. Remote Sens. 2017, 9, 907. [Google Scholar] [CrossRef]
Xie, D.; Ma, J.; Li, Y.; Liu, X. Data augmentation of sar sensor image via information maximizing generative adversarial net. In Proceedings of the 2021 IEEE 4th International Conference on Electronic Information and Communication Technology (ICEICT), Xi’an, China, 18–20 August 2021; pp. 454–458. [Google Scholar]
Ji, Y.; Xu, G.; Dong, C.; Yang, J.; Xia, C. Submesoscale eddies in the East China Sea detected from SAR images. Acta Oceanol. Sin. 2021, 40, 18–26. [Google Scholar] [CrossRef]
Ning, J.; Sun, L.; Cui, H.; Lu, K.; Wang, J. Study on characteristics of internal solitary waves in the Malacca Strait based on Sentinel-1 and GF-3 satellite SAR data. Acta Oceanol. Sin. 2020, 39, 151–156. [Google Scholar] [CrossRef]
Liu, C.; Li, Z.; Wu, Z.; Huang, L.; Zhang, P.; Li, G. An Unsupervised Snow Segmentation Approach Based on Dual-Polarized Scattering Mechanism and Deep Neural Network. IEEE Trans. Geosci. Remote Sens. 2023, 61, 4300614. [Google Scholar] [CrossRef]
Lattari, F.; Gonzalez Leon, B.; Asaro, F.; Rucci, A.; Prati, C.; Matteucci, M. Deep learning for SAR image despeckling. Remote Sens. 2019, 11, 1532. [Google Scholar] [CrossRef]
Ma, X.; Wang, C.; Yin, Z.; Wu, P. SAR image despeckling by noisy reference-based deep learning method. IEEE Trans. Geosci. Remote Sens. 2020, 58, 8807–8818. [Google Scholar] [CrossRef]
Zhang, L.; Liu, Y.; Zhao, W.; Wang, X.; Li, G.; He, Y. Frequency-Adaptive Learning for SAR Ship Detection in Clutter Scenes. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5215514. [Google Scholar] [CrossRef]
Wang, N.; Wang, Y.; Er, M.J. Review on deep learning techniques for marine object recognition: Architectures and algorithms. Control Eng. Pract. 2022, 118, 104458. [Google Scholar]
Gan, Z.; Henao, R.; Carlson, D.; Carin, L. Learning deep sigmoid belief networks with data augmentation. In Artificial Intelligence and Statistics; PMLR: London, UK, 2015; pp. 268–276. [Google Scholar]
Du, Y.; Song, W.; He, Q.; Huang, D.; Liotta, A.; Su, C. Deep learning with multi-scale feature fusion in remote sensing for automatic oceanic eddy detection. Inf. Fusion 2019, 49, 89–99. [Google Scholar] [CrossRef]
Wang, C.; Mouche, A.; Tandeo, P.; Stopa, J.; Chapron, B.; Foster, R.; Vandemark, D. Automated geophysical classification of sentinel-1 wave mode sar images through deep-learning. In Proceedings of the IGARSS 2018—2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 22–27 July 2018; pp. 1776–1779. [Google Scholar]
Torres, R.; Snoeij, P.; Geudtner, D.; Bibby, D.; Davidson, M.; Attema, E.; Potin, P.; Rommen, B.; Floury, N.; Brown, M.; et al. GMES Sentinel-1 mission. Remote Sens. Environ. 2012, 120, 9–24. [Google Scholar]
Nejad, F.M.; Zakeri, H. An optimum feature extraction method based on wavelet–radon transform and dynamic neural network for pavement distress classification. Expert Syst. Appl. 2011, 38, 9442–9460. [Google Scholar]
Engen, G.; Johnsen, H. SAR-ocean wave inversion using image cross spectra. IEEE Trans. Geosci. Remote Sens. 1995, 33, 1047–1056. [Google Scholar]
Lloret, J.; Bosch, I.; Sendra, S.; Serrano, A. A Wireless Sensor Network for Vineyard Monitoring That Uses Image Processing. Sensors 2011, 11, 6165–6196. [Google Scholar] [CrossRef]
Topouzelis, K.; Kitsiou, D. Detection and classification of mesoscale atmospheric phenomena above sea in SAR imagery. Remote Sens. Environ. 2015, 160, 263–272. [Google Scholar]
Zhang, L.; Zhang, L.; Du, B. Deep learning for remote sensing data: A technical tutorial on the state of the art. IEEE Geosci. Remote Sens. Mag. 2016, 4, 22–40. [Google Scholar] [CrossRef]
Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 2015, 115, 211–252. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1–9. [Google Scholar]
Dalal, N.; Triggs, B. Histograms of oriented gradients for human detection. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA, 20–25 June 2005; Volume 1, pp. 886–893. [Google Scholar]
Lowe, D.G. Object recognition from local scale-invariant features. In Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greece, 20–27 September 1999; Volume 2, pp. 1150–1157. [Google Scholar]
Ojala, T.; Pietikäinen, M.; Harwood, D. A comparative study of texture measures with classification based on featured distributions. Pattern Recognit. 1996, 29, 51–59. [Google Scholar] [CrossRef]
Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
Szegedy, C.; Ioffe, S.; Vanhoucke, V.; Alemi, A. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning. arXiv 2016, arXiv:1602.07261. [Google Scholar] [CrossRef]
Wang, C.; Tandeo, P.; Mouche, A.; Stopa, J.E.; Gressani, V.; Longepe, N.; Vandemark, D.; Foster, R.C.; Chapron, B. Classification of the global Sentinel-1 SAR vignettes for ocean surface process studies. Remote Sens. Environ. 2019, 234, 111457. [Google Scholar] [CrossRef]
Wang, C.; Vandemark, D.; Mouche, A.; Chapron, B.; Li, H.; Foster, R.C. An assessment of marine atmospheric boundary layer roll detection using Sentinel-1 SAR data. Remote Sens. Environ. 2020, 250, 112031. [Google Scholar] [CrossRef]
Wang, C.; Mouche, A.; Tandeo, P.; Stopa, J.E.; Longépé, N.; Erhard, G.; Foster, R.C.; Vandemark, D.; Chapron, B. A labelled ocean SAR imagery dataset of ten geophysical phenomena from Sentinel-1 wave mode. Geosci. Data J. 2019, 6, 105–115. [Google Scholar] [CrossRef]
Verspeek, J.; Stoffelen, A.; Verhoef, A.; Portabella, M. Improved ASCAT wind retrieval using NWP ocean calibration. IEEE Trans. Geosci. Remote Sens. 2012, 50, 2488–2494. [Google Scholar] [CrossRef]
Cozzolino, D.; Parrilli, S.; Scarpa, G.; Poggi, G.; Verdoliva, L. Fast adaptive nonlocal SAR despeckling. IEEE Geosci. Remote Sens. Lett. 2013, 11, 524–528. [Google Scholar] [CrossRef]
Arsenault, H.; April, G. Properties of speckle integrated with a finite aperture and logarithmically transformed. J. Opt. Soc. Am. 1976, 66, 1160–1163. [Google Scholar] [CrossRef]
Lee, J.S. Digital image enhancement and noise filtering by use of local statistics. IEEE Trans. Pattern Anal. Mach. Intell. 1980, PAMI-2, 165–168. [Google Scholar] [CrossRef]
Kong, N.S.P.; Ibrahim, H.; Hoo, S.C. A literature review on histogram equalization and its variations for digital image enhancement. Int. J. Innov. Manag. Technol. 2013, 4, 386. [Google Scholar] [CrossRef]
Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2818–2826. [Google Scholar]
Hasimoto-Beltran, R.; Canul-Ku, M.; Díaz Méndez, G.M.; Ocampo-Torres, F.J.; Esquivel-Trava, B. Ocean oil spill detection from SAR images based on multi-channel deep learning semantic segmentation. Mar. Pollut. Bull. 2023, 188, 114651. [Google Scholar] [CrossRef]
Maini, R.; Aggarwal, H. Study and comparison of various image edge detection techniques. Int. J. Image Process. IJIP 2009, 3, 1–11. [Google Scholar]
Zhang, D.; Gade, M.; Zhang, J. SAR eddy detection using mask-RCNN and edge enhancement. In Proceedings of the IGARSS 2020—2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI, USA, 26 September–2 October 2020; pp. 1604–1607. [Google Scholar]
Spontón, H.; Cardelino, J. A review of classic edge detectors. Image Process. Line 2015, 5, 90–123. [Google Scholar] [CrossRef]
Yang, W.; Wang, X.; Moran, B.; Wheaton, A.; Cooley, N. Efficient registration of optical and infrared images via modified Sobel edging for plant canopy temperature estimation. Comput. Electr. Eng. 2012, 38, 1213–1221. [Google Scholar] [CrossRef]
Bolton, T.; Zanna, L. Applications of deep learning to ocean data inference and subgrid parameterization. J. Adv. Model. Earth Syst. 2019, 11, 376–399. [Google Scholar] [CrossRef]
Shorten, C.; Khoshgoftaar, T.M. A survey on image data augmentation for deep learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]
Gulrajani, I.; Ahmed, F.; Arjovsky, M.; Dumoulin, V.; Courville, A.C. Improved training of wasserstein gans. Adv. Neural Inf. Process. Syst. 2017, 30, 1–11. [Google Scholar]
Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Chen, L.C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 40, 834–848. [Google Scholar] [CrossRef] [PubMed]
Sokolova, M.; Lapalme, G. A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 2009, 45, 427–437. [Google Scholar] [CrossRef]
Adler, R.F.; Huffman, G.J.; Chang, A.; Ferraro, R.; Xie, P.P.; Janowiak, J.; Rudolf, B.; Schneider, U.; Curtis, S.; Bolvin, D.; et al. The version-2 global precipitation climatology project (GPCP) monthly precipitation analysis (1979–present). J. Hydrometeorol. 2003, 4, 1147–1167. [Google Scholar] [CrossRef]
Kidd, C. Satellite rainfall climatology: A review. Int. J. Climatol. J. R. Meteorol. Soc. 2001, 21, 1041–1066. [Google Scholar] [CrossRef]
Wang, Z.; Du, L.; Mao, J.; Liu, B.; Yang, D. SAR target detection based on SSD with data augmentation and transfer learning. IEEE Geosci. Remote Sens. Lett. 2018, 16, 150–154. [Google Scholar] [CrossRef]

Figure 1. Comparison of an original SAR image with its denoised version. Image (a), depicting an aerial perspective of the terrain, represents the original version sourced from the TerraSAR-X dataset. This image is characterized by severe speckle noise, impairing the clarity of image features. In contrast, image (b) showcases the denoised version, revealing more intricate details and highlighting features such as edges and texture. The comparison indicates that the denoised image exhibits enhanced features, making it better suited for utilization in image recognition tasks [20].

Figure 2. Ten illustrative instances of expertly delineated geophysical phenomena in the form of imagettes: (a–j) show atmospheric fronts (AFs), biological slicks (BSs), icebergs (IBs), low-wind areas (LWAs), microconvective cells (MCCs), oceanic fronts (OFs), pure ocean waves (PWs), rain cells (RCs), sea ice (SI), and windstreaks (WSs), respectively.

Figure 3. A comprehensive coverage of SAR data across diverse geographical regions for TenGeoP-SARwv dataset under VV polarization in 2016 with WV1 (a) and WV2 (b). The chromatic representation signifies the quantity of WV images contained within individual spatial bins measuring 5° × 5°.

Figure 4. Architecture of the MCNN algorithm. The architecture includes multi-feature fusion, data augmentation, and multichannel feature extraction modules. In data augmentation stage, ‘D’ and ‘G’ stand for discriminator and generator respectively. In the multichannel feature extraction module, the stem block contains 9 convolutional and max pooling layers. Inception-ResNet A to C are modularized blocks comprising a series of convolutional and max pooling layers.

Figure 5. Multichannel features extraction. The network involves a stem structure composed of convolutional layers and pooling layers, a reduction part composed of Inception-Resnet and a reduction structure, and an output part. Inception A to C and reduction A and B are described at the bottom of the figure, where Cv and MP stand for convolutional layer and max pooling layer, respectively.

Figure 6. Variation of accuracy (%) and loss values with different iterations of the MCNN algorithm for WV1 (a,b) and WV2 (c,d) imagette classification. In the upper part, the accuracy curve illustrates the network’s training process, demonstrating a consistent increase in accuracy until reaching nearly 90%. In the lower part, the loss curve depicts the network’s training progress, consistently decreasing until approaching nearly 0.

Figure 7. Normalized confusion matrix of the MCNN algorithm when applied to WV1 (a) and WV2 (b). From the confusion matrix, it is evident that the true positive rates for most classes are above 90%, except for the class BS with WV1 applied. Additionally, with WV2 applied, the true positive rates for most classes are again above 90%, except for the class WS.

Figure 8. Annual comparison of MCNN-detected RCs (b) with CMwv-detected RCs (a) and GPM precipitation measurements (c). The rain occurrence percentages are calculated within each 5° by 5° spatial bin based on the TenGeoP-SARwv database from March 2016 to February 2017. The associated average yearly precipitation is obtained from the GPM late-run product.

Table 1. Description of four different image edge filters [58].

No.	Edge Filter	Description (x and y Denote the Horizontal and Vertical Directions, Respectively)
1	Central	Weighted difference of neighboring pixels: $\frac{d I}{d y} = \frac{I (y + 1) - I (y - 1)}{2}$ and $\frac{d I}{d x} = \frac{I (x + 1) - I (x - 1)}{2}$
2	Prewitt	Convolution mask: $Δ y = [\begin{matrix} 1 & 1 & 1 \\ 0 & 0 & 0 \\ - 1 & - 1 & - 1 \end{matrix}] and Δ x = [\begin{matrix} - 1 & 0 & 1 \\ - 1 & 0 & 1 \\ - 1 & 0 & 1 \end{matrix}]$
3	Roberts	Weighted difference between diagonally adjacent pixels: $Δ_{1} = [\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}]$
4	Sobel	Convolution mask: $Δ y = [\begin{matrix} 1 & 2 & 1 \\ 0 & 0 & 0 \\ - 1 & - 2 & - 1 \end{matrix}] and Δ x = [\begin{matrix} - 1 & 0 & 1 \\ - 2 & 0 & 2 \\ - 1 & 0 & 1 \end{matrix}]$

Table 2. Classification performance in WV1 (denoted by expansion) and WV2 (denoted by abbreviation) imagette detection of the different methods (the highest values are indicated in bold).

Class	Classification Results of the Different Methods
	AlexNet			CMwv			MCNN
	Recall	Precision	F-Score	Recall	Precision	F-Score	Recall	Precision	F-Score
Atmospheric fronts	0.46	0.72	0.56	0.95	0.40	0.56	0.93	0.83	0.88
(AFs)	0.13	0.81	0.22	0.95	0.38	0.54	0.92	0.91	0.91
Biological slicks	0.97	0.67	0.79	0.95	0.88	0.91	0.89	0.99	0.94
(BSs)	0.63	0.7	0.66	0.89	0.91	0.9	0.93	1	0.96
Icebergs	0.48	0.66	0.56	0.97	0.16	0.27	0.96	0.98	0.97
(IBs)	0.04	0.82	0.08	0.92	0.18	0.3	0.97	0.97	0.97
Low-wind areas	0.76	0.98	0.86	1	0.87	0.93	0.92	0.94	0.93
(LWAs)	0.74	0.78	0.76	1	0.79	0.88	0.96	0.95	0.95
Microconvective cells	0.38	0.88	0.53	0.8	0.76	0.78	0.98	0.98	0.98
(MCCs)	0.41	0.82	0.55	0.85	0.94	0.89	0.95	0.93	0.94
Oceanic fronts	0.41	0.88	0.56	1	0.05	0.1	0.91	0.92	0.91
(OFs)	0.67	0.48	0.56	1	0.05	0.1	0.95	0.87	0.91
Pure ocean waves	0.91	0.81	0.86	0.47	1	0.64	0.98	0.95	0.96
(PWs)	0.12	0.95	0.21	0.39	0.98	0.56	0.96	0.95	0.95
Rain cells	0.66	0.88	0.75	0.93	0.88	0.9	0.97	0.99	0.98
(RCs)	0.73	0.88	0.8	0.93	0.8	0.86	0.98	0.96	0.97
Sea ice	0.75	0.93	0.83	0.9	0.96	0.93	0.96	1	0.98
(SI)	0.85	0.74	0.79	0.96	0.96	0.96	0.99	0.98	0.98
Windstreaks	0.53	0.72	0.61	0.83	0.77	0.8	0.99	0.98	0.98
(WSs)	0.79	0.58	0.67	0.83	0.96	0.89	0.83	1	0.91
Average	0.63	0.81	0.71	0.88	0.67	0.76	0.95	0.96	0.95
Average	0.51	0.76	0.61	0.87	0.7	0.78	0.94	0.95	0.95

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bai, C.; Zhang, S.; Wang, X.; Wen, J.; Li, C. A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification. Appl. Sci. 2024, 14, 1489. https://doi.org/10.3390/app14041489

AMA Style

Bai C, Zhang S, Wang X, Wen J, Li C. A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification. Applied Sciences. 2024; 14(4):1489. https://doi.org/10.3390/app14041489

Chicago/Turabian Style

Bai, Chengzu, Shuo Zhang, Xinning Wang, Jiaqiang Wen, and Chong Li. 2024. "A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification" Applied Sciences 14, no. 4: 1489. https://doi.org/10.3390/app14041489

APA Style

Bai, C., Zhang, S., Wang, X., Wen, J., & Li, C. (2024). A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification. Applied Sciences, 14(4), 1489. https://doi.org/10.3390/app14041489

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification

Abstract

1. Introduction

2. Dataset and Data Processing

2.1. Sentinel-1 Wave Mode

2.2. TenGeoP-SARwv Dataset

2.3. Data Processing

3. MCNN Framework for Geophysical Phenomenon Classification

3.1. Multi-Feature Fusion

3.2. Data Augmentation

3.3. Multichannel Feature Extraction

4. Experimental Evaluation

4.1. Experimental Results of Ocean SAR Scene Classification

4.2. Geophysical Application

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI