Wave Height and Period Estimation from X-Band Marine Radar Images Using Convolutional Neural Network

Zuo, Shaoyan; Wang, Dazhi; Wang, Xiao; Suo, Liujia; Liu, Shuaiwu; Zhao, Yongqing; Liu, Dewang

doi:10.3390/jmse12020311

Open AccessArticle

Wave Height and Period Estimation from X-Band Marine Radar Images Using Convolutional Neural Network

by

Shaoyan Zuo

¹,

Dazhi Wang

^1,2,3,*,

Xiao Wang

⁴,

Liujia Suo

¹,

Shuaiwu Liu

¹,

Yongqing Zhao

¹ and

Dewang Liu

¹

School of Mechanical Engineering, Dalian University of Technology, Dalian 116081, China

²

State Key Laboratory of High-Performance Precision Manufacturing, Dalian University of Technology, Dalian 116024, China

³

Ningbo Institute, Dalian University of Technology, Ningbo 315000, China

⁴

Department of Navigation, Dalian Naval Academy, Dalian 116081, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2024, 12(2), 311; https://doi.org/10.3390/jmse12020311

Submission received: 29 December 2023 / Revised: 30 January 2024 / Accepted: 7 February 2024 / Published: 9 February 2024

(This article belongs to the Section Physical Oceanography)

Download

Browse Figures

Versions Notes

Abstract

:

In this study, a deep learning network for extracting spatial-temporal features is proposed to estimate significant wave height (

H_{s}

) and wave period (

T_{s}

) from X-band marine radar images. Since the shore-based radar image in this study is interfered with by other radar radial noise lines and solid target objects, to ensure that the proposed convolutional neural network (CNN) extracts the image features accurately, it is necessary to pre-process the radar image to eliminate interference. Firstly, a pre-trained GoogLeNet is used to extract multi-scale depth space features from the radar images to estimate

H_{s}

and

T_{s}

. Since CNN-based models cannot analyze the temporal behavior of wave features in radar image sequences, self-attention is connected after the deep convolutional layer of the CNN to construct a convolutional self-attention (CNNSA)-based model that generates spatial-temporal features for

H_{s}

and

T_{s}

estimation. Simultaneously,

H_{s}

and

T_{s}

measured by nearby buoys are used for model training and reference. The experimental results show that the proposed CNNSA model reduces the RMSD by 0.24 m and 0.11 m, respectively, in

H_{s}

estimation compared to the traditional SNR-based and CNN-based methods. In

T_{s}

estimation, the RMSD is reduced by 0.3 s and 0.08 s, respectively.

Keywords:

significant wave height; wave period; X-band marine radar; convolutional neural network; self-attention

1. Introduction

X-band radar has become a valuable tool for oceanographic studies due to its high spatial and temporal resolution [1]. The Bragg resonance interaction between X-band electromagnetic waves and the centimeter-scale ripple waves induced by local winds generate radar backscatter from the sea surface. The sea surface changes are imaged by the backscattering of electromagnetic waves [2]. In addition, marine radar images are used to effectively estimate sea surface features, such as wind [3,4], wave parameters [2,5,6,7], and current [8,9,10]. Different sea surface parameters, such as significant wave height (

H_{s}

), wave period (

T_{s}

), and wave direction, are essential for the safety of various marine activities and the development of coastal areas [11]. In situ sensors, such as wave buoys, have traditionally been used for wave measurements. In contrast, X-band marine radars can detect and measure wave parameters over a broader range and have relatively low maintenance costs.

The echo signal produced by the X-band radar, presented in a light and dark striped image, is called sea clutter, formed by the backscattering of electromagnetic waves emitted by radar across the sea. The base signal (short waves) is modulated by longer waves with several mechanisms, such as hydrodynamic modulation, tilt modulation, and shadow modulation, resulting in longer waves visible in radar images [12]. The conventional spectral analysis method obtains the image spectrum via 3-D Fast Fourier transformation (FFT) of the radar image sequence [2], using the dispersion relation as a filter to extract the image spectrum from the background noise. Then, the wave period, wave direction, and wavelength can be deduced from the filtered spectrum [2,6]. Based on the estimation relation of the synthetic aperture radar [13], the signal-to-noise ratio (SNR) of the wave is calculated from the wave spectrum, and the estimation is completed by the linear relationship between the

H_{s}

and the square root of the SNR. The coefficients can be determined using in situ measurements. Later studies have found that

H_{s}

is not exactly linearly proportional to the square root of the SNR due to variations in sea states, different methods of calculating SNR, and differences in radar systems. In addition to SNR-based methods, there are some alternative methods that have been proposed to estimate wave parameters, such as empirical orthogonal function-based methods [14], iterative least squares-based methods [1], 2D continuous wavelet transform-based methods [15], array-beamforming-based methods [16], shadowing mitigation-based methods [17], and synchrosqueezed wavelet transform-based methods [18]. Additionally, there are some other methods for estimating

H_{s}

that have been proposed, such as shadowing-based methods [19,20], ensemble empirical mode decomposition-based methods [21], correlation analysis-based methods [22], and variational mode decomposition-based methods [23]. It is worth noting that machine learning algorithms have been applied to

H_{s}

estimation, which can simplify the cumbersome steps of previous algorithms and improve computational efficiency. It is also possible to estimate more accurate results using methods including a support vector regression (SVR)-based method [24], artificial neural network (ANN)-based methods [25,26], a convolutional neural network (CNN)-based method [27], a convolutional gated recurrent unit network (CGRU)-based method [14], or a temporal convolutional network (TCN)-based method [28]. In addition, random forest (RF)-based machine learning methods have been used to estimate wave directions and periods [29].

CNNs can extract spatial features of image sequences but are incapable of temporal analysis. In this study, a novel

H_{s}

and

T_{s}

estimation model combining CNN with self-attention (abbreviated as CNNSA) is proposed. This method can extract the spatial features of radar image sequences via CNN, and introduces a self-attention layer in the feature vector of the CNN to capture dependencies in the time series and achieve spatial-temporal feature extraction of radar images. Section 2 introduces the data pre-processing methods used, including median filtering based on the two-layer decision and the adaptive region growing repair method. The structure and components of the proposed CNN and convolutional self-attention (CNNSA)-based models are illustrated in Section 3. The training and testing results obtained from shore-based marine radar data using SNR, CNN, and CNNSA-based models are presented and compared with each other in Section 4. The results are discussed in Section 5. Finally, the conclusion and outlook of this work appear in Section 6.

2. Data Pre-Processing

Interference from other marine radars produces radial noise lines in radar images, as shown by the dense radial pixel lines in Figure 1. In this study, the radar images collected are from shore-based X-band radar, which also is affected by interference from targets like ships, which appear as bright spots in the radar images. The pre-processing process of the radar images is shown in Figure 2.

2.1. Median Filtering Based on the Two-Layer Decision

According to the characteristics of the wave texture in the radar image, an improved median filtering method is used to screen the pixel points for noise. The region involved in

H_{s}

and

T_{s}

estimation is intercepted in a radar polar coordinates image. In this study, a sub-image with azimuth ±30° and range in 600–2500 m is selected and converted into a grey image of 0–255, denoted as

I (x, y)

, which is of size

n \times n

, n = 256. First, the size of the sliding window

L_{1}

is set to

m \times 1

, and in this study, m = 3. The

I (x, y)

is expanded according to the sliding window size; the pixel image size after the expansion is

(n + m - 1)

, and the starting point is the first position of the expanded image in order to find the noise point to be processed. Then, the center point

L_{1}

is set to determine whether the median of the grey value of each pixel point is the

L_{1}

. If it is the median, the grey value of the center point will not be processed. If it is not the median,

L_{1}

is set to determine whether it is a noise point. Next, a threshold

C_{1}

is set to determine whether the pixel point is a noise point.

C_{1} = \frac{1}{5} A_{a v g}

calculates the difference between the grayscale value of the maximum pixel point in each sliding window (

B_{m a x} (i, j)

) and the grayscale value of the minimum pixel point (

B_{m i n} (i, j)

), expressed as B. If B >

C_{1}

, the noise point is determined. Finally, the noise point is replaced by the median value calculated in

L_{1}

.

A_{a v g}

is represented as follows:

A_{a v g} = \frac{1}{x} \frac{1}{y} \sum_{j = 1}^{y} \sum_{i = 1}^{x} I (i, j)

(1)

2.2. Adaptive Region Growing Repair Method

Targets on the sea surface can cause a lack of wave texture, leading to errors when estimating

H_{s}

and

T_{s}

from radar images using deep learning-based methods. To avoid this problem, this study uses an adaptive region-growing repair method to recover the image region disturbed by targets. The process is divided into four parts. The first part uses the adaptive threshold to determine whether the target is produced. The initial parameter

C_{2}

is set with threshold

D_{1}

, where

A_{m a x}

is the maximum value in the grayscale value of all pixel points in

I (x, y)

. It is then determined whether

A_{m a x}

is greater than

D_{1}

, and if the output is based on the original grayscale value, the following calculation is required.

D_{1} = A_{a v g} + C_{2}

(2)

C_{2} = \frac{A_{m a x} + 1}{2}

(3)

The second part uses gradient descent to find the initial growth point of the target and identifies the target area in

I (x, y)

. The grayscale values of all the pixel points in

I (x, y)

are sorted from smallest to largest, and the location of a specific grayscale value is selected as the growth point of the target. The sliding window is set to

L_{2} = m \times m

, so that

A_{m a x}

is used as the center point and initial growth point of the sliding window

L_{2}

. This point is used as the starting point to search for pixel points in the sliding window that have similar features to the centre point of the sliding window. It is also used as the noise starting point until the traversal stops when there are no pixel points in the sliding window that have similar features, in order to determine the area where the target is located. The pixel point screening method for similar features is

C_{(i, j)} - C_{c e n} < D_{2}

, where

C_{(i, j)}

denotes the grayscale value of the pixel point inside

L_{2}

,

C_{c e n}

denotes the grayscale value of the center point of

L_{2}

, and

D_{2}

is the filtering threshold. If the equation is valid, pixel points with similar features are searched for using

C_{(i, j)}

as a new starting point until the equation is not valid inside the sliding window. These steps are repeated to find the remaining targets in the image.

D_{2}

is defined as follows:

D_{2} = \frac{1}{3} A_{a v g}

(4)

The third part determines whether the target is a real target at the time. The number of pixel locations occupied by each target object

N_{i}

is calculated, and it is determined whether the total number of pixel locations occupied by each target object is less than the maximum upper limit of single target object recognition where

a r e a = m \times m (m = N_{i} / 6)

. The integer is retained, and if it is valid, it is considered to be a target object. The average grey value

B_{a v g}

of the pixel points in the target area and the target threshold

D_{3}

are calculated. If

B_{a v g} > D_{3}

, then it is considered a real target.

D_{3}

is defined as follows:

D_{3} = \frac{5}{4} A_{a v g}

(5)

The fourth part is based on a mean-filling transition algorithm to restore the target and real wave recovery. The grayscale value of the pixel point where the target is located is set to 0. According to the size of the sliding window

L_{2}

,

I (x, y)

is expanded to

(n + 2 m) \times (n + 2 m)

, and the mean value of the four-pixel points that are m points away from the centre of the noise point in the distance direction and orientation direction is selected to fill the image, instead of the noise point. Figure 3 shows the radar sub-image, processed radial noise line, and the image of the target object.

3. The CNNSA-Based Estimation Model

3.1. CNN

CNNs are a deep learning model designed to process and analyze data with a grid structure, such as images and videos. They are unique in their convolutional layers, which are locally aware of the input data through the utilization of sliding windows, thus effectively capturing the spatial relationships of image or video features [30]. The basic building blocks of CNNs include convolutional operations, pooling operations, and fully connected layers. Convolutional operations detect various features in images, such as edges, textures, and shapes. In contrast, pooling operations are used to reduce the size of the feature map, reduce computational complexity, and extract higher-level features. In this study, GoogLeNet [31] is selected as the pre-training model among many convolutional network models. Compared with other classical CNNs, such as AlexNet [32] and VGGNet [33], GoogLeNet has a lighter weight and more efficient structure, adopting the “Inception” module structure to reduce the number of parameters by using multi-scale convolutional kernels. There are also deeper networks that can learn more complex feature representations.

GoogLeNet consists of several “Inception” modules, each containing a series of convolutional kernels of different scales to capture image features in parallel. Its overall structure consists of the following key components:

(1): Convolutional and pooling layers. These are used to extract the basic features of the image, such as edges and texture;
(2): Inception modules. Each “Inception” contains multiple parallel convolutional kernels and pooling operations to capture features at different scales and levels. The results of these parallel operations are cascaded together to form the module outputs, with the primary goal of improving the feature representation of the model without adding too many parameters;
(3): Global average pooling layer. This layer averages the values of each channel of the feature map to generate a fixed-size feature vector. This reduces the fully connected layer’s dimensionality and helps reduce overfitting.
(4): Fully connected layer. This layer integrates information from different features to capture complex relationships in the data. In this study, the regression task of the fully connected layer is utilized to estimate the H_S and T_S of the radar image.

The structure of the GoogLeNet-based estimation model is shown in Figure 4.

3.2. Self-Attention

The CNN-based model performs well in capturing the depth features of the radar image, but there are no features to capture the temporal correlation between the entire image sequence. Self-attention allows the model to dynamically capture long-distance dependencies throughout the input sequence, which can relate to the context of the image sequence. The computation process of self-attention, which includes the computation of Query, Key, Value, Attention Score, and the final weighted summation, is shown in Figure 5. Firstly, the mapping map x obtained by deep convolution is multiplied by the weight matrices

W_{Q}

,

W_{K}

, and

W_{K}

to get the feature spaces

f (x)

,

g (x)

, and

h (x)

, respectively:

f (x) = W_{Q} \cdot x f (x) = W_{K} \cdot x f (x) = W_{V} \cdot x

(6)

Then, after multiplying the transpose of the feature space

f (x_{i})

with the feature space

g (x_{j})

, the product is normalized with a soft classifier to obtain the attention map

β_{j, i}

:

β_{j, i} = \frac{e x p (s_{i j})}{\sum_{i = 1}^{N} e x p (s_{i j})}, w h e r e s_{i j} = f {(x_{i})}^{T} \cdot g (x_{j})

(7)

Finally, the feature space

h (x)

is multiplied by the attention map and then with a 1 × 1 convolution to obtain the self-attention feature map, o:

o_{j} = v (\sum_{j = 1}^{N} β_{j, i} \cdot h (x_{i})), w h e r e h (x_{i}) = W_{V} \cdot x_{i}

(8)

3.3. CNN-SA Model

In this study, a novel deep learning model with temporal and spatial feature extraction is proposed by embedding self-attention as a feature extraction module in the CNN model. The model captures the spatial information of radar images through the convolutional layer, pooling layer, and Inception structure of GoogLeNet. Adding self-attention after the “Inception” structure can dynamically capture long-range dependencies in image sequences. This makes capturing spatial and temporal relationships in radar image sequences more conducive to estimating

H_{s}

and

T_{s}

from the model. The flowchart of the CNNSA-based estimation model for

H_{s}

and

T_{s}

is shown in Figure 6, and detailed parameter information is shown in Table 1.

4. Results

4.1. Data Overview

The X-band marine radar data used in this study were collected in Pingtan, Fujian Province, from 16 December 2010 to 10 January 2011. The detailed parameter information of the radar is shown in Table 2. The rotation period of the radar antenna was about 2.5 s, and a radar image sequence contained 32 images, that is, 80 s of each image sequence interval. The wave buoy that measured the reference data was deployed approximately 0.85 km from the radar. Since the references

H_{s}

and

T_{s}

were measured every 20 min, temporal interpolation is required to provide synchronized references for each radar image sequence. Figure 1 shows the position relationship between a radar image under polar coordinates and wave buoy placement. During data collection,

H_{s}

was between 1.3 and 3.5 m and

T_{s}

was between 6.5 and 10 s. At the same time, the synchronized local wind speeds ranged from 5.8 to 18.5 m/s, as shown in Figure 7, making the proposed method estimated under typical wind and wave conditions. The synchronized buoy references and radar images used in the study were not present at every moment, and the corresponding data are shown in Figure 7.

4.2. Model Train

In this study, the first image of each sequence was input into the pre-processing process in Figure 2, which selected the radar image sub-area and filtered out the radial noise lines and targets so that high-quality sub-images could be provided for the estimation of

H_{s}

and

T_{s}

, based on deep learning for better learning of feature information in the images. The significant wave heights used in this study were 1.3–3.5 m, and the training sample contained images from all ranges. Since the reference

H_{s}

and

T_{s}

were measured around every 20 min before training, all the image sequences (excluding the rainy day) were first sorted according to the simultaneous

H_{s}

and

T_{s}

obtained from interpolated buoy measurements. Then, at each 0.5 m interval between 1.3–3.5 m, 70% and 30% of the samples were taken for training and testing of the model. The proposed CNNSA-based model was trained using the Pytorch framework, installed on a Windows 10 PC with two 2.10 GHz Xeon (R) Gold 6230R CPUs. The initial learning rate was 0.0003 using the Adam optimizer and the MSE loss function. Batch size was set to 36, and the number of epochs was 50.

4.3. Result Analysis

The SNR-based method [7] and the CNN-based method were also applied to the same testing set for comparison, to demonstrate the effectiveness of the CNNSA-based

H_{s}

and

T_{s}

estimation model. In this study, the

H_{s}

and

T_{s}

obtained in two continuous buoy measurement intervals were averaged using a time-moving average with the same moving time as the buoy measurement interval. The root-mean-square differences (RMSDs), correlation coefficients (CCs), and biases between the reference value and the estimated value using the SNR-based, CNN-based, and CNNSA-based methods are summarized in Table 3 and Table 4, respectively. The estimation results of the three methods for the test sample before and after the moving average are shown in Figure 8. In addition, Figure 9 shows the estimation results in the time series.

From Table 3, it can be seen that the RMSD of the proposed method is reduced by 0.21 m and 0.1 m, and the CC is improved to 0.85 when compared with the SNR-based and CNN-based methods, which is the

H_{s}

estimation without averaging. Further, when the

H_{s}

estimation result is processed by sliding for an average of 30 min, it can be seen from Table 3 that the

H_{s}

estimation result is further improved, with an RMSD and CC of 0.3 m and 0.86, respectively, based on the CNNSA method. Similarly, Table 4 shows the

T_{s}

estimation. The RMSD of the proposed method is reduced by 0.23 s and 0.09 s, and the CC is improved to 0.89 compared to the SNR-based and CNN-based methods, which is the

T_{s}

estimation without averaging. The estimated results of

T_{s}

were processed by sliding for an average of 30 min. As can be seen from Table 4, the estimated

T_{s}

is also improved, and the RMSD and CC based on the proposed method are 0.27 s and 0.91. To visualize the estimation results of the three methods regarding

H_{s}

and

T_{s}

, Figure 8 shows the estimation results of

H_{s}

and

T_{s}

under average sliding and without average sliding of the three methods.

5. Discussion

As can be seen from Figure 8a, when Hs is 2–3 m, the estimated result of

H_{s}

is small, while when

H_{s}

is greater than 3 m, the estimated result of

H_{s}

is larger. In Figure 8b, the SNR-based

T_{s}

estimation results show a large dispersion as a whole, which is due to the sensitivity of SNR-based methods to radar image noise, resulting in a large deviation in the estimation results. It can be seen from Figure 8c,e that the CNNSA-based method has a better linear correlation in

H_{s}

estimation than the CNN-based method. Similarly, in Figure 8d,f, the CNNSA-based method also produces a better linear correlation in the estimation of

T_{s}

. It can be seen from Table 3 and Table 4 that the correlation coefficients of

H_{s}

and

T_{s}

estimation based on the CNNSA method are 0.86 and 0.91, respectively, after the 30-min moving average, both of which are higher than those of the CNN-based method and SNR-based method. Further, Figure 9a and b, respectively, show the variation trend of

H_{s}

and

T_{s}

estimated by the three methods over time, and it can be seen that the results estimated by the CNNSA-based method are closer to the variation trend of buoy references.

In summary, the proposed CNNSA-based regression estimation model has different degrees of improvement in the estimation of

H_{s}

and

T_{s}

compared to the other two methods. Figure 8 shows the degree of fit of the three methods; the proposed CNNSA-based model estimates of

H_{s}

and

T_{s}

are closer to the black line after applying the time-moving average. Further, the closeness of the three methods to the buoy reference is demonstrated in Figure 9, with

H_{s}

and

T_{s}

estimation of the proposed method being closer to the buoy references. All these results show that the proposed method improves the accuracy of

H_{s}

and

T_{s}

estimation using X-band marine radar images and simultaneously validates the reasonableness and effectiveness of the proposed CNNSA-based estimation model.

6. Conclusions

In this study, a deep neural network was used to estimate

H_{s}

and

T_{s}

in an X-band marine radar backscatter image sequence. Since the shore-based radar images in this study were interfered with by other radar radial noise lines and solid target objects, to make the convolutional neural network (CNN) extract the image features more accurately, it was necessary to pre-process the radar images to eliminate the interference. Firstly, a pre-trained GoogLeNet was used to extract multi-scale depth space features from the radar images to estimate

H_{s}

and

T_{s}

. Since CNN-based models cannot analyse the temporal features of wave features in radar image sequences, self-attention was connected after the deep convolutional layer of the CNN in order to construct a convolutional self-attention (CNNSA)-based model that generated spatial-temporal features for

H_{s}

and

T_{s}

estimation. Both models were trained and tested using collected shore-based radar data. The buoy measurements were also interpolated and used as baseline values for performance evaluation. The experimental results showed that both CNN-based and CNNSA-based models can improve the estimation results of

H_{s}

and

T_{s}

. The RSMD of

H_{s}

estimation is reduced by 0.13 m and 0.24 m, respectively, and the RSMD of

T_{s}

estimation is reduced by 0.22 s and 0.3 s, respectively, compared with the traditional SNR-based method. Overall, the CNNSA-based model showed better results for estimating

H_{s}

and

T_{s}

simultaneously. Therefore, the estimation accuracy is further improved by using temporal information while extracting multi-scale depth space features.

Due to the data limitations of this experiment, in future work, a collection of sea state data and radar data containing a larger number of images and a wider range for model training and validation is required, and radar data and reference data should also be collected at different locations to validate the proposed model further.

Author Contributions

Conceptualization, S.Z., D.W., X.W. and L.S.; methodology, S.Z.; software, S.Z.; validation, S.Z., S.L., Y.Z. and D.L.; formal analysis, S.Z., D.W., X.W. and L.S.; investigation, S.Z.; resources, D.W. and X.W.; data curation, S.Z., S.L., Y.Z. and D.L.; writing—original draft preparation, S.Z.; writing—review and editing, S.Z.; visualization, S.Z.; supervision, S.Z., D.W., X.W. and L.S.; project administration, S.Z., D.W., X.W. and L.S.; funding acquisition, D.W. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant Nos. 62074138 and 51975104), the Natural Science Foundation of Ningbo (2022J008), the Fundamental Research Funds for the Central Universities (DUT22LAB405 and DUT22QN227) and Ningbo Institute of Dalian University of Technology.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw/processed data required to reproduce these findings cannot be shared at this time as the data are also part of an ongoing study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Huang, W.M.; Gill, E.; An, J.Q. Iterative least-squares-based wave measurement using X-band nautical radar. IET Radar Sonar Nav. 2014, 8, 853–863. [Google Scholar] [CrossRef]
Young, I.R.; Rosenthal, W.; Ziemer, F. A three-dimensional analysis of marine radar images for the determination of ocean wave directionality and surface currents. J. Geophys. Res. Ocean. 1985, 90, 1049–1059. [Google Scholar] [CrossRef]
Lund, B.; Graber, H.C.; Romeiser, R. Wind Retrieval From Shipborne Nautical X-Band Radar Data. IEEE Trans. Geosci. Remote Sens. 2012, 50, 3800–3811. [Google Scholar] [CrossRef]
Dankert, H.; Horstmann, J.; Rosenthal, W. Ocean wind fields retrieved from radar-image sequences. J. Geophys. Res Ocean. 2003, 108, 3352. [Google Scholar] [CrossRef]
Lund, B.; Graber, H.C.; Xue, J.; Romeiser, R. Analysis of Internal Wave Signatures in Marine Radar Data. IEEE Trans. Geosci. Remote Sens. 2013, 51, 4840–4852. [Google Scholar] [CrossRef]
Nieto-Borge, J.C.; Hessner, K.; Jarabo-Amores, P.; de la Mata-Moya, D. Signal-to-noise ratio analysis heights from X-band marine to estimate ocean wave radar image time series. IET Radar Sonar Nav. 2008, 2, 35–41. [Google Scholar] [CrossRef]
Borge, J.C.N.; Soares, C.G. Analysis of directional wave fields using X-band navigation radar. Coast. Eng. 2000, 40, 375–391. [Google Scholar] [CrossRef]
Gangeskar, R. Ocean current estimated from X-band radar sea surface images. IEEE Trans. Geosci. Remote Sens. 2002, 40, 783–792. [Google Scholar] [CrossRef]
Senet, C.M.; Seemann, J.; Ziemer, F. The near-surface current velocity determined from image sequences of the sea surface. IEEE Trans. Geosci. Remote Sens. 2001, 39, 492–505. [Google Scholar] [CrossRef]
Shen, C.; Huang, W.; Gill, E.W.; Carrasco, R.; Horstmann, J. An Algorithm for Surface Current Retrieval from X-band Marine Radar Images. Remote Sens. 2015, 7, 7753–7767. [Google Scholar] [CrossRef]
Greenwood, C.; Vogler, A.; Morrison, J.; Murray, A. The approximation of a sea surface using a shore mounted X-band radar with low grazing angle. Remote Sens. Environ. 2018, 204, 439–447. [Google Scholar] [CrossRef]
Huang, W.M.; Liu, X.L.; Gill, E.W. Ocean Wind and Wave Measurements Using X-Band Marine Radar: A Comprehensive Review. Remote Sens. 2017, 9, 39. [Google Scholar] [CrossRef]
Alpers, W.; Hasselmann, K. Spectral signal to clutter and thermal noise properties of ocean wave imaging synthetic aperture radars. Int. J. Remote Sens. 1982, 3, 423–446. [Google Scholar] [CrossRef]
Chen, X.W.; Huang, W.M. Spatial-Temporal Convolutional Gated Recurrent Unit Network for Significant Wave Height Estimation From Shipborne Marine Radar Data. IEEE Trans. Geosci. Remote Sens. 2022, 60, 4201711. [Google Scholar] [CrossRef]
An, J.Q.; Huang, W.M.; Gill, E.W. A Self-Adaptive Wavelet-Based Algorithm for Wave Measurement Using Nautical Radar. IEEE Trans. Geosci. Remote Sens. 2015, 53, 567–577. [Google Scholar]
Ma, K.; Wu, X.; Yue, X.; Wang, L.; Liu, J. Array Beamforming Algorithm for Estimating Waves and Currents From Marine X-Band Radar Image Sequences. IEEE Trans. Geosci. Remote Sens. 2017, 55, 1262–1272. [Google Scholar] [CrossRef]
Navarro, W.; Velez, J.C.; Orfila, A.; Lonin, S. A Shadowing Mitigation Approach for Sea State Parameters Estimation Using X-Band Remotely Sensing Radar Data in Coastal Areas. IEEE Trans. Geosci. Remote Sens. 2019, 57, 6292–6310. [Google Scholar] [CrossRef]
Wei, Y.B.; Zheng, Y.; Lu, Z.Z. A Method for Retrieving Wave Parameters From Synthetic X-Band Marine Radar Images. IEEE Access 2020, 8, 204880–204890. [Google Scholar] [CrossRef]
Gangeskar, R. An Algorithm for Estimation of Wave Height From Shadowing in X-Band Radar Sea Surface Images. IEEE Trans. Geosci. Remote Sens. 2014, 52, 3373–3381. [Google Scholar] [CrossRef]
Liu, X.L.; Huang, W.M.; Gill, E.W. Wave Height Estimation from Shipborne X-Band Nautical Radar Images. J. Sens. 2016, 2016, 1078053. [Google Scholar] [CrossRef]
Liu, X.L.; Huang, W.M.; Gill, E.W. Estimation of Significant Wave Height From X-Band Marine Radar Images Based on Ensemble Empirical Mode Decomposition. IEEE Geosci. Remote Sens. 2017, 14, 1740–1744. [Google Scholar] [CrossRef]
Ludeno, G.; Serafino, F. Estimation of the Significant Wave Height from Marine Radar Images without External Reference. J. Mar. Sci. Eng. 2019, 7, 432. [Google Scholar] [CrossRef]
Yang, Z.D.; Huang, W.M. Wave Height Estimation From X-Band Radar Data Using Variational Mode Decomposition. IEEE Geosci. Remote Sens. 2022, 19, 1505405. [Google Scholar] [CrossRef]
Cornejo-Bueno, L.; Borge, J.N.; Alexandre, E.; Hessner, K.; Salcedo-Sanz, S. Accurate estimation of significant wave height with Support Vector Regression algorithms and marine radar images. Coast. Eng. 2016, 114, 233–243. [Google Scholar] [CrossRef]
Park, J.; Ahn, K.; Oh, C.; Chang, Y.S. Estimation of Significant Wave Heights from X-Band Radar Using Artificial Neural Network. J. Korean Soc. Coast. Ocean Eng. 2020, 32, 561–568. [Google Scholar] [CrossRef]
Kim, H.; Ahn, K.; Oh, C. Estimation of Significant Wave Heights from X-Band Radar Based on ANN Using CNN Rainfall Classifier. J. Korean Soc. Coast. Ocean Eng. 2021, 33, 101–109. [Google Scholar] [CrossRef]
Duan, W.; Yang, K.; Huang, L.; Ma, X. Numerical Investigations on Wave Remote Sensing from Synthetic X-Band Radar Sea Clutter Images by Using Deep Convolutional Neural Networks. Remote Sens. 2020, 12, 1117. [Google Scholar] [CrossRef]
Huang, W.M.; Yang, Z.D.; Chen, X.W. Wave Height Estimation From X-Band Nautical Radar Images Using Temporal Convolutional Network. IEEE J. Stars 2021, 14, 11395–11405. [Google Scholar] [CrossRef]
Yang, Z.D.; Huang, W.M.; Chen, X.W. Evaluation and Mitigation of Rain Effect on Wave Direction and Period Estimation From X-Band Marine Radar Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 5207–5219. [Google Scholar] [CrossRef]
Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, J.; et al. Recent Advances in Convolutional Neural Networks. Pattern Recognit. 2018, 77, 354–377. [Google Scholar] [CrossRef]
Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going Deeper with Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]

Figure 1. For calculation, Radar images under polar coordinates, buoy position (black dot), and sub-image areas (red box).

Figure 2. Pre-processing of radar images.

Figure 3. Pre-processing of radar image sub-area. (a) Contains radial noise lines and target. (b) Removed radial noise lines. (c) Removed target.

Figure 4. Flowchart of the GoogLeNet-based H_S and T_S estimation model.

Figure 5. The structure of self-attention.

Figure 6. Flowchart of the CNNSA-based estimation model for H_S and T_S.

Figure 7. Synchronized anemometer-measured wind speed and buoy-measured data.

Figure 8. Scatter plots of the buoy-measured interpolated references and the radar-derived results. Blue and red dots correspond to H_S and T_S, respectively. (a,b) Testing results obtained from the SNR-based method. (c,d) Testing results obtained from the CNN-based method. (e,f) Testing results obtained from the proposed CNNSA-based method.

Figure 9. Radar-derived results in time series using three different methods, where (a) H_S estimation results and (b) T_S estimation results.

Table 1. Detailed parameter information for CNNSA-based estimation models.

Type	Input Size	Output Size	Patch Size/Stride	Filters
Input	256 × 256	256 × 256 × 1
Convolution	256 × 256 × 1	128 × 128 × 64	7 × 7/2	64
Max pool	128 × 128 × 64	64 × 64 × 64	3 × 3/2
Convolution	64 × 64 × 64	64 × 64 × 64	1 × 1/1	64
Convolution	64 × 64 × 64	64 × 64 × 192	3 × 3/1	192
9 × Inception	64 × 64 × 192	8 × 8 × 1024
Self-attention	8 × 8 × 1024	8 × 8 × 1024
Average pool	8 × 8 × 1024	1 × 1 × 1024	7 × 7/1
linear		1 × 1 × 2

Table 2. Radar Information.

Parameters	Value
Transmit frequency	9.41 GHz
Polarization	Horizontal
Antenna rotation speed	22 r/min
Range resolution	7.5 m
Antenna height	45 m
Horizontal beam width	2°
Azimuth coverage	360°

Table 3. Comparisons of results using different methods for H_S estimation.

Method	Testing (Without Averaging)			Testing (With Averaging)
Method	RMSD (m)	CC	Bias	RMSD (m)	CC	Bias
SNR	0.56	0.64	−0.20	0.54	0.65	−0.20
CNN	0.45	0.76	−0.04	0.41	0.77	−0.04
CNNSA	0.35	0.85	−0.03	0.30	0.86	0.04

Table 4. Comparisons of results using different methods for T_S estimation.

Method	Testing (Without Averaging)			Testing (With Averaging)
Method	RMSD (m)	CC	Bias	RMSD (m)	CC	Bias
SNR	0.60	0.65	0.12	0.57	0.65	0.11
CNN	0.46	0.74	−0.10	0.35	0.77	−0.1
CNNSA	0.37	0.89	0.03	0.27	0.91	0.03

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zuo, S.; Wang, D.; Wang, X.; Suo, L.; Liu, S.; Zhao, Y.; Liu, D. Wave Height and Period Estimation from X-Band Marine Radar Images Using Convolutional Neural Network. J. Mar. Sci. Eng. 2024, 12, 311. https://doi.org/10.3390/jmse12020311

AMA Style

Zuo S, Wang D, Wang X, Suo L, Liu S, Zhao Y, Liu D. Wave Height and Period Estimation from X-Band Marine Radar Images Using Convolutional Neural Network. Journal of Marine Science and Engineering. 2024; 12(2):311. https://doi.org/10.3390/jmse12020311

Chicago/Turabian Style

Zuo, Shaoyan, Dazhi Wang, Xiao Wang, Liujia Suo, Shuaiwu Liu, Yongqing Zhao, and Dewang Liu. 2024. "Wave Height and Period Estimation from X-Band Marine Radar Images Using Convolutional Neural Network" Journal of Marine Science and Engineering 12, no. 2: 311. https://doi.org/10.3390/jmse12020311

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Wave Height and Period Estimation from X-Band Marine Radar Images Using Convolutional Neural Network

Abstract

1. Introduction

2. Data Pre-Processing

2.1. Median Filtering Based on the Two-Layer Decision

2.2. Adaptive Region Growing Repair Method

3. The CNNSA-Based Estimation Model

3.1. CNN

3.2. Self-Attention

3.3. CNN-SA Model

4. Results

4.1. Data Overview

4.2. Model Train

4.3. Result Analysis

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI