Journal of Marine Science and Engineering

Research

13 pages, 3352 KiB

Open AccessArticle

Dual-CycleGANs with Dynamic Guidance for Robust Underwater Image Restoration

by Yu-Yang Lin, Wan-Jen Huang and Chia-Hung Yeh

J. Mar. Sci. Eng. 2025, 13(2), 231; https://doi.org/10.3390/jmse13020231 - 25 Jan 2025

Viewed by 541

The field of underwater image processing has gained significant attention recently, offering great potential for enhanced exploration of underwater environments, including applications such as underwater terrain scanning and autonomous underwater vehicles. However, underwater images frequently face challenges such as light attenuation, color distortion, [...] Read more.

The field of underwater image processing has gained significant attention recently, offering great potential for enhanced exploration of underwater environments, including applications such as underwater terrain scanning and autonomous underwater vehicles. However, underwater images frequently face challenges such as light attenuation, color distortion, and noise introduced by artificial light sources. These degradations not only affect image quality but also hinder the effectiveness of related application tasks. To address these issues, this paper presents a novel deep network model for single under-water image restoration. Our model does not rely on paired training images and incorporates two cycle-consistent generative adversarial network (CycleGAN) structures, forming a dual-CycleGAN architecture. This enables the simultaneous conversion of an underwater image to its in-air (atmospheric) counterpart while learning a light field image to guide the underwater image towards its in-air version. Experimental results indicate that the proposed method provides superior (or at least comparable) image restoration performance, both in terms of quantitative measures and visual quality, when compared to existing state-of-the-art techniques. Our model significantly reduces computational complexity, resulting in a more efficient approach that maintains superior restoration capabilities, ensuring faster processing times and lower memory usage, making it highly suitable for real-world applications. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

22 pages, 4204 KiB

Open AccessArticle

AquaYOLO: Enhancing YOLOv8 for Accurate Underwater Object Detection for Sonar Images

by Yanyang Lu, Jingjing Zhang, Qinglang Chen, Chengjun Xu, Muhammad Irfan and Zhe Chen

J. Mar. Sci. Eng. 2025, 13(1), 73; https://doi.org/10.3390/jmse13010073 - 3 Jan 2025

Viewed by 1143

Abstract

Object detection in underwater environments presents significant challenges due to the inherent limitations of sonar imaging, such as noise, low resolution, lack of texture, and color information. This paper introduces AquaYOLO, an enhanced YOLOv8 version specifically designed to improve object detection accuracy in [...] Read more.

Object detection in underwater environments presents significant challenges due to the inherent limitations of sonar imaging, such as noise, low resolution, lack of texture, and color information. This paper introduces AquaYOLO, an enhanced YOLOv8 version specifically designed to improve object detection accuracy in underwater sonar images. AquaYOLO replaces traditional convolutional layers with a residual block in the backbone network to enhance feature extraction. In addition, we introduce Dynamic Selection Aggregation Module (DSAM) and Context-Aware Feature Selection (CAFS) in the neck network. These modifications allow AquaYOLO to capture intricate details better and reduce feature redundancy, leading to improved performance in underwater object detection tasks. The model is evaluated on two standard underwater sonar datasets, UATD and Marine Debris, demonstrating superior accuracy and robustness compared to baseline models. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

20 pages, 6798 KiB

Open AccessArticle

SS-YOLO: A Lightweight Deep Learning Model Focused on Side-Scan Sonar Target Detection

by Na Yang, Guoyu Li, Shengli Wang, Zhengrong Wei, Hu Ren, Xiaobo Zhang and Yanliang Pei

J. Mar. Sci. Eng. 2025, 13(1), 66; https://doi.org/10.3390/jmse13010066 - 2 Jan 2025

Cited by 2 | Viewed by 1093

Abstract

As seabed exploration activities increase, side-scan sonar (SSS) is being used more widely. However, distortion and noise during the acoustic pulse’s travel through water can blur target details and cause feature loss in images, making target recognition more challenging. In this paper, we [...] Read more.

As seabed exploration activities increase, side-scan sonar (SSS) is being used more widely. However, distortion and noise during the acoustic pulse’s travel through water can blur target details and cause feature loss in images, making target recognition more challenging. In this paper, we improve the YOLO model in two aspects: lightweight design and accuracy enhancement. The lightweight design is essential for reducing computational complexity and resource consumption, allowing the model to be more efficient on edge devices with limited processing power and storage. Thus, meeting our need to deploy SSS target detection algorithms on unmanned surface vessel (USV) for real-time target detection. Firstly, we replace the original complex convolutional method in the C2f module with a combination of partial convolution (PConv) and pointwise convolution (PWConv), reducing redundant computations and memory access while maintaining high accuracy. In addition, we add an adaptive scale spatial fusion (ASSF) module using 3D convolution to combine feature maps of different sizes, maximizing the extraction of invariant features across various scales. Finally, we use an improved multi-head self-attention (MHSA) mechanism in the detection head, replacing the original complex convolution structure, to enhance the model’s ability to focus on important features with low computational load. To validate the detection performance of the model, we conducted experiments on the combined side-scan sonar dataset (SSSD). The results show that our proposed SS-YOLO model achieves average accuracies of 92.4% (mAP 0.5) and 64.7% (mAP 0.5:0.95), outperforming the original YOLOv8 model by 4.4% and 3%, respectively. In terms of model complexity, the improved SS-YOLO model has 2.55 M of parameters and 6.4 G of FLOPs, significantly lower than those of the original YOLOv8 model and similar detection models. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

32 pages, 6380 KiB

Open AccessArticle

Application and Analysis of the MFF-YOLOv7 Model in Underwater Sonar Image Target Detection

by Kun Zheng, Haoshan Liang, Hongwei Zhao, Zhe Chen, Guohao Xie, Liguo Li, Jinghua Lu and Zhangda Long

J. Mar. Sci. Eng. 2024, 12(12), 2326; https://doi.org/10.3390/jmse12122326 - 18 Dec 2024

Viewed by 983

Abstract

The need for precise identification of underwater sonar image targets is growing in areas such as marine resource exploitation, subsea construction, and ocean ecosystem surveillance. Nevertheless, conventional image recognition algorithms encounter several obstacles, including intricate underwater settings, poor-quality sonar image data, and limited [...] Read more.

The need for precise identification of underwater sonar image targets is growing in areas such as marine resource exploitation, subsea construction, and ocean ecosystem surveillance. Nevertheless, conventional image recognition algorithms encounter several obstacles, including intricate underwater settings, poor-quality sonar image data, and limited sample quantities, which hinder accurate identification. This study seeks to improve underwater sonar image target recognition capabilities by employing deep learning techniques and developing the Multi-Gradient Feature Fusion YOLOv7 model (MFF-YOLOv7) to address these challenges. This model incorporates the Multi-Scale Information Fusion Module (MIFM) as a replacement for YOLOv7’s SPPCSPC, substitutes the Conv of CBS following ELAN with RFAConv, and integrates the SCSA mechanism at three junctions where the backbone links to the head, enhancing target recognition accuracy. Trials were conducted using datasets like URPC, SCTD, and UATD, encompassing comparative studies of attention mechanisms, ablation tests, and evaluations against other leading algorithms. The findings indicate that the MFF-YOLOv7 model substantially surpasses other models across various metrics, demonstrates superior underwater target detection capabilities, exhibits enhanced generalization potential, and offers a more dependable and precise solution for underwater target identification. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

19 pages, 8118 KiB

Open AccessArticle

Research on the Identification and Classification of Marine Debris Based on Improved YOLOv8

by Wenbo Jiang, Lusong Yang and Yun Bu

J. Mar. Sci. Eng. 2024, 12(10), 1748; https://doi.org/10.3390/jmse12101748 - 3 Oct 2024

Cited by 4 | Viewed by 1829

Abstract

Autonomous underwater vehicles equipped with target recognition algorithms are a primary means of removing marine debris. However, due to poor underwater visibility, light scattering by suspended particles, and the coexistence of organisms and debris, current methods have problems such as poor recognition and [...] Read more.

Autonomous underwater vehicles equipped with target recognition algorithms are a primary means of removing marine debris. However, due to poor underwater visibility, light scattering by suspended particles, and the coexistence of organisms and debris, current methods have problems such as poor recognition and classification effects, slow recognition speed, and weak generalization ability. In response to these problems, this article proposes a marine debris identification and classification algorithm based on improved YOLOv8. The algorithm incorporates the CloFormer module, a context-aware local enhancement mechanism, into the backbone network, fully utilizing shared and context-aware weights. Consequently, it enhances high- and low-frequency feature extraction from underwater debris images. The proposed C2f-spatial and channel reconstruction (C2f-SCConv) module combines the SCConv module with the neck C2f module to reduce spatial and channel redundancy in standard convolutions and enhance feature representation. WIoU v3 is employed as the bounding box regression loss function, effectively managing low- and high-quality samples to improve overall model performance. The experimental results on the TrashCan-Instance dataset indicate that compared to the classical YOLOv8, the mAP@0.5 and F1 scores are increased by 5.7% and 6%, respectively. Meanwhile, on the TrashCan-Material dataset, the mAP@0.5 and F1 scores also improve, by 5.5% and 5%, respectively. Additionally, the model size has been reduced by 12.9%. These research results are conducive to maintaining marine life safety and ecosystem stability. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

12 pages, 3550 KiB

Open AccessArticle

Deep Learning Based Characterization of Cold-Water Coral Habitat at Central Cantabrian Natura 2000 Sites Using YOLOv8

by Alberto Gayá-Vilar, Alberto Abad-Uribarren, Augusto Rodríguez-Basalo, Pilar Ríos, Javier Cristobo and Elena Prado

J. Mar. Sci. Eng. 2024, 12(9), 1617; https://doi.org/10.3390/jmse12091617 - 11 Sep 2024

Cited by 1 | Viewed by 1172

Abstract

Cold-water coral (CWC) reefs, such as those formed by Desmophyllum pertusum and Madrepora oculata, are vital yet vulnerable marine ecosystems (VMEs). The need for accurate and efficient monitoring of these habitats has driven the exploration of innovative approaches. This study presents a [...] Read more.

Cold-water coral (CWC) reefs, such as those formed by Desmophyllum pertusum and Madrepora oculata, are vital yet vulnerable marine ecosystems (VMEs). The need for accurate and efficient monitoring of these habitats has driven the exploration of innovative approaches. This study presents a novel application of the YOLOv8l-seg deep learning model for the automated detection and segmentation of these key CWC species in underwater imagery. The model was trained and validated on images collected at two Natura 2000 sites in the Cantabrian Sea: the Avilés Canyon System (ACS) and El Cachucho Seamount (CSM). Results demonstrate the model’s high accuracy in identifying and delineating individual coral colonies, enabling the assessment of coral cover and spatial distribution. The study revealed significant variability in coral cover between and within the study areas, highlighting the patchy nature of CWC habitats. Three distinct coral community groups were identified based on percentage coverage composition and abundance, with the highest coral cover group being located exclusively in the La Gaviera canyon head within the ACS. This research underscores the potential of deep learning models for efficient and accurate monitoring of VMEs, facilitating the acquisition of high-resolution data essential for understanding CWC distribution, abundance, and community structure, and ultimately contributing to the development of effective conservation strategies. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

18 pages, 22988 KiB

Open AccessArticle

MEvo-GAN: A Multi-Scale Evolutionary Generative Adversarial Network for Underwater Image Enhancement

by Feiran Fu, Peng Liu, Zhen Shao, Jing Xu and Ming Fang

J. Mar. Sci. Eng. 2024, 12(7), 1210; https://doi.org/10.3390/jmse12071210 - 18 Jul 2024

Cited by 2 | Viewed by 1148

Abstract

In underwater imaging, achieving high-quality imagery is essential but challenging due to factors such as wavelength-dependent absorption and complex lighting dynamics. This paper introduces MEvo-GAN, a novel methodology designed to address these challenges by combining generative adversarial networks with genetic algorithms. The key [...] Read more.

In underwater imaging, achieving high-quality imagery is essential but challenging due to factors such as wavelength-dependent absorption and complex lighting dynamics. This paper introduces MEvo-GAN, a novel methodology designed to address these challenges by combining generative adversarial networks with genetic algorithms. The key innovation lies in the integration of genetic algorithm principles with multi-scale generator and discriminator structures in Generative Adversarial Networks (GANs). This approach enhances image details and structural integrity while significantly improving training stability. This combination enables more effective exploration and optimization of the solution space, leading to reduced oscillation, mitigated mode collapse, and smoother convergence to high-quality generative outcomes. By analyzing various public datasets in a quantitative and qualitative manner, the results confirm the effectiveness of MEvo-GAN in improving the clarity, color fidelity, and detail accuracy of underwater images. The results of the experiments on the UIEB dataset are remarkable, with MEvo-GAN attaining a Peak Signal-to-Noise Ratio (PSNR) of 21.2758, Structural Similarity Index (SSIM) of 0.8662, and Underwater Color Image Quality Evaluation (UCIQE) of 0.6597. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

17 pages, 7982 KiB

Open AccessArticle

Deep Dynamic Weights for Underwater Image Restoration

by Hafiz Shakeel Ahmad Awan and Muhammad Tariq Mahmood

J. Mar. Sci. Eng. 2024, 12(7), 1208; https://doi.org/10.3390/jmse12071208 - 18 Jul 2024

Viewed by 961

Abstract

Underwater imaging presents unique challenges, notably color distortions and reduced contrast due to light attenuation and scattering. Most underwater image enhancement methods first use linear transformations for color compensation and then enhance the image. We observed that linear transformation for color compensation is [...] Read more.

Underwater imaging presents unique challenges, notably color distortions and reduced contrast due to light attenuation and scattering. Most underwater image enhancement methods first use linear transformations for color compensation and then enhance the image. We observed that linear transformation for color compensation is not suitable for certain images. For such images, non-linear mapping is a better choice. This paper introduces a unique underwater image restoration approach leveraging a streamlined convolutional neural network (CNN) for dynamic weight learning for linear and non-linear mapping. In the first phase, a classifier is applied that classifies the input images as Type I or Type II. In the second phase, we use the Deep Line Model (DLM) for Type-I images and the Deep Curve Model (DCM) for Type-II images. For mapping an input image to an output image, the DLM creatively combines color compensation and contrast adjustment in a single step and uses deep lines for transformation, whereas the DCM employs higher-order curves. Both models utilize lightweight neural networks that learn per-pixel dynamic weights based on the input image’s characteristics. Comprehensive evaluations on benchmark datasets using metrics like peak signal-to-noise ratio (PSNR) and root mean square error (RMSE) affirm our method’s effectiveness in accurately restoring underwater images, outperforming existing techniques. Full article

(This article belongs to the Special Issue Application of Deep Learning in Underwater Image Processing)

► Show Figures

Figure 1

Journal Menu

Journal Browser

Application of Deep Learning in Underwater Image Processing

Share This Special Issue

Special Issue Editors

Special Issue Information

Keywords

Benefits of Publishing in a Special Issue

Published Papers (8 papers)

Research

Further Information

Guidelines

MDPI Initiatives

Follow MDPI