Next Article in Journal
Flood Susceptibility Assessment in Urban Areas via Deep Neural Network Approach
Previous Article in Journal
Spatial and Temporal Matching Measurement of Ecosystem Service Supply, Demand and Human Well-Being and Its Coordination in the Great Rivers Economic Belt—Evidence from China’s Yangtze River Economic Belt
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Enhancing Underwater Object Detection and Classification Using Advanced Imaging Techniques: A Novel Approach with Diffusion Models

by
Prabhavathy Pachaiyappan
1,*,
Gopinath Chidambaram
2,
Abu Jahid
3 and
Mohammed H. Alsharif
4,*
1
Department of Computer Science and Engineering, College of Engineering Guindy (CEG) Campus, Anna University, Chennai 600025, India
2
Department of Electrical and Electronics Engineering, Sri Venkateswara College of Engineering, Sriperumbudur 602117, India
3
School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, Canada
4
Department of Electrical Engineering, College of Electronics and Information Engineering, Sejong University, Seoul 05006, Republic of Korea
*
Authors to whom correspondence should be addressed.
Sustainability 2024, 16(17), 7488; https://doi.org/10.3390/su16177488
Submission received: 19 July 2024 / Revised: 24 August 2024 / Accepted: 27 August 2024 / Published: 29 August 2024

Abstract

Underwater object detection and classification pose significant challenges due to environmental factors such as water turbidity and variable lighting conditions. This research proposes a novel approach that integrates advanced imaging techniques with diffusion models to address these challenges effectively, aligning with Sustainable Development Goal (SDG) 14: Life Below Water. The methodology leverages the Convolutional Block Attention Module (CBAM), Modified Swin Transformer Block (MSTB), and Diffusion model to enhance the quality of underwater images, thereby improving the accuracy of object detection and classification tasks. This study utilizes the TrashCan dataset, comprising diverse underwater scenes and objects, to validate the proposed method’s efficacy. This study proposes an advanced imaging technique YOLO (you only look once) network (AIT-YOLOv7) for detecting objects in underwater images. This network uses a modified U-Net, which focuses on informative features using a convolutional block channel and spatial attentions for color correction and a modified swin transformer block for resolution enhancement. A novel diffusion model proposed using modified U-Net with ResNet understands the intricate structures in images with underwater objects, which enhances detection capabilities under challenging visual conditions. Thus, AIT-YOLOv7 net precisely detects and classifies different classes of objects present in this dataset. These improvements are crucial for applications in marine ecology research, underwater archeology, and environmental monitoring, where precise identification of marine debris, biological organisms, and submerged artifacts is essential. The proposed framework advances underwater imaging technology and supports the sustainable management of marine resources and conservation efforts. The experimental results demonstrate that state-of-the-art object detection methods, namely SSD, YOLOv3, YOLOv4, and YOLOTrashCan, achieve mean accuracies ([email protected]) of 57.19%, 58.12%, 59.78%, and 65.01%, respectively, whereas the proposed AIT-YOLOv7 net reaches a mean accuracy ([email protected]) of 81.4% on the TrashCan dataset, showing a 16.39% improvement. Due to this improvement in the accuracy and efficiency of underwater object detection, this research contributes to broader marine science and technology efforts, promoting the better understanding and management of aquatic ecosystems and helping to prevent and reduce the marine pollution, as emphasized in SDG 14.
Keywords: underwater object detection; Sustainable Development Goal (SDG) 14; diffusion models; Convolutional Block Attention Module (CBAM); Modified Swin Transformer Block (MSTB); marine debris detection underwater object detection; Sustainable Development Goal (SDG) 14; diffusion models; Convolutional Block Attention Module (CBAM); Modified Swin Transformer Block (MSTB); marine debris detection

Share and Cite

MDPI and ACS Style

Pachaiyappan, P.; Chidambaram, G.; Jahid, A.; Alsharif, M.H. Enhancing Underwater Object Detection and Classification Using Advanced Imaging Techniques: A Novel Approach with Diffusion Models. Sustainability 2024, 16, 7488. https://doi.org/10.3390/su16177488

AMA Style

Pachaiyappan P, Chidambaram G, Jahid A, Alsharif MH. Enhancing Underwater Object Detection and Classification Using Advanced Imaging Techniques: A Novel Approach with Diffusion Models. Sustainability. 2024; 16(17):7488. https://doi.org/10.3390/su16177488

Chicago/Turabian Style

Pachaiyappan, Prabhavathy, Gopinath Chidambaram, Abu Jahid, and Mohammed H. Alsharif. 2024. "Enhancing Underwater Object Detection and Classification Using Advanced Imaging Techniques: A Novel Approach with Diffusion Models" Sustainability 16, no. 17: 7488. https://doi.org/10.3390/su16177488

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop