Next Article in Journal
Detection of Cervical Lesion Cell/Clumps Based on Adaptive Feature Extraction
Previous Article in Journal
Effects of Acute Mental Stress on Choroidal Thickness
Previous Article in Special Issue
Auscultation-Based Pulmonary Disease Detection through Parallel Transformation and Deep Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Reshaping Bioacoustics Event Detection: Leveraging Few-Shot Learning (FSL) with Transductive Inference and Data Augmentation

by
Nouman Ijaz
1,
Farhad Banoori
2,3 and
Insoo Koo
1,*
1
Department of Electrical, Electronics and Computer Engineering, University of Ulsan, Ulsan 44610, Republic of Korea
2
School of Electronics and Information Engineering, South China University of Technology, Guangzhou 510641, China
3
Faculty of Computer Sciences, Department of Computer Science, ILMA University, Karachi City 74900, Pakistan
*
Author to whom correspondence should be addressed.
Bioengineering 2024, 11(7), 685; https://doi.org/10.3390/bioengineering11070685
Submission received: 21 May 2024 / Revised: 30 June 2024 / Accepted: 3 July 2024 / Published: 5 July 2024
(This article belongs to the Special Issue IoT Technology in Bioengineering Applications)

Abstract

Bioacoustic event detection is a demanding endeavor involving recognizing and classifying the sounds animals make in their natural habitats. Traditional supervised learning requires a large amount of labeled data, which are hard to come by in bioacoustics. This paper presents a few-shot learning (FSL) method incorporating transductive inference and data augmentation to address the issues of too few labeled events and small volumes of recordings. Here, transductive inference iteratively alters class prototypes and feature extractors to seize essential patterns, whereas data augmentation applies SpecAugment on Mel spectrogram features to augment training data. The proposed approach is evaluated by using the Detecting and Classifying Acoustic Scenes and Events (DCASE) 2022 and 2021 datasets. Extensive experimental results demonstrate that all components of the proposed method achieve significant F-score improvements of 27% and 10%, for the DCASE-2022 and DCASE-2021 datasets, respectively, compared to recent advanced approaches. Moreover, our method is helpful in FSL tasks because it effectively adapts to sounds from various animal species, recordings, and durations.
Keywords: few-shot learning (FSL); bioacoustics event detection; transductive inference; data augmentation few-shot learning (FSL); bioacoustics event detection; transductive inference; data augmentation

Graphical Abstract

Share and Cite

MDPI and ACS Style

Ijaz, N.; Banoori, F.; Koo, I. Reshaping Bioacoustics Event Detection: Leveraging Few-Shot Learning (FSL) with Transductive Inference and Data Augmentation. Bioengineering 2024, 11, 685. https://doi.org/10.3390/bioengineering11070685

AMA Style

Ijaz N, Banoori F, Koo I. Reshaping Bioacoustics Event Detection: Leveraging Few-Shot Learning (FSL) with Transductive Inference and Data Augmentation. Bioengineering. 2024; 11(7):685. https://doi.org/10.3390/bioengineering11070685

Chicago/Turabian Style

Ijaz, Nouman, Farhad Banoori, and Insoo Koo. 2024. "Reshaping Bioacoustics Event Detection: Leveraging Few-Shot Learning (FSL) with Transductive Inference and Data Augmentation" Bioengineering 11, no. 7: 685. https://doi.org/10.3390/bioengineering11070685

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop