Next Article in Journal
Vision-Guided Tracking and Emergency Landing for UAVs on Moving Targets
Previous Article in Journal
Dual-Driven Learning-Based Multiple-Input Multiple-Output Signal Detection for Unmanned Aerial Vehicle Air-to-Ground Communications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm

1
College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
2
College of Information Engineering, Harbin University, Harbin 150086, China
3
College of Artificial Intelligence and Big Data, Hulunbuir University, Hulunbuir 021008, China
4
Heilongjiang Forestry Intelligent Equipment Engineering Research Center, Harbin 150040, China
*
Author to whom correspondence should be addressed.
Drones 2024, 8(5), 181; https://doi.org/10.3390/drones8050181
Submission received: 7 March 2024 / Revised: 26 April 2024 / Accepted: 27 April 2024 / Published: 3 May 2024

Abstract

Unmanned aerial vehicle (UAV) aerial images often present challenges such as small target sizes, high target density, varied shooting angles, and dynamic poses. Existing target detection algorithms exhibit a noticeable performance decline when confronted with UAV aerial images compared to general scenes. This paper proposes an outstanding small target detection algorithm for UAVs, named Fine-Grained Feature Perception YOLOv8s-P2 (FGFP-YOLOv8s-P2), based on YOLOv8s-P2 architecture. We specialize in improving inspection accuracy while meeting real-time inspection requirements. First, we enhance the targets’ pixel information by utilizing slice-assisted training and inference techniques, thereby reducing missed detections. Then, we propose a feature extraction module with deformable convolutions. Decoupling the learning process of offset and modulation scalar enables better adaptation to variations in the size and shape of diverse targets. In addition, we introduce a large kernel spatial pyramid pooling module. By cascading convolutions, we leverage the advantages of large kernels to flexibly adjust the model’s attention to various regions of high-level feature maps, better adapting to complex visual scenes and circumventing the cost drawbacks associated with large kernels. To match the excellent real-time detection performance of the baseline model, we propose an improved Random FasterNet Block. This block introduces randomness during convolution and captures spatial features of non-linear transformation channels, enriching feature representations and enhancing model efficiency. Extensive experiments and comprehensive evaluations on the VisDrone2019 and DOTA-v1.0 datasets demonstrate the effectiveness of FGFP-YOLOv8s-P2. This achievement provides robust technical support for efficient small target detection by UAVs in complex scenarios.
Keywords: unmanned aerial vehicle; small object detection; Fine-Grained Feature; YOLOv8 unmanned aerial vehicle; small object detection; Fine-Grained Feature; YOLOv8

Share and Cite

MDPI and ACS Style

Liu, S.; Zhu, M.; Tao, R.; Ren, H. Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm. Drones 2024, 8, 181. https://doi.org/10.3390/drones8050181

AMA Style

Liu S, Zhu M, Tao R, Ren H. Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm. Drones. 2024; 8(5):181. https://doi.org/10.3390/drones8050181

Chicago/Turabian Style

Liu, Shi, Meng Zhu, Rui Tao, and Honge Ren. 2024. "Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm" Drones 8, no. 5: 181. https://doi.org/10.3390/drones8050181

APA Style

Liu, S., Zhu, M., Tao, R., & Ren, H. (2024). Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm. Drones, 8(5), 181. https://doi.org/10.3390/drones8050181

Article Metrics

Back to TopTop