Next Article in Journal
Multi-Modal Compliant Quadruped Robot Based on CPG Control Network
Next Article in Special Issue
An In-Depth Analysis of 2D and 3D Pose Estimation Techniques in Deep Learning: Methodologies and Advances
Previous Article in Journal
Enhanced Sagger Crack Detection Integrating Deep Learning and Machine Vision
Previous Article in Special Issue
A Classification and Segmentation Model for Diamond Abrasive Grains Based on Improved Swin-Unet-SAM
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images

1
Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin 150080, China
2
School of Astronautics, Harbin Institute of Technology, Harbin 150001, China
3
The 54th Research Institute of China Electronics Technology Group Corporation, Shijiazhuang 050081, China
*
Author to whom correspondence should be addressed.
Electronics 2024, 13(24), 5014; https://doi.org/10.3390/electronics13245014
Submission received: 5 November 2024 / Revised: 17 December 2024 / Accepted: 19 December 2024 / Published: 20 December 2024
(This article belongs to the Special Issue New Insights in 2D and 3D Object Detection and Semantic Segmentation)

Abstract

Currently, embedded unmanned aerial vehicle (UAV) systems face significant challenges in balancing detection accuracy and computational efficiency when processing remote sensing images with complex backgrounds, small objects, and occlusions. This paper proposes the Hybrid-DETR model based on a real-time end-to-end Detection Transformer (RT-DETR), featuring a novel HybridNet backbone network that implements a differentiated hybrid structure through lightweight RepConv Cross-stage Partial Efficient Layer Aggregation Network (RCSPELAN) modules and the Heat-Transfer Cross-stage Fusion (HTCF) modules, effectively balancing feature extraction efficiency and global perception capabilities. Additionally, we introduce a Small-Object Detection Module (SODM) and an EIFI module to enhance the detection capability of small objects in complex scenarios, while employing the Focaler-Shape-IoU loss function to optimize bounding box regression. Experimental results on the VisDrone2019 dataset demonstrate that Hybrid-DETR achieves mAP50 and mAP50:95 scores of 52.2% and 33.3%, respectively, representing improvements of 5.2% and 4.3% compared to RT-DETR-R18, while reducing model parameters by 29.33%. The effectiveness and robustness of our improved method are further validated on multiple challenging datasets, including AI-TOD and HIT-UAV.
Keywords: UAV remote sensing images; RT-DETR; small-object detection; feature fusion; Focaler-Shape-IoU UAV remote sensing images; RT-DETR; small-object detection; feature fusion; Focaler-Shape-IoU

Share and Cite

MDPI and ACS Style

Yang, M.; Xu, R.; Yang, C.; Wu, H.; Wang, A. Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images. Electronics 2024, 13, 5014. https://doi.org/10.3390/electronics13245014

AMA Style

Yang M, Xu R, Yang C, Wu H, Wang A. Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images. Electronics. 2024; 13(24):5014. https://doi.org/10.3390/electronics13245014

Chicago/Turabian Style

Yang, Mingji, Rongyu Xu, Chunyu Yang, Haibin Wu, and Aili Wang. 2024. "Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images" Electronics 13, no. 24: 5014. https://doi.org/10.3390/electronics13245014

APA Style

Yang, M., Xu, R., Yang, C., Wu, H., & Wang, A. (2024). Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images. Electronics, 13(24), 5014. https://doi.org/10.3390/electronics13245014

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop