Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images

Yang, Mingji; Xu, Rongyu; Yang, Chunyu; Wu, Haibin; Wang, Aili

doi:10.3390/electronics13245014

Open AccessArticle

Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images

by

Mingji Yang

^1,*,

Rongyu Xu

¹,

Chunyu Yang

^2,3,

Haibin Wu

¹

and

Aili Wang

¹

Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin 150080, China

²

School of Astronautics, Harbin Institute of Technology, Harbin 150001, China

³

The 54th Research Institute of China Electronics Technology Group Corporation, Shijiazhuang 050081, China

^*

Author to whom correspondence should be addressed.

Electronics 2024, 13(24), 5014; https://doi.org/10.3390/electronics13245014

Submission received: 5 November 2024 / Revised: 17 December 2024 / Accepted: 19 December 2024 / Published: 20 December 2024

(This article belongs to the Special Issue New Insights in 2D and 3D Object Detection and Semantic Segmentation)

Download

Browse Figures

Versions Notes

Abstract

Currently, embedded unmanned aerial vehicle (UAV) systems face significant challenges in balancing detection accuracy and computational efficiency when processing remote sensing images with complex backgrounds, small objects, and occlusions. This paper proposes the Hybrid-DETR model based on a real-time end-to-end Detection Transformer (RT-DETR), featuring a novel HybridNet backbone network that implements a differentiated hybrid structure through lightweight RepConv Cross-stage Partial Efficient Layer Aggregation Network (RCSPELAN) modules and the Heat-Transfer Cross-stage Fusion (HTCF) modules, effectively balancing feature extraction efficiency and global perception capabilities. Additionally, we introduce a Small-Object Detection Module (SODM) and an EIFI module to enhance the detection capability of small objects in complex scenarios, while employing the Focaler-Shape-IoU loss function to optimize bounding box regression. Experimental results on the VisDrone2019 dataset demonstrate that Hybrid-DETR achieves mAP50 and mAP50:95 scores of 52.2% and 33.3%, respectively, representing improvements of 5.2% and 4.3% compared to RT-DETR-R18, while reducing model parameters by 29.33%. The effectiveness and robustness of our improved method are further validated on multiple challenging datasets, including AI-TOD and HIT-UAV.

Keywords: UAV remote sensing images; RT-DETR; small-object detection; feature fusion; Focaler-Shape-IoU

Share and Cite

MDPI and ACS Style

Yang, M.; Xu, R.; Yang, C.; Wu, H.; Wang, A. Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images. Electronics 2024, 13, 5014. https://doi.org/10.3390/electronics13245014

AMA Style

Yang M, Xu R, Yang C, Wu H, Wang A. Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images. Electronics. 2024; 13(24):5014. https://doi.org/10.3390/electronics13245014

Chicago/Turabian Style

Yang, Mingji, Rongyu Xu, Chunyu Yang, Haibin Wu, and Aili Wang. 2024. "Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images" Electronics 13, no. 24: 5014. https://doi.org/10.3390/electronics13245014

APA Style

Yang, M., Xu, R., Yang, C., Wu, H., & Wang, A. (2024). Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images. Electronics, 13(24), 5014. https://doi.org/10.3390/electronics13245014

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid-DETR: A Differentiated Module-Based Model for Object Detection in Remote Sensing Images

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI