This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles
1
School of Computer and Communication Engineering, Northeastern University, Qinhuangdao 066004, China
2
School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(13), 2313; https://doi.org/10.3390/rs17132313 (registering DOI)
Submission received: 22 May 2025
/
Revised: 24 June 2025
/
Accepted: 3 July 2025
/
Published: 5 July 2025
Abstract
Unmanned Aerial Vehicles (UAVs) face a significant challenge in balancing high accuracy and high efficiency when performing real-time object detection tasks, especially amidst intricate backgrounds, diverse target scales, and stringent onboard computational resource constraints. To tackle these difficulties, this study introduces YOLO-SRMX, a lightweight real-time object detection framework specifically designed for infrared imagery captured by UAVs. Firstly, the model utilizes ShuffleNetV2 as an efficient lightweight backbone and integrates the novel Multi-Scale Dilated Attention (MSDA) module. This strategy not only facilitates a substantial 46.4% reduction in parameter volume but also, through the flexible adaptation of receptive fields, boosts the model’s robustness and precision in multi-scale object recognition tasks. Secondly, within the neck network, multi-scale feature extraction is facilitated through the design of novel composite convolutions, ConvX and MConv, based on a “split–differentiate–concatenate” paradigm. Furthermore, the lightweight GhostConv is incorporated to reduce model complexity. By synthesizing these principles, a novel composite receptive field lightweight convolution, DRFAConvP, is proposed to further optimize multi-scale feature fusion efficiency and promote model lightweighting. Finally, the Wise-IoU loss function is adopted to replace the traditional bounding box loss. This is coupled with a dynamic non-monotonic focusing mechanism formulated using the concept of outlier degrees. This mechanism intelligently assigns elevated gradient weights to anchor boxes of moderate quality by assessing their relative outlier degree, while concurrently diminishing the gradient contributions from both high-quality and low-quality anchor boxes. Consequently, this approach enhances the model’s localization accuracy for small targets in complex scenes. Experimental evaluations on the HIT-UAV dataset corroborate that YOLO-SRMX achieves an of 82.8%, representing a 7.81% improvement over the baseline YOLOv8s model; an F1 score of 80%, marking a 3.9% increase; and a substantial 65.3% reduction in computational cost (GFLOPs). YOLO-SRMX demonstrates an exceptional trade-off between detection accuracy and operational efficiency, thereby underscoring its considerable potential for efficient and precise object detection on resource-constrained UAV platforms.
Share and Cite
MDPI and ACS Style
Weng, S.; Wang, H.; Wang, J.; Xu, C.; Zhang, E.
YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles. Remote Sens. 2025, 17, 2313.
https://doi.org/10.3390/rs17132313
AMA Style
Weng S, Wang H, Wang J, Xu C, Zhang E.
YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles. Remote Sensing. 2025; 17(13):2313.
https://doi.org/10.3390/rs17132313
Chicago/Turabian Style
Weng, Shimin, Han Wang, Jiashu Wang, Changming Xu, and Ende Zhang.
2025. "YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles" Remote Sensing 17, no. 13: 2313.
https://doi.org/10.3390/rs17132313
APA Style
Weng, S., Wang, H., Wang, J., Xu, C., & Zhang, E.
(2025). YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles. Remote Sensing, 17(13), 2313.
https://doi.org/10.3390/rs17132313
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.