Next Article in Journal
SDRFPT-Net: A Spectral Dual-Stream Recursive Fusion Network for Multispectral Object Detection
Previous Article in Journal
Evaluating the Potential of Improving In-Season Potato Nitrogen Status Diagnosis Using Leaf Fluorescence Sensor as Compared with SPAD Meter
Previous Article in Special Issue
Coarse-Fine Tracker: A Robust MOT Framework for Satellite Videos via Tracking Any Point
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles

1
School of Computer and Communication Engineering, Northeastern University, Qinhuangdao 066004, China
2
School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(13), 2313; https://doi.org/10.3390/rs17132313 (registering DOI)
Submission received: 22 May 2025 / Revised: 24 June 2025 / Accepted: 3 July 2025 / Published: 5 July 2025

Abstract

Unmanned Aerial Vehicles (UAVs) face a significant challenge in balancing high accuracy and high efficiency when performing real-time object detection tasks, especially amidst intricate backgrounds, diverse target scales, and stringent onboard computational resource constraints. To tackle these difficulties, this study introduces YOLO-SRMX, a lightweight real-time object detection framework specifically designed for infrared imagery captured by UAVs. Firstly, the model utilizes ShuffleNetV2 as an efficient lightweight backbone and integrates the novel Multi-Scale Dilated Attention (MSDA) module. This strategy not only facilitates a substantial 46.4% reduction in parameter volume but also, through the flexible adaptation of receptive fields, boosts the model’s robustness and precision in multi-scale object recognition tasks. Secondly, within the neck network, multi-scale feature extraction is facilitated through the design of novel composite convolutions, ConvX and MConv, based on a “split–differentiate–concatenate” paradigm. Furthermore, the lightweight GhostConv is incorporated to reduce model complexity. By synthesizing these principles, a novel composite receptive field lightweight convolution, DRFAConvP, is proposed to further optimize multi-scale feature fusion efficiency and promote model lightweighting. Finally, the Wise-IoU loss function is adopted to replace the traditional bounding box loss. This is coupled with a dynamic non-monotonic focusing mechanism formulated using the concept of outlier degrees. This mechanism intelligently assigns elevated gradient weights to anchor boxes of moderate quality by assessing their relative outlier degree, while concurrently diminishing the gradient contributions from both high-quality and low-quality anchor boxes. Consequently, this approach enhances the model’s localization accuracy for small targets in complex scenes. Experimental evaluations on the HIT-UAV dataset corroborate that YOLO-SRMX achieves an mAP50 of 82.8%, representing a 7.81% improvement over the baseline YOLOv8s model; an F1 score of 80%, marking a 3.9% increase; and a substantial 65.3% reduction in computational cost (GFLOPs). YOLO-SRMX demonstrates an exceptional trade-off between detection accuracy and operational efficiency, thereby underscoring its considerable potential for efficient and precise object detection on resource-constrained UAV platforms.
Keywords: Unmanned Aerial Vehicle (UAV); object detection; lightweight model; YOLO; real-time detection Unmanned Aerial Vehicle (UAV); object detection; lightweight model; YOLO; real-time detection

Share and Cite

MDPI and ACS Style

Weng, S.; Wang, H.; Wang, J.; Xu, C.; Zhang, E. YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles. Remote Sens. 2025, 17, 2313. https://doi.org/10.3390/rs17132313

AMA Style

Weng S, Wang H, Wang J, Xu C, Zhang E. YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles. Remote Sensing. 2025; 17(13):2313. https://doi.org/10.3390/rs17132313

Chicago/Turabian Style

Weng, Shimin, Han Wang, Jiashu Wang, Changming Xu, and Ende Zhang. 2025. "YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles" Remote Sensing 17, no. 13: 2313. https://doi.org/10.3390/rs17132313

APA Style

Weng, S., Wang, H., Wang, J., Xu, C., & Zhang, E. (2025). YOLO-SRMX:A Lightweight Model for Real-Time Object Detection on Unmanned Aerial Vehicles. Remote Sensing, 17(13), 2313. https://doi.org/10.3390/rs17132313

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop