Next Article in Journal
A Method for Airborne Small-Target Detection with a Multimodal Fusion Framework Integrating Photometric Perception and Cross-Attention Mechanisms
Previous Article in Journal
Comparison of the Reflectivities from Precipitation Measurement Radar Onboard the FY-3G Satellite and Ground-Based S-Band Dual-Polarization Radars
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

DLNet: A Dual-Level Network with Self- and Cross-Attention for High-Resolution Remote Sensing Segmentation

1
School of Computer Science and Technology, Xi’an University of Posts and Telecommunications, Xi’an 710121, China
2
School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 101408, China
3
Department of Management, Kean University, Union, NJ 07083, USA
4
Department of Computer Science and Technology, Kean University, Union, NJ 07083, USA
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(7), 1119; https://doi.org/10.3390/rs17071119
Submission received: 23 February 2025 / Revised: 18 March 2025 / Accepted: 19 March 2025 / Published: 21 March 2025
(This article belongs to the Section Remote Sensing Image Processing)

Abstract

With advancements in remote sensing technologies, high-resolution imagery has become increasingly accessible, supporting applications in urban planning, environmental monitoring, and precision agriculture. However, semantic segmentation of such imagery remains challenging due to complex spatial structures, fine-grained details, and land cover variations. Existing methods often struggle with ineffective feature representation, suboptimal fusion of global and local information, and high computational costs, limiting segmentation accuracy and efficiency. To address these challenges, we propose the dual-level network (DLNet), an enhanced framework incorporating self-attention and cross-attention mechanisms for improved multi-scale feature extraction and fusion. The self-attention module captures long-range dependencies to enhance contextual understanding, while the cross-attention module facilitates bidirectional interaction between global and local features, improving spatial coherence and segmentation quality. Additionally, DLNet optimizes computational efficiency by balancing feature refinement and memory consumption, making it suitable for large-scale remote sensing applications. Extensive experiments on benchmark datasets, including DeepGlobe and Inria Aerial, demonstrate that DLNet achieves state-of-the-art segmentation accuracy while maintaining computational efficiency. On the DeepGlobe dataset, DLNet achieves a 76.9% mean intersection over union (mIoU), outperforming existing models such as GLNet (71.6%) and EHSNet (76.3%), while requiring lower memory (1443 MB) and maintaining a competitive inference speed of 518.3 ms per image. On the Inria Aerial dataset, DLNet attains an mIoU of 73.6%, surpassing GLNet (71.2%) while reducing computational cost and achieving an inference speed of 119.4 ms per image. These results highlight DLNet’s effectiveness in achieving precise and efficient segmentation in high-resolution remote sensing imagery.
Keywords: high-resolution imagery; remote sensing; semantic segmentation; self-attention; cross-attention high-resolution imagery; remote sensing; semantic segmentation; self-attention; cross-attention
Graphical Abstract

Share and Cite

MDPI and ACS Style

Meng, W.; Shan, L.; Ma, S.; Liu, D.; Hu, B. DLNet: A Dual-Level Network with Self- and Cross-Attention for High-Resolution Remote Sensing Segmentation. Remote Sens. 2025, 17, 1119. https://doi.org/10.3390/rs17071119

AMA Style

Meng W, Shan L, Ma S, Liu D, Hu B. DLNet: A Dual-Level Network with Self- and Cross-Attention for High-Resolution Remote Sensing Segmentation. Remote Sensing. 2025; 17(7):1119. https://doi.org/10.3390/rs17071119

Chicago/Turabian Style

Meng, Weijun, Lianlei Shan, Sugang Ma, Dan Liu, and Bin Hu. 2025. "DLNet: A Dual-Level Network with Self- and Cross-Attention for High-Resolution Remote Sensing Segmentation" Remote Sensing 17, no. 7: 1119. https://doi.org/10.3390/rs17071119

APA Style

Meng, W., Shan, L., Ma, S., Liu, D., & Hu, B. (2025). DLNet: A Dual-Level Network with Self- and Cross-Attention for High-Resolution Remote Sensing Segmentation. Remote Sensing, 17(7), 1119. https://doi.org/10.3390/rs17071119

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop