Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (73)

Search Parameters:
Keywords = CBAM-UNet

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
18 pages, 3628 KB  
Article
Extraction of Cotton Cultivation Areas Based on Deep Learning and Sentinel-2 Image Data
by Liyuan Li, Hongfei Tao, Yan Xu, Lixiran Yu, Qiao Li, Hong Xie and Youwei Jiang
Agriculture 2025, 15(16), 1783; https://doi.org/10.3390/agriculture15161783 - 20 Aug 2025
Viewed by 323
Abstract
Cotton is a crucial economic crop, and timely and accurate acquisition of its spatial distribution information is of great significance for yield prediction, as well as for the formulation and adjustment of agricultural policies. To accurately and efficiently extract cotton cultivation areas at [...] Read more.
Cotton is a crucial economic crop, and timely and accurate acquisition of its spatial distribution information is of great significance for yield prediction, as well as for the formulation and adjustment of agricultural policies. To accurately and efficiently extract cotton cultivation areas at a large scale, in this study, we focused on the Santun River Irrigation District in Xinjiang as the research area. Utilizing Sentinel-2 satellite imagery from 2019 to 2024, four cotton extraction models—U-Net, SegNet, DeepLabV3+, and CBAM-UNet—were constructed. The models were evaluated using metrics, including the mean intersection over union (mIoU), precision, recall, F1-score, and over accuracy (OA), to assess the models’ performances in cotton extraction. The results demonstrate that the CBAM-UNet model achieved the highest accuracy, with an mIoU, precision, recall, F1-score, and OA of 84.02%, 88.99%, 94.75%, 91.78%, and 95.56%, respectively. The absolute error of the extracted cotton areas from 2019 to 2024 ranged between 923.69 and 1445.46 hm2, with absolute percentage errors of less than 10%. The coefficient of determination (R2) between the extracted results and statistical data was 0.9817, indicating the best fit. The findings of this study provide technical support for rapid cotton identification and extraction in large- and medium-sized irrigation districts. Full article
Show Figures

Figure 1

16 pages, 1319 KB  
Article
Improved U-Shaped Convolutional Neural Network with Convolutional Block Attention Module and Feature Fusion for Automated Segmentation of Fine Roots in Field Rhizotron Imagery
by Yufan Wang, Fuhao Lu and Changfu Huo
Sensors 2025, 25(16), 4956; https://doi.org/10.3390/s25164956 - 11 Aug 2025
Viewed by 339
Abstract
Accurate segmentation of fine roots in field rhizotron imagery is essential for high-throughput root system analysis but remains challenging due to limitations of traditional methods. Traditional methods for root quantification (e.g., soil coring, manual counting) are labor-intensive, subjective, and low-throughput. These limitations are [...] Read more.
Accurate segmentation of fine roots in field rhizotron imagery is essential for high-throughput root system analysis but remains challenging due to limitations of traditional methods. Traditional methods for root quantification (e.g., soil coring, manual counting) are labor-intensive, subjective, and low-throughput. These limitations are exacerbated in in situ rhizotron imaging, where variable field conditions introduce noise and complex soil backgrounds. To address these challenges, this study develops an advanced deep learning framework for automated segmentation. We propose an improved U-shaped Convolutional Neural Network (U-Net) architecture optimized for segmenting larch (Larix olgensis) fine roots under heterogeneous field conditions, integrating both in situ rhizotron imagery and open-source multi-species minirhizotron datasets. Our approach integrates (1) a Convolutional Block Attention Module (CBAM) to enhance feature representation for fine-root detection; (2) an additive feature fusion strategy (UpAdd) during decoding to preserve morphological details, particularly in low-contrast regions; and (3) a transfer learning protocol to enable robust cross-species generalization. Our model achieves state-of-the-art performance with a mean intersection over union (mIoU) of 70.18%, mean Recall of 86.72%, and mean Precision of 75.89%—significantly outperforming PSPNet, SegNet, and DeepLabV3+ by 13.61%, 13.96%, and 13.27% in mIoU, respectively. Transfer learning further elevates root-specific metrics, yielding absolute gains of +0.47% IoU, +0.59% Precision, and +0.35% F1-score. The improved U-Net segmentation demonstrated strong agreement with the manual method for quantifying fine-root length, particularly for third-order roots, though optimization of lower-order root identification is required to enhance overall accuracy. This work provides a scalable approach for advancing automated root phenotyping and belowground ecological research. Full article
(This article belongs to the Section Smart Agriculture)
Show Figures

Figure 1

26 pages, 7857 KB  
Article
Investigation of an Efficient Multi-Class Cotton Leaf Disease Detection Algorithm That Leverages YOLOv11
by Fangyu Hu, Mairheba Abula, Di Wang, Xuan Li, Ning Yan, Qu Xie and Xuedong Zhang
Sensors 2025, 25(14), 4432; https://doi.org/10.3390/s25144432 - 16 Jul 2025
Viewed by 465
Abstract
Cotton leaf diseases can lead to substantial yield losses and economic burdens. Traditional detection methods are challenged by low accuracy and high labor costs. This research presents the ACURS-YOLO network, an advanced cotton leaf disease detection architecture developed on the foundation of YOLOv11. [...] Read more.
Cotton leaf diseases can lead to substantial yield losses and economic burdens. Traditional detection methods are challenged by low accuracy and high labor costs. This research presents the ACURS-YOLO network, an advanced cotton leaf disease detection architecture developed on the foundation of YOLOv11. By integrating a medical image segmentation model, it effectively tackles challenges including complex background interference, the missed detection of small targets, and restricted generalization ability. Specifically, the U-Net v2 module is embedded in the backbone network to boost the multi-scale feature extraction performance in YOLOv11. Meanwhile, the CBAM attention mechanism is integrated to emphasize critical disease-related features. To lower the computational complexity, the SPPF module is substituted with SimSPPF. The C3k2_RCM module is appended for long–range context modeling, and the ARelu activation function is employed to alleviate the vanishing gradient problem. A database comprising 3000 images covering six types of cotton leaf diseases was constructed, and data augmentation techniques were applied. The experimental results show that ACURS-YOLO attains impressive performance indicators, encompassing a mAP_0.5 value of 94.6%, a mAP_0.5:0.95 value of 83.4%, 95.5% accuracy, 89.3% recall, an F1 score of 92.3%, and a frame rate of 148 frames per second. It outperforms YOLOv11 and other conventional models with regard to both detection precision and overall functionality. Ablation tests additionally validate the efficacy of each component, affirming the framework’s advantage in addressing complex detection environments. This framework provides an efficient solution for the automated monitoring of cotton leaf diseases, advancing the development of smart sensors through improved detection accuracy and practical applicability. Full article
(This article belongs to the Section Smart Agriculture)
Show Figures

Figure 1

20 pages, 3602 KB  
Article
Dust Aerosol Classification in Northwest China Using CALIPSO Data and an Enhanced 1D U-Net Network
by Xin Gong, Delong Xiu, Xiaoling Sun, Ruizhao Zhang, Jiandong Mao, Hu Zhao and Zhimin Rao
Atmosphere 2025, 16(7), 812; https://doi.org/10.3390/atmos16070812 - 2 Jul 2025
Viewed by 388
Abstract
Dust aerosols significantly affect climate and air quality in Northwest China (30–50° N, 70–110° E), where frequent dust storms complicate accurate aerosol classification when using CALIPSO satellite data. This study introduces an Enhanced 1D U-Net model to enhance dust aerosol retrieval, incorporating Inception [...] Read more.
Dust aerosols significantly affect climate and air quality in Northwest China (30–50° N, 70–110° E), where frequent dust storms complicate accurate aerosol classification when using CALIPSO satellite data. This study introduces an Enhanced 1D U-Net model to enhance dust aerosol retrieval, incorporating Inception modules for multi-scale feature extraction, Transformer blocks for global contextual modeling, CBAM attention mechanisms for improved feature selection, and residual connections for training stability. Using CALIPSO Level 1B and Level 2 Vertical Feature Mask (VFM) data from 2015 to 2020, the model processed backscatter coefficients, polarization characteristics, and color ratios at 532 nm and 1064 nm to classify aerosol types. The model achieved a precision of 94.11%, recall of 99.88%, and F1 score of 96.91% for dust aerosols, outperforming baseline models. Dust aerosols were predominantly detected between 0.44 and 4 km, consistent with observations from CALIPSO. These results highlight the model’s potential to improve climate modeling and air quality monitoring, providing a scalable framework for future atmospheric research. Full article
(This article belongs to the Section Aerosols)
Show Figures

Figure 1

31 pages, 6788 KB  
Article
A Novel Dual-Modal Deep Learning Network for Soil Salinization Mapping in the Keriya Oasis Using GF-3 and Sentinel-2 Imagery
by Ilyas Nurmemet, Yang Xiang, Aihepa Aihaiti, Yu Qin, Yilizhati Aili, Hengrui Tang and Ling Li
Agriculture 2025, 15(13), 1376; https://doi.org/10.3390/agriculture15131376 - 27 Jun 2025
Viewed by 546
Abstract
Soil salinization poses a significant threat to agricultural productivity, food security, and ecological sustainability in arid and semi-arid regions. Effectively and timely mapping of different degrees of salinized soils is essential for sustainable land management and ecological restoration. Although deep learning (DL) methods [...] Read more.
Soil salinization poses a significant threat to agricultural productivity, food security, and ecological sustainability in arid and semi-arid regions. Effectively and timely mapping of different degrees of salinized soils is essential for sustainable land management and ecological restoration. Although deep learning (DL) methods have been widely employed for soil salinization extraction from remote sensing (RS) data, the integration of multi-source RS data with DL methods remains challenging due to issues such as limited data availability, speckle noise, geometric distortions, and suboptimal data fusion strategies. This study focuses on the Keriya Oasis, Xinjiang, China, utilizing RS data, including Sentinel-2 multispectral and GF-3 full-polarimetric SAR (PolSAR) images, to conduct soil salinization classification. We propose a Dual-Modal deep learning network for Soil Salinization named DMSSNet, which aims to improve the mapping accuracy of salinization soils by effectively fusing spectral and polarimetric features. DMSSNet incorporates self-attention mechanisms and a Convolutional Block Attention Module (CBAM) within a hierarchical fusion framework, enabling the model to capture both intra-modal and cross-modal dependencies and to improve spatial feature representation. Polarimetric decomposition features and spectral indices are jointly exploited to characterize diverse land surface conditions. Comprehensive field surveys and expert interpretation were employed to construct a high-quality training and validation dataset. Experimental results indicate that DMSSNet achieves an overall accuracy of 92.94%, a Kappa coefficient of 79.12%, and a macro F1-score of 86.52%, positively outperforming conventional DL models (ResUNet, SegNet, DeepLabv3+). The results confirm the superiority of attention-guided dual-branch fusion networks for distinguishing varying degrees of soil salinization across heterogeneous landscapes and highlight the value of integrating Sentinel-2 optical and GF-3 PolSAR data for complex land surface classification tasks. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Figure 1

25 pages, 3081 KB  
Article
A Fire Segmentation Method with Flame Detail Enhancement U-Net in Multispectral Remote Sensing Images Under Category Imbalance
by Rui Zou, Zhihui Xin, Guisheng Liao, Penghui Huang, Rui Wang and Yuhu Qiao
Remote Sens. 2025, 17(13), 2175; https://doi.org/10.3390/rs17132175 - 25 Jun 2025
Viewed by 661
Abstract
Fire poses a serious threat to the global economy, environment, and social stability, highlighting the need for rapid and accurate fire detection. Remote sensing combined with deep learning has outperformed traditional fire assessment methods. However, in early fire stages, small flame areas, class [...] Read more.
Fire poses a serious threat to the global economy, environment, and social stability, highlighting the need for rapid and accurate fire detection. Remote sensing combined with deep learning has outperformed traditional fire assessment methods. However, in early fire stages, small flame areas, class imbalance, and weak feature extraction hinder detection accuracy. This study proposes an end-to-end segmentation model called Flame Detail Enhancement U-Net (FDE U-Net), using Landsat-8 multispectral remote sensing data. The model incorporates the self-Attention and Convolutional mixture (ACmix) module and the Convolutional Block Attention Module (CBAM) into the encoder of the Residual U-Net. ACmix integrates self-attention and convolution to capture global semantic features while maintaining computational efficiency, improving both contextual awareness and local detail. CBAM enhances flame recognition by weighting important channel features and focusing spatially on small flame areas, helping address the class imbalance problem. Additionally, Haar wavelet downsampling is applied to retain image detail and improve the detection of small-scale flame regions. Experimental results show that the FDE U-Net model exhibits robust performance in fire detection, accurately extracting flame regions even when their proportion is low and the background is complex. The F1 score reaches 95.97%, significantly improving the class imbalance problem. Full article
Show Figures

Figure 1

22 pages, 3823 KB  
Article
Large-Scale Apple Orchard Identification from Multi-Temporal Sentinel-2 Imagery
by Chunxiao Wu, Yundan Liu, Jianyu Yang, Anjin Dai, Han Zhou, Kaixuan Tang, Yuxuan Zhang, Ruxin Wang, Binchuan Wei and Yifan Wang
Agronomy 2025, 15(6), 1487; https://doi.org/10.3390/agronomy15061487 - 19 Jun 2025
Cited by 1 | Viewed by 729
Abstract
Accurately extracting large-scale apple orchards from remote sensing imagery is of importance for orchard management. Most studies lack large-scale, high-resolution apple orchard maps due to sparse orchard distribution and similar crops, making mapping difficult. Using phenological information and multi-temporal feature-selected imagery, this paper [...] Read more.
Accurately extracting large-scale apple orchards from remote sensing imagery is of importance for orchard management. Most studies lack large-scale, high-resolution apple orchard maps due to sparse orchard distribution and similar crops, making mapping difficult. Using phenological information and multi-temporal feature-selected imagery, this paper proposed a large-scale apple orchard mapping method based on the AOCF-SegNet model. First, to distinguish apples from other crops, phenological information was used to divide time periods and select optimal phases for each spectral feature, thereby obtaining spectral features integrating phenological and temporal information. Second, semantic segmentation models (FCN-8s, SegNet, U-Net) were com-pared, and SegNet was chosen as the base model for apple orchard identification. Finally, to address the issue of the low proportion of apple orchards in remote sensing images, a Convolutional Block Attention Module (CBAM) and Focal Loss function were integrated into the SegNet model, followed by hyperparameter optimization, resulting in AOCF-SegNet. The results from mapping the Yantai apple orchards indicate that AOCF-SegNet achieved strong segmentation performance, with an overall accuracy of 89.34%. Compared to the SegNet, U-Net, and FCN-8s models, AOCF-SegNet achieved an improvement in overall accuracy by 3%, 6.1%, and 9.6%, respectively. The predicted orchard area exhibited an approximate area consistency of 71.97% with the official statistics. Full article
(This article belongs to the Section Precision and Digital Agriculture)
Show Figures

Figure 1

20 pages, 1669 KB  
Article
Automated Pneumothorax Segmentation with a Spatial Prior Contrast Adapter
by Yiming Jia and Essam A. Rashed
Appl. Sci. 2025, 15(12), 6598; https://doi.org/10.3390/app15126598 - 12 Jun 2025
Viewed by 648
Abstract
Pneumothorax is a critical condition that requires rapid and accurate diagnosis from standard chest radiographs. Identifying and segmenting the location of the pneumothorax are essential for developing an effective treatment plan. nnUNet is a self-configuring, deep learning-based framework for medical image segmentation. Despite [...] Read more.
Pneumothorax is a critical condition that requires rapid and accurate diagnosis from standard chest radiographs. Identifying and segmenting the location of the pneumothorax are essential for developing an effective treatment plan. nnUNet is a self-configuring, deep learning-based framework for medical image segmentation. Despite adjusting its parameters automatically through data-driven optimization strategies and offering robust feature extraction and segmentation capabilities across diverse datasets, our initial experiments revealed that nnUNet alone struggled to achieve consistently accurate segmentation for pneumothorax, particularly in challenging scenarios where subtle intensity variations and anatomical noise obscure the target regions. This study aims to enhance the accuracy and robustness of pneumothorax segmentation in low-contrast chest radiographs by integrating spatial prior information and attention mechanism into the nnUNet framework. In this study, we introduce the spatial prior contrast adapter (SPCA)-enhanced nnUNet by implementing two modules. First, we integrate an SPCA utilizing the MedSAM foundation model to incorporate spatial prior information of the lung region, effectively guiding the segmentation network to focus on anatomically relevant areas. In the meantime, a probabilistic atlas, which shows the probability of an area prone to pneumothorax, is generated based on the ground truth masks. Both the lung segmentation results and the probabilistic atlas are used as attention maps in nnUNet. Second, we combine the two attention maps as additional input into nnUNet and integrate an attention mechanism into standard nnUNet by using a convolutional block attention module (CBAM). We validate our method by experimenting on the dataset CANDID-PTX, a benchmark dataset representing 19,237 chest radiographs. By introducing spatial awareness and intensity adjustments, the model reduces false positives and improves the precision of boundary delineations, ultimately overcoming many of the limitations associated with low-contrast radiographs. Compared with standard nnUNet, SPCA-enhanced nnUNet achieves an average Dice coefficient of 0.81, which indicates an improvement of standard nnUNet by 15%. This study provides a novel approach toward enhancing the segmentation performance of pneumothorax with low contrast in chest X-ray radiographs. Full article
(This article belongs to the Special Issue Applications of Computer Vision and Image Processing in Medicine)
Show Figures

Figure 1

18 pages, 2325 KB  
Article
Enhanced Rail Surface Defect Segmentation Using Polarization Imaging and Dual-Stream Feature Fusion
by Yucheng Pan, Jiasi Chen, Peiwen Wu, Hongsheng Zhong, Zihao Deng and Daozong Sun
Sensors 2025, 25(11), 3546; https://doi.org/10.3390/s25113546 - 4 Jun 2025
Viewed by 652
Abstract
Rail surface defects pose significant risks to the operational efficiency and safety of industrial equipment. Traditional visual defect detection methods typically rely on high-quality RGB images; however, they struggle in low-light conditions due to small, low-contrast defects that blend into complex backgrounds. Therefore, [...] Read more.
Rail surface defects pose significant risks to the operational efficiency and safety of industrial equipment. Traditional visual defect detection methods typically rely on high-quality RGB images; however, they struggle in low-light conditions due to small, low-contrast defects that blend into complex backgrounds. Therefore, this paper proposes a novel defect segmentation method leveraging a dual-stream feature fusion network that combines polarization images with DeepLabV3+. The approach utilizes the pruned MobileNetV3 as the backbone network, incorporating a coordinate attention mechanism for feature extraction. This reduces the number of model parameters and enhances computational efficiency. The dual-stream module implements cascade and addition strategies to effectively merge shallow and deep features from both the original and polarization images. This enhances the detection of low-contrast defects in complex backgrounds. Furthermore, the CBAM is integrated into the decoding area to refine feature fusion and mitigate the issue of missing small-target defects. Experimental results demonstrate that the enhanced DeepLabV3+ model outperforms existing models such as U-Net, PSPNet, and the original DeepLabV3+ in terms of MIoU and MPA metrics, achieving 73.00% and 80.59%, respectively. The comprehensive detection accuracy reaches 97.82%, meeting the demanding requirements for effective rail surface defect detection. Full article
(This article belongs to the Section Industrial Sensors)
Show Figures

Figure 1

20 pages, 2913 KB  
Article
Lightweight Semantic Segmentation Network with Multi-Level Feature Fusion and Dual Attention Collaboration
by Yulong Ma, Xiaoyu Wang, Bo Deng and Yue Yu
Electronics 2025, 14(11), 2244; https://doi.org/10.3390/electronics14112244 - 30 May 2025
Viewed by 988
Abstract
Deep neural networks provide a powerful driving force for breakthroughs in semantic segmentation technology. However, the current mainstream architecture generally falls into the “parameter redundancy trap” in pursuit of accuracy improvement, which brings a large number of calculations and model parameters, forcing researchers [...] Read more.
Deep neural networks provide a powerful driving force for breakthroughs in semantic segmentation technology. However, the current mainstream architecture generally falls into the “parameter redundancy trap” in pursuit of accuracy improvement, which brings a large number of calculations and model parameters, forcing researchers to seek a new structural paradigm balance between pixel-level parsing accuracy and the limited computing power of embedded devices. We propose a lightweight semantic segmentation network with multi-level feature fusion and dual attention coordination. In view of the large number of parameters in the traditional backbone network and the fact that it only outputs semantic features at the end of the network but lacks shallow feature information, it will cause significant information loss in the decoder stage, which may lead to fuzzy segmentation results and the misclassification of categories. We design a lightweight backbone network with multi-level feature fusion capability. The detail recovery capability is enhanced in the reconstruction process layer by constructing a cross-stage feature aggregation module system; secondly, in view of the lack of effective feature attention in previous methods, we propose a new DCA module in the proposed network and introduce CBAM in the multi-level special fusion network at a shallow level, which improves the model’s category discrimination ability with minimal parameter overhead, thereby optimizing feature expression and improving segmentation performance. The results show that in the Cityscapes dataset, the mIoU reaches 75.29% with only 5.82 M parameters. In the Pascal VOC 2012 dataset experiment, the proposed model achieves an mIoU of 74.24% with only 5.869 M parameters. Compared with DCN-Deeplabv3+ network, the parameters comprise 48% of it, but the accuracy is improved by 1.66%. Compared with the UNet and PSPNet models, the parameters are reduced by 86.63% and 87.44%, respectively. Full article
Show Figures

Figure 1

25 pages, 6037 KB  
Article
Extraction of Levees from Paddy Fields Based on the SE-CBAM UNet Model and Remote Sensing Images
by Hongfu Ai, Xiaomeng Zhu, Yongqi Han, Shinai Ma, Yiang Wang, Yihan Ma, Chuan Qin, Xinyi Han, Yaxin Yang and Xinle Zhang
Remote Sens. 2025, 17(11), 1871; https://doi.org/10.3390/rs17111871 - 28 May 2025
Viewed by 659
Abstract
During rice cultivation, extracting levees helps to delineate effective planting areas, thereby enhancing the precision of management zones. This approach is crucial for devising more efficient water field management strategies and has significant implications for water-saving irrigation and fertilizer optimization in rice production. [...] Read more.
During rice cultivation, extracting levees helps to delineate effective planting areas, thereby enhancing the precision of management zones. This approach is crucial for devising more efficient water field management strategies and has significant implications for water-saving irrigation and fertilizer optimization in rice production. The uneven distribution and lack of standardization of levees pose significant challenges for their accurate extraction. However, recent advancements in remote sensing and deep learning technologies have provided viable solutions. In this study, Youyi Farm in Shuangyashan City, Heilongjiang Province, was chosen as the experimental site. We developed the SCA-UNet model by optimizing the UNet algorithm and enhancing its network architecture through the integration of the Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation Networks (SE). The SCA-UNet model leverages the channel attention strengths of SE while incorporating CBAM to emphasize spatial information. Through a dual-attention collaborative mechanism, the model achieves a synergistic perception of the linear features and boundary information of levees, thereby significantly improving the accuracy of levee extraction. The experimental results demonstrate that the proposed SCA-UNet model and its additional modules offer substantial performance advantages. Our algorithm outperforms existing methods in both computational efficiency and precision. Significance analysis revealed that our method achieved overall accuracy (OA) and F1-score values of 88.4% and 90.6%, respectively. These results validate the efficacy of the multimodal dataset in addressing the issue of ambiguous levee boundaries. Additionally, ablation experiments using 10-fold cross-validation confirmed the effectiveness of the proposed SCA-UNet method. This approach provides a robust technical solution for levee extraction and has the potential to significantly advance precision agriculture. Full article
Show Figures

Graphical abstract

17 pages, 10577 KB  
Article
Research on the Method of Crop Pest and Disease Recognition Based on the Improved YOLOv7-U-Net Combined Network
by Wenchao Xiang, Zitao Du, Xinran Liu, Zehui Lu and Yuna Yin
Appl. Sci. 2025, 15(9), 4864; https://doi.org/10.3390/app15094864 - 27 Apr 2025
Viewed by 541
Abstract
This paper proposes an improved YOLOv7-U-Net combined network for crop pest and disease recognition, aiming to address the issue of insufficient accuracy in existing methods. For the YOLOv7 network, a self-attention mechanism is integrated into the SPPCSPC module to dynamically adjust channel weights [...] Read more.
This paper proposes an improved YOLOv7-U-Net combined network for crop pest and disease recognition, aiming to address the issue of insufficient accuracy in existing methods. For the YOLOv7 network, a self-attention mechanism is integrated into the SPPCSPC module to dynamically adjust channel weights and suppress redundant information while optimizing the PAFPN structure to enhance cross-scale feature fusion and improve small-object detection capabilities. For the U-Net network, the CBAM attention module is added before decoder skip connections, and depth-separable convolutions replace traditional kernels to strengthen feature fusion and detail attention. Experimental results show the improved algorithm achieves 97.49% detection accuracy, with mean average precision (mAP) reaching 96.91% and detection speed increasing to 90.41 FPS. The loss function of the improved U-Net network decreases towards 0 with training iterations, validating its effectiveness. The study shows that the improved YOLOv7-U-Net combined network provides a more effective solution for crop pest and disease detection. Full article
(This article belongs to the Special Issue Advances in Machine Vision for Industry and Agriculture)
Show Figures

Figure 1

26 pages, 9389 KB  
Article
Unravelling the Characteristics of Microhabitat Alterations in Floodplain Inundated Areas Based on High-Resolution UAV Imagery and Remote Sensing: A Case Study in Jingjiang, Yangtze River
by Yichen Zheng, Dongshuo Lu, Zongrui Yang and Jianbo Chang
Drones 2025, 9(4), 315; https://doi.org/10.3390/drones9040315 - 18 Apr 2025
Viewed by 611
Abstract
The floodplain of a large river plays a crucial role in the river’s ecosystem and serves as an essential microhabitat for river fish to complete their life history events. Over the past four decades, the floodplain represented by the Jingjiang section in the [...] Read more.
The floodplain of a large river plays a crucial role in the river’s ecosystem and serves as an essential microhabitat for river fish to complete their life history events. Over the past four decades, the floodplain represented by the Jingjiang section in the middle reaches of the Yangtze River has experienced a significant reduction in area, complexity, and diversity of fish microhabitats. This study quantitatively analyzed the dynamic changes and geomorphological structure of the floodplain in the Jingjiang reach (JJR) of the Yangtze River using satellite remote sensing images and high-resolution unmanned aerial vehicle (UAV) optical images. We built an enhanced U-Net model incorporating both the CBAM and SE parallel attention mechanisms to classify these images and identify environmental structural units. The accuracy of the enhanced model was 16.39% higher compared to original U-Net model. At the same time, the improved normalized difference water index (mNDWI), enhanced vegetation index (EVI), and normalized difference vegetation index (NDVI) were utilized to extract the flood frequency of the floodplain and analyze the area changes of the floodplain in the JJR. The trend of the flood area in the JJR during the flood season was consistent with the overall trend of flood areas in the flood season, which generally exhibits a downward tendency. In 2022, the floodplain of the JJR underwent substantial anthropogenic disturbances, with 40% of its area comprising anthropogenic environmental units. Compared to historical periods, the impervious surface within the floodplain has increased annually, while ecological units such as riparian forests and trees have gradually diminished or even disappeared, leading to a simplification of structural complexity. These findings provide a critical background and robust data foundation for the protection and restoration of fish habitats and the formulation of strategies for fish population reconstruction in the Yangtze River. Full article
Show Figures

Figure 1

25 pages, 10770 KB  
Article
Lung Segmentation with Lightweight Convolutional Attention Residual U-Net
by Meftahul Jannat, Shaikh Afnan Birahim, Mohammad Asif Hasan, Tonmoy Roy, Lubna Sultana, Hasan Sarker, Samia Fairuz and Hanaa A. Abdallah
Diagnostics 2025, 15(7), 854; https://doi.org/10.3390/diagnostics15070854 - 27 Mar 2025
Cited by 1 | Viewed by 1846
Abstract
Background: Examining chest radiograph images (CXR) is an intricate and time-consuming process, sometimes requiring the identification of many anomalies at the same time. Lung segmentation is key to overcoming this challenge through different deep learning (DL) techniques. Many researchers are working to improve [...] Read more.
Background: Examining chest radiograph images (CXR) is an intricate and time-consuming process, sometimes requiring the identification of many anomalies at the same time. Lung segmentation is key to overcoming this challenge through different deep learning (DL) techniques. Many researchers are working to improve the performance and efficiency of lung segmentation models. This article presents a DL-based approach to accurately identify the lung mask region in CXR images to assist radiologists in recognizing early signs of high-risk lung diseases. Methods: This paper proposes a novel technique, Lightweight Residual U-Net, combining the strengths of the convolutional block attention module (CBAM), the Atrous Spatial Pyramid Pooling (ASPP) block, and the attention module, which consists of only 3.24 million trainable parameters. Furthermore, the proposed model has been trained using both the RELU and LeakyReLU activation functions, with LeakyReLU yielding superior performance. The study indicates that the Dice loss function is more effective in achieving better results. Results: The proposed model is evaluated on three benchmark datasets: JSRT, SZ, and MC, achieving a Dice score of 98.72%, 97.49%, and 99.08%, respectively, outperforming the state-of-the-art models. Conclusions: Using the capabilities of DL and cutting-edge attention processes, the proposed model improves current efforts to enhance lung segmentation for the early identification of many serious lung diseases. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

17 pages, 4721 KB  
Article
Deep Learning Model for Precipitation Nowcasting Based on Residual and Attention Mechanisms
by Zhan Zhang, Qingping Song, Minzheng Duan, Hailei Liu, Juan Huo and Congzheng Han
Remote Sens. 2025, 17(7), 1123; https://doi.org/10.3390/rs17071123 - 21 Mar 2025
Cited by 3 | Viewed by 2056
Abstract
Nowcasting is a critical technology for disaster prevention and mitigation, and the accuracy of radar echo extrapolation directly impacts forecasting performance. In most deep learning-based models, accurately predicting heavy precipitation remains a challenging task. Focusing on the region of China, this study proposes [...] Read more.
Nowcasting is a critical technology for disaster prevention and mitigation, and the accuracy of radar echo extrapolation directly impacts forecasting performance. In most deep learning-based models, accurately predicting heavy precipitation remains a challenging task. Focusing on the region of China, this study proposes an improved model based on residual and attention mechanisms—RA-UNet—for precipitation nowcasting with a lead time of 3 h. The model introduces the residual neural network (ResNet) and the convolutional block attention module (CBAM) to integrate multi-scale features into the U-Net encoder–decoder architecture, enhancing its ability to capture the spatiotemporal evolution of precipitation systems. Meanwhile, depthwise separable convolutions are employed to replace conventional convolutions, significantly improving computational efficiency while preserving model performance. To evaluate the model’s performance, experiments were conducted using 6 min resolution radar echo data from China in 2024, with comparisons made against the optical flow (OF) method and the U-Net model. The experimental results show that RA-UNet demonstrates significant advantages in 3 h forecasting: its mean absolute error (MAE) is reduced by approximately 7%, the false alarm rate (FAR) decreases by about 20%, and it outperforms the comparison models in metrics such as the critical success index (CSI) and structural similarity index (SSIM). Notably, RA-UNet effectively mitigates intensity degradation in long-term forecasts, successfully predicting the trend of >40 dBZ strong echo cores in two typical cases and significantly improving the premature dissipation problem of precipitation fields. This study provides a new approach to refined forecasting of complex precipitation systems, and future work will combine multi-source data fusion with physical constraint mechanisms to further enhance precipitation event prediction capabilities. Full article
Show Figures

Figure 1

Back to TopTop