MDPI - Publisher of Open Access Journals

18 pages, 3641 KB

Open AccessArticle

A Wavelet-Enhanced Detector for Tiny Objects in Remote-Sensing Images

by Weifan Xu and Yong Hu

Remote Sens. 2026, 18(8), 1109; https://doi.org/10.3390/rs18081109 - 8 Apr 2026

Accurate and efficient detection is pivotal for tiny objects in remote sensing. However, achieving a favorable accuracy-efficiency trade-off remains challenging due to the few informative pixels of small targets, frequent occlusions, cluttered backgrounds, and detail degradation introduced by downsampling and multi-scale fusion. To [...] Read more.

Accurate and efficient detection is pivotal for tiny objects in remote sensing. However, achieving a favorable accuracy-efficiency trade-off remains challenging due to the few informative pixels of small targets, frequent occlusions, cluttered backgrounds, and detail degradation introduced by downsampling and multi-scale fusion. To address these challenges, we propose WEYOLO, a wavelet-enhanced detector that explicitly models frequency components and adaptively strengthens high-frequency cues to improve tiny-object robustness while maintaining competitive efficiency in inference speed and model size for remote-sensing deployment. To preserve edges and textures when spatial resolution is reduced, we design a Frequency-Aware Lifting Haar (FaLH) backbone that decomposes features into directional sub-bands and retains them during downsampling, preventing the loss of high-frequency information. Next, to address the blurring and detail loss caused by conventional pooling during multi-scale fusion, we introduce a Frequency-Domain Pyramid-Pooling (FDPP) module that performs wavelet-based multi-resolution analysis for frequency-aware feature-pyramid fusion. Additionally, we propose a stable size-aware quality focal regression loss that unifies Focaler-CIoU and size-aware DFL into a single objective, improving robustness and overall accuracy for small objects. Comprehensive experiments show that WEYOLO improves precision and recall over the baseline by 3.2%/4.2% on VisDrone and 2.6%/9.7% on TT100K; on AI-TOD, it achieves 47.5% mAP@0.5 and 21.3% mAP@0.5:0.95. Meanwhile, it reduces the parameter count by 60%, achieving a strong accuracy-efficiency balance for practical aerial sensing deployment. Full article

(This article belongs to the Section AI Remote Sensing)

► Show Figures

Figure 1

21 pages, 4058 KB

Open AccessArticle

Transient Voltage Stability Assessment Method Based on CWT-ResNet

by Chong Shao, Yongsheng Jin, Bolin Zhang, Xin He, Chen Zhou and Haiying Dong

Energies 2026, 19(7), 1804; https://doi.org/10.3390/en19071804 - 7 Apr 2026

Abstract

Accurate and rapid transient voltage stability assessment is crucial for the safe and stable operation of new energy bases in desert and grassland regions. Existing deep learning methods fail to adequately capture the high-dimensional dynamic coupling features of transient voltage signals in large-scale [...] Read more.

Accurate and rapid transient voltage stability assessment is crucial for the safe and stable operation of new energy bases in desert and grassland regions. Existing deep learning methods fail to adequately capture the high-dimensional dynamic coupling features of transient voltage signals in large-scale renewable energy bases with UHVDC transmission, and suffer from poor performance under class-imbalanced sample conditions. This paper proposes a transient voltage stability assessment method utilizing continuous wavelet transform (CWT) time–frequency images and a deep residual network (ResNet-50). CWT with the Morlet wavelet basis converts voltage time-series signals into multi-scale time–frequency images to simultaneously capture temporal and frequency-domain transient features. An improved focal loss (FL) function is introduced to dynamically adjust category weights based on actual sample distribution, enhancing model robustness under extreme class imbalance. The proposed method is validated on a modified IEEE 39-bus system incorporating the Qishao UHVDC line and wind/photovoltaic integration in Northwest China, using 1490 simulation samples under diverse fault scenarios. Results demonstrate that the proposed CWT-ResNet achieves 98.88% accuracy, 94.74% precision, 100% recall, and 97.29% F1-score, outperforming SVM, 1D-CNN, and 1D-ResNet baselines. Under 5 dB noise conditions, the method maintains over 90% accuracy, demonstrating strong noise robustness. Full article

(This article belongs to the Special Issue Challenges and Innovations in Stability and Control of Power Systems)

► Show Figures

Figure 1

24 pages, 67497 KB

Open AccessArticle

A Physics-Guided Dual-Stream Vibration Feature Fusion Network for Chatter-Induced Surface Mark Diagnosis in Wafer Thinning

by Heng Li, Hua Liu, Liang Zhu, Xiangyu Zhao, Lemiao Qiu and Shuyou Zhang

Machines 2026, 14(4), 404; https://doi.org/10.3390/machines14040404 - 7 Apr 2026

Abstract

Ultra-precision thinning of hard and brittle materials like monocrystalline silicon demands high dynamic stability in thinning spindle. To address the challenge of accurately detecting subtle spindle chatter anomalies in industrial environments characterized by high noise and limited data, this paper proposes a physics-guided [...] Read more.

Ultra-precision thinning of hard and brittle materials like monocrystalline silicon demands high dynamic stability in thinning spindle. To address the challenge of accurately detecting subtle spindle chatter anomalies in industrial environments characterized by high noise and limited data, this paper proposes a physics-guided dual-stream attention fusion transfer network (PG-AFNet). First, a physics-guided signal preprocessing method was developed. Using variational mode decomposition (VMD) and continuous wavelet transform (CWT) masking, one-dimensional dynamic features and high-frequency regions of interest (ROIs) rich in transient impact features were extracted. Second, the PG-AFNet architecture was designed. By introducing an attention mechanism, it achieves deep integration of one-dimensional purely dynamic sequences with two-dimensional spatiotemporal visual textures to capture surface damage features caused by subtle vibrations. Finally, systematic validations were conducted using a real silicon wafer thinning dataset with 197 real samples. By overcoming small-sample limitations via physical augmentation, PG-AFNet achieved an 82.45% (86.64% after data augmentation) diagnostic accuracy, significantly outperforming traditional baselines. Furthermore, a large-scale cross-load validation on the diverse CWRU dataset yielded an exceptional 99.68% accuracy under mixed-load conditions, conclusively verifying the model’s robust domain generalization. Lastly, a rigorous ablation study explicitly quantified the indispensable contributions of the physics-guided dual-stream architecture and attention fusion. This research provides a feasible theoretical foundation for intelligent surface quality monitoring in semiconductor hard-brittle material processing. Full article

(This article belongs to the Special Issue Monitoring and Control of Machining Processes)

► Show Figures

Figure 1

18 pages, 5384 KB

Open AccessArticle

Experimental Investigation on Pressure Pulsation Characteristics Induced by Vortex Rope Evolution in a Centrifugal Pump Under Runaway Condition

by Jing Dai, Wenjie Wang, Chunbing Shao, Yang Cao, Fan Meng and Qixiang Hu

Processes 2026, 14(7), 1175; https://doi.org/10.3390/pr14071175 - 5 Apr 2026

Viewed by 196

Abstract

To investigate the characteristics of pressure pulsation induced by vortex ropes in the draft tube of a centrifugal pump under runaway conditions, a closed double-layer hydraulic test bench was established in this study. Runaway characteristic experiments were conducted, and pressure pulsation signals were [...] Read more.

To investigate the characteristics of pressure pulsation induced by vortex ropes in the draft tube of a centrifugal pump under runaway conditions, a closed double-layer hydraulic test bench was established in this study. Runaway characteristic experiments were conducted, and pressure pulsation signals were acquired at heads of 7.6 m, 9.6 m, and 11.9 m. The measured pressure data were analyzed in the time–frequency domain using Fast Fourier Transform (FFT) and Wavelet Transform (WT). The results show that both the runaway rotational speed and the reverse flow rate increase with increasing head. Under all three heads, the dominant frequency upstream of the elbow section of the draft tube is 0.53 times the rotational frequency, confirming that the vortex rope in the draft tube serves as the primary excitation source of the flow field. As the vortex rope is conveyed by the main flow through the elbow, it undergoes impingement and fragmentation, causing the dominant frequency downstream of the elbow to decrease to 0.1 times the rotational frequency. The dominant frequency induced by the vortex rope remains continuous over time, whereas the frequency arising from the coupling between the vortex rope and rotor–stator interaction exhibits pronounced time-varying oscillations. These oscillations intensify with increasing head, and their frequency oscillation range broadens from 4 to 6 times the rotational frequency at low head to 2–8 times at high head. These findings provide a theoretical foundation for the preventive and protective design of centrifugal pumps under runaway conditions. Full article

(This article belongs to the Section Process Control and Monitoring)

► Show Figures

Figure 1

24 pages, 2841 KB

Open AccessArticle

Enhancing Data Quality with a Novel Neural Parameter Diffusion Approach

by Jun Yang, Kehan Hu, Zijing Yu and Zhiyang Zhang

Data 2026, 11(4), 72; https://doi.org/10.3390/data11040072 - 2 Apr 2026

Viewed by 224

Abstract

This study presents a novel neural parameter diffusion approach (FWA-PDiff) designed to enhance data quality. To address the limitations of conventional diffusion models—such as inefficient sampling and insufficient feature sensitivity, which may compromise output fidelity—this study introduces four key innovations. First, the proposed [...] Read more.

This study presents a novel neural parameter diffusion approach (FWA-PDiff) designed to enhance data quality. To address the limitations of conventional diffusion models—such as inefficient sampling and insufficient feature sensitivity, which may compromise output fidelity—this study introduces four key innovations. First, the proposed model introduces an adaptive recalibration of the sampling frequency in the Fourier domain to optimize feature extraction for image data. Second, a dual-channel autoencoder architecture is employed, featuring a multi-scale, fine-grained encoder (MFE) that enables the simultaneous capture of features at multiple resolutions. Third, a wavelet-attention mechanism (WA) is incorporated into the decoder to highlight subtle high-frequency details. Fourth, the proposed model introduces a hybrid loss function that combines Mean Squared Error (MSE) and Kullback–Leibler (KL) divergence to improve data reconstruction. Collectively, these improvements enable the generation of high-fidelity parameters, thereby contributing to enhanced data quality. Extensive experiments conducted on benchmark datasets—including MNIST, CIFAR-10, CIFAR-100, and STL-10—demonstrate the effectiveness of the proposed approach, which consistently achieves superior performance in improving data quality. Full article

(This article belongs to the Topic Data Stream Mining and Processing)

► Show Figures

Figure 1

42 pages, 10149 KB

Open AccessArticle

Radon-Guided Wavelet-Domain Attention U-Net for Periodic Artifact Suppression in Brain MRI

by Jesus David Rios-Perez, German Sanchez-Torres, John W. Branch-Bedoya and Camilo Andres Laiton-Bonadiez

J. Imaging 2026, 12(4), 153; https://doi.org/10.3390/jimaging12040153 - 2 Apr 2026

Viewed by 349

Abstract

Periodic artifacts such as ringing (Gibbs), herringbone (spike/corduroy), and zipper patterns degrade the quality of brain MRI. We present a reproducible framework that (i) synthetically generates periodic artifacts with controllable severity directly in k-space, (ii) normalizes pattern orientation through a Radon-guided alignment step, [...] Read more.

Periodic artifacts such as ringing (Gibbs), herringbone (spike/corduroy), and zipper patterns degrade the quality of brain MRI. We present a reproducible framework that (i) synthetically generates periodic artifacts with controllable severity directly in k-space, (ii) normalizes pattern orientation through a Radon-guided alignment step, and (iii) corrects them in the wavelet domain using a 2D DWT (AA/AD/DA/DD) with a band-weighted loss. The evaluation was conducted using DLBS T1-weighted 3T MRI volumes with synthetically generated periodic artifacts. It combined global image-quality metrics (SSIM, PSNR) with per-band metrics to quantify how correction concentrates on high-frequency components, and included ablation studies, mixed-artifact stress tests, and structural preservation analyses. Compared with several baseline architectures, the proposed approach shows improvements in structural fidelity and a reduction in periodic patterns (SSIM:

0.985 \pm 0.022;

PSNR:

43.337 \pm 5.364

; reduction in concentrated error in high-frequency bands), while preserving unaffected structures. These findings indicate that, within a controlled synthetic benchmark, aligning the pattern orientation prior to learning and optimizing correction in the wavelet domain enables suppression of synthetically generated periodic artifacts while limiting over-smoothing. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

27 pages, 4041 KB

Open AccessFeature PaperArticle

Fault Diagnosis Method for Mechanical Components Fusing RPM, DAM-IResNet, and Transfer Learning

by Xingwei Ge, Ziyang Chen, Yachao Cao, Zhe Wu and Qi Li

Sensors 2026, 26(7), 2162; https://doi.org/10.3390/s26072162 - 31 Mar 2026

Viewed by 345

Abstract

This paper proposes a novel fault diagnosis method that integrates a Relative Position Matrix (RPM), a Downsampling Attention Module (DAM), an Improved Residual Network (IResNet), and transfer learning to address the challenges of scarce fault data and poor generalization under variable working conditions. [...] Read more.

This paper proposes a novel fault diagnosis method that integrates a Relative Position Matrix (RPM), a Downsampling Attention Module (DAM), an Improved Residual Network (IResNet), and transfer learning to address the challenges of scarce fault data and poor generalization under variable working conditions. The RPM converts 1D vibration signals into 2D images to enhance feature representation. The DAM achieves lossless feature compression and selection via Haar wavelet downsampling and convolutional attention. An IResNet then performs deep feature learning and classification. A transfer learning strategy further enables effective knowledge adaptation from data-rich source domains to data-scarce target domains, significantly improving performance in cross-condition and small-sample scenarios. Experiments on multiple bearing and gear datasets demonstrate that the proposed method achieves over 99.5% accuracy, with 100% in key transfer tasks, outperforming existing state-of-the-art approaches. The main contributions of this work include the unified RPM-DAM-IResNet framework, a targeted small-sample transfer strategy, and comprehensive validation of its superior accuracy and robustness. Full article

(This article belongs to the Special Issue Deep Learning Based Intelligent Fault Diagnosis)

► Show Figures

Figure 1

23 pages, 6865 KB

Open AccessArticle

Integrating Hyperspectral Data and Deep Learning for Non-Destructive Prediction of Tea Quality Parameters Across Different Physical States of Tea Leaves and Growth Periods

by Guanzi Zhou, Haotian Ji, Rongyu Pan, Xiaowei Yang, Suhui Zhao, Lei Yang, Xiaohan Shang, Huijie Zhang, Hanchi Zhang, Xiaojun Liu, Yuanchun Ma, Xujun Zhu, Jie Jiang and Wanping Fang

Plants 2026, 15(7), 1071; https://doi.org/10.3390/plants15071071 - 31 Mar 2026

Viewed by 315

Abstract

Achieving rapid and non-destructive assessment of tea quality is essential for intelligent tea production and quality control. In this study, an integrated hyperspectral and deep learning framework was developed to estimate tea quality constituents across seasons and physical states. Samples included field fresh [...] Read more.

Achieving rapid and non-destructive assessment of tea quality is essential for intelligent tea production and quality control. In this study, an integrated hyperspectral and deep learning framework was developed to estimate tea quality constituents across seasons and physical states. Samples included field fresh leaves, dried tea leaves, and tea powder, were collected in spring, summer, and autumn. Tea polyphenols and catechins were predicted using original reflectance, harmonic features, and wavelet features fused into multi-domain indices. Extreme gradient boosting, Gaussian process regression, and convolutional neural networks (CNN) were systematically compared to construct the quality estimation models. The result showed that three-feature indices consistently outperformed two-feature indices, yielding R² from 0.48 to 0.71. CNN achieved the best overall performance among the three modeling approaches, with its optimal accuracy obtained for tea powder samples in autumn, yielding R² values of 0.81 and 0.76 for tea polyphenols and catechins, respectively. This framework provides an accurate, non-destructive tool for tea quality evaluation and traceability, offering technical support for intelligent agriculture and quality control across the tea industry chain. Full article

(This article belongs to the Special Issue Machine Learning for Plant Phenotyping in Crops)

► Show Figures

Figure 1

14 pages, 2547 KB

Open AccessArticle

A Real Maritime Infrared Image Denoising Network Based on Joint Spatial and Wavelet Domains

by He Xu, Lili Dong, Mengge Wang, Yingjie Ji and Fang Tang

J. Mar. Sci. Eng. 2026, 14(7), 644; https://doi.org/10.3390/jmse14070644 - 31 Mar 2026

Viewed by 147

Abstract

High-quality maritime infrared images are crucial for accurate object detection, classification, and segmentation in maritime environments. However, maritime infrared images are often degraded by various types of noise, including non-uniform noise and detector non-uniformity-induced fixed-pattern noise (e.g., vertical stripe noise), which pose significant [...] Read more.

High-quality maritime infrared images are crucial for accurate object detection, classification, and segmentation in maritime environments. However, maritime infrared images are often degraded by various types of noise, including non-uniform noise and detector non-uniformity-induced fixed-pattern noise (e.g., vertical stripe noise), which pose significant challenges for the aforementioned high-level vision tasks. A novel network, termed SWDNet (Spatial–Wavelet Joint Denoising Network), is proposed to jointly model spatial- and wavelet-domain features, enabling the effective enhancement of maritime infrared image quality while preserving fine image details. Two parallel sub-networks with distinct architectures are employed to extract complementary information for maritime infrared image denoising. In the upper branch, hierarchical spatial attention aggregation (HSAA) modules are employed at multiple scales to extract spatial features and adaptively assign importance weights to different spatial locations. The lower branch employs a Haar-based DWT for sub-band decomposition, a pixel-grouped self-attention module for boundary refinement, and parallel multi-scale horizontal convolutions to suppress vertical stripe noise in the HL sub-band. Finally, the directional edge enhancement (DEE) module employs learnable Sobel operators in conjunction with multi-layer convolutions to effectively extract and enhance directional edge features. Experimental results demonstrate that, compared with state-of-the-art methods, the proposed SWDNet achieves superior denoising performance on both synthetic and real maritime infrared datasets. Full article

(This article belongs to the Section Ocean Engineering)

► Show Figures

Figure 1

24 pages, 1020 KB

Open AccessArticle

Research on the Diagnosis of Abnormal Sound Defects in Automobile Engines Based on Fusion of Multi-Modal Images and Audio

by Yi Xu, Wenbo Chen and Xuedong Jing

Electronics 2026, 15(7), 1406; https://doi.org/10.3390/electronics15071406 - 27 Mar 2026

Viewed by 291

Abstract

Against the global carbon neutrality target, predictive maintenance (PdM) of automotive engines represents a core technical strategy to advance the sustainable development of the automotive industry. Conventional single-modal diagnostic approaches for engine abnormal sound defects suffer from low accuracy and weak anti-interference capability. [...] Read more.

Against the global carbon neutrality target, predictive maintenance (PdM) of automotive engines represents a core technical strategy to advance the sustainable development of the automotive industry. Conventional single-modal diagnostic approaches for engine abnormal sound defects suffer from low accuracy and weak anti-interference capability. Existing multi-modal fusion methods fail to deeply mine the physical coupling between cross-modal features and often entail excessive model complexity, hindering deployment on resource-constrained on-board edge devices. To resolve these limitations, this study proposes a Physical Prior-Embedded Cross-Modal Attention (PPE-CMA) mechanism for lightweight multi-modal fusion diagnosis of engine abnormal sound defects. First, wavelet packet decomposition (WPD) and mel-frequency cepstral coefficients (MFCC) are integrated to extract time-frequency features from engine audio signals, while a channel-pruned ResNet18 is employed to extract spatial features from engine thermal imaging and vibration visualization images. Second, the PPE-CMA module is designed to adaptively assign attention weights to audio and image features by exploiting the physical coupling between engine fault acoustic and visual characteristics, enabling efficient cross-modal feature fusion with redundant information suppression. A rigorous theoretical derivation is provided to link cosine similarity with the physical correlation of engine fault acoustic-visual features, justifying the attention weight constraint (β = 1 − α) from the perspective of fault feature physical coupling. Third, an improved lightweight XGBoost classifier is constructed for fault classification, and a hybrid data augmentation strategy customized for engine multi-modal data is proposed to address the small-sample challenge in industrial applications. Ablation experiments on ResNet18 pruning ratios verify the optimal trade-off between diagnostic performance and computational efficiency, while feature distribution analysis validates the authenticity and effectiveness of the hybrid augmentation strategy. Experimental results on a self-constructed multi-modal dataset show that the proposed method achieves 98.7% diagnostic accuracy and a 98.2% F1-score, retaining 96.5% accuracy under 90 dB high-level environmental noise, with an end-to-end inference speed of 0.8 ms per sample (including preprocessing, feature extraction, and classification). Cross-engine and cross-domain validation on a 2.0T diesel engine small-sample dataset and the open-source SEMFault-2024 dataset yield average accuracies of 94.8% and 95.2%, respectively, demonstrating strong generalization. This method effectively enhances the accuracy and robustness of engine abnormal sound defect diagnosis, offering a lightweight technical solution for on-board real-time fault diagnosis and in-plant online quality inspection. By reducing engine fault-induced energy loss and spare parts waste, it further promotes energy conservation and emission reduction in the automotive industry. Quantified experimental data on fuel efficiency improvement and carbon emission reduction are provided to substantiate the ecological benefits of the proposed framework. Full article

► Show Figures

Figure 1

27 pages, 3220 KB

Open AccessArticle

A Novel Load-Dependent Multimodal Vibration Signal Enhancement and Fusion Framework (LD-MVSEFF) for Load-Specific Condition Monitoring

by Shahd Ziad Hejazi and Michael Packianather

Machines 2026, 14(4), 372; https://doi.org/10.3390/machines14040372 - 27 Mar 2026

Viewed by 348

Abstract

This paper presents a Load-Dependent Multimodal Vibration Signal Enhancement and Fusion Framework (LD-MVSEFF) for load-specific condition monitoring, building on the Customised Load Adaptive Framework (CLAF). The proposed approach enhances the classification of CLAF load-dependent subclasses, namely, Healthy, Mild, Moderate, and Severe, by integrating [...] Read more.

This paper presents a Load-Dependent Multimodal Vibration Signal Enhancement and Fusion Framework (LD-MVSEFF) for load-specific condition monitoring, building on the Customised Load Adaptive Framework (CLAF). The proposed approach enhances the classification of CLAF load-dependent subclasses, namely, Healthy, Mild, Moderate, and Severe, by integrating complementary information from raw vibration signals and encoded signal representations. Three input channels are employed, combining time–frequency domain features with Continuous Wavelet Transform (CWT) and Gramian Angular Difference Field (GADF) image encodings, with each channel independently trained and evaluated to identify its most effective classifiers. To address the reduced separability of the Mild and Moderate fault subclasses under varying load conditions, a weighted decision-fusion strategy is introduced, assigning classifier contributions according to their class-specific strengths. Experimental evaluation over five runs demonstrates high and stable performance, with the best configuration achieving an overall accuracy of 99.04% ± 0.22% and an average training time of 18 min and 30 s. The results confirm the effectiveness of LD-MVSEFF as a robust multimodal methodology for load-specific condition monitoring. Full article

(This article belongs to the Special Issue Towards Electric Motors and Drives: Condition Monitoring, Performance Prediction and Fault Diagnosis, 2nd Edition)

► Show Figures

Figure 1

27 pages, 9112 KB

Open AccessArticle

MSWKN: Multi-Scale Wavelet Kolmogorov–Arnold Network with Spectral–Spatial and Frequency Domain Optimization for Hyperspectral Crop Classification

by Ziwei Li, Bingjie Liang, Weizhen Zhang, Zhenqiang Xu, Baowei Zhang, Ning Li, Weiran Luo and Jianzhong Guo

Agriculture 2026, 16(7), 740; https://doi.org/10.3390/agriculture16070740 - 27 Mar 2026

Viewed by 297

Abstract

Accurate crop classification provides fundamental data for agricultural resource management and ecological research. Hyperspectral image (HSI) classification is the core technique for achieving precise crop mapping. However, existing models often suffer from excessive parameters, limited robustness under few-shot conditions, and a trade-off between [...] Read more.

Accurate crop classification provides fundamental data for agricultural resource management and ecological research. Hyperspectral image (HSI) classification is the core technique for achieving precise crop mapping. However, existing models often suffer from excessive parameters, limited robustness under few-shot conditions, and a trade-off between efficiency and robustness. To address these issues, this paper proposes a Multi-Scale Wavelet Kolmogorov–Arnold Network (MSWKN). The model employs a Two-Branch Feature Extractor (TBFE) to capture both spectral correlations and spatial textures. a Channel Cross-Spatial (CCS) module to suppress background clutter and highlight discriminative regions. A group convolution-based Fixed Wavelet Multi-Scale Convolutional Layer (FW-MSCL) that leverages the time–frequency localization of wavelets and learnable linear combinations to enhance robustness against spectral distortion while reducing parameters. And a Fourier-based Transformer encoder to enable global frequency–space modeling. Experiments on the WHU-Hi-HanChuan and WHU-Hi-HongHu hyperspectral crop datasets show that MSWKN achieves high overall accuracy and performs favorably on few-shot categories. Under lower parameter counts and fast inference conditions, the model demonstrates a reasonable trade-off between accuracy and computational efficiency. Ablation studies and wavelet kernel comparisons further confirm the contribution of each module and the advantage of the wavelet. The proposed framework provides an efficient and robust solution for fine-grained hyperspectral crop classification. Full article

(This article belongs to the Section Artificial Intelligence and Digital Agriculture)

► Show Figures

Figure 1

27 pages, 10703 KB

Open AccessArticle

WE-KAN: SAR Image Rotated Object Detection Method Based on Wavelet Domain Feature Enhancement and KAN Prediction Head

by Mingchun Li, Yang Liu, Qiang Wang and Dali Chen

Sensors 2026, 26(7), 2011; https://doi.org/10.3390/s26072011 - 24 Mar 2026

Viewed by 202

Abstract

Synthetic aperture radar (SAR) imagery plays a vital role in critical applications such as military reconnaissance and disaster monitoring. These applications require high detection accuracy. Therefore, rotated object detection has gained increasing attention. By predicting an object orientation angle, it offers advantages over [...] Read more.

Synthetic aperture radar (SAR) imagery plays a vital role in critical applications such as military reconnaissance and disaster monitoring. These applications require high detection accuracy. Therefore, rotated object detection has gained increasing attention. By predicting an object orientation angle, it offers advantages over horizontal bounding boxes, especially for elongated structures such as ships and bridges in SAR scenes. However, challenges such as speckle noise and complex backgrounds in SAR imagery still hinder high-precision detection. To address this, we propose WE-KAN, a novel rotated object detection framework based on wavelet features and Kolmogorov–Arnold network (KAN) prediction. First, we enhance the backbone by incorporating wavelet domain features from SAR grayscale images. The extracted wavelet domain features and image features are fused by a proposed attention module. Second, considering the sensitivity to angle prediction, we design a angle predictor based on KAN. This architecture provides a powerful and dedicated solution for accurate angle regression. Finally, for precise rotated bounding box regression, we employ a joint loss function combining a rotated intersection over union (RIoU) with a Gaussian distance loss function. These designs improve the model’s robustness to noise and its perception of fine object structures. When evaluated on the large-scale public RSAR dataset, our method achieves an AP50 of 70.1 and a mAP of 35.9 under the same training schedule and backbone network, significantly outperforming existing baselines. This demonstrates the effectiveness and robustness of our method for dense, small, and highly oriented objects in complex SAR scenes. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

18 pages, 6071 KB

Open AccessArticle

DFENet: A Novel Dual-Path Feature Extraction Network for Semantic Segmentation of Remote Sensing Images

by Li Cao, Zishang Liu, Yan Wang and Run Gao

J. Imaging 2026, 12(3), 141; https://doi.org/10.3390/jimaging12030141 - 23 Mar 2026

Viewed by 303

Abstract

Semantic segmentation of remote sensing images (RSIs) is a fundamental task in geoscience research. However, designing efficient feature fusion modules remains challenging for existing dual-branch or multi-branch architectures. Furthermore, existing deep learning-based architectures predominantly concentrate on spatial feature modeling and context capturing while [...] Read more.

Semantic segmentation of remote sensing images (RSIs) is a fundamental task in geoscience research. However, designing efficient feature fusion modules remains challenging for existing dual-branch or multi-branch architectures. Furthermore, existing deep learning-based architectures predominantly concentrate on spatial feature modeling and context capturing while inherently neglecting the exploration and utilization of critical frequency-domain features, which is crucial for addressing issues of semantic confusion and blurred boundaries in complex remote sensing scenes. To address the challenges of feature fusion and the lack of frequency-domain information, we propose a novel dual-path feature extraction network (DFENet) in this paper. Specifically, a dual-path module (DPM) is developed in DFENet to extract global and local features, respectively. In the global path, after applying the channel splitting strategy, four feature extraction strategies are innovatively integrated to extract global features from different granularities. According to the strategy of supplementing frequency-domain information, a frequency-domain feature extraction block (FFEB) dominated by discrete Wavelet transform (DWT) is designed to effectively captures both high- and low-frequency components. Experimental results show that our method outperforms existing state-of-the-art methods in terms of segmentation performance, achieving a mean intersection over union (mIoU) of 83.09% on the ISPRS Vaihingen dataset and 86.05% on the ISPRS Potsdam dataset. Full article

(This article belongs to the Section Image and Video Processing)

► Show Figures

Figure 1

17 pages, 3958 KB

Open AccessArticle

Mscon-C: A Model to Integrate Multiscale Convolution and CBAM Attention Mechanism for Bearing Fault Diagnosis

by Molin Wang, Cheng Cheng and Mengning Chu

Electronics 2026, 15(6), 1335; https://doi.org/10.3390/electronics15061335 - 23 Mar 2026

Viewed by 184

Abstract

Nowadays, Convolutional Neural Networks (CNNs) are the mainstream model in the bearing fault diagnosis area. Aiming at the problems of “incomplete capture of multiscale fault features and insufficient use of key time and frequency domain information”, a bearing fault diagnosis model combining multiscale [...] Read more.

Nowadays, Convolutional Neural Networks (CNNs) are the mainstream model in the bearing fault diagnosis area. Aiming at the problems of “incomplete capture of multiscale fault features and insufficient use of key time and frequency domain information”, a bearing fault diagnosis model combining multiscale convolution and Convolutional Block Attention Module (CBAM) is proposed, referred to as Mscon-C (standing for Multiscale convolution and CBAM) in this paper. Firstly, the time-frequency domain joint features of bearing faults are obtained by continuous wavelet transform. Then, the fine-grained and global fault features are synchronously extracted by multiscale parallel convolution. Finally, the weight of key features is enhanced by CBAM attention. The performance of Mscon-C was verified on two datasets, and its accuracy reached more than 95% on 12 tasks in two datasets, which was significantly improved compared with traditional CNN and Squeeze-and-Excitation-CNN (SE-CNN), and verified the effectiveness of the combination of multiscale parallel convolution and CBAM. Full article

(This article belongs to the Special Issue Advanced Online Monitoring and Fault Diagnosis of Power Equipment)

► Show Figures

Figure 1

Search Results (911)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (911)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI