Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (743)

Search Parameters:
Keywords = residual network ResNet18

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
31 pages, 3949 KB  
Article
A Railway Mobile Terminal Malware Detection Method Based on SE-ResNet
by Honglei Yao, Yijie Yang, Ning Dong and Wenjia Niu
Appl. Sci. 2025, 15(19), 10760; https://doi.org/10.3390/app151910760 - 6 Oct 2025
Viewed by 99
Abstract
This paper proposes a residual network model integrated with an attention mechanism module for the detection and classification of malware on railway mobile terminals. To address the issues of insufficient and imbalanced samples, Wasserstein Generative Adversarial Networks (WGANs) are utilized to synthesize grayscale [...] Read more.
This paper proposes a residual network model integrated with an attention mechanism module for the detection and classification of malware on railway mobile terminals. To address the issues of insufficient and imbalanced samples, Wasserstein Generative Adversarial Networks (WGANs) are utilized to synthesize grayscale image data of malware with high similarity to real samples. The performance of the model is evaluated on the publicly available CIC-InvesAndMal2019 dataset and an extended balanced dataset. Experimental results demonstrate that the synergistic integration of residual networks, WGANs, and attention mechanisms significantly enhances the performance of the malware detection model. In the context of railway applications, the proposed approach also achieves favorable classification performance when applied to image datasets derived from malware samples of railway mobile terminals. Multiple ablation studies are conducted to individually validate the contributions of each technical component in improving the classification model’s efficacy. The adoption of the SE-ResNet architecture combined with WGAN-based data augmentation presents a practical and efficient technical solution. Full article
14 pages, 3118 KB  
Article
Reconstruction Modeling and Validation of Brown Croaker (Miichthys miiuy) Vocalizations Using Wavelet-Based Inversion and Deep Learning
by Sunhyo Kim, Jongwook Choi, Bum-Kyu Kim, Hansoo Kim, Donhyug Kang, Jee Woong Choi, Young Geul Yoon and Sungho Cho
Sensors 2025, 25(19), 6178; https://doi.org/10.3390/s25196178 - 6 Oct 2025
Viewed by 163
Abstract
Fish species’ biological vocalizations serve as essential acoustic signatures for passive acoustic monitoring (PAM) and ecological assessments. However, limited availability of high-quality acoustic recordings, particularly for region-specific species like the brown croaker (Miichthys miiuy), hampers data-driven bioacoustic methodology development. In this [...] Read more.
Fish species’ biological vocalizations serve as essential acoustic signatures for passive acoustic monitoring (PAM) and ecological assessments. However, limited availability of high-quality acoustic recordings, particularly for region-specific species like the brown croaker (Miichthys miiuy), hampers data-driven bioacoustic methodology development. In this study, we present a framework for reconstructing brown croaker vocalizations by integrating fk14 wavelet synthesis, PSO-based parameter optimization (with an objective combining correlation and normalized MSE), and deep learning-based validation. Sensitivity analysis using a normalized Bartlett processor identified delay and scale (length) as the most critical parameters, defining valid ranges that maintained waveform similarity above 98%. The reconstructed signals matched measured calls in both time and frequency domains, replicating single-pulse morphology, inter-pulse interval (IPI) distributions, and energy spectral density. Validation with a ResNet-18-based Siamese network produced near-unity cosine similarity (~0.9996) between measured and reconstructed signals. Statistical analyses (95% confidence intervals; residual errors) confirmed faithful preservation of SPL values and minor, biologically plausible IPI variations. Under noisy conditions, similarity decreased as SNR dropped, indicating that environmental noise affects reconstruction fidelity. These results demonstrate that the proposed framework can reliably generate acoustically realistic and morphologically consistent fish vocalizations, even under data-limited scenarios. The methodology holds promise for dataset augmentation, PAM applications, and species-specific call simulation. Future work will extend this framework by using reconstructed signals to train generative models (e.g., GANs, WaveNet), enabling scalable synthesis and supporting real-time adaptive modeling in field monitoring. Full article
Show Figures

Figure 1

25 pages, 2295 KB  
Article
Vehicle Wind Noise Prediction Using Auto-Encoder-Based Point Cloud Compression and GWO-ResNet
by Yan Ma, Jifeng Wang, Zuofeng Pan, Hongwei Yi, Shixu Jia and Haibo Huang
Machines 2025, 13(10), 920; https://doi.org/10.3390/machines13100920 - 5 Oct 2025
Viewed by 147
Abstract
In response to the inability to quickly assess wind noise performance during the early stages of automotive styling design, this paper proposes a method for predicting interior wind noise by integrating automotive point cloud models with the Gray Wolf Optimization Residual Network model [...] Read more.
In response to the inability to quickly assess wind noise performance during the early stages of automotive styling design, this paper proposes a method for predicting interior wind noise by integrating automotive point cloud models with the Gray Wolf Optimization Residual Network model (GWO-ResNet). Based on wind tunnel test data under typical operating conditions, the point cloud model of the test vehicle is compressed using an auto-encoder and used as input features to construct a nonlinear mapping model between the whole vehicle point cloud and the wind noise level at the driver’s left ear. Through adaptive optimization of key hyperparameters of the ResNet model using the gray wolf optimization algorithm, the accuracy and generalization of the prediction model are improved. The prediction results on the test set indicate that the proposed GWO-ResNet model achieves prediction results that are consistent with the actual measured values for the test samples, thereby validating the effectiveness of the proposed method. A comparative analysis with traditional ResNet models, GWO-LSTM models, and LSTM models revealed that the GWO-ResNet model achieved Mean Absolute Percentage Error (MAPE) and mean squared error (MSE) of 9.72% and 20.96, and 9.88% and 19.69, respectively, on the sedan and SUV test sets, significantly outperforming the other comparison models. The prediction results on the independent validation set also demonstrate good generalization ability and stability (MAPE of 10.14% and 10.15%, MSE of 23.97 and 29.15), further proving the reliability of this model in practical applications. The research results provide an efficient and feasible technical approach for the rapid evaluation of wind noise performance in vehicles and provide a reference for wind noise control in the early design stage of vehicles. At the same time, due to the limitations of the current test data, it is impossible to predict the wind noise during the actual driving of the vehicle. Subsequently, the wind noise during actual driving can be predicted by the test data of multiple working conditions. Full article
(This article belongs to the Section Vehicle Engineering)
Show Figures

Figure 1

25 pages, 8881 KB  
Article
Evaluating Machine Learning Techniques for Brain Tumor Detection with Emphasis on Few-Shot Learning Using MAML
by Soham Sanjay Vaidya, Raja Hashim Ali, Shan Faiz, Iftikhar Ahmed and Talha Ali Khan
Algorithms 2025, 18(10), 624; https://doi.org/10.3390/a18100624 - 2 Oct 2025
Viewed by 227
Abstract
Accurate brain tumor classification from MRI is often constrained by limited labeled data. We systematically compare conventional machine learning, deep learning, and few-shot learning (FSL) for four classes (glioma, meningioma, pituitary, no tumor) using a standardized pipeline. Models are trained on the Kaggle [...] Read more.
Accurate brain tumor classification from MRI is often constrained by limited labeled data. We systematically compare conventional machine learning, deep learning, and few-shot learning (FSL) for four classes (glioma, meningioma, pituitary, no tumor) using a standardized pipeline. Models are trained on the Kaggle Brain Tumor MRI Dataset and evaluated across dataset regimes (100%→10%). We further test generalization on BraTS and quantify robustness to resolution changes, acquisition noise, and modality shift (T1→FLAIR). To support clinical trust, we add visual explanations (Grad-CAM/saliency) and report per-class results (confusion matrices). A fairness-aligned protocol (shared splits, optimizer, early stopping) and a complexity analysis (parameters/FLOPs) enable balanced comparison. With full data, Convolutional Neural Networks (CNNs)/Residual Networks (ResNets) perform strongly but degrade with 10% data; Model-Agnostic Meta-Learning (MAML) retains competitive performance (AUC-ROC ≥ 0.9595 at 10%). Under cross-dataset validation (BraTS), FSL—particularly MAML—shows smaller performance drops than CNN/ResNet. Variability tests reveal FSL’s relative robustness to down-resolution and noise, although modality shift remains challenging for all models. Interpretability maps confirm correct activations on tumor regions in true positives and explain systematic errors (e.g., “no tumor”→pituitary). Conclusion: FSL provides accurate, data-efficient, and comparatively robust tumor classification under distribution shift. The added per-class analysis, interpretability, and complexity metrics strengthen clinical relevance and transparency. Full article
(This article belongs to the Special Issue Machine Learning Models and Algorithms for Image Processing)
Show Figures

Figure 1

19 pages, 13644 KB  
Article
Rock Surface Crack Recognition Based on Improved Mask R-CNN with CBAM and BiFPN
by Yu Hu, Naifu Deng, Fan Ye, Qinglong Zhang and Yuchen Yan
Buildings 2025, 15(19), 3516; https://doi.org/10.3390/buildings15193516 - 29 Sep 2025
Viewed by 267
Abstract
To address the challenges of multi-scale distribution, low contrast and background interference in rock crack identification, this paper proposes an improved Mask R-CNN model (CBAM-BiFPN-Mask R-CNN) that integrates the convolutional block attention mechanism (CBAM) module and the bidirectional feature pyramid network (BiFPN) module. [...] Read more.
To address the challenges of multi-scale distribution, low contrast and background interference in rock crack identification, this paper proposes an improved Mask R-CNN model (CBAM-BiFPN-Mask R-CNN) that integrates the convolutional block attention mechanism (CBAM) module and the bidirectional feature pyramid network (BiFPN) module. A dataset of 1028 rock surface crack images was constructed. The robustness of the model was improved by dynamically combining Gaussian blurring, noise overlay, and color adjustment to enhance data augmentation strategies. The model embeds the CBAM module after the residual block of the ResNet50 backbone network, strengthens the crack-related feature response through channel attention, and uses spatial attention to focus on the spatial distribution of cracks; at the same time, it replaces the traditional FPN with BiFPN, realizes the adaptive fusion of cross-scale features through learnable weights, and optimizes multi-scale crack feature extraction. Experimental results show that the improved model significantly improves the crack recognition effect in complex rock mass scenarios. The mAP index, precision and recall rate are improved by 8.36%, 9.1% and 12.7%, respectively, compared with the baseline model. This research provides an effective solution for rock crack detection in complex geological environments, especially the missed detection of small cracks and complex backgrounds. Full article
(This article belongs to the Special Issue Recent Scientific Developments in Structural Damage Identification)
Show Figures

Figure 1

19 pages, 1027 KB  
Article
A Convolutional-Transformer Residual Network for Channel Estimation in Intelligent Reflective Surface Aided MIMO Systems
by Qingying Wu, Junqi Bao, Hui Xu, Benjamin K. Ng, Chan-Tong Lam and Sio-Kei Im
Sensors 2025, 25(19), 5959; https://doi.org/10.3390/s25195959 - 25 Sep 2025
Viewed by 390
Abstract
Intelligent Reflective Surface (IRS)-aided Multiple-Input Multiple-Output (MIMO) systems have emerged as a promising solution to enhance spectral and energy efficiency in future wireless communications. However, accurate channel estimation remains a key challenge due to the passive nature and high dimensionality of IRS channels. [...] Read more.
Intelligent Reflective Surface (IRS)-aided Multiple-Input Multiple-Output (MIMO) systems have emerged as a promising solution to enhance spectral and energy efficiency in future wireless communications. However, accurate channel estimation remains a key challenge due to the passive nature and high dimensionality of IRS channels. This paper proposes a lightweight hybrid framework for cascaded channel estimation by combining a physics-based Bilinear Alternating Least Squares (BALS) algorithm with a deep neural network named ConvTrans-ResNet. The network integrates convolutional embeddings and Transformer modules within a residual learning architecture to exploit both local and global spatial features effectively while ensuring training stability. A series of ablation studies is conducted to optimize architectural components, resulting in a compact configuration with low parameter count and computational complexity. Extensive simulations demonstrate that the proposed method significantly outperforms state-of-the-art neural models such as HA02, ReEsNet, and InterpResNet across a wide range of SNR levels and IRS element sizes in terms of the Normalized Mean Squared Error (NMSE). Compared to existing solutions, our method achieves better estimation accuracy with improved efficiency, making it suitable for practical deployment in IRS-aided systems. Full article
(This article belongs to the Section Communications)
Show Figures

Figure 1

25 pages, 8868 KB  
Article
AttenResNet18: A Novel Cross-Domain Fault Diagnosis Model for Rolling Bearings
by Gangjin Huang, Shanshan Wu, Yingxiao Zhang, Wuguo Wei, Weigang Fu, Junjie Zhang, Yuxuan Yang and Junheng Fu
Sensors 2025, 25(19), 5958; https://doi.org/10.3390/s25195958 - 24 Sep 2025
Viewed by 484
Abstract
To tackle the difficulties in cross-domain fault diagnosis for rolling bearings, researchers have devised numerous domain adaptation strategies to align feature distributions across varied domains. Nevertheless, current approaches tend to be vulnerable to noise disruptions and often neglect the distinctions between marginal and [...] Read more.
To tackle the difficulties in cross-domain fault diagnosis for rolling bearings, researchers have devised numerous domain adaptation strategies to align feature distributions across varied domains. Nevertheless, current approaches tend to be vulnerable to noise disruptions and often neglect the distinctions between marginal and conditional distributions during feature transfer. To resolve these shortcomings, this study presents an innovative fault diagnosis technique for cross-domain applications, leveraging the Attention-Enhanced Residual Network (AttenResNet18). This approach utilizes a one-dimensional attention mechanism to dynamically assign importance to each position within the input sequence, thereby capturing long-range dependencies and essential features, which reduces vulnerability to noise and enhances feature representation. Furthermore, we propose a Dynamic Balance Distribution Adaptation (DBDA) mechanism, which develops an MMD-CORAL Fusion Metric (MCFM) by combining CORrelation ALignment (CORAL) with Maximum Mean Discrepancy (MMD). Moreover, an adaptive factor is employed to dynamically regulate the balance between marginal and conditional distributions, improving adaptability to new and untested tasks. Experimental validation demonstrates that AttenResNet18 achieves an average accuracy of 99.89% on two rolling bearing datasets, representing a significant improvement in fault detection precision over existing methods. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

19 pages, 4231 KB  
Article
Deep Feature Decoupling Network for Ball Mill Load Signals
by Xiaoyan Luo, Wei Huang, Saisai He, Wencong Xiao and Zhihong Jiang
Machines 2025, 13(10), 881; https://doi.org/10.3390/machines13100881 - 24 Sep 2025
Viewed by 296
Abstract
Accurately identifying the load status of a ball mill is critical for optimizing grinding efficiency and ensuring operational stability. However, the one-dimensional vibration signals collected from ball mills exhibit strong non-stationarity and a high degree of entanglement between multi-scale local transient features and [...] Read more.
Accurately identifying the load status of a ball mill is critical for optimizing grinding efficiency and ensuring operational stability. However, the one-dimensional vibration signals collected from ball mills exhibit strong non-stationarity and a high degree of entanglement between multi-scale local transient features and long-range temporal evolution patterns. To address this, rather than relying on a purely black-box approach, this paper introduces a novel Deep Multi-scale Spatial–Temporal Feature Decoupling Network (DMSTFD-Net) guided by a clear feature decoupling philosophy to enhance model interpretability. The core of DMSTFD-Net lies in its hierarchical collaborative feature refinement mechanism. It first utilizes a one-dimensional residual network (ResNet) to adaptively capture and preliminarily decouple multi-scale spatial characteristics from the raw signal. Subsequently, the extracted high-level feature sequences are fed into a bidirectional gated recurrent unit (Bi-GRU) to decouple high-order temporal dynamic patterns. Experiments on a multi-condition dataset demonstrate that the proposed network achieves a state-of-the-art accuracy of 97.65%. Furthermore, dedicated cross-condition experiments and t-SNE visualizations validate the framework’s effectiveness. The results confirm that DMSTFD-Net provides a powerful, robust, and more interpretable solution for ball mill load identification. Full article
(This article belongs to the Section Advanced Manufacturing)
Show Figures

Figure 1

27 pages, 5776 KB  
Article
R-SWTNet: A Context-Aware U-Net-Based Framework for Segmenting Rural Roads and Alleys in China with the SQVillages Dataset
by Jianing Wu, Junqi Yang, Xiaoyu Xu, Ying Zeng, Yan Cheng, Xiaodong Liu and Hong Zhang
Land 2025, 14(10), 1930; https://doi.org/10.3390/land14101930 - 23 Sep 2025
Viewed by 292
Abstract
Rural road networks are vital for rural development, yet narrow alleys and occluded segments remain underrepresented in digital maps due to irregular morphology, spectral ambiguity, and limited model generalization. Traditional segmentation models struggle to balance local detail preservation and long-range dependency modeling, prioritizing [...] Read more.
Rural road networks are vital for rural development, yet narrow alleys and occluded segments remain underrepresented in digital maps due to irregular morphology, spectral ambiguity, and limited model generalization. Traditional segmentation models struggle to balance local detail preservation and long-range dependency modeling, prioritizing either local features or global context alone. Hypothesizing that integrating hierarchical local features and global context will mitigate these limitations, this study aims to accurately segment such rural roads by proposing R-SWTNet, a context-aware U-Net-based framework, and constructing the SQVillages dataset. R-SWTNet integrates ResNet34 for hierarchical feature extraction, Swin Transformer for long-range dependency modeling, ASPP for multi-scale context fusion, and CAM-Residual blocks for channel-wise attention. The SQVillages dataset, built from multi-source remote sensing imagery, includes 18 diverse villages with adaptive augmentation to mitigate class imbalance. Experimental results show R-SWTNet achieves a validation IoU of 54.88% and F1-score of 70.87%, outperforming U-Net and Swin-UNet, and with less overfitting than R-Net and D-LinkNet. Its lightweight variant supports edge deployment, enabling on-site road management. This work provides a data-driven tool for infrastructure planning under China’s Rural Revitalization Strategy, with potential scalability to global unstructured rural road scenes. Full article
(This article belongs to the Section Land Innovations – Data and Machine Learning)
Show Figures

Figure 1

17 pages, 4643 KB  
Article
Deep Learning Emulator Towards Both Forward and Adjoint Modes of Atmospheric Gas-Phase Chemical Process
by Yulong Liu, Meicheng Liao, Jiacheng Liu and Zhen Cheng
Atmosphere 2025, 16(9), 1109; https://doi.org/10.3390/atmos16091109 - 21 Sep 2025
Viewed by 448
Abstract
Gas-phase chemistry has been identified as a major computational bottleneck in both the forward and adjoint modes of chemical transport models (CTMs). Although previous studies have demonstrated the potential of deep learning models to simulate and accelerate this process, few studies have examined [...] Read more.
Gas-phase chemistry has been identified as a major computational bottleneck in both the forward and adjoint modes of chemical transport models (CTMs). Although previous studies have demonstrated the potential of deep learning models to simulate and accelerate this process, few studies have examined the applicability and performance of these models in adjoint sensitivity analysis. In this study, a deep learning emulator for gas-phase chemistry is developed and trained on a diverse set of forward-mode simulations from the Community Multiscale Air Quality (CMAQ) model. The emulator employs a residual neural network (ResNet) architecture referred to as FiLM-ResNet, which integrates Feature-wise Linear Modulation (FiLM) layers to explicitly account for photochemical and non-photochemical conditions. Validation within a single timestep indicates that the emulator accurately predicts concentration changes for 74% of gas-phase species with coefficient of determination (R2) exceeding 0.999. After embedding the emulator into the CTM, multi-timestep simulation over one week shows close agreement with the numerical model. For the adjoint mode, we compute the sensitivities of ozone (O3) with respect to O3, nitric oxide (NO), nitrogen dioxide (NO2), hydroxyl radical (OH) and isoprene (ISOP) using automatic differentiation, with the emulator-based adjoint results achieving a maximum R2 of 0.995 in single timestep evaluations compared to the numerical adjoint sensitivities. A 24 h adjoint simulation reveals that the emulator maintains spatially consistent adjoint sensitivity distributions compared to the numerical model across most grid cells. In terms of computational efficiency, the emulator achieves speed-ups of 80×–130× in the forward mode and 45×–102× in the adjoint mode, depending on whether inference is executed on Central Processing Unit (CPU) or Graphics Processing Unit (GPU). These findings demonstrate that, once the emulator is accurately trained to reproduce forward-mode gas-phase chemistry, it can be effectively applied in adjoint sensitivity analysis. This approach offers a promising alternative approach to numerical adjoint frameworks in CTMs. Full article
(This article belongs to the Section Atmospheric Techniques, Instruments, and Modeling)
Show Figures

Figure 1

19 pages, 2940 KB  
Article
Monitoring and Diagnostics of Mining Electromechanical Equipment Based on Machine Learning
by Eduard Muratbakeev, Yuriy Kozhubaev, Diana Novak, Roman Ershov and Zhou Wei
Symmetry 2025, 17(9), 1548; https://doi.org/10.3390/sym17091548 - 16 Sep 2025
Viewed by 333
Abstract
Induction motors are a common component of electromechanical equipment in mining operations, yet they are susceptible to failures resulting from frequent start–stops, overloading, wear and tear, and component failure. It is evident that such failures can result in severe ramifications, encompassing industrial accidents [...] Read more.
Induction motors are a common component of electromechanical equipment in mining operations, yet they are susceptible to failures resulting from frequent start–stops, overloading, wear and tear, and component failure. It is evident that such failures can result in severe ramifications, encompassing industrial accidents and economic losses. The present paper proposes a detailed study of engine fault diagnosis technology. It has been demonstrated that prevailing intelligent engine diagnosis algorithms exhibit a limited diagnostic efficacy under variable operating conditions, and the reliability of diagnostic outcomes based on individual signals is questionable. The present paper puts forward the proposition of an investigation into a fault diagnosis algorithm for induction motors. This investigation utilized a range of analytical methods, including signal analysis, deep learning, transfer learning, and information fusion. Currently, the methods employed for fault diagnosis based on traditional machine learning are reliant on the selection of statistical features by those with expertise in the field, resulting in outcomes that are significantly influenced by human factors. This paper is the first to integrate a multi-branch ResNet strategy combining three-phase and single-phase currents. A range of three-phase current input strategies were developed, and a deep learning-based motor fault diagnosis model with adaptive feature extraction was established. This enables the deep residual network to extract fault depth features from the motor current signal more effectively. The experimental findings demonstrate that deep learning possesses the capacity to automatically extract depth features, thereby exceeding the capabilities of conventional machine learning algorithms with regard to the accuracy of motor fault diagnosis. Full article
(This article belongs to the Special Issue Symmetry/Asymmetry in Motor Control, Drives and Power Electronics)
Show Figures

Figure 1

25 pages, 29311 KB  
Article
Abnormal Vibration Signal Detection of EMU Motor Bearings Based on VMD and Deep Learning
by Yanjie Cui, Weijiao Zhang and Zhongkai Wang
Sensors 2025, 25(18), 5733; https://doi.org/10.3390/s25185733 - 14 Sep 2025
Viewed by 557
Abstract
To address the challenge of anomaly detection in vibration signals from high-speed electric multiple unit (EMU) motor bearings, characterized by strong non-stationarity and multi-component coupling, this study proposes a synergistic approach integrating variational mode decomposition (VMD) and deep learning. Unlike datasets focused on [...] Read more.
To address the challenge of anomaly detection in vibration signals from high-speed electric multiple unit (EMU) motor bearings, characterized by strong non-stationarity and multi-component coupling, this study proposes a synergistic approach integrating variational mode decomposition (VMD) and deep learning. Unlike datasets focused on fault diagnosis (identifying known fault types), anomaly detection identifies deviations into unknown states. The method utilizes real-world, non-real-time vibration data from ground monitoring systems to detect anomalies from early signs to significant deviations. Firstly, adaptive VMD parameter selection, guided by power spectral density (PSD), optimizes the number of modes and penalty factors to overcome mode mixing and bandwidth constraints. Secondly, a hybrid deep learning model integrates convolutional neural networks (CNNs), bidirectional long- and short-term memory (BiLSTM), and residual network (ResNet), enabling precise modal component prediction and signal reconstruction through multi-scale feature extraction and temporal modeling. Finally, the root mean square (RMS) features of prediction errors from normal operational data train a one-class support vector machine (OC-SVM), establishing a normal-state decision boundary for anomaly identification. Validation using CR400AF EMU motor bearing data demonstrates exceptional performance: under normal conditions, root mean square error (RMSE=0.005), Mean Absolute Error (MAE=0.002), and Coefficient of Determination (R2=0.999); for anomaly detection, accuracy = 95.2% and F1-score = 0.909, significantly outperforming traditional methods like Isolation Forest (F1-score = 0.389). This provides a reliable technical solution for intelligent operation and maintenance of EMU motor bearings in complex conditions. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

21 pages, 4721 KB  
Article
Automated Brain Tumor MRI Segmentation Using ARU-Net with Residual-Attention Modules
by Erdal Özbay and Feyza Altunbey Özbay
Diagnostics 2025, 15(18), 2326; https://doi.org/10.3390/diagnostics15182326 - 13 Sep 2025
Viewed by 592
Abstract
Background/Objectives: Accurate segmentation of brain tumors in Magnetic Resonance Imaging (MRI) scans is critical for diagnosis and treatment planning due to their life-threatening nature. This study aims to develop a robust and automated method capable of precisely delineating heterogeneous tumor regions while improving [...] Read more.
Background/Objectives: Accurate segmentation of brain tumors in Magnetic Resonance Imaging (MRI) scans is critical for diagnosis and treatment planning due to their life-threatening nature. This study aims to develop a robust and automated method capable of precisely delineating heterogeneous tumor regions while improving segmentation accuracy and generalization. Methods: We propose Attention Res-UNet (ARU-Net), a novel Deep Learning (DL) architecture integrating residual connections, Adaptive Channel Attention (ACA), and Dimensional-space Triplet Attention (DTA) modules. The encoding module efficiently extracts and refines relevant feature information by applying ACA to the lower layers of convolutional and residual blocks. The DTA is fixed to the upper layers of the decoding module, decoupling channel weights to better extract and fuse multi-scale features, enhancing both performance and efficiency. Input MRI images are pre-processed using Contrast Limited Adaptive Histogram Equalization (CLAHE) for contrast enhancement, denoising filters, and Linear Kuwahara filtering to preserve edges while smoothing homogeneous regions. The network is trained using categorical cross-entropy loss with the Adam optimizer on the BTMRII dataset, and comparative experiments are conducted against baseline U-Net, DenseNet121, and Xception models. Performance is evaluated using accuracy, precision, recall, F1-score, Dice Similarity Coefficient (DSC), and Intersection over Union (IoU) metrics. Results: Baseline U-Net showed significant performance gains after adding residual connections and ACA modules, with DSC improving by approximately 3.3%, accuracy by 3.2%, IoU by 7.7%, and F1-score by 3.3%. ARU-Net further enhanced segmentation performance, achieving 98.3% accuracy, 98.1% DSC, 96.3% IoU, and a superior F1-score, representing additional improvements of 1.1–2.0% over the U-Net + Residual + ACA variant. Visualizations confirmed smoother boundaries and more precise tumor contours across all six tumor classes, highlighting ARU-Net’s ability to capture heterogeneous tumor structures and fine structural details more effectively than both baseline U-Net and other conventional DL models. Conclusions: ARU-Net, combined with an effective pre-processing strategy, provides a highly reliable and precise solution for automated brain tumor segmentation. Its improvements across multiple evaluation metrics over U-Net and other conventional models highlight its potential for clinical application and contribute novel insights to medical image analysis research. Full article
(This article belongs to the Special Issue Advances in Functional and Structural MR Image Analysis)
Show Figures

Figure 1

18 pages, 5871 KB  
Article
Inversion of Shear and Longitudinal Acoustic Wave Propagation Parameters in Sea Ice Using SE-ResNet
by Jin Bai, Yi Liu, Xuegang Zhang, Wenmao Yin and Ziye Deng
Sensors 2025, 25(18), 5663; https://doi.org/10.3390/s25185663 - 11 Sep 2025
Viewed by 309
Abstract
With the advancement of scientific research, understanding the physical parameters governing acoustic wave propagation in sea ice has become increasingly important. Among these parameters, shear wave velocity plays a crucial role. However, as measurements progressed, it became apparent that there was a large [...] Read more.
With the advancement of scientific research, understanding the physical parameters governing acoustic wave propagation in sea ice has become increasingly important. Among these parameters, shear wave velocity plays a crucial role. However, as measurements progressed, it became apparent that there was a large discrepancy between measured values of shear waves and predictions based on empirical formulas or existing models. These inconsistencies stem primarily from the complex internal structure of natural sea ice, which significantly influences its physical behavior. Research reveals that shear wave velocity is not only influenced by bulk properties such as density, temperature, and stress state but is also sensitive to microstructural features, including air bubbles, inclusions, and ice crystal orientation. Compared to longitudinal wave velocity, the characterization of shear wave velocity is far more challenging due to these inherent complexities, underscoring the need for more precise measurement and modeling techniques. To address the challenges posed by the complex internal structure of natural sea ice and improve prediction accuracy, this study introduces a novel, integrated approach combining simulation, measurement, and inversion intelligent learning model. First, a laboratory-based method for generating sea ice layers under controlled formation conditions is developed. The produced sea ice layers align closely with measured values for Poisson’s ratio, multi-year sea ice density, and uniaxial compression modulus, particularly in the high-temperature range. Second, enhancements to shear wave velocity measurement equipment have been implemented. The improved device achieves measurement accuracy exceeding 1%, offers portability, and meets the demands of high-precision experiments conducted in harsh polar environments. Finally, according to the characteristics of small sample data. The ANN neural network was improved to a deep residual neural network with the addition of Squeeze-and-Excitation Attention (SE-ResNet) to predict longitudinal and transverse wave velocities. This prediction method improves the accuracy of shear and longitudinal wave velocity prediction by 24.87% and 39.59%, respectively, compared to the ANN neural network. Full article
Show Figures

Figure 1

33 pages, 4751 KB  
Article
U-ResNet, a Novel Network Fusion Method for Image Classification and Segmentation
by Wenkai Li, Zhe Gao and Yaqing Song
Sensors 2025, 25(17), 5600; https://doi.org/10.3390/s25175600 - 8 Sep 2025
Viewed by 805
Abstract
Image classification and segmentation are important tasks in computer vision. ResNet and U-Net are representative networks for image classification and image segmentation, respectively. Although many scholars used to fuse these two networks, most integration focuses on image segmentation with U-Net, overlooking the capabilities [...] Read more.
Image classification and segmentation are important tasks in computer vision. ResNet and U-Net are representative networks for image classification and image segmentation, respectively. Although many scholars used to fuse these two networks, most integration focuses on image segmentation with U-Net, overlooking the capabilities of ResNet for image classification. In this paper, we propose a novel U-ResNet structure by combining U-Net’s convolution–deconvolution structure (UBlock) with ResNet’s residual structure (ResBlock) in a parallel manner. This novel parallel structure achieves rapid convergence and high accuracy in image classification and segmentation while also efficiently alleviating the vanishing gradient problem. Specifically, in the UBlock, the pixel-level features of both high- and low-resolution images are extracted and processed. In the ResBlock, a Selected Upsampling (SU) module was introduced to enhance performance on low-resolution datasets, and an improved Efficient Upsampling Convolutional Block (EUCB*) with a Channel Shuffle mechanism was added before the output of the ResBlock to enhance network convergence. Features from both the ResBlock and UBlock were merged for better decision making. This architecture outperformed the state-of-the-art (SOTA) models in both image classification and segmentation tasks on open-source and private datasets. Functions of individual modules were further verified via ablation studies. The superiority of the proposed U-ResNet displays strong feasibility and potential for advanced cross-paradigm tasks in computer vision. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

Back to TopTop