Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (278)

Search Parameters:
Keywords = shot noise

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
43 pages, 8627 KB  
Article
Fault Diagnosis of Rolling Bearings Based on HFMD and Dual-Branch Parallel Network Under Acoustic Signals
by Hengdi Wang, Haokui Wang and Jizhan Xie
Sensors 2025, 25(17), 5338; https://doi.org/10.3390/s25175338 - 28 Aug 2025
Abstract
This paper proposes a rolling bearing fault diagnosis method based on HFMD and a dual-branch parallel network, aiming to address the issue of diagnostic accuracy being compromised by the disparity in data quality across different source domains due to sparse feature separation in [...] Read more.
This paper proposes a rolling bearing fault diagnosis method based on HFMD and a dual-branch parallel network, aiming to address the issue of diagnostic accuracy being compromised by the disparity in data quality across different source domains due to sparse feature separation in rolling bearing acoustic signals. Traditional methods face challenges in feature extraction, sensitivity to noise, and difficulties in handling coupled multi-fault conditions in rolling bearing fault diagnosis. To overcome these challenges, this study first employs the HawkFish Optimization Algorithm to optimize Feature Mode Decomposition (HFMD) parameters, thereby improving modal decomposition accuracy. The optimal modal components are selected based on the minimum Residual Energy Index (REI) criterion, with their time-domain graphs and Continuous Wavelet Transform (CWT) time-frequency diagrams extracted as network inputs. Then, a dual-branch parallel network model is constructed, where the multi-scale residual structure (Res2Net) incorporating the Efficient Channel Attention (ECA) mechanism serves as the temporal branch to extract key features and suppress noise interference, while the Swin Transformer integrating multi-stage cross-scale attention (MSCSA) acts as the time-frequency branch to break through local perception bottlenecks and enhance classification performance under limited resources. Finally, the time-domain graphs and time-frequency graphs are, respectively, input into Res2Net and Swin Transformer, and the features from both branches are fused through a fully connected layer to obtain comprehensive fault diagnosis results. The research results demonstrate that the proposed method achieves 100% accuracy in open-source datasets. In the experimental data, the diagnostic accuracy of this study demonstrates significant advantages over other diagnostic models, achieving an accuracy rate of 98.5%. Under few-shot conditions, this study maintains an accuracy rate no lower than 95%, with only a 2.34% variation in accuracy. HFMD and the dual-branch parallel network exhibit remarkable stability and superiority in the field of rolling bearing fault diagnosis. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
20 pages, 1320 KB  
Article
A Method for Few-Shot Modulation Recognition Based on Reinforcement Metric Meta-Learning
by Fan Zhou, Xiao Han, Jinyang Ren, Wei Wang, Yang Wang, Peiying Zhang and Shaolin Liao
Computers 2025, 14(9), 346; https://doi.org/10.3390/computers14090346 - 22 Aug 2025
Viewed by 210
Abstract
In response to the problem where neural network models fail to fully learn signal sample features due to an insufficient number of signal samples, leading to a decrease in the model’s ability to recognize signal modulation methods, a few-shot signal modulation mode recognition [...] Read more.
In response to the problem where neural network models fail to fully learn signal sample features due to an insufficient number of signal samples, leading to a decrease in the model’s ability to recognize signal modulation methods, a few-shot signal modulation mode recognition method based on reinforcement metric meta-learning (RMML) is proposed. This approach, grounded in meta-learning techniques, employs transfer learning to building a feature extraction network that effectively extracts the data features under few-shot conditions. Building on this, by integrating the measurement of features of similar samples and the differences between features of different classes of samples, the metric network’s target loss function is optimized, thereby improving the network’s ability to distinguish between features of different modulation methods. The experimental results demonstrate that this method exhibits a good performance in processing new class signals that have not been previously trained. Under the condition of 5-way 5-shot, when the signal-to-noise ratio (SNR) is 0 dB, this method can achieve an average recognition accuracy of 91.8%, which is 2.8% higher than that of the best-performing baseline method, whereas when the SNR is 18 dB, the model’s average recognition accuracy significantly improves to 98.5%. Full article
(This article belongs to the Special Issue Wireless Sensor Networks in IoT)
Show Figures

Figure 1

14 pages, 2144 KB  
Article
Analogs of the Prime Number Problem in a Shot Noise Suppression of the Soft-Reset Process
by Yutaka Hirose
Nanomaterials 2025, 15(17), 1297; https://doi.org/10.3390/nano15171297 - 22 Aug 2025
Viewed by 151
Abstract
The soft-reset process, or a sequence of charge emissions from a floating storage node through a transistor biased in a subthreshold bias condition, is modeled by a master (Kolmogorov–Bateman) equation. The Coulomb interaction energy after each one-charge emission leads to a stepwise potential [...] Read more.
The soft-reset process, or a sequence of charge emissions from a floating storage node through a transistor biased in a subthreshold bias condition, is modeled by a master (Kolmogorov–Bateman) equation. The Coulomb interaction energy after each one-charge emission leads to a stepwise potential increase, giving correlated emission rates represented by Boltzmann factors. The governing probability distribution function is a hypoexponential type, and its cumulants describe characteristics of the single-charge Coulomb interaction at room temperature on a mesoscopic scale. The cumulants are further extended into a complex domain. Starting from three fundamental assumptions, i.e., the generation of non-degenerated states due to single-charge Coulomb energy, the Markovian property of each emission event, and the independence of each state, a moment function is identified as a product of mutually prime elements (algebraically termed as prime ideals) comprising the eigenvalues or the lifetimes of the emission states. Then, the algebraic structure of the moment function is found to be highly analogous to that of an integer uniquely factored into prime numbers. Treating the lifetimes as analogs of the prime numbers, two types of zeta functions are constructed. Standard analyses of the zeta functions analogous to the prime number problem or the Riemann Hypothesis are performed. For the zeta functions, the analyticity and poles are specified, and the functional equations are derived. Also, the zeta functions are found to be equivalent to the analytic extension of the cumulants. Finally, between the number of emitted charges and the lifetime, a logarithmic relation analogous to the prime number theorem is derived. Full article
(This article belongs to the Special Issue The Interaction of Electron Phenomena on the Mesoscopic Scale)
Show Figures

Figure 1

24 pages, 2115 KB  
Article
MHD-Protonet: Margin-Aware Hard Example Mining for SAR Few-Shot Learning via Dual-Loss Optimization
by Marii Zayani, Abdelmalek Toumi and Ali Khalfallah
Algorithms 2025, 18(8), 519; https://doi.org/10.3390/a18080519 - 16 Aug 2025
Viewed by 438
Abstract
Synthetic aperture radar (SAR) image classification under limited data conditions faces two major challenges: inter-class similarity, where distinct radar targets (e.g., tanks and armored trucks) have nearly identical scattering characteristics, and intra-class variability, caused by speckle noise, pose changes, and differences in depression [...] Read more.
Synthetic aperture radar (SAR) image classification under limited data conditions faces two major challenges: inter-class similarity, where distinct radar targets (e.g., tanks and armored trucks) have nearly identical scattering characteristics, and intra-class variability, caused by speckle noise, pose changes, and differences in depression angle. To address these challenges, we propose MHD-ProtoNet, a meta-learning framework that extends prototypical networks with two key innovations: margin-aware hard example mining to better separate confusable classes by enforcing prototype distance margins, and dual-loss optimization to refine embeddings and improve robustness to noise-induced variations. Evaluated on the MSTAR dataset in a five-way one-shot task, MHD-ProtoNet achieves 76.80% accuracy, outperforming the Hybrid Inference Network (HIN) (74.70%), as well as standard few-shot methods such as prototypical networks (69.38%), ST-PN (72.54%), and graph-based models like ADMM-GCN (61.79%) and DGP-NET (68.60%). By explicitly mitigating inter-class ambiguity and intra-class noise, the proposed model enables robust SAR target recognition with minimal labeled data. Full article
Show Figures

Figure 1

20 pages, 8858 KB  
Article
Compressed Sensing Reconstruction with Zero-Shot Self-Supervised Learning for High-Resolution MRI of Human Embryos
by Kazuma Iwazaki, Naoto Fujita, Shigehito Yamada and Yasuhiko Terada
Tomography 2025, 11(8), 88; https://doi.org/10.3390/tomography11080088 - 2 Aug 2025
Viewed by 472
Abstract
Objectives: This study investigates whether scan time in the high-resolution magnetic resonance imaging (MRI) of human embryos can be reduced without compromising spatial resolution by applying zero-shot self-supervised learning (ZS-SSL), a deep-learning-based reconstruction method. Methods: Simulations using a numerical phantom were [...] Read more.
Objectives: This study investigates whether scan time in the high-resolution magnetic resonance imaging (MRI) of human embryos can be reduced without compromising spatial resolution by applying zero-shot self-supervised learning (ZS-SSL), a deep-learning-based reconstruction method. Methods: Simulations using a numerical phantom were conducted to evaluate spatial resolution across various acceleration factors (AF = 2, 4, 6, and 8) and signal-to-noise ratio (SNR) levels. Resolution was quantified using a blur-based estimation method based on the Sparrow criterion. ZS-SSL was compared to conventional compressed sensing (CS). Experimental imaging of a human embryo at Carnegie stage 21 was performed at a spatial resolution of (30 μm)3 using both retrospective and prospective undersampling at AF = 4 and 8. Results: ZS-SSL preserved spatial resolution more effectively than CS at low SNRs. At AF = 4, image quality was comparable to that of fully sampled data, while noticeable degradation occurred at AF = 8. Experimental validation confirmed these findings, with clear visualization of anatomical structures—such as the accessory nerve—at AF = 4; there was reduced structural clarity at AF = 8. Conclusions: ZS-SSL enables significant scan time reduction in high-resolution MRI of human embryos while maintaining spatial resolution at AF = 4, assuming an SNR above approximately 15. This trade-off between acceleration and image quality is particularly beneficial in studies with limited imaging time or specimen availability. The method facilitates the efficient acquisition of ultra-high-resolution data and supports future efforts to construct detailed developmental atlases. Full article
Show Figures

Figure 1

18 pages, 7965 KB  
Article
Identification of Environmental Noise Traces in Seismic Recordings Using Vision Transformer and Mel-Spectrogram
by Qianlong Ding, Shuangquan Chen, Jinsong Shen and Borui Wang
Appl. Sci. 2025, 15(15), 8586; https://doi.org/10.3390/app15158586 - 1 Aug 2025
Viewed by 371
Abstract
Environmental noise is inevitable during seismic data acquisition, with major sources including heavy machinery, rivers, wind, and other environmental factors. During field data acquisition, it is important to assess the impact of environmental noise and evaluate data quality. In subsequent seismic data processing, [...] Read more.
Environmental noise is inevitable during seismic data acquisition, with major sources including heavy machinery, rivers, wind, and other environmental factors. During field data acquisition, it is important to assess the impact of environmental noise and evaluate data quality. In subsequent seismic data processing, these noise components also need to be eliminated. Accurate identification of noise traces facilitates rapid quality control (QC) during fieldwork and provides a reliable basis for targeted noise attenuation. Conventional environmental noise identification primarily relies on amplitude differences. However, in seismic data, high-amplitude signals are not necessarily caused by environmental noise. For example, surface waves or traces near the shot point may also exhibit high amplitudes. Therefore, relying solely on amplitude-based criteria has certain limitations. To improve noise identification accuracy, we use the Mel-spectrogram to extract features from seismic data and construct the dataset. Compared to raw time-series signals, the Mel-spectrogram more clearly reveals energy variations and frequency differences, helping to identify noise traces more accurately. We then employ a Vision Transformer (ViT) network to train a model for identifying noise in seismic data. Tests on synthetic and field data show that the proposed method performs well in identifying noise. Moreover, a denoising case based on synthetic data further confirms its general applicability, making it a promising tool in seismic data QC and processing workflows. Full article
Show Figures

Figure 1

25 pages, 4296 KB  
Article
StripSurface-YOLO: An Enhanced Yolov8n-Based Framework for Detecting Surface Defects on Strip Steel in Industrial Environments
by Haomin Li, Huanzun Zhang and Wenke Zang
Electronics 2025, 14(15), 2994; https://doi.org/10.3390/electronics14152994 - 27 Jul 2025
Viewed by 547
Abstract
Recent advances in precision manufacturing and high-end equipment technologies have imposed ever more stringent requirements on the accuracy, real-time performance, and lightweight design of online steel strip surface defect detection systems. To reconcile the persistent trade-off between detection precision and inference efficiency in [...] Read more.
Recent advances in precision manufacturing and high-end equipment technologies have imposed ever more stringent requirements on the accuracy, real-time performance, and lightweight design of online steel strip surface defect detection systems. To reconcile the persistent trade-off between detection precision and inference efficiency in complex industrial environments, this study proposes StripSurface–YOLO, a novel real-time defect detection framework built upon YOLOv8n. The core architecture integrates an Efficient Cross-Stage Local Perception module (ResGSCSP), which synergistically combines GSConv lightweight convolutions with a one-shot aggregation strategy, thereby markedly reducing both model parameters and computational complexity. To further enhance multi-scale feature representation, this study introduces an Efficient Multi-Scale Attention (EMA) mechanism at the feature-fusion stage, enabling the network to more effectively attend to critical defect regions. Moreover, conventional nearest-neighbor upsampling is replaced by DySample, which produces deeper, high-resolution feature maps enriched with semantic content, improving both inference speed and fusion quality. To heighten sensitivity to small-scale and low-contrast defects, the model adopts Focal Loss, dynamically adjusting to sample difficulty. Extensive evaluations on the NEU-DET dataset demonstrate that StripSurface–YOLO reduces FLOPs by 11.6% and parameter count by 7.4% relative to the baseline YOLOv8n, while achieving respective improvements of 1.4%, 3.1%, 4.1%, and 3.0% in precision, recall, mAP50, and mAP50:95. Under adverse conditions—including contrast variations, brightness fluctuations, and Gaussian noise—SteelSurface-YOLO outperforms the baseline model, delivering improvements of 5.0% in mAP50 and 4.7% in mAP50:95, attesting to the model’s robust interference resistance. These findings underscore the potential of StripSurface–YOLO to meet the rigorous performance demands of real-time surface defect detection in the metal forging industry. Full article
Show Figures

Figure 1

24 pages, 2508 KB  
Article
Class-Discrepancy Dynamic Weighting for Cross-Domain Few-Shot Hyperspectral Image Classification
by Chen Ding, Jiahao Yue, Sirui Zheng, Yizhuo Dong, Wenqiang Hua, Xueling Chen, Yu Xie, Song Yan, Wei Wei and Lei Zhang
Remote Sens. 2025, 17(15), 2605; https://doi.org/10.3390/rs17152605 - 27 Jul 2025
Viewed by 464
Abstract
In recent years, cross-domain few-shot learning (CDFSL) has demonstrated remarkable performance in hyperspectral image classification (HSIC), partially alleviating the distribution shift problem. However, most domain adaptation methods rely on similarity metrics to establish cross-domain class matching, making it difficult to simultaneously account for [...] Read more.
In recent years, cross-domain few-shot learning (CDFSL) has demonstrated remarkable performance in hyperspectral image classification (HSIC), partially alleviating the distribution shift problem. However, most domain adaptation methods rely on similarity metrics to establish cross-domain class matching, making it difficult to simultaneously account for intra-class sample size variations and inherent inter-class differences. To address this problem, existing studies have introduced a class weighting mechanism within the prototype network framework, determining class weights by calculating inter-sample similarity through distance metrics. However, this method suffers from a dual limitation: susceptibility to noise interference and insufficient capacity to capture global class variations, which may lead to distorted weight allocation and consequently result in alignment bias. To solve these issues, we propose a novel class-discrepancy dynamic weighting-based cross-domain FSL (CDDW-CFSL) framework. It integrates three key components: (1) the class-weighted domain adaptation (CWDA) method dynamically measures cross-domain distribution shifts using global class mean discrepancies. It employs discrepancy-sensitive weighting to strengthen the alignment of critical categories, enabling accurate domain adaptation while maintaining feature topology; (2) the class mean refinement (CMR) method incorporates class covariance distance to compute distribution discrepancies between support set samples and class prototypes, enabling the precise capture of cross-domain feature internal structures; (3) a novel multi-dimensional feature extractor that captures both local spatial details and continuous spectral characteristics simultaneously, facilitating deep cross-dimensional feature fusion. The results in three publicly available HSIC datasets show the effectiveness of the CDDW-CFSL. Full article
Show Figures

Figure 1

17 pages, 6827 KB  
Article
Deep Learning-Based Min-Entropy-Accelerated Evaluation for High-Speed Quantum Random Number Generation
by Xiaomin Guo, Wenhe Zhou, Yue Luo, Xiangyu Meng, Jiamin Li, Yaoxing Bian, Yanqiang Guo and Liantuan Xiao
Entropy 2025, 27(8), 786; https://doi.org/10.3390/e27080786 - 24 Jul 2025
Viewed by 254
Abstract
Secure communication is critically dependent on high-speed and high-security quantum random number generation (QRNG). In this work, we present a responsive approach to enhance the efficiency and security of QRNG by leveraging polarization-controlled heterodyne detection to simultaneously measure the quadrature amplitude and phase [...] Read more.
Secure communication is critically dependent on high-speed and high-security quantum random number generation (QRNG). In this work, we present a responsive approach to enhance the efficiency and security of QRNG by leveraging polarization-controlled heterodyne detection to simultaneously measure the quadrature amplitude and phase fluctuations of vacuum shot noise. To address the practical non-idealities inherent in QRNG systems, we investigate the critical impacts of imbalanced heterodyne detection, amplitude–phase overlap, finite-size effects, and security parameters on quantum conditional min-entropy derived from the entropy uncertainty principle. It effectively mitigates the overestimation of randomness and fortifies the system against potential eavesdropping attacks. For a high-security parameter of 1020, QRNG achieves a true random bit extraction ratio of 83.16% with a corresponding real-time speed of 37.25 Gbps following a 16-bit analog-to-digital converter quantization and 1.4 GHz bandwidth extraction. Furthermore, we develop a deep convolutional neural network for rapid and accurate entropy evaluation. The entropy evaluation of 13,473 sets of quadrature data is processed in 68.89 s with a mean absolute percentage error of 0.004, achieving an acceleration of two orders of magnitude in evaluation speed. Extracting the shot noise with full detection bandwidth, the generation rate of QRNG using dual-quadrature heterodyne detection exceeds 85 Gbps. The research contributes to advancing the practical deployment of QRNG and expediting rapid entropy assessment. Full article
(This article belongs to the Section Quantum Information)
Show Figures

Figure 1

16 pages, 3084 KB  
Article
Generating Large Time–Bandwidth Product RF-Chirped Waveforms Using Vernier Dual-Optical Frequency Combs
by Mohammed S. Alshaykh
Photonics 2025, 12(7), 700; https://doi.org/10.3390/photonics12070700 - 11 Jul 2025
Viewed by 364
Abstract
Chirped radio-frequency signals are essential waveforms in radar systems. To enhance resolution and improve the signal-to-noise ratio through higher energy transmission, chirps with high time–bandwidth products are highly desirable. Photonic technologies, with their ability to handle broad electrical bandwidths, have been widely employed [...] Read more.
Chirped radio-frequency signals are essential waveforms in radar systems. To enhance resolution and improve the signal-to-noise ratio through higher energy transmission, chirps with high time–bandwidth products are highly desirable. Photonic technologies, with their ability to handle broad electrical bandwidths, have been widely employed in the generation, filtering, processing, and detection of broadband electrical waveforms. In this work, we propose a photonics-based large-TBWP RF chirp generator utilizing dual optical frequency combs with a small difference in the repetition rate. By employing dispersion modules for frequency-to-time mapping, we convert the spectral interferometric patterns into a temporal RF sinusoidal carrier signal whose frequency is swept through the optical shot-to-shot delay. We derive analytical expressions to quantify the system’s performance under various design parameters, including the comb repetition rate and its offset, the second-order dispersion, the transform-limited optical pulse width, and the photodetector’s bandwidth limitations. We benchmark the expected system performance in terms of RF bandwidth, chirp duration, chirp rate, frequency step size, and TBWP. Using realistic dual-comb source parameters, we demonstrate the feasibility of generating RF chirps with a duration of 284.44 μs and a bandwidth of 234.05 GHz, corresponding to a TBWP of 3.3×107. Full article
Show Figures

Figure 1

19 pages, 17180 KB  
Article
Adaptive Support Weight-Based Stereo Matching with Iterative Disparity Refinement
by Alexander Richter, Till Steinmann, Andreas Reichenbach and Stefan J. Rupitsch
Sensors 2025, 25(13), 4124; https://doi.org/10.3390/s25134124 - 2 Jul 2025
Viewed by 509
Abstract
Real-time 3D reconstruction in minimally invasive surgery improves depth perception and supports intraoperative decision-making and navigation. However, endoscopic imaging presents significant challenges, such as specular reflections, low-texture surfaces, and tissue deformation. We present a novel, deterministic and iterative stereo-matching method based on adaptive [...] Read more.
Real-time 3D reconstruction in minimally invasive surgery improves depth perception and supports intraoperative decision-making and navigation. However, endoscopic imaging presents significant challenges, such as specular reflections, low-texture surfaces, and tissue deformation. We present a novel, deterministic and iterative stereo-matching method based on adaptive support weights that is tailored to these constraints. The algorithm is implemented in CUDA and C++ to enable real-time performance. We evaluated our method on the Stereo Correspondence and Reconstruction of Endoscopic Data (SCARED) dataset and a custom synthetic dataset using the mean absolute error (MAE), root mean square error (RMSE), and frame rate as metrics. On SCARED datasets 8 and 9, our method achieves MAEs of 3.79 mm and 3.61 mm, achieving 24.9 FPS on a system with an AMD Ryzen 9 5950X and NVIDIA RTX 3090. To the best of our knowledge, these results are on par with or surpass existing deterministic stereo-matching approaches. On synthetic data, which eliminates real-world imaging errors, the method achieves an MAE of 140.06 μm and an RMSE of 251.9 μm, highlighting its performance ceiling under noise-free, idealized conditions. Our method focuses on single-shot 3D reconstruction as a basis for stereo frame stitching and full-scene modeling. It provides accurate, deterministic, real-time depth estimation under clinically relevant conditions and has the potential to be integrated into surgical navigation, robotic assistance, and augmented reality workflows. Full article
(This article belongs to the Special Issue Stereo Vision Sensing and Image Processing)
Show Figures

Figure 1

17 pages, 489 KB  
Review
Experimental Advances in Phase Estimation with Photonic Quantum States
by Laura T. Knoll, Agustina G. Magnoni and Miguel A. Larotonda
Entropy 2025, 27(7), 712; https://doi.org/10.3390/e27070712 - 1 Jul 2025
Viewed by 839
Abstract
Photonic quantum metrology has emerged as a leading platform for quantum-enhanced precision measurements. By taking advantage of quantum resources such as entanglement, quantum metrology enables parameter estimation with sensitivities surpassing classical limits. In this review, we describe the basic tools and recent experimental [...] Read more.
Photonic quantum metrology has emerged as a leading platform for quantum-enhanced precision measurements. By taking advantage of quantum resources such as entanglement, quantum metrology enables parameter estimation with sensitivities surpassing classical limits. In this review, we describe the basic tools and recent experimental progress in the determination of an optical phase with a precision that may exceed the shot-noise limit, enabled by the use of nonclassical states of light. We review the state of the art and discuss the challenges and trends in the field. Full article
(This article belongs to the Section Quantum Information)
Show Figures

Figure 1

19 pages, 3484 KB  
Article
Rolling Bearing Fault Diagnosis Model Based on Multi-Scale Depthwise Separable Convolutional Neural Network Integrated with Spatial Attention Mechanism
by Zhixin Jin, Xudong Hu, Hongli Wang, Shengyu Guan, Kaiman Liu, Zhiwen Fang, Hongwei Wang, Xuesong Wang, Lijie Wang and Qun Zhang
Sensors 2025, 25(13), 4064; https://doi.org/10.3390/s25134064 - 30 Jun 2025
Cited by 1 | Viewed by 420
Abstract
In response to the challenges posed by complex and variable operating conditions of rolling bearings and the limited availability of labeled data, both of which hinder the effective extraction of key fault features and reduce diagnostic accuracy, this study introduces a model that [...] Read more.
In response to the challenges posed by complex and variable operating conditions of rolling bearings and the limited availability of labeled data, both of which hinder the effective extraction of key fault features and reduce diagnostic accuracy, this study introduces a model that combines a spatial attention (SA) mechanism with a multi-scale depthwise separable convolution module. The proposed approach first employs the Gramian angular difference field (GADF) to convert raw signals. This conversion maps the temporal characteristics of the signal into an image format that intrinsically preserves both temporal dynamics and phase relationships. Subsequently, the model architecture incorporates a spatial attention mechanism and a multi-scale depthwise separable convolutional module. Guided by the attention mechanism to concentrate on discriminative feature regions and to suppress noise, the convolutional component efficiently extracts hierarchical features in parallel through the multi-scale receptive fields. Furthermore, the trained model serves as a pre-trained network and is transferred to novel variable-condition environments to enhance diagnostic accuracy in few-shot scenarios. The effectiveness of the proposed model was evaluated using bearing datasets and field-collected industrial data. Experimental results confirm that the proposed model offers outstanding fault recognition performance and generalization capability across diverse working conditions, small-sample scenarios, and real industrial environments. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

24 pages, 3716 KB  
Article
HRRPGraphNet++: Dynamic Graph Neural Network with Meta-Learning for Few-Shot HRRP Radar Target Recognition
by Lingfeng Chen, Zhiliang Pan, Qi Liu and Panhe Hu
Remote Sens. 2025, 17(12), 2108; https://doi.org/10.3390/rs17122108 - 19 Jun 2025
Cited by 1 | Viewed by 758
Abstract
High-Resolution Range Profile (HRRP) radar recognition suffers from data scarcity challenges in real-world applications. We present HRRPGraphNet++, a framework combining dynamic graph neural networks with meta-learning for few-shot HRRP recognition. Our approach generates graph representations dynamically through multi-head self attention (MSA) mechanisms that [...] Read more.
High-Resolution Range Profile (HRRP) radar recognition suffers from data scarcity challenges in real-world applications. We present HRRPGraphNet++, a framework combining dynamic graph neural networks with meta-learning for few-shot HRRP recognition. Our approach generates graph representations dynamically through multi-head self attention (MSA) mechanisms that adapt to target-specific scattering characteristics, integrated with a specialized meta-learning framework employing layer-wise learning rates. Experiments demonstrate state-of-the-art performance in 1-shot (82.3%), 5-shot (91.8%), and 20-shot (94.7%) settings, with enhanced noise robustness (68.7% accuracy at 0 dB SNR). Our hybrid graph mechanism combines physical priors with learned relationships, significantly outperforming conventional methods in challenging scenarios. Full article
(This article belongs to the Special Issue Advanced AI Technology for Remote Sensing Analysis)
Show Figures

Graphical abstract

20 pages, 10753 KB  
Article
Physics-Guided Self-Supervised Learning Full Waveform Inversion with Pretraining on Simultaneous Source
by Qiqi Zheng, Meng Li and Bangyu Wu
J. Mar. Sci. Eng. 2025, 13(6), 1193; https://doi.org/10.3390/jmse13061193 - 19 Jun 2025
Viewed by 656
Abstract
Full waveform inversion (FWI) is an established precise velocity estimation tool for seismic exploration. Machine learning-based FWI could plausibly circumvent the long-standing cycle-skipping problem of traditional model-driven methods. The physics-guided self-supervised FWI is appealing in that it avoids having to make tedious efforts [...] Read more.
Full waveform inversion (FWI) is an established precise velocity estimation tool for seismic exploration. Machine learning-based FWI could plausibly circumvent the long-standing cycle-skipping problem of traditional model-driven methods. The physics-guided self-supervised FWI is appealing in that it avoids having to make tedious efforts in terms of label generation for supervised methods. One way is to employ an inversion network to convert the seismic shot gathers into a velocity model. The objective function is to minimize the difference between the recorded seismic data and the synthetic data by solving the wave equation using the inverted velocity model. To further improve the efficiency, we propose a two-stage training strategy for the self-supervised learning FWI. The first stage is to pretrain the inversion network using a simultaneous source for a large-scale velocity model with high efficiency. The second stage is switched to modeling the separate shot gathers for an accurate measurement of the seismic data to invert the velocity model details. The inversion network is a partial convolution attention modified UNet (PCAMUNet), which combines local feature extraction with global information integration to achieve high-resolution velocity model estimation from seismic shot gathers. The time-domain 2D acoustic wave equation serves as the physical constraint in this self-supervised framework. Different loss functions are used for the two stages, that is, the waveform loss with time weighting for the first stage (simultaneous source) and the hybrid waveform with time weighting and logarithmic envelope loss for the second stage (separate source). Comparative experiments demonstrate that the proposed approach improves both inversion accuracy and efficiency on the Marmousi2 model, Overthrust model, and BP model tests. Moreover, the method exhibits excellent noise resistance and stability when low-frequency data component is missing. Full article
(This article belongs to the Special Issue Modeling and Waveform Inversion of Marine Seismic Data)
Show Figures

Figure 1

Back to TopTop