Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (3,996)

Search Parameters:
Keywords = multi-task network

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
25 pages, 1336 KB  
Article
Construction and Reasoning Method of Knowledge Graph for Aircraft Skin Spraying Process
by Danyang Yu, Chengzhi Su, Huilin Tian, Wenyu Song, Yuxin Yue and Haifeng Bao
Processes 2026, 14(4), 581; https://doi.org/10.3390/pr14040581 (registering DOI) - 7 Feb 2026
Abstract
To address the heavy reliance on experiential knowledge, fragmented multi-source information, and limited intelligence in decision-making for aircraft skin spraying processes, this paper proposes a knowledge reasoning method based on a knowledge graph. The authors construct a knowledge graph that integrates multi-structure ontology [...] Read more.
To address the heavy reliance on experiential knowledge, fragmented multi-source information, and limited intelligence in decision-making for aircraft skin spraying processes, this paper proposes a knowledge reasoning method based on a knowledge graph. The authors construct a knowledge graph that integrates multi-structure ontology and physical rule constraints. This graph systematically organizes and manages multi-dimensional knowledge, including painting object attributes, paint performance indicators, and spraying parameters. On this basis, a three-stage reasoning mechanism with multi-granularity semantic understanding, knowledge enhancement, feature fusion, and multi-constraint intelligent matching (MKM) is designed. The model can perform semantic analysis of the user’s fuzzy query, implicit knowledge completion, and dynamic subgraph matching, so as to give the aircraft skin spraying process plan that meets the constraints of safety, compatibility, and feasibility. The experimental results show that the proposed method is superior to the traditional case-based reasoning method, graph convolutional network method, and knowledge graph embedding method in the key evaluation indices of Hit@1, Hit@3, and MRR in the knowledge reasoning task of aircraft skin spraying process. It also has good robustness and promotion value when data are scarce and parameters are uncertain. This study provides a feasible method of intelligent management and dynamic decision-making in terms of aircraft skin spraying process knowledge, and may be applied to other manufacturing fields. Full article
(This article belongs to the Section AI-Enabled Process Engineering)
21 pages, 1814 KB  
Article
TreeSeg-Net: An End-to-End Instance Segmentation Network for Leaf-Off Forest Point Clouds Using Global Context and Spatial Proximity
by Xingmei Xu, Ruihang Zhang, Shunfu Xiao, Jiayuan Li, Xinyue Zhang, Liying Cao, Helong Yu, Yuntao Ma, Jian Zhang and Xiyang Zhao
Plants 2026, 15(4), 525; https://doi.org/10.3390/plants15040525 (registering DOI) - 7 Feb 2026
Abstract
Forest ecosystems play a pivotal role in maintaining the balance of the global carbon cycle and conserving biodiversity. High-density point clouds derived from unmanned aerial vehicle (UAV) structure from motion (SfM) and multi-view stereo (MVS) technologies offer a cost-effective solution for data acquisition. [...] Read more.
Forest ecosystems play a pivotal role in maintaining the balance of the global carbon cycle and conserving biodiversity. High-density point clouds derived from unmanned aerial vehicle (UAV) structure from motion (SfM) and multi-view stereo (MVS) technologies offer a cost-effective solution for data acquisition. These technologies have become efficient tools for facilitating precision forest resource management and extracting individual tree structural parameters. However, in complex forest scenarios during the leaf-off season, canopies exhibit unstructured branch network morphologies due to the absence of leaf occlusion, and adjacent crowns are heavily interlaced. Consequently, existing segmentation methods struggle to overcome challenges associated with fuzzy boundaries and instance adhesion. To address these challenges, this study proposes TreeSeg-Net, an end-to-end instance segmentation network designed to precisely separate individual trees directly from raw point clouds. The network incorporates a global context attention module (GCAM) to capture long-range feature dependencies, thereby compensating for the limitations of sparse convolution in perceiving global information. Simultaneously, a spatial proximity weighting module (SPWM) is designed. By introducing geometric center constraints and a distance penalty mechanism, this module effectively mitigates under-segmentation issues caused by the feature similarity of adjacent branches in high-canopy-density environments. Experimental results demonstrate that TreeSeg-Net achieves an average precision (AP) of 97.2% in instance segmentation tasks and a mean intersection over union (mIoU) of 99.7% in semantic segmentation tasks. Compared to mainstream networks, the proposed method exhibits superior segmentation accuracy, providing an efficient and automated technical solution for precise resource inventory in complex forest environments. Full article
43 pages, 805 KB  
Article
Enhanced Deep Reinforcement Learning for Robustness Falsification of Partially Observable Cyber-Physical Systems
by Yangwei Xing, Ting Shu, Xuesong Yin and Jinsong Xia
Symmetry 2026, 18(2), 304; https://doi.org/10.3390/sym18020304 (registering DOI) - 7 Feb 2026
Abstract
Robustness falsification is a critical verification task for ensuring the safety of cyber-physical systems (CPS). Under partially observable conditions, where internal states are hidden and only input–output data is accessible, existing deep reinforcement learning (DRL) approaches for CPS robustness falsification face two key [...] Read more.
Robustness falsification is a critical verification task for ensuring the safety of cyber-physical systems (CPS). Under partially observable conditions, where internal states are hidden and only input–output data is accessible, existing deep reinforcement learning (DRL) approaches for CPS robustness falsification face two key limitations: inadequate temporal modeling due to unidirectional network architectures, and sparse reward signals that impede efficient exploration. These limitations severely undermine the efficacy of DRL in black-box falsification, leading to low success rates and high computational costs. This study addresses these limitations by proposing DRL-BiT-MPR, a novel framework whose core innovation is the synergistic integration of a bidirectional temporal network with a multi-granularity reward function. Specifically, the bidirectional temporal network captures bidirectional temporal dependencies, remedies inadequate temporal modeling, and complements unobservable state information. The multi-granularity reward function includes fine-grained, medium-grained and coarse-grained layers, corresponding to single-step local feedback, phased progress feedback, and global result feedback, respectively, providing multi-time-scale incentives to resolve reward sparsity. Experiments are conducted on three benchmark CPS models: the continuous CARS model, the hybrid discrete-continuous AT model, and the controller-based PTC model. Results show that DRL-BiT-MPR increases the falsification success rate by an average of 39.6% compared to baseline methods and reduces the number of simulations by more than 50.2%. The framework’s robustness is further validated through theoretical analysis of convergence and soundness properties, along with systematic parameter sensitivity studies. Full article
30 pages, 5650 KB  
Article
An Intelligent Multi-Task Supply Chain Model Based on Bio-Inspired Networks
by Mehdi Khaleghi, Sobhan Sheykhivand, Nastaran Khaleghi and Sebelan Danishvar
Biomimetics 2026, 11(2), 123; https://doi.org/10.3390/biomimetics11020123 - 6 Feb 2026
Abstract
Acknowledging recent breakthroughs in the context of deep bio-inspired neural networks, several architectural deep network options have been deployed to create intelligent systems. The foundations of convolutional neural networks are influenced by hierarchical processing in the visual cortex. The graph neural networks mimic [...] Read more.
Acknowledging recent breakthroughs in the context of deep bio-inspired neural networks, several architectural deep network options have been deployed to create intelligent systems. The foundations of convolutional neural networks are influenced by hierarchical processing in the visual cortex. The graph neural networks mimic the communication of biological neurons. Considering these two computation methods, a novel deep ensemble network is used to propose a bio-inspired deep graph network for creating an intelligent supply chain model. An automated smart supply chain helps to create a more agile, resilient and sustainable system. Improving the sustainability of the network plays a key role in the efficiency of the supply chain’s performance. The proposed bio-inspired Chebyshev ensemble graph network (Ch-EGN) is hybrid learning for creating an intelligent supply chain. The functionality of the proposed deep network is assessed on two different databases including SupplyGraph and DataCo for risk administration, enhancing supply chain sustainability, identifying hidden risks and increasing the supply chain’s transparency. An average accuracy of 98.95% is obtained using the proposed network for automatic delivery status prediction. The performance metrics regarding multi-class categorization scenarios of the intelligent supply chain confirm the efficiency of the proposed bio-inspired approach for sustainability and risk management. Full article
Show Figures

Figure 1

35 pages, 2737 KB  
Article
Joint Trajectory and Power Optimization for Loosely Coupled Tasks: A Decoupled-Critic MAPPO Approach
by Xiangyu Wu, Changbo Hou, Guojing Meng, Zhichao Zhou and Qin Liu
Drones 2026, 10(2), 116; https://doi.org/10.3390/drones10020116 - 6 Feb 2026
Abstract
Multi-unmanned aerial vehicle (UAV) systems are crucial for establishing resilient communication networks in disaster-stricken areas, but their limited energy and dynamic characteristics pose significant challenges for sustained and reliable service provision. Optimizing resource allocation in this situation is a complex sequential decision-making problem, [...] Read more.
Multi-unmanned aerial vehicle (UAV) systems are crucial for establishing resilient communication networks in disaster-stricken areas, but their limited energy and dynamic characteristics pose significant challenges for sustained and reliable service provision. Optimizing resource allocation in this situation is a complex sequential decision-making problem, which is naturally suitable for multi-agent reinforcement learning (MARL). However, the most advanced MARL methods (e.g., multi-agent proximal policy optimization (MAPPO)) often encounter difficulties in the “loosely coupled” multi-UAV environment due to their overly centralized evaluation mechanism, resulting in unclear credit assignment and inhibiting personalized optimization. To overcome this, we propose a novel hierarchical framework supported by MAPPO with decoupled critics (MAPPO-DC). Our framework employs an efficient clustering algorithm for user association in the upper layer, while MAPPO-DC is used in the lower layer to enable each UAV to learn customized trajectories and power control strategies. MAPPO-DC achieves a complex balance between global coordination and personalized exploration by redesigning the update rules of the critic network, allowing for precise and personalized credit assignment in a loosely coupled environment. In addition, we designed a composite reward function to guide the learning process towards the goal of proportional fairness. The simulation results show that our proposed MAPPO-DC outperforms existing baselines, including independent proximal policy optimization (IPPO) and standard MAPPO, in terms of communication performance and sample efficiency, validating the effectiveness of our tailored MARL architecture for the task. Through model robustness experiments, we have verified that our proposed MAPPO-DC still has certain advantages in strongly coupled environments. Full article
(This article belongs to the Section Drone Communications)
Show Figures

Figure 1

76 pages, 1079 KB  
Systematic Review
Mapping Executive Function Performance Based on Resting-State EEG in Healthy Individuals: A Systematic and Mechanistic Review
by James Chmiel and Donata Kurpas
J. Clin. Med. 2026, 15(3), 1306; https://doi.org/10.3390/jcm15031306 - 6 Feb 2026
Abstract
Introduction: Resting-state EEG (rsEEG) is a scalable window onto trait-like “executive readiness,” but findings have been fragmented by task impurity on the executive-function (EF) side and heterogeneous EEG pipelines. This review synthesizes rsEEG features that reliably track EF in healthy samples across [...] Read more.
Introduction: Resting-state EEG (rsEEG) is a scalable window onto trait-like “executive readiness,” but findings have been fragmented by task impurity on the executive-function (EF) side and heterogeneous EEG pipelines. This review synthesizes rsEEG features that reliably track EF in healthy samples across development and aging and evaluates moderators such as cognitive reserve. Materials and methods: Following PRISMA 2020, we defined PECOS-based eligibility (human participants; eyes-closed/eyes-open rsEEG; spectral, aperiodic, connectivity, topology, microstate, and LRTC features; behavioral EF outcomes) and searched MEDLINE/PubMed, Embase, PsycINFO, Web of Science, Scopus, and IEEE Xplore from inception to 30 August 2025. Two reviewers were screened/double-extracted; the risk of bias in non-randomized studies was assessed using the ROBINS-I tool. Sixty-three studies met criteria (plus citation tracking), spanning from childhood to old age. Results: Across domains, tempo, noise, and wiring jointly explained EF differences. Faster individual/peak alpha frequency (IAF/PAF) related most consistently to manipulation-heavy working may and interference control/vigilance in aging; alpha power was less informative once periodic and aperiodic components were separated. Aperiodic 1/f parameters (slope/offset) indexed domain-general efficiency (processing speed, executive composites) with education-dependent sign flips in later life. Connectivity/topology outperformed local power: efficient, small-world-like alpha networks predicted faster, more consistent decisions and higher WM accuracy, whereas globally heightened alpha/gamma synchrony—and rigid high-beta organization—were behaviorally sluggish. Within-frontal beta/gamma coherence supported span maintenance/sequencing, but excessive fronto-posterior theta coherence selectively undermined WM manipulation/updating. A higher frontal theta/beta ratio forecasts riskier, less adaptive choices and poorer reversal learning for decision policy. Age and reserve consistently moderated effects (e.g., child frontal theta supportive for WM; older-adult slow power often detrimental; stronger EO ↔ EC connectivity modulation and faster alpha with higher reserve). Boundary conditions were common: low-load tasks and homogeneous young samples usually yielded nulls. Conclusions: RsEEG does not diagnose EF independently; single-band metrics or simple ratios lack specificity and can be confounded by age/reserve. Instead, a multi-feature signature—faster alpha pace, steeper 1/f slope with appropriate offset, efficient/flexible alpha-band topology with limited global over-synchrony (especially avoiding long-range theta lock), and supportive within-frontal fast-band coherence—best captures individual differences in executive speed, interference control, stability, and WM manipulation. For reproducible applications, recordings should include ≥5–6 min eyes-closed (plus eyes-open), ≥32 channels, vigilant artifact/drowsiness control, periodic–aperiodic decomposition, lag-insensitive connectivity, and graph metrics; analyses must separate speed from accuracy and distinguish WM maintenance vs. manipulation. Clinical translation should prioritize stratification and monitoring (not diagnosis), interpreted through the lenses of development, aging, and cognitive reserve. Full article
(This article belongs to the Special Issue Innovations in Neurorehabilitation—2nd Edition)
Show Figures

Figure 1

22 pages, 1664 KB  
Article
KAN+Transformer: An Explainable and Efficient Approach for Electric Load Forecasting
by Long Ma, Changna Guo, Yangyang Wang, Yan Zhang and Bin Zhang
Sustainability 2026, 18(3), 1677; https://doi.org/10.3390/su18031677 - 6 Feb 2026
Abstract
Short-Term Residential Load Forecasting (STRLF) is a core task in smart grid dispatching and energy management, and its accuracy directly affects the economy and stability of power systems. Current mainstream methods still have limitations in addressing issues such as complex temporal patterns, strong [...] Read more.
Short-Term Residential Load Forecasting (STRLF) is a core task in smart grid dispatching and energy management, and its accuracy directly affects the economy and stability of power systems. Current mainstream methods still have limitations in addressing issues such as complex temporal patterns, strong stochasticity of load data, and insufficient model interpretability. To this end, this paper proposes an explainable and efficient forecasting framework named KAN+Transformer, which integrates Kolmogorov–Arnold Networks (KAN) with Transformers. The framework achieves performance breakthroughs through three innovative designs: constructing a Reversible Mixture of KAN Experts (RMoK) layer, which optimizes expert weight allocation using a load-balancing loss to enhance feature extraction capability while preserving model interpretability; designing an attention-guided cascading mechanism to dynamically fuse the local temporal patterns extracted by KAN with the global dependencies captured by the Transformer; and introducing a multi-objective loss function to explicitly model the periodicity and trend characteristics of load data. Experiments on four power benchmark datasets show that KAN+Transformer significantly outperforms advanced models such as Autoformer and Informer; ablation studies confirm that the KAN module and the specialized loss function bring accuracy improvements of 7.2% and 4.8%, respectively; visualization analysis further verifies the model’s decision-making interpretability through weight-feature correlation, providing a new paradigm for high-precision and explainable load forecasting in smart grids. Collectively, the results demonstrate our model’s superior capability in representing complex residential load dynamics and capturing both transient and stable consumption behaviors. By enabling more accurate, interpretable, and computationally efficient short-term load forecasting, the proposed KAN+Transformer framework provides effective support for demand-side management, renewable energy integration, and intelligent grid operation. As such, it contributes to improving energy utilization efficiency and enhancing the sustainability and resilience of modern power systems. Full article
Show Figures

Figure 1

18 pages, 947 KB  
Article
A Classifier with Unknown Pattern Recognition for Domain Name System Tunneling Detection in Dynamic Networks
by Huijuan Dong, Zengwei Zheng and Shenfei Pei
Electronics 2026, 15(3), 709; https://doi.org/10.3390/electronics15030709 - 6 Feb 2026
Abstract
Domain Name System (DNS) tunneling, a stealthy attack that exploits DNS infrastructure, poses critical threats to dynamic networks and is evolving with emerging attack patterns. This study aims to accurately classify multi-pattern legitimate and malicious traffic and to identify previously unseen attack patterns. [...] Read more.
Domain Name System (DNS) tunneling, a stealthy attack that exploits DNS infrastructure, poses critical threats to dynamic networks and is evolving with emerging attack patterns. This study aims to accurately classify multi-pattern legitimate and malicious traffic and to identify previously unseen attack patterns. We focus on two core research questions: how to accurately classify known-pattern DNS queries and reliably identify unknown-pattern samples. The codified objective is to develop an unsupervised classification approach that integrates multi-pattern adaptation and the recognition of unknown patterns. We formalize the task as Emerging Pattern Classification and propose the Medium Neighbors Forest. It is a forest-based model that uses the “medium neighbor” mechanism and clustering to identify unknown patterns. Experiments verify that the proposed model effectively identifies unseen patterns, offering a new perspective for DNS tunneling detection. Full article
(This article belongs to the Special Issue AI for Cybersecurity and Emerging Technologies for Secure Systems)
Show Figures

Figure 1

22 pages, 3195 KB  
Article
Building Vector Contour Extraction from Remote Sensing Images Based on Multi-Level Contour Refinement and Morphological Perception
by Wenjie Zhao, Ze Meng, Longjie Luo, Liufeng Tao, Bin Hu and Yongyang Xu
Appl. Sci. 2026, 16(3), 1626; https://doi.org/10.3390/app16031626 - 5 Feb 2026
Abstract
Accurate extraction of building vector contours from high-resolution remote sensing images is a fundamental task for urban mapping and geographic information systems. However, existing approaches often suffer from blurred boundaries and geometric distortions when dealing with buildings of complex shapes, limiting the accuracy [...] Read more.
Accurate extraction of building vector contours from high-resolution remote sensing images is a fundamental task for urban mapping and geographic information systems. However, existing approaches often suffer from blurred boundaries and geometric distortions when dealing with buildings of complex shapes, limiting the accuracy and usability of the extracted building footprints. To address these challenges, this paper proposes a multi-level building contour refinement framework based on morphological perception. The proposed framework integrates a three-stage contour optimization strategy, including principal direction extraction, morphology-based contour reconstruction, and geometry-aware regularization, to progressively refine complex building contours under geometric constraints. In addition, a multi-dimensional contour complexity model and an adaptive threshold optimization network are introduced to dynamically adjust refinement parameters according to contour complexity. Experimental results on the WHU-Mix dataset demonstrate that the proposed method outperforms state-of-the-art approaches, achieving 87.52%, 77.43%, and 87.35% in boundary F1, vertex F1, and mIoU, respectively. These results indicate that the proposed framework provides an effective and robust solution for high-precision building vector contour extraction in complex remote sensing scenarios. Full article
Show Figures

Figure 1

31 pages, 2074 KB  
Article
A Multi-Model Dynamic Selection Framework Using Deep Contextual Bandits for Urban Traffic Flow Prediction in Large-Scale Road Networks
by Silai Chen, Shengfeng Mao, Zongcheng Zhang, Xiaoyuan Zhang, Yunxia Wu, Yangsheng Jiang and Zhihong Yao
Mathematics 2026, 14(3), 566; https://doi.org/10.3390/math14030566 - 4 Feb 2026
Viewed by 132
Abstract
To address the challenge of model selection in large-scale traffic flow prediction tasks, this paper proposes a dynamic multi-model selection framework based on Deep Contextual Bandits (DCB). Centered on the optimal combination of sub-models, the framework leverages contextual information of road segments to [...] Read more.
To address the challenge of model selection in large-scale traffic flow prediction tasks, this paper proposes a dynamic multi-model selection framework based on Deep Contextual Bandits (DCB). Centered on the optimal combination of sub-models, the framework leverages contextual information of road segments to select dynamically among candidate predictors, achieving more efficient and accurate traffic flow prediction. Several mechanisms are introduced to improve strategy learning and convergence, including a baseline network, experience replay, double-model estimation, and prioritized experience sampling. A clustering-based strategy is further designed to reduce the search space and enhance the generalization and transferability. Experiments on real-world traffic datasets demonstrate that the proposed framework significantly outperforms traditional static fusion methods, reinforcement learning (RL) baselines, and mainstream spatiotemporal prediction models. In particular, the framework yields a 1.0% improvement in R2 and a 3.2% reduction in MAE compared to state-of-the-art baselines, while reducing inference time by 43.1%. Moreover, the proposed framework shows strong capability in adaptive model selection under varying contexts, with ablation studies confirming the effectiveness of its key components. Full article
16 pages, 1033 KB  
Article
Harnessing Symmetry in Recurrence Plots: A Multi-Scale Detail Boosting Approach for Time Series Similarity Measurement
by Jiancheng Yin, Xuye Zhuang, Wentao Sui and Yunlong Sheng
Symmetry 2026, 18(2), 290; https://doi.org/10.3390/sym18020290 - 4 Feb 2026
Viewed by 64
Abstract
Time series similarity measurement is a fundamental task underpinning clustering, classification, and anomaly detection. Traditional approaches predominantly rely on one-dimensional data representations, which often fail to capture complex structural dependencies. To address this limitation, this paper proposes a novel similarity measurement framework based [...] Read more.
Time series similarity measurement is a fundamental task underpinning clustering, classification, and anomaly detection. Traditional approaches predominantly rely on one-dimensional data representations, which often fail to capture complex structural dependencies. To address this limitation, this paper proposes a novel similarity measurement framework based on two-dimensional image enhancement. The method initially transforms one-dimensional time series into recurrence plots (RPs), converting temporal dynamics into visually symmetric textures, enhancing the temporal information of the one-dimensional time series. To overcome the potential blurring of fine-grained information during transformation, multi-scale detail boosting (MSDB) is introduced to amplify the high-frequency components and textural details of the RP images. Subsequently, a pre-trained ResNet-18 network is utilized to extract deep visual features from the enhanced images, and the similarity is quantified using the Euclidean distance of these feature vectors. Extensive experiments on the UCR Time Series Classification Archive demonstrate that the proposed method effectively leverages image enhancement to reveal latent temporal patterns. This approach leverages the inherent symmetry properties embedded in recurrence plots. By enhancing the texture of these symmetrical structures, the proposed method provides a more robust and informative basis for similarity assessment. Full article
(This article belongs to the Section Mathematics)
22 pages, 2092 KB  
Article
Research on Hot Spot Fault Detection Method Based on Infrared Images of Photovoltaic Modules in Complex Background
by Lei Li, Weili Wu and Zhong Li
Sensors 2026, 26(3), 1024; https://doi.org/10.3390/s26031024 - 4 Feb 2026
Viewed by 102
Abstract
Aiming at the problem that fault characteristics cannot be effectively expressed due to the low pixel proportion of the hot spot target and background interference when detecting hot spot faults in complex environments, a photovoltaic module hot spot fault detection method integrating U-Net [...] Read more.
Aiming at the problem that fault characteristics cannot be effectively expressed due to the low pixel proportion of the hot spot target and background interference when detecting hot spot faults in complex environments, a photovoltaic module hot spot fault detection method integrating U-Net and YOLOv8 is proposed. Firstly, the U-Net segmentation network is introduced to remove pseudo-high-brightness heat sources in the background and highlight the contour features of the photovoltaic panels, laying a good foundation for the subsequent photovoltaic hot spot fault detection tasks. Secondly, a detection network is built based on the YOLOv8 framework. Aiming at the problems that it is difficult to extract the hot spot features of photovoltaic panels of different sizes and to balance the reasoning speed and detection accuracy, a detection network based on deformable convolution and GhostNet is designed. Furthermore, to enhance the adaptability of the convolutional neural network to multi-scale hot spot targets, deformable convolution (DCN) is introduced into the YOLOv8 network. By adaptively adjusting the shape and size of the receptive field, the detection accuracy is further improved. Then, aiming at the issue that it is difficult to balance accuracy and speed in the detection network, the C2f_Ghost module is designed to simplify the network parameters and improve the model inference speed. To verify the effectiveness of the algorithm, a comparison is made with SSD, YOLOv5, YOLOv7, and YOLOv8. The results show that the proposed algorithm can accurately detect hot spot faults, with an accuracy of up to 88.5%. Full article
22 pages, 11216 KB  
Article
A Multi-Scale Remote Sensing Image Change Detection Network Based on Vision Foundation Model
by Shenbo Liu, Dongxue Zhao and Lijun Tang
Remote Sens. 2026, 18(3), 506; https://doi.org/10.3390/rs18030506 - 4 Feb 2026
Viewed by 100
Abstract
As a key technology in the intelligent interpretation of remote sensing, remote sensing image change detection aims to automatically identify surface changes from images of the same area acquired at different times. Although vision foundation models have demonstrated outstanding capabilities in image feature [...] Read more.
As a key technology in the intelligent interpretation of remote sensing, remote sensing image change detection aims to automatically identify surface changes from images of the same area acquired at different times. Although vision foundation models have demonstrated outstanding capabilities in image feature representation, their inherent patch-based processing and global attention mechanisms limit their effectiveness in perceiving multi-scale targets. To address this, we propose a multi-scale remote sensing image change detection network based on a vision foundation model, termed SAM-MSCD. This network integrates an efficient parameter fine-tuning strategy with a cross-temporal multi-scale feature fusion mechanism, significantly improving change perception accuracy in complex scenarios. Specifically, the Low-Rank Adaptation mechanism is adopted for parameter-efficient fine-tuning of the Segment Anything Model (SAM) image encoder, adapting it for the remote sensing change detection task. A bi-temporal feature interaction module(BIM) is designed to enhance the semantic alignment and the modeling of change relationships between feature maps from different time phases. Furthermore, a change feature enhancement module (CFEM) is proposed to fuse and highlight differential information from different levels, achieving precise capture of multi-scale changes. Comprehensive experimental results on four public remote sensing change detection datasets, namely LEVIR-CD, WHU-CD, NJDS, and MSRS-CD, demonstrate that SAM-MSCD surpasses current state-of-the-art (SOTA) methods on several key evaluation metrics, including the F1-score and Intersection over Union(IoU), indicating its broad prospects for practical application. Full article
(This article belongs to the Section AI Remote Sensing)
Show Figures

Figure 1

23 pages, 11674 KB  
Article
High-Precision Individual Identification Method for UAVs Based on FFS-SPWVD and DIR-YOLOv11
by Jian Yu, Mingwei Qin, Liang Han, Song Lu, Yinghui Zhou and Jun Jiang
Electronics 2026, 15(3), 680; https://doi.org/10.3390/electronics15030680 - 4 Feb 2026
Viewed by 82
Abstract
As the threat from malicious UAVs continues to intensify, accurate identification of individual UAVs has become a critical challenge in regulatory and security domains. Existing single-signal analysis methods suffer from limited recognition accuracy. To address this issue, this paper proposes a high-precision individual [...] Read more.
As the threat from malicious UAVs continues to intensify, accurate identification of individual UAVs has become a critical challenge in regulatory and security domains. Existing single-signal analysis methods suffer from limited recognition accuracy. To address this issue, this paper proposes a high-precision individual identification method for UAVs based on FFS-SPWVD and DIR-YOLOv11. The proposed method first employs a frame-by-frame search strategy combined with the smoothing pseudo-Wigner–Ville distribution (SPWVD) algorithm to obtain effective time–frequency feature representations of flight control signals. Building on this foundation, the YOLOv11n network is adopted as the baseline architecture. To enhance the extraction of time–frequency texture features from UAV signals in complex environments, a Multi-Branch Auxiliary Multi-Scale Fusion Network is incorporated into the neck network. Meanwhile, partial space–frequency selective convolutions are introduced into selected C3k2 modules to alleviate the increased computational burden caused by architectural modifications and to reduce the overall number of model parameters. Experimental results on the public DroneRFb-DIR dataset demonstrate that the proposed method effectively extracts flight control frames and performs high-resolution time–frequency analysis. In individual UAV identification tasks, the proposed approach achieves 96.17% accuracy, 97.82% mAP50, and 95.29% recall, outperforming YOLOv11, YOLOv12, and YOLOv13. This study demonstrates that the proposed method achieves both high accuracy and computational efficiency in individual UAV recognition, providing a practical technical solution for whitelist identification and group size estimation in application scenarios such as border patrol, traffic control, and large-scale events. Full article
Show Figures

Figure 1

22 pages, 10079 KB  
Article
FS2-DETR: Transformer-Based Few-Shot Sonar Object Detection with Enhanced Feature Perception
by Shibo Yang, Xiaoyu Zhang and Panlong Tan
J. Mar. Sci. Eng. 2026, 14(3), 304; https://doi.org/10.3390/jmse14030304 - 4 Feb 2026
Viewed by 114
Abstract
In practical underwater object detection tasks, imbalanced sample distribution and the scarcity of samples for certain classes often lead to insufficient model training and limited generalization capability. To address these challenges, this paper proposes FS2-DETR (Few-Shot Detection Transformer for Sonar Images), a transformer-based [...] Read more.
In practical underwater object detection tasks, imbalanced sample distribution and the scarcity of samples for certain classes often lead to insufficient model training and limited generalization capability. To address these challenges, this paper proposes FS2-DETR (Few-Shot Detection Transformer for Sonar Images), a transformer-based few-shot object detection network tailored for sonar imagery. Considering that sonar images generally contain weak, small, and blurred object features, and that data scarcity in some classes can hinder effective feature learning, the proposed FS2-DETR introduces the following improvements over the baseline DETR model. (1) Feature Enhancement Compensation Mechanism: A decoder-prediction-guided feature resampling module (DPGFRM) is designed to process the multi-scale features and subsequently enhance the memory representations, thereby strengthening the exploitation of key features and improving detection performance for weak and small objects. (2) Visual Prompt Enhancement Mechanism: Discriminative visual prompts are generated to jointly enhance object queries and memory, thereby highlighting distinctive image features and enabling more effective feature capture for few-shot objects. (3) Multi-Stage Training Strategy: Adopting a progressive training strategy to strengthen the learning of class-specific layers, effectively mitigating misclassification in few-shot scenarios and enhancing overall detection accuracy. Extensive experiments conducted on the improved UATD sonar image dataset demonstrate that the proposed FS2-DETR achieves superior detection accuracy and robustness under few-shot conditions, outperforming existing state-of-the-art detection algorithms. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

Back to TopTop