Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,795)

Search Parameters:
Keywords = low-dimensional feature

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 5072 KB  
Article
A Dual-Input Dense U-Net-Based Method for Line Spectrum Purification Under Interference Background
by Zixuan Jia, Tingting Teng and Dajun Sun
J. Mar. Sci. Eng. 2026, 14(8), 700; https://doi.org/10.3390/jmse14080700 - 9 Apr 2026
Abstract
Line spectrum purification is a fundamental task in underwater detection and identification tasks. A dual-input architecture based on Dense U-net is introduced to extract clean line spectra from strong interference. The U-net model features a symmetric encoder–decoder structure that accepts two-dimensional data as [...] Read more.
Line spectrum purification is a fundamental task in underwater detection and identification tasks. A dual-input architecture based on Dense U-net is introduced to extract clean line spectra from strong interference. The U-net model features a symmetric encoder–decoder structure that accepts two-dimensional data as both input and output. The DenseBlock, a core component of DenseNets, offers greater parameter efficiency compared to conventional convolutional layers. In this paper, standard convolutional layers inside the original U-net are replaced by DenseBlocks. This model possesses two input channels, thus allowing the time–frequency feature of the interference and that of the interference–target mixture to be fed simultaneously. With supervised learning, the model is capable of eliminating the strong interference components and background noise from the superimposed spectrum, thereby producing a purified target line spectrum. Compared to traditional interference suppression methods, this approach offers higher feature accuracy and greater signal-to-interference-and-noise ratio (SINR) gain. Moreover, the model is trainable using simulation datasets and then deployed to real-world measurements, demonstrating strong generalization capabilities—a valuable property given the limited availability of labeled samples in underwater detection tasks. Being data-driven, this method operates without requiring prior assumptions about the array configuration, and consequently exhibits greater resilience to array imperfections relative to conventional model-based interference suppression techniques. Simulation and experimental results demonstrate that the proposed method achieves an output SINR improvement of more than 8 dB under low SINR conditions and exhibits significantly better robustness to array position errors than conventional methods, verifying its excellent line spectrum purification capability. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

24 pages, 2013 KB  
Article
Capacity-Enhanced Li-Fi Transmission Using Autoencoder-Based Latent Representation: Performance Analysis Under Practical Optical Links
by Serin Kim, Yong-Yuk Won and Jiwon Park
Photonics 2026, 13(4), 356; https://doi.org/10.3390/photonics13040356 - 8 Apr 2026
Abstract
Visible light communication (VLC)-based Li-Fi systems suffer from limitations in transmission capacity expansion due to the restricted modulation bandwidth of LEDs. In this study, a latent representation-based NRZ-OOK Li-Fi transmission framework that exploits the statistical feature distribution of the latent space is proposed [...] Read more.
Visible light communication (VLC)-based Li-Fi systems suffer from limitations in transmission capacity expansion due to the restricted modulation bandwidth of LEDs. In this study, a latent representation-based NRZ-OOK Li-Fi transmission framework that exploits the statistical feature distribution of the latent space is proposed to improve transmission efficiency without expanding the physical bandwidth. An autoencoder is employed to transform input images into low-dimensional latent vectors, which are then quantized and modulated for transmission. At the receiver, hard decision and inverse quantization are performed, and the image is reconstructed through a trained decoder by leveraging the distribution characteristics of the latent representation. The effective transmission capacity gain Gcap is defined to quantify the amount of representable information relative to the original data under the same physical link resources according to the latent dimension, achieving up to a 49-fold data representation efficiency. The experimental results over practical optical links (0.5–1.5 m) showed that, in short-range conditions, larger latent dimensions maintained higher reconstruction PSNR, whereas under channel degradation conditions, smaller latent dimensions exhibited higher robustness, demonstrating a performance inversion phenomenon. Furthermore, it was confirmed that the dominant factor governing reconstruction performance shifts from the representational capability of the data to error accumulation characteristics depending on the channel condition. These results suggest that the latent representation-based transmission framework is an effective Li-Fi strategy that can simultaneously consider transmission efficiency and channel robustness through information representation optimization in bandwidth-limited environments. Full article
Show Figures

Figure 1

19 pages, 1991 KB  
Article
Multimodal Deep Learning for Prediction of Progression-Free Survival in Patients with Neuroendocrine Tumors Undergoing 177Lu-Based Peptide Receptor Radionuclide Therapy
by Simon Baur, Tristan Ruhwedel, Ekin Böke, Zuzanna Kobus, Gergana Lishkova, Christoph Wetz, Holger Amthauer, Christoph Roderburg, Frank Tacke, Julian M. Rogasch, Wojciech Samek, Henning Jann, Jackie Ma and Johannes Eschrich
Cancers 2026, 18(8), 1194; https://doi.org/10.3390/cancers18081194 - 8 Apr 2026
Abstract
Background/Objectives: Peptide receptor radionuclide therapy (PRRT) is an established treatment for metastatic neuroendocrine tumors (NETs), yet long-term disease control occurs only in a subset of patients. Predicting progression-free survival (PFS) could support individualized treatment planning. This study evaluates laboratory, imaging, and multimodal [...] Read more.
Background/Objectives: Peptide receptor radionuclide therapy (PRRT) is an established treatment for metastatic neuroendocrine tumors (NETs), yet long-term disease control occurs only in a subset of patients. Predicting progression-free survival (PFS) could support individualized treatment planning. This study evaluates laboratory, imaging, and multimodal deep learning models for PFS prediction in PRRT-treated patients. Methods: In this retrospective, single-center study 116 patients with metastatic NETs undergoing [177Lu]Lu-DOTATOC were included. Clinical characteristics, laboratory values, and pretherapeutic somatostatin receptor positron emission tomography/computed tomographies (SR-PET/CTs) were collected. Seven models were trained to classify low- vs. high-PFS groups, including unimodal (laboratory, SR-PET, or CT) and multimodal fusion approaches. Performance was assessed via repeated 3-fold cross-validation with area under the receiver operating characteristic curve (AUROC) and area under the precision–recall curve (AUPRC). Explainability was evaluated by feature importance analysis and gradient based saliency maps. Results: Forty-two patients (36%) displayed short PFS (≤1 year) and 74 patients displayed long PFS (>1 year). Groups were similar in most characteristics, except for higher baseline chromogranin A (p = 0.003), elevated γ-GT (p = 0.002), and fewer PRRT cycles (p < 0.001) in short-PFS patients. The Random Forest model trained only on laboratory biomarkers reached an AUROC of 0.59 ± 0.02. Unimodal three-dimensional convolutional neural networks using SR-PET or CT performed worse (AUROC 0.42 ± 0.03 and 0.54 ± 0.01, respectively). A multimodal fusion model integrating laboratory values, SR-PET, and CT—augmented with a pretrained CT branch—achieved the best results (AUROC 0.72 ± 0.01, AUPRC 0.80 ± 0.01). Explainability analyses provided insights into model predictions, with explainability patterns in the fusion model appearing physiologically plausible and predominantly tumor-focused. Conclusions: Multimodal deep learning combining SR-PET, CT, and laboratory biomarkers outperformed unimodal approaches for PFS prediction after PRRT. Upon external validation, such models may support risk-adapted follow-up strategies. Full article
Show Figures

Figure 1

24 pages, 2056 KB  
Article
Study on the Public Perception Characteristics of Intangible Cultural Heritage in China from the Perspective of Social Media
by Xing Tu and Yu Xia
ISPRS Int. J. Geo-Inf. 2026, 15(4), 159; https://doi.org/10.3390/ijgi15040159 - 7 Apr 2026
Abstract
Exploring public awareness, participation, and emotional inclination toward intangible cultural heritage (ICH) clarifies public attitudes and demands toward traditional culture, providing a crucial basis for targeted ICH protection and inheritance. Based on ICH text big data collected from China’s mainstream social media platform [...] Read more.
Exploring public awareness, participation, and emotional inclination toward intangible cultural heritage (ICH) clarifies public attitudes and demands toward traditional culture, providing a crucial basis for targeted ICH protection and inheritance. Based on ICH text big data collected from China’s mainstream social media platform Weibo, this study improves the TF-IDF algorithm, integrates LDA topic analysis for semantic feature mining, and trains a new sentiment analysis model to explore public emotional attitudes and their formation mechanisms. The study is geographically limited to China and covers the entire year of 2023. The results show that: (1) Public ICH perception is multi-dimensional, with close attention to crafts like paper-cutting and traditional Chinese medicine; action-oriented terms reflect dynamic inheritance demands. Public discussions focus on three dimensions: ICH inheritance and development (39%), introduction and promotion (45%), and public experience and participation (16%), with the latter accounting for a low proportion. (2) Public sentiment toward ICH is predominantly positive, with all regions scoring above 0.730 (full score = 1), and Zhejiang (0.751) and Jiangsu (0.750) ranking significantly higher. (3) Spatial econometric analysis reveals marked regional differences in ICH sentiment distribution, mainly affected by three key factors—the number of ICH projects, the number of inheritors, and regional GDP—with regression coefficients of 0.699, 0.632, and 0.458 (p < 0.01). This finding provides a basis for formulating targeted ICH protection strategies. Full article
(This article belongs to the Topic 3D Documentation of Natural and Cultural Heritage)
25 pages, 1501 KB  
Article
MA-JTATO: Multi-Agent Joint Task Association and Trajectory Optimization in UAV-Assisted Edge Computing System
by Yunxi Zhang and Zhigang Wen
Drones 2026, 10(4), 267; https://doi.org/10.3390/drones10040267 - 7 Apr 2026
Abstract
With the rapid development of applications such as smart cities and the industrial internet, the computation-intensive tasks generated by massive sensing devices pose significant challenges to traditional cloud computing paradigms. Unmanned aerial vehicle (UAV)-assisted edge computing systems, leveraging their high mobility and wide-area [...] Read more.
With the rapid development of applications such as smart cities and the industrial internet, the computation-intensive tasks generated by massive sensing devices pose significant challenges to traditional cloud computing paradigms. Unmanned aerial vehicle (UAV)-assisted edge computing systems, leveraging their high mobility and wide-area coverage capabilities, offer an innovative architecture for low-latency and highly reliable edge services. However, the practical deployment of such systems faces a highly complex multi-objective optimization problem featured by the tight coupling of task offloading decisions, UAV trajectory planning, and edge server resource allocation. Conventional optimization methods are difficult to adapt to the dynamic and high-dimensional characteristics of this problem, leading to suboptimal system performance. To address this critical challenge, this paper constructs an intelligent collaborative optimization framework for UAV-assisted edge computing systems and formulates the system quality of service (QoS) optimization problem as a mixed-integer non-convex programming problem with the dual objectives of minimizing task processing latency and reducing overall system energy consumption. A multi-agent joint task association and trajectory optimization (MA-JTATO) algorithm based on hybrid reinforcement learning is proposed to solve this intractable problem, which innovatively decouples the original coupled optimization problem into three interrelated subproblems and realizes their collaborative and efficient solution. Specifically, the Advantage Actor-Critic (A2C) algorithm is adopted to realize dynamic and optimal task association between UAVs and edge servers for discrete decision-making requirements; the multi-agent deep deterministic policy gradient (MADDPG) method is employed to achieve cooperative and energy-efficient trajectory planning for multiple UAVs to meet the needs of continuous control in dynamic environments; and convex optimization theory is applied to obtain a closed-form optimal solution for the efficient allocation of computational resources on edge servers. Simulation results demonstrate that the proposed MA-JTATO algorithm significantly outperforms traditional baseline algorithms in enhancing overall QoS, effectively validating the framework’s superior performance and robustness in dynamic and complex scenarios. Full article
(This article belongs to the Section Drone Communications)
Show Figures

Figure 1

14 pages, 16245 KB  
Article
Aging State Classification of Lithium-Ion Batteries in a Low-Dimensional Latent Space
by Limei Jin, Franz Philipp Bereck, Rüdiger-A. Eichel, Josef Granwehr and Christoph Scheurer
Batteries 2026, 12(4), 127; https://doi.org/10.3390/batteries12040127 - 7 Apr 2026
Viewed by 29
Abstract
Battery datasets, whether gathered experimentally or through simulation, are typically high-dimensional and complex, which complicates the direct interpretation of degradation behavior or anomaly detection. To overcome these limitations, this study introduces a framework that compresses battery signals into a low-dimensional representation using an [...] Read more.
Battery datasets, whether gathered experimentally or through simulation, are typically high-dimensional and complex, which complicates the direct interpretation of degradation behavior or anomaly detection. To overcome these limitations, this study introduces a framework that compresses battery signals into a low-dimensional representation using an autoencoder, enabling the extraction of informative features for state analysis. A central component of this work is the systematic comparison of latent representations obtained from two fundamentally different data sources: frequency-domain impedance data and time-domain voltage-current data. The close agreement of aging trajectories in both representations suggests that information traditionally derived from impedance analysis can also be captured directly from raw time-series signals. To better approximate real operating conditions, synthetic datasets are augmented with stochastic perturbations. In this context, latent spaces learned from idealized periodic inputs are contrasted with those derived from permuted and noise-contaminated signals. The resulting low-dimensional features are subsequently evaluated through a support vector machine with both linear and nonlinear kernel functions, allowing the categorization of battery states into fresh, aged and damaged conditions. The results demonstrate that the progression of battery degradation is consistently reflected in the latent space, independent of the input domain or signal quality. This robustness indicates that the proposed approach can effectively capture essential aging characteristics even under non-ideal conditions. Consequently, this framework provides a basis for developing advanced diagnostic strategies, including the design of pseudo-random excitation profiles for improved battery state assessment and optimized operational control. Full article
Show Figures

Graphical abstract

20 pages, 4162 KB  
Article
Exponential Function-Based Neural Tangent Kernels for SECM Signal Reconstruction
by Vadimas Ivinskij, Eugenijus Mačerauskas, Laisvidas Striška, Darius Plonis, Vijitashwa Pandey, Sonata Tolvaisiene and Inga Morkvėnaitė
Appl. Sci. 2026, 16(7), 3578; https://doi.org/10.3390/app16073578 - 6 Apr 2026
Viewed by 148
Abstract
Scanning electrochemical microscopy (SECM) provides spatially resolved electrochemical information but is constrained by long acquisition times arising from dense spatial sampling requirements. This work investigates whether physics-informed signal representations can improve neural reconstruction of SECM approach curve signals from sparse measurements. We propose [...] Read more.
Scanning electrochemical microscopy (SECM) provides spatially resolved electrochemical information but is constrained by long acquisition times arising from dense spatial sampling requirements. This work investigates whether physics-informed signal representations can improve neural reconstruction of SECM approach curve signals from sparse measurements. We propose an exponential function-based Neural Tangent Kernel (NTK) framework in which SECM signals are encoded using deterministic exponential feature mappings aligned with diffusion-controlled electrochemical dynamics. A layer-wise NTK checkpointing mechanism is employed to filter covariantly insignificant components during training, reducing redundancy while preserving dominant signal modes. The method is evaluated on synthetically generated SECM signals designed to replicate characteristic approach curve behavior. Quantitative performance is assessed using root mean square error (RMSE), mean absolute error (MAE), relative error (%), and the coefficient of determination (R2). Compared to a random Gaussian (Fourier feature) baseline (RMSE = 0.0952, MAE = 0.0547, Rel.Err = 17.68%), the proposed exponential mappings achieve consistently lower reconstruction error, with the best configuration yielding RMSE = 0.0858, MAE = 0.0375, and relative error = 11.10% under identical training conditions. Results demonstrate that incorporating physically motivated exponential feature representations into NTK-aware learning improves reconstruction fidelity and stability for low-dimensional electrochemical signals, highlighting the potential of physics-informed kernel methods for accelerated SECM data acquisition. Full article
(This article belongs to the Special Issue Advances in Biosignal Processing, 2nd Edition)
Show Figures

Figure 1

17 pages, 2174 KB  
Article
RadarSSM: A Lightweight Spatiotemporal State Space Network for Efficient Radar-Based Human Activity Recognition
by Rubin Zhao, Fucheng Miao and Yuanjian Liu
Sensors 2026, 26(7), 2259; https://doi.org/10.3390/s26072259 - 6 Apr 2026
Viewed by 193
Abstract
Millimeter-wave radar has gradually gained popularity as a sensor mode for Human Activity Recognition (HAR) in recent years because it preserves the privacy of individuals and is resistant to environmental conditions. Nevertheless, the fast inference of high-dimensional and sparse 4D radar data is [...] Read more.
Millimeter-wave radar has gradually gained popularity as a sensor mode for Human Activity Recognition (HAR) in recent years because it preserves the privacy of individuals and is resistant to environmental conditions. Nevertheless, the fast inference of high-dimensional and sparse 4D radar data is still difficult to perform on low-resource edge devices. Current models, including 3D Convolutional Neural Networks and Transformer-based models, are frequently plagued by extensive parameter overhead or quadratic computational complexity, which restricts their applicability to edge applications. The present paper attempts to resolve these issues by introducing RadarSSM as a lightweight spatiotemporal hybrid network in the context of radar-based HAR. The explicit separation of spatial feature extraction and temporal dependency modeling helps RadarSSM decrease the overall complexity of computation significantly. Specifically, a spatial encoder based on depthwise separable 3D convolutions is designed to efficiently capture fine-grained geometric and motion features from voxelized radar data. For temporal modeling, a bidirectional State Space Model is introduced to capture long-range temporal dependencies with linear time complexity O(T), thereby avoiding the quadratic cost associated with self-attention mechanisms. Extensive experiments conducted on public radar HAR datasets demonstrate that RadarSSM achieves accuracy competitive with state-of-the-art methods while substantially reducing parameter count and computational cost relative to representative convolutional baselines. These results validate the effectiveness of RadarSSM and highlight its suitability for efficient radar sensing on edge hardware. Full article
(This article belongs to the Special Issue Radar and Multimodal Sensing for Ambient Assisted Living)
Show Figures

Figure 1

32 pages, 6103 KB  
Article
An Optimal Deep Hybrid Framework with Selective Kernel U-Net for Skin Lesion Detection and Classification
by Guzal Gulmirzaeva, Robert Hudec, Baxtiyorjon Akbaraliev and Batirbek Samandarov
Bioengineering 2026, 13(4), 427; https://doi.org/10.3390/bioengineering13040427 - 6 Apr 2026
Viewed by 215
Abstract
Early and accurate detection of skin cancer is critical for reducing mortality rates, particularly for malignant melanoma. Automated analysis of dermoscopic images has gained significant attention due to its potential to support clinical diagnosis and overcome the limitations of manual inspection. Motivated by [...] Read more.
Early and accurate detection of skin cancer is critical for reducing mortality rates, particularly for malignant melanoma. Automated analysis of dermoscopic images has gained significant attention due to its potential to support clinical diagnosis and overcome the limitations of manual inspection. Motivated by challenges such as image noise, low contrast, lesion variability, and redundant feature representation, this study proposes an optimal deep hybrid framework for skin lesion detection and classification. The objective of this work is to design a robust and efficient system that integrates advanced preprocessing, precise segmentation, optimal feature selection, and accurate classification. Initially, contrast enhancement using Contrast Limited Adaptive Histogram Equalization (CLAHE) and noise reduction using Wiener filtering are applied to improve image quality. Lesion regions are then segmented using a Selective Kernel U-Net (SK-UNet), which adaptively captures multi-scale spatial information. Subsequently, discriminative color, texture, and shape features are extracted and optimized using the Fossa Optimization Algorithm (FOA) to eliminate redundancy. A hybrid one-dimensional Convolutional Neural Network–Gated Recurrent Unit (1D-CNN–GRU) classifier is employed for final classification, learning both spatial and sequential feature patterns. Experimental evaluation on the ISIC and DermMNIST datasets demonstrates that the proposed framework achieves classification accuracies of 97.6% and 95.6%, respectively, outperforming several existing methods. The results confirm that the proposed hybrid framework provides reliable, accurate, and scalable skin cancer diagnosis, highlighting its potential for assisting clinical decision-making and early detection. Full article
(This article belongs to the Special Issue Deep Learning for Medical Applications: Challenges and Opportunities)
Show Figures

Figure 1

25 pages, 3586 KB  
Article
A Classification Algorithm of UAV and Bird Target Based on L/K Dual-Band Micro-Doppler and Mamba
by Tao Zhang and Xiaoru Song
Drones 2026, 10(4), 265; https://doi.org/10.3390/drones10040265 - 6 Apr 2026
Viewed by 144
Abstract
To address the challenge of accurately distinguishing UAVs and birds in a low-altitude detection field, this paper proposes a classification algorithm of UAVs and birds based on L/K dual-band micro-Doppler spectrograms and Mamba. We establish a dual-band radar detection model for unmanned aerial [...] Read more.
To address the challenge of accurately distinguishing UAVs and birds in a low-altitude detection field, this paper proposes a classification algorithm of UAVs and birds based on L/K dual-band micro-Doppler spectrograms and Mamba. We establish a dual-band radar detection model for unmanned aerial vehicles (UAVs) and birds, provide a method for characterizing the Doppler parameters of the echo signals, and research a UAV and bird target classification network model that integrates micro-Doppler and Mamba. Based on a dual-branch encoding framework, we use Patch block decomposition to design a classification model to serialize the two-dimensional spectrogram of the echo signal, and introduce the Mamba state-space backbone network to extract the long-term sequence features of the target’s micro-motion. The main breakthrough of the proposed classification algorithm lies in the feature fusion stage, where a late fusion strategy is adopted to integrate the dual-path high-level representation measures, fully leveraging the sensitivity of the K-band to high-frequency textures and the scale complementarity of the L-band. Then, according to the joint loss function of mutual learning and contrastive learning, we improve the model’s prediction consistency and representation discriminability. Through experimental testing, the results show that the proposed method can effectively classify UAVs and birds, and compared with other algorithms, the accuracy rate reaches 97.5%. Full article
(This article belongs to the Section Drone Communications)
Show Figures

Figure 1

30 pages, 8434 KB  
Review
AI-Assisted Molecular Biosensors: Design Strategies for Wearable and Real-Time Monitoring
by Sishi Zhu, Jie Zhang, Xuming He, Lijun Ding, Xiao Luo and Weijia Wen
Int. J. Mol. Sci. 2026, 27(7), 3305; https://doi.org/10.3390/ijms27073305 - 6 Apr 2026
Viewed by 387
Abstract
Artificial intelligence (AI) has become a transformative tool in the field of molecular biosensing, enabling data-driven optimization in sensor design, signal processing, and real-time monitoring. AI promotes the discovery of biomarkers, the design of high-affinity receptors, and the rational engineering of sensing materials, [...] Read more.
Artificial intelligence (AI) has become a transformative tool in the field of molecular biosensing, enabling data-driven optimization in sensor design, signal processing, and real-time monitoring. AI promotes the discovery of biomarkers, the design of high-affinity receptors, and the rational engineering of sensing materials, thereby enhancing sensitivity, specificity, and detection accuracy. In the development of biosensors, AI-assisted strategies have accelerated the identification of novel molecular targets, guided the design of proteins and aptamers with enhanced binding performance, and optimized plasmonic and nanophotonic structures through forward prediction and inverse design frameworks. The integration of artificial intelligence has significantly enhanced the performance of various biosensing platforms, including optical, electrochemical, and microfluidic biosensors. It also enabled automatic feature extraction, noise reduction, dimensionality reduction, and multimodal data fusion, overcoming the challenges posed by complex signals, environmental interference, and device variations. These capabilities are particularly crucial for wearable molecular biosensors, as low signal strength, motion artifacts, and fluctuations in physiological conditions impose strict requirements on robustness and real-time reliability. This review systematically summarizes the latest advancements in AI-assisted molecular biosensors, highlighting representative sensing strategies and algorithms for wearable and real-time monitoring, and discusses the current challenges and future development opportunities of intelligent biosensing technologies. Full article
(This article belongs to the Special Issue Biosensors: Emerging Technologies and Real-Time Monitoring)
Show Figures

Figure 1

27 pages, 390 KB  
Article
A Comparative Study of Federated Learning and Amino Acid Encoding with IoT Malware Detection as a Case Study
by Thaer AL Ibaisi, Stefan Kuhn, Muhammad Kazim, Ismail Kara, Turgay Altindag and Mujeeb Ur Rehman
Big Data Cogn. Comput. 2026, 10(4), 111; https://doi.org/10.3390/bdcc10040111 - 6 Apr 2026
Viewed by 164
Abstract
The increasing deployment of Internet of Things (IoT) devices introduces significant security challenges, while privacy concerns limit centralized data aggregation for intrusion detection. Federated learning (FL) offers a decentralized alternative, yet the interaction between feature representation, model architecture, and data heterogeneity remains insufficiently [...] Read more.
The increasing deployment of Internet of Things (IoT) devices introduces significant security challenges, while privacy concerns limit centralized data aggregation for intrusion detection. Federated learning (FL) offers a decentralized alternative, yet the interaction between feature representation, model architecture, and data heterogeneity remains insufficiently understood in IoT malware detection. This study provides a controlled comparative analysis of centralized and federated learning, optionally using amino acid encoding, under IID and Non-IID conditions using a 10,000-sample subset of the CTU–IoT–Malware–Capture dataset. First, we evaluate raw tabular features versus amino acid-based feature encoding, followed by a lightweight multi-layer perceptron (2882 parameters) versus a deeper residual network (70,532 parameters), across binary and multi-class classification tasks. In the binary setting, centralized training achieved up to 98.6% accuracy, while federated IID training reached 98.6%, with differences within statistical variance. Under Non-IID conditions, performance decreased modestly (0.1–0.5 percentage points), and accuracy was consistently lower when using encoded features compared with raw features. The degradation is smaller in deeper architectures and may offer improved stability under highly skewed federated conditions. In the four-class setting, the complex network achieved up to 97.8% accuracy with raw features, while amino acid encoding achieves up to 93.3%. The results show that federated learning can achieve performance comparable to centralized training under moderate heterogeneity, that lightweight architectures are sufficient for low-dimensional IoT traffic features, and that feature compression via amino acid encoding does not inherently mitigate Non-IID effects. These findings clarify the relative impact of representation, heterogeneity, and architectural capacity in practical FL-based IoT intrusion detection systems. Full article
(This article belongs to the Special Issue Application of Cloud Computing in Industrial Internet of Things)
Show Figures

Figure 1

22 pages, 4792 KB  
Article
Distracted Driving Behavior Recognition Based on Improved YOLOv8n-Pose and Multi-Feature Fusion
by Zhuzhou Li, Dudu Guo, Zhenxun Wei, Guoliang Chen, Miao Sun and Yuhao Sun
Appl. Sci. 2026, 16(7), 3532; https://doi.org/10.3390/app16073532 - 3 Apr 2026
Viewed by 171
Abstract
Distracted driving is one of the primary causes of road traffic accidents. Behavior recognition technology based on machine vision has emerged as a research hotspot due to its non-contact and high-efficiency nature. To address the challenges of complex lighting conditions in the driver’s [...] Read more.
Distracted driving is one of the primary causes of road traffic accidents. Behavior recognition technology based on machine vision has emerged as a research hotspot due to its non-contact and high-efficiency nature. To address the challenges of complex lighting conditions in the driver’s cabin, low detection accuracy for small-scale keypoints, and the difficulty in effectively characterizing behavioral features, this paper proposes a distracted driving behavior recognition method based on an improved YOLOv8n-Pose model and multi-feature fusion. First, the original YOLOv8n-Pose model is optimized. A P2 detection layer is added to enhance the feature extraction capabilities for small-scale human keypoints, and the SE attention module is incorporated to improve the model’s robustness under complex lighting conditions. In addition, the loss function is replaced with focal loss to tackle the class imbalance problem, thus forming the YOLOv8n-PSF-Pose keypoint detection network. Subsequently, based on the coordinates of 12 human keypoints extracted by this network, a multi-dimensional feature vector is constructed, which takes joint angles as the core and integrates the relative distances between keypoints and the number of valid keypoints. Finally, a BP neural network is adopted to classify the constructed feature vectors, enabling the accurate recognition of six typical distracted driving behaviors (normal driving, drinking or eating, making phone calls, using mobile phones, operating vehicle infotainment systems, and turning around to fetch items). The experimental results show that the improved YOLOv8n-PSF-Pose model achieves an mAP50 of 93.8% in keypoint detection, which is 6.7 percentage points higher than the original model; the BP classification model based on multi-feature fusion achieves an F1-score of 97.7% in the behavior recognition task, which is significantly better than traditional classifiers such as SVM and random forest, and the image processing speed on the NVIDIA RTX 3090TI reaches a high throughput of 45 FPS. This proves that the proposed method achieves an excellent balance between accuracy and speed. This study provides an effective solution for the real-time and accurate recognition of distracted driving behaviors. Full article
Show Figures

Figure 1

41 pages, 35277 KB  
Article
A Multi-Strategy Improved Seagull Optimization Algorithm for Global Optimization and Artistic Image Segmentation
by Yangyang Jiang
Biomimetics 2026, 11(4), 247; https://doi.org/10.3390/biomimetics11040247 - 3 Apr 2026
Viewed by 269
Abstract
Multilevel threshold image segmentation is a key task in image processing, yet it faces challenges such as low search efficiency in high-dimensional spaces, difficulty in balancing segmentation accuracy and stability, and insufficient adaptability to complex scenes. Existing solutions mainly include traditional thresholding methods [...] Read more.
Multilevel threshold image segmentation is a key task in image processing, yet it faces challenges such as low search efficiency in high-dimensional spaces, difficulty in balancing segmentation accuracy and stability, and insufficient adaptability to complex scenes. Existing solutions mainly include traditional thresholding methods and metaheuristic optimization-based schemes, but they still face limitations in high-dimensional and complex segmentation tasks. The standard Seagull Optimization Algorithm (SOA) suffers from shortcomings including a single exploration mechanism, weak local exploitation capability, and a tendency for population diversity to deteriorate, making it difficult to meet the demands of high-dimensional optimization. To address these issues, this paper proposes a multi-strategy fused improved Seagull Optimization Algorithm (MFISOA), which integrates three strategies: adaptive cooperative foraging, differential evolution-driven exploitation, and centroid opposition-based boundary control. These strategies jointly construct a collaborative optimization framework with dynamic resource allocation, fine local search, and population diversity maintenance, thereby improving global exploration efficiency, local exploitation accuracy, and population stability. To evaluate the optimization performance of MFISOA, numerical simulation experiments were conducted on the CEC2017 and CEC2022 benchmark test suites, and comparisons were made with nine other mainstream advanced algorithms. The results show that MFISOA outperforms the competing algorithms in terms of optimization accuracy, convergence speed, and operational stability. Its superiority is further verified by the Wilcoxon rank-sum test and the Friedman test, with statistical significance (p < 0.05). In the multilevel threshold image segmentation task, using the Otsu criterion as the objective function, MFISOA was tested on nine benchmark images under 4-, 6-, 8-, and 10-threshold segmentation scenarios. The results indicate that MFISOA achieves better performance on metrics such as Structural Similarity Index (SSIM), Peak Signal-to-Noise Ratio (PSNR), and Feature Similarity Index (FSIM), enabling more accurate characterization of image grayscale distribution features and producing higher-quality segmentation results. This study provides an efficient and reliable approach for numerical optimization and multilevel threshold image segmentation. Full article
Show Figures

Figure 1

15 pages, 4227 KB  
Article
Amidoxime-Functionalized Wood-Based Adsorbent for Uranium Extraction
by Xiongxiang Wu, Yu Wang, Haoyang Xu, Chunde Jin and Zhe Wang
Processes 2026, 14(7), 1161; https://doi.org/10.3390/pr14071161 - 3 Apr 2026
Viewed by 172
Abstract
Uranium is a critical raw material for the nuclear industry. Given the vast uranium reserves in seawater, the development of efficient adsorbents is central to extraction technologies. Polyamidoxime (PAO)-based adsorbents are widely utilized due to their high affinity for uranium; however, traditional PAO [...] Read more.
Uranium is a critical raw material for the nuclear industry. Given the vast uranium reserves in seawater, the development of efficient adsorbents is central to extraction technologies. Polyamidoxime (PAO)-based adsorbents are widely utilized due to their high affinity for uranium; however, traditional PAO materials often suffer from low mechanical strength and poor recyclability. To address these limitations, this study utilized natural balsa wood as a substrate. A three-dimensional porous cellulose skeleton (DES-W) featuring high porosity, hydrophilicity, and retained mechanical strength was constructed by partially removing lignin using a deep eutectic solvent (DES). Subsequently, polyamidoxime was loaded onto the inner walls of the DES-W via vacuum impregnation, resulting in a polyamidoxime-functionalized wood-based adsorbent (PAO-WA). The results indicated that PAO-WA achieved an equilibrium adsorption capacity of 45.31 mg/g at pH 6.0 with an initial uranium concentration of 50 mg/L, representing a twofold increase compared to the unmodified DES-W. The adsorption kinetics and isotherms followed the pseudo-second-order and Langmuir models, respectively, suggesting a mechanism dominated by monolayer chemisorption. Mechanism analysis confirmed that uranyl ions were primarily captured via coordination with nitrogen and oxygen atoms in the amidoxime groups, with residual carboxyl groups in the wood contributing to the adsorption process. This work offers a novel strategy for developing efficient, environmentally friendly, and mechanically robust adsorbents for uranium extraction from seawater. Full article
(This article belongs to the Section Chemical Processes and Systems)
Show Figures

Figure 1

Back to TopTop