MDPI - Publisher of Open Access Journals

23 pages, 4815 KB

Open AccessArticle

Automatic Diagnosis, Classification, and Segmentation of Abdominal Aortic Aneurysm and Dissection from Computed Tomography Images

by Hakan Baltaci, Sercan Yalcin, Muhammed Yildirim and Harun Bingol

Diagnostics 2025, 15(19), 2476; https://doi.org/10.3390/diagnostics15192476 (registering DOI) - 27 Sep 2025

Abstract

Background/Objectives: Diagnosis of abdominal aortic aneurysm and abdominal aortic dissection (AAA and AAD) is of strategic importance as cardiovascular disease has fatal implications worldwide. This study presents a novel deep learning-based approach for the accurate and efficient diagnosis of abdominal aortic aneurysms (AAAs) [...] Read more.

Background/Objectives: Diagnosis of abdominal aortic aneurysm and abdominal aortic dissection (AAA and AAD) is of strategic importance as cardiovascular disease has fatal implications worldwide. This study presents a novel deep learning-based approach for the accurate and efficient diagnosis of abdominal aortic aneurysms (AAAs) and aortic dissections (AADs) from CT images. Methods: Our proposed convolutional neural network (CNN) architecture effectively extracts relevant features from CT scans and classifies regions as normal or diseased. Additionally, the model accurately delineates the boundaries of detected aneurysms and dissections, aiding in clinical decision-making. A pyramid scene parsing network has been built in a hybrid method. The layer block after the classification layer is divided into two groups: whether there is an AAA or AAD region in the abdominal CT image, and determination of the borders of the detected diseased region in the medical image. Results: In this sense, both detection and segmentation are performed in AAA and AAD diseases. Python programming has been used to assess the accuracy and performance results of the proposed strategy. From the results, average accuracy rates of 83.48%, 86.9%, 88.25%, and 89.64% were achieved using ResDenseUNet, INet, C-Net, and the proposed strategy, respectively. Also, intersection over union (IoU) of 79.24%, 81.63%, 82.48%, and 83.76% have been achieved using ResDenseUNet, INet, C-Net, and the proposed method. Conclusions: The proposed strategy is a promising technique for automatically diagnosing AAA and AAD, thereby reducing the workload of cardiovascular surgeons. Full article

(This article belongs to the Special Issue Artificial Intelligence and Computational Methods in Cardiology 2025)

21 pages, 4379 KB

Open AccessArticle

Deep Learning-Based Super-Resolution Reconstruction of a 1/9 Arc-Second Offshore Digital Elevation Model for U.S. Coastal Regions

by Chenhao Wu, Bo Zhang, Meng Zhang and Chaofan Yang

Remote Sens. 2025, 17(18), 3205; https://doi.org/10.3390/rs17183205 - 17 Sep 2025

Viewed by 358

Abstract

High-resolution offshore digital elevation models (DEMs) are essential for coastal geomorphology, marine resource management, and disaster prevention. While deep learning-based super-resolution (SR) techniques have become a mainstream solution for enhancing DEMs, they often fail to maintain a balance between large-scale geomorphological structure and [...] Read more.

High-resolution offshore digital elevation models (DEMs) are essential for coastal geomorphology, marine resource management, and disaster prevention. While deep learning-based super-resolution (SR) techniques have become a mainstream solution for enhancing DEMs, they often fail to maintain a balance between large-scale geomorphological structure and fine-scale topographic detail due to limitations in modeling spatial dependency. To overcome this challenge, we propose DEM-Asymmetric multi-scale super-resolution network (DEM-AMSSRN), a novel asymmetric multi-scale super-resolution network tailored for offshore DEM reconstruction. Our method incorporates region-level non-local (RL-NL) modules to capture long-range spatial dependencies and residual multi-scale blocks (RMSBs) to extract hierarchical terrain features. Additionally, a hybrid loss function combining pixel-wise, perceptual, and adversarial losses is introduced to ensure both geometric fidelity and visual realism. Experimental evaluations on U.S. offshore DEM datasets demonstrate that DEM-AMSSRN significantly outperforms existing GAN-based models, reducing RMSE by up to 72.47% (vs. SRGAN) and achieving 53.30 dB PSNR and 0.995056 SSIM. These results highlight its effectiveness in preserving both continental shelf-scale bathymetric patterns and detailed terrain textures. Using this model, we also constructed the USA_OD_2025, a 1/9 arc-second high-resolution offshore DEM for U.S. coastal zones, providing a valuable geospatial foundation for future marine research and engineering. Full article

(This article belongs to the Special Issue Machine Learning and GeoAI for Remote Sensing Environmental Monitoring (2nd Edition))

► Show Figures

Figure 1

14 pages, 353 KB

Open AccessArticle

Building Geometry Generation Example Applying GPT Models

by Zsolt Ercsey and Tamás Storcz

Architecture 2025, 5(3), 79; https://doi.org/10.3390/architecture5030079 - 9 Sep 2025

Viewed by 296

Abstract

The emergence of large language models (LLMs) has opened new avenues for integrating artificial intelligence into architectural design workflows. This paper explores the feasibility of applying generative AI to solve a classic combinatorial problem: generating valid building geometries of a modular family house [...] Read more.

The emergence of large language models (LLMs) has opened new avenues for integrating artificial intelligence into architectural design workflows. This paper explores the feasibility of applying generative AI to solve a classic combinatorial problem: generating valid building geometries of a modular family house structure. The problem involves identifying all valid placements of six spatial blocks under strict architectural constraints. The study contrasts the conventional algorithmic solution with generative approaches using ChatGPT-3.5, ChatGPT-4o, and a hybrid expert model. While early GPT models struggled with accuracy and solution completeness, the hybrid expert-guided approach demonstrated a successful synergy between LLM-driven code generation and domain-specific corrections. The findings suggest that, while LLMs alone are insufficient for precise combinatorial tasks, hybrid systems combining classical and AI techniques hold great promise for supporting architectural problem solving including building geometry generation. Full article

(This article belongs to the Special Issue AI as a Tool for Architectural Design and Urban Planning)

► Show Figures

Figure 1

20 pages, 1051 KB

Open AccessArticle

Synthetic Methods of Sugar Amino Acids and Their Application in the Development of Cyclic Peptide Therapeutics

by Chengcheng Bao and Dekai Wang

Processes 2025, 13(9), 2849; https://doi.org/10.3390/pr13092849 - 5 Sep 2025

Viewed by 435

Abstract

Sugar amino acids (SAAs) represent a privileged class of molecular chimeras that uniquely merge the structural rigidity of carbohydrates with the functional display of amino acids. These hybrid molecules have garnered significant attention as programmable conformational constraints, offering a powerful strategy to overcome [...] Read more.

Sugar amino acids (SAAs) represent a privileged class of molecular chimeras that uniquely merge the structural rigidity of carbohydrates with the functional display of amino acids. These hybrid molecules have garnered significant attention as programmable conformational constraints, offering a powerful strategy to overcome the inherent limitations of peptide-based therapeutics, such as proteolytic instability and conformational ambiguity. The strategic incorporation of SAAs into peptide backbones, particularly within cyclic frameworks, allows for the rational design of peptidomimetics with pre-organized secondary structures, enhanced metabolic stability, and improved physicochemical properties. This review provides a comprehensive analysis of the synthetic methodologies developed to access the diverse structural landscape of SAAs, with a focus on modern, stereoselective strategies that yield versatile building blocks for peptide chemistry. A critical examination of the structural impact of SAA incorporation reveals their profound ability to induce and stabilize specific secondary structures, such as β- and γ-turns. Furthermore, a comparative analysis positions SAAs in the context of other widely used peptidomimetic scaffolds, highlighting their unique advantages in combining conformational control with tunable hydrophilicity. We surveyed the application of SAA-containing cyclic peptides as therapeutic agents, with a detailed case study on gramicidin S analogs that underscores the power of SAAs in elucidating complex structure–activity relationships. Finally, this review presents a forward-looking perspective on the challenges and future directions of the field, emphasizing the transformative potential of computational design, artificial intelligence, and advanced bioconjugation techniques to accelerate the development of next-generation SAA-based therapeutics. Full article

(This article belongs to the Special Issue Recent Advances in Bioprocess Engineering and Fermentation Technology)

► Show Figures

Figure 1

18 pages, 2884 KB

Open AccessArticle

Research on Multi-Path Feature Fusion Manchu Recognition Based on Swin Transformer

by Yu Zhou, Mingyan Li, Hang Yu, Jinchi Yu, Mingchen Sun and Dadong Wang

Symmetry 2025, 17(9), 1408; https://doi.org/10.3390/sym17091408 - 29 Aug 2025

Viewed by 438

Abstract

Recognizing Manchu words can be challenging due to their complex character variations, subtle differences between similar characters, and homographic polysemy. Most studies rely on character segmentation techniques for character recognition or use convolutional neural networks (CNNs) to encode word images for word recognition. [...] Read more.

Recognizing Manchu words can be challenging due to their complex character variations, subtle differences between similar characters, and homographic polysemy. Most studies rely on character segmentation techniques for character recognition or use convolutional neural networks (CNNs) to encode word images for word recognition. However, these methods can lead to segmentation errors or a loss of semantic information, which reduces the accuracy of word recognition. To address the limitations in the long-range dependency modeling of CNNs and enhance semantic coherence, we propose a hybrid architecture to fuse the spatial features of original images and spectral features. Specifically, we first leverage the Short-Time Fourier Transform (STFT) to preprocess the raw input images and thereby obtain their multi-view spectral features. Then, we leverage a primary CNN block and a pair of symmetric CNN blocks to construct a symmetric spectral enhancement module, which is used to encode the raw input features and the multi-view spectral features. Subsequently, we design a feature fusion module via Swin Transformer to fuse multi-view spectral embedding and thereby concat it with the raw input embedding. Finally, we leverage a Transformer decoder to obtain the target output. We conducted extensive experiments on Manchu words benchmark datasets to evaluate the effectiveness of our proposed framework. The experimental results demonstrated that our framework performs robustly in word recognition tasks and exhibits excellent generalization capabilities. Additionally, our model outperformed other baseline methods in multiple writing-style font-recognition tasks. Full article

(This article belongs to the Section Computer)

► Show Figures

Figure 1

18 pages, 6467 KB

Open AccessArticle

State-Space Model Meets Linear Attention: A Hybrid Architecture for Internal Wave Segmentation

by Zhijie An, Zhao Li, Saheya Barintag, Hongyu Zhao, Yanqing Yao, Licheng Jiao and Maoguo Gong

Remote Sens. 2025, 17(17), 2969; https://doi.org/10.3390/rs17172969 - 27 Aug 2025

Viewed by 767

Abstract

Internal waves (IWs) play a crucial role in the transport of energy and matter within the ocean while also posing significant risks to marine engineering, navigation, and underwater communication systems. Consequently, effective segmentation methods are essential for mitigating their adverse impacts and minimizing [...] Read more.

Internal waves (IWs) play a crucial role in the transport of energy and matter within the ocean while also posing significant risks to marine engineering, navigation, and underwater communication systems. Consequently, effective segmentation methods are essential for mitigating their adverse impacts and minimizing associated hazards. A promising strategy involves applying remote sensing image segmentation techniques to accurately identify IWs, thereby enabling predictions of their propagation velocity and direction. However, current IWs segmentation models struggle to balance computational efficiency and segmentation accuracy, often resulting in either excessive computational costs or inadequate performance. Motivated by recent developments in the Mamba2 architecture, this paper introduces the state-space model meets linear attention (SMLA), a novel segmentation framework specifically designed for IWs. The proposed hybrid architecture effectively integrates three key components: a feature-aware serialization (FAS) block to efficiently convert spatial features into sequences; a state-space model with linear attention (SSM-LA) block that synergizes a state-space model with linear attention for comprehensive feature extraction; and a decoder driven by hierarchical fusion and upsampling, which performs channel alignment and scale unification across multi-level features to ensure high-fidelity spatial detail recovery. Experiments conducted on a dataset of 484 synthetic-aperture radar (SAR) images containing IWs from the South China Sea achieved a mean Intersection over Union (MIoU) of 74.3%, surpassing competing methods evaluated on the same dataset. These results demonstrate the superior effectiveness of SMLA in extracting features of IWs from SAR imagery. Full article

(This article belongs to the Special Issue Advancements of Vision-Language Models (VLMs) in Remote Sensing)

► Show Figures

Figure 1

18 pages, 5921 KB

Open AccessArticle

Milling Versus Printing: The Effect of Fabrication Technique on the Trueness and Fitness of Fabricated Crowns (A Comparative In Vitro Study)

by Mohammed Hassen Ali and Manhal A. Majeed

Prosthesis 2025, 7(5), 107; https://doi.org/10.3390/prosthesis7050107 - 25 Aug 2025

Viewed by 796

Abstract

Background/Objectives: Computer-aided manufacturing techniques are divided into subtractive (milling) and additive (3D printing) techniques. The accuracy of both techniques is measured only indirectly by testing the fabricated restorations. However, the role of the fabrication technique is masked by the differences in the [...] Read more.

Background/Objectives: Computer-aided manufacturing techniques are divided into subtractive (milling) and additive (3D printing) techniques. The accuracy of both techniques is measured only indirectly by testing the fabricated restorations. However, the role of the fabrication technique is masked by the differences in the materials used. Hence, this study used the same printing resin to print crowns and blocks for milling. Methods: Ten maxillary first premolars were prepared for full crowns and scanned with Primescan Connect IOS, and then crown restorations were designed using Exocad. A CAD/CAM block equal to size C14 was designed in CAD software (Microsoft 3D Builder) (Version 18.0.1931.0). The designed crowns and blocks were printed using three hybrid ceramic materials, namely, Ceramic Crown (SprintRay), Varseosmile Crown plus (Bego), and P-crown (Senertek), using a SprintRay Pro95S 3D-printer. The printed blocks were then used to fabricate the designed crowns using an In-Lab MCXL milling machine. The trueness and marginal and internal gaps of the crowns were then measured using Geomagic Control X metrology software (Version 2022.1). Statistical analysis was performed using the Kruskal–Wallis test, Dunn’s test, one-way ANOVA test, and Tukey’s HSD test. Results: Generally, the milled crowns showed significantly higher trueness but lower fitness than their 3D-printed counterparts (p < 0.05). A significant reverse correlation was found between the trueness and fitness of the fabricated restorations. Conclusions: The fabrication technique significantly influenced the accuracy of the hybrid ceramic crowns. Milling offered superior trueness, whereas 3D printing resulted in better internal and marginal adaptation. Full article

(This article belongs to the Section Prosthodontics)

► Show Figures

Figure 1

19 pages, 4206 KB

Open AccessArticle

A Hybrid UNet with Attention and a Perceptual Loss Function for Monocular Depth Estimation

by Hamidullah Turkmen and Devrim Akgun

Mathematics 2025, 13(16), 2567; https://doi.org/10.3390/math13162567 - 11 Aug 2025

Viewed by 583

Abstract

Monocular depth estimation is a crucial technique in computer vision that determines the depth or distance of objects in a scene using a single 2D image captured by a camera. UNet-based models are a fundamental architecture for monocular depth estimation, due to their [...] Read more.

Monocular depth estimation is a crucial technique in computer vision that determines the depth or distance of objects in a scene using a single 2D image captured by a camera. UNet-based models are a fundamental architecture for monocular depth estimation, due to their effective encoder–decoder structure. This study presents an effective depth estimation model based on a hybrid UNet architecture that incorporates ensemble features. The new model integrates Transformer-based attention blocks to capture global context and an encoder built on ResNet18 to extract spatial features. Additionally, a novel Boundary-Aware Depth Consistency Loss (BADCL) function has been introduced to enhance accuracy. This function features dynamic scaling, smoothness regularization, and boundary-aware weighting, which provides sharper edges, smoother depth transitions, and scale-consistent predictions. The proposed model has been evaluated on the NYU Depth V2 dataset, achieving a Structural Similarity Index Measure (SSIM) of 99.8%. The performance of the proposed model indicates increased depth accuracy compared to state-of-the-art methods. Full article

(This article belongs to the Special Issue Artificial Intelligence and Algorithms with Their Applications)

► Show Figures

Figure 1

30 pages, 20069 KB

Open AccessArticle

Evaluation of CoFe₂O₄-L-Au (L: Citrate, Glycine) as Superparamagnetic–Plasmonic Nanocomposites for Enhanced Cytotoxic Activity Towards Oncogenic (A549) Cells

by Alberto Lozano-López, Mario E. Cano-González, J. Ventura-Juárez, Martín H. Muñoz-Ortega, Israel Betancourt, Juan Antonio Zapien and Iliana E. Medina-Ramirez

Int. J. Mol. Sci. 2025, 26(16), 7732; https://doi.org/10.3390/ijms26167732 - 10 Aug 2025

Viewed by 503

Abstract

We investigated the influence of gold deposition on the magnetic behavior, biocompatibility, and bioactivity of CoFe₂O₄ (MCF) nanomaterials (NMs) functionalized with sodium citrate (Cit) or glycine (Gly). The resulting multifunctional plasmonic nanostructured materials (MCF-Au-L, where L is Cit, Gly) exhibit [...] Read more.

We investigated the influence of gold deposition on the magnetic behavior, biocompatibility, and bioactivity of CoFe₂O₄ (MCF) nanomaterials (NMs) functionalized with sodium citrate (Cit) or glycine (Gly). The resulting multifunctional plasmonic nanostructured materials (MCF-Au-L, where L is Cit, Gly) exhibit superparamagnetic behavior with magnetic saturation of 59 emu/g, 55 emu/g, and 60 emu/g, and blocking temperatures of 259 K, 311 K, and 322 K for pristine MCF, MCF-Au-Gly, and MCF-Au-Cit, respectively. The MCF NMs exhibit a small uniform size (with a mean size of 7.1 nm) and an atomic ratio of Fe:Co (2:1). The gold nanoparticles (AuNPs) show high heterogeneity as determined by high-resolution transmission electron microscopy (HR-TEM) and energy-dispersive X-ray spectroscopy (EDX). The UV-Vis spectroscopy of the composites reveals two localized surface plasmons (LSPs) at 530 nm and 705 nm, while Fourier Transformed-Infrared spectroscopy (FTIR) and thermogravimetric analysis (TGA) confirm the presence of Cit and Gly on their surface. Subsequent biocompatibility tests confirm that MCF-Au-L NMs do not exert hemolytic activity (hemolysis < 5%). In addition, the CCK-8 viability assay tests indicate the higher sensitivity of cancerous cells (A549) to the photoactivity of MCF-Au compared to healthy Detroit 548 (D548) cell lines. We use advanced microscopy techniques, namely atomic force, fluorescence, and holotomography microscopies (AFM, FM, and HTM, respectively) to provide further insights into the nature of the observed photoactivity of MCF-Au-L NMs. In addition, in situ radiation, using a modified HTM microscope with an IR laser accessory, demonstrates the photoactivity of the MCF-Au NMs and their suitability for destroying cancerous cells through photodynamic therapy. The combined imaging capabilities demonstrate clear morphological changes, NMs internalization, and oxidative damage. Our results confirm that the fabricated multifunctional NMs exhibit high stability in aqueous solution, chemical solidity, superparamagnetic behavior, and effective IR responses, making them promising precursors for hybrid cancer therapy. Full article

(This article belongs to the Special Issue Toxicity of Nanoparticles: Second Edition)

► Show Figures

Graphical abstract

26 pages, 1790 KB

Open AccessArticle

A Hybrid Deep Learning Model for Aromatic and Medicinal Plant Species Classification Using a Curated Leaf Image Dataset

by Shareena E. M., D. Abraham Chandy, Shemi P. M. and Alwin Poulose

AgriEngineering 2025, 7(8), 243; https://doi.org/10.3390/agriengineering7080243 - 1 Aug 2025

Viewed by 1124

Abstract

In the era of smart agriculture, accurate identification of plant species is critical for effective crop management, biodiversity monitoring, and the sustainable use of medicinal resources. However, existing deep learning approaches often underperform when applied to fine-grained plant classification tasks due to the [...] Read more.

In the era of smart agriculture, accurate identification of plant species is critical for effective crop management, biodiversity monitoring, and the sustainable use of medicinal resources. However, existing deep learning approaches often underperform when applied to fine-grained plant classification tasks due to the lack of domain-specific, high-quality datasets and the limited representational capacity of traditional architectures. This study addresses these challenges by introducing a novel, well-curated leaf image dataset consisting of 39 classes of medicinal and aromatic plants collected from the Aromatic and Medicinal Plant Research Station in Odakkali, Kerala, India. To overcome performance bottlenecks observed with a baseline Convolutional Neural Network (CNN) that achieved only 44.94% accuracy, we progressively enhanced model performance through a series of architectural innovations. These included the use of a pre-trained VGG16 network, data augmentation techniques, and fine-tuning of deeper convolutional layers, followed by the integration of Squeeze-and-Excitation (SE) attention blocks. Ultimately, we propose a hybrid deep learning architecture that combines VGG16 with Batch Normalization, Gated Recurrent Units (GRUs), Transformer modules, and Dilated Convolutions. This final model achieved a peak validation accuracy of 95.24%, significantly outperforming several baseline models, such as custom CNN (44.94%), VGG-19 (59.49%), VGG-16 before augmentation (71.52%), Xception (85.44%), Inception v3 (87.97%), VGG-16 after data augumentation (89.24%), VGG-16 after fine-tuning (90.51%), MobileNetV2 (93.67), and VGG16 with SE block (94.94%). These results demonstrate superior capability in capturing both local textures and global morphological features. The proposed solution not only advances the state of the art in plant classification but also contributes a valuable dataset to the research community. Its real-world applicability spans field-based plant identification, biodiversity conservation, and precision agriculture, offering a scalable tool for automated plant recognition in complex ecological and agricultural environments. Full article

(This article belongs to the Special Issue Implementation of Artificial Intelligence in Agriculture)

► Show Figures

Figure 1

20 pages, 18416 KB

Open AccessArticle

Swin-FSNet: A Frequency-Aware and Spatially Enhanced Network for Unpaved Road Extraction from UAV Remote Sensing Imagery

by Jiwu Guan, Qingzhan Zhao, Wenzhong Tian, Xinxin Yao, Jingyang Li and Wei Li

Remote Sens. 2025, 17(14), 2520; https://doi.org/10.3390/rs17142520 - 20 Jul 2025

Viewed by 665

Abstract

The efficient recognition of unpaved roads from remote sensing (RS) images holds significant value for tasks such as emergency response and route planning in outdoor environments. However, unpaved roads often face challenges such as blurred boundaries, low contrast, complex shapes, and a lack [...] Read more.

The efficient recognition of unpaved roads from remote sensing (RS) images holds significant value for tasks such as emergency response and route planning in outdoor environments. However, unpaved roads often face challenges such as blurred boundaries, low contrast, complex shapes, and a lack of publicly available datasets. To address these issues, this paper proposes a novel architecture, Swin-FSNet, which combines frequency analysis and spatial enhancement techniques to optimize feature extraction. The architecture consists of two core modules: the Wavelet-Based Feature Decomposer (WBFD) module and the Hybrid Dynamic Snake Block (HyDS-B) module. The WBFD module enhances boundary detection by capturing directional gradient changes at the road edges and extracting high-frequency features, effectively addressing boundary blurring and low contrast. The HyDS-B module, by adaptively adjusting the receptive field, performs spatial modeling for complex-shaped roads, significantly improving adaptability to narrow road curvatures. In this study, the southern mountainous area of Shihezi, Xinjiang, was selected as the study area, and the unpaved road dataset was constructed using high-resolution UAV images. Experimental results on the SHZ unpaved road dataset and the widely used DeepGlobe dataset show that Swin-FSNet performs well in segmentation accuracy and road structure preservation, with an IoU_road of 81.76% and 71.97%, respectively. The experiments validate the excellent performance and robustness of Swin-FSNet in extracting unpaved roads from high-resolution RS images. Full article

(This article belongs to the Special Issue Road Extraction and Distress Assessment by Spaceborne, Airborne and Terrestrial Platforms (Second Edition))

► Show Figures

Figure 1

20 pages, 2926 KB

Open AccessArticle

SonarNet: Global Feature-Based Hybrid Attention Network for Side-Scan Sonar Image Segmentation

by Juan Lei, Huigang Wang, Liming Fan, Qingyue Gu, Shaowei Rong and Huaxia Zhang

Remote Sens. 2025, 17(14), 2450; https://doi.org/10.3390/rs17142450 - 15 Jul 2025

Viewed by 487

Abstract

With the rapid advancement of deep learning techniques, side-scan sonar image segmentation has become a crucial task in underwater scene understanding. However, the complex and variable underwater environment poses significant challenges for salient object detection, with traditional deep learning approaches often suffering from [...] Read more.

With the rapid advancement of deep learning techniques, side-scan sonar image segmentation has become a crucial task in underwater scene understanding. However, the complex and variable underwater environment poses significant challenges for salient object detection, with traditional deep learning approaches often suffering from inadequate feature representation and the loss of global context during downsampling, thus compromising the segmentation accuracy of fine structures. To address these issues, we propose SonarNet, a Global Feature-Based Hybrid Attention Network specifically designed for side-scan sonar image segmentation. SonarNet features a dual-encoder architecture that leverages residual blocks and a self-attention mechanism to simultaneously capture both global structural and local contextual information. In addition, an adaptive hybrid attention module is introduced to effectively integrate channel and spatial features, while a global enhancement block fuses multi-scale global and spatial representations from the dual encoders, mitigating information loss throughout the network. Comprehensive experiments on a dedicated underwater sonar dataset demonstrate that SonarNet outperforms ten state-of-the-art saliency detection methods, achieving a mean absolute error as low as 2.35%. These results highlight the superior performance of SonarNet in challenging sonar image segmentation tasks. Full article

(This article belongs to the Special Issue Ocean Remote Sensing Based on Radar, Sonar and Optical Techniques (Second Edition))

► Show Figures

Graphical abstract

28 pages, 8538 KB

Open AccessArticle

Deep-Learning Integration of CNN–Transformer and U-Net for Bi-Temporal SAR Flash-Flood Detection

by Abbas Mohammed Noori, Abdul Razzak T. Ziboon and Amjed N. AL-Hameedawi

Appl. Sci. 2025, 15(14), 7770; https://doi.org/10.3390/app15147770 - 10 Jul 2025

Viewed by 3020

Abstract

Flash floods are natural disasters that have significant impacts on human life and economic damage. The detection of flash floods using remote-sensing techniques provides essential data for subsequent flood-risk assessment through the preparation of flood inventory samples. In this research, a new deep-learning [...] Read more.

Flash floods are natural disasters that have significant impacts on human life and economic damage. The detection of flash floods using remote-sensing techniques provides essential data for subsequent flood-risk assessment through the preparation of flood inventory samples. In this research, a new deep-learning approach for bi-temporal flash-flood detection in Synthetic Aperture Radar (SAR) is proposed. It combines a U-Net convolutional network with a Transformer model using a compact Convolutional Tokenizer (CCT) to improve the efficiency of long-range dependency learning. The hybrid model, namely CCT-U-ViT, naturally combines the spatial feature extraction of U-Net and the global context capability of Transformer. The model significantly reduces the number of basic blocks as it uses the CCT tokenizer instead of conventional Vision Transformer tokenization, which makes it the right fit for small flood detection datasets. This model improves flood boundary delineation by involving local spatial patterns and global contextual relations. However, the method is based on Sentinel-1 SAR images and focuses on Erbil, Iraq, which experienced an extreme flash flood in December 2021. The experimental comparison results show that the proposed CCT-U-ViT outperforms multiple baseline models, such as conventional CNNs, U-Net, and Vision Transformer, obtaining an impressive overall accuracy of 91.24%. Furthermore, the model obtains better precision and recall with an F1-score of 91.21% and mIoU of 83.83%. Qualitative results demonstrate that CCT-U-ViT can effectively preserve the flood boundaries with higher precision and less salt-and-pepper noise compared with the state-of-the-art approaches. This study underscores the significance of hybrid deep-learning models in enhancing the precision of flood detection with SAR data, providing valuable insights for the advancement of real-time flood monitoring and risk management systems. Full article

► Show Figures

Figure 1

20 pages, 90560 KB

Open AccessArticle

A Hybrid MIL Approach Leveraging Convolution and State-Space Model for Whole-Slide Image Cancer Subtyping

by Dehui Bi and Yuqi Zhang

Mathematics 2025, 13(13), 2178; https://doi.org/10.3390/math13132178 - 3 Jul 2025

Viewed by 550

Abstract

Precise identification of cancer subtypes from whole slide images (WSIs) is pivotal in tailoring patient-specific therapies. Under the weakly supervised multiple instance learning (MIL) paradigm, existing techniques frequently fall short in simultaneously capturing local tissue textures and long-range contextual relationships. To address these [...] Read more.

Precise identification of cancer subtypes from whole slide images (WSIs) is pivotal in tailoring patient-specific therapies. Under the weakly supervised multiple instance learning (MIL) paradigm, existing techniques frequently fall short in simultaneously capturing local tissue textures and long-range contextual relationships. To address these challenges, we introduce ConvMixerSSM, a hybrid model that integrates a ConvMixer block for local spatial representation, a state space model (SSM) block for capturing long-range dependencies, and a feature-gated block to enhance informative feature selection. The model was evaluated on the TCGA-NSCLC dataset and the CAMELYON16 dataset for cancer subtyping tasks. Extensive experiments, including comparisons with state-of-the-art MIL methods and ablation studies, were conducted to assess the contribution of each component. ConvMixerSSM achieved an AUC of 97.83%, an ACC of 91.82%, and an F1 score of 91.18%, outperforming existing MIL baselines on the TCGA-NSCLC dataset. The ablation study revealed that each block contributed positively to performance, with the full model showing the most balanced and superior results. Moreover, our visualization results further confirm that ConvMixerSSM can effectively identify tumor regions within WSIs, providing model interpretability and clinical relevance. These findings suggest that ConvMixerSSM has strong potential for advancing computational pathology applications in clinical decision-making. Full article

(This article belongs to the Special Issue Computational Perspectives on Artificial Intelligence Drive in Medical Decision-Making)

► Show Figures

Figure 1

24 pages, 1991 KB

Open AccessArticle

Robust Deep Neural Network for Classification of Diseases from Paddy Fields

by Karthick Mookkandi and Malaya Kumar Nath

AgriEngineering 2025, 7(7), 205; https://doi.org/10.3390/agriengineering7070205 - 1 Jul 2025

Cited by 1 | Viewed by 708

Abstract

Agriculture in India supports millions of livelihoods and is a major force behind economic expansion. Challenges in modern agriculture depend on environmental factors (such as soil quality and climate variability) and biotic factors (such as pests and diseases). These challenges can be addressed [...] Read more.

Agriculture in India supports millions of livelihoods and is a major force behind economic expansion. Challenges in modern agriculture depend on environmental factors (such as soil quality and climate variability) and biotic factors (such as pests and diseases). These challenges can be addressed by advancements in technology (such as sensors, internet of things, communication, etc.) and data-driven approaches (such as machine learning (ML) and deep learning (DL)), which can help with crop yield and sustainability in agriculture. This study introduces an innovative deep neural network (DNN) approach for identifying leaf diseases in paddy crops at an early stage. The proposed neural network is a hybrid DL model comprising feature extraction, channel attention, inception with residual, and classification blocks. Channel attention and inception with residual help extract comprehensive information about the crops and potential diseases. The classification module uses softmax to obtain the score for different classes. The importance of each block is analyzed via an ablation study. To understand the feature extraction ability of the modules, extracted features at different stages are fed to the SVM classifier to obtain the classification accuracy. This technique was experimented on eight classes with 7857 paddy crop images, which were obtained from local paddy fields and freely available open sources. The classification performance of the proposed technique is evaluated according to accuracy, sensitivity, specificity, F1 score, MCC, area under curve (AUC), and receiver operating characteristic (ROC). The model was fine-tuned by setting the hyperparameters (such as batch size, learning rate, optimizer, epoch, and train and test ratio). Training, validation, and testing accuracies of 99.91%, 99.87%, and 99.49%, respectively, were obtained for 20 epochs with a learning rate of 0.001 and sgdm optimizer. The proposed network robustness was studied via an ablation study and with noisy data. The model’s classification performance was evaluated for other agricultural data (such as mango, maize, and wheat diseases). These research outcomes can empower farmers with smarter agricultural practices and contribute to economic growth. Full article

(This article belongs to the Section Computer Applications and Artificial Intelligence in Agriculture)

► Show Figures

Figure 1

Search Results (163)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (163)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI