Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,165)

Search Parameters:
Keywords = gradient-boosting decision tree

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
26 pages, 2975 KB  
Article
CTGAN-Augmented Ensemble Learning Models for Classifying Dementia and Heart Failure
by Pornthep Phanbua, Sujitra Arwatchananukul, Georgi Hristov and Punnarumol Temdee
Inventions 2025, 10(6), 101; https://doi.org/10.3390/inventions10060101 - 6 Nov 2025
Abstract
Research shows that individuals with heart failure are 60% more likely to develop dementia because of their shared metabolic risk factors. Developing a classification model to differentiate between these two conditions effectively is crucial for improving diagnostic accuracy, guiding clinical decision-making, and supporting [...] Read more.
Research shows that individuals with heart failure are 60% more likely to develop dementia because of their shared metabolic risk factors. Developing a classification model to differentiate between these two conditions effectively is crucial for improving diagnostic accuracy, guiding clinical decision-making, and supporting timely interventions in older adults. This study proposes a novel method for dementia classification, distinguishing it from its common comorbidity, heart failure, using blood testing and personal data. A dataset comprising 11,124 imbalanced electronic health records of older adults from hospitals in Chiang Rai, Thailand, was utilized. Conditional tabular generative adversarial networks (CTGANs) were employed to generate synthetic data while preserving key statistical relationships, diversity, and distributions of the original dataset. Two groups of ensemble models were analyzed: the boosting group—extreme gradient boosting, light gradient boosting machine—and the bagging group—random forest and extra trees. Performance metrics, including accuracy, precision, recall, F1-score, and area under the receiver-operating characteristic curve were evaluated. Compared with the synthetic minority oversampling technique, CTGAN-based synthetic data generation significantly enhanced the performance of ensemble learning models in classifying dementia and heart failure. Full article
(This article belongs to the Special Issue Machine Learning Applications in Healthcare and Disease Prediction)
Show Figures

Figure 1

20 pages, 1224 KB  
Article
Explainable AI for Coronary Artery Disease Stratification Using Routine Clinical Data
by Nurdaulet Tasmurzayev, Baglan Imanbek, Assiya Boltaboyeva, Gulmira Dikhanbayeva, Sarsenbek Zhussupbekov, Qarlygash Saparbayeva and Gulshat Amirkhanova
Algorithms 2025, 18(11), 693; https://doi.org/10.3390/a18110693 - 3 Nov 2025
Viewed by 198
Abstract
Background: Coronary artery disease (CAD) remains a leading cause of morbidity and mortality. Early diagnosis reduces adverse outcomes and alleviates the burden on healthcare, yet conventional approaches are often invasive, costly, and not always available. In this context, machine learning offers promising solutions. [...] Read more.
Background: Coronary artery disease (CAD) remains a leading cause of morbidity and mortality. Early diagnosis reduces adverse outcomes and alleviates the burden on healthcare, yet conventional approaches are often invasive, costly, and not always available. In this context, machine learning offers promising solutions. Objective: The objective of this study is to evaluate the feasibility of reliably predicting both the presence and the severity of CAD. The analysis is based on a harmonized, multi-center UCI dataset that includes cohorts from Cleveland, Hungary, Switzerland, and Long Beach. The work aims to assess the accuracy and practical utility of models built exclusively on routine tabular clinical and demographic data, without relying on imaging. These models are designed to improve risk stratification and guide patient routing. Methods and Results: The study is based on a uniform and standardized data processing pipeline. This pipeline includes handling missing values, feature encoding, scaling, an 80/20 train–test split and applying the SMOTE method exclusively to the training set to prevent information leakage. Within this pipeline, a standardized comparison of a wide range of models (including gradient boosting, tree-based ensembles, support vector methods, etc.) was conducted with hyperparameter tuning via GridSearchCV. The best results were demonstrated by the CatBoost model: accuracy—0.8278, recall—0.8407, and F1-score—0.8436. Conclusions: A key distinction of this work is the comprehensive evaluation of the models’ practical suitability. Beyond standard metrics, the analysis of calibration curves confirmed the reliability of the probabilistic predictions. Patient-level interpretability using SHAP showed that the model relies on clinically significant predictors, including ST-segment depression. Calibrated and explainable models based on readily available data are positioned as a practical tool for scalable risk stratification and decision support, especially in resource-constrained settings. Full article
Show Figures

Figure 1

24 pages, 2473 KB  
Article
Estimating Indirect Accident Cost Using a Two-Tiered Machine Learning Algorithm for the Construction Industry
by Ayesha Munira Chowdhury, Jurng-Jae Yee, Sang I. Park, Eun-Ju Ha and Jae-Ho Choi
Buildings 2025, 15(21), 3947; https://doi.org/10.3390/buildings15213947 - 1 Nov 2025
Viewed by 435
Abstract
Accurately estimating total accident costs is essential for managing construction safety budgets. While direct costs are well-documented, indirect costs—such as productivity loss, material damage, and legal expenses—are difficult to predict and often overlooked. Traditional ratio-based methods lack accuracy due to variability across projects [...] Read more.
Accurately estimating total accident costs is essential for managing construction safety budgets. While direct costs are well-documented, indirect costs—such as productivity loss, material damage, and legal expenses—are difficult to predict and often overlooked. Traditional ratio-based methods lack accuracy due to variability across projects and accident types. This study introduces a two-tiered machine learning framework for real-time indirect cost estimation. In the first tier, classification models (decision tree, random forest, k-nearest neighbor, and XGBoost) predict total cost categories; in the second, regression models (decision tree, random forest, gradient boosting, and light-gradient boosting machine) estimate indirect costs. Using a dataset of 1036 construction accidents collected over two years, the model achieved accuracies above 87% in classification and an R2 of 0.95 with a training MSE of 0.21 in regression. Compared to conventional statistical and single-step models, it demonstrated superior predictive performance, reducing average deviations to $362.63 and sometimes achieving zero deviation. This framework enables more precise, real-time estimation of hidden costs, promoting better safety investment, reduced financial risk, and adaptive learning through retraining. When integrated with a national accident cost database, it supports ongoing improvement and informed risk management for construction stakeholders. Full article
Show Figures

Figure 1

33 pages, 5642 KB  
Article
Feature-Optimized Machine Learning Approaches for Enhanced DDoS Attack Detection and Mitigation
by Ahmed Jamal Ibrahim, Sándor R. Répás and Nurullah Bektaş
Computers 2025, 14(11), 472; https://doi.org/10.3390/computers14110472 - 1 Nov 2025
Viewed by 210
Abstract
Distributed denial of service (DDoS) attacks pose a serious risk to the operational stability of a network for companies, often leading to service disruptions and financial damage and a loss of trust and credibility. The increasing sophistication and scale of these threats highlight [...] Read more.
Distributed denial of service (DDoS) attacks pose a serious risk to the operational stability of a network for companies, often leading to service disruptions and financial damage and a loss of trust and credibility. The increasing sophistication and scale of these threats highlight the pressing need for advanced mitigation strategies. Despite the numerous existing studies on DDoS detection, many rely on large, redundant feature sets and lack validation for real-time applicability, leading to high computational complexity and limited generalization across diverse network conditions. This study addresses this gap by proposing a feature-optimized and computationally efficient ML framework for DDoS detection and mitigation using benchmark dataset. The proposed approach serves as a foundational step toward developing a low complexity model suitable for future real-time and hardware-based implementation. The dataset was systematically preprocessed to identify critical parameters, such as packet length Min, Total Backward Packets, Avg Fwd Segment Size, and others. Several ML algorithms, involving Logistic Regression, Decision Tree, Random Forest, Gradient Boosting, and Cat-Boost, are applied to develop models for detecting and mitigating abnormal network traffic. The developed ML model demonstrates high performance, achieving 99.78% accuracy with Decision Tree and 99.85% with Random Forest, representing improvements of 1.53% and 0.74% compared to previous work, respectively. In addition, the Decision Tree algorithm achieved 99.85% accuracy for mitigation. with an inference time as low as 0.004 s, proving its suitability for identifying DDoS attacks in real time. Overall, this research presents an effective approach for DDoS detection, emphasizing the integration of ML models into existing security systems to enhance real-time threat mitigation. Full article
Show Figures

Figure 1

24 pages, 5914 KB  
Article
Identification of Cotton Leaf Mite Damage Stages Using UAV Multispectral Images and a Stacked Ensemble Method
by Shifeng Fan, Qiang He, Yongqin Chen, Xin Xu, Wei Guo, Yanhui Lu, Jie Liu and Hongbo Qiao
Agriculture 2025, 15(21), 2277; https://doi.org/10.3390/agriculture15212277 - 31 Oct 2025
Viewed by 149
Abstract
Cotton leaf mites are pests that cause irreparable damage to cotton and pose a severe threat to the cotton yield, and the application of unmanned aerial vehicles (UAVs) to monitor the incidence of cotton leaf mites across a vast region is important for [...] Read more.
Cotton leaf mites are pests that cause irreparable damage to cotton and pose a severe threat to the cotton yield, and the application of unmanned aerial vehicles (UAVs) to monitor the incidence of cotton leaf mites across a vast region is important for cotton leaf mite prevention. In this work, 52 vegetation indices were calculated based on the original five bands of spliced UAV multispectral images, and six featured indices were screened using Shapley value theory. To classify and identify cotton leaf mite infestation classes, seven machine learning classification models were used: random forest (RF), support vector machine (SVM), extreme gradient boosting (XGB), light gradient boosting machine (LGBM), K-Nearest Neighbors (KNN), decision tree (DT), and gradient boosting decision tree (GBDT) models. The base model and metamodel used in stacked models were built based on a combination of four models, namely, the XGB, GBDT, KNN, and DT models, which were selected in accordance with the heterogeneity principle. The experimental results showed that the stacked classification models based on the XGB, KNN base model, and DT metamodel were the best performers, outperforming other integrated and single individual models, with an overall accuracy of 85.7% (precision: 93.3%, recall: 72.6%, and F1-score: 78.2% in the macro_avg case; precision: 88.6%, recall: 85.7%, and F1 score: 84.7% in the weighted_avg case). This approach provides support for using UAVs to monitor the cotton leaf mite prevalence over vast regions. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Figure 1

12 pages, 858 KB  
Article
Machine Learning Approaches for Predicting Intraoperative Blood Transfusion in Partial Hip Arthroplasty
by Mürsel Kahveci
J. Clin. Med. 2025, 14(21), 7657; https://doi.org/10.3390/jcm14217657 - 28 Oct 2025
Viewed by 143
Abstract
Objective: Partial hip arthroplasty (PHA) procedures are often associated with significant blood loss, particularly in elderly patients with comorbidities. Predicting the need for intraoperative transfusion in advance is crucial for patient safety and surgical planning. Machine learning (ML) algorithms offer data-driven solutions to [...] Read more.
Objective: Partial hip arthroplasty (PHA) procedures are often associated with significant blood loss, particularly in elderly patients with comorbidities. Predicting the need for intraoperative transfusion in advance is crucial for patient safety and surgical planning. Machine learning (ML) algorithms offer data-driven solutions to support clinical decision-making in such scenarios. Methods: This retrospective, single-center cohort study evaluated data from 202 patients who underwent PHA between December 2023 and July 2025. Demographic data, as well as preoperative and intraoperative variables, were collected. Six ML algorithms—Logistic Regression, Decision Tree, Support Vector Machines (SVM), Artificial Neural Network (ANN), Random Forest, and Gradient Boosting—were trained and tested to predict intraoperative blood transfusion. Model performance was assessed using accuracy, F1-score, and area under the ROC curve (AUC). SHAP (SHapley Additive exPlanations) analysis was used to evaluate model interpretability. Results: Among the 202 patients, 85 (42.1%) received intraoperative blood transfusions. Significant predictors included low preoperative hemoglobin, high ASA score, prolonged operative time, increased intraoperative blood loss, and elevated INR (all p < 0.05). The Random Forest and Decision Tree models achieved the highest accuracy (95.1%) and F1-score (0.960), while the SVM model yielded the highest AUC (0.992). SHAP analysis identified hemoglobin, age, ASA score, INR, and operative time as the most influential features in model decision-making. Conclusions: Machine learning models—particularly Random Forest, Decision Tree, and SVM—demonstrated high performance in predicting intraoperative transfusion needs during PHA. The incorporation of explainable AI techniques such as SHAP enhanced the clinical interpretability of model outputs, supporting personalized patient management. These findings provide a strong foundation for integrating such models into clinical decision support systems, though external validation through multicenter and prospective studies is warranted. Full article
(This article belongs to the Section Orthopedics)
Show Figures

Figure 1

22 pages, 2297 KB  
Article
Machine Learning-Driven E-Nose-Based Diabetes Detection: Sensor Selection and Feature Reduction Study
by Yavuz Selim Taspinar
Sensors 2025, 25(21), 6607; https://doi.org/10.3390/s25216607 - 27 Oct 2025
Viewed by 497
Abstract
Diabetes is a major global health problem, with a rapidly increasing prevalence and long-term health complications in both developed and developing countries. If not diagnosed early, it can lead to cardiovascular diseases, kidney failure, vision loss, and nervous system disorders. This study aimed [...] Read more.
Diabetes is a major global health problem, with a rapidly increasing prevalence and long-term health complications in both developed and developing countries. If not diagnosed early, it can lead to cardiovascular diseases, kidney failure, vision loss, and nervous system disorders. This study aimed to classify individuals with diabetes or healthy individuals using e-nose sensor data obtained from breath samples taken from 1000 individuals. Six sensor features and one class feature were used in the analysis. Machine learning methods included Artificial Neural Networks (ANN), Decision Trees (DT), Gradient Boosting (GB), Naive Bayes (NB), and AdaBoost (AB). ANOVA and Information Gain analyses were conducted to determine the effectiveness of the sensor data, and the TGS2610 and TGS2611 sensors were found to be critical for classification. Principal Component Analysis (PCA) reduced data size and saved processing time. Experimental results showed that the ANN model provided the most successful classification, with 100% accuracy. AB and GB achieved 99.8% accuracy, while NB achieved 97.6% accuracy. Dimensionality reduction using PCA optimized training and testing times without loss of accuracy. The study presents a data-driven approach to e-nose-based diabetes detection, demonstrates the comparative performance of the models, and highlights the importance of sensor selection and data size optimization. Full article
(This article belongs to the Section Electronic Sensors)
Show Figures

Figure 1

20 pages, 3577 KB  
Article
Hyperspectral Remote Sensing and Artificial Intelligence for High-Resolution Soil Moisture Prediction
by Ki-Sung Kim, Junwon Lee, Jeongjun Park, Gigwon Hong and Kicheol Lee
Water 2025, 17(21), 3069; https://doi.org/10.3390/w17213069 - 27 Oct 2025
Viewed by 374
Abstract
Reliable field estimation of soil moisture supports hydrology and water resources management. This study develops a drone-based hyperspectral approach in which visible and near-infrared reflectance is paired one-to-one with gravimetric water content measured by oven drying, yielding 1000 matched samples. After standardization, outlier [...] Read more.
Reliable field estimation of soil moisture supports hydrology and water resources management. This study develops a drone-based hyperspectral approach in which visible and near-infrared reflectance is paired one-to-one with gravimetric water content measured by oven drying, yielding 1000 matched samples. After standardization, outlier control, ranked wavelength selection, and light feature engineering, several predictors were evaluated. Conventional machine learning methods, including simple and multiple regression and tree-based ensembles, were limited by band collinearity and piecewise approximations and therefore failed to meet the accuracy target. Gradient boosting reached the target but used different trade-offs in variable sensitivity. An artificial neural network with three hidden layers, rectified linear unit activations, and dropout was trained using a feature count sweep and early stopping. With ten predictors, the model achieved a coefficient of determination of 0.9557, demonstrating accurate mapping from hyperspectral reflectance to gravimetric water content and providing a reproducible framework suitable for larger, multi date acquisitions and operational decision support. Full article
Show Figures

Figure 1

23 pages, 1943 KB  
Article
Modeling of New Agents with Potential Antidiabetic Activity Based on Machine Learning Algorithms
by Yevhen Pruhlo, Ivan Iurchenko and Alina Tomenko
AppliedChem 2025, 5(4), 30; https://doi.org/10.3390/appliedchem5040030 - 27 Oct 2025
Viewed by 246
Abstract
Type 2 diabetes mellitus (T2DM) is a growing global health challenge, expected to affect over 600 million people by 2045. The discovery of new antidiabetic agents remains resource-intensive, motivating the use of machine learning (ML) for virtual screening based on molecular structure. In [...] Read more.
Type 2 diabetes mellitus (T2DM) is a growing global health challenge, expected to affect over 600 million people by 2045. The discovery of new antidiabetic agents remains resource-intensive, motivating the use of machine learning (ML) for virtual screening based on molecular structure. In this study, we developed a predictive pipeline integrating two distinct descriptor types: high-dimensional numerical features from the Mordred library (>1800 2D/3D descriptors) and categorical ontological annotations from the ClassyFire and ChEBI systems. These encode hierarchical chemical classifications and functional group labels. The dataset included 45 active compounds and thousands of inactive molecules, depending on the descriptor system. To address class imbalance, we applied SMOTE and created balanced training and test sets while preserving independent validation sets. Thirteen ML models—including regression, SVM, naive Bayes, decision trees, ensemble methods, and others—were trained using stratified 12-fold cross-validation and evaluated across training, test, and validation. Ridge Regression showed the best generalization (MCC = 0.814), with Gradient Boosting following (MCC = 0.570). Feature importance analysis highlighted the complementary nature of the descriptors: Ridge Regression emphasized ClassyFire taxonomies such as CHEMONTID:0000229 and CHEBI:35622, while Mordred-based models (e.g., Random Forest) prioritized structural and electronic features like MAXsssCH and ETA_dEpsilon_D. This study is the first to systematically integrate and compare structural and ontological descriptors for antidiabetic compound prediction. The framework offers a scalable and interpretable approach to virtual screening and can be extended to other therapeutic domains to accelerate early-stage drug discovery. Full article
Show Figures

Figure 1

20 pages, 1297 KB  
Article
Predicting Corrosion Behaviour of Magnesium Alloy Using Machine Learning Approaches
by Tülay Yıldırım and Hüseyin Zengin
Metals 2025, 15(11), 1183; https://doi.org/10.3390/met15111183 - 24 Oct 2025
Viewed by 303
Abstract
The primary objective of this study is to develop a machine learning-based predictive model using corrosion rate data for magnesium alloys compiled from the literature. Corrosion rates measured under different deformation rates and heat treatment parameters were analyzed using artificial intelligence algorithms. Variables [...] Read more.
The primary objective of this study is to develop a machine learning-based predictive model using corrosion rate data for magnesium alloys compiled from the literature. Corrosion rates measured under different deformation rates and heat treatment parameters were analyzed using artificial intelligence algorithms. Variables such as chemical composition, heat treatment temperature and time, deformation state, pH, test method, and test duration were used as inputs in the dataset. Various regression algorithms were compared with the PyCaret AutoML library, and the models with the highest accuracy scores were analyzed with Gradient Extra Trees and AdaBoost regression methods. The findings of this study demonstrate that modelling corrosion behaviour by integrating chemical composition with experimental conditions and processing parameters substantially enhances predictive accuracy. The regression models, developed using the PyCaret library, achieved high accuracy scores, producing corrosion rate predictions that are remarkably consistent with experimental values reported in the literature. Detailed tables and figures confirm that the most influential factors governing corrosion were successfully identified, providing valuable insights into the underlying mechanisms. These results highlight the potential of AI-assisted decision systems as powerful tools for material selection and experimental design, and, when supported by larger databases, for predicting the corrosion life of magnesium alloys and guiding the development of new alloys. Full article
(This article belongs to the Section Computation and Simulation on Metals)
Show Figures

Figure 1

20 pages, 9075 KB  
Article
CatBoost Improves Inversion Accuracy of Plant Water Status in Winter Wheat Using Ratio Vegetation Index
by Bingyan Dong, Shouchen Ma, Zhenhao Gao and Anzhen Qin
Appl. Sci. 2025, 15(21), 11363; https://doi.org/10.3390/app152111363 - 23 Oct 2025
Viewed by 312
Abstract
The accurate monitoring of crop water status is critical for optimizing irrigation strategies in winter wheat. Compared with satellite remote sensing, unmanned aerial vehicle (UAV) technology offers superior spatial resolution, temporal flexibility, and controllable data acquisition, making it an ideal choice for the [...] Read more.
The accurate monitoring of crop water status is critical for optimizing irrigation strategies in winter wheat. Compared with satellite remote sensing, unmanned aerial vehicle (UAV) technology offers superior spatial resolution, temporal flexibility, and controllable data acquisition, making it an ideal choice for the small-scale monitoring of crop water status. During 2023–2025, field experiments were conducted to predict crop water status using UAV images in the North China Plain (NCP). Thirteen vegetation indices were calculated and their correlations with observed crop water content (CWC) and equivalent water thickness (EWT) were analyzed. Four machine learning (ML) models, namely, random forest (RF), decision tree (DT), LightGBM, and CatBoost, were evaluated for their inversion accuracy with regard to CWC and EWT in the 2024–2025 growing season of winter wheat. The results show that the ratio vegetation index (RVI, NIR/R) exhibited the strongest correlation with CWC (R = 0.97) during critical growth stages. Among the ML models, CatBoost demonstrated superior performance, achieving R2 values of 0.992 (CWC) and 0.962 (EWT) in training datasets, with corresponding RMSE values of 0.012% and 0.1907 g cm−2, respectively. The model maintained robust performance in testing (R2 = 0.893 for CWC, and R2 = 0.961 for EWT), outperforming conventional approaches like RF and DT. High-resolution (5 cm) inversion maps successfully identified spatial variability in crop water status across experimental plots. The CatBoost-RVI framework proved particularly effective during the booting and flowering stages, providing reliable references for precision irrigation management in the NCP. Full article
(This article belongs to the Special Issue Advanced Plant Biotechnology in Sustainable Agriculture—2nd Edition)
Show Figures

Figure 1

15 pages, 1536 KB  
Article
Evaluation of the Risk of Urinary System Stone Recurrence Using Anthropometric Measurements and Lifestyle Behaviors in a Developed Artificial Intelligence Model
by Hikmet Yasar, Kadir Yildirim, Mucahit Karaduman, Bayram Kolcu, Mehmet Ezer, Ferhat Yakup Suceken, Fatih Bicaklioğlu, Mehmet Erhan Aydin, Coskun Kaya, Muhammed Yildirim and Kemal Sarica
Diagnostics 2025, 15(20), 2643; https://doi.org/10.3390/diagnostics15202643 - 20 Oct 2025
Viewed by 474
Abstract
Background/Objectives: Urinary system stone disease is an important health problem both clinically and economically due to its high recurrence rates. In this study, an innovative hybrid approach based on deep learning is proposed to predict the recurrence risk of stone disease. Methods: Patient [...] Read more.
Background/Objectives: Urinary system stone disease is an important health problem both clinically and economically due to its high recurrence rates. In this study, an innovative hybrid approach based on deep learning is proposed to predict the recurrence risk of stone disease. Methods: Patient data were divided into three subsets: anthropometric measurements (Part A), derived body composition indices (Part B), and other clinical and demographic information (Part C). Each data subset was processed with autoencoder models, and low-dimensional, meaningful features were extracted. The obtained features were combined, and the classification process was performed using four different machine learning algorithms: Extreme Gradient Boosting (XGBoost), Cubic Support Vector Machines (Cubic SVM), k-Nearest Neighbor algorithm (KNN), and Decision Tree (DT). Results: According to the experimental results, the highest classification performance was obtained with the XGBoost algorithm. The suggested approach adds to the literature by offering a novel solution that makes early risk calculation for stone disease recurrence easier. It also shows how well structural feature engineering and deep representation can be integrated in clinical prediction issues. Conclusions: Prediction of the stone recurrence risk in advance is of great importance both in terms of improving the quality of life of patients and reducing the unnecessary diagnostic evaluations along with lowering treatment costs. Full article
(This article belongs to the Special Issue New Technologies and Tools Used for Risk Assessment of Diseases)
Show Figures

Figure 1

29 pages, 28659 KB  
Article
Assessing Anthropogenic Impacts on the Carbon Sink Dynamics in Tropical Lowland Rainforest Using Multiple Remote Sensing Data: A Case Study of Jianfengling, China
by Shijie Mao, Mingjiang Mao, Wenfeng Gong, Yuxin Chen, Yixi Ma, Renhao Chen, Miao Wang, Xiaoxiao Zhang, Jinming Xu, Junting Jia and Lingbing Wu
Forests 2025, 16(10), 1611; https://doi.org/10.3390/f16101611 - 20 Oct 2025
Viewed by 407
Abstract
Aboveground biomass (AGB) is a key indicator of forest structure and carbon sequestration, yet its dynamics under concurrent anthropogenic disturbances remain poorly understood. This study investigates the spatiotemporal dynamics and driving mechanisms of AGB in the Jianfengling tropical lowland rainforest (JFLTLR) within Hainan [...] Read more.
Aboveground biomass (AGB) is a key indicator of forest structure and carbon sequestration, yet its dynamics under concurrent anthropogenic disturbances remain poorly understood. This study investigates the spatiotemporal dynamics and driving mechanisms of AGB in the Jianfengling tropical lowland rainforest (JFLTLR) within Hainan Tropical Rainforest National Park (NRHTR) from 2015 to 2023. Six machine learning models—Extreme Gradient Boosting (XGBoost), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), k-Nearest Neighbors (KNN), Decision Tree (DT), and Random Forest (RF)—were evaluated, with RF achieving the highest accuracy (R2 = 0.83). Therefore, RF was employed to generate high-resolution annual AGB maps based on Sentinel-1/2 data fusion, field surveys, socio-economic indicators, and topographic variables. Human pressure was quantified using the Human Influence Index (HII). Threshold analysis revealed a critical breakpoint at ΔHII ≈ 0.1712: below this level, AGB remained relatively stable, whereas beyond it, biomass declined sharply (≈−2.65 mg·ha−1 per 0.01 ΔHII). Partial least squares structural equation modeling (PLS-SEM) identified plantation forests as the dominant negative driver, while GDP (−0.91) and road (−1.04) exerted strong indirect effects through HII, peaking in 2019 before weakening under ecological restoration policies. Spatially, biomass remained resilient within central core zones but declined in peripheral regions associated with road expansion. Temporally, AGB exhibited a trajectory of decline, partial recovery, and renewed loss, resulting in a net reduction of ≈ 0.0393 × 106 mg. These findings underscore the urgent need for a “core stabilization–peripheral containment” strategy integrating disturbance early-warning systems, transportation planning that minimizes impacts on high-AGB corridors, and the strengthening of ecological corridors to maintain carbon-sink capacity and guide differentiated rainforest conservation. Full article
(This article belongs to the Special Issue Modelling and Estimation of Forest Biomass)
Show Figures

Figure 1

31 pages, 6615 KB  
Article
A Modular and Explainable Machine Learning Pipeline for Student Dropout Prediction in Higher Education
by Abdelkarim Bettahi, Fatima-Zahra Belouadha and Hamid Harroud
Algorithms 2025, 18(10), 662; https://doi.org/10.3390/a18100662 - 18 Oct 2025
Viewed by 457
Abstract
Student dropout remains a persistent challenge in higher education, with substantial personal, institutional, and societal costs. We developed a modular dropout prediction pipeline that couples data preprocessing with multi-model benchmarking and a governance-ready explainability layer. Using 17,883 undergraduate records from a Moroccan higher [...] Read more.
Student dropout remains a persistent challenge in higher education, with substantial personal, institutional, and societal costs. We developed a modular dropout prediction pipeline that couples data preprocessing with multi-model benchmarking and a governance-ready explainability layer. Using 17,883 undergraduate records from a Moroccan higher education institution, we evaluated nine algorithms (logistic regression (LR), decision tree (DT), random forest (RF), k-nearest neighbors (k-NN), support vector machine (SVM), gradient boosting, Extreme Gradient Boosting (XGBoost), Naïve Bayes (NB), and multilayer perceptron (MLP)). On our test set, XGBoost attained an area under the receiver operating characteristic curve (AUC–ROC) of 0.993, F1-score of 0.911, and recall of 0.944. Subgroup reporting supported governance and fairness: across credit–load bins, recall remained high and stable (e.g., <9 credits: precision 0.85, recall 0.932; 9–12: 0.886/0.969; >12: 0.915/0.936), with full TP/FP/FN/TN provided. A Shapley additive explanations (SHAP)-based layer identified risk and protective factors (e.g., administrative deadlines, cumulative GPA, and passed-course counts), surfaced ambiguous and anomalous cases for human review, and offered case-level diagnostics. To assess generalization, we replicated our findings on a public dataset (UCI–Portugal; tables only): XGBoost remained the top-ranked (F1-score 0.792, AUC–ROC 0.922). Overall, boosted ensembles combined with SHAP delivered high accuracy, transparent attribution, and governance-ready outputs, enabling responsible early-warning implementation for student retention. Full article
Show Figures

Figure 1

22 pages, 11256 KB  
Article
Driving Mechanisms of Urban Form on Anthropogenic Carbon Emissions: An RSG-Net Ensemble Model for Targeted Carbon Reduction Strategies
by Banglong Pan, Jiayi Li, Zhuo Diao, Qi Wang, Qianfeng Gao, Wuyiming Liu, Ying Shu and Shaoru Feng
Appl. Sci. 2025, 15(20), 11175; https://doi.org/10.3390/app152011175 - 18 Oct 2025
Viewed by 235
Abstract
Urban Form (UF), as a synthesis of urban functions and socioeconomic elements, is closely associated with Anthropogenic Carbon Emissions (ACE) and has important implications for low-carbon urban planning. As a key national economic strategy region, the Yangtze River Economic Belt (YREB) exhibits pronounced [...] Read more.
Urban Form (UF), as a synthesis of urban functions and socioeconomic elements, is closely associated with Anthropogenic Carbon Emissions (ACE) and has important implications for low-carbon urban planning. As a key national economic strategy region, the Yangtze River Economic Belt (YREB) exhibits pronounced heterogeneity in urban development, highlighting the urgent need to elucidate the interaction mechanisms between UF and ACE to support carbon reduction strategies. This study employs nighttime light data and carbon emission records from 2002 to 2022 in the YREB. By integrating Support Vector Regression (SVR), Random Forest (RF), and Gradient Boosting Decision Tree (GBDT), we developed a neural network ensemble model (RSG-Net) to analyze the impacts and driving mechanisms of UF on ACE. The results indicate the following: (1) Over the past two decades, total ACE in the YREB increased by 196%, displaying a three-phase trajectory of rapid growth, deceleration, and rebound. (2) The RSG-Net model achieved superior predictive performance, with an R2 of 0.93, an RMSE of 1.96 × 106 t, an RPD of 3.69, and a PBIAS of 4.53%. (3) Based on Pearson correlation analysis and SHAP (Shapley Additive Explanations) feature importance, beyond economic and demographic indicators, the most influential UF indicators are ranked as Number of Urban Patches (NP), Normalized Difference Vegetation Index (NDVI), and Construction Land Concentration (CLC). These findings demonstrate that the RSG-Net model can not only predict ACE but also identify key UF factors and explain their interrelationships, thereby providing technical support for the formulation of urban carbon reduction strategies. Full article
(This article belongs to the Section Environmental Sciences)
Show Figures

Figure 1

Back to TopTop