Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (13,827)

Search Parameters:
Keywords = random forest

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
18 pages, 3674 KiB  
Article
Detection and Quantification of Vegetation Losses with Sentinel-2 Images Using Bi-Temporal Analysis of Spectral Indices and Transferable Random Forest Model
by Alicja Rynkiewicz, Agata Hościło, Linda Aune-Lundberg, Anne B. Nilsen and Aneta Lewandowska
Remote Sens. 2025, 17(6), 979; https://doi.org/10.3390/rs17060979 (registering DOI) - 11 Mar 2025
Abstract
The precise spatially explicit data on land cover and land use changes is one of the essential variables for enhancing the quantification of greenhouse gas emissions and removals, which is relevant for meeting the goal of the European economy and society to become [...] Read more.
The precise spatially explicit data on land cover and land use changes is one of the essential variables for enhancing the quantification of greenhouse gas emissions and removals, which is relevant for meeting the goal of the European economy and society to become climate-neutral by 2050. The accuracy of the machine learning models trained on remote-sensed data suffers from a lack of reliable training datasets and they are often site-specific. Therefore, in this study, we proposed a method that integrates the bi-temporal analysis of the combination of spectral indices that detects the potential changes, which then serve as reference data for the Random Forest classifier. In addition, we examined the transferability of the pre-trained model over time, which is an important aspect from the operational point of view and may significantly reduce the time required for the preparation of reliable and accurate training data. Two types of vegetation losses were identified: woody coverage converted to non-woody vegetation, and vegetated areas converted to sealed surfaces or bare soil. The vegetation losses were detected annually over the period 2018–2021 with an overall accuracy (OA) above 0.97 and a Kappa coefficient of 0.95 for all time intervals in the study regions in Poland and Norway. Additionally, the pre-trained model’s temporal transferability revealed an improvement of the OA by 5 percentage points and the macroF1-Score value by 12 percentage points compared to the original model. Full article
(This article belongs to the Special Issue Women’s Special Issue Series: Remote Sensing 2023-2025)
Show Figures

Figure 1

28 pages, 9801 KiB  
Article
Large-Scale Monitoring of Potatoes Late Blight Using Multi-Source Time-Series Data and Google Earth Engine
by Zelong Chi, Hong Chen, Sheng Chang, Zhao-Liang Li, Lingling Ma, Tongle Hu, Kaipeng Xu and Zhenjie Zhao
Remote Sens. 2025, 17(6), 978; https://doi.org/10.3390/rs17060978 (registering DOI) - 11 Mar 2025
Abstract
Effective monitoring and management of potato late blight (PLB) is essential for sustainable agriculture. This study describes a methodology to improve PLB identification on a large scale. The method combines unsupervised and supervised machine learning algorithms. To improve the monitoring accuracy of the [...] Read more.
Effective monitoring and management of potato late blight (PLB) is essential for sustainable agriculture. This study describes a methodology to improve PLB identification on a large scale. The method combines unsupervised and supervised machine learning algorithms. To improve the monitoring accuracy of the PLB regression model, the study used the K-Means algorithm in conjunction with morphological operations to identify potato growth areas. Input data consisted of monthly NDVI from Sentinel-2 and VH bands from Sentinel-1 (covering the year 2021). The identification results were validated on 221 field survey samples with an F1 score of 0.95. To monitor disease severity, we compared seven machine learning models: CART decision trees (CART), Gradient Tree Boosting (GTB), Random Forest (RF), single optical data Random Forest Time series model (TS–RF), single radar data Random Forest Time series model (STS–RF), multi-source data Gradient Tree Boosting Time series model (MSTS–GTB), and multi-source data Random Forest Time series model (MSTS–RF). The MSTS–RF model was the best performer, with a validation RMSE of 20.50 and an R² of 0.71. The input data for the MSTS–RF model consisted of spectral indices (NDVI, NDWI, NDBI, etc.), radar features (VH-band and VV-band), texture features, and Sentinel-2 bands synthesized as a monthly time series from May to September 2021. The feature importance analysis highlights key features for disease identification: the NIR band (B8) for Sentinel-2, DVI, SAVI, and the VH band for Sentinel-1. Notably, the blue band data (458–523 nm) were critical during the month of May. These features are related to vegetation health and soil moisture are critical for early detection. This study presents for the first time a large-scale map of PLB distribution in China with an accuracy of 10 m and an RMSE of 26.52. The map provides valuable decision support for agricultural disease management, demonstrating the effectiveness and practical potential of the proposed method for large-scale monitoring. Full article
(This article belongs to the Special Issue Plant Disease Detection and Recognition Using Remotely Sensed Data)
Show Figures

Figure 1

16 pages, 1385 KiB  
Article
Development of a miRNA-Based Model for Lung Cancer Detection
by Kai Chin Poh, Toh Ming Ren, Goh Liuh Ling, John S Y Goh, Sarrah Rose, Alexa Wong, Sanhita S. Mehta, Amelia Goh, Pei-Yu Chong, Sim Wey Cheng, Samuel Sherng Young Wang, Seyed Ehsan Saffari, Darren Wan-Teck Lim and Na-Yu Chia
Cancers 2025, 17(6), 942; https://doi.org/10.3390/cancers17060942 (registering DOI) - 10 Mar 2025
Abstract
Background: Lung cancer is the leading cause of cancer-related mortality globally, with late-stage diagnoses contributing to poor survival rates. While lung cancer screening with low-dose computed tomography (LDCT) has proven effective in reducing mortality among heavy smokers, its limitations, including high false-positive rates [...] Read more.
Background: Lung cancer is the leading cause of cancer-related mortality globally, with late-stage diagnoses contributing to poor survival rates. While lung cancer screening with low-dose computed tomography (LDCT) has proven effective in reducing mortality among heavy smokers, its limitations, including high false-positive rates and resource intensiveness, restrict widespread use. Liquid biopsy, particularly using microRNA (miRNA) biomarkers, offers a promising adjunct to current screening strategies. This study aimed to evaluate the predictive power of a panel of serum miRNA biomarkers for lung cancer detection. Patients and Methods: A case-control study was conducted at two tertiary hospitals, enrolling 82 lung cancer cases and 123 controls. We performed an extensive literature review to shortlist 25 candidate miRNAs, of which 16 showed a significant two-fold increase in expression compared to the controls. Machine learning techniques, including Random Forest, K-Nearest Neighbors, Neural Networks, and Support Vector Machines, were employed to identify the top six miRNAs. We then evaluated predictive models, incorporating these biomarkers with lung nodule characteristics on LDCT. Results: A prediction model utilising six miRNA biomarkers (mir-196a, mir-1268, mir-130b, mir-1290, mir-106b and mir-1246) alone achieved area under the curve (AUC) values ranging from 0.78 to 0.86, with sensitivities of 70–78% and specificities of 73–85%. Incorporating lung nodule size significantly improved model performance, yielding AUC values between 0.96 and 0.99, with sensitivities of 92–98% and specificities of 93–98%. Conclusions: A prediction model combining serum miRNA biomarkers and nodule size showed high predictive power for lung cancer. Integration of the prediction model into current lung cancer screening protocols may improve patient outcomes. Full article
(This article belongs to the Special Issue Predictive Biomarkers for Lung Cancer)
Show Figures

Figure 1

23 pages, 8242 KiB  
Article
Study of Factors Influencing Thermal Comfort at Tram Stations in Guangzhou Based on Machine Learning
by Xin Chen, Huanchen Zhao, Beini Wang and Bo Xia
Buildings 2025, 15(6), 865; https://doi.org/10.3390/buildings15060865 (registering DOI) - 10 Mar 2025
Abstract
As global climate change intensifies, the frequency and severity of extreme weather events continue to rise. However, research on semi-outdoor and transitional spaces remains limited, and transportation stations are typically not fully enclosed. Therefore, it is crucial to gain a deeper understanding of [...] Read more.
As global climate change intensifies, the frequency and severity of extreme weather events continue to rise. However, research on semi-outdoor and transitional spaces remains limited, and transportation stations are typically not fully enclosed. Therefore, it is crucial to gain a deeper understanding of the environmental needs of users in these spaces. This study employs machine learning (ML) algorithms and the SHAP (SHapley Additive exPlanations) methodology to identify and rank the critical factors influencing outdoor thermal comfort at tram stations. We collected microclimatic data from tram stations in Guangzhou, along with passenger comfort feedback, to construct a comprehensive dataset encompassing environmental parameters, individual perceptions, and design characteristics. A variety of ML models, including Extreme Gradient Boosting (XGB), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost), Random Forest (RF), and K-Nearest Neighbors (KNNs), were trained and validated, with SHAP analysis facilitating the ranking of significant factors. The results indicate that the LightGBM and CatBoost models performed exceptionally well, identifying key determinants such as relative humidity (RH), outdoor air temperature (Ta), mean radiant temperature (Tmrt), clothing insulation (Clo), gender, age, body mass index (BMI), and the location of the space occupied in the past 20 min prior to waiting (SOP20). Notably, the significance of physical parameters surpassed that of physiological and behavioral factors. This research provides clear strategic guidance for urban planners, public transport managers, and designers to enhance thermal comfort at tram stations while offering a data-driven approach to optimizing outdoor spaces and promoting sustainable urban development. Full article
(This article belongs to the Section Building Energy, Physics, Environment, and Systems)
Show Figures

Figure 1

20 pages, 5974 KiB  
Article
Improving the Accuracy of Tree Species Mapping by Sentinel-2 Images Using Auxiliary Data—A Case Study of Slyudyanskoye Forestry Area near Lake Baikal
by Anastasia Popova
Forests 2025, 16(3), 487; https://doi.org/10.3390/f16030487 (registering DOI) - 10 Mar 2025
Abstract
Timely and accurate information on forest composition is crucial for ecosystem conservation and management tasks. Information regarding the distribution and extent of forested areas can be derived through the classification of satellite imagery. However, optical data alone are often insufficient to achieve the [...] Read more.
Timely and accurate information on forest composition is crucial for ecosystem conservation and management tasks. Information regarding the distribution and extent of forested areas can be derived through the classification of satellite imagery. However, optical data alone are often insufficient to achieve the required accuracy due to the similarity in spectral characteristics among tree species, particularly in mountainous regions. One approach to improving the accuracy of forest classification is the integration of auxiliary environmental data. This paper presents the results of research conducted in the Slyudyanskoye Forestry area in the Irkutsk Region. A dataset comprising 101 variables was collected, including Sentinel-2 bands, vegetation indices, and climatic, soil, and topographic data, as well as forest canopy height. The classification was performed using the Random Forest machine learning method. The results demonstrated that auxiliary environmental data significantly improved the performance of the tree species classification model, with the overall accuracy increasing from 49.59% (using only Sentinel-2 bands) to 80.69% (combining spectral data with auxiliary variables). The most significant improvement in accuracy was achieved through the incorporation of climatic and soil features. The most important variables were the shortwave infrared band B11, forest canopy height, the length of the growing season, and the number of days with snow cover. Full article
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)
Show Figures

Figure 1

26 pages, 682 KiB  
Review
The Development of a River Quality Prediction Model That Is Based on the Water Quality Index via Machine Learning: A Review
by Hassan Shaheed, Mohd Hafiz Zawawi and Gasim Hayder
Processes 2025, 13(3), 810; https://doi.org/10.3390/pr13030810 - 10 Mar 2025
Abstract
This review, “The Development of a River Quality Prediction Model That Is Based on the Water Quality Index using Machine Learning: A Review”, discusses and evaluates research articles and attempts to incorporate ML algorithms into the water quality index (WQI) to improve the [...] Read more.
This review, “The Development of a River Quality Prediction Model That Is Based on the Water Quality Index using Machine Learning: A Review”, discusses and evaluates research articles and attempts to incorporate ML algorithms into the water quality index (WQI) to improve the prediction of river water quality. This original study confirms how new methodologies like LSTM, CNNs, and random forest perform better than previous methods, as they offer real-time predictions, operational cost saving, and opportunities for handling big data. This review finds that, in addition to good case studies and real-life applications, there is a need to expand in the following areas: impacts of climate change, ways of enhancing data representation, and concerns to do with ethics as well as data privacy. Furthermore, this review outlines issues, such as data scarcity, model explainability, and computational overhead in real-world ML applications, as well as strategies to preemptively address these issues in order to improve the versatility of data-driven models in various domains. Moving to the analysis of the review specifically to discuss the propositions, the identified key points focus on the use of complex approaches and interdisciplinarity and the involvement of stakeholders. Due to the added specificity and depth in a number of comparisons and specific technical and policy discussions, this sweeping review offers a broad view of how to proceed in enhancing the usefulness of the predictive technologies that will be central to environmental forecasting. Full article
(This article belongs to the Section Advanced Digital and Other Processes)
30 pages, 4981 KiB  
Article
A Machine Learning Framework for Student Retention Policy Development: A Case Study
by Sidika Hoca and Nazife Dimililer
Appl. Sci. 2025, 15(6), 2989; https://doi.org/10.3390/app15062989 - 10 Mar 2025
Abstract
Student attrition at tertiary institutions is a global challenge with significant personal and social consequences. Early identification of students at risk of dropout is crucial for proactive and preventive intervention. This study presents a machine learning framework for predicting and visualizing students at [...] Read more.
Student attrition at tertiary institutions is a global challenge with significant personal and social consequences. Early identification of students at risk of dropout is crucial for proactive and preventive intervention. This study presents a machine learning framework for predicting and visualizing students at risk of dropping out. While most previous work relies on wide-ranging data from numerous sources such as surveys, enrolment, and learning management systems, making the process complex and time-consuming, the current study uses minimal data that are readily available in any registration system. The use of minimal data simplifies the process and ensures broad applicability. Unlike most similar research, the proposed framework provides a comprehensive system that not only identifies students at risk of dropout but also groups them into meaningful clusters, enabling tailored policy generation for each cluster through digital technologies. The proposed framework comprises two stages where the first stage identifies at-risk students using a machine learning classifier, and the second stage uses interpretable AI techniques to cluster and visualize similar students for policy-making purposes. For the case study, various machine learning algorithms—including Support Vector Classifier, K-Nearest Neighbors, Logistic Regression, Naïve Bayes, Artificial Neural Network, Random Forest, Classification and Regression Trees, and Categorical Boosting—were trained for dropout prediction using data available at the end of the students’ second semester. The experimental results indicated that Categorical Boosting with an F1-score of 82% is the most effective classifier for the dataset. The students identified as at risk of dropout were then clustered and a decision tree was used to visualize each cluster, enabling tailored policy-making. Full article
(This article belongs to the Special Issue ICT in Education, 2nd Edition)
Show Figures

Figure 1

20 pages, 21648 KiB  
Article
Spatial–Temporal Heterogeneity of Wetlands in the Alpine Mountains of the Shule River Basin on the Northeastern Edge of the Qinghai–Tibet Plateau
by Shuya Tai, Donghui Shangguan, Jinkui Wu, Rongjun Wang and Da Li
Remote Sens. 2025, 17(6), 976; https://doi.org/10.3390/rs17060976 - 10 Mar 2025
Abstract
Alpine wetland ecosystems, as important carbon sinks and water conservation areas, possess unique ecological functions. Driven by climate change and human activities, the spatial distribution changes in alpine wetlands directly affect the ecosystems and water resource management within a basin. To further refine [...] Read more.
Alpine wetland ecosystems, as important carbon sinks and water conservation areas, possess unique ecological functions. Driven by climate change and human activities, the spatial distribution changes in alpine wetlands directly affect the ecosystems and water resource management within a basin. To further refine the evolution processes of different types of alpine wetlands in different zones of a basin, this study combined multiple field surveys, unmanned aerial vehicle (UAV) flights, and high-resolution images. Based on the Google Earth Engine (GEE) cloud platform, we constructed a Random Forest model to identify and extract alpine wetlands in the Shule River Basin over a long-term period from 1987 to 2021. The results indicated that the accuracy of the extraction based on this method exceeded 90%; the main wetland types are marsh, swamp meadow, and river and lake water bodies; and the spatial–temporal distribution of each wetland type has obvious heterogeneity. In total, 90% of the swamp meadows areas were mainly scattered throughout the study area’s section 3700 to 4300 m above sea level (a.s.l.), and 80% of the marshes areas were concentrated in the Dang River source 3200 m above sea level. From 1987 to 2021, the alpine wetland in the study area showed an overall expansion trend. The total area of the wetland increased by 51,451.8 ha and the area increased by 53.5%. However, this expansion mainly occurred in the elevation zone below 4000 m after 2004, and low-altitude marsh wetland primarily dominated the expansion. The analysis of the spatial–temporal heterogeneity of alpine wetlands can provide a scientific basis for the attribution analysis of the change in alpine wetlands in inland water conservation areas, as well as for protection and rational development and utilization, and promote the healthy development of ecological environments in nature reserves. Full article
Show Figures

Figure 1

25 pages, 9167 KiB  
Article
Spatiotemporal Dynamics and Potential Distribution Prediction of Spartina alterniflora Invasion in Bohai Bay Based on Sentinel Time-Series Data and MaxEnt Modeling
by Qi Wang, Guoli Cui, Haojie Liu, Xiao Huang, Xiangming Xiao, Ming Wang, Mingming Jia, Dehua Mao, Xiaoyan Li, Yihua Xiao and Huiying Li
Remote Sens. 2025, 17(6), 975; https://doi.org/10.3390/rs17060975 - 10 Mar 2025
Viewed by 90
Abstract
The northward expansion of Spartina alterniflora (S. alterniflora) poses a profound ecological threat to coastal ecosystems and biodiversity along China’s coastline. This invasive species exhibits strong adaptability to colder climates, facilitating its potential spread into northern regions and underscoring the urgent [...] Read more.
The northward expansion of Spartina alterniflora (S. alterniflora) poses a profound ecological threat to coastal ecosystems and biodiversity along China’s coastline. This invasive species exhibits strong adaptability to colder climates, facilitating its potential spread into northern regions and underscoring the urgent need for a nuanced understanding of its spatial distribution and invasion risks to inform evidence-based ecosystem management strategies. This study employed multi-temporal Sentinel-1/2 imagery (2016–2022) to map and predict the spread of S. alterniflora in Bohai Bay. An object-based random forest classification achieved an overall accuracy above 92% (κ = 0.978). Over the six-year period, the S. alterniflora distribution decreased from 46.60 km2 in 2016 to 12.56 km2 in 2022, reflecting an annual reduction of approximately 5.67 km2. This decline primarily resulted from targeted eradication efforts, including physical removal, chemical treatments, and biological competition strategies. Despite this local reduction, MaxEnt modeling suggests that climate trends and habitat suitability continue to support potential northward expansion, particularly in high-risk areas such as the Binhai New District, the Shandong Yellow River Delta, and the Laizhou Bay tributary estuary. Key environmental drivers of S. alterniflora distribution include the maximum temperature of the warmest month, mean temperature of the wettest quarter, isothermality, sea surface temperature, mean temperature of the warmest quarter, and soil type. High-risk invasion zones, covering about 95.65 km2. These findings illuminate the spatial dynamics of S. alterniflora and offer scientific guidance for evidence-based restoration and management strategies, ensuring the protection of coastal ecosystems and fostering sustainable development. Full article
(This article belongs to the Special Issue GIS and Remote Sensing in Ocean and Coastal Ecology)
Show Figures

Figure 1

20 pages, 4952 KiB  
Article
Construction and Application of Feature Recommendation Model for Remote Sensing Interpretation of Rock Strata Based on Knowledge Graph
by Liufeng Tao, Qirui Wu, Miao Tian, Zhong Xie, Jianguo Chen, Yueyu Wu and Qinjun Qiu
Remote Sens. 2025, 17(6), 973; https://doi.org/10.3390/rs17060973 - 10 Mar 2025
Viewed by 37
Abstract
The enhancement of remote sensing interpretation accuracy for rock strata in complex terrain areas has long been limited by challenges in field validation and the insufficient integration of geological knowledge in traditional spectral–spatial feature selection methods. This study proposes a geological remote sensing [...] Read more.
The enhancement of remote sensing interpretation accuracy for rock strata in complex terrain areas has long been limited by challenges in field validation and the insufficient integration of geological knowledge in traditional spectral–spatial feature selection methods. This study proposes a geological remote sensing interpretation framework that integrates textual geological data, which enhances lithological identification accuracy by systematically combining multi-source geological knowledge with machine learning algorithms. Using a dataset of 2591 geological survey reports and scientific literature, a remote sensing interpretation ontology model was established, featuring four core entities (rock type, stratigraphic unit, spectral feature, and geomorphological indicator). A hybrid information extraction process combining rule-based parsing and a fine-tuned Universal Information Extraction (UIE) model was employed to extract knowledge from unstructured texts. A knowledge graph constructed using the TransE algorithm consists of 766 entity nodes and 1008 relationships, enabling a quantitative evaluation of feature correlations based on semantic similarity. When combined with Landsat multispectral data and digital elevation model (DEM)-derived terrain parameters, the knowledge-enhanced Random Forest (81.79%) and Support Vector Machine (75.76%) models demonstrated excellent performance in identifying rock-stratigraphic assemblages in the study area. While reducing subjective biases in manual interpretation, the method still has limitations. These include limited use of cross-modal data (e.g., geochemical tables, outcrop images) and a reliance on static knowledge representations. Future research will introduce dynamic graph updating mechanisms and multi-modal fusion architectures to improve adaptability across diverse geological lithological and structural environments. Full article
Show Figures

Figure 1

23 pages, 26510 KiB  
Article
Improving the Individual Tree Parameters Estimation of a Complex Mixed Conifer—Broadleaf Forest Using a Combination of Structural, Textural, and Spectral Metrics Derived from Unmanned Aerial Vehicle RGB and Multispectral Imagery
by Jeyavanan Karthigesu, Toshiaki Owari, Satoshi Tsuyuki and Takuya Hiroshima
Geomatics 2025, 5(1), 12; https://doi.org/10.3390/geomatics5010012 - 10 Mar 2025
Viewed by 36
Abstract
Individual tree parameters are essential for forestry decision-making, supporting economic valuation, harvesting, and silvicultural operations. While extensive research exists on uniform and simply structured forests, studies addressing complex, dense, and mixed forests with highly overlapping, clustered, and multiple tree crowns remain limited. This [...] Read more.
Individual tree parameters are essential for forestry decision-making, supporting economic valuation, harvesting, and silvicultural operations. While extensive research exists on uniform and simply structured forests, studies addressing complex, dense, and mixed forests with highly overlapping, clustered, and multiple tree crowns remain limited. This study bridges this gap by combining structural, textural, and spectral metrics derived from unmanned aerial vehicle (UAV) Red–Green–Blue (RGB) and multispectral (MS) imagery to estimate individual tree parameters using a random forest regression model in a complex mixed conifer–broadleaf forest. Data from 255 individual trees (115 conifers, 67 Japanese oak, and 73 other broadleaf species (OBL)) were analyzed. High-resolution UAV orthomosaic enabled effective tree crown delineation and canopy height models. Combining structural, textural, and spectral metrics improved the accuracy of tree height, diameter at breast height, stem volume, basal area, and carbon stock estimates. Conifers showed high accuracy (R2 = 0.70–0.89) for all individual parameters, with a high estimate of tree height (R2 = 0.89, RMSE = 0.85 m). The accuracy of oak (R2 = 0.11–0.49) and OBL (R2 = 0.38–0.57) was improved, with OBL species achieving relatively high accuracy for basal area (R2 = 0.57, RMSE = 0.08 m2 tree−1) and volume (R2 = 0.51, RMSE = 0.27 m3 tree−1). These findings highlight the potential of UAV metrics in accurately estimating individual tree parameters in a complex mixed conifer–broadleaf forest. Full article
Show Figures

Figure 1

13 pages, 649 KiB  
Article
Robustness of Machine Learning Predictions for Determining Whether Deep Inspiration Breath-Hold Is Required in Breast Cancer Radiation Therapy
by Wlla E. Al-Hammad, Masahiro Kuroda, Ghaida Al Jamal, Mamiko Fujikura, Ryo Kamizaki, Kazuhiro Kuroda, Suzuka Yoshida, Yoshihide Nakamura, Masataka Oita, Yoshinori Tanabe, Kohei Sugimoto, Irfan Sugianto, Majd Barham, Nouha Tekiki, Miki Hisatomi and Junichi Asaumi
Diagnostics 2025, 15(6), 668; https://doi.org/10.3390/diagnostics15060668 - 10 Mar 2025
Viewed by 121
Abstract
Background/Objectives: Deep inspiration breath-hold (DIBH) is a commonly used technique to reduce the mean heart dose (MHD), which is critical for minimizing late cardiac side effects in breast cancer patients undergoing radiation therapy (RT). Although previous studies have explored the potential of machine [...] Read more.
Background/Objectives: Deep inspiration breath-hold (DIBH) is a commonly used technique to reduce the mean heart dose (MHD), which is critical for minimizing late cardiac side effects in breast cancer patients undergoing radiation therapy (RT). Although previous studies have explored the potential of machine learning (ML) to predict which patients might benefit from DIBH, none have rigorously assessed ML model performance across various MHD thresholds and parameter settings. This study aims to evaluate the robustness of ML models in predicting the need for DIBH across different clinical scenarios. Methods: Using data from 207 breast cancer patients treated with RT, we developed and tested ML models at three MHD cut-off values (240, 270, and 300 cGy), considering variations in the number of independent variables (three vs. six) and folds in the cross-validation (three, four, and five). Robustness was defined as achieving high F2 scores and low instability in predictive performance. Results: Our findings indicate that the decision tree (DT) model demonstrated consistently high robustness at 240 and 270 cGy, while the random forest model performed optimally at 300 cGy. At 240 cGy, a threshold critical to minimize late cardiac risks, the DT model exhibited stable predictive power, reducing the risk of overestimating DIBH necessity. Conclusions: These results suggest that the DT model, particularly at lower MHD thresholds, may be the most reliable for clinical applications. By providing a tool for targeted DIBH implementation, this model has the potential to enhance patient-specific treatment planning and improve clinical outcomes in RT. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

23 pages, 28011 KiB  
Article
Predicting the Spatial Distribution of Geological Hazards in Southern Sichuan, China, Using Machine Learning and ArcGIS
by Ruizhi Zhang, Dayong Zhang, Bo Shu and Yang Chen
Land 2025, 14(3), 577; https://doi.org/10.3390/land14030577 - 10 Mar 2025
Viewed by 120
Abstract
Geological hazards in Southern Sichuan have become increasingly frequent, posing severe risks to local communities and infrastructure. This study aims to predict the spatial distribution of potential geological hazards using machine learning models and ArcGIS-based spatial analysis. A dataset comprising 2700 known geological [...] Read more.
Geological hazards in Southern Sichuan have become increasingly frequent, posing severe risks to local communities and infrastructure. This study aims to predict the spatial distribution of potential geological hazards using machine learning models and ArcGIS-based spatial analysis. A dataset comprising 2700 known geological hazard locations in Yibin City was analyzed to extract key environmental and topographic features influencing hazard susceptibility. Several machine learning models were evaluated, including random forest, XGBoost, and CatBoost, with model optimization performed using the Sparrow Search Algorithm (SSA) to enhance prediction accuracy. This study produced high-resolution susceptibility maps identifying high-risk zones, revealing a distinct spatial pattern characterized by a concentration of hazards in mountainous areas such as Pingshan County, Junlian County, and Gong County, while plains exhibited a relatively lower risk. Among different hazard types, landslides were found to be the most prevalent. The results further indicate a strong spatial overlap between predicted high-risk zones and existing rural settlements, highlighting the challenges of hazard resilience in these areas. This research provides a refined methodological framework for integrating machine learning and geospatial analysis in hazard prediction. The findings offer valuable insights for rural land use planning and hazard mitigation strategies, emphasizing the necessity of adopting a “small aggregations and multi-point placement” approach to settlement planning in Southern Sichuan’s mountainous regions. Full article
(This article belongs to the Topic Remote Sensing and Geological Disasters)
Show Figures

Figure 1

24 pages, 4616 KiB  
Article
Assessing the Potential Risk of Invasion of the Neophyte Pluchea ovalis (Pers.) DC. (Asteraceae) in the Canarian Archipelago Using an Ensemble of Species Distribution Modelling
by Juan José García-Alvarado, Miguel Pestano-González, Cristina González-Montelongo, Agustín Naranjo-Cigala and José Ramón Arévalo
Diversity 2025, 17(3), 195; https://doi.org/10.3390/d17030195 - 10 Mar 2025
Viewed by 139
Abstract
Invasive species represent a significant threat to biodiversity and ecosystem conservation, with their impacts often amplified in island ecosystems. Species distribution models (SDMs) can infer the potential habitat throughout the life of an organism and are considered a valuable tool for predicting the [...] Read more.
Invasive species represent a significant threat to biodiversity and ecosystem conservation, with their impacts often amplified in island ecosystems. Species distribution models (SDMs) can infer the potential habitat throughout the life of an organism and are considered a valuable tool for predicting the risk of expansion of invasive plants and animals. In our approach, we used an ensemble of four presence–absence models (GLM, GAM, Random Forest, and BART) calibrated only with data collected in Tenerife, the island with the best representation of the species, to infer the habitat suitability for Pluchea ovalis (Pers.) DC. (Asteraceae). Subsequently, we transferred the ensembled model to the rest of the Canarian Island archipelago. Our results show that under near-present conditions, the suitable areas are in the coastal and mid-elevations of the south slope sectors of Tenerife and Gran Canarian Islands, as well as a vast portion of the westernmost and drier islands, always coinciding with ravines and highly disturbed ecosystems. In addition, we forecasted the potential distribution of Pluchea ovalis under different climate change conditions (SSP126, SSP370, and SSP585), showing how its habitability would increase in the worst scenarios. Both contexts favor areas gained by the species in places where they are currently not present, revealing new suitable sectors in the westernmost islands. Full article
(This article belongs to the Section Biodiversity Loss & Dynamics)
Show Figures

Figure 1

24 pages, 3801 KiB  
Article
Risk-Adjusted Performance of Random Forest Models in High-Frequency Trading
by Akash Deep, Abootaleb Shirvani, Chris Monico, Svetlozar Rachev and Frank Fabozzi
J. Risk Financial Manag. 2025, 18(3), 142; https://doi.org/10.3390/jrfm18030142 - 9 Mar 2025
Viewed by 262
Abstract
Because of the theoretical challenges posed by the Efficient Market Hypothesis with respect to technical analysis, the effectiveness of technical indicators in high-frequency trading remains inadequately explored, particularly at the minute-level frequency, where the effects of the microstructure of the market dominate. This [...] Read more.
Because of the theoretical challenges posed by the Efficient Market Hypothesis with respect to technical analysis, the effectiveness of technical indicators in high-frequency trading remains inadequately explored, particularly at the minute-level frequency, where the effects of the microstructure of the market dominate. This study evaluates the integration of traditional technical indicators with Random Forest regression models using minute-level SPY data, analyzing 13 distinct model configurations. Our empirical results reveal a stark contrast between in-sample and out-of-sample performance, with R2 values deteriorating from 0.749–0.812 during training to negative values in testing. A feature importance analysis demonstrates that primary price-based features dominate the predictions made by the model, accounting for over 60% of the importance, while established technical indicators, such as RSI and Bollinger Bands, account for only 14–15%. Although the indicator-enhanced models achieved superior risk-adjusted metrics, with Rachev ratios between 0.919 and 0.961, they consistently underperformed a simple buy-and-hold strategy, generating returns ranging from −2.4% to −3.9%. These findings challenge conventional assumptions about the usefulness of technical indicators in algorithmic trading, suggesting that in high-frequency contexts, they may be more relevant to risk management rather than to predicting returns. For practitioners and researchers, our findings indicate that successful high-frequency trading strategies should focus on adaptive feature selection and regime-specific modeling rather than relying on traditional technical indicators, as well as indicating the critical importance of robust out-of-sample testing in the development of a model. Full article
(This article belongs to the Special Issue Machine Learning Applications in Finance, 2nd Edition)
Show Figures

Figure 1

Back to TopTop