Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (96)

Search Parameters:
Keywords = Box–Cox transformation

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
19 pages, 1121 KB  
Article
Comparing ARIMA, Holt–Winters and TimeGPT Models for Municipal Water Consumption Forecasting: Evidence from Vouzela, Portugal
by Júlio Rocha, Salviano Soares, António Valente and Filipe Cabral Pinto
Mathematics 2026, 14(10), 1740; https://doi.org/10.3390/math14101740 - 19 May 2026
Viewed by 132
Abstract
This study presents a methodology for forecasting municipal water consumption to support efficient resource management. Using monthly data from 2018 to 2022 for the municipality of Vouzela, Portugal, three forecasting approaches were evaluated: SARIMA, Holt–Winters, and TimeGPT. Data preparation included logarithmic transformation and [...] Read more.
This study presents a methodology for forecasting municipal water consumption to support efficient resource management. Using monthly data from 2018 to 2022 for the municipality of Vouzela, Portugal, three forecasting approaches were evaluated: SARIMA, Holt–Winters, and TimeGPT. Data preparation included logarithmic transformation and stationarity assessment using the KPSS test, ensuring appropriate conditions for statistical modelling. The SARIMA model was selected automatically based on the Akaike Information Criterion (AIC), while the Holt–Winters method was fitted with additive components and a Box–Cox transformation. In addition, TimeGPT was employed as a state-of-the-art foundation model for time series forecasting. The three methods were used to predict water consumption for the 12 months of 2023, and their performance was assessed using MAE, MSE, RMSE and MAPE. Results indicate that although all methods perform adequately, Holt–Winters and TimeGPT better capture recent consumption dynamics, providing more accurate forecasts in several periods. Overall, this study shows that combining classical statistical models with advanced forecasting techniques offers local authorities reliable and computationally accessible tools to support water supply planning and sustainability. Full article
(This article belongs to the Special Issue Advanced Machine Learning Analysis and Application in Data Science)
Show Figures

Graphical abstract

20 pages, 1722 KB  
Article
Fully Automated Serum LC-MS/MS Platform and Pediatric Reference Intervals for Organic Acids, Amino Acids, and Acylcarnitines in Children (Ages 0–6 Years): Toward Quantitative Diagnosis of Inborn Errors of Metabolism
by Yasushi Ueyanagi, Daiki Setoyama, Tsuyoshi Nakanishi, Yuichi Mushimoto, Vlad Tocan, Hironori Kobayashi, Miki Matsui, Shinya Matsumoto, Akiyoshi Fujishima, Taeko Hotta, Ayumi Sakata and Yuya Kunisaki
Diagnostics 2026, 16(6), 911; https://doi.org/10.3390/diagnostics16060911 - 19 Mar 2026
Viewed by 859
Abstract
Background/Objectives: Conventional diagnosis of inborn errors of metabolism (IEMs) requires multiple specimen types—urine organic acids, plasma amino acids, and serum acylcarnitines—analyzed on distinct analytical platforms. This multi-assay approach is labor-intensive and limits timely clinical decision making. We aimed to develop a fully automated [...] Read more.
Background/Objectives: Conventional diagnosis of inborn errors of metabolism (IEMs) requires multiple specimen types—urine organic acids, plasma amino acids, and serum acylcarnitines—analyzed on distinct analytical platforms. This multi-assay approach is labor-intensive and limits timely clinical decision making. We aimed to develop a fully automated serum-based LC–MS/MS platform for integrated quantitative metabolite profiling and to establish pediatric reference intervals (RIs) to support diagnostic interpretation. Methods: A fully automated LC–MS/MS system integrated with the CLAM-2030 automated pretreatment module was developed to enable simultaneous quantification of 25 organic acids, 8 amino acids, and 21 acylcarnitines. Analytical performance was assessed for linearity, limits of detection and quantification, precision and accuracy. Serum samples from 296 non-IEM children aged 0–6 years were analyzed to establish pediatric RIs using Box–Cox transformation and Gaussian modeling. Clinical utility was evaluated in sera from 89 patients diagnosed with IEM using z-score-based logistic regression models. Results: The method demonstrated excellent performance, with linearity (r2 > 0.99) across calibration ranges, limits of detection and quantification defined by signal-to-noise ratios > 3 and >10, and intra- and inter-assay precision < 15% CV for all 54 analytes. Twenty-one analytes met the acceptance criterion of ±20% accuracy at all quality-control levels. Pediatric RIs provided a quantitative framework for interpreting the metabolic abnormalities. In IEM patients, disease-specific metabolites were consistently outside the established ranges, and z-score-based logistic regression models successfully distinguished major IEM categories, including organic acidemias and long-chain fatty acid oxidation disorders. Conclusions: This fully automated, serum-based LC–MS/MS platform provides a clinically practical and quantitative framework for integrated metabolic profiling using pediatric RIs, supporting diagnosis and monitoring of IEMs in pediatric settings. Full article
(This article belongs to the Section Clinical Laboratory Medicine)
Show Figures

Figure 1

44 pages, 3365 KB  
Article
A Moment-Targeting Normality Transformation Based on Simultaneous Optimization of Tukey g–h Distribution Parameters
by Zeynel Cebeci, Figen Ceritoglu, Melis Celik Guney and Adnan Unalan
Symmetry 2026, 18(3), 458; https://doi.org/10.3390/sym18030458 - 6 Mar 2026
Viewed by 682
Abstract
This study proposes Optimized Skewness and Kurtosis Transformation (OSKT), a novel moment-targeting normality transformation that corrects asymmetry and peakedness in non-normal data. OSKT employs a transformation function derived from the Tukey g–h distribution, incorporating skewness and kurtosis parameters, and is optimized by minimizing [...] Read more.
This study proposes Optimized Skewness and Kurtosis Transformation (OSKT), a novel moment-targeting normality transformation that corrects asymmetry and peakedness in non-normal data. OSKT employs a transformation function derived from the Tukey g–h distribution, incorporating skewness and kurtosis parameters, and is optimized by minimizing a single objective function based on the Anderson–Darling test statistic. The optimization process uses L-BFGS-B to tune the transformation parameters to find the best fit for the standard normal distribution. OSKT ensures a balance between symmetry and tail behavior by minimizing deviations from theoretical normality. It has highly competitive performance compared to the alternative, Box–Cox, Yeo–Johnson transformations, including their robust variants and moment-matching Lambert W method, for normalizing complex distributions. According to our analysis, OSKT also achieves superior normalization for highly non-Gaussian data, successfully transforming highly resistant distributions, including approximately symmetric bimodal datasets, where other methods fail. Full article
(This article belongs to the Section Mathematics)
Show Figures

Figure 1

13 pages, 372 KB  
Article
Unifying Models of Trophic Exploitation: A Mathematical Framework for Understanding the Paradox of Enrichment
by Lindomar Soares dos Santos, Brenno Caetano Troca Cabella and Alexandre Souto Martinez
Math. Comput. Appl. 2026, 31(1), 29; https://doi.org/10.3390/mca31010029 - 14 Feb 2026
Viewed by 512
Abstract
The rapid increase in the world’s human population has largely been attributed to efforts aimed at enhancing primary productivity and enriching food resources. However, an intriguing proposition of M. Rosenzweig, known as the paradox of enrichment, challenged the notion that such enrichment schemes [...] Read more.
The rapid increase in the world’s human population has largely been attributed to efforts aimed at enhancing primary productivity and enriching food resources. However, an intriguing proposition of M. Rosenzweig, known as the paradox of enrichment, challenged the notion that such enrichment schemes always lead to sustained population growth. Instead, they can disrupt the delicate equilibrium of predator–prey systems, potentially driving one or both species to extinction. In this study, we develop a comprehensive mathematical framework that unifies Rosenzweig’s six analytical models of trophic exploitation through the Richards growth model, which can be viewed as a Box–Cox transformation of one species’ abundance relative to carrying capacity. Our analysis not only elucidates the connections and similarities between each model but also presents a generalized framework that unveils the underlying relationships between the proposed functions. Using the generalized logarithm and exponential functions of nonextensive statistical mechanics, we offer a fresh perspective and highlight the importance of a cautious approach when enriching ecosystems. This unification clarifies how the parameters that govern growth dynamics and predator–prey interactions determine system stability in diverse ecological contexts. Through numerical simulations and isoclinic analysis, we demonstrate that our generalized model accurately reproduces the classic paradox of enrichment while providing new insights into the mechanisms driving population fluctuations after environmental enrichment. This mathematical synthesis advances both theoretical ecology and practical conservation efforts by enabling a more accurate assessment of enrichment risks in managed ecosystems. Full article
Show Figures

Figure 1

12 pages, 1148 KB  
Data Descriptor
Psoriatic Arthritis (PsA) Clinical Lipidomics Dataset with Hidden Laboratory Workflow Artifacts: A Benchmark Dataset for Data Processing Quality Control in Lipidomics
by Jörn Lötsch, Robert Gurke, Lisa Hahnefeld, Frank Behrens and Gerd Geisslinger
Data 2026, 11(2), 32; https://doi.org/10.3390/data11020032 - 3 Feb 2026
Cited by 1 | Viewed by 649
Abstract
This dataset presents a real-world lipidomics resource for developing and benchmarking quality control methods, batch effect detection algorithms, and data validation workflows. The data originates from a cross-sectional clinical study of psoriatic arthritis (PsA) patients (n = 81) and healthy controls (n = [...] Read more.
This dataset presents a real-world lipidomics resource for developing and benchmarking quality control methods, batch effect detection algorithms, and data validation workflows. The data originates from a cross-sectional clinical study of psoriatic arthritis (PsA) patients (n = 81) and healthy controls (n = 26), matched for age, sex, and body mass index, which was collected at a tertiary university rheumatology center. Subtle laboratory irregularities were detected only through advanced unsupervised analysis, after passing conventional quality control and standard analytical methods. Blood samples were processed using standardized protocols and analyzed using high-resolution and tandem mass spectrometry platforms. Both targeted and untargeted lipid assays captured lipids of several classes (including carnitines, ceramides, glycerophospholipids, sphingolipids, glycerolipids, fatty acids, sterols and esters, endocannabinoids). The dataset is organized into four comma-separated value (CSV) files: (1) Box–Cox-transformed and imputed lipidomics values; (2) outlier-cleaned and imputed values on the original scale; (3) metadata including clinical classifications, biological sex, and batch information for all assay types and control sample processing dates; and (4) a variable-level description file (readme.csv). The 292 lipid variables are named according to LIPID MAPS classification and standardized nomenclature. Complete batch documentation and FAIR-compliant data structure make this dataset valuable for testing the robustness of analytical pipelines and quality control in lipidomics and related omics fields. This unique dataset does not compete with larger lipidomics quality control datasets for comparisons of results but provides a unique, real-life lipidomics dataset displaying traces of the laboratory sample processing schedule, which can be used to challenge quality control frameworks. Full article
Show Figures

Figure 1

24 pages, 8057 KB  
Article
Retrieval of Mangrove Leaf Area Index Using Multispectral Vegetation Indices and Machine Learning Regression Algorithms
by Liangchao Deng, Xuyang Chen, Li Xu, Bolin Fu, Yongze Xing, Shuo Yu, Tengfang Deng, Yuzhou Huang and Qianguang Liu
Forests 2026, 17(2), 180; https://doi.org/10.3390/f17020180 - 29 Jan 2026
Viewed by 1567
Abstract
Leaf Area Index (LAI) is the total leaf area per unit of land surface area and is a crucial parameter for assessing vegetation growth and productivity. Machine learning regression algorithms are widely applied for LAI estimation. Due to spectral response variations among sensors [...] Read more.
Leaf Area Index (LAI) is the total leaf area per unit of land surface area and is a crucial parameter for assessing vegetation growth and productivity. Machine learning regression algorithms are widely applied for LAI estimation. Due to spectral response variations among sensors and susceptibility of mangrove-derived variables to environmental noise suppression, obtaining sensitivity indices and optimal machine learning regression models is essential for retrieving mangrove LAI at the population scale. This study proposes a novel approach to processing and retrieving mangrove LAI data by integrating multispectral indices with machine learning methods. Box–Cox transformation and CatBoost-based feature selection were employed to obtain the optimal dataset. Random Forest (RF), Gradient Boosting Regression Trees (GBRT), and Categorical Boosting (CatBoost) algorithms were used to evaluate the accuracy of LAI retrieval from Unmanned Aerial Vehicle (UAV) and Gaofen-6 (GF-6) data. Results indicate that when LAI > 3, LAI does not immediately saturate as CVI, MTVI 2, and other indices increase, demonstrating higher sensitivity. UAV data outperformed GF-6 data in retrieving LAI for diverse mangrove populations; during model training, RF proved more suitable for small-sample datasets, while CatBoost effectively suppressed environmental noise. Both RF and CatBoost demonstrated higher robustness in estimating Avicennia marina (AM) (RF: R2 = 0.704) and Aegiceras corniculatum (AC) (R2 = 0.766), respectively. Spatial distribution analysis of LAI indicates that healthy AM and AC cover 85.36% and 96.67% of the area, respectively. Spartina alterniflora and aquaculture wastewater may be among the factors affecting the health of mangrove forests in the study area. LAI retrieval holds significant importance for mangrove health monitoring and risk early warning. Full article
Show Figures

Figure 1

11 pages, 1286 KB  
Article
Establishment and Validation of Serum Ferritin Reference Intervals Based on Real-World Big Data and Multi-Strategy Partitioning Algorithms
by Yixin Xu, Xiaojuan Wu, Junlong Zhang, Qian Niu, Bei Cai and Qiang Miao
J. Clin. Med. 2026, 15(3), 976; https://doi.org/10.3390/jcm15030976 - 26 Jan 2026
Cited by 1 | Viewed by 613
Abstract
Background/Objectives: We aimed to establish and validate population-based reference intervals (RIs) for serum ferritin (SF) using an indirect, date-driven approach based on real-world laboratory data and to optimize partitioning strategies. Methods: SF results from 29,723 apparently healthy individuals who underwent health examinations at [...] Read more.
Background/Objectives: We aimed to establish and validate population-based reference intervals (RIs) for serum ferritin (SF) using an indirect, date-driven approach based on real-world laboratory data and to optimize partitioning strategies. Methods: SF results from 29,723 apparently healthy individuals who underwent health examinations at West China Hospital between 2020 and 2024 were retrospectively analyzed. SF was measured on a Roche Cobas e801 electrochemiluminescence immunoassay platform. After Box–Cox transformation, outliers were removed using an iterative Tukey method. Potential partitioning factors were evaluated, and data-driven age cut-points were explored using decision tree regression and verified with the Harris–Boyd criteria. RIs were estimated using nonparametric percentile methods and validated in an independent cohort of 2494 individuals. Results: SF concentrations were significantly higher in males than in females (p < 0.001). In females, SF showed a significant positive association with age (r = 0.466, p < 0.001), whereas no such association was observed in males. Decision tree analysis identified 50 years as the optimal age cut-off for females (R2 = 0.2467). The final study-derived RIs were 98.02–997.78 µg/L for males, 10.30–299.55 µg/L for females ≤ 50 years, and 36.61–507.00 µg/L for females > 50 years. In the validation cohort, the study-derived RIs achieved pass rates of 93.83–94.72%, which were significantly higher than the manufacturer-provided RIs (37.12–73.97%, all p < 0.001). Conclusions: Using a large health examination database and a multi-step partitioning strategy, we established robust sex- and age-specific SF RIs on the Roche Cobas e801 platform for the local population. This work provides a reproducible, generalizable framework for indirect RI determination of other biomarkers. Full article
(This article belongs to the Section Clinical Laboratory Medicine)
Show Figures

Figure 1

20 pages, 2825 KB  
Article
Effects of Biochar–Fertilizer Combinations on Photosynthetic and Transpiration Functions of Paddy Rice Using Box–Cox Transformation
by Yuanshu Jing, Zhaodong Zheng, Zhiyun Xu, Shuyun Yang and Zhaozhong Feng
Agronomy 2026, 16(2), 160; https://doi.org/10.3390/agronomy16020160 - 8 Jan 2026
Viewed by 792
Abstract
Biochar is recognized for its ability to improve the chemical, physical, and biological properties of soil, thereby enhancing crop productivity. However, the effects of biochar on photosynthetic and transpiration traits in rice crop–soil systems, particularly through the lens of on-site data subjected to [...] Read more.
Biochar is recognized for its ability to improve the chemical, physical, and biological properties of soil, thereby enhancing crop productivity. However, the effects of biochar on photosynthetic and transpiration traits in rice crop–soil systems, particularly through the lens of on-site data subjected to Box–Cox transformation, remain insufficiently explored. To address this, a two-factor randomized block design experiment was conducted using the rice cultivar Nangeng 9108 at the Agricultural Meteorology Experimental Station of Nanjing University of Information Science and Technology over the 2022–2023 principle phenophases. This study investigated changes in leaf stomatal conductance, photosynthetic, transpiration, and water-use efficiency (WUE) parameters under combined applications of biochar (0, 15, and 30 t/ha) and nitrogen fertilizer (0, 180, 225, and 300 kg/ha). Application of the Box–Cox transformation substantially improved data normality and variance homogeneity, enabling the development of a robust predictive model linking net photosynthetic rate to environmental factors. A two-way ANOVA further revealed that both the high nitrogen (300 kg/ha) with high biochar (30 t/ha) treatment and the conventional nitrogen (225 kg/ha) with moderate biochar (15 t/ha) treatment significantly enhanced rice photosynthetic and transpiration performance. Of particular note, the N225B15 treatment, which showed a net photosynthetic rate increase from 9.52% to 19.01%, and transpiration rate increase from 11.49% to 28.43%, is recommended as an optimal fertilization strategy for sustainable rice production. These results underscore the synergistic role of moderate biochar and nitrogen inputs in improving key physiological traits of rice, supporting higher crop yields. Full article
(This article belongs to the Section Water Use and Irrigation)
Show Figures

Figure 1

22 pages, 4060 KB  
Article
High-Performance Concrete Strength Regression Based on Machine Learning with Feature Contribution Visualization
by Lei Zhen, Chang Qu, Man-Lai Tang and Junping Yin
Mathematics 2025, 13(24), 3965; https://doi.org/10.3390/math13243965 - 12 Dec 2025
Viewed by 1055
Abstract
Concrete compressive strength is a fundamental indicator of the mechanical properties of High-Performance Concrete (HPC) with multiple components. Traditionally, it is measured through laboratory tests, which are time-consuming and resource-intensive. Therefore, this study develops a machine learning-based regression framework to predict compressive strength, [...] Read more.
Concrete compressive strength is a fundamental indicator of the mechanical properties of High-Performance Concrete (HPC) with multiple components. Traditionally, it is measured through laboratory tests, which are time-consuming and resource-intensive. Therefore, this study develops a machine learning-based regression framework to predict compressive strength, aiming to reduce experimental costs and resource usage. Under three different data preprocessing strategies—raw data, standard score, and Box–Cox transformation—a selected set of high-performance ensemble models demonstrates excellent predictive capacity, with both the coefficient of determination (R2) and explained variance score (EVS) exceeding 90% across all datasets, indicating high accuracy in compressive strength prediction. In particular, stacking ensemble (R2-0.920, EVS-0.920), XGBoost regression (R2-0.920, EVS-0.920), and HistGradientBoosting regression (R2-0.913, EVS-0.914) based on Box–Cox transformation data show strong generalization capability and stability. Additionally, tree-based and boosting methods demonstrate high effectiveness in capturing complex feature interactions. Furthermore, this study presents an analytical workflow that enhances feature interpretability through visualization techniques—including Partial Dependence Plots (PDP), Individual Conditional Expectation (ICE), and SHapley Additive exPlanations (SHAP). These methods clarify the contribution of each feature and quantify the direction and magnitude of its impact on predictions. Overall, this approach supports automated concrete quality control, optimized mixture proportioning, and more sustainable construction practices. Full article
(This article belongs to the Special Issue Advanced Computational Mechanics)
Show Figures

Figure 1

13 pages, 1045 KB  
Article
Development of a Nomogram for Predicting Lymphovascular Invasion at Initial Transurethral Resection of Bladder Tumors
by Takatoshi Somoto, Takanobu Utsumi, Rino Ikeda, Naoki Ishitsuka, Takahide Noro, Yuta Suzuki, Shota Iijima, Yuka Sugizaki, Ryo Oka, Takumi Endo, Naoto Kamiya, Nobuyuki Hiruta and Hiroyoshi Suzuki
Appl. Sci. 2025, 15(24), 12979; https://doi.org/10.3390/app152412979 - 9 Dec 2025
Viewed by 490
Abstract
Lymphovascular invasion (LVI) is a potent yet underutilized prognostic marker in bladder cancer, particularly in non–muscle-invasive disease (NMIBC). We aimed to develop and internally validate a predictive nomogram to estimate the probability of LVI at initial transurethral resection of bladder tumors (TURBT), utilizing [...] Read more.
Lymphovascular invasion (LVI) is a potent yet underutilized prognostic marker in bladder cancer, particularly in non–muscle-invasive disease (NMIBC). We aimed to develop and internally validate a predictive nomogram to estimate the probability of LVI at initial transurethral resection of bladder tumors (TURBT), utilizing preoperative clinical parameters. In this retrospective cohort study, 413 patients with histologically confirmed urothelial carcinoma who underwent initial TURBT were included. LVI was identified histologically in 9.2% of cases. Univariate and multivariate logistic regression, in conjunction with the least absolute shrinkage and selection operator modeling, revealed eight significant predictors: papillary architecture, Box–Cox–transformed tumor size, urinary cytology classification, age ≥ 75 years, pedunculated morphology, gender, hydronephrosis, and tumor multiplicity. The resulting nomogram demonstrated excellent discriminative performance, with an AUC of 0.888 in the training cohort and 0.827 in the validation cohort, and exhibited good calibration based on weighted plots. This model facilitates individualized prediction of LVI using routinely available clinical data. Early detection of LVI may inform risk-adapted management strategies, including repeat resection, or intensified surveillance in patients with bladder cancer. The model complements existing predictive frameworks and can contribute to more personalized and effective bladder cancer care. Full article
Show Figures

Figure 1

35 pages, 2126 KB  
Review
Techniques and Developments in Stochastic Streamflow Synthesis—A Comprehensive Review
by Shirin Studnicka and Umed S. Panu
Encyclopedia 2025, 5(4), 198; https://doi.org/10.3390/encyclopedia5040198 - 21 Nov 2025
Cited by 2 | Viewed by 1361
Abstract
Stochastic streamflow synthesis has long been the cornerstone of water resource planning, enabling the generation of extended hydrological sequences that reflect natural variability beyond the limitations of observed records. This paper presents a comprehensive review of the theoretical foundations, methodological advancements, and evolving [...] Read more.
Stochastic streamflow synthesis has long been the cornerstone of water resource planning, enabling the generation of extended hydrological sequences that reflect natural variability beyond the limitations of observed records. This paper presents a comprehensive review of the theoretical foundations, methodological advancements, and evolving trends in synthetic streamflow generation. Historical progression is explored through three distinct eras: the pre-modern formulation era (pre-1960), the era dominated by autoregressive models (1960–2000), and the recent period marked by the rise of data-driven AI/ML approaches. Various modelling paradigms, parametric versus non-parametric, traditional versus AI-based, and single- versus multi-scale approaches, are critically assessed and compared with a focus on their applicability across temporal resolutions and hydrological regimes. This study also categorizes evaluation criteria into four dimensions: preservation of stochastic characteristics, distributional consistency, error-based metrics, and operational performance. In addition, the use and impact of transformation techniques (e.g., log or Box-Cox) employed to normalize streamflow distributions for improved model fidelity are examined. A bibliometric analysis of over 200 studies highlights the global research footprint, showing that the United States leads with 70 studies, followed by Canada with 15, reflecting the growing international engagement in the field. The analysis also identifies the most active journals publishing streamflow synthesis research: Water Resources Research (50 publications, since 1967), Journal of Hydrology (25 publications, since 1963), and Journal of the American Water Resources Association (9 publications, since 1974). This review not only synthesizes past and current practices but also outlines key challenges and future research directions to advance stochastic hydrology in an era of climatic uncertainty and data complexity. Full article
(This article belongs to the Section Earth Sciences)
Show Figures

Figure 1

15 pages, 1251 KB  
Article
Application of a Box-Cox Transformed LSTAR-GARCH Model for Point and Interval Forecasting of Monthly Rainfall in Hainan, China
by Xiaoxuan Zhang, Yu Liu and Jun Li
Water 2025, 17(22), 3274; https://doi.org/10.3390/w17223274 - 16 Nov 2025
Viewed by 833
Abstract
To improve the accuracy of monthly rainfall forecasting and reasonably quantify its uncertainty, this study developed a hybrid LSTAR-GARCH model incorporating a Box–Cox transformation. Using monthly rainfall data from 1999 to 2019 from four meteorological stations in Hainan Province (Haikou, Dongfang, Danzhou, and [...] Read more.
To improve the accuracy of monthly rainfall forecasting and reasonably quantify its uncertainty, this study developed a hybrid LSTAR-GARCH model incorporating a Box–Cox transformation. Using monthly rainfall data from 1999 to 2019 from four meteorological stations in Hainan Province (Haikou, Dongfang, Danzhou, and Qiongzhong), the non-stationarity and nonlinearity of the series were first verified using KPSS and BDS tests, and the Box–Cox transformation was applied to reduce skewness. A Logistic Smooth Transition Autoregressive (LSTAR) model was then established to capture nonlinear dynamics, followed by a GARCH(1,1) model to address heteroskedasticity in the residuals. The results indicate that: (1) The LSTAR model effectively captured the nonlinear characteristics of monthly rainfall, with Nash-Sutcliffe efficiency (NSE) values ranging from 0.565 to 0.802, though some bias remained in predicting extreme values; (2) While the GARCH component did not improve point forecast accuracy, it significantly enhanced interval forecasting performance. At the 95% confidence level, the average interval width (RIW) of the LSTAR-GARCH model was reduced to 0.065–0.130, substantially narrower than that of the LSTAR-ARCH model (RIW: 4.548–8.240), while maintaining high coverage rates (CR) between 93.8% and 97.9%; (3) The LSTAR-GARCH model effectively characterizes both the nonlinear mean process and time-varying volatility in rainfall series, proving to be an efficient and reliable tool for interval rainfall forecasting, particularly in tropical monsoon regions with high rainfall variability. This study provides a scientific basis for regional water resource management and climate change adaptation. Full article
(This article belongs to the Section Water and Climate Change)
Show Figures

Figure 1

25 pages, 5257 KB  
Article
A Reduced Stochastic Data-Driven Approach to Modelling and Generating Vertical Ground Reaction Forces During Running
by Guillermo Fernández, José María García-Terán, Álvaro Iglesias-Pordomingo, César Peláez-Rodríguez, Antolin Lorenzana and Alvaro Magdaleno
Modelling 2025, 6(4), 144; https://doi.org/10.3390/modelling6040144 - 6 Nov 2025
Viewed by 943
Abstract
This work presents a time-domain approach for characterizing the Ground Reaction Forces (GRFs) exerted by a pedestrian during running. It is focused on the vertical component, but the methodology is adaptable to other components or activities. The approach is developed from a statistical [...] Read more.
This work presents a time-domain approach for characterizing the Ground Reaction Forces (GRFs) exerted by a pedestrian during running. It is focused on the vertical component, but the methodology is adaptable to other components or activities. The approach is developed from a statistical perspective. It relies on experimentally measured force-time series obtained from a healthy male pedestrian at eight step frequencies ranging from 130 to 200 steps/min. These data are subsequently used to build a stochastic data-driven model. The model is composed of multivariate normal distributions which represent the step patterns of each foot independently, capturing potential disparities between them. Additional univariate normal distributions represent the step scaling and the aerial phase, the latter with both feet off the ground. A dimensionality reduction procedure is also implemented to retain the essential geometric features of the steps using a sufficient set of random variables. This approach accounts for the intrinsic variability of running gait by assuming normality in the variables, validated through state-of-the-art statistical tests (Henze-Zirkler and Shapiro-Wilk) and the Box-Cox transformation. It enables the generation of virtual GRFs using pseudo-random numbers from the normal distributions. Results demonstrate strong agreement between virtual and experimental data. The virtual time signals reproduce the stochastic behavior, and their frequency content is also captured with deviations below 4.5%, most of them below 2%. This confirms that the method effectively models the inherent stochastic nature of running human gait. Full article
Show Figures

Figure 1

13 pages, 815 KB  
Article
A Bayesian Geostatistical Approach to Analyzing Groundwater Depth in Mining Areas
by Maria Chrysanthi, Andrew Pavlides and Emmanouil A Varouchakis
Geosciences 2025, 15(11), 410; https://doi.org/10.3390/geosciences15110410 - 25 Oct 2025
Cited by 1 | Viewed by 918
Abstract
This study addresses the spatial variability of groundwater levels within a mining basin in Greece. The objective is to develop an accurate spatial model of groundwater levels in the area to support an integrated groundwater management plan. Hydraulic heads were measured in 72 [...] Read more.
This study addresses the spatial variability of groundwater levels within a mining basin in Greece. The objective is to develop an accurate spatial model of groundwater levels in the area to support an integrated groundwater management plan. Hydraulic heads were measured in 72 observation wells, which are irregularly distributed, primarily in mining zones. Multiple geostatistical approaches are evaluated to identify an optimal model based on cross-validation metrics. We introduce a novel trend model that includes the surface elevation gradient, as well as the proximity of wells to the riverbed, utilizing a modified Box–Cox transformation to normalize residuals. The results indicate that Regression Kriging with a non-differentiable Matérn variogram outperforms Ordinary Kriging in cross-validation accuracy. The study provides maps of the piezometric head and kriging variance within a Bayesian framework, being among the first to quantify and incorporate river-distance effects within regression kriging for groundwater. Full article
(This article belongs to the Section Hydrogeology)
Show Figures

Figure 1

15 pages, 1977 KB  
Article
Robustness of the Trinormal ROC Surface Model: Formal Assessment via Goodness-of-Fit Testing
by Christos Nakas
Stats 2025, 8(4), 101; https://doi.org/10.3390/stats8040101 - 17 Oct 2025
Viewed by 1170
Abstract
Receiver operating characteristic (ROC) surfaces provide a natural extension of ROC curves to three-class diagnostic problems. A key summary index is the volume under the surface (VUS), representing the probability that a randomly chosen observation from each of the three ordered groups is [...] Read more.
Receiver operating characteristic (ROC) surfaces provide a natural extension of ROC curves to three-class diagnostic problems. A key summary index is the volume under the surface (VUS), representing the probability that a randomly chosen observation from each of the three ordered groups is correctly classified. A parametric estimation of VUS typically assumes trinormality of the class distributions. However, a formal method for the verification of this composite assumption has not appeared in the literature. Our approach generalizes the two-class AUC-based GOF test of Zou et al. to the three-class setting by exploiting the parallel structure between empirical and trinormal VUS estimators. We propose a global goodness-of-fit (GOF) test for trinormal ROC models based on the difference between empirical and trinormal parametric estimates of the VUS. To improve stability, a probit transformation is applied and a bootstrap procedure is used to estimate the variance of the difference. The resulting test provides a formal diagnostic for assessing the adequacy of trinormal ROC modeling. Simulation studies illustrate the robustness of the assumption via the empirical size and power of the test under various distributional settings, including skewed and multimodal alternatives. The method’s application to COVID-19 antibody level data demonstrates the practical utility of it. Our findings suggest that the proposed GOF test is simple to implement, computationally feasible for moderate sample sizes, and a useful complement to existing ROC surface methodology. Full article
(This article belongs to the Section Biostatistics)
Show Figures

Figure 1

Back to TopTop