Next Article in Journal
Research on Auxetic Lattice Structure for Impact Absorption in Machines and Mechanisms
Previous Article in Journal
Jacobi Stability Analysis of Liu System: Detecting Chaos
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reparameterized Scale Mixture of Rayleigh Distribution Regression Models with Varying Precision

by
Pilar A. Rivera
1,
Diego I. Gallardo
2,
Osvaldo Venegas
3,*,
Emilio Gómez-Déniz
4 and
Héctor W. Gómez
1
1
Departamento de Estadística y Ciencias de Datos, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
2
Departamento de Estadística, Facultad de Ciencias, Universidad del Bío-Bío, Concepción 4081112, Chile
3
Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile
4
Department of Quantitative Methods in Economics and TIDES Institute, University of Las Palmas de Gran Canaria, 35017 Las Palmas de Gran Canaria, Spain
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(13), 1982; https://doi.org/10.3390/math12131982
Submission received: 29 May 2024 / Revised: 19 June 2024 / Accepted: 23 June 2024 / Published: 27 June 2024
(This article belongs to the Section Probability and Statistics)

Abstract

:
In this paper, we introduce a new parameterization for the scale mixture of the Rayleigh distribution, which uses a mean linear regression model indexed by mean and precision parameters to model asymmetric positive real data. To test the goodness of fit, we introduce two residuals for the new model. A Monte Carlo simulation study is performed to evaluate the parameter estimation of the proposed model. We compare our proposed model with existing alternatives and illustrate its advantages and usefulness using Gilgais data in R software version 4.2.3 with the gamlss package.

1. Introduction

Generalized linear models (GLM) were introduced by Nelder and Wedderburn [1], and are widely used in statistical modeling. For example, Lange et al. [2] proposed robust statistical modeling using the Student’s-t distribution as an alternative to the normal distribution. Smith [3] reviewed the regression models based on the Weibull distribution and gives various applications involving reliability data. Ferrari and Cribari-Neto [4] introduced the beta regression model for modeling situations where the variable of interest is continuous and restricted to the interval (0, 1), and is related to other variables through a regression structure. Rodríguez-Avi et al. [5] developed a regression model for count data based on the generalized Waring distribution. A methodology based on a reparameterized Birnbaum-Saunders regression model with variable accuracy was suggested by Santos-Neto et al. [6]. This methodology enhances the existing literature on the topic. Bourguignon and Gallardo [7] presented a mean linear regression model in which the response variable follows an inverse gamma distribution. This was achieved using a reparameterization of the distribution, indexed by average and precision parameters. Gallardo et al. [8] put forward a regression model where the response variable follows a reparameterized slashed Rayleigh distribution, indexed by mean and precision parameters. Mota et al. [9] proposed a regression model based on a weighted Lindley distribution, reparameterized in terms of mean and precision parameters, and Gómez et al. [10] discuss several aspects of the slashed half-normal distribution, reparameterizing the model based on the mean and comparing it with the known regression models for positive data. The models discussed show the versatility of generalized linear models in statistical analysis. Additionally, in the context of scale mixture models, Guzmán et al. [11] proposed a linear regression model with censored responses based on skew scale mixtures of normal distributions. Ferreira et al. [12] constructed a linear mixed model that relies on skew scale combinations of normal distributions, highlighting the skew Student-t normal, skew–slash, and skew–contaminated normal distributions. A class of additive partially linear models based on scale mixtures of skew-normal under the centralized parameterization was suggested by De Freitas et al. [13], and Benites et al. [14] proposed a novel censored linear regression model where the random errors follow a finite mixture of scale mixtures of normal distribution. From count data to continuous variables constrained within specific intervals, various distributions and reparameterizations offer tailored solutions for different data types. The development of methodologies like the Reparameterized regression model with variable precision, demonstrates the advance of specialized techniques to address specific needs in statistical modeling, such as accommodating varying precision in the data; they allow for a more intuitive understanding of the models’ parameters and can improve model performance. The ongoing development of regression models reflects the continuous evolution and refinement of statistical techniques to meet the evolving needs of data analysis.
In this paper we propose a reparametrized model based on the response variable. The model used is a Scale mixture of Rayleigh (SMR) distribution, proposed by Rivera et al. [15]. This model is based on the ratio of two independent random variables (RVs), the distributed Rayleigh and the Generalized Gamma distribution introduced by Stacy [16]. Let T S M R ( σ , q ) , the probability density function (pdf) of T is given by
f T ( t ; σ , q ) = q t 2 σ t 2 2 σ + 1 q 2 + 1 , t > 0 ,
with σ > 0 and q > 0 . The SMR distribution is known for its ability to model variability in data that may exhibit asymmetry and heavy tails, making it suitable for a wide range of applications in science, engineering, and other disciplines. In this context, we propose a reparametrization of the original model to enhance its interpretability and applicability across different data analysis scenarios. Our reparametrization focuses on facilitating the interpretation of model parameters and improving the computational efficiency of their estimation. Under this parameterization, we propose a reparameterized scale mixture of Rayleigh (RSMR) distribution to be used in a regression model framework, considering a regression structure for the mean. This reparametrized approach has the potential to enhance researchers’ ability to comprehend and model data distributed according to the SMR, which could lead to significant advances in the analysis and interpretation of a wide variety of phenomena observed in practice.
The rest of the paper proceeds as follows. In Section 2, we introduce a new parameterization of the SMR distribution indexed by the mean and precision parameters. The RSMR regression model is analyzed in Section 3. In Section 4, a simulation analysis is carried out to test the performance of the estimates derived from our model. This evaluation of the performance serves to validate the robustness and reliability of our proposed approach under various scenarios and conditions. In Section 5, utilizing the gamlss function ([17]) from the corresponding package as documented by [18], we specifically focus on the Gilgais dataset. Through this empirical application, we aim to demonstrate the efficacy and practical utility of our proposed methodology in real data settings. We compare the new model with the reparameterized Birnbaum-Saunders distribution, the reparameterized Gamma distribution, and the reparameterized Weibull distribution, using the Akaike Information Criterion (AIC) Akaike [19] and the Bayesian Information Criterion (BIC), introduced in Schwarz [20]. Section 6 concludes the paper.

2. An SMR Distribution Parameterized by Its Mean and Precision Parameters

In this section, we consider an alternative parameterization of the SMR model based on mean and precision parameters. For Y S M R ( σ , q ) , the mean and variance are given by
E [ Y ] = π σ 2 Γ ϕ + 1 2 Γ ϕ + 2 2 , V [ Y ] = μ 2 4 Γ ϕ 2 Γ ϕ + 2 2 π Γ ϕ + 1 2 2 1 .
We use the notation Y R S M R ( μ ; ϕ ) to indicate that Y is a random variable following a reparameterized SMR distribution proposed in (1) with mean μ > 0 and precision parameter ϕ > 0 .
Using the proposed parameterization, the RSMR density in (2) can be written as
f Y ( y ; μ , ϕ ) = ( ϕ + 2 ) π Γ ϕ + 1 2 2 y 4 μ 2 Γ ϕ + 2 2 2 π Γ ϕ + 1 2 2 · y 2 4 μ 2 Γ ϕ + 2 2 2 + 1 ϕ 2 + 2 , y > 0 ,
and its cumulative distribution function can be written as
F Y ( y ; μ , ϕ ) = 1 π Γ ϕ + 1 2 2 · y 2 4 μ 2 Γ ϕ + 2 2 2 + 1 ϕ + 2 2 .
Then, the quantile function of the RSMR distribution is given by
Q ( p ) = 2 μ Γ ϕ + 2 2 Γ ϕ + 1 2 1 π ( 1 p ) 2 ϕ + 2 1 .
The expression of the noncentral moments μ 1 , μ 2 , μ 3 , and μ 4 are represented by
μ 1 = σ 2 B 1 2 , ϕ + 1 2 , μ 2 = 2 μ 2 ϕ π 2 B 2 1 2 , ϕ 2 ,
μ 3 = 24 μ ϕ 2 π B 1 2 , ϕ 2 6 B 3 2 , ϕ 1 2 , ϕ > 1 ,
μ 4 = 2 μ ϕ π B 1 2 , ϕ 2 4 B 2 , ϕ 2 2 , ϕ > 2 ,
where B is the beta function.
The coefficient of variation (cv) of the RSMR distribution is given by
cv = 2 ϕ π 2 B 2 1 2 , ϕ 2 1 1 / 2 ,
where B is the beta function (note that cv does not depend on the μ parameter).
The skewness and kurtosis coefficients of the RSMR distribution are given by
β 1 = 24 μ ϕ 2 π B 1 2 , ϕ 2 6 B 3 2 , ϕ 1 2 24 ϕ μ ϕ 2 π B 1 2 , ϕ 2 2 + 2 μ 3 μ 2 2 ϕ π 2 B 2 1 2 , ϕ 2 1 3 / 2 = 24 μ 6 a 6 b 31 24 μ 3 ϕ a 2 + 2 μ 3 μ 3 8 ϕ a 2 1 3 / 2 = 2 μ 3 ( 12 μ 3 ϕ a 6 b 31 12 a 2 + ϕ ) ϕ μ 3 1 ϕ 3 / 2 8 a 2 ϕ 3 / 2 = 2 ϕ ( 12 μ 3 ϕ a 6 b 31 12 a 2 + ϕ ) 8 a 2 ϕ 3 / 2 ,
and
β 2 = 2 μ ϕ π B 1 2 , ϕ 2 4 B 2 , ϕ 2 2 ( 4 μ ) 24 μ ϕ 2 π B 1 2 , ϕ 2 6 B 3 2 , ϕ 1 2 + ( 6 μ 2 ) 2 μ 2 ϕ π 2 B 2 1 2 , ϕ 2 3 μ 4 μ 2 2 ϕ π 2 B 2 1 2 , ϕ 2 1 2 = 32 μ 4 a 4 b 42 96 μ 7 a 6 b 31 + 48 μ 4 a 2 ϕ 3 μ 4 μ 4 1 ϕ 2 8 a 2 ϕ 2 = μ 4 ϕ 32 ϕ a 4 b 42 96 μ 4 ϕ a 6 b 31 + 48 a 2 3 ϕ μ 4 1 ϕ 2 8 a 2 ϕ 2 = ϕ 32 ϕ a 4 b 42 96 μ 4 ϕ a 6 b 31 + 48 a 2 3 ϕ 8 a 2 ϕ 2 ,
where a k = ϕ 2 π B 1 2 , ϕ 2 k and b i j = B i 2 , ϕ j 2 .

3. RSMR Regression Model

Let Y 1 , , Y n be n independent random variables, where each Y i , i = 1 , , n , follows the RMSR model with pdf provided in Equation (3) with mean μ i and precision parameter ϕ . Suppose the mean of Y i satisfies the following functional relation
g 1 ( μ i ) = η 1 i = x i β and g 2 ( ϕ i ) = η 2 i = z i ν
where β = ( β 1 , , β p ) is a vector of unknown regression coefficients, β R p , η 1 i are linear predictors, x i = ( x i 1 , , x i p ) are observations on p known covariates, for i = 1 , , n . We assume the matrix X = ( x 1 , , x n ) has rank p. The link function g 1 : R R + ; μ i = g 1 1 ( x i β ^ ) , with g 1 1 ( · ) being the inverse functions of g 1 ( · ) .
The log-likelihood function has the form
( θ ) = i = 1 n ( μ i , ϕ i ) ,
with
( μ i , ϕ i ) = log ( ϕ + 2 ) + 2 log Γ ϕ + 1 2 log ( μ ) log Γ ϕ + 2 2 ϕ 2 + 2 log Z ( θ ) ,
where Z ( μ i , ϕ i ) = π Γ ϕ + 1 2 2 · t 2 4 μ 2 Γ ϕ + 2 2 2 + 1 .

3.1. Residuals

To check the validity of the RSMR regression model, in this subsection we discuss two kind of residuals for this model based on the randomized quantile (Dunn and Smyth [21]) and deviance residuals.

Randomized Quantile Residuals

The randomized quantile residual for the RSMR model are defined as
r i Q = Φ 1 1 π Γ ϕ ^ i + 1 2 2 · y i 2 4 μ ^ i 2 Γ ϕ ^ i + 2 2 2 + 1 ϕ ^ i + 2 2 ,
where i = 1 , , n , Φ 1 ( · ) denotes the inverse of the cdf for the standard normal model, μ ^ i = g 1 1 ( x i β ^ ) , and ϕ ^ i = g 2 1 ( z i ν ^ ) .
If the model is correctly specified, r 1 Q , r 2 Q , , r n Q is a random sample from the standard normal distribution. The latter expression is used to sketch the QQ-plots in the applications section.

3.2. Deviance Residuals

For the RSMR regression model, the deviance residual is given by
r i D = sign 1 + log S ( y i ; μ ^ i , ϕ ^ i ) 2 1 + log S ( y i ; μ ^ i , ϕ ^ i ) + log log S ( y i ; μ ^ i , ϕ ^ i ) 1 / 2
where i = 1 , , n , sign ( x ) is the sign function, and S ( y ; μ , ϕ ) corresponds to the survival function of the RSMR model, given by S ( y ; μ , ϕ ) = π Γ ϕ + 1 2 2 · y 2 4 μ 2 Γ ϕ + 2 2 2 + 1 ϕ + 2 2 .

4. Simulation Study

In this section, a Monte Carlo simulation study were conducted to assess the performance of the estimators for the RSMR regression model. All simulations were performed using the R programming language [18]. Specifically, to perform parameter estimation for the RSMR model, we used the gamlss package. The initial values used for optimization are ( β , β 1 , β 2 ) and log ( ϕ ) given by ( 0.3 , 0.5 , 0.6 ) , ( 0.3 , 0.25 , 0.6 ) , ( 0.3 , 0 , 0.6 ) , ( 0.3 , 0.5 , 0.3 ) , ( 0.3 , 0.5 , 0 ) , ( 0.3 , 0.25 , 0.3 ) , ( 0.3 , 0 , 0 ) with log ( ϕ ) = 0.7 , and ( 0.2 , 0.5 , 0.8 ) , ( 0.2 , 0.25 , 0.8 ) , ( 0.2 , 0 , 0.8 ) , ( 0.2 , 0.5 , 0.3 ) , ( 0.2 , 0.5 , 0 ) , ( 0.2 , 0.25 , 0.3 ) , ( 0.2 , 0 , 0 ) with log ( ϕ ) = 1.3 . This process was repeated 1000 times with sample size n = 50 , 100, and 200. We calculated the average of the estimations, average of the errors (se), root of the mean squared error (RMSE), and the coverage (Cov) percentages, as shown in Table 1 below. The averages are close to the initial values, and the bias and RMSEs decrease as the sample size increases. The results are found in Table 1 and in the tables in Appendix A in this paper.

5. Real Data Application

We consider the Gilgais (*R) dataset, collected on a line transect survey in Gilgai territory in New South Wales, Australia. The main aim of this study is to explain the electrical conductivity at depth 30–40 cm ( e c 30 ) in terms of pH at depth 30–40 cm ( p H 30 ) and electrical conductivity at depth 80–90 cm ( e c 80 ). To use the dataset, the MASS package is required (see Venables and Ripley [22]).
Table 2 shows a summary for the variable e c 30 . We compared the RSMR regression model with the RGA, reparameterized Birnbaum–Saunders (RBS), and reparameterized Weibull (RWE) regression models. The computational implementation to model mean and dispersion parameters with a set of covariates linked to both components in RGA and RWE models is implemented in the gamlss.dist package in R version 4.2.3 (see Rigby and Stasinopoulos [23] and Rigby and Stasinopoulos [24]), while the RBS model is discussed in Santos-Neto et al. [6].
Table 3 shows the estimated parameters, standard errors (se), AIC (Akaike information criterion), presented in Akaike [19] and BIC (Bayesian information criterion), presented in Schwarz [20] for the models considered. Based on those criteria, the model which is preferable for this data set is the RSMR regression model.
Figure 1 presents the QQ-plot for the randomized quantile residuals for the models considered. If the respective model is appropriate for the dataset, such residuals represent a random sample from the standard normal distribution. The model that provides the best fit for this dataset is the RSMR regression model.

5.1. MASS Package

This is a functions and datasets package to support Venables and Ripley [22]. To fit this model, we use the Gilgais dataset from this package. The dataset was collected on a line transect survey in Gilgai territory in New South Wales, Australia.
The data collection was stimulated by the following question: are these patterns reflected in soil properties? These data have 9 columns and 365 observations for each of the following:
  • pH00, pH30, and pH80: pH at depth 0–10, 30–40, and 80–90 cm, respectively;
  • e00, e30, and e80: electrical conductivity in mS/cm (0–10 cm), (30–40 cm), and (80–90 cm), respectively;
  • c00, c30, and c80: chloride content in ppm (0–10 cm), (30–40 cm), and (80–90 cm), respectively.
Samples were taken at each of the 365 sampling locations on a linear grid with 4 meter spacing, at depths of 0 to 10 cm, 30 to 40 cm, and 80 to 90 cm below the surface.

5.2. Gamlss Package: Fit

Functions for fitting the Generalized Additive Models for Location Scale and Shape are introduced in [23]. In this model, we use the gamlss function in the same package to fit the RSMR regression model, as follows:   
fit <- gamlss(y∼y1+y2,sigma.fo=∼1,i.control = glim.control(cyc = 1000),
                      control = gamlss.control(n.cyc = 1000),
                      family=MRS(mu.link="log",sigma.link="identity"))
   To analyze the summary of the adjustment that we made, we use summary(fit), by which we obtain Estimate, Std.Error, Global Deviance, AIC, BIC, and other values of interest for parameter adjustment. The plot(fit) function displays four plots: residuals against values fitted, residuals against an index or a specific explanatory variable, a density plot of the residuals, and a Normal QQ plot of the residuals, as shown in Figure 2.

6. Conclusions

In this article, we introduce a novel SMR distribution characterized by mean and precision parameters. Following this characterization, we have put forth a unique regression model designed for modeling positively skewed real data. We assess the advantages and practicality of our model by comparing it to various alternative models also reparameterized in its mean. For the estimation of model parameters, we have employed maximum likelihood inference and evaluated its effective performance through Monte Carlo simulations. We also have carried out statistical modeling with actual data using our newly proposed model. The results from this application reveal the impact of covariates pH and ec80 on the mean and precision of observations. Our aspiration is that this innovative model will encourage broader applications in the field of regression analysis, as well other related context such as random effects models and error-in-variables models.

Author Contributions

Conceptualization, P.A.R. and H.W.G.; methodology, H.W.G. and D.I.G.; software, D.I.G. and E.G.-D.; validation, P.A.R. and E.G.-D.; formal analysis, O.V. and D.I.G.; investigation, P.A.R. and O.V.; writing—original draft preparation, P.A.R. and H.W.G.; writing—review and editing, D.I.G. and O.V.; funding acquisition, H.W.G. and O.V. All authors have read and agreed to the published version of the manuscript.

Funding

The research of P.A. Rivera was supported during 2023 as a research assistant by Universidad de Antofagasta, and H.W. Gómez was supported by Semillero UA-2024.

Data Availability Statement

The MASS package version 7.3.60 is required to use this dataset (see Venables and Ripley [22]).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Simulated Results

Table A1. Simulation study for the RSMR model.
Table A1. Simulation study for the RSMR model.
β β 1 β 2 log ( ϕ ) seRMSE% Cov.
n = 50
−0.30.500.7 β ^ −0.33800.09270.08620.91
β ^ 1 0.47970.09160.07620.93
β ^ 2 −0.00090.09420.08400.91
log ( ϕ ^ ) 1.70511.11721.01191.00
n = 100
β ^ −0.32450.06940.06110.92
β ^ 1 0.49350.06020.04890.94
β ^ 2 0.00250.06600.05600.92
log ( ϕ ^ ) 1.26400.65730.57810.99
n = 200
β ^ −0.31250.05170.04110.93
β ^ 1 0.49990.04680.03760.94
β ^ 2 −0.00130.04830.04020.94
log ( ϕ ^ ) 0.96550.43060.33030.94
n = 50
−0.30.25−0.30.7 β ^ −0.34190.09790.09170.89
β ^ 1 0.25430.10820.09930.90
β ^ 2 −0.29690.09330.08160.92
log ( ϕ ^ ) 1.67721.11810.98401.00
n = 100
β ^ −0.32800.07020.06240.91
β ^ 1 0.24780.07180.05890.94
β ^ 2 −0.29650.05690.04750.94
log ( ϕ ^ ) 1.26240.65870.57600.99
n = 200
β ^ −0.31340.05200.04360.94
β ^ 1 0.25450.05360.04240.95
β ^ 2 −0.29750.04850.03910.95
log ( ϕ ^ ) 0.96100.43250.32560.94
n = 50
−0.3000.7 β ^ −0.33660.09220.08750.90
β ^ 1 0.00160.08770.07560.93
β ^ 2 0.00470.08680.07740.92
log ( ϕ ^ ) 1.66211.08910.96891.00
n = 100
β ^ −0.32350.06960.06010.91
β ^ 1 0.00020.06860.05910.93
β ^ 2 −0.00090.06410.05460.93
log ( ϕ ^ ) 1.22290.64470.53670.99
n = 200
β ^ −0.31380.05180.04220.94
β ^ 1 −0.00060.04760.03790.94
β ^ 2 −0.00450.05230.04140.93
log ( ϕ ^ ) 0.95910.43160.32410.95
Table A2. Simulation study for the RSMR model.
Table A2. Simulation study for the RSMR model.
β β 1 β 2 log ( ϕ ) seRMSE% Cov.
n = 50
−0.20.50.81.3 β ^ −0.21330.08780.07780.92
β ^ 1 0.49290.09210.07600.93
β ^ 2 0.77430.09360.07450.94
log ( ϕ ^ ) 2.08171.35480.86301.00
n = 100
β ^ −0.19260.06670.05520.94
β ^ 1 0.46280.07630.06220.93
β ^ 2 0.73340.06080.07370.81
log ( ϕ ^ ) 1.76330.90830.79280.99
n = 200
β ^ −0.19670.04890.03960.95
β ^ 1 0.48580.04590.03190.95
β ^ 2 0.77090.04640.04260.91
log ( ϕ ^ ) 1.30950.50660.45130.99
n = 50
−0.20.250.81.3 β ^ −0.22950.08750.07960.91
β ^ 1 0.22390.09650.08080.94
β ^ 2 0.75650.09830.08440.93
log ( ϕ ^ ) 2.03891.30510.82431.00
n = 100
β ^ −0.20550.06570.05320.95
β ^ 1 0.24990.06420.05120.96
β ^ 2 0.77750.06250.05030.95
log ( ϕ ^ ) 1.60550.75590.55371.00
n = 200
β ^ −0.19970.04740.03750.95
β ^ 1 0.25010.04970.03990.95
β ^ 2 0.78790.05050.04130.95
log ( ϕ ^ ) 1.37120.45050.31760.99
n = 50
−0.200.81.3 β ^ −0.21730.08930.07560.94
β ^ 1 −0.00450.09530.08190.94
β ^ 2 0.79570.09670.07880.94
log ( ϕ ^ ) 1.90841.21110.69871.00
n = 100
β ^ −0.20690.06530.05490.93
β ^ 1 0.00490.06960.05610.95
β ^ 2 0.78790.06570.05310.94
log ( ϕ ^ ) 1.64710.76010.56231.00
n = 200
β ^ −0.19770.04840.03760.95
β ^ 1 0.00190.04740.03680.96
β ^ 2 0.77940.04250.03650.94
log ( ϕ ^ ) 1.37640.47070.39180.99
n = 50
−0.20.5−0.31.3 β ^ −0.21550.08950.07590.93
β ^ 1 0.48680.08690.06920.95
β ^ 2 −0.29790.09370.08000.92
log ( ϕ ^ ) 1.92691.23130.72851.00
n = 100
β ^ −0.21050.06480.05460.94
β ^ 1 0.49680.06410.05260.95
β ^ 2 −0.29690.05780.04680.95
log ( ϕ ^ ) 1.65870.75870.55271.00
n = 200
β ^ −0.20180.04730.03820.93
β ^ 1 0.50030.04940.04040.94
β ^ 2 −0.30090.04630.03750.94
log ( ϕ ^ ) 1.42770.46200.34560.99
n = 50
−0.20.501.3 β ^ −0.21960.09260.07820.93
β ^ 1 0.49870.09390.07840.95
β ^ 2 0.00480.08340.06850.94
log ( ϕ ^ ) 1.84301.16690.64031.00
n = 100
β ^ −0.20710.06490.05250.94
β ^ 1 0.49430.06820.05630.94
β ^ 2 −0.00480.05780.04990.93
log ( ϕ ^ ) 1.63300.75880.53191.00
n = 200
β ^ −0.20170.04710.03870.94
β ^ 1 0.50050.04710.03840.94
β ^ 2 −0.00140.04640.03530.95
log ( ϕ ^ ) 1.44350.46330.37580.99
n = 50
−0.20.25−0.31.3 β ^ −0.21350.08880.07460.93
β ^ 1 0.24590.10390.09030.92
β ^ 2 −0.29730.09080.07170.94
log ( ϕ ^ ) 1.85191.18390.66391.00
n = 100
β ^ −0.20470.06570.05440.95
β ^ 1 0.25090.06610.05430.94
β ^ 2 −0.30080.06090.05090.93
log ( ϕ ^ ) 1.58630.72450.50051.00
n = 200
β ^ −0.20510.04730.03850.95
β ^ 1 0.25020.04570.03670.95
β ^ 2 -0.30020.04490.03510.94
log ( ϕ ^ ) 1.43180.46230.36120.99
Table A3. Simulation study for the RSMR model.
Table A3. Simulation study for the RSMR model.
β β 1 β 2 log ( ϕ ) seRMSE% Cov.
n = 50
−0.2001.3 β ^ −0.21580.08890.07410.94
β ^ 1 0.00710.09490.07860.94
β ^ 2 0.00410.08360.06880.94
log ( ϕ ^ ) 1.84561.17930.63921.00
n = 100
β ^ −0.20910.06510.05470.93
β ^ 1 0.00170.06460.05500.93
β ^ 2 0.00180.06670.05490.95
log ( ϕ ^ ) 1.61530.74920.51291.00
n = 200
β ^ −0.20370.04730.03820.95
β ^ 1 0.00060.04700.04010.94
β ^ 2 0.00010.04610.03800.94
log ( ϕ ^ ) 1.43760.46220.35490.99
n = 50
0.1−0.90.42.3 β ^ 0.08160.08090.07030.92
β ^ 1 −0.85720.08290.07050.94
β ^ 2 0.39090.07710.06130.95
log ( ϕ ^ ) 3.10472.12691.17780.99
n = 100
β ^ 0.09280.05610.04770.93
β ^ 1 −0.87270.05810.04850.94
β ^ 2 0.39890.05360.04480.93
log ( ϕ ^ ) 3.32791.90311.27720.98
n = 200
β ^ 0.09690.03890.03180.94
β ^ 1 −0.87890.04110.03840.92
β ^ 2 0.38550.04150.03630.92
log ( ϕ ^ ) 3.37141.50501.20990.99
Table A4. Simulation study for the RMRS model.
Table A4. Simulation study for the RMRS model.
β β 1 β 2 log ( ϕ ) seRMSE% Cov.
n = 50
0.10.250.42.3 β ^ 0.08890.08400.06720.95
β ^ 1 0.24550.07620.06230.93
β ^ 2 0.39590.07080.05660.96
log ( ϕ ^ ) 2.34431.48100.63621.00
n = 100
β ^ 0.09310.05940.04650.96
β ^ 1 0.25130.05590.04520.95
β ^ 2 0.40400.06190.04890.95
log ( ϕ ^ ) 2.39641.11670.59630.98
n = 200
β ^ 0.09960.04090.03390.94
β ^ 1 0.25090.04040.03260.95
β ^ 2 0.39780.03750.02980.96
log ( ϕ ^ ) 2.46700.79390.47020.98
n = 50
0.100.42.3 β ^ 0.09110.08490.06680.95
β ^ 1 0.00050.08320.06380.96
β ^ 2 0.40350.08830.07030.95
log ( ϕ ^ ) 2.31431.46150.60921.00
n = 100
β ^ 0.09440.05940.04850.94
β ^ 1 −0.00040.06270.05030.94
β ^ 2 0.39930.06330.05090.95
log ( ϕ ^ ) 2.32851.04080.54540.98
n = 200
β ^ 0.09800.04150.03290.95
β ^ 1 −0.00140.03980.03230.94
β ^ 2 0.39720.03950.03230.95
log ( ϕ ^ ) 2.39690.76380.47080.98
n = 50
0.1−0.9−0.32.3 β ^ 0.08740.08540.06500.96
β ^ 1 −0.88480.10550.08090.96
β ^ 2 −0.29980.08890.07120.96
log ( ϕ ^ ) 2.47231.58770.74071.00
n = 100
β ^ 0.09350.05590.04650.94
β ^ 1 −0.86740.05760.05250.91
β ^ 2 −0.28890.05200.04220.94
log ( ϕ ^ ) 3.26661.84511.22950.98
n = 200
β ^ 0.09450.03960.03250.94
β ^ 1 −0.88330.04350.03670.94
β ^ 2 −0.29560.03910.03220.95
log ( ϕ ^ ) 2.96121.13310.85590.98
Table A5. Simulation study for the RMRS model.
Table A5. Simulation study for the RMRS model.
β β 1 β 2 log ( ϕ ) seRMSE% Cov.
n = 50
0.1 0.9 02.3 β ^ 0.08720.08360.06640.96
β ^ 1 −0.90200.09290.07090.95
β ^ 2 0.00500.09270.07010.97
log ( ϕ ^ ) 2.33021.44520.61930.99
n = 100
β ^ 0.09520.05730.04510.95
β ^ 1 −0.88970.06050.04890.94
β ^ 2 0.00540.06730.05260.96
log ( ϕ ^ ) 2.72861.32070.78110.99
n = 200
β ^ 0.09550.03950.03300.93
β ^ 1 −0.86590.04220.04460.88
β ^ 2 −0.00240.04560.03380.94
log ( ϕ ^ ) 3.27241.38481.15470.98
n = 50
0.10.25 0.3 2.3 β ^ 0.08760.08520.06730.95
β ^ 1 0.25360.07960.06320.96
β ^ 2 −0.29930.09810.07630.95
log ( ϕ ^ ) 2.27521.44730.64380.99
n = 100
β ^ 0.09390.05960.04750.95
β ^ 1 0.25190.06150.04690.95
β ^ 2 −0.29700.06610.05180.95
log ( ϕ ^ ) 2.34961.05590.55680.98
n = 200
β ^ 0.09620.04110.03270.96
β ^ 1 0.25090.04160.03290.95
β ^ 2 −0.30010.04140.03340.94
log ( ϕ ^ ) 2.40670.76950.47520.98
n = 50
0.1002.3 β ^ 0.08750.08490.06340.95
β ^ 1 0.00030.07380.05880.95
β ^ 2 0.00280.08380.06550.96
log ( ϕ ^ ) 2.24551.42840.62790.99
n = 100
β ^ 0.09590.05870.04610.95
β ^ 1 0.00090.06540.05270.94
β ^ 2 0.00110.06130.04780.95
log ( ϕ ^ ) 2.39221.12440.58900.98
n = 200
β ^ 0.09890.04110.03300.95
β ^ 1 −0.00250.04090.03190.96
β ^ 2 −0.00110.03950.03120.95
log ( ϕ ^ ) 2.47990.82040.50180.99

References

  1. Nelder, J.A.; Wedderburn, R.W.M. Generalized Linear Models. J. R. Stat. Soc. Ser. A 1972, 135, 370–384. [Google Scholar] [CrossRef]
  2. Lange, K.L.; Little, R.J.; Taylor, J.M.G. Robust statistical modeling using the t-distribution. J. Am. Stat. Assoc. 1989, 84, 881–896. [Google Scholar] [CrossRef]
  3. Smith, R.L. Weibull regression models for reliability data. Reliab. Eng. Syst. Saf. 1991, 34, 55–76. [Google Scholar] [CrossRef]
  4. Ferrari, S.; Cribari-Neto, F. Beta Regression for Modelling Rates and Proportions. J. Appl. Stat. 2004, 31, 799–815. [Google Scholar] [CrossRef]
  5. Rodríguez-Avi, J.; Conde-Sánchez, A.; Sáez-Castillo, A.J.; Olmo-Jiménez, M.J.; Martínez-Rodríguez, A.M. A generalized Waring regression model for count data. Comput. Stat. Data Anal. 2009, 53, 3717–3725. [Google Scholar] [CrossRef]
  6. Santos-Neto, M.; Cysneiros, F.J.A.; Leiva, V.; Barros, M. Reparameterized Birnbaum-Saunders regression models with varying precision. Electron. J. Statist. 2016, 10, 2825–2855. [Google Scholar] [CrossRef]
  7. Bourguignon, M.; Gallardo, D.I. Reparameterized inverse Gamma regression models with varying precision. Stat. Neerl. 2020, 74, 611–627. [Google Scholar] [CrossRef]
  8. Gallardo, D.I.; Gómez-Déniz, E.; Leão, J.; Gómez, H.W. Estimation and diagnostic tools in reparameterized slashed Rayleigh regression model. An application to chemical data. Chemometr. Intell. Lab. Syst. 2007, 207, 104189. [Google Scholar] [CrossRef]
  9. Mota, A.L.; Santos-Neto, M.; Miranda Neto, M.; Leão, J.; Tomazella, V.L.D.; Louzada, F. Weighted Lindley regression model with varying precision: Estimation, modeling and its diagnostics. Commun. Stat. Simul. Comput. 2024, 53, 1690–1710. [Google Scholar] [CrossRef]
  10. Gómez, Y.M.; Gallardo, D.I.; De Castro, M. A regression model for positive data based on the slashed half-normal distribution. REVSTAT–Stat. J. 2021, 19, 553–573. [Google Scholar]
  11. Guzmán, D.C.F.; Ferreira, C.S.; Zeller, C.B. Linear censored regression models with skew scale mixtures of normal distributions. J. Appl. Stat. 2020, 48, 3060–3085. [Google Scholar] [CrossRef] [PubMed]
  12. Ferreira, C.S.; Bolfarine, H.; Lachos, V.H. Linear mixed models based on skew scale mixtures of normal distributions. Commun. Stat. Simul. Comput. 2020, 51, 7194–7214. [Google Scholar] [CrossRef]
  13. de Freitas, J.V.B.; Azevedo, C.L.N.; Nobre, J.S. A stochastic approximation ECME algorithm to semi-parametric scale mixtures of centred skew normal regression models. Stat. Comput. 2023, 33, 51. [Google Scholar] [CrossRef]
  14. Benites, L.; Zeller, C.B.; Bolfarine, H.; Lachos, V.H. Regression modeling of censored data based on compound scale mixtures of normal distributions. Braz. J. Probab. Stat. 2023, 37, 282–312. [Google Scholar] [CrossRef]
  15. Rivera, P.A.; Barranco-Chamorro, I.; Gallardo, D.I.; Gómez, H.W. Scale Mixture of Rayleigh Distribution. Mathematics 2020, 8, 1842. [Google Scholar] [CrossRef]
  16. Stacy, E.W. A generalization of the gamma distribution. Ann. Math. Stat. 1962, 33, 1187–1192. [Google Scholar] [CrossRef]
  17. Stasinopoulos, D.M.; Rigby, R.A. Generalized Additive Models for Location Scale and Shape (GAMLSS) in R. J. Stat. Softw. 2007, 23, 1–46. [Google Scholar] [CrossRef]
  18. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.R-project.org/ (accessed on 8 June 2024).
  19. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  20. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  21. Dunn, P.K.; Smyth, G.K. Randomized Quantile Residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar] [CrossRef]
  22. Venables, W.N.; Ripley, B.D. Modern Applied Statistics with S, 4th ed.; Springer: New York, NY, USA, 2002. [Google Scholar]
  23. Rigby, R.A.; Stasinopoulos, D.M. Generalized Additive Models for Location, Scale and Shape (with discussion). J. R. Stat. Soc. Ser. C Appl. Stat. 2005, 54, 507–554. [Google Scholar] [CrossRef]
  24. Rigby, R.A.; Stasinopoulos, D.M. gamlss.dist: Distributions for Generalized Additive Models for Location Scale and Shape. R Package Version 5.1-3, 2019. Available online: https://cran.r-project.org/package=gamlss.dist (accessed on 8 June 2024).
Figure 1. QQ-plot of the randomized quantile residuals of the (a) RGA, (b) RBS, (c) RWE, and (d) RSMR distributions for the dataset.
Figure 1. QQ-plot of the randomized quantile residuals of the (a) RGA, (b) RBS, (c) RWE, and (d) RSMR distributions for the dataset.
Mathematics 12 01982 g001
Figure 2. Plot by plot(fit) for fitting the dataset.
Figure 2. Plot by plot(fit) for fitting the dataset.
Mathematics 12 01982 g002
Table 1. Simulation study for RSMR model.
Table 1. Simulation study for RSMR model.
β β 1 β 2 log ( ϕ ) seRMSE% Cov.
n = 50
0.3 0.5 0.6 0.7 β ^ 0.3114 0.08260.06830.94
β ^ 1 0.59100.09080.07920.92
β ^ 2 0.4984 0.08180.07180.92
log ( ϕ ^ ) 1.72741.11881.03431.00
n = 100
β ^ 0.3254 0.06980.06090.91
β ^ 1 0.48620.08090.06470.94
β ^ 2 0.5809 0.06890.05730.95
log ( ϕ ^ ) 1.23980.65690.55430.99
n = 200
β ^ 0.3124 0.05220.04340.94
β ^ 1 0.48890.05560.04210.94
β ^ 2 0.5957 0.05040.04060.96
log ( ϕ ^ ) 0.92250.42500.28270.97
n = 50
0.3 0.25 0.6 0.7 β ^ 0.3409 0.09110.08890.88
β ^ 1 0.24810.10610.09210.92
β ^ 2 0.5913 0.08650.07900.92
log ( ϕ ^ ) 1.69891.10651.00571.00
n = 100
β ^ 0.3229 0.06910.06090.92
β ^ 1 0.24370.07320.06210.93
β ^ 2 0.5909 0.07400.06170.93
log ( ϕ ^ ) 1.24210.65220.55420.99
n = 200
β ^ 0.3132 0.05190.04190.94
β ^ 1 0.24640.04990.04290.93
β ^ 2 0.5974 0.05160.04180.94
log ( ϕ ^ ) 0.96070.42810.32640.95
n = 50
0.3 0 0.6 0.7 β ^ 0.3429 0.09090.08840.88
β ^ 1 0.0038 0.11310.10060.93
β ^ 2 0.5787 0.09130.07870.92
log ( ϕ ^ ) 1.69211.10780.99891.00
n = 100
β ^ 0.3232 0.06890.06090.92
β ^ 1 0.00060.06610.05520.94
β ^ 2 0.5938 0.06480.05440.94
log ( ϕ ^ ) 1.27920.64980.59040.99
n = 200
β ^ 0.3125 0.05160.04390.94
β ^ 1 0.00020.04550.03780.94
β ^ 2 0.5954 0.04640.03750.96
log ( ϕ ^ ) 0.97850.43070.34260.94
n = 50
−0.30.5−0.30.7 β ^ −0.33230.09230.08580.90
β ^ 1 0.49930.10920.09800.90
β ^ 2 −0.29860.09750.08230.94
log ( ϕ ^ ) 1.64891.08710.95571.00
n = 100
β ^ −0.31960.06970.05920.93
β ^ 1 0.49910.07750.06520.93
β ^ 2 −0.30450.06260.05220.94
log ( ϕ ^ ) 1.25750.65060.57070.99
n = 200
β ^ −0.31430.05190.04240.94
β ^ 1 0.49830.05090.03950.95
β ^ 2 −0.29890.04970.04010.95
log ( ϕ ^ ) 0.97390.43160.33350.94
Table 2. Descriptive statistics for e c 30 .
Table 2. Descriptive statistics for e c 30 .
nMeanMediansdcvSkewnessKurtosis
365 95.770 54 97.953 102.279 1.642 5.483
Table 3. Estimated parameters and standard error (se) for RGA, RBS, RWE and RSMR regression models in Gilgais data set.
Table 3. Estimated parameters and standard error (se) for RGA, RBS, RWE and RSMR regression models in Gilgais data set.
RGARBSRWERSMR
ParameterEstimateseEstimateseEstimateseEstimatese
β I n t e r c e p t 8.0826 0.9266 4.9413 0.7576 10.1701 0.9637 7.7041 1.0611
β p H 30 0.5651 0.1014 0.2012 0.0859 0.7919 0.1046 0.5269 0.1163
β e c 80 0.0063 0.0004 0.0061 0.0003 0.0058 0.0004 0.0065 0.0004
log ( ϕ ) 0.4639 0.0349 1.3385 0.0740 0.5149 0.0396 1.4591 0.3473
AIC 3726.30 3742.23 3736.92 3723.66
BIC 3741.90 3757.83 3752.52 3739.26
CVM 0.0451 <0.0001 0.0916 0.1376
AD 0.0855 <0.0001 0.1389 0.1761
SW 0.1403 <0.0001 0.0031 0.4559
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rivera, P.A.; Gallardo, D.I.; Venegas, O.; Gómez-Déniz, E.; Gómez, H.W. Reparameterized Scale Mixture of Rayleigh Distribution Regression Models with Varying Precision. Mathematics 2024, 12, 1982. https://doi.org/10.3390/math12131982

AMA Style

Rivera PA, Gallardo DI, Venegas O, Gómez-Déniz E, Gómez HW. Reparameterized Scale Mixture of Rayleigh Distribution Regression Models with Varying Precision. Mathematics. 2024; 12(13):1982. https://doi.org/10.3390/math12131982

Chicago/Turabian Style

Rivera, Pilar A., Diego I. Gallardo, Osvaldo Venegas, Emilio Gómez-Déniz, and Héctor W. Gómez. 2024. "Reparameterized Scale Mixture of Rayleigh Distribution Regression Models with Varying Precision" Mathematics 12, no. 13: 1982. https://doi.org/10.3390/math12131982

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop