Modeling Mortality Based on Pollution and Temperature Using a New Birnbaum–Saunders Autoregressive Moving Average Structure with Regressors and Related-Sensors Data

Saulo, Helton; Souza, Rubens; Vila, Roberto; Leiva, Víctor; Aykroyd, Robert G.

doi:10.3390/s21196518

Open AccessArticle

Modeling Mortality Based on Pollution and Temperature Using a New Birnbaum–Saunders Autoregressive Moving Average Structure with Regressors and Related-Sensors Data

¹

Department of Statistics, Universidade de Brasília, Brasília 70910-90, Brazil

²

School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile

³

Department of Statistics, University of Leeds, Leeds, West Yorkshire LS2 9JT, UK

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(19), 6518; https://doi.org/10.3390/s21196518

Submission received: 17 August 2021 / Revised: 22 September 2021 / Accepted: 25 September 2021 / Published: 29 September 2021

(This article belongs to the Special Issue Modeling COVID-19 with Artificial Intelligence and Machine/Statistical Learning Techniques from Sensor Data and other Potential Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Environmental agencies are interested in relating mortality to pollutants and possible environmental contributors such as temperature. The Gaussianity assumption is often violated when modeling this relationship due to asymmetry and then other regression models should be considered. The class of Birnbaum–Saunders models, especially their regression formulations, has received considerable attention in the statistical literature. These models have been applied successfully in different areas with an emphasis on engineering, environment, and medicine. A common simplification of these models is that statistical dependence is often not considered. In this paper, we propose and derive a time-dependent model based on a reparameterized Birnbaum–Saunders (RBS) asymmetric distribution that allows us to analyze data in terms of a time-varying conditional mean. In particular, it is a dynamic class of autoregressive moving average (ARMA) models with regressors and a conditional RBS distribution (RBSARMAX). By means of a Monte Carlo simulation study, the statistical performance of the new methodology is assessed, showing good results. The asymmetric RBSARMAX structure is applied to the modeling of mortality as a function of pollution and temperature over time with sensor-related data. This modeling provides strong evidence that the new ARMA formulation is a good alternative for dealing with temporal data, particularly related to mortality with regressors of environmental temperature and pollution.

Keywords:

ARMA models; Birnbaum–Saunders distribution; data dependent over time; maximum likelihood methods; model selection; Monte Carlo simulation; R software; residuals; sensing and data extraction

1. Introduction

Environmental agencies charged with establishing health-based air pollution standards are interested in determining significant relationships between pollution levels and human mortality [1]. These agencies must choose the admissible levels of these standards to protect the population including sensitive groups, such as children and the elderly, against adverse effects on their health [2]. In general, a relevant question to answer is related to the degree of association between pollutants and mortality considering possible environmental contributors, such as climate, linked mainly to temperature [3,4].

Variables associated with mortality, pollutants and temperature are often statistically related, but also their data are dependent over time. Then, a simple multiple regression is not enough to model this relationship, since a time-series structure should be considered [5]. This type of modeling is frequently conducted under the Gaussianity/normality assumption. However, such an assumption is often violated in environmental phenomena due to asymmetry and then diverse practitioners employ logarithmic transformations to reach Gaussianity. Nevertheless, data transformation brings difficulties of interpretation and power loss in statistical tests. Consequently, asymmetric models with suitable mathematical arguments for describing mortality in terms of pollution and temperature can be used. One distribution that holds with asymmetry and possesses such arguments is the Birnbaum–Saunders (BS) distribution as demonstrated in [6].

The BS distribution is a lifetime model that, in recent decades, has been widely applied in different fields of science. This distribution is continuous and unimodal, has positive asymmetry, and is supported on the set of positive real numbers. It is indexed by two parameters corresponding to its shape and scale. Proposed in [7], the BS distribution had its origins in physical problems related to a specific type of fatigue in materials under repeated stress and tension. It describes the total time until the cumulative damage caused by the development and growth of a dominant crack reaches a threshold and failure occurs. Subsequently, some assumptions made in [7] were relaxed in [8], reinforcing the physical justification for the BS model by presenting a more general derivation. For more details on the BS distribution with respect to its properties, see [9,10].

Since its first use and numerous applications in the areas of engineering and material reliability, the BS distribution family has been considered in different fields of knowledge, including environmental sciences [11,12,13,14,15,16,17,18,19]. The wide interest in this distribution is due to its theoretical arguments, its good properties, and its close relationship with the normal distribution. Several works have been performed focussing on aspects of estimation, inference, generalizations, extensions, modeling, and diagnostics in BS models. A summary of the main studies of the BS distribution can be found in [20].

In BS regressions, some forms of modeling were proposed by the authors of [21], who were the pioneers in this type of modeling. They introduced a log-linear structure for the BS distribution and developed methods for estimating parameters, hypothesis testing, and calculating confidence intervals. Later, other investigations were carried out on BS regression models such as shown and summarized in [22]. Additionally, statistical diagnostic methods were presented in [23,24] for BS models. In the same vein, diagnostic methods were formulated in [25] for BS regression models with censored observations. BS quantile regression, boundary, and bimodality have been modeled in a number of works [26,27,28,29]. A generalization of the BS distribution was derived based on elliptically contoured distributions, called the generalized BS distribution, which has been applied widely as well as its mixture [30,31]. In all of these models, the original response must be first transformed onto a logarithmic scale. This leads to a problem of interpretation of the results and to a reduction in the power of the study. In addition, although the mean

ς = log (λ)

is being modeled on the logarithmic scale,

λ = \exp (ς)

is being modeled on the original scale, which, in the case of the BS distribution, corresponds to the median.

A way of dealing with the problem of logarithmic transformation usually applied in BS regression models is through reparametrization. In this sense, several reparametrizations of the BS distribution were introduced in [32], one of which, called the reparameterized BS distribution (RBS), indexes the BS distribution by its mean and precision parameters. Such a reparametrization allows the direct modeling of the mean without the need for a transformation, in a similar way to generalized linear models (GLM). Considering this mean-based RBS distribution, a GLM type regression model was introduced in [33]. In this model, the mean response is related to a linear predictor by one of the several possible link functions, and encompasses all the parameters to be estimated. Unlike all existing BS regression models, the RBS regression approach proposed in [33] allows data to be described at their original scale with ample flexibility.

Despite the growing interest in the BS distribution and the development of a considerable amount of investigation, little has been proposed for data involving a serial correlation structure. In the context of BS models, initial efforts considering a dependence structure are attributed to [34,35,36,37,38,39], and recently to [40]. As mentioned earlier, data on mortality, pollutants and temperature are often statistically related, and temporal dependence may be present. Hence, the main objective of our work is to derive a novel time-series model based on the RBS distribution, which fills a gap in a little-studied area. We derive an RBS autoregressive moving average with regressors (RBSARMAX) time-series model, which is specified in terms of a conditional mean varying over time and extends the RBS regression proposed in [33], where temporal dependence was not considered. Our approach is similar to that studied in [5,41,42]. The secondary objective is to apply the RBSARMAX structure for modeling mortality as a function of pollution and temperature with data that are related to sensors as detailed in the section on application.

The rest of this article is organized as follows. Section 2 presents the RBS distribution, some of its properties, and the RBS regression model proposed in [33]. In Section 3, the new RBSARMAX model is formulated, conditional maximum likelihood (CML) estimators of the model parameters are derived, and residual analysis is considered for this model. In Section 4, we conduct Monte Carlo simulations to evaluate the performance of the proposed methodology. Section 5 applies the RBSARMAX modeling approach to sensor-related time-series data to show its potential. The results are compared with an approach based on a Gaussian ARMA model. Finally, Section 6 provides a summary and some concluding observations, limitations, and ideas for the future of the present work.

2. An RBS Regression Model

2.1. The RBS Distribution

The RBS distribution [32], as one of the various forms of parameterization of the BS distribution, was introduced using a new parametrization of the latter as a function of its mean. The RBS distribution allows several characteristics of data modeling to be considered [32,43].

To start, if a random variable T follows a BS distribution, usually denoted by

T \sim BS (α, λ)

, then its cumulative distribution function (CDF) is given by:

F_{T} (t; α, λ) = Φ [\frac{1}{α} (\sqrt{t / λ} - \sqrt{λ / t})], t > 0, α > 0, λ > 0,

(1)

where

Φ

is the standard normal CDF,

α

is a shape parameter, and

λ

is a scale parameter, as well as the distribution median. Then, by considering the parameters of the BS distribution with CDF defined in (1) as

α = \sqrt{2 / δ}

and

λ = μ δ / (δ + 1)

, the new parameters of the form reparametrized of the BS distribution are expressed as

μ = λ (1 + α^{2} / 2)

and

δ = 2 / α^{2}

, where

μ > 0

is the mean of the distribution and also a scale parameter, whereas

δ > 0

is a shape and precision parameter. In this case, we use the notation

Y \sim RBS (μ, δ)

.

The CDF of

Y \sim RBS (μ, δ)

is stated as:

F (y; μ, δ) = Φ \{\sqrt{\frac{δ}{2}} [\sqrt{\frac{(δ + 1) y}{μ δ}} - \sqrt{\frac{μ δ}{(δ + 1) y}}]\}, y > 0,

(2)

whereas the probability density function (PDF) of Y is obtained by differentiating the expression established in (2) with respect to y formulated as:

f (y; μ, δ) = \frac{exp (δ / 2) \sqrt{δ + 1}}{4 \sqrt{π μ}} y^{- 3 / 2} [y + \frac{μ δ}{(δ + 1)}] exp \{- \frac{δ}{4} [\frac{(δ + 1) y}{μ δ} + \frac{μ δ}{(δ + 1) y}]\}, y > 0 .

(3)

Figure 1 shows some shapes of the RBS PDF. From Figure 1a, note that

δ

, in addition to being a precision parameter, is also a shape parameter. Observe that, as

δ

increases, the PDF is more concentrated around the mean

μ = 1

and therefore the variability decreases. In Figure 1b, note that the distribution mean

μ

also behaves as a scale parameter. Hence, as it increases, there is an increase in the variance and an increased flatness in the PDF.

Due to the relationship of the BS distribution in its original version to the normal distribution, the RBS distribution has the following relationship with the normal distribution:

Y = \frac{μ δ}{δ + 1} {[\frac{Z}{\sqrt{2 δ}} + \sqrt{{(\frac{Z}{\sqrt{2 δ}})}^{2} + 1}]}^{2},

(4)

wherein, from (4), we obtain

Z = {(\frac{δ}{2})}^{\frac{1}{2}} \{{[\frac{(δ + 1) Y}{μ δ}]}^{\frac{1}{2}} - {[\frac{μ δ}{(δ + 1) Y}]}^{\frac{1}{2}}\} \sim N (0, 1) .

(5)

Consequently, from (4) and (5), the quantile function for the RBS distribution is expressed as:

y (q; μ, δ) = F^{- 1} (q; μ, δ) = \frac{μ δ}{δ + 1} {\{\frac{z (q)}{\sqrt{2 δ}} + \sqrt{{[\frac{z (q)}{\sqrt{2 δ}}]}^{2} + 1}\}}^{2}, 0 < q < 1,

(6)

where

z (q)

defined in (6) is the q-th quantile of the standard normal distribution and

F_{Y}^{- 1}

is the inverse of the CDF of Y applied to q. The expressions for the mean and variance of the RBS distribution are stated, respectively, as:

E (Y) = μ, Var (Y) = μ^{2} {[CV (Y)]}^{2},

(7)

where the notation CV defined in (7) is formulated as

CV (Y) = \sqrt{2 δ + 5} / (δ + 1) \in (0, \sqrt{5})

and corresponds to the coefficient of variation of Y. As mentioned,

δ

can be interpreted as a precision parameter, that is, for fixed values of

μ

, when

δ \to \infty

, the variance of Y tends to zero. In addition, for fixed values of

μ

, if

δ \to 0

, then

Var (Y) = 5 μ^{2}

. The median of Y is

δ μ / (δ + 1)

and hence is proportional to the mean. Note that, for

μ

fixed, we have that

δ μ / (δ + 1) \to μ

when

δ \to \infty

.

2.2. Formulation

Based on the RBS distribution, a new approach to the regression modeling of the BS distribution was proposed in [33]. In this approach, the construction of the regression model is similar to the GLM, in which the mean is directly described without the need for a transformation of the dependent variable to the logarithmic scale. Formally, consider

Y = {(Y_{1}, \dots, Y_{n})}^{⊤}

, which is a sample of independent random variables, where each

Y_{t} \sim RBS (μ_{t}, δ)

, for

t \in {1, \dots, n}

, and their respective observations are

y = {(y_{1}, \dots, y_{n})}^{⊤}

. Then, a regression model based on (3) is defined by a systematic component expressed as:

g (μ_{t}) = α_{t} = x_{t}^{⊤} β, t \in {1, \dots, n},

(8)

where

x_{t} = {(x_{t 1}, \dots, x_{t r})}^{⊤}

is a vector of known values for r regressors, with

t \in {1, \dots, n}

and

r < n

,

β = {(β_{1}, \dots, β_{r})}^{⊤}

is a vector of unknown regression coefficients to be estimated, and

α_{t}

is the linear predictor. Here, we have a link function

g : R \to R^{+}

which is strictly monotonic, always positive, and at least twice differentiable. Hence, the mean of the response variable is given by

μ_{t} = g^{- 1} (x_{t}^{⊤} β)

, with

g^{- 1}

being the inverse function of g.

2.3. Estimation

The logarithm of the likelihood function of the RBS regression model for the parameter vector

γ = {(β^{⊤}, δ)}^{⊤}

has the form:

ℓ (γ) = \sum_{t = 1}^{n} ℓ_{t} (y_{t}; μ_{t}, δ),

(9)

where

ℓ_{t} (y_{t}; μ_{t}, δ)

defined in (9) is given by:

ℓ_{t} (y_{t}; μ_{t}, δ) = \frac{δ}{2} - \frac{log (16 π)}{2} - \frac{1}{2} log [\frac{(δ + 1) y_{t}^{3} μ_{t}}{{(δ y_{t} + y_{t} + δ μ_{t})}^{2}}] - \frac{(δ + 1) y_{t}}{4 μ_{t}} - \frac{δ^{2} μ_{t}}{4 (δ + 1) y_{t}} .

The maximum likelihood estimate of

γ

is stated through solution of the system of equations

U_{β_{j}} (γ) = 0

, for

j \in {1, \dots, k}

, and

U_{δ} (γ) = 0

, where

U_{β_{j}} (γ) = \partial ℓ (γ) / \partial β_{j}

, and

U_{δ} (γ) = \partial ℓ (γ) / \partial δ

. In this case, it is not possible to find an analytical solution so that the maximum likelihood estimates must be obtained numerically using an appropriate iterative method for nonlinear optimization problems, such as the Broyden–Fletcher–Goldfarb–Shanno (BFGS) quasi-Newton method, which is implemented in the R software (https://www.r-project.org, accessed on 22 September 2021) [44,45] by a command named optim.

3. RBSARMAX Model

3.1. Formulation

Let

{Y_{t}}

, for

t \in {1, \dots, n}

, be random variables such that the conditional distribution of

Y_{t}

, given the past,

F_{t - 1} = {Y_{t - 1}, \dots, Y_{1}, μ_{t - 1}, \dots, μ_{1}},

follows an RBS distribution, denoted by

Y_{t} | F_{t - 1} \sim RBS (μ_{t}, δ)

. Then, its PDF is given by:

f (y_{t}; μ_{t}, δ | F_{t - 1}) = \frac{exp (\frac{δ}{2}) \sqrt{δ + 1}}{4 \sqrt{π μ_{t}}} y_{t}^{- 3 / 2} [y_{t} + \frac{δ μ_{t}}{(δ + 1)}] exp \{- \frac{δ}{4} [\frac{(δ + 1) y_{t}}{δ μ_{t}} + \frac{δ μ_{t}}{(δ + 1) y_{t}}]\}, y_{t} > 0,

(10)

where

δ > 0

and

μ_{t} = E [Y_{t} | F_{t - 1}]

are the precision parameter and the conditional mean of

Y_{t}

, respectively. Based on the RBS regression presented in (8), we postulate the RBSARMAX

(p, q, r)

model accommodating an additional dynamic component with an ARMA structure and regressors formulated as:

τ_{t} = η + \sum_{i = 1}^{p} ϕ_{i} [g (y_{t - i}) - x_{t - i}^{⊤} β] + \sum_{j = 1}^{q} θ_{j} [g (y_{t - j}) - α_{t - j}],

(11)

such that now g defined in (11) is

g (μ_{t}) = α_{t} = x_{t}^{⊤} β + τ_{t}

, for

t \in {1, \dots, n}

, wherein g,

x_{t}

, and

β = {(β_{1}, \dots, β_{r})}^{⊤} \in R^{r}

are defined as in (8),

ϕ = {(ϕ_{1}, \dots, ϕ_{p})}^{⊤} \in R^{p}

,

θ = {(θ_{1}, \dots, θ_{q})}^{⊤} \in R^{q}

, and

p, q, r \in N

are the ARMAX parameters and their orders, respectively; whereas

η \in R

is a constant.

Therefore, we have that

g (μ_{t}) = α_{t} = η + (x_{t}^{⊤} β - \sum_{i = 1}^{p} ϕ_{i} x_{t - i}^{⊤} β) + \sum_{i = 1}^{p} ϕ_{i} g (y_{t - i}) + \sum_{j = 1}^{q} θ_{j} [g (y_{t - j}) - α_{t - j}] .

(12)

The RBSARMAX model is stated by

Y_{t} | F_{t - 1} \sim RBS (μ_{t}, δ)

, whose PDF is defined in (10), and by the component given in (12). Note that the RBSARMAX model follows the same structure as the GARMA models [41]. For the RBSARMAX structure, the link function chosen is the identity.

3.2. Estimation

Parameter estimation in the RBSARMAX model is performed with the CML method or the first m observations, in which

m = \max {p, q}

and

n > m

. From the expression stated in (10), we have that the log-likelihood function for

γ = {(δ, η, β^{⊤}, ϕ^{⊤}, θ^{⊤})}^{⊤}

conditional on m observations is given by

ℓ (γ) = ℓ = \sum_{t = m + 1}^{n} ℓ_{t} (δ, β, η, ϕ, θ)

, wherein

ℓ_{t} (δ, β, η, ϕ, θ) = ℓ_{t} = log [f (y_{t}; μ_{t}, δ | F_{t - 1})]

is defined by

ℓ_{t} = \frac{δ}{2} + log [\frac{log (16 π)}{2}] - \frac{1}{2} log \{\frac{(δ + 1) Y_{t}^{3} μ_{t}}{{[(δ + 1) Y_{t} + δ μ_{t}]}^{2}}\} - \frac{Y_{t} (δ + 1)}{4 μ_{t}} - \frac{δ^{2} μ_{t}}{4 (δ + 1) Y_{t}} .

(13)

The CML estimate of

γ

can be obtained by maximizing the log-likelihood function defined in (13), matching the score vector

U (γ) = \partial ℓ / \partial γ

to zero. Thus, the CML estimates are obtained numerically using the BFGS method. The methodology proposed in this work can be easily used by a practitioner through the R software. In particular, by employing the function garmaFit of a package named gamlss.util and some functions of the RBS package, which can be downloaded from GitHub via remotes:: install_github(“santosneto/RBS”). Note that the computational cost and complexity are relatively low. In Appendix A, we present mathematical results associated with the Fisher information matrix.

3.3. Residual Analysis

Residuals play a key role in the validation of any statistical model and permit us to detect the existence of outliers. In particular, two types of residuals are proposed in this study. The first is a generalized Cox–Snell (GCS) residual given by:

r_{t}^{GCS} = - log [\hat{S} (y_{t} | F_{t - 1})],

(14)

wherein

\hat{S} (y_{t} | F_{t - 1})

is the estimated survival function for the fitted model, defined as:

\hat{S} (y_{t}; μ_{t}, δ) = Φ [- {(\frac{δ}{2})}^{\frac{1}{2}} \{{[\frac{(δ + 1) y_{t}}{μ_{t} δ}]}^{\frac{1}{2}} - {[\frac{μ_{t} δ}{(δ + 1) y_{t}}]}^{\frac{1}{2}}\}], y_{t} > 0 .

(15)

The GCS residuals follow a unit exponential distribution, EXP (1) in short, when the model is specified correctly, and a plot of the theoretical quantiles versus empirical quantiles (QQ) of

r_{t}^{GCS}

, defined in (14), can be used to assess the fit of the model to the data.

The randomized quantile (RQ) residual is also proposed, which is expressed as:

r_{t}^{GS} = Φ^{- 1} [\hat{S} (y_{t} | F_{t - 1})],

(16)

where

Φ^{- 1}

is the inverse function of the CDF of the standard normal distribution and

\hat{S} (y_{t} | F_{t - 1})

is the estimated survival function, adjusted as in (15). The RQ residual follows a standard normal distribution when the model is specified correctly. Hence, a QQ plot of the residuals defined as in (16) may be utilized to assess the fit of the model to the data.

4. Numerical Simulations

4.1. Definitions and Simulation Model

The simulations are performed using the RBSARMAX(1,1,1) model and are based on samples of size

n \in {100, 200, 500}

, considering two cases. In Case 1, simulations are performed with the values

δ \in {8, 15, 25, 50}

,

β = 0.7

,

η = 1.0

,

ϕ = 0.7

, and

θ = 0.5

. For Case 2, the autoregressive (

ϕ

) and moving average (

θ

) parameters take the values of

0.3

,

0.5

, and

0.7

, with

δ = 8

,

β = 0.7

, and

η = 1.0

. These simulations evaluate the performance of the CML estimators of the RBSARMAX(1,1,1) model parameters. The simulation study is based on 1000 Monte Carlo replicates for each n. The proposed sample sizes aim to verify whether there are improvements in the parameter estimation as the sample size increases. The criteria used to evaluate performance for CML estimators of

ϕ

,

θ

, and

δ

are the empirical mean, bias, variance and mean square error (MSE) given, respectively, by:

\bar{\hat{φ}} = \frac{1}{N_{r}} \sum_{r = 1}^{N_{r}} {\hat{φ}}_{r}, Bias (\hat{φ}) = \bar{\hat{φ}} - φ, \hat{Var} (\hat{φ}) = \frac{1}{N_{r}} \sum_{r = 1}^{N_{r}} {({\hat{φ}}_{r} - \bar{\hat{φ}})}^{2}, \hat{MSE} (\hat{φ}) = \frac{1}{N_{r}} \sum_{r = 1}^{N_{r}} {({\hat{φ}}_{r} - φ)}^{2},

(17)

where

{\hat{φ}}_{r}

is the estimate obtained from the r-th replicate of the corresponding parameter,

φ

represents the true value of the parameter and

N_{r}

is the number of Monte Carlo replicates. With the exception of the mean, for all other calculated statistics, as the value is smaller, the estimator has a better statistical performance. Note that the bias has this characteristic when analyzed in terms of its absolute value. All simulation and estimation routines were developed employing the R software.

4.2. RBSARMAX(1,1,1) Model

Table 1 and Table 2 report the empirical mean, bias, variance, and MSE calculated as in (17) of the estimators for the shape and precision parameter (

δ

), autoregressive parameters (

ϕ

), and moving average parameters (

θ

), respectively. Table 1 shows the estimates for the parameter

δ

, fixed according to Case 1. Note that the performance of the estimator of

δ

is related to the sample size. For example, when the sample size increases from

n = 100

to

n = 500

, the empirical bias in absolute value of the estimator of

δ = 8

, on average, decreases considerably, from

0.4705

to

0.0720

. Consequently, the mean of the estimator of

δ

tends to the true parameter value. In all considered scenarios, the parameter

δ

is, on average, overestimated, that is, the estimate

\hat{δ}

provided by the CML estimator for

δ

is greater than the true value of the parameter. The results of Table 1 are also shown in Figure 2 to simplify the interpretation of the calculated statistics in relation to the sample size and the true values of

δ

. Note in Figure 2a that, as

n \to \infty

, the bias of the estimator in absolute value is smaller.

The results in Table 1 and Figure 2 allow us to conclude that, in general, the performance of the estimator of

δ

is directly related to the sample size. That is, as

n \to \infty

, the values of the statistics are smaller and, consequently, the statistical performance of the estimator is better. Such behavior is expected, because as the sample size is greater, more information is available to estimate the parameters.

Table 2 presents summary statistics for the estimates of the parameters

ϕ

and

θ

, fixed according to the settings described for Case 2. Note that the estimators of

ϕ

and

θ

are very accurate for large sample sizes. This makes the results obtained for the MSE very close to the variance. For example, for a sample size of

n = 500

,

ϕ = 0.5

, and

θ = 0.3

, the estimates are very close to the true value of the parameters, that is,

\hat{ϕ} = 0.4922

and

\hat{θ} =

0.3034. On average, absolute biases in estimated values of

ϕ

or

θ

are always less than

0.0336

. The maximum values of the MSE are observed for

ϕ = 0.3

and

θ = 0.3

with a sample size equal to 100. Considering a fixed sample size, there is a slight reduction in the variance and MSE of the estimators of

ϕ

and

θ

as both of these parameters increase. Observe that the estimated values for

ϕ

and

θ

are, on average, underestimated. That is, the estimates

\hat{ϕ}

and

\hat{θ}

are less than the true parameter, in most of the considered scenarios.

4.3. Performance Measures and Model Selection

Performance measures are used to assess the accuracy of forecasts and compare models. These measures are a function of the observed and predicted values of the time series. Here, we consider two scenarios with respect to the data generating model: (Scenario 1) the model is correctly specified, that is, simulated values from the RBSARMAX model are generated and the RBSARMAX and Gaussian ARMA models are fitted; and (Scenario 2) the model is incorrectly specified, that is, simulated values from an ARMA model based on the Weibull distribution [46] are generated and the RBSARMAX and Gaussian ARMA models are fitted. The Weibull model was chosen because it is an asymmetrical distribution that often is considered as a competing model of the BS distribution. Then, the performance and goodness of fit of the models are compared. To evaluate the predictive ability of the models, the mean absolute percentage error (MAPE) is employed, which is given by:

MAPE = \frac{1}{n} \sum_{t = 1}^{n} | \frac{(y_{t} - {\hat{y}}_{t})}{y_{t}} | \times 100,

(18)

where n is the number of observations in the time series,

y_{t}

is the observed value at time t, and

{\hat{y}}_{t}

is the predicted value of

y_{t}

. To select the best model, we use the Akaike information criterion (AIC) and Bayesian information criterion (BIC), which are stated as:

\begin{matrix} AIC & = & - 2 \log (L) + 2 k, \\ BIC & = & - 2 \log (L) + 2 k \log (n), \end{matrix}

(19)

where L is the maximized likelihood for the estimated model, n is the number of observations, and k is the number of parameters. The AIC relies on the likelihood penalized by the number of model parameters, while the BIC in addition weights the number of parameters using the sample size. Smaller AIC and/or BIC values indicate better models [47].

4.3.1. Scenario 1

Table 3 reports the results for sample sizes

n \in {100, 200, 500}

of the RBSARMAX(1,1,1) model, with

η = 1.0

,

β = 0.7

,

δ = 8

and

ϕ, θ \in {0.3, 0.5, 0.7}

. In the simulation, 1000 replicates are utilized for each combination of parameters. The Gaussian ARMA(1,1) model is also considered. Comparing the RBSARMAX and Gaussian ARMA estructures based on the statistics described in Table 3, note that the values of AIC and BIC highlight the fact that the RBSARMAX model fits the data better than the Gaussian ARMA model, with AIC and BIC being calculated as in (19). Considering the forecasting performance, the RBSARMAX model also provides smaller MAPE values, indicating a better forecasting capacity, with the MAPE being calculated as in (18). To measure the effects of the parameter

δ

on the performance of the model, Table 4 shows the summary results of 1000 Monte Carlo replicates with

η = 1.0

,

β = 0.7

,

ϕ = 0.7

,

θ = 0.5

and

δ \in {8, 15, 25, 50}

. In this case, the RBSARMAX model provides smaller values of AIC, BIC and MAPE, indicating better goodness-of-fit and forecasting ability.

4.3.2. Scenario 2

Table 5 reports results for the RBSARMAX and Gaussian ARMA models. The simulated values are generated from a Weibull ARMA model with

η = 1.0

,

β = 0.7

and

δ = 8

(shape parameter of the Weibull distribution) and

ϕ, θ \in {0.3, 0.5, 0.7}

in the case of Table 5, and from a Weibull ARMA model with

η = 1.0

,

β = 0.7

,

ϕ = 0.5

,

θ = 0.3

and

δ \in {2.5, 5, 8, 15, 25, 50}

in the case of Table 6. In general, the results of both tables show that the RBSARMAX model outperforms the ARMA model in terms of forecasting ability based on the MAPE and root mean squared error (RMSE), with

RMSE = \sqrt{(1 / n) \sum_{t = 1}^{n} {(y_{t} - {\hat{y}}_{t})}^{2}}

, where n,

y_{t}

and

{\hat{y}}_{t}

are as stated in (18). However, the selection criteria (AIC and BIC) indicate an advantage of the latter model. Since usually in time series, one is interested in forecasting, the RBSARMAX model is a better choice.

5. Application to Real-World Data Related to Sensors

5.1. Sensor-Related Data and Definition of the Variables

Next, we deal with an illustration and evaluation of the performance of the RBSARMAX model applied to a real environmental process composed of three time series related to mortality, pollutants, and temperature. Note that the pollutant data are often available from monitoring stations which are associated with sensors [48] and similarly with the temperature. On the one hand, the monitoring stations extract air from the environment for time intervals and then measure the amount of transmitted light. The measurement method is considered to be quite sensitive to particles small enough to penetrate deep into the human lung. On the other hand, the temperature sensors are electrical and electronic components that, as sensors, allow temperature to be measured using a specific electrical signal. This signal can be sent directly or by changing the resistance. They are also called heat sensors or thermosensors.

The analyzed data are available in the R software through the astsa package. These data correspond to 508 observations of weekly averages of cardiovascular mortality in Los Angeles County, CA, USA, from 1970 to 1979, associated with effects of temperature variation and levels of particulate matter (PM), which are very fine particles of solids or liquids suspended in the air [2]. The variables under analysis are mortality (

M_{t}

), temperature (

X_{1 t}

) and PM (

X_{2 t}

). A study similar to this was carried out in [4], which used the same dataset for regression models in the context of a time series.

5.2. Exploratory Data Analysis

The behavior of the variables

M_{t}

,

X_{1 t}

, and

X_{2 t}

over time are shown in Figure 3. Note that all series have a notorious seasonality. In addition, Figure 3a shows a downward trend in mortality over the period under study. Table 7 provides some descriptive measures for each variable, which include: sample size (n), minimum and maximum values, median, standard deviation (SD), CV, and coefficients of symmetry (CS) and kurtosis (CK). Figure 4 displays summaries of

M_{t}

,

X_{1 t}

, and

X_{2 t}

. Histograms are shown along the diagonal; below the diagonal are scatterplots and above the diagonal are the Pearson correlation coefficients (

ρ

). These graphical plots allow us to identify that mortality

M_{t}

and temperature

X_{1 t}

have a clear relationship, with lower temperatures giving higher mortality, and that the mortality is the highest at lower temperatures. Here,

\hat{ρ} = - 0.44

indicates a moderate negative correlation which is statistically different from zero at 1% significance. Similarly, mortality

M_{t}

and PM levels

X_{2 t}

have a linear relationship and a moderate positive correlation (

\hat{ρ} = 0.44

, which is also statistically different from zero at 1% significance), indicating that higher levels of PM are associated with higher levels of mortality. However, temperature

X_{1 t}

and PM

X_{2 t}

have practically no correlation (

\hat{ρ} = - 0.02

). The histograms confirm the summaries in Table 7 show that mortality

M_{t}

and PM levels

X_{2 t}

have positive skewed behavior, whereas temperature

X_{1 t}

is more symmetric. This behavior is confirmed by the box-plots shown in Figure 5. Additionally, in this plot, the presence of outliers for mortality

M_{t}

and PM levels

X_{2 t}

is evident.

5.3. Time-Series Modeling

Based on the analysis of Figure 4, which shows the relationship between the variables

M_{t}

,

X_{1 t}

, and

X_{2 t}

, in addition to considering

M_{t}

as the response variable, these relationships can be modeled over time t using the corresponding observed values

x_{1 t}

and

x_{2 t}

as:

M_{t} = η + β_{1} t + β_{2} (x_{1 t} - {\bar{x}}_{1}) + β_{3} {(x_{1 t} - {\bar{x}}_{1})}^{2} + β_{4} x_{2 t} + ε_{t},

(20)

where the first two terms define a linear trend in t, as seen in Figure 3a; the next two terms describe a quadratic relationship with temperature and

{\bar{x}}_{1}

being the average temperature included to avoid collinearity; the next is a linear term in PM levels; and then

ε_{t}

is a random error or a noise process. In [4], the error consists of independent and identically distributed variables with zero mean and variance

σ_{ε}^{2}

, whereas an alternative approach is taken here.

Figure 6 shows the plots of the autocorrelation function –ACF– (a) and the partial autocorrelation function –PACF– (b) of the residuals fitted with the least squares method for the model stated in (20). Consideration of the ACF and PACF plots suggests the characteristic of a stationary AR(p) model of order

p = 2

for the residuals. Thus, the correlated error model defined in (20) is expressed as:

ε_{t} = ϕ_{1} ε_{t - 1} + ϕ_{2} ε_{t - 2} + u_{t},

where

ε_{t}

is an AR(2) model and

u_{t}

is a white noise. The results for this model are obtained using the garmaFit function of a package named gamlss.util (http://www.gamlss.org, accessed on 22 September 2021). Now, consider an analysis with the RBSARMAX model defined by (10) and (12). Table 8 reports the CML estimates as well as the MAPE and AIC/BIC values. From this table, note that the RBSARMAX(2,0,2) model provides a better fit than the ARMA(2,0) model based on the AIC/BIC values. Moreover, the RBSARMAX(2,0,2) model has less MAPE, indicating better forecasting capacity. We emphasize that, in addition to the advantage of these results, the RBSARMAX(2,0,2) model is more appropriate due to the skewed and kurtosis features in the data empirical distribution.

The QQ plots of the GCS and RQ residuals, with simulation envelopes, are presented in Figure 7a,b, respectively, which indicate better agreement with the EXP(1) distribution in the RBSARMA model. However, for the same analysis referring to the ARMA model based on Figure 7, note that the plots of GCS and RQ residuals, with simulation envelopes, produce points that are located far from the diagonal line and outside the envelope. In the ACF and PACF charts, observe that both models produce non-autocorrelated errors; see Figure 7c,d. The time-series forecasts using the fitted RBSARMAX and ARMA models are presented together with the observed time-series data in Figure 8.

6. Conclusions, Limitations, and Future Research

In this work, a new mean-based autoregressive moving average model using the Birnbaum–Saunders distribution, called RBSARMAX, was studied and formulated mathematically. We have estimated the model parameters with the maximum likelihood method and used information criteria for model selection to assess the adequacy of the new Birnbaum–Saunders autoregressive moving average structure.

We have conducted Monte Carlo simulations to evaluate in practice the statistical performance of the conditional maximum likelihood estimators for the parameters of the new model, showing a good performance. Additionally, in this simulation, several performance measures were used to assess the level of accuracy of forecasts and to compare different models, obtaining similarly reasonable and good results.

In the application, when modeling mortality as a function of pollution and temperature with data related to sensors, the RBSARMAX model presented a superior result to that of the Gaussian ARMA model, providing strong evidence that the Birnbaum–Saunders distribution is a good alternative for dealing with temporal data. Consequently, the results have suggested that the RBSARMAX model can become a valuable tool for analyzing positive and asymmetric time-series data in environmental sciences and other fields of knowledge.

The new methodology is an addition to the tools of applied statisticians, data scientists, and diverse users interested in the modeling of time-series data. From the application presented in this study, we have generated helpful information that may be employed by practitioners and users of statistics.

Some limitations of our proposal are described next. Since the BS distribution is related to the normal distribution, parameter estimation in RBSARMAX models may be affected by outliers and potentially influential cases. To obtain robust estimation, the BS-Student-t distribution could be considered instead [30,49]. Besides fixed effects considered by regression parameters in the RBSARMAX model, random effects may be formulated. A multivariate version of the RBSARMAX model might also be of interest [12,50], and local influence diagnostics could be derived, allowing the detection of potentially influential cases [16]. Other aspects for future study using this new model are associated with quantile, spatial, partial least squares, principal components, and sampling structures [51,52,53,54,55,56].

The authors are working on these and other aspects related to the study reported in this paper, and their findings will be presented in future articles.

Author Contributions

Data curation, H.S., R.S.; investigation, H.S., R.S., V.L.; formal analysis and methodology, H.S., R.S., R.V., V.L., R.G.A.; writing—original draft, H.S., R.S., R.V.; writing—review and editing, V.L., H.S., R.G.A. All authors have read and agreed to this version of the manuscript.

Funding

The research of V.L. was partially supported by FONDECYT, project grant number 1200525, from the National Agency for Research and Development (ANID) of the Chilean government under the Ministry of Science, Technology, Knowledge and Innovation.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data analyzed are available on request.

Acknowledgments

The authors would also like to thank the editor and reviewers for their constructive comments which led to improving the presentation of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The Observed Fisher Information Matrix

Letting

a (y) = a (y; μ, δ) = \sqrt{δ / 2} (\sqrt{(δ + 1) y / (μ δ)} - \sqrt{μ δ / (δ + 1) y})

, a simple calculation gives

\begin{matrix} \frac{\partial a (y)}{\partial y} = \frac{\sqrt{δ}}{2 \sqrt{2} y} (\sqrt{\frac{(δ + 1) y}{μ δ}} + \sqrt{\frac{μ δ}{(δ + 1) y}}) \\ \frac{\partial a (y)}{\partial μ} = - \frac{\sqrt{δ}}{2 \sqrt{2} μ} (\sqrt{\frac{(δ + 1) y}{μ δ}} + \sqrt{\frac{μ δ}{(δ + 1) y}}), & \frac{\partial a (y)}{\partial δ} = - \frac{1}{2 \sqrt{2} (δ + 1) \sqrt{δ}} [δ (\sqrt{\frac{μ δ}{(δ + 1) y}} - \sqrt{\frac{(δ + 1) y}{μ δ}}) + 2 \sqrt{\frac{μ δ}{(δ + 1) y}}], \\ \frac{\partial^{2} a (y)}{\partial μ \partial y} = \frac{\sqrt{δ}}{4 \sqrt{2} μ y} (\sqrt{\frac{μ δ}{(δ + 1) y}} - \sqrt{\frac{(δ + 1) y}{μ δ}}), & \frac{\partial^{2} a (y)}{\partial δ \partial y} = \frac{1}{2 \sqrt{2} (δ + 1) \sqrt{δ} y} [δ (\sqrt{\frac{μ δ}{(δ + 1) y}} + \sqrt{\frac{(δ + 1) y}{μ δ}}) + 2 \sqrt{\frac{μ δ}{(δ + 1) y}}], \\ \frac{\partial^{2} a (y)}{\partial μ^{2}} = \frac{\sqrt{δ}}{4 \sqrt{2} μ^{2}} (3 \sqrt{\frac{(δ + 1) y}{μ δ}} + \sqrt{\frac{μ δ}{(δ + 1) y}}), & \frac{\partial^{3} a (y)}{\partial μ^{2} \partial y} = - \frac{\sqrt{δ}}{8 \sqrt{2} μ^{2} y} (\sqrt{\frac{μ δ}{(δ + 1) y}} - 3 \sqrt{\frac{(δ + 1) y}{μ δ}}), \\ \frac{\partial^{2} a (y)}{\partial δ^{2}} = \frac{1}{4 \sqrt{2} {(δ + 1)}^{2} \sqrt{δ}} [δ (\sqrt{\frac{μ δ}{(δ + 1) y}} - \sqrt{\frac{(δ + 1) y}{μ δ}}) + 4 \sqrt{\frac{μ δ}{(δ + 1) y}}], & \frac{\partial^{3} a (y)}{\partial δ^{2} \partial y} = - \frac{1}{8 \sqrt{2} {(δ + 1)}^{2} \sqrt{δ} y} [(δ + 4) \sqrt{\frac{μ δ}{(δ + 1) y}} + δ \sqrt{\frac{(δ + 1) y}{μ δ}}], \\ \frac{\partial^{2} a (y)}{\partial δ \partial μ} = - \frac{1}{4 \sqrt{2} (δ + 1) \sqrt{δ} μ} [δ (\sqrt{\frac{(δ + 1) y}{μ δ}} - \sqrt{\frac{μ δ}{(δ + 1) y}}) + 2 \sqrt{\frac{μ δ}{(δ + 1) y}}], & \frac{\partial^{3} a (y)}{\partial δ \partial μ \partial y} = \frac{1}{8 \sqrt{2} (δ + 1) \sqrt{δ} μ y} [(δ + 2) \sqrt{\frac{μ δ}{(δ + 1) y}} - δ \sqrt{\frac{(δ + 1) y}{μ δ}}] . \end{matrix}

(A1)

Moreover,

f (y_{t}; μ_{t}, δ | F_{t - 1}) = ϕ [a (y_{t}; μ_{t}, δ)] \partial a (y_{t}; μ_{t}, δ) / \partial y_{t}

and

ℓ_{t} = ℓ_{t} (δ, β, η, ϕ, θ) = log f (y_{t}; μ_{t}, δ | F_{t - 1}) = log \{ϕ [a (y_{t}; μ_{t}, δ)]\} + log [\frac{\partial a (y_{t}; μ_{t}, δ)}{\partial y_{t}}] .

The first-order partial derivatives of

ℓ_{t}

with respect to parameters are given by:

\frac{\partial ℓ_{t}}{\partial δ} = - a (y_{t}) \frac{\partial a (y_{t})}{\partial δ} + [\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{2} a (y_{t})}{\partial δ \partial y_{t}}, \frac{\partial ℓ_{t}}{\partial γ} = \{- a (y_{t}) \frac{\partial a (y_{t})}{\partial μ_{t}} +[\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{2} a (y_{t})}{\partial μ_{t} \partial y_{t}}} \frac{\partial μ_{t}}{\partial γ},

where

γ \in {β_{k}, η, ϕ_{k}, θ_{l}}

,

k \in {1, \dots, p}

, and

l \in {1, \dots, q}

. The observed Fisher information matrix is defined as:

\hat{J} (δ, β, η, ϕ, θ) = - \partial^{2} ℓ_{t} / \partial u \partial v,

where second derivatives of

ℓ_{t}

with respect to parameters are stated as:

\begin{matrix} \frac{\partial^{2} ℓ_{t}}{\partial δ^{2}} = - {[\frac{\partial a (y_{t})}{\partial δ}]}^{2} - a (y_{t}) \frac{\partial^{2} a (y_{t})}{\partial δ^{2}} - [\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 2} {[\frac{\partial^{2} a (y_{t})}{\partial δ \partial y_{t}}]}^{2} + [\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{3} a (y_{t})}{\partial δ^{2} \partial y_{t}}, \\ \frac{\partial^{2} ℓ_{t}}{\partial γ^{2}} = \{- a (y_{t}) \frac{\partial a (y_{t})}{\partial μ_{t}} +[\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{2} a (y_{t})}{\partial μ_{t} \partial y_{t}}} \frac{\partial^{2} μ_{t}}{\partial γ^{2}} \\ + \{- {[\frac{\partial a (y_{t})}{\partial μ_{t}}]}^{2} - a (y_{t}) \frac{\partial^{2} a (y_{t})}{\partial μ_{t}^{2}} -[\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 2} {[\frac{\partial^{2} a (y_{t})}{\partial μ_{t} \partial y_{t}}]}^{2} + [\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{3} a (y_{t})}{\partial μ_{t}^{2} \partial y_{t}}} {(\frac{\partial μ_{t}}{\partial γ})}^{2}, \\ \frac{\partial^{2} ℓ_{t}}{\partial δ \partial γ} = \{- \frac{\partial a (y_{t})}{\partial δ} \frac{\partial a (y_{t})}{\partial μ_{t}} - a (y_{t}) \frac{\partial^{2} a (y_{t})}{\partial δ \partial μ_{t}} -[\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 2} \frac{\partial^{2} a (y_{t})}{\partial δ \partial y_{t}} \frac{\partial^{2} a (y_{t})}{\partial μ_{t} \partial y_{t}} + [\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{3} a (y_{t})}{\partial μ_{t} \partial y_{t}}} \frac{\partial μ_{t}}{\partial γ}, \\ \frac{\partial^{2} ℓ_{t}}{\partial γ \partial γ^{'}} = \{- a (y_{t}) \frac{\partial a (y_{t})}{\partial μ_{t}} +[\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{2} a (y_{t})}{\partial μ_{t} \partial y_{t}}} \frac{\partial^{2} μ_{t}}{\partial γ \partial γ^{'}} \\ + \{- {[\frac{\partial a (y_{t})}{\partial μ_{t}}]}^{2} - a (y_{t}) \frac{\partial^{2} a (y_{t})}{\partial μ_{t}^{2}} -[\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 2} \frac{\partial^{2} a (y_{t})}{\partial μ_{t} \partial y_{t}} \frac{\partial^{2} a (y_{t})}{\partial γ^{'} \partial y_{t}} + [\frac{\partial a (y_{t})}{\partial y_{t}}]^{- 1} \frac{\partial^{3} a (y_{t})}{\partial μ_{t}^{2} \partial y_{t}}} \frac{\partial μ_{t}}{\partial γ} \frac{\partial μ_{t}}{\partial γ^{'}}, \end{matrix}

where

γ \in {β_{k}, η, ϕ_{k}, θ_{l}}

and

γ^{'} \in {β_{r}, η, ϕ_{r}, θ_{m}}

, for

k, r \in {1, \dots, p}

and

l, m \in {1, \dots, q}

. Here, the partial derivatives of

a (y_{t})

are presented in (A1). By using (12), the partial derivatives of

μ_{t}

with respect to parameters are expressed, for

k \in {1, \dots, p}

and

l \in {1, \dots, q}

, as:

\begin{matrix} \frac{\partial μ_{t}}{\partial η} = \frac{1}{g^{'} (μ_{t})}, & \frac{\partial μ_{t}}{\partial β_{k}} = \frac{1}{g^{'} (μ_{t})} \{x_{t k} + \sum_{i = 1}^{p} ϕ_{i} [g (y_{t - i}) - x_{(t - i) k}]\}, \\ \frac{\partial μ_{t}}{\partial ϕ_{k}} = \frac{1}{g^{'} (μ_{t})} [g (y_{t - k}) - x_{t - k}^{⊤} β], & \frac{\partial μ_{t}}{\partial θ_{l}} = \frac{1}{g^{'} (μ_{t})} [g (y_{t - l}) - α_{t - l}], \\ \frac{\partial^{2} μ_{t}}{\partial γ \partial η} = - \frac{g^{''} (μ_{t})}{{[g^{'} (μ_{t})]}^{2}} \frac{\partial μ_{t}}{\partial γ}, γ \in {β_{k}, η, ϕ_{k}, θ_{l}}, & \frac{\partial^{2} μ_{t}}{\partial ξ^{2}} = - \frac{g^{''} (μ_{t})}{g^{'} (μ_{t})} {(\frac{\partial μ_{t}}{\partial ξ})}^{2}, ξ \in {β_{k}, ϕ_{k}, θ_{l}}, \\ \frac{\partial^{2} μ_{t}}{\partial θ_{l} \partial ζ} = - \frac{g^{''} (μ_{t})}{g^{'} (μ_{t})} \frac{\partial μ_{t}}{\partial θ_{l}} \frac{\partial μ_{t}}{\partial ζ}, ζ \in {β_{k}, ϕ_{k}}, & \frac{\partial^{2} μ_{t}}{\partial ϕ_{k} \partial β_{k}} = - \frac{g^{''} (μ_{t})}{g^{'} (μ_{t})} \frac{\partial μ_{t}}{\partial ϕ_{k}} \frac{\partial μ_{t}}{\partial β_{k}} + \frac{1}{g^{'} (μ_{t})} [g (y_{t - k} - x_{(t - k) k})] . \end{matrix}

References

Marchant, C.; Leiva, V.; Cavieres, M.F.; Sanhueza, A. Air contaminant statistical distributions with application to PM10 in Santiago, Chile. Rev. Environ. Contam. Toxicol. 2013, 223, 1–31. [Google Scholar] [PubMed]
Cavieres, M.F.; Leiva, V.; Marchant, C.; Rojas, F. A methodology for data-driven decision making in the monitoring of particulate matter environmental contamination in Santiago of Chile. Rev. Environ. Contam. Toxicol. 2020, 250, 45–67. [Google Scholar] [PubMed]
Shumway, R.H.; Azari, A.S.; Pawitan, Y. Modeling mortality fluctuations in Los Angeles as functions of pollution and weather effects. Environ. Res. 1988, 45, 224–241. [Google Scholar] [CrossRef]
Shumway, R.H.; Stoffer, D.S. Time Series Analysis and Its Applications: With R Examples; Springer: New York, NY, USA, 2017. [Google Scholar]
Maior, V.Q.S.; Cysneiros, F.J.A. SYMARMA: A new dynamic model for temporal data on conditional symmetric distribution. Stat. Pap. 2016, 59, 75–97. [Google Scholar] [CrossRef]
Leiva, V.; Marchant, C.; Ruggeri, F.; Saulo, H. A criterion for environmental assessment using Birnbaum–Saunders attribute control charts. Environmetrics 2015, 26, 463–476. [Google Scholar] [CrossRef]
Birnbaum, Z.W.; Saunders, S.C. A new family of life distributions. J. Appl. Probab. 1969, 6, 319–327. [Google Scholar] [CrossRef]
Desmond, A. Stochastic models of failure in random environments. Can. J. Stat. 1985, 13, 171–183. [Google Scholar] [CrossRef]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions; Wiley: New York, NY, USA, 1995; pp. 651–663. [Google Scholar]
Leiva, V. The Birnbaum–Saunders Distribution; Academic Press: New York, NY, USA, 2016. [Google Scholar]
Ferreira, M.; Gomes, M.I.; Leiva, V. On an extreme value version of the Birnbaum–Saunders distribution. REVSTAT 2012, 10, 181–210. [Google Scholar]
Marchant, C.; Leiva, V.; Cysneiros, F.J.A.; Liu, S. Robust multivariate control charts based on Birnbaum–Saunders distributions. J. Stat. Comput. Simul. 2018, 88, 182–202. [Google Scholar] [CrossRef]
Marchant, C.; Leiva, V.; Christakos, G.; Cavieres, M.F. Monitoring urban environmental pollution by bivariate control charts: New methodology and case study in Santiago, Chile. Environmetrics 2019, 30, e2551. [Google Scholar] [CrossRef]
Puentes, R.; Marchant, C.; Leiva, V.; Figueroa-Zúñiga, J.I.; Ruggeri, F. Predicting PM2.5 and PM10 levels during critical episodes management in Santiago, Chile, with a bivariate Birnbaum–Saunders log-linear model. Mathematics 2021, 9, 645. [Google Scholar] [CrossRef]
Garcia-Papani, F.; Leiva, V.; Ruggeri, F.; Uribe-Opazo, M.A. Kriging with external drift in a Birnbaum-Saunders geostatistical model. Stoch. Environ. Res. Risk Assess. 2018, 32, 1517–1530. [Google Scholar] [CrossRef]
Garcia-Papani, F.; Leiva, V.; Uribe-Opazo, M.A.; Aykroyd, R.G. Birnbaum–Saunders spatial regression models: Diagnostics and application to chemical data. Chemom. Intell. Lab. Syst. 2018, 177, 114–128. [Google Scholar] [CrossRef] [Green Version]
Leiva, V.; Ferreira, M.; Gomes, M.I.; Lillo, C. Extreme value Birnbaum–Saunders regression models applied to environmental data. Stoch. Environ. Res. Risk Assess. 2016, 30, 1045–1058. [Google Scholar] [CrossRef]
Lillo, C.; Leiva, V.; Nicolis, O.; Aykroyd, R.G. L-moments of the Birnbaum–Saunders distribution and its extreme value version: Estimation, goodness of fit and application to earthquake data. J. Appl. Stat. 2018, 45, 187–209. [Google Scholar] [CrossRef]
Saulo, H.; Leiva, V.; Ziegelmann, F.A.; Marchant, C. A nonparametric method for estimating asymmetric densities based on skewed Birnbaum–Saunders distributions applied to environmental data. Stoch. Environ. Res. Risk Assess. 2013, 27, 1479–1491. [Google Scholar] [CrossRef]
Balakrishnan, N.; Kundu, D. Birnbaum–Saunders distribution: A review of models, analysis, and applications. Appl. Stoch. Model. Bus. Ind. 2019, 35, 4–49. [Google Scholar] [CrossRef] [Green Version]
Rieck, J.R.; Nedelman, J.R. A log-linear model for the Birnbaum–Saunders distribution. Technometrics 1991, 3, 51–60. [Google Scholar]
Dasilva, A.; Dias, R.; Leiva, V.; Marchant, C.; Saulo, H. Birnbaum–Saunders regression models: A comparative evaluation of three approaches. J. Stat. Comput. Simul. 2020, 90, 2552–2570. [Google Scholar] [CrossRef]
Fonseca, R.V.; Nobre, J.S.; Farias, R.B.A. Comparative inference and diagnostic in a reparametrized Birnbaum–Saunders regression model. Chilean J. Stat. 2016, 7, 17–30. [Google Scholar]
Leão, J.; Leiva, V.; Saulo, H.; Tomazella, V. Incorporation of frailties into a cure rate regression model and its diagnostics and application to melanoma data. Stat. Med. 2018, 37, 4421–4440. [Google Scholar] [CrossRef] [PubMed]
Desousa, M.; Saulo, H.; Leiva, V.; Santos-Neto, M. On a new mixture-based regression model: Simulation and application to data with high censoring. J. Stat. Comput. Simul. 2020, 90, 2861–2877. [Google Scholar] [CrossRef]
Mazucheli, M.; Leiva, V.; Alves, B.; Menezes, A.F.B. A new quantile regression for modeling bounded data under a unit Birnbaum–Saunders distribution with applications in medicine and politics. Symmetry 2021, 13, 682. [Google Scholar] [CrossRef]
Mazucheli, J.; Menezes, A.F.B.; Dey, S. The unit-Birnbaum–Saunders distribution with applications. Chilean J. Stat. 2018, 9, 47–57. [Google Scholar]
Reyes, J.; Arrue, J.; Leiva, V.; Martin-Barreiro, C. A new Birnbaum–Saunders distribution and its mathematical features applied to bimodal real-world data from environment and medicine. Mathematics 2021, 9, 1891. [Google Scholar] [CrossRef]
Sanchez, L.; Leiva, V.; Galea, M.; Saulo, H. Birnbaum–Saunders quantile regression and its diagnostics with application to economic data. Appl. Stoch. Model. Bus. Ind. 2021, 37, 53–73. [Google Scholar] [CrossRef]
Athayde, E.; Azevedo, A.; Barros, M.; Leiva, V. Failure rate of Birnbaum–Saunders distributions: Shape, change-point, estimation and robustness. Braz. J. Probab. Stat. 2019, 33, 301–328. [Google Scholar] [CrossRef] [Green Version]
Balakrishnan, N.; Gupta, R.; Kundu, D.; Leiva, V.; Sanhueza, A. On some mixture models based on the Birnbaum–Saunders distribution and associated inference. J. Stat. Plan. Inference 2011, 141, 2175–2190. [Google Scholar] [CrossRef]
Santos-Neto, M.; Cysneiros, F.J.A.; Leiva, V.; Barros, M. On new parameterizations of the Birnbaum–Saunders distribution and its moments, estimation and application. REVSTAT 2014, 12, 247–272. [Google Scholar]
Leiva, V.; Santos-Neto, M.; Cysneiros, F.J.A.; Barros, M. Birnbaum–Saunders statistical modeling: A new approach. Stat. Model. 2014, 14, 21–48. [Google Scholar] [CrossRef]
Bhatti, C. The Birnbaum–Saunders autoregressive conditional duration model. Math. Comput. Simul. 2010, 80, 2062–2078. [Google Scholar] [CrossRef]
Fonseca, R.V.; Cribari-Neto, F. Bimodal Birnbaum–Saunders generalized autoregressive score model. J. Appl. Stat. 2014, 45, 2585–2606. [Google Scholar] [CrossRef]
Leiva, V.; Saulo, H.; Leão, J.; Marchant, C. A family of autoregressive conditional duration models applied to financial data. Comput. Stat. Data Anal. 2014, 79, 175–191. [Google Scholar] [CrossRef]
Rahul, T.; Balakrishnan, N.; Balakrishna, N. Time series with Birnbaum–Saunders marginal distributions. Appl. Stoch. Model. Bus. Ind. 2018, 34, 562–581. [Google Scholar] [CrossRef]
Saulo, H.; Leão, J.; Leiva, V.; Aykroyd, R.G. Birnbaum–Saunders autoregressive conditional duration models applied to high-frequency financial data. Stat. Pap. 2019, 46, 1021–1042. [Google Scholar] [CrossRef] [Green Version]
Saulo, H.; Leão, J.; Santos-Neto, M. Discussion of “Birnbaum–Saunders distribution: A review of models, analysis, and applications” by N. Balakrishnan and Debasis Kundu. Appl. Stoch. Model. Bus. Ind. 2019, 35, 118–121. [Google Scholar] [CrossRef] [Green Version]
Leiva, V.; Saulo, H.; Souza, R.; Aykroyd, R.G.; Vila, R. A new BISARMA time series model for forecasting mortality using weather and particulate matter data. J. Forecast. 2021, 40, 346–364. [Google Scholar] [CrossRef]
Benjamin, M.A.; Rigby, R.A.; Stasinopoulos, D.M. Generalized autoregressive moving average models. J. Am. Stat. Assoc. 2003, 98, 214–223. [Google Scholar] [CrossRef]
Rocha, A.V.; Cribari-Neto, F. Beta autoregressive moving average models. TEST 2009, 18, 529–545. [Google Scholar] [CrossRef]
Santos-Neto, M.; Cysneiros, F.J.A.; Leiva, V.; Barros, M. Reparameterized Birnbaum–Saunders regression models with varying precision. Electron. J. Stat. 2016, 2, 2825–2855. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
Stasinopoulos, D.; Rigby, R. Generalized additive models for location, scale and shape (GAMLSS). J. Stat. Softw. 2007, 23, 1–46. [Google Scholar] [CrossRef] [Green Version]
Rinne, H. The Weibull Distribution; Chapman and Hall: London, UK, 2009. [Google Scholar]
Ventura, M.; Saulo, H.; Leiva, V.; Monsueto, S. Log-symmetric regression models: Information criteria, application to movie business and industry data with economic implications. Appl. Stoch. Model. Bus. Ind. 2019, 35, 963–977. [Google Scholar] [CrossRef]
Sales-Lérida, D.; Bello, A.J.; Sánchez-Alzola, A.; Martínez-Jiménez, P.M. An approximation for metal-oxide sensor calibration for air quality monitoring using multivariable statistical analysis. Sensors 2021, 21, 4781. [Google Scholar] [CrossRef]
Velasco, H.; Laniado, H.; Toro, M.; Leiva, V.; Lio, Y. Robust three-step regression based on comedian and its performance in cell-wise and case-wise outliers. Mathematics 2021, 8, 1259. [Google Scholar]
Aykroyd, R.G.; Leiva, V.; Marchant, C. Multivariate Birnbaum-Saunders distributions: Modelling and applications. Risks 2018, 6, 21. [Google Scholar] [CrossRef] [Green Version]
Saulo, H.; Dasilva, A.; Leiva, V.; Sanchez, L.; de la Fuente-Mella, H. Log-symmetric quantile regression models. Stat. Neerl. 2022, in press. [Google Scholar] [CrossRef]
Huerta, M.; Leiva, V.; Liu, S.; Rodriguez, M.; Villegas, D. On a partial least squares regression model for asymmetric data with a chemical application in mining. Chemom. Intell. Lab. Syst. 2019, 190, 55–68. [Google Scholar] [CrossRef]
Rodriguez, M.; Leiva, V.; Huerta, M.; Lillo, M.; Tapia, A.; Ruggeri, F. An asymmetric area model-based approach for small area estimation applied to survey data. REVSTAT 2021, 19, 399–420. [Google Scholar]
Costa, E.; Santos-Neto, M.; Leiva, V. Optimal sample size for the Birnbaum–Saunders distribution under decision theory with symmetric and asymmetric loss functions. Symmetry 2021, 13, 926. [Google Scholar] [CrossRef]
Martin-Barreiro, C.; Ramirez-Figueroa, J.A.; Nieto, A.B.; Leiva, V.; Martin-Casado, A.; Galindo-Villardón, M.P. A new algorithm for computing disjoint orthogonal components in the three-way Tucker model. Mathematics 2021, 9, 203. [Google Scholar]
Martin-Barreiro, C.; Ramirez-Figueroa, J.A.; Cabezas, X.; Leiva, V.; Galindo-Villardón, M.P. Disjoint and functional principal component analysis for infected cases and deaths due to COVID-19 in South American countries with sensor-related data. Sensors 2021, 21, 4094. [Google Scholar] [CrossRef] [PubMed]

Figure 1.

RBS (μ, δ)

PDFs for

μ = 1

fixed (a) and for

δ = 50

fixed (b).

Figure 1.

RBS (μ, δ)

PDFs for

μ = 1

fixed (a) and for

δ = 50

fixed (b).

Figure 2. Empirical bias (a), variance (b) and MSE (c) of

\hat{δ}

with simulated data from the RBSARMAX(1,1,1) model.

Figure 2. Empirical bias (a), variance (b) and MSE (c) of

\hat{δ}

with simulated data from the RBSARMAX(1,1,1) model.

Figure 3. Mortality (a), temperature (b), and PM (c) times series over 1970–1979 in Los Angeles, CA, USA.

Figure 4. Histograms, scatterplots, and correlation coefficients of the variables: Mortality (

M_{t}

), temperature (

X_{1 t}

), and PM levels (

X_{2 t}

) for data over 1970–1979 in Los Angeles, CA, USA. Note that “***” indicates that such a correlation is statistically significant at 1%.

Figure 4. Histograms, scatterplots, and correlation coefficients of the variables: Mortality (

M_{t}

), temperature (

X_{1 t}

), and PM levels (

X_{2 t}

) for data over 1970–1979 in Los Angeles, CA, USA. Note that “***” indicates that such a correlation is statistically significant at 1%.

Figure 5. Boxplots for the variables mortality

M_{t}

(a), temperature

X_{1 t}

(b), and PM levels

X_{2 t}

(c) for data over 1970–1979 in Los Angeles, CA, USA.

Figure 5. Boxplots for the variables mortality

M_{t}

(a), temperature

X_{1 t}

(b), and PM levels

X_{2 t}

(c) for data over 1970–1979 in Los Angeles, CA, USA.

Figure 6. Charts of the ACF (a) and PACF (b) for the regression residuals with time-series data over the 10-year period (1970–1979) in Los Angeles County, CA, USA.

Figure 7. Plots of envelopes of GCS (a) and RQ residuals (b); and charts of ACF (c) and PACF (d) for the RBSARMAX (left) and ARMA (right) models.

Figure 8. Cardiovascular mortality series for LA, USA (gray) with fitted RBSARMAX (green) and ARMA (red) models.

Table 1. CML estimates for indicated

δ

, based on Monte Carlo simulation of the RBSARMAX(1,1,1) model.

Table 1. CML estimates for indicated

δ

, based on Monte Carlo simulation of the RBSARMAX(1,1,1) model.

n	$δ$	$\hat{δ}$
n	$δ$	Mean	Bias	Variance	MSE
100	8	8.4705	0.4705	1.5462	0.2334
	15	15.8429	0.8429	5.3862	0.7171
	25	26.3139	1.3139	14.8842	1.7304
	50	52.2676	2.2676	59.0348	5.1441
200	8	8.2414	0.2414	0.7216	0.0640
	15	15.4353	0.4353	2.5264	0.1926
	25	25.6816	0.6816	7.0042	0.4665
	50	51.1722	1.1722	27.9215	1.3750
500	8	8.0720	0.0720	0.2464	0.0072
	15	15.1332	0.1332	0.8625	0.0189
	25	25.2044	0.2044	2.3963	0.0425
	50	50.3276	0.3276	9.5864	0.1077

Table 2. CML estimates for indicated

ϕ, θ

based on Monte Carlo simulation of the RBSARMAX(1,1,1) model.

Table 2. CML estimates for indicated

ϕ, θ

based on Monte Carlo simulation of the RBSARMAX(1,1,1) model.

n	$ϕ$	$θ$	$\hat{ϕ}$				$\hat{θ}$
n	$ϕ$	$θ$	Mean	Bias	Variance	MSE	Mean	Bias	Variance	MSE
100	0.3	0.3	0.2957	−0.0043	0.0258	0.0258	0.2953	−0.0047	0.0285	0.0286
		0.5	0.3087	0.0087	0.0190	0.0191	0.4791	−0.0209	0.0207	0.0211
		0.7	0.3098	0.0098	0.0139	0.0140	0.6681	−0.0319	0.0142	0.0152
	0.5	0.3	0.4761	−0.0239	0.0149	0.0155	0.3114	0.0114	0.0188	0.0189
		0.5	0.4863	−0.0137	0.0127	0.0129	0.4946	−0.0054	0.0151	0.0152
		0.7	0.4855	−0.0145	0.0110	0.0112	0.6755	−0.0245	0.0122	0.0128
	0.7	0.3	0.6664	−0.0336	0.0084	0.0096	0.3171	0.0171	0.0136	0.0139
		0.5	0.6733	−0.0267	0.0080	0.0087	0.5005	0.0005	0.0119	0.0119
		0.7	0.6725	−0.0275	0.0074	0.0081	0.6717	−0.0283	0.0109	0.0117
200	0.3	0.3	0.2954	−0.0046	0.0143	0.0143	0.2980	−0.0020	0.0151	0.0151
		0.5	0.3004	0.0004	0.0083	0.0083	0.4913	−0.0087	0.0078	0.0079
		0.7	0.3046	0.0046	0.0066	0.0066	0.6818	−0.0182	0.0052	0.0055
	0.5	0.3	0.4853	−0.0147	0.0079	0.0082	0.3066	0.0066	0.0096	0.0096
		0.5	0.4897	−0.0103	0.0054	0.0055	0.4983	−0.0017	0.0060	0.0060
		0.7	0.4920	−0.0080	0.0048	0.0049	0.6852	−0.0148	0.0045	0.0048
	0.7	0.3	0.6811	−0.0189	0.0041	0.0045	0.3093	0.0093	0.0067	0.0068
		0.5	0.6841	−0.0159	0.0033	0.0036	0.5006	0.0006	0.0050	0.0050
		0.7	0.6868	−0.0132	0.0033	0.0034	0.6817	−0.0183	0.0044	0.0047
500	0.3	0.3	0.2958	−0.0042	0.0058	0.0058	0.3002	0.0002	0.0063	0.0063
		0.5	0.2978	−0.0022	0.0036	0.0036	0.4984	−0.0016	0.0032	0.0032
		0.7	0.3018	0.0018	0.0025	0.0025	0.6922	−0.0078	0.0017	0.0018
	0.5	0.3	0.4922	−0.0078	0.0030	0.0030	0.3034	0.0034	0.0040	0.0040
		0.5	0.4936	−0.0064	0.0023	0.0024	0.5011	0.0011	0.0024	0.0024
		0.7	0.4966	−0.0034	0.0018	0.0019	0.6936	−0.0064	0.0015	0.0016
	0.7	0.3	0.6916	−0.0084	0.0014	0.0015	0.3037	0.0037	0.0029	0.0029
		0.5	0.6927	−0.0073	0.0013	0.0014	0.5013	0.0013	0.0020	0.0020
		0.7	0.6965	−0.0035	0.0013	0.0013	0.6905	−0.0095	0.0015	0.0015

Table 3. Forecasting comparison statistics for indicated

ϕ, θ