Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model

Najafi, Zeinolabedin; Zare, Karim; Mahmoudi, Mohammad Reza; Shokri, Soheil; Mosavi, Amir

doi:10.3390/math10152820

Open AccessArticle

Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model

by

Zeinolabedin Najafi

¹,

Karim Zare

^1,*,†

,

Mohammad Reza Mahmoudi

^2,†

,

Soheil Shokri

³

and

Amir Mosavi

^4,5,6,7,*

¹

Department of Statistics, Marvdasht Branch, Islamic Azad University, Marvdasht 73711-13119, Iran

²

Department of Statistics, Faculty of Science, Fasa University, Fasa 74616-86131, Iran

³

Department of Statistics, Lahijan Branch, Islamic Azad University, Lahijan 44169-39515, Iran

⁴

Faculty of Civil Engineering, Technische Universität Dresden, 01069 Dresden, Germany

⁵

John von Neumann Faculty of Informatics, Obuda University, 1034 Budapest, Hungary

⁶

Institute of Information Society, University of Public Service, 1083 Budapest, Hungary

⁷

Institute of Information Engineering, Automation and Mathematics, Slovak University of Technology in Bratislava, 81243 Bratislava, Slovakia

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2022, 10(15), 2820; https://doi.org/10.3390/math10152820

Submission received: 6 July 2022 / Revised: 26 July 2022 / Accepted: 4 August 2022 / Published: 8 August 2022

(This article belongs to the Special Issue Mathematical and Computational Statistics and Their Applications)

Download

Browse Figures

Versions Notes

Abstract

:

This work considers a multifactor linear mixed model under heteroscedasticity in random-effect factors and the skew-normal errors for modeling the correlated datasets. We implement an expectation–maximization (EM) algorithm to achieve the maximum likelihood estimates using conditional distributions of the skew-normal distribution. The EM algorithm is also implemented to extend the local influence approach under three model perturbation schemes in this model. Furthermore, a Monte Carlo simulation is conducted to evaluate the efficiency of the estimators. Finally, a real data set is used to make an illustrative comparison among the following four scenarios: normal/skew-normal errors and heteroscedasticity/homoscedasticity in random-effect factors. The empirical studies show our methodology can improve the estimates when the model errors follow from a skew-normal distribution. In addition, the local influence analysis indicates that our model can decrease the effects of anomalous observations in comparison to normal ones.

Keywords:

EM algorithm; expectation–maximization algorithm; heteroscedasticity; Monte Carlo simulation; random effects; skew-normal; variance components; applied mathematics; Linear mixed models

MSC:

62F10; 62J05

1. Introduction

Linear mixed models (LMMs) are useful for the statistical analysis of correlated datasets such as longitudinal data. For simplicity, it is usually assumed that both random effects and random errors follow a normal distribution. Under these restrictions, there are several proposals for estimating LMM parameters in the literature; among these, one can refer to Harvill [1], Fellner [2], Khuri et al. [3] and Wu et al. [4]. For example, Harvill [1] and Fellner [2] obtained the maximum likelihood (ML) estimates of parameters in a multifactor normal LMM under heteroscedasticity of random-effect factors. They showed the estimates, in addition to being consistent, were asymptotically normally distributed. However, as pointed out by Zhong and Davidian [5], using these estimation methods may cause invalid statistical inferences when the data are asymmetric. Therefore, many authors have criticized the common use of the normality assumption (see, e.g., [6,7,8,9,10]).

From a practical perspective, the most frequently used method to achieve normality is to apply a transformation on the variables. Although such methods may provide suitable empirical results, they should not be used if a more reasonable theoretical model is available [11]. Thus, it is of great interest to develop estimation methods in statistical models with flexible distribution assumptions. In this sense, one of the simplest and applicable distributions that provides skewness and contains a normal distribution is the skew-normal distribution introduced by Azzalini [12]. Although it is old, it is still used in statistical models due to its flexibility and simplicity, especially in complex models where it is difficult to use new generalized skewed distributions. Some new works in this research area are [13,14]. In the literature, many authors have studied inferences on parameters in LMMs (with only one random-effect factor) where the random effects or the model random errors follow asymmetric and non-normal distributions (see, e.g., [5,7,8,9,15,16,17,18,19,20,21,22]). Verbeke and Lesaffre [15], through an extended simulation when the random-effect distribution was misspecified, showed that the standard errors of the parameters needed to be corrected. Arellano-Valle et al. [18] considered an LMM when both the random-effect factor and the random errors follow the SN distribution. Due to complexity, they derived marginal distributions and implemented an EM algorithm to obtain the ML estimates. They indicated that the estimates had more efficiency than normal estimates when the normality assumption was violated. Lachos et al. [19] presented an LMM when the random effects followed a multivariate SN independent distribution. They derived the ML estimates of the parameters based on an efficient EM algorithm. They also investigated a technique to predict the response variable. Kheradmandi and Rasekh [20] followed [18] and obtained the ML estimates in the LMM when the fixed effects were measured with non-negligible errors. In this work, a multifactor SN–LMM was considered with different variances for the random-effect factors to show how heteroscedasticity, as a freer assumption in random-effect factors, can improve our statistical results. Here, a SN–LMM is an LMM with SN distribution in the model random errors.

A diagnostic analysis is a necessary step in statistical analysis after parameter estimation. The local influence approach, a pioneering work of Cook [23], is one of the most important diagnostic tools for assessing the stability of the estimation parameters. Due to the complicated calculations of Cook’s local influence approach in statistical models with incomplete data, Zhu and Lee [24] developed Cook’s approach to these models based on the conditional expectation of a complete-data log likelihood at the E-step of the EM algorithm. To see some applications of Zhu and Lee’s approach, one can refer to [25,26,27,28]. Local influence analysis for LMMs based on SN distribution had been studied by Bolfarine [29], Montenegro et al. [30], and Zeller et al. [31]. All these works considered local influence diagnostics for an LMM based on SN distribution in the random-effect factor. Furthermore, all perturbation schemes were considered the same. In this work, besides parameter estimation, we developed Zhu and Lee’s local influence diagnostic measures for the LMM under different assumptions on random effects and random errors that were mentioned before. A different perturbation scheme was also considered concerning the previous works. The rest of the paper is structured as follows. In Section 2, we present the model and obtain distributional facts about the variables that will help us use the EM algorithm. In Section 3, parameter estimation and random effects prediction are derived via the EM algorithm. In Section 4, the local influence diagnostic measures for the LMM are extended based on the methodology proposed by Zhu and Lee [24]. The basic building blocks of three perturbation schemes are also derived. In Section 5, a simulation study is performed to compare the normal LMM and the SN-LMM, and then a real dataset is analyzed to perform an illustrative comparison. Discussion and conclusions of this paper are given in Section 6.

2. The Model Definition

Consider the following LMM:

Y = X β + U b + ε = X β + \sum_{i = 1}^{m} U_{i} b_{i} + ε,

(1)

where

β

is a

p \times 1

vector of parameters, which are fixed effects;

X

and

U = [U_{1} |U_{2}| \dots | U_{m}]

are

n \times p

and

n \times q

known design matrices, respectively, where

U_{i}

is an

n \times q_{i}

design matrix of the random-effect factor

i

;

b = {(b_{1}^{⊤}, b_{2}^{⊤}, \dots, b_{m}^{⊤})}^{⊤}

, where

b_{i}

is a

q_{i} \times 1

vector of unobservable random effects from

N_{q} (0, σ_{i}^{2} I_{q_{i}})

,

i = 1, \dots, m

; ε is an

n \times 1

vector of unobservable random errors from

S N_{n} (0, σ^{2} I_{n}, λ_{ε})

, that is

n

-dimensional SN distribution with skewness vector

λ_{ε} \in ℝ^{n}

. The variances

σ^{2} and σ_{i}^{2}, i = 1, \dots, m

are named variance components. We assume that

b_{i}

,

i = 1, \dots, m

, and

ε

are mutually independent. One may also write

b \sim N_{q} (0, σ^{2} Σ)

, where

Σ

is a block diagonal matrix with the

i

th block being

γ_{i} I_{q_{i}}

, for

γ_{i} = σ_{i}^{2} / σ^{2}

, that are called the ratio of variance components.

The above assumptions conclude that

Y | b \sim S N_{n} (X β + U b, σ^{2} I_{n}, λ_{ε})

and so the joint distribution of the vectors

Y

and

b

is obtained as follows:

f (y, b) = 2 ϕ_{q} (b | 0, σ^{2} Σ) ϕ_{n} (y | X β + U b, σ^{2} I_{n})

\times Φ (λ_{ε}^{⊤} (y - X β - U b) / σ)

(2)

where

ϕ_{n} (| μ, Σ)

stands for the

n

-variate normal density function with mean

μ

and covariance matrix

Σ

and

Φ (\cdot)

represents the cumulative distribution function of

N (0, 1)

.

From (2), the marginal density of

Y

would be as follows:

f (y) = \int_{ℝ^{q}}^{} 2 ϕ_{q} (b | Σ U^{⊤} V^{- 1} (y - X β), σ^{2} Σ T) ϕ_{n} (y | X β, σ^{2} V)

\times Φ (λ_{ε}^{⊤} (y - X β - U b) / σ) d b

= 2 ϕ_{n} (y | X β, σ^{2} V) E (Φ (λ_{ε}^{⊤} (y - X β - U B) / σ))

= 2 ϕ_{n} (y | X β, σ^{2} V) Φ (\frac{λ_{ε}^{⊤} V^{- 1} (y - X β)}{σ δ}); y \in ℝ^{n}

where

V = I_{n} + U Σ U^{⊤} = I_{n} + \sum_{i = 1}^{m} γ_{i} U_{i} U_{i}^{⊤}

,

T = {(I_{q} + U^{⊤} U Σ)}^{- 1}

so that

Σ T

is a symmetric, non-singular matrix;

B ~ N_{q} (Σ U^{⊤} V^{- 1} (y - X β), σ^{2} Σ T)

; and where

δ^{2} = 1 + λ_{ε}^{⊤} (I_{n} - V^{- 1}) λ_{ε}

. Therefore,

Y

has a generalized SN distribution.

Based on the distribution of

Y

, the log-likelihood function of

θ = {(β^{⊤}, σ^{2}, γ^{⊤}, λ_{ε}^{⊤})}^{⊤}

is given by

l (θ; X, y) \propto - \frac{n}{2} \log (σ^{2}) - \frac{1}{2} \log (|V|) - \frac{1}{2 σ^{2}} [{(y - X β)}^{⊤} V^{- 1} (y - X β)] + \log Φ (\frac{λ_{ε}^{⊤} V^{- 1} (y - X β)}{σ δ}),

(3)

where

γ = {(γ_{1}, \dots, γ_{m})}^{⊤}

.

As can be seen, there is no obvious solution for the direct maximization of Equation (3), and the likelihood function has to be maximized numerically. Using numerical approaches, besides the high computational costs and lack of robustness to the starting values, causes some problems for the maximization due to the term

\log (Φ (\cdot))

. Therefore, corresponding to previous studies in the field of using the skew family in modeling (see, e.g., [8,9,18,19,20,22,27]), an EM algorithm was applied to reduce computation complexity with high efficiency.

The EM algorithm, introduced by Dempster et al. [32], is a famous iterative algorithm for ML estimation in incomplete data models. One of the major reasons for its popularity is the M-step that includes maximization of a likelihood function based on complete data, which is often computationally simple. It is also not very sensitive to the starting parameter values.

Let

T N (ξ, η^{2}; a, b)

represent the truncated normal distribution with parameters

ξ

and

η^{2}

and truncation range

(a, b)

. If the distribution of

y

is written as

f (y) = 2 ϕ_{n} (y | X β, σ^{2} V) \int_{0}^{+ \infty} ϕ (z | λ_{ε}^{⊤} V^{- 1} (y - X β), σ^{2} δ^{2}) d z; y \in ℝ^{n},

the joint distribution of

Y

and the missing variable

Z

will be

f (y, z) = 2 ϕ_{n} (y | X β, σ^{2} V) ϕ (z | λ_{ε}^{⊤} V^{- 1} (y - X β), σ^{2} δ^{2}); y \in ℝ^{n}, z > 0 .

Based on the above joint distribution, the conditional distribution of

Z| y

is obtained as

f (z| y) = \frac{ϕ (z | λ_{ε}^{⊤} V^{- 1} (y - X β), σ^{2} δ^{2})}{\int_{0}^{+ \infty} ϕ (z | λ_{ε}^{⊤} V^{- 1} (y - X β), σ^{2} δ^{2}) d z}; z > 0 .

Hence,

Z| y \sim T N (λ_{ε}^{⊤} V^{- 1} (y - X β), σ^{2} δ^{2}; 0, + \infty)

. Now, with the help of the properties of the truncated normal distribution [33], we have

E (Z| y) = λ_{ε}^{⊤} V^{- 1} (y - X β) + σ δ W (\frac{λ_{ε}^{⊤} V^{- 1} (y - X β)}{σ δ}),

(4)

and

E (Z^{2}| y) = {(λ_{ε}^{⊤} V^{- 1} (y - X β))}^{2} + σ^{2} δ^{2}

+ σ δ λ_{ε}^{⊤} V^{- 1} (y - X β) W (\frac{λ_{ε}^{⊤} V^{- 1} (y - X β)}{σ δ}),

(5)

To predict the random effects, we first need the conditional distribution of

b| y

given by

\begin{array}{l} f (b| y) & = \frac{f (y, b)}{f (y)} \\ = \frac{1}{α} ϕ_{q} (b | Σ U^{⊤} V^{- 1} (y - X β), σ^{2} Σ T) Φ (λ_{ε}^{⊤} (y - X β - U b) / σ), \end{array}

where

α = Φ (\frac{λ_{ε}^{⊤} V^{- 1} (y - X β)}{σ δ}) .

Now, the conditional log-likelihood function of

b

––given

θ

,

X

and

y

––is obtained as

\begin{array}{l} l^{*} (b; θ, X, y) & \propto - \frac{q}{2} \log (σ^{2}) - \frac{1}{2} \log (|Σ T|) \\ - \frac{1}{2 σ^{2}} {(b - Σ U^{⊤} V^{- 1} (y - X β))}^{⊤} {(Σ T)}^{- 1} (b - Σ U^{'} V^{- 1} (y - X β)) \\ + \log (Φ (λ_{ε}^{⊤} (y - X β - U b) / σ)) - \log (α) . \end{array}

As seen again,

l^{*}

like

l

has the term

\log (Φ (\cdot))

and so, to predict

b

, based on the ML method, we use the EM algorithm.

To do this, we first rewrote the conditional distribution of

b| y

as

\begin{array}{l} f (b| y) & = \frac{1}{α} ϕ_{q} (b | Σ U^{⊤} V^{- 1} (y - X β), σ^{2} Σ T) \\ \times \int_{0}^{+ \infty} ϕ (z^{*} | λ_{ε}^{⊤} (y - X β - U b), σ^{2}) d z^{*}; b \in ℝ^{q}, y \in ℝ^{n} . \end{array}

So, the conditional distribution of

b

and the missing variable

Z^{*}

given

y

is equal to

\begin{array}{l} f (b, z^{*}| y) \\ = \frac{1}{α} ϕ_{q} (b | Σ U^{⊤} V^{- 1} (y - X β), σ^{2} Σ T) \\ \times ϕ (z^{*} | λ_{ε}^{⊤} (y - X β - U b), σ^{2}); b \in ℝ^{q}, y \in ℝ^{n}, z^{*} > 0 . \end{array}

(6)

The above equation concludes that the conditional distribution of the missing variable

Z^{*}

given

b

and

y

is equal to

f (z^{*}| b, y) = \frac{ϕ (z^{*} | λ_{ε}^{⊤} (y - X β - U b), σ^{2})}{\int_{0}^{+ \infty} ϕ (z^{*} | λ_{ε}^{⊤} (y - X β - U b), σ^{2}) d z^{*}}, z^{*} > 0 .

As seen,

Z^{*}| (b, y) \sim T N (λ_{ε}^{⊤} (y - X β - U b), σ^{2}; 0, + \infty)

and hence,

E (Z^{*}| (b, y)) = λ_{ε}^{⊤} (y - X β - U b) + σ W (λ_{ε}^{⊤} (y - X β - U b) / σ),

(7)

and

\begin{array}{l} E (Z^{*}^{2}| (b, y)) & = {(λ_{ε}^{⊤} (y - X β - U b))}^{2} + σ^{2} \\ + σ λ_{ε}^{⊤} (y - X β - U b) W (λ_{ε}^{⊤} (y - X β - U b) / σ) . \end{array}

(8)

3. Parameter Estimation via the EM Algorithm

Now, let

y = {(y_{1}, \dots, y_{n})}^{'}

be a vector of observed responses and

z

be a missing observation. Then, the complete log-likelihood function associated with

y_{c} = {(y^{'}, z)}^{’}

will be obtained as

\begin{array}{l} l_{c} (θ; X, y_{c}) & = C - \frac{n}{2} \log (σ^{2}) - \frac{1}{2} \log (|V|) \\ - \frac{1}{2 σ^{2}} [{(y - X β)}^{⊤} V^{- 1} (y - X β)] \\ - \frac{1}{2} \log (σ^{2} δ^{2}) - \frac{1}{2 σ^{2} δ^{2}} {(z - λ_{ε}^{⊤} V^{- 1} (y - X β))}^{2} \end{array}

where

C

does not depend on unknown parameters.

If

{\hat{θ}}^{(r)} = {({\hat{β}}^{⊤ (r)}, {\hat{σ^{2}}}^{(r)}, {\hat{γ}}^{⊤ (r)}, {\hat{λ_{ε}}}^{⊤ (r)})}^{⊤}

is the estimate in the

r

th iteration, then the expected complete log-likelihood function would be

\begin{array}{l} Q (θ| {\hat{θ}}^{(r)}) \\ = E [l_{c} (θ; X, y_{c})| {\hat{θ}}^{(r)}] \propto - \frac{n}{2} \log (σ^{2}) - \frac{1}{2} \log (|V|) \\ - \frac{1}{2 σ^{2}} [{(y - X β)}^{⊤} V^{- 1} (y - X β)] - \frac{1}{2} \log (σ^{2} δ^{2}) \\ - \frac{1}{2 σ^{2} δ^{2}} ({\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} λ_{ε}^{⊤} V^{- 1} (y - X β) + {(λ_{ε}^{⊤} V^{- 1} (y - X β))}^{2}) \end{array}

(9)

where

{\hat{z}}^{(r)} = E [Z| y; {\hat{θ}}^{(r)}]

and

{\hat{z^{2}}}^{(r)} = E [Z^{2}| y; {\hat{θ}}^{(r)}]

are calculated by substituting

{\hat{θ}}^{(r)}

in Equations (4) and (5), respectively.

To obtain a new estimate

{\hat{θ}}^{(r + 1)}

, the M-step maximizes

Q (θ| {\hat{θ}}^{(r)})

with respect to

θ

. This was obtained as a solution of the following equations:

\frac{\partial Q (θ| {\hat{θ}}^{(r)})}{\partial β} = 0,

\frac{\partial Q (θ| {\hat{θ}}^{(r)})}{\partial σ^{2}} = 0,

and

\frac{\partial Q (θ| {\hat{θ}}^{(r)})}{\partial γ_{i}} = 0; i = 1, \dots, m .

If we define

h = V^{- 1} λ_{ε}

, then from the Equation (9), the estimation of

β

, is given by

\begin{array}{l} {\hat{β}}^{(r + 1)} & = {(X^{⊤} ({\hat{V}}^{(r)}^{- 1} + {\hat{h}}^{(r)} {\hat{h}}^{⊤ (r)} / {\hat{δ^{2}}}^{(r)}) X)}^{- 1} \\ \times (X^{⊤} ({\hat{V}}^{(r)}^{- 1} + {\hat{h}}^{(r)} {\hat{h}}^{⊤ (r)} / {\hat{δ^{2}}}^{(r)}) y - {\hat{z}}^{(r)} X^{⊤} {\hat{h}}^{(r)} / {\hat{δ^{2}}}^{(r)}) \end{array}

Before the estimation of variance components, we presented a realized value of the random effects. In a similar way for fixed effects (See Appendix A.1 for more details.), the ML prediction of

b

was given by

{\hat{b}}^{(r + 1)} = {\hat{Σ}}^{(r)} U^{⊤} {\hat{V}}^{(r)}^{- 1} [(I_{n} + \frac{1}{{\hat{δ^{2}}}^{(r)}} {\hat{λ_{ε}}}^{(r)} {\hat{λ_{ε}}}^{⊤ (r)} {\hat{V}}^{(r)}^{- 1}) (y - X {\hat{β}}^{(r)}) - \frac{1}{{\hat{δ^{2}}}^{(r)}} {\hat{z^{*}}}^{(r)} {\hat{λ_{ε}}}^{(r)}] .

where

{\hat{z^{*}}}^{(r)} = E [Z^{*}| ({\hat{b}}^{(r)}, y); {\hat{θ}}^{(r)}]

and

{\hat{z^{* 2}}}^{(r)} = E [Z^{*}^{2}| ({\hat{b}}^{(r)}, y); {\hat{θ}}^{(r)}]

were calculated by substituting

{\hat{θ}}^{(r)}

and

{\hat{b}}^{(r)}

into Equations (7) and (8), respectively.

Then, for the

i

th random-effect factor, we have

{\hat{b_{i}}}^{(r + 1)} = {\hat{γ_{i}}}^{(r)} U_{i}^{⊤} {\hat{V}}^{(r)}^{- 1} [(I_{n} + \frac{1}{{\hat{δ^{2}}}^{(r)}} {\hat{λ_{ε}}}^{(r)} {\hat{λ_{ε}}}^{⊤ (r)} {\hat{V}}^{(r)}^{- 1}) (y - X {\hat{β}}^{(r)}) - \frac{1}{{\hat{δ^{2}}}^{(r)}} {\hat{z^{*}}}^{(r)} {\hat{λ_{ε}}}^{(r)}] .

From Equation (9), the estimates of variance components were derived as

{\hat{σ^{2}}}^{(r + 1)} = \frac{1}{n + 1} [{(y - X {\hat{β}}^{(r)})}^{⊤} ({\hat{V}}^{(r)}^{- 1} + {\hat{h}}^{(r)} {\hat{h}}^{⊤ (r)} / {\hat{δ^{2}}}^{(r)}) (y - X {\hat{β}}^{(r)}) + ({\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} {\hat{h}}^{⊤ (r)} (y - X {\hat{β}}^{(r)})) / {\hat{δ^{2}}}^{(r)}] .

and

\begin{array}{l} {\hat{σ_{i}^{2}}}^{(r + 1)} = \frac{{\hat{γ_{i}}}^{(r)}^{2}}{q_{i} - tr (T_{i i})} & {{(y - X {\hat{β}}^{(r)})}^{⊤} {\hat{V}}^{(r)}^{- 1} U_{i} U_{i}^{⊤} {\hat{V}}^{(r)}^{- 1} (y - X {\hat{β}}^{(r)}) \\ - \frac{{\hat{σ^{2}}}^{(r)}}{{\hat{δ^{2}}}^{(r)}} {\hat{h}}^{⊤ (r)} U_{i} U_{i}^{⊤} h \\ + \frac{1}{{\hat{δ^{2}}}^{(r)}^{2}} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} {\hat{h}}^{⊤ (r)} (y - X {\hat{β}}^{(r)}) \\ + {({\hat{h}}^{⊤ (r)} (y - X {\hat{β}}^{(r)}))}^{2}] {\hat{h}}^{(r)}^{’} U_{i} U_{i}^{’} {\hat{h}}^{(r)} \\ + \frac{2}{{\hat{δ^{2}}}^{(r)}} [{(y - X {\hat{β}}^{(r)})}^{⊤} {\hat{h}}^{(r)} {\hat{h}}^{⊤ (r)} U_{i} U_{i}^{⊤} {\hat{V}}^{(r)}^{- 1} (y - X {\hat{β}}^{(r)}) \\ - {\hat{z}}^{(r)} {\hat{h}}^{⊤ (r)} U_{i} U_{i}^{⊤} {\hat{V}}^{(r)}^{- 1} (y - X {\hat{β}}^{(r)})]} \end{array}

where

T_{i i}

is

i

th diagonal block of the matrix

T

(See Appendix A.2 for more details.). By substituting

{\hat{z}}^{(r)}

and

{\hat{z^{2}}}^{(r)}

with

{\hat{z^{*}}}^{(r)}

and

{\hat{z^{*}}}^{(r)}^{2}

, respectively, in the above equation, one can obtain another estimator for

σ_{i}^{2}

based on

{\hat{b_{i}}}^{(r + 1)}

similar to its corresponding estimator in the normal LMM. This estimator would be as follows:

{\tilde{σ_{i}^{2}}}^{(r + 1)} = \frac{1}{q_{i} - tr (T_{i i})} ({\hat{b_{i}}}^{(r + 1)} {\hat{b_{i}}}^{⊤ (r + 1)} - \frac{{\tilde{σ^{2}}}^{(r)}}{{\hat{δ}}^{(r)}} d_{i}^{⊤} d_{i}),

where

d_{i} = U_{i}^{⊤} {\hat{h}}^{(r)}

.

Finally, the ML estimates of skewness parameters would be

{\hat{λ_{ε}}}^{(r + 1)} = {argmax}_{λ_{ε}} (Q ({\hat{β}}^{(r + 1)}, {\hat{σ^{2}}}^{(r + 1)}, {\hat{γ}}^{(r + 1)}, λ_{ε}| {\hat{θ}}^{(r)})),

where

Q ({\hat{β}}^{(r + 1)}, {\hat{σ^{2}}}^{(r + 1)}, {\hat{γ}}^{(r + 1)}, λ_{ε}| {\hat{θ}}^{(r)})

is

Q (θ| {\hat{θ}}^{(r)})

evaluated at updated

{\hat{β}}^{(r + 1)}

,

{\hat{σ^{2}}}^{(r + 1)}

and

{\hat{γ}}^{(r + 1)}

.

The above algorithm stops when a reasonable convergence rule is satisfied (e.g.,

{\hat{θ}}^{(r + 1)} - {\hat{θ}}^{(r)} < 10^{- 6}

). A set of adequate starting values can be obtained by solving the normal LMM for

β

,

σ^{2}

and

σ_{i}^{2}

,

i = 1, \dots, m

and the sample skewness coefficient of the residuals or zero values for

λ_{ε}

. But, as recommended in the literature, the EM algorithm should be run several times with different starting values.

4. Local Influence Analysis

Cook’s local influence method was used to evaluate the influence of various minor model perturbations on the parameter estimates. Inspired by the general idea of the EM algorithm, Zhu and Lee [24] generalized the local influence diagnostic method to general statistical models with incomplete data based on a Q-function. Here, we briefly studied a natural extension of this procedure to SN–LMM. In this section, we assumed that the

γ_{i}

’s were known. If the

γ_{i}

’s were unknown, the ML estimates would have been placed back into

Σ

, so the vector

θ

would have been

θ = {(β^{⊤}, σ^{2}, λ_{ε}^{⊤})}^{⊤}

.

If we let

ω = (ω_{1}, ω_{2}, \dots, ω_{n})

be a n-dimensional vector of perturbations varying in open region

Ω \in ℝ^{n}

, the perturbed complete-data log-likelihood function would be denoted by

l_{c} (θ, ω; X, y_{c})

. It is assumed that there exists

w_{0}

, a vector of no perturbation, such that

l_{c} (θ, ω_{0}; X, y_{c}) = l_{c} (θ; X, y_{c})

for all

θ

. Let

Q (\hat{θ} (ω)| {\hat{θ}}^{(r)})

be the maximum value of the

Q (θ, ω| {\hat{θ}}^{(r)}) = E (l_{c} (θ, ω; X, y_{c}))

, where

\hat{θ} (ω)

denotes the ML estimate under

Q (θ, ω| {\hat{θ}}^{(r)})

. To evaluate the influence of minor perturbations on the ML estimate

\hat{θ}

, one may regard the Q-displacement function, defined as follows:

Q D (ω) = 2 \{Q (\hat{θ}| {\hat{θ}}^{(r)}) - Q (\hat{θ} (ω)| {\hat{θ}}^{(r)})\}

Zhu and Lee [24] suggested studying the local behavior of

Q D (ω)

around

ω_{0}

. Corresponding to their proposal, the normal curvature in the direction of some unit vector

d

, given by

C_{Q D, d} = - 2 d^{⊤} {\ddot{Q}}_{ω_{0}} d

, can be employed to summarize the local behavior of the Q-displacement function, where

- {\ddot{Q}}_{ω_{0}} = Δ_{ω_{0}}^{⊤} {\{- {\ddot{Q}}_{θ} (\hat{θ})\}}^{- 1} Δ_{ω_{0}}

in which

{\ddot{Q}}_{θ} (\hat{θ}) = {\frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial θ \partial θ^{⊤}}|}_{θ = \hat{θ}}

and

Δ_{ω} = {\frac{\partial^{2} Q (θ, ω| {\hat{θ}}^{(r)})}{\partial θ \partial ω^{⊤}}|}_{θ = \hat{θ} (ω)} .

Since most influence measures proposed in the statistical literature are closely related to a spectral decomposition of

- 2 {\ddot{Q}}_{ω_{0}}

, we used this expression to detect influential observations. Let

\sum_{k = 1}^{n} ξ_{k} e_{k} e_{k}^{⊤}

be the spectral decomposition of

- 2 {\ddot{Q}}_{ω_{0}}

where

\{(ξ_{i}, e_{i}) : i = 1, \dots, n\}

are the eigenvalue–eigenvector pairs of the matrix

- 2 {\ddot{Q}}_{ω_{0}}

with

ξ_{1} \geq \dots \geq ξ_{q}

,

ξ_{q + 1} = \dots = ξ_{n} = 0

and

\{e_{k} = (e_{k 1}, \dots, e_{k q}) : k = 1, \dots, n\}

is the associated orthonormal basis.

Following Zhu and Lee [24] and Lu and Song [34], the assessment of influential observations was based on

M {(0)}_{i} = \sum_{k = 1}^{q} {\tilde{ξ}}_{k} e_{k i}^{2}, i = 1, \dots, n

where

{\tilde{ξ}}_{k} = ξ_{k} / (ξ_{1} + \dots + ξ_{q})

. The influence measure

M {(0)}_{i}

may be obtained through

B_{Q D, d_{i}} = \frac{- 2 d_{i}^{⊤} {\ddot{Q}}_{ω_{0}} d_{i}}{tr (- 2 {\ddot{Q}}_{ω_{0}})}

where

d_{i}

is an

n \times 1

vector with the ith element equal to one and all other elements equal to zero. Moreover, corresponding to Lee and Xu [35], we used the cut-off point

1 / n + c^{*} S M (0)

to consider the ith observation as influential, where

c^{*}

is a constant, chosen according to the real application, and

S M (0)

denotes the standard deviation of

\{M {(0)}_{i} : i = 1, \dots, n\}

.

4.1. The Hessian Matrix

To achieve the local influence diagnostic measures for a particular perturbation scheme, we needed to compute

\frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial θ \partial θ^{⊤}}

. It follows from (9) that the Hessian matrix has elements given by

\frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial β \partial β^{⊤}} = - \frac{1}{σ^{2}} X^{⊤} (V^{- 1} + \frac{1}{δ^{2}} h h^{⊤}) X,

\frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial β \partial σ^{2}} = - \frac{1}{σ^{4}} X^{⊤} \{(V^{- 1} + \frac{1}{δ^{2}} h h^{⊤}) (y - X β) - \frac{{\hat{z}}^{(r)}}{δ^{2}} h\},

\begin{array}{l} \frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial β \partial λ_{ε}^{⊤}} & = \frac{2}{σ^{2} δ^{4}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} X^{⊤} h {(λ_{ε} - h)}^{⊤} \\ - \frac{1}{σ^{2} δ^{2}} X^{⊤} V^{- 1} [\{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} I_{n} - λ_{ε} {(y - X β)}^{⊤} V^{- 1}], \end{array}

\frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial σ^{2} \partial σ^{2}} = \frac{n + 1}{2 σ^{4}} - \frac{1}{σ^{6}} {(y - X β)}^{⊤} V^{- 1} (y - X β) - \frac{1}{σ^{6} δ^{2}} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h^{⊤} (y - X β) + {\{h^{⊤} (y - X β)\}}^{2}],

\begin{array}{l} \frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial σ^{2} \partial λ_{ε}^{⊤}} = & - \frac{1}{σ^{4} δ^{4}} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h^{⊤} (y - X β) + {\{h^{⊤} (y - X β)\}}^{2}] {(λ_{ε} - h)}^{⊤} \\ - \frac{1}{σ^{4} δ^{2}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} {(y - X β)}^{⊤} V^{- 1}, \end{array}

\begin{array}{l} \frac{\partial^{2} Q (θ| {\hat{θ}}^{(r)})}{\partial λ_{ε} \partial λ_{ε}^{⊤}} & = \frac{2}{δ^{4}} (λ_{ε} - h) {(λ_{ε} - h)}^{⊤} - \frac{1}{δ^{2}} (I_{n} - V^{- 1}) \\ - \frac{1}{σ^{2} δ^{4}} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h^{⊤} (y - X β) + {\{h^{⊤} (y - X β)\}}^{2}] [\frac{4}{δ^{2}} (λ_{ε} - h) {(λ_{ε} - h)}^{⊤} - (I_{n} - V^{- 1})] \\ - \frac{2}{σ^{2} δ^{4}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} (λ_{ε} - h) {(y - X β)}^{⊤} V^{- 1} \\ - \frac{1}{σ^{2} δ^{2}} V^{- 1} (y - X β) [\frac{2}{δ^{2}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} {(λ_{ε} - h)}^{⊤} + {(y - X β)}^{⊤} V^{- 1}] . \end{array}

4.2. Perturbation Schemes

In this section, we present three distinct perturbation schemes for the model defined in (1).

4.2.1. Perturbation of the Response Variable

A perturbation of the response variable

y

is defined as

y_{ω} = y + s_{y} ω

, where

s_{y}

is the standard deviation of

y

. In this case,

ω_{0} = 0

and

\begin{array}{l} Q (θ, ω| {\hat{θ}}^{(r)}) \\ \propto - \frac{n}{2} \log (σ^{2}) - \frac{1}{2} \log (|V|) \\ - \frac{1}{2 σ^{2}} [{(y_{ω} - X β)}^{⊤} V^{- 1} (y_{ω} - X β)] - \frac{1}{2} \log (σ^{2} δ^{2}) \\ - \frac{1}{2 σ^{2} δ^{2}} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h^{⊤} (y_{ω} - X β) + {\{h^{⊤} (y_{ω} - X β)\}}^{2}] . \end{array}

(10)

From (10), the matrix

Δ_{ω_{0}} = {\frac{\partial^{2} Q (θ, ω| {\hat{θ}}^{(r)})}{\partial θ \partial ω^{⊤}}|}_{ω = ω_{0}}

has the following elements.

Δ_{β} = \frac{s_{y}}{σ^{2}} X^{⊤} (V^{- 1} + \frac{1}{δ^{2}} h h^{⊤})

Δ_{σ^{2}} = \frac{s_{y}}{σ^{4}} {(y - X β)}^{⊤} V^{- 1} - \frac{s_{y}}{σ^{4} δ^{2}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} h^{⊤}

\begin{array}{l} Δ_{λ_{ε}} = - \frac{2 s_{y}}{σ^{2} δ^{4}} & \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} (λ_{ε} - h) h^{⊤} \\ + \frac{s_{y}}{σ^{2} δ^{2}} [\{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} V^{- 1} - V^{- 1} (y - X β) h^{⊤}] \end{array}

4.2.2. Perturbation of the kth Column of the Matrix $X$

We considered altering the kth column matrix

X

, i.e.,

x_{k}

, by taking

x_{k ω} = x_{k} + s_{k} ω

where

s_{k}

is the standard deviation of

x_{k}

and

ω_{0} = 0

represents no perturbation. In this case, the perturbed

Q

-function took the form

\begin{array}{l} Q (θ, ω| {\hat{θ}}^{(r)}) \\ \propto - \frac{n}{2} \log (σ^{2}) - \frac{1}{2} \log (|V|) \\ - \frac{1}{2 σ^{2}} [{(y - X_{ω} β)}^{⊤} V^{- 1} (y - X_{ω} β)] - \frac{1}{2} \log (σ^{2} δ^{2}) \\ - \frac{1}{2 σ^{2} δ^{2}} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h^{⊤} (y - X_{ω} β) + {\{h^{⊤} (y - X_{ω} β)\}}^{2}], \end{array}

(11)

where

X_{ω} = (x_{1 ω}, \dots, x_{p ω})

. It follows from (11), that the elements of the matrix

Δ_{ω_{0}}

were given by

Δ_{β} = \frac{s_{t}}{σ^{2}} \{u_{t} {(y - X β)}^{⊤} V^{- 1} - β_{t} X^{⊤} V^{- 1}\} - \frac{s_{t}}{σ^{2} δ^{2}} [\{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} u_{t} h^{⊤} + β_{t} X^{⊤} h h^{⊤}]

Δ_{σ^{2}} = - \frac{s_{t} β_{t}}{σ^{4}} {(y - X β)}^{⊤} V^{- 1} + \frac{s_{t} β_{t}}{σ^{4} δ^{2}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} h^{⊤}

Δ_{λ_{ε}} = \frac{s_{t} β_{t}}{σ^{2}} [\{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} \{\frac{2}{δ^{4}} (λ_{ε} - h) h^{⊤} - \frac{1}{δ^{2}} V^{- 1}\} + \frac{1}{δ^{2}} V^{- 1} (y - X β) h^{⊤}]

4.2.3. Perturbation of the Dispersion Matrix of the Errors

We modified the dispersion matrix of errors, i.e.,

σ^{2} I_{n}

, to

σ^{2} D (ω)

where

D (ω)

is a diagonal matrix with diagonal elements

ω = {(ω_{1}, ω_{2}, \dots, ω_{n})}^{⊤}

. The point representing no perturbation is

ω_{0} = {(1, 1, \dots, 1)}^{⊤}

. In this case, the perturbed

Q

-function was obtained as

\begin{array}{l} Q (θ, ω| {\hat{θ}}^{(r)}) \\ \propto - \frac{n}{2} \log (σ^{2}) - \frac{1}{2} \log (|V (ω)|) \\ - \frac{1}{2 σ^{2}} [{(y - X β)}^{⊤} V {(ω)}^{- 1} (y - X β)] - \frac{1}{2} \log (σ^{2} δ^{2} (ω)) \\ - \frac{1}{2 σ^{2} δ^{2} (ω)} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h {(ω)}^{⊤} (y - X β) + {\{h {(ω)}^{⊤} (y - X β)\}}^{2}], \end{array}

(12)

where

V (ω) = D (ω) + U Σ U^{⊤}

,

h (ω) = V^{- 1} (ω) λ_{ε}

, and

δ^{2} (ω) = 1 + λ_{ε}^{⊤} (D (ω) - D (ω) V^{- 1} (ω) D (ω)) λ_{ε}

. From (12), we obtained the elements of

Δ_{ω_{0}}

.

The kth column of the matrix

Δ_{β}

was given by

\begin{array}{l} - \frac{1}{σ^{2}} X^{⊤} c_{k} c_{k}^{⊤} (y - X β) & + \frac{1}{σ^{2} δ^{4}} {\{d_{k}^{⊤} (λ_{ε} - h)\}}^{2} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} X^{⊤} h \\ - \frac{1}{σ^{2} δ^{2}} [λ_{ε}^{⊤} c_{k} c_{k}^{⊤} (y - X β) X^{⊤} h \\ - \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} X^{⊤} c_{k} c_{k}^{⊤} λ_{ε}] \end{array}

where

c_{k}

is the kth column of matrix

V^{- 1}

. Also, the kth element of the vector

Δ_{σ^{2}}

was obtained by

- \frac{1}{2 σ^{2}} {(y - X β)}^{⊤} c_{k} c_{k}^{⊤} (y - X β) - \frac{1}{2 σ^{4} δ^{4}} {\{d_{k}^{⊤} (λ_{ε} - h)\}}^{2} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h^{⊤} (y - X β) + {\{h^{⊤} (y - X β)\}}^{2}] + \frac{1}{σ^{4} δ^{2}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} λ_{ε}^{⊤} c_{k} c_{k}^{⊤} (y - X β)

Finally, the kth column of the matrix

Δ_{λ_{ε}}

was achieved by

\begin{array}{l} r_{k} - \frac{1}{σ^{2} δ^{2}} [[{\hat{z^{2}}}^{(r)} & - 2 {\hat{z}}^{(r)} h^{⊤} (y - X β) + {\{h^{⊤} (y - X β)\}}^{2}] (2 r_{k} + \frac{1}{δ^{2}} H_{k} λ_{ε}) \\ + \frac{1}{δ^{2}} {\{d_{k}^{⊤} (λ_{ε} - h)\}}^{2} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} V^{- 1} (y - X β) \\ - λ_{ε}^{⊤} c_{k} c_{k}^{⊤} (y - X β) [\frac{2}{δ^{2}} \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} (I_{n} - V^{- 1}) λ_{ε} \\ + V^{- 1} (y - X β)] + \{{\hat{z}}^{(r)} - h^{⊤} (y - X β)\} c_{k} c_{k}^{⊤} (y - X β)] \end{array}

where

H_{k} = (I_{n} - V^{- 1}) D (d_{k}) (I_{n} - V^{- 1}),

and

r_{k} = \frac{1}{δ^{4}} {\{d_{k}^{⊤} (λ_{ε} - h)\}}^{2} (I_{n} - V^{- 1}) λ_{ε} - \frac{1}{δ^{2}} H_{k} λ_{ε} .

5. Empirical Studies

In this section, we presented a simulation study and a real data example. The R software (version 4.1.2) Vienna, Austria, [36] was used to conduct all programs.

5.1. Simulation Study

Here, we considered a Monte Carlo simulation to evaluate the performance of the ML estimates in finite sample sizes. The model was taken as follows:

Y_{i j} = x_{i j}^{(1)} β_{1} + x_{i j}^{(2)} β_{2} + b_{1 j} + ε_{i j}; i = 1, \dots, s; j = 1, \dots, q,

where

q

, according to [37], is the number of independent clusters, and

s

is the cluster size in a longitudinal study; hence, the total sample size was

n = s q

. If this model were represented in a matrix form as in (1), we would have

m = 1

,

b = b_{1} = {(b_{11}, \dots, b_{1 q})}^{⊤}

;

Y = {(Y_{11}, \dots, Y_{1 q}, Y_{21}, \dots, Y_{2 q}, \dots, Y_{s 1}, \dots, Y_{s q})}^{⊤}

; and

X = {(x^{⊤ (1)}, x^{⊤ (2)})}^{⊤}

, where

x^{(k)} = {(x_{11}^{(k)}, \dots, x_{1 q}^{(k)}, x_{21}^{(k)}, \dots, x_{2 q}^{(k)}, \dots, x_{s 1}^{(k)}, \dots, x_{s q}^{(k)})}^{⊤}

and

ε

has the same structure as

Y

. We considered the following combinations for simulation:

q = 50

or

100

and

s = 3

, which are usual sample sizes in longitudinal studies,

β_{1} = - 1

,

β_{2} = 2

,

x^{(k)} ~ N_{n} (0, I_{n})

,

b_{1} ~ N_{q} (0, σ_{1}^{2} I_{q})

,

ε ~ S N_{n} (0, σ^{2} I_{n}, λ_{ε} = {(λ_{1} = - 3, λ_{2} = 0, λ_{3} = 3)}^{⊤} \otimes 1_{q} / n)

, where

1_{q}

is a

q \times 1

vector of 1s. To see the effect of variability in data on the estimation of the parameters, we took

σ_{1}^{2} = {0.5}^{2}

and

σ^{2} = {0.4}^{2}

as a low dispersion and

σ_{1}^{2} = {0.9}^{2}

and

σ^{2} = {0.8}^{2}

as a high dispersion in the response data. We also considered two cases for obtaining our estimates: (I) without taking into account skewness in the model and using the ML estimates under normal errors, which is called here the Normal estimates; and (II), using the estimates in this work named by the SN estimates. For each combination of the parameters, 1000 iterations were performed. We applied the mvtnorm and sn packages to generate random samples from the multivariate normal and SN distributions, respectively.

In each iteration, we obtained parameter estimates based on both scenarios, and then the mean and the standard deviation (SD) of the Normal and SN estimators were calculated. The summary results are presented in Table 1, Table 2, Table 3 and Table 4.

It can be seen from Table 1, Table 2, Table 3 and Table 4, that the ML estimates of fixed effects were unbiased for both cases. However, the SN estimates had a lower dispersion compared to the Normal estimates. Both cases showed better results by increasing the sample size, but the SN estimates performed better. Increasing the values of variance components did not have much effect on the biases, but their SDs increased, and the precision of the estimates decreased. In addition, we observed that the Normal estimates of variance components were biased and hade larger SDs. In contrast, the SN estimates were unbiased and had smaller SDs. As was seen, increasing the values of variance components had a much worse effect on the biases and precision of the Normal estimates of the variance components. However, for the SN estimates, there were some rises in SDs, which was completely natural, but we still had unbiasedness. Furthermore, in both aspects (bias and SD), the SN estimates had better performance concerning the Normal estimates. For these, increasing the sample size did not affect demolishing biases; however, it decreased the SD to some extent. Nevertheless, it decreased bias and SD for the SN estimates. The simulation results in Table 1, Table 2, Table 3 and Table 4 showed that the skewness estimates were unbiased. These estimates were made worse by increasing variability in the data and made better by increasing the sample size (see Figure 1 and Figure 2). Furthermore, the histograms in Figure 1 and Figure 2 indicate that the distribution of the estimators, even for small sample sizes, was approximately normal.

5.2. An Example

The metallic oxide data given by Fellner [2] was used to illustrate the usefulness of our method in applications. Following Fellner [2], a nested model was fitted with raw material for fixed effects, while the random effects were lot, sample, chemist, and analysis. We considered normal and SN distributions for the random errors in this model for comparative purposes, and both assumptions (heteroscedasticity and homoscedasticity) in the random-effect factors were considered. Therefore, we had four scenarios:

(a): Normal random errors and homoscedasticity in random-effect factors.
(b): SN random errors and homoscedasticity in random-effect factors.
(c): Normal random errors and heteroscedasticity in random-effect factors.
(d): SN random errors and heteroscedasticity in random-effect factors.

The estimates of the parameters and some model selection criteria, including the log-likelihood values (

l (θ)

), AIC, and BIC, are given in Table 5. In all cases, the Type 1 mean was approximately the same. The Type 2 mean was the same in the Normal cases (homoscedasticity and heteroscedasticity). This was approximately true in the SN cases as well. Under both assumptions in random-effect factors, the Type 2 mean in the SN case was greater than for the Normal case. In addition, the variance of the error term (analysis variance) and the lot variance were reduced in the SN case compared to the Normal case. As we expected, the model selection criteria showed that the fitted model under heteroscedasticity and SN random errors had the best fit.

We continued by conducting the local influence approach to detect influential observations based on

M (0)

for this dataset. Figure 3, Figure 4, Figure 5 and Figure 6 depict, respectively, the index plots of

M (0)

for all scenarios under perturbations of the response variable, the dispersion matrix of the errors, and the columns of matrix

X

. In all the perturbation schemes, we used

c^{*} = 3

to construct the benchmarks. Furthermore, the influential observations are numbered in all figures.

Figure 3 indicates that observation #192 stood out as the most influential of the four scenarios under response variable perturbation. There were some suspicious points in the three first scenarios that were regular points in scenario 4. It could be claimed that the effects of these points in this scenario were controlled as the most flexible model in this work. Under the perturbation of the dispersion matrix of the errors in Figure 4, observation #192 again appeared as the most influential. As can be seen, the effects of other observations in scenario 4 were lower when compared with other scenarios. For the perturbation of the columns of matrix

X

, as depicted in Figure 5 and Figure 6, the results were approximately the same. In these schemes, observation #192 seemed to be the most influential again. Furthermore, for the two first scenarios (the models under homoscedasticity), observation #191 was also influential. Except for the two last scenarios (the models under heteroscedasticity), observation #234 was an influential case. Generally, in all figures, suspicious points in the less flexible models were more numerous than in the more flexible models. In other words, when we used a more flexible model, we decreased the effects of anomalous observations in our statistical results.

6. Conclusions

The study of a multifactor normal LMM under heteroscedasticity was done by authors such as [1,2]. They showed that the ML method for estimation of the parameters performed well and the estimates had good properties such as consistency and asymptotic normal distribution. When the normality test of model errors or random effects was rejected, the derived estimates did not lead to satisfactory results. Therefore, LMMs (only with a random-effect factor) based on skewed distributions were presented by several authors. When the normality assumption did not hold for the model errors or the random-effect factor, these models performed well in comparison to those under the normality assumption. Additionally, diagnostic analyses showed these models decreased the effect of outliers. Recent research showed that new generalized skewed distributions usually have a better fit than simpler skewed distributions (see e.g., [38,39]). Clearly, using these distributions in LMMs can be also considered (see e.g., [40]), but, as mentioned before, the complexity of calculations in complicated models makes a case for using the simple but flexible SN distribution (See using SN distribution in some new complicated models [13,14]). Therefore, we considered SN distribution for the model errors in the multifactor LMM under heteroscedasticity in random-effect factors. Our main goals involved parameter estimation and the local influence method for the multifactor SN–LMM under heteroscedasticity in random-effect factors. At first, we expanded an EM-based algorithm, as many in the literature proposed, to estimate the model parameters. We also obtained a closed form to estimate variance components using this method. Then, we applied Zhu and Lee’s approach to extend the local influence method to this model. Empirical studies, a simulation study and a real data example, were carried out to see the behavior of our estimators. Our findings followed previous results in this field. The simulation results––consistency, low dispersion, and asymptotic normal distribution––showed that the estimators performed well, even for finite sample sizes. It was also observed that ignorance of skewness when the error model followed from a skewed distribution like SN made unsuitable outcomes. Finally, through a real example, it was shown that taking into account both heteroscedasticity in random-effect factors and election a skewed distribution for random errors in the fitted model improved statistical results in comparison to other works that considered at most one of them. Additionally, in this case, we observed the robustness of the ML estimators through the local influence method. Finally, any skewed distributions contained symmetrical distributions in special cases. When the assumption of symmetry held for random variables in a sensitivity analysis of any model, skewed distributions were not recommended due to additional parameter costs. Extending this work when both random-effect factors and model errors have SN distribution is theoretically and computationally hard, but that will be our goal in a subsequent work. Moreover, generalized skewed distributions for the model errors or the random-effect factors in the multifactor LMM, along with diagnostic measures, are proposed for future work.

Author Contributions

Conceptualization, Z.N. and M.R.M.; Formal analysis, Z.N. and K.Z.; Investigation, Z.N., M.R.M., S.S. and A.M.; Methodology, Z.N., K.Z. and M.R.M.; Project administration, K.Z., M.R.M. and A.M.; Resources, M.R.M.; Software, M.R.M. and A.M.; Supervision, K.Z., M.R.M. and A.M.; Validation, S.S.; Visualization, A.M.; Writing—original draft, Z.N., K.Z., M.R.M. and S.S.; Writing—review & editing, A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data can be available from the authors upon request.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

EM	Expectation Maximization
LMM	Linear Mixed Model
SN	Skew-Normal
ML	Maximum Likelihood
SD	Standard Deviation
AIC	The Akaike Information Criterion
BIC	The Bayesian Information Criterion

Appendix A

Appendix A.1. ML Prediction of Random Effects

Let

y = {(y_{1}, \dots, y_{n})}^{⊤}

and

b = {(b_{1}^{⊤}, b_{2}^{⊤}, \dots, b_{m}^{⊤})}^{⊤}

be vectors of observed responses and the random effects, respectively, and

z^{*}

be a missing observation. It follows from (6) that the conditional complete log-likelihood function of

b

given

θ

,

X

and

y_{c}^{*} = {(y^{⊤}, z^{*})}^{⊤}

may be expressed as

\begin{array}{l} l_{c}^{*} (b; θ, X, y_{c}^{*}) \\ = C - \frac{q + 1}{2} \log (σ^{2}) - \frac{1}{2} \log (|Σ T|) \\ - \frac{1}{2 σ^{2}} {(b - Σ U^{⊤} V^{- 1} (y - X β))}^{⊤} {(Σ T)}^{- 1} (b - Σ U^{⊤} V^{- 1} (y - X β)) \\ - \frac{1}{2 σ^{2}} {(z^{*} - λ_{ε}^{⊤} (y - X β - U b))}^{2} - \log (α), \end{array}

where

C

does not depend on unknown parameters. If

{\hat{θ}}^{(r)}

and

{\hat{b}}^{(r)}

are the estimates in the

r

th iteration, the expected conditional complete log-likelihood function would be

\begin{array}{l} Q^{*} (b| {\hat{θ}}^{(r)}, {\hat{b}}^{(r)}) \\ = E [l_{c}^{*} (b; θ, X, y_{c}^{*})| & {\hat{θ}}^{(r)}, {\hat{b}}^{(r)}] \\ \propto - \frac{q + 1}{2} \log (σ^{2}) & - \frac{1}{2} \log (|Σ T|) \\ - \frac{1}{2 σ^{2}} {b^{⊤} [{(Σ T)}^{- 1} & + U^{⊤} λ_{ε} λ_{ε}^{⊤} U b] \\ - 2 b^{⊤} [U^{⊤} (y - X β) - {\hat{z^{*}}}^{(r)} U^{⊤} λ_{ε} + U^{⊤} λ_{ε} λ_{ε}^{⊤} (y - X β)] \\ + {(y - X β)}^{’} (I_{n} - V^{- 1} + λ_{ε} λ_{ε}^{⊤}) (y - X β) - 2 {\hat{z^{*}}}^{(r)} λ_{ε}^{⊤} (y - X β) + \\ {\hat{z^{*}^{2}}}^{(r)}} \end{array}

Then, the M-step maximizes

Q^{*} (b| {\hat{θ}}^{(r)}, {\hat{b}}^{(r)})

with respect to

b

. So, we solve the following equation:

\begin{array}{l} \frac{\partial Q^{*} (b| {\hat{θ}}^{(r)}, {\hat{b}}^{(r)})}{\partial b} \\ = - \frac{1}{σ^{2}} [({(Σ T)}^{- 1} & + U^{⊤} λ_{ε} λ_{ε}^{⊤} U) b \\ - (U^{'} (y - X β) - {\hat{z^{*}}}^{(r)} U^{'} λ_{ε} + U^{⊤} λ_{ε} λ_{ε}^{⊤} (y - X β))] = 0 . \end{array}

Appendix A.2. ML Estimation of Variances of Random Effects

To find a solution for

γ_{i}

and hence for

σ_{i}^{2}

,

i = 1, \dots, m

, in Equation (9) using the relations

|V| = |I_{q} + U^{⊤} U Σ| = {|T|}^{- 1}

,

\partial V^{- 1} / \partial γ_{i} = - V^{- 1} U_{i} U_{i}^{⊤} V^{- 1}

,

\partial Σ / \partial γ_{i} = diag (0, \dots, 0, I_{q_{i}}, 0, \dots, 0)

and

\partial \log |I_{q} + U^{⊤} U Σ| / \partial γ_{i} = tr (T U^{⊤} U \frac{\partial Σ}{\partial γ_{i}}) = γ_{i}^{- 1} (q_{i} - tr (T_{i i})),

we have

\begin{array}{l} \frac{\partial Q (θ| {\hat{θ}}^{(r)})}{\partial γ_{i}} \\ = - \frac{1}{2} γ_{i}^{- 1} (q_{i} - tr (T_{i i})) \\ + \frac{1}{2 σ^{2} δ^{2}} [{\hat{z^{2}}}^{(r)} - 2 {\hat{z}}^{(r)} h^{⊤} (y - X β) + {(h^{'} (y - X β))}^{2}] h^{⊤} U_{i} U_{i}^{⊤} h \\ + \frac{1}{2 σ^{2}} {(y - X β)}^{⊤} V^{- 1} U_{i} U_{i}^{⊤} V^{- 1} (y - X β) - \frac{1}{2 δ} h^{⊤} U_{i} U_{i}^{⊤} h \\ + \frac{1}{σ^{2} δ} [{(y - X β)}^{⊤} h h^{⊤} U_{i} U_{i}^{⊤} V^{- 1} (y - X β) - {\hat{z}}^{(r)} h^{⊤} U_{i} U_{i}^{⊤} V^{- 1} (y - X β)] \end{array}

Then, by solving the equations

\frac{\partial Q (θ| {\hat{θ}}^{(r)})}{\partial γ_{i}} = 0, i = 1, \dots, m

the ML estimates of

γ_{i}

’s are obtained. Thus, the ML estimate of

σ_{i}^{2}

is taken to satisfy

\hat{σ_{i}^{2}} = \hat{σ^{2}} \hat{γ_{i}}, i = 1, \dots, m .

References

Harvill, D.A. Maximum likelihood approaches to variance component estimation and related problems (with discussion). J. Ame. Stat. Assoc. 1977, 72, 320–340. [Google Scholar] [CrossRef]
Fellner, W.H. Robust estimation of variance components. Technometrics 1986, 28, 51–60. [Google Scholar] [CrossRef]
Khuri, A.I.; Methew, T.; Sinha, B.K. Statistical Tests for Mixed Linear Models; John Wiley: New York, NY, USA, 1998. [Google Scholar]
Wu, M.X.; Yu, K.F.; Liu, A.Y.; Ma, T.F. Simultaneous optimal estimation in linear mixed models. Metrika 2012, 75, 471–489. [Google Scholar] [CrossRef]
Zhang, D.; Davidian, M. Linear mixed models with flexible distributions of random effects for longitudinal data. Biometrics 2001, 57, 795–802. [Google Scholar] [CrossRef] [PubMed]
Pinheiro, J.C.; Liu, C.H.; Wu, Y.N. Efficient algorithms for robust estimation in linear mixed-effects models using a multivariate t-distribution. J. Comput. Gr. Stat. 2001, 10, 249–276. [Google Scholar] [CrossRef]
Ghidey, W.; Lesaffre, E.; Eilers, P. Smooth random effects distribution in a linear mixed model. Biometrics 2004, 60, 945–953. [Google Scholar] [CrossRef]
Lin, T.I.; Lee, J.C. Estimation and prediction in linear mixed models with skew-normal random effects for longitudinal data. Stat. Med. 2008, 27, 1490–1507. [Google Scholar] [CrossRef]
Lachos, V.H.; Dey, D.K.; Cancho, V.G. Robust linear mixed models with skew-normal independent distributions from a Bayesian perspective. J. Stat. Plan. Inference 2009, 139, 4098–4110. [Google Scholar] [CrossRef]
Ye, R.D.; Wang, T.; Gupta, A.K. Distribution of matrix quadratic forms under skew-normal settings. J. Multivar. Anal. 2014, 131, 229–239. [Google Scholar] [CrossRef]
Azzalini, A.; Capitanio, A. Statistical applications of the multivariate skew normal distributions. J. Roy. Stat. Soc. Ser. B 1999, 61, 579–602. [Google Scholar] [CrossRef]
Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Statist. 1985, 12, 171–178. [Google Scholar]
Hosseini, F.; Karimi, O. Approximate pairwise likelihood inference in SGLM models with skew normal latent variables. J. Comput. Appl. Math. 2021, 398, 113692. [Google Scholar] [CrossRef]
Ju, Y.; Yang, Y.; Hu, M.; Dai, L.; Wu, L. Bayesian Influence Analysis of the Skew-Normal Spatial Autoregression Models. Mathematics 2022, 10, 1306. [Google Scholar] [CrossRef]
Verbeke, G.; Lesaffre, E. The effect of misspecifying the random effects distribution in linear mixed models for longitudinal data. Comput. Stat. Data. Anal. 1997, 23, 541–556. [Google Scholar] [CrossRef]
Tao, H.; Palta, M.; Yandell, B.S.; Newton, M.A. An estimation method for the semiparametric mixed effects model. Biometrics 1999, 55, 102–110. [Google Scholar] [CrossRef]
Ma, Y.; Genton, M.G.; Davidian, M. Linear mixed models with flexible generalized skew-elliptical random effects. In SKEW-Elliptical Distributions and Their Applications: A Journey Beyond Normality; Genton, M.G., Ed.; Chapman & Hall/CRC: Boca Raton, FL, USA, 2004; pp. 339–358. [Google Scholar]
Arellano-Valle, R.B.; Bolfarine, H.; Lachos, V.H. Skew-normal linear mixed models. J. Data Sci. 2005, 3, 415–438. [Google Scholar] [CrossRef]
Lachos, V.H.; Ghosh, P.; Arellano-Valle, R.B. Likelihood based inference for skew-normal independent linear mixed models. Stat. Sin. 2010, 20, 303–322. [Google Scholar]
Kheradmandi, A.; Rasekh, A. Estimation in skew-normal linear mixed measurement error models. J. Multivar. Anal. 2015, 136, 1–11. [Google Scholar] [CrossRef]
Maleki, M.; Wraith, D.; Arellano-Valle, R.B. A flexible class of parametric distributions for Bayesian linear mixed models. Test 2019, 28, 543–564. [Google Scholar] [CrossRef]
Ferreira, C.S.; Bolfarine, H.; Lachos, V.H. Linear mixed models based on skew scale mixtures of normal distributions. Commun. Stat. Simul. Comput. 2020, 1–21. [Google Scholar] [CrossRef]
Cook, R.D. Assessment of local influence (with discussion). J. Roy. Stat. Soc. Ser. B 1986, 48, 133–169. [Google Scholar] [CrossRef]
Zhu, H.; Lee, S. Local influence for incomplete-data models. J. Roy. Stat. Soc. Ser. B 2001, 63, 111–126. [Google Scholar] [CrossRef]
Montenegro, L.C.; Bolfarine, H.; Lachos, V.H. Influence diagnostics for a skew extension of the Grubbs measurement error model. Comm. Stat. Simul. Comput. 2009, 38, 667–681. [Google Scholar] [CrossRef]
Zeller, C.B.; Lachos, V.H.; Vilca, F.V. Influence diagnostics for Grubbs’s model with asymmetric heavy-tailed distributions. Stat. Pap. 2014, 55, 671–690. [Google Scholar] [CrossRef]
Ferreira, C.S.; Lachos, V.H.; Bolfarine, H. Inference and diagnostics in skew scale mixtures of normal regression models. J. Stat. Comput. Simul. 2015, 85, 517–537. [Google Scholar] [CrossRef]
Massuia, M.B.; Cabral, C.R.B.; Matos, L.A.; Lachos, V.H. Inference diagnostics for student-t censored linear regression models. Statistics 2015, 49, 1074–1094. [Google Scholar] [CrossRef]
Bolfarine, H.; Montenegro, L.C.; Lachos, V.H. Influence diagnostics for skew-normal linear mixed models. Sankhyā Indian J. Stat. 2007, 69, 648–670. [Google Scholar]
Montenegro, L.C.; Lachos, V.H.; Bolfarine, H. Local influence analysis for skew-normal linear mixed models. Commun. Stat. Theory Methods 2009, 38, 484–496. [Google Scholar] [CrossRef]
Zeller, C.B.; Labra, F.V.; Lachos, V.H.; Balakrishnan, N. Influence analyses of skew-normal/independent linear mixed models. Comput. Stat. Data Anal. 2010, 54, 1266–1280. [Google Scholar] [CrossRef]
Dempster, A.; Laird, N.; Rubin, D. Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. Ser. B 1977, 39, 1–22. [Google Scholar] [CrossRef]
Johnson, N.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 1994. [Google Scholar]
Lu, B.; Song, X. Local influence of multivariate probit latent variable models. J. Multivar. Anal. 2006, 97, 1783–1798. [Google Scholar] [CrossRef] [Green Version]
Lee, S.Y.; Xu, L. Influence analysis of nonlinear mixed-effects models. Comput. Stat. Data Anal. 2004, 45, 321–341. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021; Available online: https://www.R-project.org/ (accessed on 1 November 2021).
Wang, N.; Lin, X.; Gutierrez, R.G.; Carroll, R.J. Bias analysis and SIMEX approach in generalized linear mixed measurement error models. J. Amer. Statist. Assoc. 1998, 93, 249–261. [Google Scholar] [CrossRef]
Labra, F.V.; Garay, A.M.; Lachos, V.H.; Ortega, E.M.M. Estimation and diagnostics for heteroscedastic nonlinear regression models based on scale mixtures of skew-normal distributions. J. Stat. Plan. Inference 2012, 142, 2149–2165. [Google Scholar] [CrossRef]
Schumacher, F.L.; Dey, D.K.; Lachos, V.H. Approximate Inferences for Nonlinear Mixed Effects Models with Scale Mixtures of Skew-Normal Distributions. J. Stat. Theory Pract. 2021, 15, 60. [Google Scholar] [CrossRef]
Schumacher, F.L.; Lachos, V.H.; Matos, L.A. Scale mixture of skew-normal linear mixed models with within-subject serial dependence. Stat. Med. 2021, 40, 1790–1810. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The histograms of the estimators of

λ_{1}

,

λ_{2}

and

λ_{3}

for

σ_{1}^{2} = {0.5}^{2}

and

σ^{2} = {0.4}^{2}

. The left histograms are for

n = 150

and the right histograms are for

n = 300

.

Figure 1. The histograms of the estimators of

λ_{1}

,

λ_{2}

and

λ_{3}

for

σ_{1}^{2} = {0.5}^{2}

and

σ^{2} = {0.4}^{2}

. The left histograms are for

n = 150

and the right histograms are for

n = 300

.

Figure 2. The histograms of the estimators of

λ_{1}

,

λ_{2}

and

λ_{3}

for

σ_{1}^{2} = {0.8}^{2}

and

σ^{2} = {0.9}^{2}

. The left histograms are for

n = 150

and the right histograms are for

n = 300

.

Figure 2. The histograms of the estimators of

λ_{1}

,

λ_{2}

and

λ_{3}

for

σ_{1}^{2} = {0.8}^{2}

and

σ^{2} = {0.9}^{2}

. The left histograms are for

n = 150

and the right histograms are for

n = 300

.

Figure 3. Index plots of M(0) for the perturbation of the response variable for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Figure 3. Index plots of M(0) for the perturbation of the response variable for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Figure 4. Index plots of M(0) for the perturbation of the dispersion matrix of the errors for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Figure 4. Index plots of M(0) for the perturbation of the dispersion matrix of the errors for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Figure 5. Index plots of M(0) for the perturbation of the first column of

X

for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Figure 5. Index plots of M(0) for the perturbation of the first column of

X

for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Figure 6. Index plots of M(0) for the perturbation of the second column of

X

for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Figure 6. Index plots of M(0) for the perturbation of the second column of

X

for the four scenarios. Dotted lines show the yardstick for M(0) with

c^{*} = 3

.

Table 1. Mean and SD of each estimator for both cases with

n = 150

,

σ^{2} = {0.4}^{2}

and

σ_{1}^{2} = {0.5}^{2}

.

Table 1. Mean and SD of each estimator for both cases with

n = 150

,

σ^{2} = {0.4}^{2}

and

σ_{1}^{2} = {0.5}^{2}

.

		Normal		SN
Parameter	True Value	Mean	SD	Mean	SD
$β_{1}$	−1	−1.0031	0.0409	−0.9990	0.0385
$β_{2}$	2	1.9992	0.0411	2.0019	0.0406
$σ^{2}$	0.16	0.1871	0.0233	0.1580	0.0222
$σ_{1}^{2}$	0.25	0.2198	0.0611	0.2455	0.0575
$λ_{1}$	−3	-	-	−3.1239	0.3321
$λ_{2}$	0	-	-	−0.0083	0.0963
$λ_{3}$	3	-	-	2.9146	0.3275

Table 2. Mean and SD of each estimator for both cases with

n = 300

,

σ^{2} = {0.4}^{2}

and

σ_{1}^{2} = {0.5}^{2}

.

Table 2. Mean and SD of each estimator for both cases with

n = 300

,

σ^{2} = {0.4}^{2}

and

σ_{1}^{2} = {0.5}^{2}

.

		Normal		SN
Parameter	True Value	Mean	SD	Mean	SD
$β_{1}$	−1	−0.9989	0.0298	−1.0012	0.0268
$β_{2}$	2	1.9978	0.0297	2.0001	0.0274
$σ^{2}$	0.16	0.1893	0.0167	0.1587	0.0160
$σ_{1}^{2}$	0.25	0.2212	0.0389	0.2505	0.0433
$λ_{1}$	−3	-	-	−2.9564	0.2249
$λ_{2}$	0	-	-	0.0015	0.0635
$λ_{3}$	3	-	-	3.0650	0.2268

Table 3. Mean and SD of each estimator for both cases with

n = 150

,

σ^{2} = {0.8}^{2}

and

σ_{1}^{2} = {0.9}^{2}

.

Table 3. Mean and SD of each estimator for both cases with

n = 150

,

σ^{2} = {0.8}^{2}

and

σ_{1}^{2} = {0.9}^{2}

.

		Normal		SN
Parameter	True Value	Mean	SD	Mean	SD
$β_{1}$	−1	−0.9995	0.0837	0.9988	0.0800
$β_{2}$	2	1.9989	0.0820	1.9980	0.0778
$σ^{2}$	0.64	0.7462	0.0942	0.6204	0.0868
$σ_{1}^{2}$	0.81	0.6872	0.2089	0.8138	0.1884
$λ_{1}$	−3	-	-	−2.9000	0.6506
$λ_{2}$	0	-	-	0.0056	0.2007
$λ_{3}$	3	-	-	2.9032	0.6644

Table 4. Mean and SD of each estimator for both cases with

n = 300

,

σ^{2} = {0.8}^{2}

and

σ_{1}^{2} = {0.9}^{2}

.

Table 4. Mean and SD of each estimator for both cases with

n = 300

,

σ^{2} = {0.8}^{2}

and

σ_{1}^{2} = {0.9}^{2}

.

		Normal		SN
Parameter	True Value	Mean	SD	Mean	SD
$β_{1}$	−1	−0.9970	0.0576	−0.9982	0.0545
$β_{2}$	2	2.0019	0.0579	1.9981	0.0547
$σ^{2}$	0.64	0.7580	0.0670	0.6330	0.0644
$σ_{1}^{2}$	0.81	0.6883	0.1336	0.8059	0.1432
$λ_{1}$	−3	-	-	−3.0679	0.2716
$λ_{2}$	0	-	-	−0.0049	0.1386
$λ_{3}$	3	-	-	2.9507	0.2694

Table 5. Metallic oxide summary results for four scenarios.

Parameter	(a)	(b)	(c)	(d)
Type 1 mean	$3.865$	$3.858$	$3.863$	$3.856$
Type 2 mean	$3.064$	$3.107$	$3.064$	$3.134$
Lot variance	$0.175$	$0.123$	$0.607$	$0.565$
Sample variance			$0.043$	$0.044$
Chemist variance			$0.032$	$0.033$
Analysis variance	$0.045$	$0.041$	$0.043$	$0.03$ 7
Skewness parameter	-	$- 4.396$	-	$- 3.674$
$l (θ)$	$- 122.762$	$- 118.602$	$- 91.322$	$- 88.176$
AIC	$253.524$	$247.204$	$194.644$	$190.352$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Najafi, Z.; Zare, K.; Mahmoudi, M.R.; Shokri, S.; Mosavi, A. Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model. Mathematics 2022, 10, 2820. https://doi.org/10.3390/math10152820

AMA Style

Najafi Z, Zare K, Mahmoudi MR, Shokri S, Mosavi A. Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model. Mathematics. 2022; 10(15):2820. https://doi.org/10.3390/math10152820

Chicago/Turabian Style

Najafi, Zeinolabedin, Karim Zare, Mohammad Reza Mahmoudi, Soheil Shokri, and Amir Mosavi. 2022. "Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model" Mathematics 10, no. 15: 2820. https://doi.org/10.3390/math10152820

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model

Abstract

1. Introduction

2. The Model Definition

3. Parameter Estimation via the EM Algorithm

4. Local Influence Analysis

4.1. The Hessian Matrix

4.2. Perturbation Schemes

4.2.1. Perturbation of the Response Variable

4.2.2. Perturbation of the kth Column of the Matrix $X$

4.2.3. Perturbation of the Dispersion Matrix of the Errors

5. Empirical Studies

5.1. Simulation Study

5.2. An Example

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

Appendix A.1. ML Prediction of Random Effects

Appendix A.2. ML Estimation of Variances of Random Effects

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model

Abstract

1. Introduction

2. The Model Definition

3. Parameter Estimation via the EM Algorithm

4. Local Influence Analysis

4.1. The Hessian Matrix

4.2. Perturbation Schemes

4.2.1. Perturbation of the Response Variable

4.2.2. Perturbation of the kth Column of the Matrix X

4.2.3. Perturbation of the Dispersion Matrix of the Errors

5. Empirical Studies

5.1. Simulation Study

5.2. An Example

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

Appendix A.1. ML Prediction of Random Effects

Appendix A.2. ML Estimation of Variances of Random Effects

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2.2. Perturbation of the kth Column of the Matrix $X$