Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm

Angraini, Yenni; Notodiputro, Khairil Anwar; Folmer, Henk; Saefuddin, Asep; Toharudin, Toni

doi:10.3390/sym13071286

Open AccessArticle

Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm

by

Yenni Angraini

^1,2,*

,

Khairil Anwar Notodiputro

^1,*,

Henk Folmer

²,

Asep Saefuddin

¹ and

Toni Toharudin

³

¹

Department of Statistics, IPB University, Bogor 16680, Indonesia

²

Faculty of Spatial Sciences, University of Groningen, 9747 Groningen, The Netherlands

³

Department of Statistics, University of Padjadjaran, Bandung 16426, Indonesia

^*

Authors to whom correspondence should be addressed.

Symmetry 2021, 13(7), 1286; https://doi.org/10.3390/sym13071286

Submission received: 3 June 2021 / Revised: 1 July 2021 / Accepted: 15 July 2021 / Published: 17 July 2021

(This article belongs to the Special Issue Symmetry in Statistics and Data Science)

Download

Browse Figures

Versions Notes

Abstract

:

This paper deals with symmetrical data that can be modelled based on Gaussian distribution, such as linear mixed models for longitudinal data. The latent factor linear mixed model (LFLMM) is a method generally used for analysing changes in high-dimensional longitudinal data. It is usual that the model estimates are based on the expectation-maximization (EM) algorithm, but unfortunately, the algorithm does not produce the standard errors of the regression coefficients, which then hampers testing procedures. To fill in the gap, the Supplemented EM (SEM) algorithm for the case of fixed variables is proposed in this paper. The computational aspects of the SEM algorithm have been investigated by means of simulation. We also calculate the variance matrix of beta using the second moment as a benchmark to compare with the asymptotic variance matrix of beta of SEM. Both the second moment and SEM produce symmetrical results, the variance estimates of beta are getting smaller when number of subjects in the simulation increases. In addition, the practical usefulness of this work was illustrated using real data on political attitudes and behaviour in Flanders-Belgium.

Keywords:

latent factor linear mixed model (LFLMM); expectation-maximization (EM) algorithm; supplemented EM algorithm; longitudinal data analysis

1. Introduction

The latent factor multivariate linear mixed model (LFLMM) is a combination between the Factor Analysis (FA) and the Linear Mixed Model (LMM), as proposed by [1]. The model aims to analyze longitudinal data sets with large numbers of multivariate responses, i.e., high-dimensional longitudinal data. The authors proposed estimation of the LFLMM by means of the EM algorithm, which is a closed-form solution. They showed by way of simulation that EM estimation of the LFLMM provides accurate parameter estimates and is more efficient in terms of adding variables other than time variables in the model than alternatives like the structural equation model. As shown by [2,3], the combination of fixed and random effects and the interaction of covariates with time can be straightforwardly handled by the LFLMM estimated by EM.

The LFLMM assumes that the responses are continuous and that the number of latent variables is known [1]. Moreover, convergence of the EM algorithm is sometimes slow. The main disadvantage, however, is that the EM algorithm does not produce standard errors of the estimator of the regression coefficients because it does not calculate the derivatives of the likelihood function, which are often complicated and tedious to derive [4,5]. Thus, it is difficult to study the effects of different covariates or fixed variables for different latent factors simultaneously.

The general Supplemented EM algorithm was proposed by [6] to obtain the standard errors by calculating the complete information matrix as the base of the variance-covariance matrix of the estimator. The Supplemented EM algorithm has been applied to various kinds of models, notably item response models [6,7,8,9]. However, its suitability and features in the case of application to the LFLMM have not been investigated yet. In this study, we extend the work of [1] by employing the Supplemented EM algorithm as a by-product of the EM estimator for the case of fixed variables. We used simulation studies to investigate the computational aspects of the Supplemented EM algorithm and used a real data example to illustrate the practical usefulness of this work.

The remainder of this study is organized as follows. In Section 2, we specify the LFLMM and summarize the EM algorithm to estimate it. Section 3 presents the Supplemented EM algorithm. Section 4 and Section 5 present the results of the simulation and real data example. Conclusions follow in Section 6.

2. The LFLMM and the EM Algorithm

Following [1], an LFLMM can be composed of two parts. The first is the factor analysis model, which represents the relationships between the observed and latent variables. This is similar to the structural equation model, which explains the relationship of the latent variables and the measurement indicators carried out through factor analysis [10]. This part can be written as:

Y_{i t} = Λ η_{i t} + ϵ_{i t}

(1)

Specifically, for the i-th of

N individuals

, we observe j = 1, …,

J

responses which characterize

d

latent factors

(η_{i t} = (η_{i t}^{1}, \dots, η_{i t}^{d}), d < J)

at time

t

,

t = 1, \dots, T_{i}

, where

T_{i}

is the number of time periods for subject

i

.

Λ

is the matrix of factor loadings and

ϵ_{i t} = (ϵ_{i t 1}, \dots, ϵ_{i t J})

the vector of measurement errors for subject

i

at time

t

. It is assumed that

ϵ_{i t J} ~ N (0, τ_{j}^{2})

and

ϵ_{i t j} ⊥ ϵ_{i t h}

,

j \neq h

. In matrix notation, Equation (1) reads:

Y_{i} = (I_{T_{i}} ⨂ Λ) η_{i} + ϵ_{i}

(2)

where

Y_{i} = (y_{i 1}^{'}, \dots, y_{i T_{i}}^{'})'_{[J \times T_{i}, 1]} η_{i} = (η_{i 1}^{'}, \dots, η_{i T_{i}}^{'})'_{[d \times T_{i}, 1]} Λ_{[J \times d]} = (\begin{matrix} λ_{1}^{'} \\ ⋮ \\ λ_{J}^{'} \end{matrix})

The second part of the LFLMM is a multivariate linear mixed model containing the fixed and random effects for each latent variable (

η_{i t})

. For individual

i

,

i = 1, 2, \dots N,

at time

t

,

t = 1, 2, \dots, T_{i}

, and latent variable

l, l

= 1, 2, \dots, d

, we thus have:

η_{i t}^{l} = x_{i t}^{l} β^{l} + z_{i t}^{l} a_{i}^{l} + ε_{i t}^{l}

(3)

where

x_{i t}^{l}

and

z_{i t}^{l}

are the elements of design matrices of the

p fixed

variables and

q

random effects, respectively.

β^{l}

is an

unknown coefficient

,

a_{i}^{l} = (a_{i 1}^{l}, \dots, a_{i q}^{l})

and

ε_{i}^{l} = (ε_{i 1}^{l}, \dots, ε_{i T i}^{l})

,

l = 1, 2, \dots d,

are the random effects and errors for subject

i

and factor

l, respectively

. The random effects are assumed to be normally distributed with mean 0 and variance-covariance matrix

V (a) = Σ_{a}

. It is assumed that

Σ_{a}

captures the changes among the latent variables [1]. For example, a positive covariance between the random effects for the latent variables 1 and 2 means that if for a given individual

i

the latent variable 1 increases over time, the latent variable 2 also increases for that individual. Note that in this setting, the covariates are included in the multivariate linear mixed model (MLMM) of Equation (3) but not in the factor analysis model of Equation (1).

In matrix notation, Equation (3) reads:

η_{i} = X_{i} β + Z_{i} a_{i} + ε_{i}

(4)

where

X_{i} = {(\begin{matrix} x_{i 1} \\ ⋮ \\ x_{i T_{i}} \end{matrix})}_{[d \times T_{i}, p \times d]}, x_{i t} = {(\begin{matrix} \begin{matrix} x_{i t}^{1} & 0 \\ 0 & x_{i t}^{2} \end{matrix} & \begin{matrix} \dots & 0 \\ \dots & 0 \end{matrix} \\ \begin{matrix} \dots & \dots \\ 0 & 0 \end{matrix} & \begin{matrix} ⋱ & ⋮ \\ \dots & x_{i t}^{d} \end{matrix} \end{matrix})}_{[d, p \times d]} Z_{i} = {(\begin{matrix} z_{i 1} \\ ⋮ \\ z_{i T_{i}} \end{matrix})}_{[d \times T_{i}, q \times d]}, z_{i t} = {(\begin{matrix} \begin{matrix} z_{i t}^{1} & 0 \\ 0 & z_{i t}^{2} \end{matrix} & \begin{matrix} \dots & 0 \\ \dots & 0 \end{matrix} \\ \begin{matrix} \dots & \dots \\ 0 & 0 \end{matrix} & \begin{matrix} ⋱ & ⋮ \\ \dots & z_{i t}^{d} \end{matrix} \end{matrix})}_{[d, q \times d]} β = (β^{1}^{'}, \dots, β^{d}^{'})'_{[p \times d, 1]} a_{i} = (a_{i}^{1'}, \dots, a_{i}^{d'})'_{[q \times d, 1]} \sim N (0, Σ_{a}) ε_{i} = (ε_{i 1}^{'}, \dots, ε_{i T_{i}}^{'})'_{[d \times T_{i}, 1]}, ε_{i t} \sim N (0, Σ_{ε})

The marginal distribution of

Y_{i}

is assumed multivariate normal with mean:

E (Y_{i}) = (I_{T_{i}} ⨂ Λ) X_{i} β

and variance-covariance matrix

V (Y_{i}) = (I_{T_{i}} \otimes Λ) V (η_{i}) {(I_{T_{i}} \otimes Λ)}^{'} + I_{T_{i}} \otimes d i a g (τ_{1}^{2}, \dots, τ_{J}^{2})

The first term in

V (Y_{i})

denotes the variances and covariance of the latent factors and the last term the variances of the error term,

ϵ_{i t}

. The mean and variance-covariance matrix of

η_{i}

are

E (η_{i}) = X_{i} β

and

V (η_{i}) = Z_{i} Σ_{a} Z_{i}^{'} + I_{T_{i}} \otimes Σ_{ε}

, respectively.

To estimate the LFLMM by EM, we summarize it below, as proposed by [1]. Before going into detail, we observe that

{η_{i}, a_{i}}

is treated as missing data. Hence, the complete dataset is

{Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}}

whereas the observed data is

{Y_{i}, X_{i}, Z_{i}}

. It follows that the complete data likelihood is:

L = \prod_{i = 1}^{N} P (Y_{i} | η_{i}, Λ, τ^{2}) P (η_{i} | X_{i}, Z_{i}, a_{i}, β, Σ_{ε}) P (a_{i} | Σ_{a})

(5)

The corresponding complete data loglikelihood is:

\log L = \sum_{i = 1}^{N} [\log P (Y_{i} | η_{i}, Λ, τ^{2}) + \log P (η_{i} | X_{i}, Z_{i}, a_{i}, β, Σ_{ε}) + \log P (a_{i} | Σ_{a})]

(6)

where

\begin{array}{l} \sum_{i = 1}^{N} \log P (Y_{i} | η_{i}, Λ, τ^{2}) & = \sum_{i = 1}^{N} \sum_{j = 1}^{J} \log P (Y_{i j} | η_{i}, Λ_{j}, τ_{j}^{2}) \\ = \sum_{i = 1}^{N} \sum_{j = 1}^{J} [- \frac{n_{i}}{2} \log τ_{j}^{2} - \frac{1}{2 τ_{j}^{2}} {(Y_{i j} - η_{i}^{'} Λ_{j})}^{'} (Y_{i j} - η_{i}^{'} Λ_{j})] \end{array}

(7)

\begin{array}{l} \sum_{i = 1}^{N} \log P (η_{i} | X_{i}, Z_{i}, β, a_{i}, Σ_{ε}) & = \sum_{i = 1}^{N} \sum_{t = 1}^{T_{i}} \log P (η_{i t} | X_{i}, Z_{i}, a_{i}, β, Σ_{ε}) \\ = \sum_{i = 1}^{N} \sum_{t = 1}^{T_{i}} [- \frac{1}{2} \log | Σ_{ε} | - \frac{1}{2} {(η_{i t} - X_{i t} β - Z_{i} a_{i})}^{'} Σ_{ε}^{- 1} (η_{i t} - X_{i t} β - Z_{i} a_{i})] \end{array}

(8)

\sum_{i = 1}^{N} \log P (a_{i} | Σ_{a}) = - \frac{1}{2} \sum_{i = 1}^{N} [\log | Σ_{a} | - \frac{1}{2} a'_{i} Σ_{a}^{- 1} a_{i}]

(9)

Let the

θ

denote the parameter vector

(Λ, τ^{2}, β, and Σ_{ε})

,

θ^{(w)}

be the ML estimate of

θ

at the

w

th iteration for

w = 0, 1, \dots

., and

Q (θ | θ^{(w)})

the expectation of the joint loglikelihood for the complete data

{Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}}

conditional on the observed data

{Y_{i}, X_{i}, Z_{i}}

:

Q (θ | θ^{(w)}) = E {\log L (θ | Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}) | Y_{i}, X_{i}, Z_{i}, θ^{(w)}}

(10)

Then the

(w + 1)

th iteration of the EM algorithm consists of (i) the E–step, which is the expectation of the joint loglikelihood computed according to (10) and (ii) the M-step, which maximizes

Q (θ | θ^{(w)})

to yield

θ^{(w + 1)}

. Further details on EM estimation of the LFLMM can be found in [1].

3. The Supplemented EM

Below we discuss the Supplemented EM algorithm, denoted as SEM. Before going into detail, we observe that the main purpose of this study is to estimate the standard errors of the fixed effects,

β

.

Consider the mapping M defined by iteration w of the EM algorithm:

β^{(w + 1)} = M (β^{(w)}), for w = 0, 1, \dots

when the parameter vector converges to

β^{*}, we obtain β^{*} = M (β^{*})

. For

M (β)

continuous we have by Taylor expansion in the neighbourhood of

β^{*}

β^{(w + 1)} = M (β^{(w)}) \approx M (β^{*}) + D M (β^{(w)} - β^{*}) = β^{*} + D M (β^{(w)} - β^{*})

(11)

where

D M = {(\frac{\partial M_{h} (β)}{\partial β_{g}}) |}_{β = β^{*}}

(12)

g = 1, 2, \dots, k

and

h = 1, 2, \dots, k

is the

k \times k

Jacobian matrix of

M (β) = (M_{1} (β), \dots, M_{k} (β))

evaluated at the ML estimate of

β

with

k = p \times d .

DM is known as the rate matrix. To obtain the loglikelihood of

β

, we consider the complete data density of the LFLMM:

\begin{array}{l} f ({Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}} | θ) \\ = f ({Y_{i}, X_{i}, Z_{i}} | θ) f ({η_{i}, a_{i}} | {Y_{i}, X_{i}, Z_{i}}, θ) \end{array}

where

f ({Y_{i}, X_{i}, Z_{i}} | θ)

is the density of the observed data and

f ({η_{i}, a_{i}} | {Y_{i}, X_{i}, Z_{i}}, θ)

the density of missing data, given the observed data. Thus, the loglikelihood of

β

given the complete data is:

\log L (β | {Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}}) = \log L (β | {Y_{i}, X_{i}, Z_{i}}) + \log f ({η_{i}, a_{i}} | {Y_{i}, X_{i}, Z_{i}}, β)

(13)

where

\log L (β | {Y_{i}, X_{i}, Z_{i}})

is the observed-data loglikelihood and

\log L (β | {Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}})

is the complete data loglikelihood.

The asymptotic variance-covariance matrix of

β

,

V (β)

, is the inverse of the observed information matrix

(I_{o})

. In the case of the LFLMM, the observed data is

{Y_{i}, X_{i}, Z_{i}}

so that

V (β)

is:

V (β) = I_{o}^{- 1} (β | {Y_{i}, X_{i}, Z_{i}})

(14)

where

I_{o} (β | {Y_{i}, X_{i}, Z_{i}})

is the information matrix of the observed data loglikelihood (which is assumed to exist). That is [6,11]:

I_{o} (β | {Y_{i}, X_{i}, Z_{i}}) = - E [\frac{\partial^{2} \log L (β | {Y_{i}, X_{i}, Z_{i}})}{\partial β \cdot \partial β}]

(15)

Equation (15) is difficult to evaluate directly using the EM algorithm [6,11]. As a way out, [7] suggested to evaluate the complete data information matrix:

I_{o} (β | {Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}}) = - E [\frac{\partial^{2} \log L (β | {Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}})}{\partial β \cdot \partial β}]

(16)

The conditional complete data information, given the observed data evaluated at

β = β^{*}

, is:

I_{o c} = E [I_{o} (β | {Y_{i}, X_{i}, Z_{i}, η_{i}, a_{i}} | {Y_{i}, X_{i}, Z_{i}}, β^{*}]

(17)

After taking second derivatives, averaging over

f ({η_{i}, a_{i}} | {Y_{i}, X_{i}, Z_{i}}, β)

, and evaluating at

β = β^{*}

, Equation (13) implies:

I_{o} (β^{*} | ({Y_{i}, X_{i}, Z_{i}}) = I_{o c} - I_{o m}

(18)

where the missing information matrix

(I_{o m})

is

I_{o m} = E [- \frac{\partial^{2} \log f ({η_{i}, a_{i}} | {Y_{i}, X_{i}, Z_{i}}, β^{*})}{\partial β \cdot \partial β}]

(19)

ref [12] interpreted Equation (18) as

o b s e r v e d i n f o r m a t i o n = c o m p l e t e i n f o r m a t i o n - m i s s i n g i n f o r m a t i o n

and called it the “missing information principle”. Equation (18) can be written as:

I_{o} (β^{*} | ({Y_{i}, X_{i}, Z_{i}}) = (I - I_{o m} I_{o c}^{- 1}) I_{o c},

(20)

where

I

is the

k \times k

identity matrix and

I_{o m} I_{o c}^{- 1}

is the matrix of the fraction of missing information [7,11]. According to [13], the rate of convergence of the EM algorithm is determined by the fraction of missing information in the neighborhood of

β^{*}

:

D M = I_{o m} I_{o c}^{- 1}

(21)

Substituting

D M = I_{o m} I_{o c}^{- 1}

into Equation (20) and inverting, the asymptotic variance-covariance matrix of

β^{*}

,

V (β^{*})

is:

V (β^{*}) = I_{o c}^{- 1} {(I - D M)}^{- 1}

(22)

From the equality

{(I - P)}^{- 1} = (I - P + P) {(I - P)}^{- 1} = I + P {(I - P)}^{- 1}

it follows that:

V (β^{*}) = I_{o c}^{- 1} {I + D M {(I - D M)}^{- 1}} = I_{o c}^{- 1} + I_{o c}^{- 1} D M {(I - D M)}^{- 1}

(23)

or

V (β^{*}) = I_{o c}^{- 1} + Δ V (β^{*})

(24)

where

Δ V (β^{*})

is the increase of the diagonal elements of

V (β^{*})

related to missing information.

Calculation of the

D M

matrix can be done using the code and output of the original EM algorithm as follows [6,7]. The

D M

matrix represents the differential of the parameter mappings during the EM algorithm. Hence, each element of the

D M

matrix represents a component-wise increase of the rate of convergence per iteration of the EM algorithm. Let

r_{g h}

be the

(g, h)

th element of the

D M

matrix. From Equation (13), we have:

\begin{array}{l} r_{g h} = \frac{\partial M_{h} (β^{*})}{\partial β_{g}} & = \lim_{β_{g} \to β_{g}^{*}} \frac{M_{h} (β_{1}^{*}, \dots, β_{g - 1}^{*}, β_{g}, β_{g + 1}^{*}, \dots, β_{k}^{*}) - M_{h} (β^{*})}{β_{g} - β_{g}^{*}} \\ = \lim_{w \to \infty} \frac{M_{h} (β^{(w)} (g)) - M_{h} (β^{*})}{β_{g}^{(w)} - β_{g}^{*}} \equiv \underset{w \to \infty}{l i m} r_{g h}^{(w)} \end{array}

(25)

g = 1, 2, \dots, k

and

h = 1, 2, \dots, k

where

β^{(w)} (g)

is called the semi-active parameter set

β^{(w)} (g) = (β_{1}^{*}, \dots, β_{g - 1}^{*}, β_{g}^{(w)}, β_{g + 1}^{*}, \dots, β_{k}^{*}), w = 1, 2, \dots

(26)

which converges to

β_{g}^{*}

. Note that only the

g

th component in

β^{(w)} (g)

takes a value different from its maximum likelihood estimate.

To calculate

r_{g h}

, the Supplemented EM algorithm requires

θ^{*} = {Λ^{*}, τ^{2 *}, β^{*} and Σ_{ε}^{*}}

and

θ^{(w)} = {Λ^{(w)}, τ^{2 (w)}, β^{(w)} and Σ_{ε}^{(w)}}

for

w = 1, 2, \dots

as input

. θ^{*}

can be obtained by the EM algorithm using a set of arbitrarily chosen initial parameters

θ^{i n i t}

including

θ^{(w)}

for

w = 1, i . e ., θ^{(1)}

. The starting point

θ^{(1)}

may, but need not, be close

θ^{*}

. The algorithm below closely follows [14,15].

Select input: $θ^{(w)}$ and $θ^{*}$
Set $θ^{(w)}$ for $w = 1, 2, \dots$ . Then take the E step and M step of the LFLMM EM algorithm to produce $θ^{(w + 1)}$ .
For rows $= 1, 2, \dots, k$ :
(i)
Set ${\tilde{β}}^{(w)} (g)$ be equal to $β^{*}$ , except for the $g$ th element: $({\tilde{β}}^{(w)} (g) = (β_{1}^{*}, \dots, β_{g - 1}^{*}, β_{g}^{(w)}, β_{g + 1}^{*}, \dots, β_{k}^{*}))$
(ii)
Run the LFLMM EM algorithm with ${\tilde{β}}^{(w)} (g)$ as the current estimate of $β$ to obtain ${\tilde{β}}^{(w + 1)} (g)$ .
(iii)
Calculate the $g$ th row of $r_{g h}^{(w)}$ as

$r_{g h}^{(w)} = \frac{{\tilde{β}}_{h}^{(w + 1)} (g) - β_{h}^{*}}{β_{g}^{(w)} - β_{g}^{*}}, for h = 1, 2, \dots, k$

The output after a single run of the Supplemented EM algorithm (Step 1 and 2) are

β^{w + 1}

and

r_{g h}^{(w)} g = 1, 2, \dots, k

and

h = 1, 2, \dots, k

. Based on the final estimates of DM,

V (β^{*})

is calculated using (24). The diagonal elements of

V

are the variance of

β^{*}

.

4. Simulation

To evaluate the statistical properties and computational aspects of the SEM, we set up a simulation study. The number of subjects (

N)

is set at 500, 1000, and 1500 with six time periods. The number of simulations (

S

) is set at 50 and 250. The other set-up of the simulations is adopted from [1]. Particularly, we use the same initial values of the parameters of the LFLMM model (12 items, 2 latent factors, and a simple structure to model the relationship between the items and the latent factors). It is done to check if the bias resulting from the Supplemented EM algorithm on LFLMM is in line with the results presented in [1]. Table 1 presents the absolute difference for the true parameters, and the averages of the SEM estimates are calculated as a measure of performance.

Table 1 shows that the absolute difference of

σ_{a, 11}

has a range from 0 to 0.0444

(N = 500

and

S = 250)

. The results are in line with [1] the parameters of the measurement model (factor loadings and error variances) are estimated more precisely than those of the latent mixed regression model. Overall, these results indicate that with the increasing number of subjects in the simulation, the absolute difference between the actual parameters and the SEM average is getting smaller. This means the SEM can estimate the model parameter very well.

Although the results in Table 1 indicate that the accuracy of the estimates in the latent mixed regression model (No. 23–43) is not as good as in the measurement model part (No. 1–22), through Figure 1a–c, it can be shown that the median of boxplots (which generally is close to the true parameters) are all at the same level. This means that the parameters of the latent mixed regression models

(β, σ_{a}, σ_{ε})

are estimated more precisely, especially for the number of simulations

S = 250

. Furthermore, all the boxplots are also shown to have different distributions of views with an increasing number of subjects in the simulation. This is indicated by the smaller size of the boxplot as the number of subjects increases for both numbers of simulations.

The results from the simulations of the Supplemented EM algorithm in estimating the asymptotic variance matrix of beta is summarized as a standard deviation of beta in Table 2. We also calculate the standard deviation of beta using the 2nd moment,

\sqrt{V (\hat{β})} = \sqrt{\frac{1}{S - 1} \sum_{s = 1}^{S} {({\hat{β}}_{s} - \bar{\hat{β}})}^{2}}

as a benchmark to compare with the standard deviation of beta of SEM. Both the 2nd moment and SEM produce symmetrical results the parameter estimate for the standard deviation of beta is getting smaller with the increasing number of subjects in the simulation. Overall, it can be concluded that by using SEM, changes in the parameter estimate for the standard deviation of beta are not too different for all number of subjects (Figure 2). Therefore, the simulation results suggest that the asymptotic variance of beta from the Supplemented EM Algorithm can be used to estimate the asymptotic variance of beta in real data analysis.

5. Real Data Example

The real data-set that we used to illustrate the development of the Supplemented EM algorithm is the political attitudes and behavior data of Flemish. The data was designed to include a representative sample of the target population under the Belgian electorate. The Flemish data set (Flemish and Dutch speaking respondents from Brussels Capital Region) consists of 1274 respondents, who have been interviewed three times (1991, 1995, and 1999) [16,17,18]. There are four latent factors measured on political attitudes of Flemish used, i.e., Individualism, Nationalism, Ethnocentrism, and Authoritarianism. This data has been analyzed using various methods by several authors, including [19,20,21,22,23]. There are three interesting questions in this real data case, i.e., how Individualism, Nationalism, Ethnocentrism, and Authoritarianism of the Flemish develop over time; whether there is an association between these four developments, and whether the gender of the respondent affects the change patterns of latent developments.

I, N, E, and A in Table 3 correspond to Individualism, Nationalism, Ethnocentrism, and Authoritarianism, respectively.

a_{11}

and

a_{12}

are the random intercept and random slope for Individualism.

a_{21}

and

a_{22}

are the random intercept and random slope for Nationalism.

a_{31}

and

a_{32}

are the random intercept and random slope for Ethnocentrism.

a_{41}

and

a_{42}

are the random intercept and random slope for Authoritarianism. The positive correlation of random intercept between

a_{11}

and

a_{21}

,

a_{11}

and

a_{31}

,

a_{11}

and

a_{41}

suggests that the development of Individualism and other political attitudes is highly related, which highest correlated with Ethnocentrism. The results indicate that those who have a better sense of Individualism tend to have a better sense of Nationalism, Ethnocentrism, and Authoritarianism. The results find a positive correlation of random intercept between

a_{21}

and

a_{31}

,

a_{21}

and

a_{41}

. It suggests that those who have a better sense of Nationalism tend to have a better sense of Ethnocentrism and Authoritarianism, as well as those who have a better sense of Ethnocentrism tend to have a better sense of Authoritarianism. There is also a positive correlation of random slope between

a_{12}

and

a_{22}

. It means that if one subject’s Individualism decreases over time, then it is reasonable to expect that his or her Nationalism will decrease over time and vice versa. This also holds between Individualism and Ethnocentrism and between Individualism and Authoritarianism. The positive correlation of random slope between

a_{22}

and

a_{32}

, meaning that if one subject’s Nationalism decreases over time, then it is reasonable to expect that his or her Ethnocentrism will decrease over time. The correlation matrix of random effects confirms that all latent factors have a positive correlation over time.

The significance of parameter estimate of

β

is analyzed via the

z -

values. By using the Supplemented EM algorithm, the standard errors of

β

for all parameters can be calculated. The standard errors of

β

are listed in Table 4. Using a 95 percent confidence interval of

β

, almost all confidence intervals do not include the null value, except the slope of Male on Authoritarianism. Hence there are statistically significant differences in the parameter estimate of

β

. In other words, all latent factors of Flemish people decrease over time, with Ethnocentrism having the highest rate of decline over time (−0.252) and Nationalism the lowest (−0.177). On average, the Individualism and Nationalism of the male respondent are higher than that of the female. However, Ethnocentrism of the male respondent is lower than that of the female.

6. Conclusions

This paper proposed the Supplemented EM algorithm for LFLMM in estimating the asymptotic variance-covariance matrix as a by-product of the EM estimator for the case of fixed variables in the model. Results from simulation studies suggest that the Supplemented EM algorithm can estimate the model very close to the initial parameters.

As a result of the development of EM algorithm of LFLMM, the Supplemented EM algorithm is very slow to converge, as stated by [1], especially when the number of simulations is 250 times with 1500 subjects. For this reason, further research is needed to find techniques that can be used to accelerate the speed of the algorithm. Several approaches to speed the EM algorithm have been proposed and can be found in [24,25,26] (the ECM algorithm), [27] (the ECME algorithm), and [28] (the Parameter-Expanded EM algorithm).

Author Contributions

Conceptualization, Y.A.; methodology, Y.A.; software, Y.A.; validation, K.A.N. and H.F.; formal analysis, Y.A.; data curation, Y.A. and T.T.; writing—original draft preparation, Y.A.; writing—review and editing, K.A.N., H.F., A.S. and T.T.; visualization, T.T.; supervision, K.A.N., H.F. and A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by RUG and IPB University.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from ISPO which were used under license, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of ISPO.

Conflicts of Interest

The authors declare no conflict of interest.

References

An, X.; Yang, Q.; Bentler, P.M. A latent factor linear mixed model for high-dimensional longitudinal data analysis. Stat. Med. 2013, 32, 4229–4239. [Google Scholar] [CrossRef] [PubMed]
Kondaurova, M.V.; Bergeson, R.R.; Xu, H.; Kitamura, C. Affective Properties of Mothers’ Speech to Infants with Hearing Impairment and Cochlear Implants. J. Speech Lang. Hear. Res. 2015, 58, 590–600. [Google Scholar] [CrossRef] [Green Version]
Wang, J.; Luo, S. Multidimensional latent trait linear mixed model: An application in clinical studies with multivariate longitudinal outcomes. Stat. Med. 2017, 36, 3244–3256. [Google Scholar] [CrossRef]
Ng, S.K.; Krishnan, T.; McLachlan, G. The EM algorithm. In Handbook of Computational Statistics; Springer: Berlin, Germany, 2004; pp. 137–168. ISBN 9783642215513. [Google Scholar]
Mclachlan, G.J.; Krishnan, T. The EM Algorithm and Extensions Second Edition, 2nd ed.; Wiley: New York, NY, USA, 2007; ISBN 9780471201700. [Google Scholar]
Meng, A.X.; Rubin, D.B. Using EM to Obtain Asymptotic Variance-Covariance Matrices: The SEM Algorithm. J. Am. Stat. Assoc. 1991, 86, 899–909. [Google Scholar] [CrossRef]
Cai, L. SEM of another flavour: Two new applications of the supplemented EM algorithm. Br. J. Math. Stat. Psychol. 2008, 61, 309–329. [Google Scholar] [CrossRef] [PubMed]
Cai, L.; Lee, T.; Lee, T. Covariance Structure Model Fit Testing Under Missing Data: An Application of the Supplemented EM Algorithm Covariance Structure Model Fit Testing Under Missing Data: An Application of the Supplemented EM Algorithm. Multivar. Behav. Res. 2009, 44, 281–304. [Google Scholar] [CrossRef] [PubMed]
Tian, W.; Cai, L.; Thissen, D.; Xin, T. Numerical Differentiation Methods for Computing Error Covariance Matrices in Item Response Theory Modeling: An Evaluation and a New Proposal. Educ. Psychol. Meas 2012, 73, 412–439. [Google Scholar] [CrossRef]
Caraka, R.E.; Noh, M.; Chen, R.C.; Lee, Y.; Gio, P.U.; Pardamean, B. Connecting climate and communicable disease to penta helix using hierarchical likelihood structural equation modelling. Symmetry 2021, 13, 657. [Google Scholar] [CrossRef]
Pritikin, J.N. A comparison of parameter covariance estimation methods for item response models in an expectation-maximization framework. Cogent Psychol. 2017, 4, 1–11. [Google Scholar] [CrossRef]
Orchard, T.; Woodbury, M. A Missing Information Principle: Theory and Applications. In Theory of Statistics; University of California Press: Berkeley, CA, USA, 1972; Volume 1, pp. 697–715. Available online: https://projecteuclid.org/download/pdf_1/euclid.bsmsp/1200514117 (accessed on 1 June 2021).
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data via the EM Algorithm A. J. R. Stat. Soc. Ser. B 1977, 39, 1–38. [Google Scholar]
Little, R.J.A.; Rubin, D.B. Statistical Analysis with Missing; Wiley: New York, NY, USA, 2002; ISBN 3175723993. [Google Scholar]
Abel, G.J. International Migration Flow Table Estimation. Ph.D. Thesis, University of Southampton, Southampton, UK, 2009. [Google Scholar]
Interuniversitair Steunpunt Politieke-Opinieonderzoek. General Election Study: Codebook and Questionnaire; ISPO: Leuven, Belgium, 1991; ISBN 9067841161. [Google Scholar]
Interuniversitair Steunpunt Politieke-Opinieonderzoek. General Election Study: Codebook and Questionnaire; ISPO: Leuven, Belgium, 1995; ISBN 9067841366. [Google Scholar]
Interuniversitair Steunpunt Politieke-Opinieonderzoek. General Election Study: Codebook and Questionnaire; ISPO: Leuven, Belgium, 1999. [Google Scholar]
Billiet, J. Church Involvement, Individualism, and Ethnic Prejudice among Flemish Roman Catholics: New Evidence of a Moderating Effect. J. Sci. Study Relig. 1995, 34, 224–233. [Google Scholar] [CrossRef]
Billiet, J.; Coffe, H.; Maddens, B. Een Vlaams-nationale identiteit en de houding tegenover allochtonen in een longitudinaal perspectief. In Proceedings of the Paper Presented at the Marktdag Sociologie; Universitaire Pers Leuven: Leuven, Belgium, 2005. [Google Scholar]
Toharudin, T.; Oud, J.H.L.; Billiet, J.B. Assessing the relationships between Nationalism, Ethnocentrism, and Individualism in Flanders using Bergstrom’s approximate discrete model. Stat. Neerl. 2008, 62, 83–103. [Google Scholar] [CrossRef]
Toharudin, T.; Oud, J.H.L.; Billiet, J.; Folmer, H. Measuring Authoritarianism with Different Sets of Items in a Longitudinal Study. In Methods, Theories, Andempirical Applications in the Social Sciences; Salzborn, S., Davidov, E., Reinecke, J., Eds.; Springer: Heidelberg, Germany, 2012; pp. 193–200. ISBN 9783531188980. [Google Scholar]
Angraini, Y.; Toharudin, T.; Folmer, H.; Oud, J.H.L. The Relationships between Individualism, Nationalism, Ethnocentrism, and Authoritarianism in Flanders: A Continuous Time-Structural Equation Modeling Approach. Multivar. Behav. Res. 2014, 49, 41–53. [Google Scholar] [CrossRef] [PubMed]
Meng, X.; Rubin, D.B. Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika 1993, 80, 267–278. [Google Scholar] [CrossRef]
Van Dyk, D.A.; Meng, X.; Rubin, D.B. Maximum Likelihood Estimation via the ECM Algorithm: Computing The Asymptotic Variance. Stat. Sin. 1995, 5, 55–75. [Google Scholar]
Li, H.; Tian, W. Slashed lomax distribution and regression model. Symmetry 2020, 12, 1877. [Google Scholar] [CrossRef]
Liu, B.Y.C.; Rubin, D.B. The ECME algorithm: A simple extension of EM and ECM with faster monotone convergence. Biometrika 1994, 81, 633–648. [Google Scholar] [CrossRef]
Liu, C.; Rubin, D.B.; Wu, Y.N. Parameter Expansion to Accelerate EM: The PX-EM Algorithm. Biometrika 1998, 85, 755–770. [Google Scholar] [CrossRef] [Green Version]

Figure 1. (a) Boxplot the parameters estimate of the latent mixed regression models

(β)

. (b) Boxplot the parameters estimate of the latent mixed regression models

(σ_{a})

. (c) Boxplot the parameters estimate of the latent mixed regression models

(σ_{ε})

.

Figure 1. (a) Boxplot the parameters estimate of the latent mixed regression models

(β)

. (b) Boxplot the parameters estimate of the latent mixed regression models

(σ_{a})

. (c) Boxplot the parameters estimate of the latent mixed regression models

(σ_{ε})

.

Figure 2. Line plot for the standard deviation of beta.

Table 1. The absolute difference between the true parameters and the SEM averages.

No	Para	True	SEM						No	Para	True	SEM
	$N$		500		1000		1500			$N$		500		1000		1500
	$S$		50	250	50	250	50	250		$S$		50	250	50	250	50	250
1	$λ_{2, 1}$	1	0.0013	0.0012	0.0010	0.0009	0.0009	0.0009	23	$β_{1}^{1}$	1	0.0040	0.0036	0.0032	0.0004	0.0018	0.0007
2	$λ_{3, 1}$	1	0.0019	0.0015	0.0013	0.0009	0.0012	0.0009	24	$β_{1}^{2}$	−1	0.0111	0.0079	0.0008	0.0008	0.0034	0.0017
3	$λ_{4, 1}$	1	0.0015	0.0009	0.0005	0.0009	0.0012	0.0010	25	$β_{2}^{1}$	0	0.0038	0.0018	0.0012	0.0007	0.0016	0.0003
4	$λ_{5, 1}$	1	0.0014	0.0016	0.0007	0.0009	0.0007	0.0009	26	$β_{2}^{2}$	0	0.0031	0.0039	0.0014	0.0021	0.0044	0.0006
5	$λ_{6, 1}$	1	0.0016	0.0014	0.0011	0.0009	0.0013	0.0010	27	$β_{3}^{1}$	1	0.0012	0.0016	0.0008	0.0003	0.0007	0.0004
6	$λ_{7, 2}$	1	0.0021	0.0018	0.0013	0.0014	0.0016	0.0013	28	$β_{3}^{2}$	1	0.0016	0.0010	0.0002	0.0004	0.0000	0.0005
7	$λ_{8, 2}$	1	0.0015	0.0016	0.0017	0.0014	0.0010	0.0013	29	$β_{4}^{1}$	1	0.0025	0.0027	0.0002	0.0001	0.0007	0.0002
8	$λ_{9, 2}$	1	0.0020	0.0015	0.0015	0.0014	0.0019	0.0013	30	$β_{4}^{2}$	−1	0.0018	0.0025	0.0004	0.0001	0.0003	0.0001
9	$λ_{10, 2}$	1	0.0015	0.0015	0.0015	0.0014	0.0022	0.0017	31	$σ_{a, 11}$	3	0.0332	0.0444	0.0152	0.0154	0.0039	0.0160
10	$λ_{11, 2}$	1	0.0019	0.0012	0.0017	0.0017	0.0016	0.0016	32	$σ_{a, 12}$	1	0.0410	0.0120	0.0024	0.0057	0.0265	0.0085
11	$τ_{1}^{2}$	0.5	0.0007	0.0001	0.0006	0.0004	0.0009	0.0004	33	$σ_{a, 13}$	1.5	0.0289	0.0200	0.0218	0.0141	0.0229	0.0133
12	$τ_{2}^{2}$	0.5	0.0003	0.0006	0.0000	0.0011	0.0019	0.0006	34	$σ_{a, 14}$	1	0.0286	0.0131	0.0077	0.0083	0.0169	0.0094
13	$τ_{3}^{2}$	0.5	0.0027	0.0001	0.0021	0.0001	0.0005	0.0007	35	$σ_{a, 22}$	3	0.0233	0.0105	0.0247	0.0017	0.0276	0.0150
14	$τ_{4}^{2}$	0.5	0.0007	0.0009	0.0002	0.0002	0.0005	0.0008	36	$σ_{a, 23}$	1	0.0003	0.0045	0.0158	0.0042	0.0124	0.0051
15	$τ_{5}^{2}$	0.5	0.0002	0.0006	0.0010	0.0013	0.0010	0.0006	37	$σ_{a, 24}$	2	0.0073	0.0020	0.0057	0.0004	0.0126	0.0089
16	$τ_{6}^{2}$	0.5	0.0044	0.0010	0.0017	0.0005	0.0004	0.0004	38	$σ_{a, 33}$	3	0.0014	0.0125	0.0383	0.0260	0.0387	0.0274
17	$τ_{7}^{2}$	0.5	0.0005	0.0008	0.0016	0.0021	0.0017	0.0004	39	$σ_{a, 34}$	1	0.0080	0.0075	0.0236	0.0069	0.0043	0.0022
18	$τ_{8}^{2}$	0.5	0.0027	0.0012	0.0006	0.0006	0.0011	0.0009	40	$σ_{a, 44}$	3	0.0021	0.0015	0.0215	0.0020	0.0148	0.0094
19	$τ_{9}^{2}$	0.5	0.0016	0.0010	0.0004	0.0009	0.0001	0.0004	41	$σ_{ε, 11}$	0.5	0.0020	0.0013	0.0028	0.0005	0.0015	0.0013
20	$τ_{10}^{2}$	0.5	0.0009	0.0012	0.0012	0.0005	0.0003	0.0008	42	$σ_{ε, 12}$	0.2	0.0017	0.0006	0.0006	0.0003	0.0006	0.0001
21	$τ_{11}^{2}$	0.5	0.0001	0.0001	0.0013	0.0000	0.0012	0.0001	43	$σ_{ε, 22}$	0.5	0.0021	0.0033	0.0027	0.0029	0.0028	0.0023
22	$τ_{12}^{2}$	0.5	0.0011	0.0012	0.0001	0.0002	0.0011	0.0002

Table 2. The parameter estimates for

\sqrt{V (\hat{β})}

.

Table 2. The parameter estimates for

\sqrt{V (\hat{β})}

.

Number of Subjects	Parameter	The 2nd Moment		SEM
Number of Subjects	Parameter	50	250	50	250
500	$β_{1}^{1}$	0.0512	0.0528	0.0249	0.0251
	$β_{1}^{2}$	0.0587	0.0530	0.0187	0.0195
	$β_{2}^{1}$	0.0378	0.0392	0.0110	0.0126
	$β_{2}^{2}$	0.0391	0.0379	0.0164	0.0158
	$β_{3}^{1}$	0.0184	0.0190	0.0245	0.0268
	$β_{3}^{2}$	0.0184	0.0190	0.0212	0.0200
	$β_{4}^{1}$	0.0283	0.0283	0.0105	0.0105
	$β_{4}^{2}$	0.0288	0.0308	0.0145	0.0145
1000	$β_{1}^{1}$	0.0338	0.0327	0.0179	0.0184
	$β_{1}^{2}$	0.0342	0.0339	0.0138	0.0134
	$β_{2}^{1}$	0.0232	0.0232	0.0071	0.0077
	$β_{2}^{2}$	0.0253	0.0232	0.0105	0.0100
	$β_{3}^{1}$	0.0095	0.0100	0.0176	0.0184
	$β_{3}^{2}$	0.0089	0.0095	0.0130	0.0130
	$β_{4}^{1}$	0.0141	0.0130	0.0077	0.0077
	$β_{4}^{2}$	0.0130	0.0130	0.0110	0.0105
1500	$β_{1}^{1}$	0.0276	0.0253	0.0152	0.0152
	$β_{1}^{2}$	0.0270	0.0265	0.0105	0.0110
	$β_{2}^{1}$	0.0187	0.0182	0.0063	0.0063
	$β_{2}^{2}$	0.0164	0.0192	0.0084	0.0084
	$β_{3}^{1}$	0.0071	0.0071	0.0145	0.0152
	$β_{3}^{2}$	0.0063	0.0071	0.0105	0.0105
	$β_{4}^{1}$	0.0110	0.0105	0.0063	0.0063
	$β_{4}^{2}$	0.0110	0.0105	0.0084	0.0084

Table 3. Correlation matrix of random effects.

Random Effects		I		N		E		A
Random Effects		$a_{11}$	$a_{12}$	$a_{21}$	$a_{22}$	$a_{31}$	$a_{32}$	$a_{41}$	$a_{42}$
I	$a_{11}$	1	0.892	0.832	0.903	0.966	0.930	0.956	0.880
I	$a_{12}$	0.892	1	0.854	0.878	0.937	0.931	0.913	0.895
N	$a_{21}$	0.832	0.854	1	0.311	0.864	0.883	0.856	0.835
N	$a_{22}$	0.903	0.878	0.311	1	0.918	0.893	0.899	0.861
E	$a_{31}$	0.966	0.937	0.864	0.918	1	0.946	0.973	0.919
E	$a_{32}$	0.930	0.931	0.883	0.893	0.946	1	0.946	0.918
A	$a_{41}$	0.956	0.913	0.856	0.899	0.973	0.946	1	0.873
A	$a_{42}$	0.880	0.895	0.835	0.861	0.919	0.918	0.873	1

Table 4. Parameter estimates of

β

.

Table 4. Parameter estimates of

β

.

Parameter	Estimate	$S E (\hat{β})$	Lower	Upper
Individualism	−0.195	0.004	−0.203	−0.187
Nationalism	−0.177	0.028	−0.231	−0.123
Ethnocentrism	−0.252	0.011	−0.273	−0.231
Authoritarianism	−0.186	0.002	−0.190	−0.182
Slope of Male on Individualism	0.110	0.010	0.091	0.129
Slope of Male on Nationalism	0.219	0.017	0.185	0.253
Slope of Male on Ethnocentrism	−0.038	0.003	−0.045	−0.031
Slope of Male on Authoritarianism	0.022	0.017	−0.011	0.055

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Angraini, Y.; Notodiputro, K.A.; Folmer, H.; Saefuddin, A.; Toharudin, T. Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm. Symmetry 2021, 13, 1286. https://doi.org/10.3390/sym13071286

AMA Style

Angraini Y, Notodiputro KA, Folmer H, Saefuddin A, Toharudin T. Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm. Symmetry. 2021; 13(7):1286. https://doi.org/10.3390/sym13071286

Chicago/Turabian Style

Angraini, Yenni, Khairil Anwar Notodiputro, Henk Folmer, Asep Saefuddin, and Toni Toharudin. 2021. "Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm" Symmetry 13, no. 7: 1286. https://doi.org/10.3390/sym13071286

APA Style

Angraini, Y., Notodiputro, K. A., Folmer, H., Saefuddin, A., & Toharudin, T. (2021). Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm. Symmetry, 13(7), 1286. https://doi.org/10.3390/sym13071286

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm

Abstract

1. Introduction

2. The LFLMM and the EM Algorithm

3. The Supplemented EM

4. Simulation

5. Real Data Example

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI