Modeling Model Misspecification in Structural Equation Models

Alexander Robitzsch

doi:10.3390/stats6020044

Abstract

Structural equation models constrain mean vectors and covariance matrices and are frequently applied in the social sciences. Frequently, the structural equation model is misspecified to some extent. In many cases, researchers nevertheless intend to work with a misspecified target model of interest. In this article, a simultaneous statistical inference for sampling errors and model misspecification errors is discussed. A modified formula for the variance matrix of the parameter estimate is obtained by imposing a stochastic model for model errors and applying M-estimation theory. The presence of model errors is quantified in increased standard errors in parameter estimates. The proposed inference is illustrated with several analytical examples and an empirical application.

Keywords:

model misspecification; model error; structural equation modeling; M-estimation

1. Introduction

Confirmatory factor analysis (CFA) and structural equation models (SEM) are statistical approaches to analyzing multivariate data in the social sciences [1,2,3,4,5,6,7]. These models relate a multivariate vector

X = (X_{1}, \dots, X_{I})

of observed (i.e., manifest) I variables (also referred to as indicators or items) to a vector of latent variables (i.e., factors)

η

. SEMs impose constraints on the mean vector

μ

and the covariance matrix

Σ

of the random variable

X

as a function of an unknown parameter vector

θ

. In particular, the mean vector is represented as

μ (θ)

, and the covariance matrix is represented by

Σ (θ)

.

SEM, and CFA as a particular case, define a measurement model that relates the observed variables

X

to latent variables

η

X = ν + Λ η + ϵ .

(1)

In addition, we denote the covariance matrix

Var (ϵ) = Ψ

, and

η

and

ϵ

are multivariate normally distributed random vectors. The random vectors

η

and

ϵ

are assumed to be uncorrelated. In CFA, these vectors follow a multivariate normal (MVN) distribution as

η \sim MVN (α, Φ)

and

ϵ \sim MVN (0, Ψ)

. Hence, we can write the mean and the covariance matrix in CFA as

μ (θ) = ν + Λ α and Σ (θ) = Λ Φ Λ^{⊤} + Ψ .

(2)

In SEM, relationships among the latent variables can be specified as regression models or path models

η = B η + ξ with E (ξ) = α and Var (ξ) = Φ,

(3)

where

B

denotes a matrix of regression coefficients. Hence, the mean vector and the covariance matrix are represented in SEM as

μ (θ) = ν + Λ {(I - B)}^{- 1} α and Σ (θ) = Λ {(I - B)}^{- 1} Φ {[{(I - B)}^{- 1}]}^{⊤} Λ^{⊤} + Ψ,

(4)

where

I

denotes the identity matrix.

In practice, SEM parsimoniously parametrizes the mean vector and the covariance matrix using a parameter vector

θ

as a statistical summary. Such restrictions are unlikely to hold in practice, and model assumptions in SEM are only an approximation of a true data-generating model. In SEM, model deviations (e.g., model errors, model misspecification) in covariances emerge as a difference between a population covariance matrix

Σ

and a model-implied covariance matrix

Σ (θ)

(see Refs. [8,9]). Furthermore, there can be differences in the population mean vector

μ

and the model-implied mean vector

μ (θ)

.

This article addresses how to include model misspecification in statistical inference for parameter estimates. Wu and Browne [9,10] proposed an estimation approach that simultaneously models sampling errors and model errors. They do so by modifying the estimation function in the SEM and estimating the model with the maximum likelihood approach. Uanhoro [11] builds on the approach of Wu and Browne [9] but employs Bayesian (i.e., Markov chain Monte Carlo) estimation. Both approaches have in common that the presence of model errors is quantified in increased standard errors in parameter estimates. In this article, the estimation function in the SEM remains unchanged. We derive a simultaneous statistical inference regarding sampling and model errors based on M-estimation theory [12]. As a consequence, this article only addresses an alternative method of estimating standard errors in SEMs. The estimates of model parameters in the SEM are left unchanged.

The remainder of the article is organized as follows. Different estimation methods and standard error estimates with respect to sampling error are reviewed in Section 2. Section 3 presents a stochastic model for model errors and derives the extended variance formula that simultaneously addresses sampling errors and model errors. In Section 4, three analytical illustrative examples show how model errors are reflected in the variance of parameter estimates. Section 5 presents a numerical example using a survey dataset in which the proposed approach is applied. Finally, the article closes with a discussion in Section 6.

2. Estimating Structural Equation Models

In this section, we review different estimation methods for multiple-group SEMs. Note that some identification constraints at the population level must be imposed to estimate the SEM [2,13,14]. When modeling multivariate normally distributed data without missing data, the empirical mean vector

\bar{x}

and the empirical covariance matrix

S

are sufficient statistics for an unknown mean vector

μ

and covariance matrix

Σ

. Hence, the statistics

\bar{x}

and

S

are also sufficient for the parameter vector

θ

of the SEM.

Assume that there are G groups with sample sizes

N_{g}

and empirical means

{\bar{x}}_{g}

and covariance matrices

S_{g}

for groups

g = 1, \dots, G

. The population mean vectors are denoted by

μ_{g}

, and the population covariance matrices are denoted by

Σ_{g}

(

g = 1, \dots, G

). The model-implied mean vectors are denoted by

μ_{g} (θ)

and the model-implied covariance matrix by

Σ_{g} (θ)

. The parameter vector

θ

can have common parameters across groups and parameters that are group-specific. For example, in a CFA, equal factor loadings and item intercepts across groups are imposed (i.e., measurement invariance holds; [15,16]) by assuming the same loading matrix

Λ

and the same intercept vector

ν

across groups, while mean vectors and covariance matrices are allowed to differ across groups.

The maximum likelihood (ML) function for the parameter

θ

in the SEM is given by the following (see Refs. [2,4]):

F_{ML} (θ; {{\bar{x}}_{g}}, {S_{g}}) = - \sum_{g = 1}^{G} \frac{N_{g}}{2} (- I log (2 π) + \log | Σ_{g} (θ) | + tr (S_{g} Σ_{g} {(θ)}^{- 1}) + {({\bar{x}}_{g} - μ_{g} (θ))}^{⊤} Σ_{g} {(θ)}^{- 1} ({\bar{x}}_{g} - μ_{g} (θ))),

(5)

where

{{\bar{x}}_{g}}

and

{S_{g}}

denote the sets of the empirical mean vectors and empirical covariance matrices for groups

g = 1, \dots, G

, respectively. In practice, the model-implied covariance matrix can be misspecified [17,18,19,20], and

θ

is a pseudo-true parameter defined as the maximizer of the fitting function

F_{ML}

in (5). Importantly,

θ

does not refer to a parameter of the data-generating model in this case, but it should be interpreted as a summary of the data that is of central interest to the researcher.

A more general class of fitting functions in SEMs is weighted least squares (WLS) estimation [3,4,21]. The parameter vector

θ

is determined as the minimizer of

F_{WLS} (θ; {{\bar{x}}_{g}}, {S_{g}}) = \sum_{g = 1}^{G} {({\bar{x}}_{g} - μ_{g} (θ))}^{⊤} W_{1 g} ({\bar{x}}_{g} - μ_{g} (θ)) + \sum_{g = 1}^{G} {(s_{g} - σ_{g} (θ))}^{⊤} W_{2 g} (s_{g} - σ_{g} (θ)),

(6)

where matrices

Σ

and

S

have been replaced by vectors

σ

and

s

that collect all nonduplicated elements of the matrices in vectors. Formally, we denote the vech operator for this transformation that defines

σ_{g} = vech (Σ_{g})

and

s_{g} = vech (S_{g})

. The weight matrices

W_{1 g}

and

W_{2 g}

(

g = 1, \dots, G

) can also depend on parameters that must be estimated prior to solving the estimation problem (6). Diagonally weighted least squares (DWLS) estimation results by choosing diagonal weight matrices

W_{g 1}

and

W_{g 2}

. In this case, the fitting function can be written as

F_{DWLS} (θ; {{\bar{x}}_{g}}, {S_{g}}) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 g i} {({\bar{x}}_{g i} - μ_{g i} (θ))}^{2} + \sum_{g = 1}^{G} \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 g i j} {(s_{g i j} - σ_{g i j} (θ))}^{2},

(7)

where

w_{1 g i}

and

w_{2 g i j}

are appropriate elements in

W_{1 g}

and

W_{2 g}

, respectively. Unweighted least squares (ULS) estimation is obtained by setting all weights

w_{1 g i}

and

w_{2 g i j}

equal to one.

Interestingly, the minimization of

F_{DWLS}

in (6) with respect to the parameter

θ

can be viewed as a nonlinear least squares estimation problem with sufficient statistics

{{\bar{x}}_{g}}

and

{S_{g}}

as input data [22]. It has been shown that ML estimation can be approximately written as DWLS estimation [23] with particular weight matrices. The weights are approximately determined by

w_{1 g i} = 1 / u_{g i}^{2}

and

w_{2 g i j} = 1 / (u_{g i}^{2} u_{g j}^{2})

, where

u_{g i}^{2}

are sample unique standardized variances with

u_{g i}^{2} = ψ_{g i i} / σ_{g i i}

(see Ref. [23]).

The fitting functions can be slightly more generally formulated as a sum of group-specific fitting functions

F (θ, \hat{ξ}) = \sum_{g = 1}^{G} F_{g} (θ, {\hat{ξ}}_{g}),

(8)

where

{\hat{ξ}}_{g} = ({\bar{x}}_{g}, s_{g})

denote the vectors of group-specific sufficient statistics (

g = 1, \dots, G

). The parameter estimate

\hat{θ}

is obtained as the root of the partial derivative of F with respect to

θ

defined in (8):

F_{θ} (θ, \hat{ξ}) = \sum_{g = 1}^{G} F_{g, θ} (θ, {\hat{ξ}}_{g}) = 0,

(9)

where

F_{θ}

and

F_{g, θ}

denote the partial derivatives of F and

F_{g}

with respect to

θ

, respectively. The parameter estimate

\hat{θ}

is a nonlinear function of the input vector of sufficient statistics

\hat{ξ}

. Hence, the distribution of

\hat{θ}

can be expressed as a function of the distribution of

\hat{ξ}

by applying the multivariate delta method [17,24] (see also [25]).

The asymptotic distribution of the vector of sufficient statistics

{\hat{ξ}}_{g}

is given as

{\hat{ξ}}_{g} - ξ = (\begin{matrix} {\bar{x}}_{g} \\ s_{g} \end{matrix}) - (\begin{matrix} μ_{g} \\ σ_{g} \end{matrix}) \sim MVN (0, V_{g}) .

(10)

The covariance matrix

V_{g}

is determined by

V_{g} = N_{g}^{- 1} (\begin{matrix} Σ_{g} & 0 \\ 0 & K (Σ_{g} \otimes Σ_{g}) K^{⊤} \end{matrix}),

(11)

where ⊗ denotes the Kronecker product and

K

is a matrix containing entries 0, 0.5, and 1 such that

σ_{g} = vech (Σ_{g}) = K vec (Σ_{g})

, where the

vec

operator stacks all elements of a matrix into a vector. The covariance matrix

V_{g}

in (11) can be estimated by substituting the population covariance matrix

Σ_{g}

with the empirical covariance matrix

S_{g}

:

{\hat{V}}_{g} = N_{g}^{- 1} (\begin{matrix} S_{g} & 0 \\ 0 & K (S_{g} \otimes S_{g}) K^{⊤} \end{matrix}) .

(12)

The covariance matrix

V = Var (\hat{ξ})

is given as a block-diagonal matrix of covariance matrices

V_{g}

for

g = 1, \dots, G

:

V = (\begin{matrix} V_{1} & 0 & \dots & 0 \\ 0 & V_{2} & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & V_{G} \end{matrix}) .

(13)

A corresponding estimate

\hat{V}

is obtained by using estimates

{\hat{V}}_{g}

(

g = 1, \dots, G

) for group-specific covariance matrices.

Assume that the population sufficient statistics are denoted by

ξ_{0}

and there exists a pseudo-true parameter

θ_{0}

such that

F_{θ} (θ, ξ_{0}) = 0

. Hence, we can write

ξ_{0} = ξ (θ_{0})

. Note again that the parameter

ξ_{0}

does not refer to a data-generating parameter, but it is defined as a summary of the data by choosing a particular function F. Different pseudo-true parameters

θ_{0}

will be obtained for different choices of fitting functions F in misspecified SEMs; specifically,

θ_{0}

is a function of

ξ_{0}

and F (i.e.,

θ_{0} = g (ξ_{0}, F)

for some function g).

We now derive the covariance matrix of

\hat{θ}

by utilizing a Taylor expansion of

F_{θ}

around

(θ_{0}, ξ_{0})

. Denote by

F_{θ θ}

and

F_{θ ξ}

the matrices of second-order partial derivatives of

F_{θ}

with respect to

θ

and

ξ

, respectively. Then, we obtain

F_{θ} (\hat{θ}, \hat{ξ}) = F_{θ} (θ_{0}, ξ_{0}) + F_{θ θ} (θ_{0}, ξ_{0}) (\hat{θ} - θ_{0}) + F_{θ ξ} (θ_{0}, ξ_{0}) (\hat{ξ} - ξ_{0}) .

(14)

As the parameter estimate

\hat{θ}

is a nonlinear function of

\hat{ξ}

, the Taylor expansion (14) provides the approximation

\hat{θ} - θ_{0} = - F_{θ θ} {(θ_{0}, ξ_{0})}^{- 1} F_{θ ξ} (θ_{0}, ξ_{0}) (\hat{ξ} - ξ_{0}) .

(15)

By defining

A = F_{θ θ} (θ_{0}, ξ_{0})

and

B = F_{θ ξ} (θ_{0}, ξ_{0})

, we obtain the multivariate delta formula [17,26]

Var (\hat{θ}) = A^{- 1} B V B^{⊤} {(A^{- 1})}^{⊤} .

(16)

The matrices

A

and

B

can be estimated by

\hat{A} = F_{θ θ} (\hat{θ}, \hat{ξ})

and

\hat{B} = F_{θ ξ} (\hat{θ}, \hat{ξ})

. The estimated covariance matrix

Var (\hat{θ})

in (16) can be used for statistical inference, such as the computation of standard errors or the application of Wald tests.

The standard error (SE) of the lth entry

{\hat{θ}}_{l}

in

\hat{θ}

is given by

SE ({\hat{θ}}_{l}) = \sqrt{{(Var (\hat{θ}))}_{l l}} = \sqrt{{(A^{- 1} B V B^{⊤} {(A^{- 1})}^{⊤})}_{l l}} .

(17)

In this section, we computed the covariance matrix of the parameter estimate

\hat{ξ}

with respect to sampling errors. In particular, we assumed a sampling scheme of identically and independently distributed observations that led to variability in sufficient statistics

\hat{ξ}

, which, in turn, resulted in variability in estimated model parameters

\hat{θ}

across repeated sampling. In the next section, we additionally address the extent of model misspecification errors in the covariance matrix. The presence of model misspecification should be quantified in increased standard errors. We do so by imposing a stochastic model on model specification errors.

3. Modeling Model Misspecification

In this section, we impose a stochastic model on model misspecification in the SEM. At the population level, the population mean vector

μ

can differ from the model-implied mean vector

μ (θ_{0})

, and the (vectorized) population covariance matrix

σ

can differ from the model-implied covariance matrix

σ (θ_{0})

. As in Section 2, we define the vector

ξ = (μ, σ)

, where

ξ

contains all group-specific means and covariances.

3.1. Stochastic Model for Model Misspecification

Assume that there exists a

θ_{0}

such that

ξ = ξ (θ_{0}) + e,

(18)

where

e

constitutes the model specification error. The vector

e

contains deviations (i.e., model misspecification) in all means and pairwise covariances in all groups. Formally, we define

e_{μ} = {(e_{μ, g, i})}_{g = 1, \dots, G; i = 1, \dots, I}

,

e_{σ} = {(e_{σ, g, i})}_{g = 1, \dots, G; i, j = 1, \dots, I for i < j}

and

e = (e_{μ, g}, e_{σ, g})

. Assume that

E (e) = 0

.

To model misspecification in the mean structure, assume that

e_{μ, g, i}

are normally distributed variables with zero mean and variance

τ_{μ}

(i.e.,

E (e_{μ, g, i}) = 0

and

Var (e_{μ, g, i}) = τ_{μ}

). All variables

e_{μ, g, i}

contained in the vector

e_{μ}

are independently and identically distributed.

To model misspecification in the covariance structure, we assume an effect decomposition of the error in the modeled covariance of items

i \neq j

in group g as

e_{σ, g, i j} = u_{g, i} + u_{g, j} + v_{g, i j},

(19)

where

u_{g, i}

and

v_{g, i j}

are uncorrelated random effects for all i and all pairs

i \neq j

, respectively. The model (19) is a cross-classified two-level model [27]. The stochastic model in (19) fundamentally differs from the approach in [11] that assumes independent

e_{σ, g, i j}

effects (i.e., there are no item effects

u_{g, i}

for model errors in covariances). Because an item appears in several item pairs referring to different covariances, we find the inclusion of item effects

u_{g, i}

in (19) more plausible. The appearance of

u_{g, i}

and

u_{g, j}

in (19) might be motivated by the fact that (intentionally) misspecified factor loadings of item i enter all residuals

e_{σ, g, i j}

with

j \neq i

(see Appendix A). For multidimensional factor models, the stochastic model (19) might be made more general (see (A7) in Appendix A). Note that we set

e_{σ, g, i i} = 0

as in [11]. Thus, diagonal entries in the covariance matrix are assumed to be correctly specified at the population level. We can compute for

i \neq j

and

k \neq h

E (e_{σ, g, i j} e_{σ, g, k h}) = \{\begin{matrix} 0 & if card ({i, j} \cap {k, h}) = 0 \\ τ_{σ, 2} & if card ({i, j} \cap {k, h}) = 1 \\ 2 τ_{σ, 2} + τ_{σ, 1} & if card ({i, j} \cap {k, h}) = 2 \end{matrix},

(20)

where

card (A)

denotes the cardinality of a set A. The condition

if card ({i, j} \cap {k, h}) = 2

means that

i = k

and

j = h

; that is,

E (e_{σ, g, i j}^{2}) = 2 τ_{σ, 2} + τ_{σ, 1}

.

3.2. Estimating the Variance Components in the Stochastic Model

The model errors

e_{μ, g, i}

and

e_{σ, g, i j}

are not directly observable. For statistical inference regarding the stochastic model for model misspecification, the variance components in (20) must be estimated. Instead of computing

e = ξ - ξ (θ_{0})

, we compute empirical residuals

\hat{e}

that are defined as

\hat{e} = \hat{ξ} - ξ (\hat{θ})

. Note that these residuals are included in the standard output of widespread SEM software [28,29,30].

One can compute quantities

{\hat{e}}_{μ, g, i}^{2}

and

{\hat{e}}_{σ, g, i j} {\hat{e}}_{σ, g, k h}

for

(i, j, k, h)

as estimates of

e_{μ, g, i}

and

e_{σ, g, i j} e_{σ, g, h k}

and equate them with expected values. This approach is referred to as the method of moments [31]. Define the vector

τ = (τ_{μ}, τ_{σ, 2}, τ_{σ, 1})

. The vector of empirical variances and covariances that contain the product quantities is denoted by

\hat{z}

. According to (20), the expected values of cross-products are linear in

τ

. Hence, the method of moments maps the empirical (co)variances defined in

\hat{z}

to the vector of unknown variance components

τ

using the linear model

\hat{z} = H τ + ε,

(21)

where

H

is an appropriate known design matrix that contains entries 0, 1, or 2. The linear model (21) can be solved by

\hat{τ} = \tilde{H} \hat{z} with \tilde{H} = {(H^{⊤} H)}^{- 1} H .

(22)

Negatively estimated variances can be set to zero.

In the case of our defined variance component model for the mean and the covariance structure, simple formulas for the variance estimates can be derived. The variance

τ_{μ}

can be estimated by

{\hat{τ}}_{μ} = \frac{1}{G I} \sum_{g = 1}^{G} \sum_{i = 1}^{I} {\hat{e}}_{μ, g, i}^{2} .

(23)

Let

M_{a}

(

a = 0, 1, 2

) denote the set of cross-products of residuals

{\hat{e}}_{σ, g, i j}

and

{\hat{e}}_{σ, g, k h}

with

card ({i, j} \cap {k, h}) = a

. We define

Y_{a}

as the average of the products from the set

M_{a}

. Then, we use the estimate

{\hat{τ}}_{σ, 2} = max (0, Y_{1})

. Finally, we compute

{\hat{τ}}_{σ, 1} = max (0, Y_{2} - 2 {\hat{τ}}_{σ, 2})

.

However,

\hat{z}

is affected by sampling error. We can write

\hat{e} - e \sim MVN (0, V_{2})

. The vector

\hat{z}

contains products of normally distributed variables. Hence, the bias

B

in

\hat{z}

due to sampling errors can be estimated by computing

E (\hat{z}) = B + z

(24)

Then, we obtain from (22) and (24)

\hat{τ} = \tilde{H} (\hat{z} - B)

(25)

The bias

B

can also be determined by resampling techniques. The parameter estimates

{\hat{τ}}_{μ}

,

{\hat{τ}}_{σ, 2}

, and

{\hat{τ}}_{σ, 1}

can be repeatedly computed from bootstrap samples of subjects. Then, a bootstrap bias of variance components can be determined [32]. As a result, bias-corrected variance component estimates can be computed. Again, negatively estimated variances are set to zero.

3.3. Error in Model Parameters Due to Model Misspecification

Now, the variance component

τ

referring to model misspecification has been determined. In the next step, we compute the variance in the SEM parameter estimate

\hat{θ}

due to model misspecification error. As in Section 2, we apply a Taylor expansion around

(θ_{0}, ξ_{0})

with

ξ = ξ (θ_{0})

and obtain

F_{θ} (\hat{θ}, ξ) = F_{θ} (θ_{0}, ξ_{0}) + F_{θ θ} (θ_{0}, ξ_{0}) (\hat{θ} - θ_{0}) + \sum_{g = 1}^{G} F_{g, θ ξ} (θ_{0}, ξ_{g, 0}) (ξ_{g} - ξ_{g, 0})

(26)

Using again the abbreviation

A = F_{θ θ} (θ_{0}, ξ_{0})

, we obtain, by solving for

\hat{θ}

in (26),

\hat{θ} - θ_{0} = A^{- 1} \sum_{g = 1}^{G} F_{g, θ ξ} (θ_{0}, ξ_{g, 0}) (ξ_{g} - ξ_{g, 0})

(27)

We now simplify (27) regarding the distributional assumptions of model misspecification. Denote by

M_{g, i}

and

C_{g, i j}

the corresponding second-order derivatives with respect to appropriate entries in

ξ_{g}

in the function

F_{g, θ, ξ}

. Moreover, we set

C_{g, i i} = 0

and note that

C_{g, i j} = C_{g, j i}

. We then obtain, for the variance contribution of group g,

\begin{matrix} F_{g, θ ξ} (θ_{0}, ξ_{g, 0}) (ξ_{g} - ξ_{g, 0}) & = & \sum_{i = 1}^{I} M_{g, i} e_{μ, g, i} + \sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} C_{g, i j} e_{σ, g, i j} \\ = & \sum_{i = 1}^{I} M_{g, i} e_{μ, g, i} + \sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} C_{g, i j} (u_{g, i} + u_{g, j} + v_{g, i j}) \\ = & \sum_{i = 1}^{I} M_{g, i} e_{μ, g, i} + \sum_{i = 1}^{I} u_{g, i} \sum_{j = 1}^{I} C_{g, i j} + \sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} C_{g, i j} v_{g, i j} \end{matrix}

(28)

We can now derive the variance

V_{g} = Var (F_{g, θ ξ} (θ_{0}, ξ_{g, 0}) (ξ_{g} - ξ_{g, 0}))

:

V_{g} = τ_{μ} (\sum_{i = 1}^{I} M_{g, i} M_{g, i}^{⊤}) + τ_{σ, 1} (\sum_{i = 1}^{I} (\sum_{j = 1}^{I} C_{g, i j}) {(\sum_{j = 1}^{I} C_{g, i j})}^{⊤}) + τ_{σ, 2} (\sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} C_{g, i j} C_{g, i j}^{⊤}) .

(29)

By using the abbreviation

V = \sum_{g = 1}^{G} V_{g}

, we obtain by using (27)

Var (\hat{θ}) = A^{- 1} V {(A^{- 1})}^{⊤} .

(30)

The misspecification error (ME) for th lth entry

{\hat{θ}}_{l}

in

\hat{θ}

is given by

ME ({\hat{θ}}_{l}) = \sqrt{{(Var (\hat{θ}))}_{l l}} = \sqrt{{(A^{- 1} V {(A^{- 1})}^{⊤})}_{l l}} .

(31)

As an alternative to the proposed analytical solution in this subsection, the uncertainty in parameter estimates due to model misspecification can be assessed by parametric bootstrapping. This procedure is based on the stochastic model (18). If variance components for the stochastic model for residuals

e

in (18) are estimated, a random draw of new residuals

e^{*}

can be conducted for each bootstrap sample, which subsequently provides a draw from the vector of sufficient statistics

ξ^{*} = ξ (\hat{θ}) + e^{*}

. A parameter estimate

\hat{θ}

for each bootstrap sample is obtained by solving

F_{θ} (θ, ξ^{*}) = 0

. By drawing a large number of bootstrap samples, the distribution of

\hat{θ}

with respect to misspecification error can be determined (see [33,34] for a similar approach).

3.4. Computing the Total Error

The variance in (30) is due to the imposed stochastic model on model errors. In addition, there exists a sampling error in the parameter estimate

\hat{θ}

that has been derived in Section 2. By adding the variance matrices computed in (16) (i.e., sampling error) and (30) (i.e., model error), we finally obtain the total variance matrix:

Var (\hat{θ}) = A^{- 1} (B V B^{⊤} + V) {(A^{- 1})}^{⊤} .

(32)

Hence, the total error (TE) of the lth entry

{\hat{θ}}_{l}

in

\hat{θ}

is determined by

TE ({\hat{θ}}_{l}) = \sqrt{SE {({\hat{θ}}_{l})}^{2} + ME {({\hat{θ}}_{l})}^{2}} .

(33)

To summarize, the steps described in the previous and this section resulted in the variance formula (32) that integrates sampling error and model error in a simultaneous inference without changing the estimation equation. The steps described here should be sufficient for the practical implementation of the proposed standard errors.

In the next section, we present illustrative examples of the computation of the model error component for the parameter estimate

\hat{θ}

.

4. Analytical Illustrative Examples

In this section, three illustrative examples are presented in which the variance due to model errors is quantified. In the examples, we assume infinite sample sizes. Specifically, sampling errors are ignored. Furthermore, we only consider ULS estimation. However, we believe that despite the simplified assumptions, the properties of the modeled specification error can be grasped more easily.

4.1. Example 1: Misspecified Error Structure in Unidimensional Factor Analysis

In Example 1, we consider a unidimensional CFA in a single group. We assume equal loadings of one and estimate the factor variance

ϕ

. The residual variances are allowed to vary across items. For I items

X_{1}, \dots, X_{I}

, the covariance is defined as

σ_{i j} = Cov (X_{i}, X_{j})

. The assumed stochastic model for model errors is defined as

σ_{i j} = ϕ + e_{σ, i j},

(34)

where the error is decomposed into (see (19))

e_{σ, i j} = u_{i} + u_{j} + v_{i j}

(35)

with

E (u_{i}) = E (v_{i j}) = 0

,

Var (u_{i}) = τ_{σ, 2}

, and

Var (v_{i j}) = τ_{σ, 1}

.

The estimating equation for

ϕ

in ULS estimation is given by

F_{θ} (θ; ξ) = \sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} (σ_{i j} - ϕ) = 0 .

(36)

Hence, the second-order derivative is given by

F_{θ θ} (θ; ξ) = - \frac{I (I - 1)}{2} .

(37)

By inserting the data-generating model for model errors in (36), we obtain

0 = - \frac{I (I - 1)}{2} (\hat{ϕ} - ϕ) + \sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} e_{σ, i j}

(38)

Solving this equation with respect to

\hat{ϕ}

, we arrive at

\hat{ϕ} = ϕ + \frac{2}{I} \sum_{i = 1}^{I} u_{i} + \frac{2}{I (I - 1)} \sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} v_{i j} .

(39)

Hence, the variance in

\hat{ϕ}

due to model errors can be calculated as

Var (\hat{ϕ}) = \frac{4}{I} τ_{σ, 2} + \frac{2}{I (I - 1)} τ_{σ, 1} .

(40)

4.2. Example 2: Misspecified Error Structure in Confirmatory Factor Analysis

In Example 2, we consider a two-dimensional CFA. The items

X_{1}, \dots, X_{I_{1}}

load on the first factor, while items

X_{I_{1} + 1}, \dots, X_{I_{1} + I_{2}}

load on the second factor. There are no modeled cross-loadings in the analysis model. However, by imposing a stochastic model for model errors, the effect of unmodeled cross-loadings is quantified in increased standard errors in the model parameter estimates.

The data-generating model for covariances

σ_{i j} = Cov (X_{i}, X_{j})

for

i \neq j

is given by

σ_{i j} = \{\begin{matrix} ϕ_{11} + e_{σ, i j} & if i \leq I_{1} and j \leq I_{1} \\ ϕ_{12} + e_{σ, i j} & if i \leq I_{1} and j \geq I_{1} + 1 \\ ϕ_{22} + e_{σ, i j} & if i \geq I_{1} + 1 and j \geq I_{1} + 1 \end{matrix}

(41)

As in Example 1, we impose the stochastic model for misspecified covariances as

e_{σ, i j} = u_{i} + u_{j} + v_{i j} .

(42)

The estimating equation for the factor covariance

ϕ_{12}

based on ULS is given by (see [8])

F_{θ} (θ; ξ) = \sum_{i = 1}^{I_{1}} \sum_{j = I_{1} + 1}^{I_{2}} (σ_{i j} - ϕ_{12}) = 0 .

(43)

Hence, we obtain for the estimate

{\hat{ϕ}}_{12}

{\hat{ϕ}}_{12} = ϕ_{12} + \frac{1}{I_{1} I_{2}} \sum_{i = 1}^{I_{1}} \sum_{j = I_{1} + 1}^{I_{2}} e_{σ, i j} .

(44)

Therefore, we obtain the variance for

{\hat{ϕ}}_{12}

due to model misspecification

Var ({\hat{ϕ}}_{12}) = (\frac{1}{I_{1}} + \frac{1}{I_{2}}) τ_{σ, 2} + \frac{1}{I_{1} I_{2}} τ_{σ, 1} .

(45)

Interestingly, the variance in

{\hat{ϕ}}_{12}

is determined by the smaller number of items per factor (i.e.,

min (I_{1}, I_{2})

) if there exists variance in the random item factor (i.e.,

τ_{σ, 2} > 0

).

4.3. Example 3: Measurement Noninvariance in Multiple-Group SEM

Finally, in Example 3, we quantify the extent of measurement noninvariance in increased standard errors in parameter estimates. It has been argued that misspecified SEM can result if violations of measurement invariance are intentionally ignored in model estimation [35,36,37,38].

We assume a multiple-group unidimensional CFA. We assume model deviations in the mean structure of group g that refer to measurement noninvariance

μ_{g, i} = ν_{i} + λ_{i} α_{g} + e_{μ, g, i} .

(46)

The estimating equation for the group mean

α_{g}

for group g is given by

F_{θ} (θ; ξ) = \sum_{i = 1}^{I} (μ_{g, i} - ν_{i} - λ_{i} α_{g}) = 0,

(47)

where parameters

ν_{i}

and

λ_{i}

are assumed to be already estimated for

i = 1, \dots, I

for ease of presentation. Then, we obtain the estimate

{\hat{α}}_{g}

as

{\hat{α}}_{g} = \frac{\sum_{i = 1}^{I} (μ_{g, i} - ν_{i})}{\sum_{i = 1}^{I} λ_{i}} = α_{g} + \frac{\sum_{i = 1}^{I} e_{μ, g, i}}{\sum_{i = 1}^{I} λ_{i}} .

(48)

Therefore, the variance of

{\hat{α}}_{g}

can be computed as

Var ({\hat{α}}_{g}) = \frac{I}{{(\sum_{i = 1}^{I} λ_{i})}^{2}} τ_{μ}

(49)

If all loadings are set to one (i.e.,

λ_{i} = 1

for all

i = 1, \dots, I

), we obtain from (49)

Var ({\hat{α}}_{g}) = \frac{1}{I} τ_{μ}

(50)

The quantity in (50) corresponds to the well-known linking error of the one-parameter logistic item response model [39,40,41,42]. Hence, the quantification of model misspecification can be seen as an alternative to assessing uncertainty in model parameters regarding the selected items. To some extent, one can argue that modeling model misspecification is conceptually equivalent to assessing linking errors, although linking errors are mainly considered in multiple-group settings.

5. Numerical Illustrative Example: ESS 2005 Data

5.1. Method

In this empirical example, we use a dataset that was also analyzed in [25,43,44,45]. The data came from the European Social Survey (ESS) conducted in the year 2005 (ESS 2005) that included subjects from 26 countries. The latent factor variable of tradition and conformity was assessed by four items presented in portrait format, where the scale of the items is such that a high value represents a low level of tradition conformity. The wording of the four items was as follows (see [45]): “It is important for him to be humble and modest. He tries not to draw attention to himself”. (item TR9); “Tradition is important to him. He tries to follow the customs handed down by his religion or family” (item TR20); “He believes that people should do what they’re told. He thinks people should follow rules at all times, even when no one is watching” (item CO7); and “It is important for him to always behave properly. He wants to avoid doing anything people would say is wrong” (item CO16). The full dataset used in [45] was downloaded from https://www.statmodel.com/Alignment.shtml (accessed on 9 May 2023).

In this application, we used ten selected countries C01, C05, C08, C10, C13, C15, C16, C17, C21, C25 using the country labels from [25]. This resulted in a subsample of

N = 19,916

persons. The sample sizes per country ranged between 1450 and 2622, with an average of 1991.6 (

S D = 375.4

). We only included participants in the sample that had no missing values on all four items.

A multiple-group one-dimensional factor model with 10 groups (i.e., 10 countries) was specified, assuming invariant item intercepts

ν_{i}

and factor loadings

λ_{i}

(

i = 1, \dots, 4

) across countries. The residual variances were allowed to vary across countries. The CFA model was identified by fixing the factor mean of the first group to 0 and the factor variance in the first group to 1.

To compute standard errors with respect to the sampling of persons, nonparametric bootstrapping of persons was conducted. In total,

R = 100

bootstrap samples were drawn. We used the nonparametric bootstrap samples to obtain bias-corrected estimates of the variance components for the stochastic model for misspecification errors (see Section 3.2). Misspecification error in parameter estimates was determined by parametric bootstrapping (see Section 3.3) using 200 bootstrap samples. The total error for all parameter estimates that comprised standard error and misspecification error was computed using Equation (33).

Because jackknifing items were suggested to investigate the stability in parameter estimates due to changes in the model [40,46,47], we computed the misspecification error with a jackknife-based variability measure. The dataset included four items such that a jackknife sample of items included three items. An alternative misspecification error based on jackknife (JKME) was obtained by applying the jackknife standard error formula [32].

The obtained country means and country standard deviations of the factor variable were linearly transformed for all different estimators such that the total population comprising all persons from all 10 countries had a mean of 0 and a standard deviation of 1. Hence, the factor variable was standardized in the total population that comprised all 10 countries. The multiple-group SEM was estimated with ULS.

The analysis was conducted using the sirt:::mgsem() function from the R [48] package sirt [49]. The dataset used in this analysis can be found at https://osf.io/hj3k9/ (accessed on 9 May 2023).

5.2. Results

The variance

τ_{μ}

for residuals in the mean structure was estimated as 0.0332 with the raw estimation method. The bias-corrected estimate used for subsequent statistical inference was slightly smaller, with 0.0328. The estimate of

τ_{σ, 2}

was negative and set to 0. The raw estimate of

τ_{σ, 1}

was 0.0050, while the bias-corrected estimate was 0.0042. From these results, it can be concluded that misspecification was more severe in the mean structure than in the covariance structure.

In Table 1, parameter estimates and their standard errors, misspecification errors, and total errors are displayed. The misspecification errors (ME; estimated by parametric bootstrapping) were substantially larger than the standard error. The error ratio (ER) defined as the quotient of ME and SE was on average 4.69 (

S D = 1.15

) and provided evidence that inferences regarding the factor means are much more affected by the choice of items than the sampling of persons. For factor means, the jackknife misspecification error (JKME) had an average of 0.220 and was very similar to the average of 0.214 of the ME values. Notably, the total error (TE) was mainly determined by the ME. Total errors were on average 480% larger than standard errors.

Table 1. Empirical example: estimated model parameters with their standard errors, misspecification errors, and total errors.

The situation was quite different for factor variances. The error ratio had an average of 1.14, indicating that sampling and modeling errors had a similar impact regarding the uncertainty in factor variances. Total errors were on average 52% larger than standard errors. Notably, the JKME (

M = 0.326

) was much larger than the ME (

M = 0.130

). We suppose that the standard error computation for jackknife does not reflect the stochastic model for residuals in covariances, which might explain this large difference.

In Table 2, factor means and factor variances and their error estimates are presented after standardizing the factor variable in the total population comprising all 10 countries. The ME estimates for factor means had an average of 0.121 (

S D = 0.010

). While the estimates based on jackknifing items (i.e., JKME) had a similar average of 0.133, the variability across countries (

S D = 0.058

) was much larger. This fact reflects the possibility that the extent of model misspecification error is allowed to vary across countries but is assumed as homogeneous across countries when estimating the ME. Moreover, note that the ME estimates for factor means in Table 2 when using the population standardization were much smaller than the factor means in the estimated model that used the first country as the reference (see Table 1). Setting scaling issues aside, this finding can be simply explained by the fact that is represented in the parameter estimate. When using population standardization, a country is compared with an average across countries. In the analysis model that uses a reference country, a respective country is compared with a reference country. In the latter approach, uncertainty within averages across countries is minor so that only the ME for one country is taken into account. However, in the former approach, the ME for a country reflects the misspecification of the corresponding country and the reference country because a comparison is conducted. Hence, the difference is in full alignment with what can be expected when using different identification constraints when estimating factor means and factor variances. In line with the results from Table 1, jackknife-based estimates of ME (i.e., JKME) were substantially larger than the ME estimates.

Table 2. Empricial example: estimated factor means and factor variances after population standardization (i.e., mean of 0 and standard deviation of 1 in the total population) with their standard errors, misspecification errors, and total errors.

6. Discussion

In this article, we present a simultaneous statistical inference regarding sampling errors and model errors in single-group and multiple-group SEMs. Our framework closely follows that of Wu and Browne [9] but differs in the fact that we use the same estimation function (e.g., maximum likelihood or diagonally weighted least squares).

The procedure can be summarized as follows. Let

\hat{ξ}

contain estimated (group) means and (group) covariances. An SEM estimates a parameter

\hat{θ}

that summarizes the mean and covariance structure. Thus, we expect that the model-implied means and covariances approximate or predict the observed means and covariances somehow. We can write

\hat{ξ} \approx ξ (\hat{θ})

. In samples,

\hat{θ}

is an estimate of the population means and covariances

ξ

, and we can write

\hat{ξ} = ξ + ε

with a sampling error

ε

. Typically, the SEM will be misspecified. In other words, there exists some parameter

θ_{0}

that fits the population means and covariances best with respect to a chosen fitting function F. Model specification error exists if there exists a vector of residuals

e

such that

ξ = ξ (θ_{0}) + e

. The vector

e

is also referred to as a model error. Hence, we observe that there simultaneously appear sampling errors

ε

and model errors

e

, and the estimated means and covariances are represented as

\hat{ξ} = ξ (θ_{0}) + e + ε

. In this paper, a stochastic model is imposed on model errors

e

that allows statistical inference for

\hat{θ}

, which is a function of the parameter

θ_{0}

and the two errors

e

and

ε

. Ordinary statistical inference only reflects sampling error

ε

in standard errors in parameter estimates, while the proposed method additionally includes model errors in the standard errors.

Although the illustrations in Section 4 and the empirical example in Section 5 utilized ULS estimation, the derivations apply to any differentiable fitting function for SEM, such as the ML fitting function. In our stochastic model for the modeling of misspecification, we assume that there is no residual error in the diagonal matrix of residuals. Specifically, at the population level, the model-implied variances and the total variances of all items coincide. This is likely fulfilled with ULS estimation if residual variances have group-specific estimates resulting in zero residuals. However, even if the residual variances are group-specific, residuals for variances are typically different from zero in ML estimation. Hence, we suppose that the stochastic model for misspecification must be slightly adapted.

In this article, analytical illustrations and a numerical example are provided. In future studies, it would be interesting to investigate the performance of our approach in simulation studies. Nevertheless, we believe that the proposed method has clear asymptotic foundations. We suspect that the number of items by the number of groups is critical for the reliable estimation of the variance components of the stochastic model for model misspecification.

Because our approach only relies on modeling misspecification in the mean and the covariance structure, it can be directly applied to SEMs with ordinal data by substituting the mean vector with a vector of thresholds and the covariance matrix with a polychoric correlation matrix. In this case, a closer correspondence to linking errors that are mainly discussed in item response models can be investigated.

We would like to note that the simultaneous assessment of sampling errors and model specification errors has similarities to generalizability theory [50,51,52], domain sampling theory [53,54,55], or linking errors [56,57]. Notably, resampling techniques regarding items could also approximate statistical inference with respect to model misspecification [57].

In our approach, we opt for a particular stochastic model to model specification errors. There is always ambiguity in choosing such a stochastic model. For example, model errors in the same item or same item pair might be correlated across groups. Such an extension can also be addressed in our estimation approach with slight changes in the variance formula.

Our derivations show that the misspecification error reduces if the number of items increases. This result is a consequence of assuming independently distributed model errors across items. Sometimes, it might be more plausible to use a two-level model to model misspecification in the mean structure such that items are nested within item groups (or item clusters). In this case, model residuals in the mean structure might be positively correlated within an item group.

In general, one could argue that the stochastic model for model misspecification is of no relevance because it does not refer to an actual sampling model of items or the effects of model discrepancies in data generation (or data collection). We do not believe that this would be a viable objection. A statistical model is always a model in which an investigator defines randomness by the means of random variables, which must not have any connection to a sampling procedure. The model residuals are merely modeled by a random variable and the variability is quantified by a single variance for the mean structure and two variances for the covariance structure. Independence assumptions of residuals should be compared with random sampling assumptions across persons. In a concrete sample, there is no test to determine whether the independence assumption across persons is fulfilled. It is simply a definition that can be useful in applications for statistical inference. The same holds true for model residuals. They are simply assumed to be independent according to a stochastic model. This assumption cannot (be fully) tested.

If model misspecification is present, one can speculate as to whether ML estimation should be the preferred estimation method. ML achieves the most efficient parameter estimation if the analysis model is correctly specified. However, ML can produce more variable estimates for DWLS or ULS estimation in the presence of model misspecification. Hence, choosing between ML and DWLS (or ULS) is a decision regarding whether input data should be more reflected regarding sampling errors (i.e., preferring ML) or model residuals (i.e., preferring DWLS or ULS). We tend to prefer DWLS in many, if not almost all, applications because correct model specification is generally not guaranteed.

There can always be arguments that researchers should not interpret parameter estimates from misspecified models [58]. However, we disagree with such a view. With a data-driven modification of a target model of interest, the meaning of the primary model parameters changes. Hence, researchers implicitly change the meaning of the latent variables and their relationships in an SEM. We do not see why statistics (or psychometrics as a special branch of it) should redefine the target estimand of interest in a data-driven way. In contrast, researchers intentionally use models because they describe some phenomenon of interest. In our view, model misspecification should not lead to model modification but should be reflected as a type of error that can be reported. We believe that including model misspecification as an increase in standard errors in parameter estimates is a viable concept to quantify model errors. We hope that it can be applied to standard research practices that utilize SEMs.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The dataset used in Section 5 can be found at https://osf.io/hj3k9/ (accessed on 9 May 2023).

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CFA	confirmatory factor analysis
DWLS	diagonally weighted least squares
ESS	European Social Survey
ME	misspecification error
ML	maximum likelihood
MVN	multivariate normal
SE	standard error
SEM	structural equation model
TE	total error
WLS	weighted least squares

Appendix A. Motivation of the Stochastic Model (19) for Model Misspecification in Covariances

We now present the motivation for the stochastic model in (19). Assume that the data-generating model is a one-dimensional factor model

X_{i} = ν_{i} + λ_{i} η_{1} + ε_{i}

(A1)

for items

i = 1, \dots, I

. The factor variable

η_{1}

has a mean of 0 and a variance of 1. Moreover, there exist residual covariances

ψ_{i j}

between items i and j. The analysis model assumes equal factor loadings, and residual covariances are unmodeled. We assume that factor loadings can be decomposed into

λ_{i} = λ_{0} + ω_{i},

(A2)

where loading residuals

ω_{i}

have zero means (i.e., the average loading of I items is

λ_{0}

).

The covariance between observed variables

X_{i}

and

X_{j}

is given as

Cov (X_{i}, X_{j}) = λ_{i} λ_{j} = λ_{0}^{2} + λ_{0} ω_{i} + λ_{0} ω_{j} + ω_{i} ω_{j} + ψ_{i j} .

(A3)

The model-implied covariance

σ_{i j} (θ)

is

λ_{0}^{2}

. Hence, the residuals in covariances are computed as

e_{σ, i j} = λ_{0} ω_{i} + λ_{0} ω_{j} + ω_{i} ω_{j} + ψ_{i j} .

(A4)

By defining

u_{i} = λ_{0} ω_{i}

,

v_{i j} = ω_{i} ω_{j} + ψ_{i j}

, we obtain the same stochastic model as in (19)

e_{σ, i j} = u_{i} + u_{j} + v_{i j} .

(A5)

The independence of variables

u_{i}

is assured if the residual loadings

ω_{i}

are assumed to be independent and

E (ω_{i}^{2}) = κ_{ω}

. Now, we additionally assume that

ω_{i}

is normally distributed. Furthermore, we obtain for

i \neq j

and

k \neq h

due to

E (ω_{i} ω_{j}) = 0

the covariance

Cov (ω_{i} ω_{j}, ω_{k} ω_{h}) = \{\begin{matrix} 0 & if card ({i, j} \cap {k, h}) = 0 \\ 0 & if card ({i, j} \cap {k, h}) = 1 \\ κ_{ω}^{2} & if card ({i, j} \cap {k, h}) = 2 \end{matrix} .

(A6)

Due to (A6), the variables

v_{i j}

in (A5) are uncorrelated.

The stochastic model might be made more complicated if the factor model involves multiple latent variables. Assume that each item i loads on a dimension

d [i]

. Assume that products

ω_{i} ω_{j}

are approximately zero. Furthermore, let

ϕ_{d e}

be the covariance between latent factors

η_{d}

and

η_{e}

. Then, a modified stochastic model of (A5) for

d [i] \neq d [j]

is given as

e_{σ, i j} = ϕ_{d [i] d [j]} u_{i} + ϕ_{d [i] d [j]} u_{j} + v_{i j} .

(A7)

The covariances

ϕ_{d e}

can be estimated when fitting the CFA model. Hence, the variances of

u_{i}

and

v_{i j}

in (A7) can be estimated as a cross-classified two-level model with random slopes.

References

Bartholomew, D.J.; Knott, M.; Moustaki, I. Latent Variable Models and Factor Analysis: A Unified Approach; Wiley: New York, NY, USA, 2011. [Google Scholar] [CrossRef]
Bollen, K.A. Structural Equations with Latent Variables; Wiley: New York, NY, USA, 1989. [Google Scholar] [CrossRef]
Browne, M.W.; Arminger, G. Specification and Estimation of Mean-and Covariance-Structure Models. In Handbook of Statistical Modeling for the Social and Behavioral Sciences; Arminger, G., Clogg, C.C., Sobel, M.E., Eds.; Springer: Boston, MA, USA, 1995; pp. 185–249. [Google Scholar] [CrossRef]
Jöreskog, K.G.; Olsson, U.H.; Wallentin, F.Y. Multivariate Analysis with LISREL; Springer: Basel, Switzerland, 2016. [Google Scholar] [CrossRef]
Mulaik, S.A. Foundations of Factor Analysis; CRC Press: Boca Raton, FL, USA, 2009. [Google Scholar]
Shapiro, A. Statistical Inference of Covariance Structures. In Current Topics in the Theory and Application of Latent Variable Models; Edwards, M.C., MacCallum, R.C., Eds.; Routledge: Abingdon-on-Thames, UK, 2012; pp. 222–240. [Google Scholar] [CrossRef]
Yuan, K.H.; Bentler, P.M. Structural Equation Modeling. In Handbook of Statistics; Psychometrics; Rao, C.R., Sinharay, S., Eds.; Elsevier: Amsterdam, The Netherlands, 2007; Volume 26, pp. 297–358. [Google Scholar] [CrossRef]
Robitzsch, A. Comparing the robustness of the structural after measurement (SAM) approach to structural equation modeling (SEM) against local model misspecifications with alternative estimation approaches. Stats 2022, 5, 631–672. [Google Scholar] [CrossRef]
Wu, H.; Browne, M.W. Quantifying adventitious error in a covariance structure as a random effect. Psychometrika 2015, 80, 571–600. [Google Scholar] [CrossRef]
Wu, H. An Empirical Bayesian Approach to Misspecified Covariance Structures. Unpublished Thesis, Ohio State University, Columbus, OH, USA, 2010. Available online: https://bit.ly/3HGuLFT (accessed on 9 May 2023).
Uanhoro, J.O. Modeling misspecification as a parameter in Bayesian structural equation models. Educ. Psychol. Meas. 2023. [Google Scholar] [CrossRef]
Stefanski, L.A.; Boos, D.D. The calculus of M-estimation. Am. Stat. 2002, 56, 29–38. [Google Scholar] [CrossRef]
Bollen, K.A.; Davis, W.R. Two rules of identification for structural equation models. Struct. Equ. Model. 2009, 16, 523–536. [Google Scholar] [CrossRef]
Drton, M.; Foygel, R.; Sullivant, S. Global identifiability of linear structural equation models. Ann. Stat. 2011, 39, 865–886. [Google Scholar] [CrossRef]
Meredith, W. Measurement invariance, factor analysis and factorial invariance. Psychometrika 1993, 58, 525–543. [Google Scholar] [CrossRef]
Putnick, D.L.; Bornstein, M.H. Measurement invariance conventions and reporting: The state of the art and future directions for psychological research. Dev. Rev. 2016, 41, 71–90. [Google Scholar] [CrossRef]
Boos, D.D.; Stefanski, L.A. Essential Statistical Inference; Springer: New York, NY, USA, 2013. [Google Scholar] [CrossRef]
Gourieroux, C.; Monfort, A.; Trognon, A. Pseudo maximum likelihood methods: Theory. Econometrica 1984, 52, 681–700. [Google Scholar] [CrossRef]
Kolenikov, S. Biases of parameter estimates in misspecified structural equation models. Sociol. Methodol. 2011, 41, 119–157. [Google Scholar] [CrossRef]
White, H. Maximum likelihood estimation of misspecified models. Econometrica 1982, 50, 1–25. [Google Scholar] [CrossRef]
Browne, M.W. Generalized least squares estimators in the analysis of covariance structures. S. Afr. Stat. J. 1974, 8, 1–24. Available online: https://bit.ly/3yviejm (accessed on 9 May 2023). [CrossRef]
Savalei, V. Understanding robust corrections in structural equation modeling. Struct. Equ. Model. 2014, 21, 149–160. [Google Scholar] [CrossRef]
MacCallum, R.C.; Browne, M.W.; Cai, L. Factor Analysis Models as Approximations. In Factor Analysis at 100; Cudeck, R., MacCallum, R.C., Eds.; Lawrence Erlbaum: Hillsdale, NJ, USA, 2007; pp. 153–175. [Google Scholar] [CrossRef]
Held, L.; Sabanés Bové, D. Applied Statistical Inference; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar] [CrossRef]
Robitzsch, A. Model-robust estimation of multiple-group structural equation models. Algorithms 2023, 16, 210. [Google Scholar] [CrossRef]
Ver Hoef, J.M. Who invented the delta method? Am. Stat. 2012, 66, 124–127. [Google Scholar] [CrossRef]
Gelman, A.; Hill, J. Data Analysis Using Regression and Multilevel/Hierarchical Models; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar] [CrossRef]
Boker, S.; Neale, M.; Maes, H.; Wilde, M.; Spiegel, M.; Brick, T.; Spies, J.; Estabrook, R.; Kenny, S.; Bates, T.; et al. OpenMx: An open source extended structural equation modeling framework. Psychometrika 2011, 76, 306–317. [Google Scholar] [CrossRef] [PubMed]
Fox, J. Teacher’s corner: Structural equation modeling with the sem package in R. Struct. Equ. Model. 2006, 13, 465–486. [Google Scholar] [CrossRef]
Rosseel, Y. lavaan: An R package for structural equation modeling. J. Stat. Softw. 2012, 48, 1–36. [Google Scholar] [CrossRef]
Searle, S.R.; Casella, G.; McCulloch, C.E. Variance Components; Wiley: New York, NY, USA, 1992. [Google Scholar] [CrossRef]
Efron, B.; Tibshirani, R.J. An Introduction to the Bootstrap; CRC Press: Boca Raton, FL, USA, 1994. [Google Scholar] [CrossRef]
Chen, Y.; Li, C.; Xu, G. DIF statistical inference and detection without knowing anchoring items. arXiv 2021, arXiv:2110.11112. [Google Scholar] [CrossRef]
Wang, W.; Liu, Y.; Liu, H. Testing differential item functioning without predefined anchor items using robust regression. J. Educ. Behav. Stat. 2022, 47, 666–692. [Google Scholar] [CrossRef]
Funder, D.C.; Gardiner, G. MIsgivings about measurement invariance. PsyArXiv 2023. [Google Scholar] [CrossRef]
Robitzsch, A. Estimation methods of the multiple-group one-dimensional factor model: Implied identification constraints in the violation of measurement invariance. Axioms 2022, 11, 119. [Google Scholar] [CrossRef]
Robitzsch, A.; Lüdtke, O. Why full, partial, or approximate measurement invariance are not a prerequisite for meaningful and valid group comparisons. Struct. Equ. Model. 2023, 1–12. [Google Scholar] [CrossRef]
Welzel, C.; Inglehart, R.F. Misconceptions of measurement equivalence: Time for a paradigm shift. Comp. Political Stud. 2016, 49, 1068–1094. [Google Scholar] [CrossRef]
Monseur, C.; Berezner, A. The computation of equating errors in international surveys in education. J. Appl. Meas. 2007, 8, 323–335. [Google Scholar]
Monseur, C.; Sibberns, H.; Hastedt, D. Linking errors in trend estimation for international surveys in education. IERI Monogr. Ser. 2008, 1, 113–122. [Google Scholar]
Robitzsch, A.; Lüdtke, O. Linking errors in international large-scale assessments: Calculation of standard errors for trend estimation. Assess. Educ. 2019, 26, 444–465. [Google Scholar] [CrossRef]
Robitzsch, A. Linking error in the 2PL model. J 2023, 6, 58–84. [Google Scholar] [CrossRef]
Knoppen, D.; Saris, W. Do we have to combine values in the Schwartz’ human values scale? A comment on the Davidov studies. Surv. Res. Methods 2009, 3, 91–103. [Google Scholar] [CrossRef]
Beierlein, C.; Davidov, E.; Schmidt, P.; Schwartz, S.H.; Rammstedt, B. Testing the discriminant validity of Schwartz’ portrait value questionnaire items—A replication and extension of Knoppen and Saris (2009). Surv. Res. Methods 2012, 6, 25–36. [Google Scholar] [CrossRef]
Asparouhov, T.; Muthén, B. Multiple-group factor analysis alignment. Struct. Equ. Model. 2014, 21, 495–508. [Google Scholar] [CrossRef]
Gifi, A. Nonlinear Multivariate Analysis; Wiley: New York, NY, USA, 1990. [Google Scholar]
Oberski, D.L. Evaluating sensitivity of parameters of interest to measurement invariance in latent variable models. Polit. Anal. 2014, 22, 45–60. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; The R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.R-project.org/ (accessed on 15 March 2023).
Robitzsch, A. sirt: Supplementary Item Response Theory Models; The R Foundation for Statistical Computing: Vienna, Austria, 2023; R package version 3.13-162; Available online: https://github.com/alexanderrobitzsch/sirt (accessed on 9 May 2023).
Brennan, R.L. Generalizabilty Theory; Springer: New York, NY, USA, 2001. [Google Scholar] [CrossRef]
Cronbach, L.J.; Gleser, G.C.; Nanda, H.; Rajaratnam, N. The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles; Wiley: New York, NY, USA, 1972. [Google Scholar]
Husek, T.R.; Sirotnik, K. Item Sampling in Educational Research; CSEIP Occasional Report No. 2; University of California: Los Angeles, CA, USA, 1967; Available online: https://bit.ly/3k47t1s (accessed on 8 May 2023).
Hunter, J.E. Probabilistic foundations for coefficients of generalizability. Psychometrika 1968, 33, 1–18. [Google Scholar] [CrossRef] [PubMed]
McDonald, R.P. Generalizability in factorable domains: “Domain validity and generalizability”. Educ. Psychol. Meas. 1978, 38, 75–79. [Google Scholar] [CrossRef]
McDonald, R.P. Behavior domains in theory and in practice. Alta. J. Educ. Res. 2003, 49, 212–230. [Google Scholar]
Robitzsch, A. L_p loss functions in invariance alignment and Haberman linking with few or many groups. Stats 2020, 3, 246–283. [Google Scholar] [CrossRef]
Robitzsch, A. Robust and nonrobust linking of two groups for the Rasch model with balanced and unbalanced random DIF: A comparative simulation study and the simultaneous assessment of standard errors and linking errors with resampling techniques. Symmetry 2021, 13, 2198. [Google Scholar] [CrossRef]
Steyer, R.; Sengewald, E.; Hahn, S. Some comments on Wu and Browne. Psychometrika 2015, 80, 608–610. [Google Scholar] [CrossRef]

Table 1. Empirical example: estimated model parameters with their standard errors, misspecification errors, and total errors.

Par	Est	SE	ME	JKME	TE
$ν_{1}$	3.070	0.021	0.097	-	0.100
$ν_{2}$	2.698	0.017	0.094	-	0.095
$ν_{3}$	2.602	0.022	0.126	-	0.127
$ν_{4}$	2.678	0.019	0.101	-	0.103
$λ_{1}$	0.591	0.021	0.029	-	0.036
$λ_{2}$	0.567	0.021	0.029	-	0.036
$λ_{3}$	0.685	0.024	0.032	-	0.040
$λ_{4}$	0.556	0.018	0.029	-	0.034
$α_{1}$	0 $^{‡}$	-	-	-	-
$α_{2}$	0.062	0.046	0.226	0.329	0.231
$α_{3}$	0.186	0.061	0.187	0.154	0.197
$α_{4}$	0.326	0.052	0.205	0.339	0.212
$α_{5}$	−0.421	0.031	0.220	0.206	0.223
$α_{6}$	−0.109	0.055	0.214	0.188	0.221
$α_{7}$	−0.012	0.049	0.215	0.141	0.220
$α_{8}$	−0.504	0.042	0.239	0.236	0.243
$α_{9}$	0.232	0.041	0.206	0.174	0.210
$α_{10}$	−0.544	0.050	0.213	0.211	0.218
$ϕ_{1}$	1 $^{‡}$	-	-	-	-
$ϕ_{2}$	1.329	0.141	0.135	0.333	0.195
$ϕ_{3}$	1.132	0.097	0.118	0.183	0.153
$ϕ_{4}$	1.534	0.149	0.152	0.190	0.213
$ϕ_{5}$	1.363	0.100	0.123	0.307	0.158
$ϕ_{6}$	1.423	0.100	0.131	0.477	0.164
$ϕ_{7}$	2.164	0.172	0.154	0.434	0.231
$ϕ_{8}$	1.370	0.116	0.135	0.284	0.178
$ϕ_{9}$	1.142	0.086	0.105	0.308	0.136
$ϕ_{10}$	1.148	0.092	0.116	0.420	0.148

Note. Par = model parameter; Est = parameter estimate; SE = standard error; ME = misspecification error; JKME = misspecification error estimated by jackknifing items; TE = total error based on (33);

ν_{i}

= item intercept;

λ_{i}

= factor loading (

i = 1, \dots, 4

);

α_{g}

= factor mean;

ϕ_{g}

= factor variance (

g = 1, \dots, 10

);

^{‡}

= factor mean

α_{1}

and factor variance

ϕ_{1}

for first country were fixed to 0 and 1, respectively.

Table 2. Empricial example: estimated factor means and factor variances after population standardization (i.e., mean of 0 and standard deviation of 1 in the total population) with their standard errors, misspecification errors, and total errors.

Country	Est	SE	ME	JKME	TE
	Factor Means
1	0.065	0.024	0.124	0.102	0.127
2	0.117	0.023	0.134	0.166	0.136
3	0.220	0.030	0.113	0.074	0.117
4	0.336	0.029	0.116	0.156	0.119
5	−0.285	0.025	0.106	0.215	0.109
6	−0.026	0.031	0.118	0.209	0.122
7	0.056	0.030	0.110	0.026	0.114
8	−0.354	0.025	0.127	0.135	0.130
9	0.258	0.022	0.135	0.136	0.137
10	−0.387	0.023	0.130	0.108	0.132
	Factor Variances
1	0.831	0.027	0.032	0.089	0.042
2	0.958	0.024	0.024	0.034	0.034
3	0.884	0.022	0.027	0.044	0.035
4	1.029	0.024	0.027	0.106	0.036
5	0.970	0.019	0.023	0.033	0.030
6	0.991	0.027	0.028	0.072	0.039
7	1.223	0.019	0.023	0.015	0.029
8	0.973	0.029	0.027	0.049	0.039
9	0.888	0.017	0.030	0.046	0.035
10	0.890	0.020	0.031	0.081	0.037

Note. Est = parameter estimate; SE = standard error; ME = misspecification error; JKME = misspecification error estimated by jackknifing items; TE = total error based on (33).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.