Parameter Estimation in Spatial Autoregressive Models with Missing Data and Measurement Errors

Li, Tengjun; Zhang, Zhikang; Song, Yunquan

doi:10.3390/axioms13050315

Open AccessArticle

Parameter Estimation in Spatial Autoregressive Models with Missing Data and Measurement Errors

by

Tengjun Li

,

Zhikang Zhang

and

Yunquan Song

^*

College of Science, China University of Petroleum, Qingdao 266580, China

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(5), 315; https://doi.org/10.3390/axioms13050315

Submission received: 23 January 2024 / Revised: 15 April 2024 / Accepted: 24 April 2024 / Published: 10 May 2024

(This article belongs to the Special Issue Mathematical and Statistical Methods and Their Applications)

Download Review Reports Versions Notes

Abstract

:

This study addresses the problem of parameter estimation in spatial autoregressive models with missing data and measurement errors in covariates. Specifically, a corrected likelihood estimation approach is employed to rectify the bias in the log-maximum likelihood function induced by measurement errors. Additionally, a combination of inverse probability weighting (IPW) and mean imputation is utilized to mitigate the bias caused by missing data. Under several mild conditions, it is demonstrated that the proposed estimators are consistent and possess oracle properties. The efficacy of the proposed parameter estimation process is assessed through Monte Carlo simulation studies. Finally, the applicability of the proposed method is further substantiated using the Boston Housing Dataset.

Keywords:

spatial autoregression; measurement error; corrected likelihood estimation; missing data; inverse probability weighting

MSC:

62F12; 62G08; 62G20; 62J07T07

1. Introduction

Both classic linear regression models and spatial autoregressive models are used to study the linear relationship between a response variable and multiple explanatory variables. The former usually assumes that the observed values of the explained variable are independent of each other. However, in fields such as economics, biology, and meteorology, the collected data often exhibit certain spatial dependencies. Ignoring these dependencies in statistical inference can lead to significantly biased results (see Luo [1]). The latter model, in contrast, considers spatial dependencies, positing that a region’s response variable is not only related to its explanatory variables but also associated with those of neighboring regions (see Chen [2]).

Both classic linear regression models and spatial autoregressive models typically assume that (i) the values of explanatory variables are always observable or measurable (assuming no incomplete observation), and (ii) the observations or measurements are error-free (assuming no measurement errors in explanatory variables) (see Bai et al. [3]). However, these assumptions may be violated in many scientific studies and practical applications. It is well known in statistical analysis that ignoring measurement errors and missing observations in explanatory variables can lead to serious biases in estimation and large standard errors, resulting in incorrect inference on the estimated regression coefficients. Therefore, studying parameter estimation methods for spatial autoregressive models with measurement errors and missing data in explanatory variables is of great importance.

This paper mainly studies the parameter estimation issues in spatial autoregressive models with measurement errors and missing data in explanatory variables.

Many economic datasets are related to spatial locations, such as studies on Gross Domestic Product, tourism, and research and development across various provinces nationwide (see Li [4]). Spatial data introduce spatial location information (or mutual distances) to cross-sectional or panel data. Spatial data are generally recognized to have locational attributes (see Anselin [5]), assuming that variables with closer distances are more closely related. Tobler’s First Law of Geography states that everything is related to everything else, but nearby things are more related than distant things (see Tobler [6]).

Spatial econometrics was first proposed by Jean Paelinck in May 1974 at the Netherlands Statistical Conference, aiming to provide methodological foundations for econometric models of urban and regional economics. Spatial econometrics primarily deals with addressing spatial interactions (spatial autocorrelation) and spatial structures (spatial heterogeneity) in cross-sectional and panel data regression models (see Anselin [7]). The issues studied in spatial econometrics include (1) model setting; (2) parameter estimation; (3) model setting testing; and (4) spatial prediction (see Anselin [8]). Spatial autoregressive models, which incorporate spatial effects into classic regression models, are important for studying spatial autocorrelation in data.

Research on spatial autoregressive models began early. In the 1970s, Cliff and Ord [9] introduced spatial effects into traditional linear regression models, constructing spatial autoregressive models. However, due to the endogeneity caused by spatial dependence, ordinary least-squares parameter estimates are biased and inconsistent. To address this, researchers have proposed other estimation methods to reduce or eliminate bias caused by spatial effects, obtaining consistent parameter estimates. Anselin [5] first applied the concentrated likelihood function method to provide the maximum likelihood estimation (MLE) of the model. However, MLE requires solving complex likelihood functions and is computationally expensive. Kelejian and Prucha [10] proposed the Generalized Method of Moments for spatial autoregressive models, which is relatively simpler than MLE regardless of sample size, and also provided the asymptotic properties of this estimator for large and small samples. Lesage [11] used Bayesian methods based on Gibbs sampling to address the parameter estimation issue in spatial autoregressive models with heteroskedasticity. Lee [12] employed the GMM and 2SLS for spatial autoregressive models, deriving the optimal GMM and proving its consistency and asymptotic normality. In a normal distribution, the optimal GMM and ML estimates have the same limiting distribution. The fundamental idea behind the Generalized Method of Moments (GMM) is to estimate model parameters using the moment conditions within the model. Meanwhile, Two-Stage Least Squares (2SLS) is a commonly employed method to address issues of endogeneity, particularly when instrumental variables are encountered in regression analysis. Lee and Liu [13] extended the GMM in mixed spatial autoregressive models to higher-order mixed spatial autoregressive models. Their research showed that the GMM has computational advantages over the usual ML, and the proposed GMM estimates were proven to be consistent and asymptotically normal. Wei et al. [14] proposed a semi-parametric partially linear varying coefficient spatial autoregressive model and introduced a quasi-maximum likelihood method based on local linear methods to estimate model parameters.

The development of spatial econometrics in China started late, focusing primarily on empirical studies. For example, Ma and Zhang [15] used provincial panel data in China to analyze the impact of economic development and energy structure on haze pollution, finding a positive spatial correlation in inter-provincial haze pollution, with provinces having a higher proportion of coal consumption in their energy structure experiencing more severe haze pollution. Wang et al. [16] used Chinese city panel data to analyze the impact of high-speed rail on economic growth, finding that the opening of high-speed rail strengthens the positive spatial dependence among Chinese cities’ GDPs and has a positive spatial spillover effect on economic growth. Cheng and Dong [17] used panel data from countries along the “Belt and Road” to construct a spatial econometric model to analyze the spatial effects of trade facilitation on China’s industrial goods exports, finding a positive spatial spillover effect. In 2010, China saw its first textbook on spatial econometrics, authored by Shen [18], but this book focused more on modeling and simulation, with less emphasis on theoretical derivation and proof. Overall, domestic research on spatial autoregressive models has achieved certain results but generally exhibits a situation with more empirical studies and fewer theoretical methods.

Spatial data with measurement errors and missing data are commonly found in scientific research and practical applications. Although various techniques can reduce errors and missing data, the measurement errors and missing data sometimes reach a level that cannot be ignored. Therefore, studying spatial autoregressive models with measurement errors and missing data in explanatory variables becomes very important.

Due to spatial dependencies, the assumption of independence among explanatory variables in spatial data no longer holds. Under such circumstances, if one ignores the measurement errors and spatial dependencies present in the data and still employs traditional estimation methods (such as the least-squares method), this can lead to significant biases in the estimation results (see Li [4]). In their study of spatial data linear mixed models with measurement errors, Yi et al. [19] found that ignoring measurement errors leads to reduced regression coefficients and increased variance. To address this issue, they proposed a structural modeling method that obtains model parameters through maximum likelihood estimation while considering measurement errors and uses the EM algorithm for iterative optimization of parameters. They proved that the proposed method’s parameter estimates have good asymptotic properties, i.e., the maximum likelihood estimates are consistent and satisfy asymptotic normality. Huque et al. [20] explored the sensitivity of parameter estimation in spatial regression models when explanatory variables have measurement errors. When errors exist, parameter estimates of the model exhibit attenuation bias. They proved the bias expression of the estimator when ignoring measurement errors, showing that the bias is related to the degree of spatial correlation between explanatory variables and residuals. They also proposed two strategies for obtaining consistent parameter estimates: (1) using an estimated attenuation factor for subsequent correction and (2) linearly transforming the error-prone explanatory variables. Through simulation studies, they assessed the finite sample performance of these two methods. The results showed that both methods can provide consistent parameter estimates, but the transformation method performs better. They also illustrated this method using ischemic heart disease data. Zhang and Zhu [21] proved that for spatial autoregressive models, when the explained variables have measurement errors, whether or not these errors are related to the model’s disturbance terms, the commonly used maximum likelihood estimation is inconsistent. When the null hypothesis is rejected, using 2SLS can yield consistent parameter estimates. He and Hu [22] introduced measurement errors of independent variables into the classic spatial autoregressive model, establishing a univariate spatial autoregressive measurement error (USARME) model, and proposed a parameter estimation method for this model. Their research showed that if one does not consider the measurement errors of independent variables and directly uses the ordinary spatial autoregressive model, the estimated parameters exhibit significant bias. As measurement errors increase, the parameter estimation performance of the ordinary spatial autoregressive model becomes very poor, while the USARME model still achieves good estimation results. The feasibility and reliability of the proposed parameter estimation method were verified through numerical simulations. Luo [1] proposed a three-stage least-squares (3SLS) estimation method that simultaneously uses Berkson and classical types of instrumental variables. Under mild conditions, they derived the asymptotic normality of the estimators proposed for each type of instrumental variable.

In practical applications, for various reasons, some observations in datasets may be missing. For example, in pharmacological studies, due to the side effects of some drugs, some patients are unable to continue treatment and drop out mid-course, leading to missing data. Simply ignoring these missing values not only reduces the efficiency of the study but may also introduce systematic bias (see You [23]).

Types of missing data, classified according to the missing mechanism, can roughly be divided into three categories: Missing Completely at Random (MCAR), Missing at Random (MAR), and Missing Not at Random (MNAR). Among these, randomly missing data are characterized by the distribution of missing data not depending on unobserved data but only on observed data (see Cheng [24]). This article mainly studies this type of missing data, that is, whether the missing data depend only on observable (exogenous) explanatory variables.

Typical methods for dealing with missing data include imputation and inverse probability weighting. Yang et al. [25] proposed a missing data imputation method based on spatial clustering and spatial autoregressive models. This method first uses the DBSCAN algorithm to cluster the dataset, then establishes a spatial autoregressive model within each cluster to impute missing data. DBSCAN (Density-Based Spatial Clustering of Applications with Noise) is a density-based spatial clustering algorithm designed to organize points in a dataset into clusters separated by areas of varying density, with the capability to identify outliers or noise points. Experimental results with meteorological data showed that compared to cluster kernel function-based imputation and K-nearest neighbors (KNN) imputation, this method achieves more accurate imputation results. Wang and Lee [26] considered the situation of randomly missing explanatory variables in spatial autoregressive models. They proposed nonlinear least-squares estimation and the Generalized Method of Moments for the model. Additionally, they proposed an inferential two-stage least-squares estimation method. These estimation methods were analyzed and compared, revealing that the generalized nonlinear least-squares, best-generalized two-stage least-squares, and optimal moment estimation methods have the same asymptotic variance. Monte Carlo simulations showed that these methods provide consistent estimates even in the presence of unknown heteroskedastic disturbances and are more robust compared to the EM algorithm. Luo [1] studied the situation of missing response variable data in spatial autoregressive models. They proposed a two-stage least-squares estimation method based on IPW and missing data imputation (2II-2SLS). They proved the consistency and asymptotic normality of this estimator and studied its finite sample performance through simulations. The results showed that the performance of this estimator is superior to that of the EM estimator and the maximum likelihood estimator and that the choice of initial values has little impact on its performance.

The above-mentioned papers have greatly enhanced our understanding of parameter estimation in spatial autoregressive models with missing data and measurement errors. However, current research mainly focuses on dealing with single types of data issues in spatial autoregressive models, such as considering only missing data or only measurement errors. In contrast, there is a relative lack of research on the more complex situation where both measurement errors and missing data occur simultaneously in spatial autoregressive models. This paper investigates the parameter estimation problem in spatial autoregressive models with both measurement errors and missing data, proposing a method for parameter estimation. The main contributions of this article are as follows:

(1): We establish a parameter estimation method for spatial autoregressive models with missing data and measurement errors, which uses a combination of corrected likelihood estimation and IPW with mean imputation to eliminate biases caused by missing data and measurement errors.
(2): We apply the proposed method to revise and optimize traditional spatial autoregressive models. Based on this, the log-likelihood function of the modified model is presented, and explicit mathematical expressions and analyses are provided for some key parameters, offering deeper insights into the theoretical foundation and practical application of the model.
(3): Under some mild conditions, we prove that the proposed estimates have consistency and oracle properties. Additionally, we conduct extensive numerical studies, proving that our method is superior to others in terms of parameter estimation.

The rest of this paper is organized as follows. In Section 2, the parameter estimation of spatial autoregression with missing data and measurement errors is considered, presenting the theoretical properties of the oracle estimator and proving its consistency and asymptotic normality. Section 3 conducts numerical comparisons and simulation studies. Section 4 illustrates the application of the proposed method through real data analysis. The proof of the technical results is provided in Appendix A.

2. Parameter Estimation in Spatial Autoregressive Models with Measurement Errors and Missing Data

2.1. Spatial Autoregressive Models

We consider the following spatial autoregressive model:

Y_{n} = X_{n} β + λ W_{n} Y_{n} + ε_{n}

(1)

where

Y_{n}

is an

n \times 1

vector of the observed values of the dependent variable;

X_{n}

is an

n \times k

matrix of the observed values of k exogenous covariates;

λ

is a scalar spatial autoregressive coefficient with

|λ| < 1

;

W_{n}

is a known

n \times n

spatial weight matrix;

ε_{n}

is an n-dimensional vector of regression disturbances, independently and identically distributed with mean 0 and finite variance

σ^{2}

; and

β

is a k-dimensional vector of the regression coefficients. Let

θ_{0} = {(σ_{0}^{2}, λ_{0}, β_{0}^{T})}^{T} = {(θ_{1, 0}, θ_{2, 0}, \dots, θ_{k + 2, 0})}^{T}

be the true parameter point. Let

S_{n} (λ) = I_{n} - λ W_{n}

and

ε_{n} (δ) = Y_{n} - X_{n} β - λ W_{n} Y_{n}

, where

δ = {(λ, β^{T})}^{T}

. Then, following the approach of Lee [27], the log-likelihood function of the model is given by:

\begin{matrix} ln L_{n} (θ) & = - \frac{n}{2} ln (2 π) - \frac{n}{2} ln σ^{2} + ln |S_{n} (λ)| - \frac{1}{2 σ^{2}} ε_{n}^{T} (δ) ε_{n} (δ) \\ = - \frac{n}{2} ln (2 π) - \frac{n}{2} ln σ^{2} + ln |S_{n} (λ)| - \frac{1}{2 σ^{2}} [{(S_{n} (λ) Y_{n} - X_{n} β)}^{T} (S_{n} (λ) Y_{n} - X_{n} β)] \end{matrix}

(2)

where

θ = {(σ^{2}, λ, β^{T})}^{T} = {(θ_{1}, θ_{2}, \dots, θ_{k + 2})}^{T}

and

ln L_{n} (θ)

is the log-likelihood function of model (1). Let

ε_{n} = {(e_{1}, e_{2}, \dots, e_{n})}^{T}

,

S_{n} = S_{n} (λ_{0})

, and

G_{n} = W_{n} S_{n}^{- 1}

. Additionally, to ensure the large-sample properties of QMLE, some basic assumptions are listed as follows:

Assumption 1.

In

ε_{n}

,

e_{i}

,

i = 1, \dots, n

are independently and identically distributed, with mean

E (e_{i}) = 0

and variance

Var (e_{i}) = σ^{2}

. For

γ > 0

, the moment

E ({|e_{i}|}^{4 + γ})

exists.

Assumption 2.

For all

i, j

, the elements

w_{n, i j}

of

W_{n}

are at most of the order of

h_{n}^{- 1}

, denoted as

O (1 / h_{n})

, where the rate sequence

h_{n}

can be bounded or divergent. As a normalization,

w_{n, i i} = 0

for all i.

Assumption 3.

As

n \to \infty

,

n^{- 1} h_{n} \to 0

.

Assumption 4.

The matrix

S_{n}

is non-singular.

Assumption 5.

The matrix sequences

W_{n}

and

S_{n}^{- 1}

are uniformly bounded in terms of row and column sums.

Assumption 6.

For all n, elements

X_{n}

are uniformly bounded constants. When

{lim}_{n \to \infty}

,

n^{- 1} X_{n}^{T} X_{n}

exists and is non-singular.

Assumption 7.

For all λ in the compact parameter space Λ, which is a compact set of the parameter space Λ,

S_{n}^{- 1} (λ)

is uniformly bounded in terms of row and column sums. The true

λ_{0}

is inside Λ.

Assumption 8.

{lim}_{n \to \infty} n^{- 1} {(X_{n}, G_{n} X_{n} β_{0})}^{T} (X_{n}, G_{n} X_{n} β_{0})

exists and is a non-singular matrix.

Assumption 9.

{lim}_{n \to \infty} E (n^{- 1} \frac{\partial^{2} ln L_{n} (θ_{0})}{\partial θ \partial θ^{T}})

exists.

Assumption 10.

For all θ in the open set H containing the true parameter point

θ_{0}

, the third-order derivative

\frac{\partial^{3} ln L_{n} (θ)}{\partial θ_{j} \partial θ_{l} \partial θ_{m}}

exists. Additionally, there exists a function

M_{j l m}

such that for all

θ \in H

,

|n^{- 1} \frac{\partial^{3} ln L_{n} (θ)}{\partial θ_{j} \partial θ_{l} \partial θ_{m}}| \leq M_{j l m}

, where

E (M_{j l m}) < \infty

for all

j, l, m

.

Assumptions 1–9 are similar to those provided by Lee [27], which are sufficient conditions for the correctness of the global identification and the consistency and asymptotic normality of QMLE for model (1). Assumption 1 is needed to apply the central limit theorem by Kelejian and Prucha [28]. Assumptions 2 and 3 characterize the weight matrix for sample size n. If

h_{n}

is a bounded sequence, then Assumptions 2 and 3 are satisfied. In Case’s model, Assumptions 2 and 3 still hold, although

h_{n}

may diverge (see Case [29]). Assumption 4 is used to ensure the existence of the means and variances of the independent variables. Assumption 5 implies that when n tends to infinity, the variance of

Y_{n}

is bounded (refer to Kelejian, Prucha [28], and Lee [27]). Assumption 6 excludes multicollinearity among the regressors in

X_{n}

. For convenience, we assume the regressors are uniformly bounded. If not, they can be replaced with random regressors under certain finite moment conditions (see Lee [27]). Assumption 7 is meaningful for handling the nonlinearity of the log-likelihood function

ln |S_{n} (λ)|

. Assumptions 8 and 9 are applicable to the asymptotic normality of QMLE. Assumption 10 is similar to condition (C) in Fan and Li [30] and plays an important role in the Taylor expansion of related functions.

2.2. Spatial Autoregressive Model with Missing Data and Measurement Errors

When a subset of covariates has missing values, we consider model (3). Let

X_{i}^{(o)} \in R^{s}

be the vector of covariates that are always observed and

X_{i}^{(m)} \in R^{k}

be the vector of covariates that may contain some missing components. For each observation, the indicator

Q_{i}

denotes whether

X_{i}^{(m)}

is fully observed, i.e., if

X_{i}^{(m)}

is fully observed, then

Q_{i} = 1

; otherwise,

Q_{i} = 0

. Let

v_{i} = {(X_{i}^{(o)})}^{T} \in R^{s}

, and as mentioned earlier, the data in this study are randomly missing but not endogenous. This means that the probability of missing observations may depend on variables that are always observed rather than on variables that may have missing data. Formally:

π_{i_{0}} = Pr (Q_{i} = 1 ∣ X_{i}) = Pr (Q_{i} = 1 ∣ X_{i}^{(o)}) = Pr (Q_{i} = 1 ∣ v_{i})

(3)

In the covariate data, we assume that the dimension of the covariates

X_{i}^{(m)} \in R^{k}

with missing data and the dimension of the covariates involved in the missing model

X_{i}^{(o)} \in R^{s}

are fixed.

For the issue of missing data in covariates, the IPW method can be used to address this. The idea of IPW is to offset potential biases due to missing data by assigning different weights to complete observations. This approach helps mitigate biases in results estimation due to missing covariate data, thereby achieving more accurate statistical inferences. The probability

π_{i 0}

is usually parameterized and modeled through logistic or probit regression. Here, we assume it is generated by the following logistic regression model:

{\tilde{π}}_{i} = \frac{exp (ξ_{0} + v_{i}^{T} ξ_{1})}{1 + exp (ξ_{0} + v_{i}^{T} ξ_{1})}

(4)

For simplicity,

π_{i 0}

denotes the true probability of observation i having complete data, and

{\tilde{π}}_{i}

is the probability calculated based on the logistic function. Let

Q = diag (\frac{Q_{1}}{{\tilde{π}}_{1}}, \frac{Q_{2}}{{\tilde{π}}_{2}}, \dots, \frac{Q_{n}}{{\tilde{π}}_{n}})

. The weighted spatial autoregressive log-likelihood function is defined as:

\begin{matrix} ln {\tilde{L}}_{n} (θ) = - \frac{n}{2} ln (2 π) - \frac{n}{2} ln σ^{2} + ln |S_{n} (λ)| - \frac{1}{2 σ^{2}} [{(S_{n} (λ) Y_{n} - X_{n} β)}^{T} Q (S_{n} (λ) Y_{n} - X_{n} β)] \end{matrix}

(5)

When

X_{n}

has measurement errors, consider the classic additive measurement error model:

Z_{n} = X_{n} + U_{n}

(6)

In Equation

(6)

,

X_{n}

cannot be directly observed, but

Z_{n}

can be, where

Z_{n}

is an

n \times k

matrix,

U_{n}

is the error term with

U_{n} \sim N (0, Σ)

, and

ε_{n}

is independent of

U_{n}

. Additionally, we assume that

U_{n}

is independent of the covariate

X_{n}

.

Li [4] proposed a corrected likelihood estimation to solve the spatial autoregressive model with measurement errors. In this case, IPW and corrected likelihood estimation are applied to spatial autoregressive models with measurement errors and missing data.

The corrected likelihood method was initially proposed by Nakamura [31] to address the impact of measurement errors on parameter estimation, enabling parameter estimation without additional assumptions. The specific method is as follows:

For the classic linear regression model with measurement errors:

\{\begin{matrix} Y & = X β + ε, \\ Z & = X + U \end{matrix}

(7)

where Y is the dependent variable, X is the unobservable value of the explanatory variable,

β

is the parameter vector,

ε

is the residual vector, Z is the observable vector, and U is the measurement error.

Let

L (β, X, Y)

,

U (β, X, Y)

,

J (β, X, Y)

, and

I (β, X, Y)

denote, respectively, the log-likelihood function, score function, observed information, and Fisher information of model (7) given Z and Y, with

E^{+}

being the mathematical expectation regarding the respective variable Y. Without measurement errors in variables, the following equations hold:

E^{+} U (β, X, Y) = 0

(8)

E^{+} J (β, X, Y) = I (β, X, Y)

(9)

With measurement errors, if Z values are simply substituted for X, Equations (8) and (9) do not always hold.

Thus, Nakamura’s corrected likelihood method is used to handle this model, setting the corrected log-likelihood function

L^{*} (β, Z, Y)

to satisfy:

E^{*} {L^{*} (β, Z, Y)} = L (β, X, Y)

(10)

where

E^{*}

is the conditional expectation of Z given

Y, X

. Let

U^{*} (β, Z, Y) = \frac{\partial L^{*} (β, Z, Y)}{\partial β}

and

J^{*} (β, Z, Y) = - \frac{\partial U^{*} (β, Z, Y)}{\partial β}

represent the corrected score function and corrected observed information, respectively. If

E^{*}

and

\partial β

are interchangeable, then:

E^{*} {U^{*} (β, Z, Y)} = U (β, X, Y)

(11)

E^{*} {J^{*} (β, Z, Y)} = I (β, X, Y)

(12)

If the estimation of

β

satisfies

U^{*} (β^{*}, Z, Y) = 0

, then

β^{*}

is called a corrected likelihood estimate. Let

E = E^{+} E^{*}

. Then,

E {U^{*} (β, Z, Y)} = E^{+} E^{*} {U^{*} (β, Z, Y)} = E^{+} {U (β, X, Y)} = 0

(13)

This shows that the corrected score function is unbiased.

Property 1.

Let F be an open convex subset of the parameter space containing β. If

L^{*} (β, Z, Y)

and

L (β, X, Y)

are differentiable on F,

\sum k^{- 2} var {L^{*} (β, Z_{k}, Y_{k})} < \infty

, β is identifiable, Y is mutually identifiable, and then

U^{*} (β, X, Y) =

0 has a root that is consistent with probability one as

n \to \infty

converges in probability to 0.

Property 2.

If

β^{*}

and

β_{X}

are consistent roots of

U^{*} (β, X, Y) = 0

and

U (β, Z, Y) = 0

, and if

U^{*} (β, X, Y)

and

U (β, Z, Y)

meet some regularity conditions, then under the given conditions of X and Y,

(β^{*} - β_{X})

follows an asymptotic normal distribution with mean 0 and variance

I^{+} {(β, X)}^{- 1} E^{+} [var {U^{*} (β, Z, Y)}] I^{+} {(β, X)}^{- 1}

, as

(n \to \infty)

.

For model (7), the log-likelihood function is

L (β, X, Y) = - \frac{1}{2} ln (2 π) - \frac{1}{2} ln σ^{2} - \frac{1}{2 σ^{2}} {(Y_{n} - X_{n} β)}^{T} (Y_{n} - X_{n} β)

(14)

Define

L^{*} (β, Z, Y) = - \frac{1}{2} ln (2 π) - \frac{1}{2} ln σ^{2} - \frac{1}{2 σ^{2}} [{(Y_{n} - X_{n} β)}^{T} (Y_{n} - X_{n} β) - β^{T} Σ β]

(15)

Then,

E^{*} {L^{*} (β, Z, Y)} = L (β, X, Y)

(16)

Thus,

L^{*} (β, Z, Y)

is the corrected likelihood function for model (7).

By applying Nakamura’s corrected likelihood method, the corrected likelihood function for the spatial autoregressive model with missing data and measurement errors can be obtained (here, denote

ln {\hat{L}}_{n} (θ) = L^{*} (λ, β, Z, Y)

):

\begin{matrix} ln {\hat{L}}_{n} (θ) = - \frac{n}{2} ln (2 π) - \frac{n}{2} ln σ^{2} + ln | S_{n} (λ) | - \frac{1}{2 σ^{2}} {[{(S_{n} (λ) Y_{n} - Z_{n} β)}^{T} Q (S_{n} (λ) Y_{n} - Z_{n} β)] - β^{T} Σ β} \end{matrix}

(17)

Next, we solve for the parameters. Differentiating both sides of the previous equation with respect to

β

and

σ^{2}

, we obtain the following set of equations:

\{\begin{matrix} \frac{\partial ln {\hat{L}}_{n} (θ)}{\partial β} & = 0 \\ \frac{\partial ln {\hat{L}}_{n} (θ)}{\partial σ^{2}} & = 0 \end{matrix}

(18)

That is,

\{\begin{matrix} - \frac{1}{2 σ^{2}} [- 2 Z_{n}^{T} Q (S_{n} (λ) Y_{n} - Z_{n} β) - 2 Σ β] & = 0 \\ - \frac{n}{2 σ^{2}} + \frac{1}{2 σ^{4}} [{(S_{n} (λ) Y_{n} - X_{n} β)}^{T} Q (S_{n} (λ) Y_{n} - X_{n} β) - β^{T} Σ β] & = 0 \end{matrix}

(19)

Given

λ

, the corrected likelihood estimate for

β

is:

\hat{β} (λ) = {(Z_{n}^{T} Q Z_{n} - Σ)}^{- 1} Z_{n}^{T} Q S_{n} (λ) Y_{n}

(20)

Similarly, the corrected likelihood estimate for

σ^{2}

is:

{\hat{σ}}^{2} (λ) = \frac{1}{n} {[{(S_{n} (λ) Y_{n} - Z_{n} β)}^{T} Q (S_{n} (λ) Y_{n} - Z_{n} β)] - β^{T} Σ β} = \frac{1}{n} Y_{n}^{T} S_{n}^{T} (λ) M_{n} S_{n} (λ) Y_{n}

(21)

where

M_{n} = P_{z} - Q Z_{n} {({(Z_{n}^{T} Q Z_{n} - Σ)}^{- 1})}^{T} Σ {(Z_{n}^{T} Q Z_{n} - Σ)}^{- 1} Z_{n}^{T} Q

,

P_{z} = Q - Q Z_{n} (Z_{n}^{T} Q Z_{n}

{- Σ)}^{- 1} Z_{n}^{T} Q

. Substituting Equations (20) and (21) into Equation (17), the concentrated likelihood function for

λ

is:

ln {\hat{L}}_{n} (λ) = - \frac{n}{2} (ln (2 π) + 1) - \frac{n}{2} ln {\hat{σ}}^{2} (λ) + ln | S_{n} (λ) |

(22)

The corrected likelihood estimate of

λ

is then found by maximizing Equation (22).

Theorem 1 (Oracle Properties).

Suppose the regularity conditions in Assumptions A1–A5 in Appendix A hold. It is apparent that Assumptions A1–A5 are largely consistent with the earlier Assumptions 1–8. If Assumptions A1–A5 hold, then

{\hat{θ}}_{n}

is globally identified, a consistent estimator, and has an asymptotic distribution.

1.: $θ_{0}$ is globally identifiable, and ${\hat{θ}}_{n}$ is a consistent estimate of $θ_{0}$ .
2.: $\sqrt{n} (\hat{θ} - θ_{0}) \overset{D}{\to} N (0, Σ_{θ}^{- 1})$ , where $Σ_{θ}^{- 1} = - {lim}_{n \to \infty} E (\frac{1}{n} \frac{\partial^{2} ln {\hat{L}}_{n} (θ)}{\partial θ \partial θ^{T}})$ .

3. Simulation

Through Monte Carlo simulation, the performance and efficiency of the proposed method were compared. We simulated 500 datasets.

The sample sizes n for each dataset were set to 100, 150, 200, and 250, respectively. The threshold m of the weight matrix represents the number of non-zero elements in each row of the matrix. The threshold m for the weight matrix was set to 10, 15, 20, and 25, respectively. The spatial autoregressive coefficients

λ

were set to 0.5 and −0.5. Covariates and random errors were generated as follows: Covariates

X_{n} = (X_{1}, X_{2}, \dots, X_{p})

were generated from a p-dimensional normal distribution with mean 0 and variance 1. In the simulation, we set

β = (1, 0, 5, 0, 3)

. The generation mechanism for

Y_{n}

is

Y_{n} = {(I_{n} - λ W_{n})}^{- 1} (X_{n} β + ε_{i})

(23)

We assumed that the error

ε

in the spatial autoregressive model followed a normal distribution with variances

e^{2}

of

{0.5}^{2}

,

1^{2}

, and

{1.5}^{2}

. In the simulation, we assumed that

X_{2}

and

X_{4}

might have missing values. The missingness model considered is

Logit {P r (R_{i} = 1)} = 0.5 + 0.5 X_{i_{1}} - 1.5 X_{i_{3}} + 0.5 X_{i_{5}}

(24)

The measurement error model was set as

Z = X + U

, where the measurement error U follows a multivariate normal distribution ∑ with mean 0 and variance

1^{2}

.

(I): Both measurement errors and missing data are considered;
(II): Measurement errors are ignored;
(III): Missing data are ignored (By dropping observations with missing covariates.);
(IV): Both measurement errors and missing data are ignored.

Table 1, Table 2 and Table 3 present the median square errors (MeSE) of the estimates for

λ

,

β

, and

σ^{2}

as

{(\hat{λ} - λ)}^{2}

,

{(\hat{β} - β)}^{2}

, and

{({\hat{σ}}^{2} - σ^{2})}^{2}

, respectively. As observed from Table 1, Table 2 and Table 3, the estimates provided by the proposed method are overall significantly superior to those obtained by directly ignoring missing data or both missing data and measurement errors, with smaller squared errors. Compared to ignoring missing data, the proposed method’s correction effect on estimating

β

and

λ

is not very pronounced, but the correction for

σ^{2}

is relatively significant. Overall, the proposed method more significantly corrects the biases caused by missing data. However, when the values of n and e are relatively low, the correction effect for measurement errors is not particularly satisfactory. Moreover, ignoring both missing data and measurement errors leads to larger squared errors due to the severe loss of information. In summary, correcting for missing data and measurement errors is both necessary and effective.

4. Real Data Example

In this section, we present a real example to illustrate the performance of the parameter estimation procedure for spatial autoregressive models with missing data and measurement errors proposed in this paper.

We consider the famous 1970 Boston Housing Dataset, which contains information on 506 different houses in different locations in the Boston Standard Metropolitan Statistical Area. This dataset has been used by many authors and can be found in the spdep library in R. It was first analyzed by Harrison and Rubinfeld [32]. Sun et al. [33] and Du et al. [34] explored the spatial dependence of these data through partially linear varying coefficient autoregressive and partially linear additive autoregressive models, respectively. Liu [35] used the Moran I statistic to test the spatial dependence of the dataset. Therefore, the data serve our analysis purposes well. Table 4 provides specific descriptions of the variables in the dataset.

In the actual data analysis, following the practice of Harrison and Rubinfeld [32], we consider the logarithm of the median value of owner-occupied homes (MEDV) in census tracts as the dependent variable and the other variables as the independent variables. Among these, the weighted distance to five Boston employment centers (DIS), the index of accessibility to radial highways (RAD), the percentage of the population classified as lower income (LSTAT), and the average number of rooms per dwelling (RM) are log-transformed, while the nitrogen oxide concentration (NOX) is squared. For ease of analysis, all variables are mean-centered to have a sample mean of zero.

Spatial weight matrices generally consist of two types of information. One is determined using latitude and longitude coordinates, and the other is determined using the relative locations of regions (see Liu [35]).

Our approach is similar to that of Pace and Gilley [36]. We first define an initial matrix W and the weight between two houses i and j as:

W_{i j} = max (1 - \frac{d_{i j}}{d_{0}}, 0)

(25)

where

d_{i j}

is the Euclidean distance calculated based on the latitude and longitude coordinates of the two houses. We set the threshold distance

d_{0}

to 0.05. Additionally, in practice, the spatial weight matrix is row-normalized.

For the above dataset, we consider the following model:

Y = λ W Y + X_{1} β_{1} + X_{2} β_{2} + X_{3} β_{3} + X_{4} β_{4} + X_{5} β_{5} + ε_{i}

(26)

where the response variable Y is the median value of house prices (MEDV),

X_{1}

is the weighted distance (DIS),

X_{2}

is the index of accessibility to radial highways (RAD),

X_{3}

is the percentage of the population classified as lower income (LSTAT),

X_{4}

is the nitrogen oxide concentration (NOX), and

X_{5}

is the average number of rooms per dwelling (RM).

In the simulation, we assume that

X_{2}

and

X_{4}

may have missing values. The missingness model considered is:

Logit {P r (R_{i}) = 1} = 1 + 2 X_{i 1} - 3 X_{i 3} + X_{i 5}

(27)

The measurement error model is set as

Z = X + U

, where the measurement error U follows a multivariate normal distribution ∑ with mean 0 and variance equal to the sample variance

σ_{i}^{2}

of each variable.

Table 5 displays the median squared errors (MeSEs) of the estimators

λ

,

β

, and

σ^{2}

, specifically

{||\hat{λ} - λ||}^{2}

,

{||{\hat{β}}_{i} - β_{i}||}^{2}

, and

{||{\hat{σ}}^{2} - σ^{2}||}^{2}

. It can be observed that the performance of the proposed method for the Boston Housing Dataset is largely consistent with the results of the numerical simulations. Although the correction effect for the partial

β

is not particularly pronounced, the overall correction effect is still relatively ideal, and in particular, the corrections for

σ^{2}

and

λ

are very effective.

5. Conclusions

We have developed a robust method for simultaneously handling missing data and measurement errors in covariates of spatial autoregressive models. Clearly, traditional statistical methods may lead to biased estimates when covariates have missing data and measurement errors. Our method uses IPW and corrected likelihood methods to address this issue. We have studied the theoretical properties of the proposed method and investigated its performance in parameter estimation through Monte Carlo simulations, comparing it to scenarios where measurement errors, missing data, or both are ignored. The simulation studies demonstrate that our method outperforms traditional direct extensions for parameter estimation in spatial autoregressive models.

Author Contributions

Software, Validation, Investigation, Resources, Data Curation, Writing—Review & Editing, and Funding Acquisition were handled by T.L. He was mainly responsible for implementing the concepts, programming and running simulations, testing the code, collecting and curating data, as well as editing the manuscript and securing funding. Conceptualization, Methodology, Formal Analysis, Writing—Original Draft Preparation, Writing—Review & Editing were performed by Z.Z. He was primarily responsible for the initial concept and design, theoretical derivation of formulas, and writing and editing the theoretical parts of the manuscript. Supervision and Project Administration were carried out by the corresponding author Y.S., who was in charge of overseeing the entire project and managing its administration. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Fundamental Research Funds for the Central Universities (No. 23CX03012A) and the National Key Research and Development Program (2021YFA1000102) of China.

Data Availability Statement

All datasets used in this study, including those analyzed and those generated during the research process, are publicly available and can be accessed through the following link: https://drive.google.com/drive/folders/1egSNneFuDZe-iUbn69OPkWlDAxEzpP6q (accessed on 25 April 2024).

Conflicts of Interest

The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Appendix A

Using a combination of IPW and mean interpolation, consider an ideal “pseudo-complete dataset”

{\tilde{Z}}_{n}

for covariates, satisfying

{(S_{n} (λ) Y_{n} - Z_{n} β)}^{T} Q (S_{n} (λ) Y_{n} - Z_{n} β) = {(S_{n} (λ) Y_{n} - {\tilde{Z}}_{n} β)}^{T} (S_{n} (λ) Y_{n} - {\tilde{Z}}_{n} β)

(A1)

From Equations (1), (4) and (A1), we obtain

Y_{n} = λ W_{n} Y_{n} + {\tilde{Z}}_{n} β + ε_{n} - U_{n} β ≜ λ W_{n} Y_{n} + {\tilde{Z}}_{n} β + ε_{n}^{*}

(A2)

where

ε_{n}^{*} = ε_{n} - U_{n} β

.

Thus, Equation (16) can be simplified as:

Y_{n} = S_{n}^{- 1} ({\tilde{Z}}_{n} β_{0} + ε_{n}^{*})

(A3)

where

S_{n} = S_{n} (λ_{0})

.

Let

G_{n} = W_{n} S_{n}^{- 1}

. Then,

I_{n} + λ_{0} G_{n} = S_{n}^{- 1}

, and Equation (A1) can be expressed as:

Y_{n} = λ_{0} G_{n} {\tilde{Z}}_{n} β_{0} + {\tilde{Z}}_{n} β_{0} + S_{n}^{- 1} ε_{n}^{*}

(A4)

To establish the asymptotic properties of the estimator, the following regularity conditions are required:

Assumption A1.

In

ε_{n} = {e_{i}}

and

U_{n} = {u_{i}}

, elements

e_{i}

,

u_{i}

,

i = 1, \dots, n

are independently and identically distributed with mean

E (e_{i}) = E (u_{i}) = 0

, variance

Var (e_{i}) = σ^{2}

,

Var (u_{i}) = Σ

. For

γ > 0

, moments

E ({| e_{i} |^{4 + γ}})

and

E ({| u_{i} |^{4 + γ}})

exist.

Assumption A2.

For all

i, j

, elements

w_{n, i j}

of

W_{n}

are at most of the order of

h_{n}^{- 1}

, denoted as

O (1 / h_{n})

, where the rate sequence

{h_{n}}

can be bounded or divergent. For normalization, all i have

w_{n, i i} = 0

.

Assumption A3.

The matrix sequences

{W_{n}}

and

{S_{n}^{- 1}}

are uniformly bounded in terms of row and column sums.

Assumption A4.

For all n, elements

{\tilde{Z}}_{n}

are uniformly bounded constants. When

{lim}_{n \to \infty}

,

n^{- 1} {\tilde{Z}}_{n}^{T} {\tilde{Z}}_{n}

exists and is non-singular.

Assumption A5.

For all λ in the compact parameter space Λ,

{S_{n}^{- 1} (λ)}

are uniformly bounded in terms of row and column sums. The true

λ_{0}

is inside Λ.

Assumption A6.

{lim}_{n \to \infty} n^{- 1} {({\tilde{Z}}_{n} G_{n} {\tilde{Z}}_{n} β_{0})}^{T} ({\tilde{Z}}_{n} G_{n} {\tilde{Z}}_{n} β_{0})

exists and is a non-singular matrix.

Theorem A1.

Under Assumptions 1–8,

θ_{0}

is globally identifiable, and

{\hat{θ}}_{0}

is a consistent estimator of

θ_{0}

.

Define

P_{n} (λ) = {max}_{β, σ^{2}} E (ln {\hat{L}}_{n} (θ))

. In this maximization problem, the optimal solutions for

β

and

σ^{2}

are

\begin{matrix} β^{*} (λ) & = {({\tilde{Z}}_{n}^{T} {\tilde{Z}}_{n} - Σ)}^{- 1} {\tilde{Z}}_{n}^{T} S_{n} (λ) Y_{n} \\ σ^{* 2} (λ) & = \frac{1}{n} E [{(S_{n} (λ) Y_{n} - {\tilde{Z}}_{n} β)}^{T} (S_{n} (λ) Y_{n} - {\tilde{Z}}_{n} β) - β^{T} Σ β] \\ = \frac{1}{n} [{(λ_{0} - λ)}^{2} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} M_{n} (G_{n} {\tilde{Z}}_{n} β_{0}) + σ_{0}^{2} tr [{(S_{n}^{T})}^{- 1} S_{n}^{T} (λ) S_{n} (λ) S_{n}^{- 1}]] \end{matrix}

Then,

P_{n} (λ) = - \frac{n}{2} (l n (2 π) + 1) - \frac{n}{2} l n {\hat{σ}}^{* 2} (λ) + l n | S_{n} (λ) |

The value of

λ_{0}

can be obtained by maximizing

{\frac{P_{n} (λ)}{n}}

.

To prove that

{\hat{θ}}_{0}

is a consistent estimator of

θ_{0}

, it suffices to show the following:

1.: $\frac{\hat{ln L} (θ) - P_{n} (λ)}{n}$ converges uniformly to 0 in $Λ$ ;
2.: For $\forall ω > 0, {lim}_{n \to \infty} {max}_{λ \in {\bar{N}}_{ω} (λ_{0})} \frac{P_{n} (λ) - P_{n} (λ_{0})}{n}$ is a complement set of a neighborhood with a diameter.

(a): Since

$\frac{ln \hat{L} (λ) - P_{n} (λ)}{n} = - \frac{1}{2} (ln {\hat{σ}}^{2} (λ) - ln σ^{* 2} (λ))$

$σ^{* 2} (λ)$ and ${\hat{σ}}^{2} (λ)$ can be written as

$\begin{matrix} σ^{* 2} (λ) & = \frac{{(λ_{0} - λ)}^{2} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} M_{n} (G_{n} {\tilde{Z}}_{n} β_{0})}{n} + σ^{2} (λ), \\ {\hat{σ}}^{2} (λ) & = \frac{1}{n} Y_{n}^{T} S_{n}^{T} (λ) M_{n} S_{n} (λ) Y_{n} \\ = \frac{1}{n} {(λ_{0} - λ)}^{2} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} M_{n} (G_{n} {\tilde{Z}}_{n} β_{0}) + 2 (λ_{0} - λ) H_{1 n} (λ) + H_{2 n} (λ) . \end{matrix}$

where

$\begin{matrix} σ^{2} (λ) & = \frac{σ_{0}^{2}}{n} tr [{(S_{n}^{T})}^{- 1} S_{n}^{T} (λ) S_{n} (λ) S_{n}^{- 1}], \\ H_{1 n} (λ) & = \frac{1}{n} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} M_{n} S_{n} (λ) S_{n}^{- 1} ε_{n}^{*}, \\ H_{2 n} (λ) & = \frac{1}{n} ε_{n}^{* T} {(S_{n}^{- 1})}^{T} S_{n}^{T} (λ) M_{n} S_{n} (λ) S_{n}^{- 1} ε_{n}^{*} . \end{matrix}$

It can be shown that on $Λ$ ,

$\begin{matrix} H_{1 n} (λ) & = O_{p} (1), \\ H_{2 n} (λ) - σ^{2} (λ) & = O_{p} (1) . \end{matrix}$

Therefore,

${\hat{σ}}^{2} (λ) - σ^{* 2} (λ) = O_{p} (1) .$

Hence,

$sup_{λ \in Λ} |\frac{ln {\hat{L}}_{n} (θ) - P_{n} (λ)}{n}| = O_{p} (1) .$
(b): $\begin{matrix} \frac{1}{n} [P_{n} (λ) - P_{n} (λ_{0})] & = \frac{1}{n} [P_{p, n} (λ) - P_{p, n} (λ_{0})] = \frac{1}{2} [ln σ^{* 2} (λ) - ln σ^{2} (λ)], \\ where P_{p, n} (λ) & = - \frac{n}{2} (ln (2 π) + 1) - \frac{n}{2} ln σ^{2} (λ) + ln | S_{n} (λ) | . \end{matrix}$

{\frac{P_{n} (λ)}{n}}

is uniformly continuous on

Λ

. By Jensen’s inequality, for all

λ

,

\frac{P_{p, n} (λ) - P_{p, n} (λ_{0})}{n} \leq 0

and, therefore,

σ^{* 2} (λ) \geq σ^{2} (λ)

If the assumption does not hold, then there exists a sequence

{λ_{n}} \in Λ

,

{lim}_{n \to \infty} λ_{n} = λ_{+} \neq λ_{0}

such that

lim_{n \to \infty} \frac{P_{n} (λ_{n}) - P_{n} (λ_{0})}{n} = 0

This can only occur if

lim_{n \to \infty} (σ^{* 2} (λ_{+}) - σ^{2} (λ_{+})) = 0

and

lim_{n \to \infty} \frac{P_{p, n} (λ_{+}) - P_{p, n} (λ_{0})}{n} = 0

simultaneously hold, but the latter equation contradicts

lim_{n \to \infty} \frac{{(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} M_{n} (G_{n} {\tilde{Z}}_{n} β_{0})}{n} \neq 0

Hence, consistency is proven.

Theorem A2.

The asymptotic distribution of CMLE

{\hat{θ}}_{n}

can be derived from the Taylor expansion of

\frac{\partial ln \hat{L_{n}} ({\hat{θ}}_{n})}{\partial θ} = 0

at

θ_{n}

, where the first-order derivative of the log-likelihood function at

θ_{0}

is

\begin{matrix} \frac{1}{\sqrt{n}} \frac{\partial ln {\hat{L}}_{n} (θ_{0})}{\partial β} & = \frac{1}{\sqrt{n} σ_{0}^{2}} ({\tilde{Z}}_{n}^{T} Q ε_{n}^{*} + n Σ β_{0}) \\ \frac{1}{\sqrt{n}} \frac{\partial ln {\hat{L}}_{n} (θ_{0})}{\partial σ^{2}} & = \frac{1}{2 \sqrt{n} σ_{0}^{4}} (Q ε_{n}^{* T} ε_{n}^{*} - n β_{0}^{T} Σ β_{0} - n σ_{0}^{2}) \\ \frac{1}{\sqrt{n}} \frac{\partial ln {\hat{L}}_{n} (θ_{0})}{\partial λ} & = \frac{1}{\sqrt{n} σ_{0}^{2}} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} ε_{n}^{*} + \frac{1}{\sqrt{n} σ_{0}^{2}} (ε_{n}^{* T} G_{n} ε_{n}^{*} - σ_{0}^{2} tr (G_{n})) \end{matrix}

The asymptotic distribution of these expressions can be obtained through the central limit theorem. If

{h_{n}}

is a bounded sequence, Kelejian and Prucha’s central limit theorem can be used. If

{h_{n}}

is unbounded, i.e.,

{lim}_{n \to \infty} h_{n} = \infty

, then under Assumption A5,

\frac{1}{\sqrt{n} σ_{0}^{2}} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} ε_{n}^{*}

will significantly influence

\frac{1}{\sqrt{n}} \frac{\partial ln \hat{L}}{(} θ_{0}) \partial λ

. This is because

\begin{matrix} var (\frac{1}{\sqrt{n}} ε_{n}^{* T} G_{n} ε_{n}^{*}) & = O (\frac{1}{h_{n}}) \\ \frac{1}{\sqrt{n}} (ε_{n}^{* T} G_{n} ε_{n}^{*} - σ_{0}^{2} tr (G_{n})) & = O_{p} (1) \end{matrix}

However,

\frac{1}{\sqrt{n}} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} ε_{n}^{*} = O_{p} (1)

In this case, Kolmogorov’s central limit theorem can be used.

\frac{1}{\sqrt{n}} \frac{\partial ln \hat{L_{n}} (θ_{0})}{\partial θ} ’ s variance matrix is E (\frac{1}{\sqrt{n}} \frac{\partial ln \hat{L} (θ_{0})}{\partial θ} \frac{1}{\sqrt{n}} \frac{\partial ln \hat{L} (θ_{0})}{\partial θ}) = - E (\frac{1}{n} \frac{\partial^{2} ln \hat{L} (θ_{0})}{\partial θ \partial θ^{T}})

where

- E (\frac{1}{n} \frac{\partial^{2} ln \hat{L} (θ_{0})}{\partial θ \partial θ^{T}}) = (\begin{matrix} \frac{1}{n σ_{0}^{2}} ({\tilde{Z}}_{n}^{T} {\tilde{Z}}_{n} - 2 n Σ) & \frac{1}{n σ_{0}^{2}} {\tilde{Z}}_{n}^{T} (G_{n} {\tilde{Z}}_{n} β_{0}) & \frac{1}{σ_{0}^{4}} Σ β_{0} \\ \frac{1}{n σ_{0}^{2}} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} {\tilde{Z}}_{n} & \frac{1}{n σ_{0}^{2}} {(G_{n} {\tilde{Z}}_{n} β_{0})}^{T} (G_{n} {\tilde{Z}}_{n} β_{0}) + \frac{1}{n} tr (G_{n}^{T} G_{n}) & \frac{1}{n σ_{0}^{2}} (G_{n}) \\ \frac{1}{σ_{0}^{4}} Σ β_{0} & \frac{1}{n σ_{0}^{2}} tr (G_{n}) & \frac{1}{2 σ_{0}^{4}} \end{matrix})

In

\frac{\partial^{2} ln \hat{L} (θ_{0})}{\partial θ \partial θ^{T}}

,

λ

,

β

, and

\frac{1}{σ_{0}^{2}}

appear either as linear or quadratic moments, and

\frac{\partial^{2} ln \hat{L} (θ)}{\partial λ^{2}} = - tr ({[W_{n}^{T} S_{n}^{- 1} (λ)]}^{2}) - \frac{Y_{n}^{T} W_{n}^{T} W_{n} Y_{n}}{σ^{2}}

Let

G_{n} (λ) = W_{n} S_{n}^{- 1} (λ)

, and by the mean value theorem,

tr (G_{n}^{2} ({\tilde{λ}}_{n})) = tr (G_{n}^{2}) + 2 tr (G_{n}^{3} (\tilde{λ})) (\tilde{λ} - λ_{0})

Assumption A3 ensures that within a neighborhood of

λ_{0}

,

G_{n} ({\tilde{λ}}_{n})

’s row and column sums are uniformly bounded. Since

tr (G_{n}^{3} ({\tilde{λ}}_{n})) = O (\frac{n}{h_{n}})

,

Y_{n}^{T} W_{n}^{T} W_{n} Y_{n} = O_{p} (n h_{n})

and, therefore,

\frac{1}{n} [\frac{\partial^{2} ln \hat{L} ({\tilde{θ}}_{n})}{\partial λ^{2}} - \frac{\partial^{2} ln \hat{L} (θ_{0})}{\partial λ^{2}}] = - 2 \frac{tr (G_{n}^{3} ({\tilde{λ}}_{n}))}{n} ({\tilde{λ}}_{n} - λ_{0}) + [\frac{1}{σ_{0}^{2}} - \frac{1}{{\tilde{σ}}^{2}}] \frac{Y_{n}^{T} W_{n}^{T} W_{n} Y_{n}}{n} = O_{p} (1)

The second-order derivatives of other terms can be similarly deduced. Since both

\frac{{\tilde{Z}}_{n}^{T} G_{n} ε_{n}^{*}}{n}

and

\frac{1}{n} ({ε_{n}^{*}}^{T} G_{n} ε_{n}^{*} - (σ_{0}^{2} + β_{0}^{T} Σ β_{0}) tr (G_{n}))

converge to

O_{p} (1)

, it follows that

\frac{1}{n} [\frac{\partial^{2} ln \hat{L} ({\tilde{θ}}_{n})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} ln \hat{L} (θ_{0})}{\partial θ \partial θ^{T}}] \overset{P}{\to} 0

Since

\frac{\partial ln L_{n} (θ_{0})}{\partial θ}

is either a linear or quadratic function of

ε_{n}^{*}

, and the higher-order moments of

ε_{n}^{*}

exist according to Assumption A1, the central limit theorem by Kelejian and Prucha implies

\frac{1}{\sqrt{n}} \frac{\partial ln \hat{L} (θ_{0})}{\partial θ} \overset{D}{\to} N (0, Σ_{θ})

Assumption A5 ensures that

Σ_{θ}

is non-singular, and the asymptotic distribution of

{\hat{θ}}_{n}

can be written as

\sqrt{n} ({\hat{θ}}_{n} - θ_{0}) = - {(\frac{1}{n} \frac{\partial^{2} l n {\hat{L}}_{n} ({\tilde{θ}}_{n})}{\partial θ \partial θ^{T}})}^{- 1} \frac{1}{\sqrt{n}} \frac{\partial l n {\hat{L}}_{n} (θ_{0})}{\partial θ}

where

{\hat{θ}}_{n}

converges in probability to

θ_{0}

.

References

Luo, G. Statistical Inference in Spatial Autoregressive Models with Complex Data; Beijing University of Technology: Beijing, China, 2023. [Google Scholar] [CrossRef]
Chen, Q. Advanced Econometrics and Stata Application; Higher Education Press: Beijing, China, 2010. [Google Scholar]
Bai, Y.; Tian, M.; Tang, M.-L.; Lee, W.-Y. Variable selection for ultra-high dimensional quantile regression with missing data and measurement error. Stat. Methods Med. Res. 2021, 30, 129–150. [Google Scholar] [CrossRef] [PubMed]
Li, W. Parameter Estimation of Spatial Autoregressive Models with Measurement Error; Yunnan University: Kunming, China, 2020. [Google Scholar]
Anselin, L. SpaceStat Tutorial: A Workbook for Using SpaceStat in the Analysis of Spatial Data; West Virginia University: Urbana, IL, USA, 1992. [Google Scholar]
Tobler, R.W. A Computer Movie Simulating Urban Growth in the Detroit Region. Econ. Geogr. 2016, 46, 234–240. [Google Scholar] [CrossRef]
Anselin, L. Spatial Econometrics: Methods and Models. J. Am. Stat. Assoc. 1990, 85, 160. [Google Scholar] [CrossRef]
Anselin, L. Thirty Years of Spatial Econometrics. Pap. Reg. Sci. 2010, 89, 3–25. [Google Scholar] [CrossRef]
Cox, T.F. Spatial Processes: Models and Applications. J. R. Stat. Soc. Ser. A 1984, 147, 515. [Google Scholar] [CrossRef]
Prucha, K.I.R. A Generalized Moments Estimator for the Autoregressive Parameter in a Spatial Model. Int. Econ. Rev. 2010, 40, 509–533. [Google Scholar] [CrossRef]
Lesage, J.P. Bayesian Estimation of Spatial Autoregressive Models. Int. Reg. Sci. Rev. 1997, 20, 113–129. [Google Scholar] [CrossRef]
Lee, L.F. GMM and 2SLS estimation of mixed regressive, spatial autoregressive models. J. Econ. 2007, 137, 489–514. [Google Scholar] [CrossRef]
Lee, L.F.; Liu, X. Efficient GMM estimation of high order spatial autoregressive models. Econ. Theory 2010, 26, 187–230. [Google Scholar] [CrossRef]
Wei, C.; Guo, S.; Zhai, S. Statistical inference of partially linear varying coefficient spatial autoregressive models. Econ. Model. 2017, 64, 553–559. [Google Scholar] [CrossRef]
Ma, M.; Zhang, X. Spatial Effects of China’s Haze Pollution and the Impact of Economy and Energy Structure. China Ind. Econ. 2014, 19–31. [Google Scholar] [CrossRef]
Wang, Y.F.; Ni, P.F. Economic Growth Spillovers and Regional Spatial Optimization Under the Influence of High-Speed Rail. China Ind. Econ. 2016, 21–36. [Google Scholar] [CrossRef]
Cheng, Y.; Dong, C. Study on the Spatial Effects of Trade Facilitation on China’s Industrial Manufactured Goods Export. Quant. Econ. Tech. Econ. Res. 2021, 38, 98–115. [Google Scholar] [CrossRef]
Shen, T.; Feng, D.; Sun, T. Spatial Econometrics; Peking University Press: Beijing, China, 2010. [Google Scholar]
Yi, L.; Tang, H.; Lin, X. Spatial Linear Mixed Models with Covariate Measurement Errors. Stat. Sin. 2009, 19, 1077. [Google Scholar]
Huque, M.H.; Bondell, H.D.; Ryan, L. On the impact of covariate measurement error on spatial regression modelling. Environmetrics 2015, 25, 560–570. [Google Scholar] [CrossRef]
Zhang, Z.; Zhu, P. Estimation and Testing of Spatial Autoregressive Models with Measurement Error. Stat. Res. 2010, 27, 103–109. [Google Scholar] [CrossRef]
He, X.; Hu, X. Parameter Estimation of Univariate Spatial Autoregressive Measurement Error Models. Sci. China Math. 2020, 50, 613–628. [Google Scholar] [CrossRef]
You, P. Estimation of Semiparametric Spatial Autoregressive Models with Random Missing; Yunnan University: Kunming, China, 2018. [Google Scholar]
Cheng, D. Modeling and Analysis of Residents’ Travel Behavior Based on Multiple Differences; Dalian Jiaotong University: Dalian, China, 2019. [Google Scholar] [CrossRef]
Yang, Z.; Yu, J.; Chen, J. A Missing Data Imputation Method Based on Clustering and Spatial Autoregressive Model. Intelligent Information Technology Application Association. In Proceedings of the 2011 International Conference on Ecological Protection of Lakes-Wetlands-Watershed and Application of 3S Technology (EPLWW3S 2011 V2), Nanchang, China, 25–26 June 2011; pp. 554–557. [Google Scholar]
Wang, W.; Lee, L. Estimation of spatial autoregressive models with randomly missing data in the dependent variable. Econ. J. 2013, 16, 73–102. [Google Scholar] [CrossRef]
Lee, L.F. Asymptotic Distributions of Quasi-Maximum Likelihood Estimators for Spatial Autoregressive Models. Econometrica 2004, 72, 1899–1925. [Google Scholar] [CrossRef]
Kelejian, H.H.; Prucha, I.R. On the Asymptotic Distribution of the Moran I Test Statistic with Applications. J. Econom. 2001. [Google Scholar] [CrossRef]
Case, A.C. Spatial Patterns in Household Demand. Econometrica 1991, 59, 953–965. [Google Scholar] [CrossRef]
Li, F.R. Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties. Publ. Am. Stat. Assoc. 2001, 96, 1348–1360. [Google Scholar] [CrossRef]
Nakamura, T. Corrected Score Function for Errors-in-Variables Models: Methodology and Application to Generalized Linear Models. Biometrika 1990, 77, 127–137. [Google Scholar] [CrossRef]
David, H., Jr. Hedonic housing prices and the demand for clean air. J. Environ. Econ. Manag. 1978. [Google Scholar] [CrossRef]
Sun, Y.; Yan, H.; Zhang, W. A semiparametric spatial dynamic model. Ann. Stats. 2014, 42, 700–727. [Google Scholar] [CrossRef]
Du, J.; Sun, X.; Cao, R. Statistical inference for partially linear additive spatial autoregressive models. Spat. Stat. 2018, 52–67. [Google Scholar] [CrossRef]
Liu, X.; Chen, J.; Cheng, S. A Penalized Quasi-Maximum Likelihood Method for Variable Selection in the Spatial Autoregressive Model. Spat. Stat. 2018, 25, 86–104. [Google Scholar] [CrossRef]
Pace, R.K.; Gilley, O.W. Using the Spatial Configuration of the Data to Improve Estimation. J. Real Estate Financ. Econ. 1997, 14, 333–340. [Google Scholar] [CrossRef]

Table 1. MeSEs for

λ

.

Table 1. MeSEs for

λ

.

		n = 100			n = 150			n = 200			n = 250
e	$λ$	m = 10	m = 15	m = 20	m = 10	m = 15	m = 20	m = 15	m = 20	m = 25	m = 15	m = 20	m = 25
Incorporating measurement errors and missing data
0.5	0.5	1.627E+01	9.324E-01	5.839E+01	4.618E-02	4.189E-02	4.618E-02	3.850E-02	3.492E-02	3.850E-02	2.462E-02	3.809E-02	2.610E-02
0.5	−0.5	8.240E+00	2.289E+00	3.887E+00	4.440E-02	4.288E-02	4.440E-02	3.938E-02	2.922E-02	3.938E-02	2.435E-02	3.501E-02	2.692E-02
1	0.5	2.184E-01	2.354E-01	2.102E-01	1.552E-01	1.353E-01	1.555E-01	1.487E-01	1.248E-01	1.185E-01	1.078E-01	1.304E-01	1.073E-01
1	−0.5	2.081E-01	2.065E-01	2.159E-01	1.480E-01	1.437E-01	1.562E-01	1.524E-01	1.312E-01	1.207E-01	1.138E-01	1.281E-01	1.112E-01
1.5	0.5	4.731E-01	4.785E-01	4.959E-01	3.910E-01	3.290E-01	3.192E-01	3.803E-01	2.695E-01	2.539E-01	2.395E-01	2.561E-01	2.375E-01
1.5	−0.5	4.632E-01	5.174E-01	4.847E-01	3.844E-01	3.290E-01	3.267E-01	3.723E-01	2.694E-01	2.819E-01	2.261E-01	2.668E-01	2.542E-01
Ignoring missing data
0.5	0.5	1.320E+02	1.603E+02	1.368E+03	9.288E+02	2.599E+02	8.833E+01	8.905E+02	8.535E+01	9.025E+01	7.097E+01	6.560E+01	1.963E+02
0.5	−0.5	1.403E+02	1.288E+02	6.390E+02	1.014E+03	2.685E+02	8.620E+01	9.785E+02	1.797E+02	1.074E+02	7.247E+01	6.491E+01	1.774E+02
1	0.5	3.652E+03	6.698E+01	1.661E+03	7.315E+02	2.568E+02	9.291E+01	6.350E+02	1.839E+02	1.098E+02	7.515E+01	7.648E+01	1.835E+02
1	−0.5	3.840E+03	7.060E+01	1.649E+03	1.035E+03	3.026E+02	8.620E+01	5.711E+02	1.797E+02	1.081E+02	7.477E+01	7.326E+01	1.934E+02
1.5	0.5	3.776E+03	6.636E+01	2.115E+03	9.394E+02	2.887E+02	9.215E+01	7.766E+02	1.727E+02	1.091E+02	7.751E+01	7.263E+01	1.949E+02
1.5	−0.5	3.692E+03	6.373E+01	1.862E+03	8.851E+02	2.952E+02	8.631E+01	7.435E+02	1.987E+02	1.167E+02	7.406E+01	6.734E+01	1.884E+02
Ignoring measurement errors
0.5	0.5	5.054E-02	5.442E-02	5.727E-02	3.850E-02	3.523E-02	3.850E-02	3.140E-02	3.492E-02	2.867E-02	2.482E-02	3.501E-02	2.432E-02
0.5	−0.5	5.584E-02	4.883E-02	4.941E-02	3.938E-02	3.660E-02	3.938E-02	2.713E-02	2.922E-02	2.754E-02	2.403E-02	3.501E-02	2.692E-02
1	0.5	2.101E-01	2.141E-01	2.100E-01	1.334E-01	1.487E-01	1.332E-01	1.199E-01	1.248E-01	1.185E-01	1.074E-01	1.314E-01	1.095E-01
1	−0.5	2.021E-01	2.038E-01	2.149E-01	1.428E-01	1.524E-01	1.490E-01	1.064E-01	1.312E-01	1.207E-01	1.109E-01	1.254E-01	1.102E-01
1.5	0.5	4.449E-01	4.604E-01	4.668E-01	3.803E-01	3.180E-01	3.142E-01	2.530E-01	2.695E-01	2.539E-01	2.441E-01	2.509E-01	2.375E-01
1.5	−0.5	4.664E-01	4.983E-01	4.969E-01	3.723E-01	3.233E-01	3.121E-01	2.527E-01	2.694E-01	2.819E-01	2.309E-01	2.637E-01	2.568E-01
Ignoring measurement errors and missing data
0.5	0.5	1.227E+02	1.543E+02	1.312E+03	8.905E+02	2.516E+02	8.535E+01	6.129E+02	1.699E+02	9.582E+01	6.998E+01	6.491E+01	1.921E+02
0.5	−0.5	1.299E+02	1.220E+02	6.024E+02	9.785E+02	2.573E+02	8.620E+01	6.237E+02	1.942E+02	1.035E+02	7.140E+01	6.491E+01	1.746E+02
1	0.5	3.468E+03	6.335E+01	1.557E+03	7.074E+02	2.468E+02	8.876E+01	6.189E+02	1.827E+02	1.078E+02	7.358E+01	7.555E+01	1.799E+02
1	−0.5	3.493E+03	6.724E+01	1.557E+03	1.008E+03	2.891E+02	8.326E+01	5.566E+02	1.772E+02	1.062E+02	7.408E+01	7.166E+01	1.914E+02
1.5	0.5	3.524E+03	6.336E+01	2.019E+03	8.924E+02	2.822E+02	8.807E+01	7.539E+02	1.677E+02	1.078E+02	7.674E+01	7.112E+01	1.918E+02
1.5	−0.5	3.375E+03	6.215E+01	1.734E+03	8.151E+02	2.867E+02	8.371E+01	7.322E+02	1.935E+02	1.143E+02	7.346E+01	6.638E+01	1.847E+02

Table 2. MeSEs for

β

.

Table 2. MeSEs for

β

.

		n = 100			n = 150			n = 200			n = 250
e	$λ$	m = 10	m = 15	m = 20	m = 10	m = 15	m = 20	m = 15	m = 20	m = 25	m = 15	m = 20	m = 25
Incorporating measurement errors and missing data
0.5	0.5	1.801E-02	8.973E-04	6.942E-03	1.063E-07	2.664E-07	1.269E-06	5.714E-08	2.531E-07	6.536E-07	4.296E-07	2.436E-06	2.780E-07
0.5	−0.5	2.281E-02	2.045E-03	1.019E-03	1.119E-07	3.246E-07	1.563E-06	5.643E-08	1.796E-07	5.257E-07	4.646E-07	2.419E-06	2.300E-07
1	0.5	1.784E-07	1.216E-05	5.359E-07	2.187E-07	9.488E-07	3.077E-06	1.683E-07	7.737E-07	2.334E-06	1.877E-06	3.835E-06	9.066E-07
1	−0.5	1.822E-07	1.164E-05	4.690E-07	2.171E-07	9.527E-07	3.141E-06	1.917E-07	8.920E-07	1.713E-06	1.825E-06	4.568E-06	1.024E-06
1.5	0.5	4.202E-07	2.680E-05	1.040E-06	3.780E-07	1.599E-06	3.857E-06	4.070E-07	1.672E-06	4.307E-06	4.402E-06	5.968E-06	2.078E-06
1.5	−0.5	4.165E-07	2.693E-05	8.912E-07	3.953E-07	1.725E-06	6.557E-06	4.159E-07	1.475E-06	4.581E-06	4.567E-07	5.814E-06	2.216E-06
Ignoring missing data
0.5	0.5	1.027E-06	1.165E-06	1.946E-07	6.254E-08	1.896E-07	9.387E-07	5.386E-08	2.548E-07	6.431E-07	4.499E-07	2.257E-06	2.740E-07
0.5	−0.5	1.329E-06	1.328E-06	2.800E-07	6.503E-08	2.004E-07	9.951E-07	5.913E-08	1.693E-07	5.375E-07	4.490E-07	2.253E-06	2.388E-07
1	0.5	1.679E-07	1.181E-05	5.124E-07	2.171E-07	9.079E-07	3.141E-06	1.828E-07	7.743E-07	2.231E-06	2.041E-06	3.937E-06	9.629E-07
1	−0.5	1.866E-07	1.037E-05	4.678E-07	2.425E-07	9.527E-07	3.857E-06	1.939E-07	8.920E-07	1.685E-06	1.822E-06	3.700E-06	1.017E-06
1.5	0.5	4.402E-07	2.404E-05	1.753E-06	3.953E-07	1.725E-06	6.790E-06	4.070E-07	1.654E-06	3.690E-06	4.567E-07	5.917E-06	1.985E-06
1.5	−0.5	4.567E-07	2.572E-05	9.034E-07	4.730E-07	1.725E-06	6.706E-06	4.159E-07	1.475E-06	4.581E-06	4.402E-07	5.814E-06	2.232E-06
Ignoring measurement errors
0.5	0.5	1.801E-02	8.973E-04	6.942E-03	1.063E-07	2.664E-07	1.269E-06	5.714E-08	2.531E-07	6.536E-07	4.296E-07	2.436E-06	2.780E-07
0.5	−0.5	2.281E-02	2.045E-03	1.019E-03	1.119E-07	3.246E-07	1.563E-06	5.643E-08	1.796E-07	5.257E-07	4.646E-07	2.419E-06	2.300E-07
1	0.5	1.784E-07	1.216E-05	5.359E-07	2.187E-07	9.488E-07	3.077E-06	1.683E-07	7.737E-07	2.334E-06	1.877E-06	3.835E-06	9.066E-07
1	−0.5	1.822E-07	1.164E-05	4.690E-07	2.171E-07	9.527E-07	3.141E-06	1.917E-07	8.920E-07	1.713E-06	1.825E-06	4.568E-06	1.024E-06
1.5	0.5	4.202E-07	2.680E-05	1.040E-06	3.780E-07	1.599E-06	3.857E-06	4.070E-07	1.672E-06	4.307E-06	4.402E-06	5.968E-06	2.078E-06
1.5	−0.5	4.165E-07	2.693E-05	8.912E-07	3.953E-07	1.725E-06	6.557E-06	4.159E-07	1.475E-06	4.581E-06	4.567E-07	5.814E-06	2.216E-06
Ignoring measurement errors and missing data
0.5	0.5	1.801E-02	8.973E-04	6.942E-03	1.063E-07	2.664E-07	1.269E-06	5.714E-08	2.531E-07	6.536E-07	4.296E-07	2.436E-06	2.780E-07
0.5	−0.5	2.281E-02	2.045E-03	1.019E-03	1.119E-07	3.246E-07	1.563E-06	5.643E-08	1.796E-07	5.257E-07	4.646E-07	2.419E-06	2.300E-07
1	0.5	1.784E-07	1.216E-05	5.359E-07	2.187E-07	9.488E-07	3.077E-06	1.683E-07	7.737E-07	2.334E-06	1.877E-06	3.835E-06	9.066E-07
1	−0.5	1.822E-07	1.164E-05	4.690E-07	2.171E-07	9.527E-07	3.141E-06	1.917E-07	8.920E-07	1.713E-06	1.825E-06	4.568E-06	1.024E-06
1.5	0.5	4.202E-07	2.680E-05	1.040E-06	3.780E-07	1.599E-06	3.857E-06	4.070E-07	1.672E-06	4.307E-06	4.402E-06	5.968E-06	2.078E-06
1.5	−0.5	4.165E-07	2.693E-05	8.912E-07	3.953E-07	1.725E-06	6.557E-06	4.159E-07	1.475E-06	4.581E-06	4.567E-07	5.814E-06	2.216E-06

Table 3. MeSEs for

σ^{2}

.

Table 3. MeSEs for

σ^{2}

.

		n = 100			n = 150			n = 200			n = 250
e	$λ$	m = 10	m = 15	m = 20	m = 10	m = 15	m = 20	m = 15	m = 20	m = 25	m = 15	m = 20	m = 25
Incorporating measurement errors and missing data
0.5	0.5	8.838E+03	6.371E+00	1.498E+05	2.544E-02	2.725E-02	2.938E-02	1.402E-02	1.326E-02	1.413E-02	1.255E-02	5.831E-02	1.436E-02
0.5	−0.5	1.971E+03	3.881E+01	4.972E+02	2.515E-02	2.724E-02	2.161E-02	1.504E-02	1.285E-02	1.285E-02	1.087E-02	4.399E-02	1.340E-02
1	0.5	2.497E-01	2.111E-01	2.422E-01	2.286E-01	2.451E-01	2.327E-01	2.655E-01	3.116E-01	2.646E-01	5.731E-01	7.831E-01	5.634E-01
1	−0.5	2.428E-01	2.424E-01	2.191E-01	2.618E-01	2.378E-01	2.618E-01	2.571E-01	3.228E-01	2.817E-01	5.984E-01	1.015E+00	5.455E-01
1.5	0.5	1.039E+00	8.811E-01	8.929E-01	1.870E+00	1.237E+00	1.388E+00	1.751E+00	1.950E+00	1.752E+00	3.281E+00	4.377E+00	2.882E+00
1.5	−0.5	7.428E-01	1.214E+00	1.008E+00	2.276E+00	1.156E+00	1.197E+00	1.409E+00	2.525E+00	2.000E+00	3.183E+00	5.228E+00	3.731E+00
Ignoring missing data
0.5	0.5	6.195E+05	7.860E+05	9.643E+07	9.921E+07	1.059E+07	6.249E+05	1.097E+08	6.918E+06	1.673E+06	1.590E+06	1.887E+06	9.845E+06
0.5	−0.5	7.591E+05	6.743E+05	1.409E+07	1.534E+08	1.149E+07	8.153E+05	8.139E+06	1.796E+06	8.070E+06	1.241E+06	1.750E+06	8.070E+06
1	0.5	5.165E+08	8.849E+04	1.396E+08	8.536E+07	1.048E+07	8.269E+05	1.222E+08	7.143E+06	1.647E+06	1.489E+06	1.885E+06	8.864E+06
1	−0.5	8.202E+08	8.025E+04	1.175E+08	1.300E+08	9.951E+06	7.452E+05	1.033E+08	7.292E+06	1.689E+06	1.381E+06	1.962E+06	1.062E+07
1.5	0.5	5.896E+08	9.098E+04	1.807E+08	1.364E+08	1.323E+07	7.982E+05	1.827E+08	8.376E+06	1.992E+06	1.701E+06	2.093E+06	9.215E+06
1.5	−0.5	5.829E+08	8.085E+04	1.362E+08	1.450E+08	1.148E+07	7.943E+05	1.349E+08	8.539E+06	1.874E+06	1.478E+06	2.036E+06	1.101E+07
Ignoring measurement errors
0.5	0.5	1.552E-02	1.409E-02	1.635E-02	2.452E-02	1.870E-02	1.839E-02	3.074E-02	3.737E-02	3.452E-02	3.967E-02	1.438E-01	5.327E-02
0.5	−0.5	1.556E-02	1.611E-02	1.420E-02	1.966E-02	1.673E-02	1.157E-01	3.432E-02	3.214E-02	5.016E-02	4.095E-02	1.157E-01	5.016E-02
1	0.5	1.575E-01	1.784E-01	1.942E-01	3.701E-01	4.468E-01	3.903E-01	4.643E-01	5.033E-01	4.763E-01	7.663E-01	1.166E+00	7.863E-01
1	−0.5	2.203E-01	1.996E-01	2.201E-01	4.989E-01	3.800E-01	3.196E-01	4.460E-01	5.350E-01	5.115E-01	8.282E-01	1.174E+00	7.716E-01
1.5	0.5	1.071E+00	9.059E-01	1.040E+00	2.490E+00	1.723E+00	1.843E+00	2.196E+00	2.450E+00	2.661E+00	3.956E+00	4.853E+00	3.374E+00
1.5	−0.5	8.711E-01	1.130E+00	1.075E+00	2.974E+00	1.652E+00	1.563E+00	1.849E+00	3.028E+00	2.583E+00	3.761E+00	5.851E+00	4.455E+00
Ignoring measurement errors and missing data
0.5	0.5	6.249E+05	7.933E+05	9.505E+07	1.007E+08	1.088E+07	6.285E+05	1.143E+08	8.101E+06	1.670E+06	1.592E+06	1.891E+06	1.008E+07
0.5	−0.5	8.039E+05	6.779E+05	1.376E+07	1.524E+08	1.148E+07	8.325E+05	8.101E+06	1.797E+06	8.078E+06	1.262E+06	1.754E+06	8.078E+06
1	0.5	5.243E+08	8.745E+04	1.387E+08	8.553E+07	1.051E+07	8.309E+05	1.223E+08	7.152E+06	1.655E+06	1.490E+06	1.887E+06	8.866E+06
1	−0.5	8.338E+08	8.011E+04	1.187E+08	1.306E+08	1.015E+07	7.443E+05	1.036E+08	7.452E+06	1.691E+06	1.382E+06	1.965E+06	1.069E+07
1.5	0.5	5.856E+08	8.961E+04	1.826E+08	1.371E+08	1.389E+07	7.982E+05	1.828E+08	8.344E+06	1.997E+06	1.726E+06	2.096E+06	9.126E+06
1.5	−0.5	5.829E+08	8.209E+04	1.369E+08	1.456E+08	1.156E+07	7.943E+05	1.357E+08	8.559E+06	1.843E+06	1.474E+06	2.039E+06	1.089E+07

Table 4. Descriptions of the variables in the Boston Housing Dataset.

Attribute	Explanation	Remarks
CRIM	Per capita crime rate by town
ZN	Proportion of residential land zoned for lots over 25,000 sq. ft.	Residential land proportion
INDUS	Proportion of non-retail business acres per town	Non-retail business proportion
CHAS	Charles River dummy variable	Charles River variable for regression analysis
NOX	Nitrogen oxide concentration (ppm)	Environmental indicator
RM	Average number of rooms per dwelling	Number of rooms in residential units
AGE	Proportion of owner-occupied units built prior to 1940	Pre-1940s-constructed units proportion
DIS	Weighted distances to five Boston employment centers	Distance to employment hubs
RAD	Index of accessibility to radial highways	Highway accessibility index
TAX	Full-value property tax rate per 10,000	Property tax rate
PRATO	Pupil–teacher ratio by town	Pupil-teacher ratio
B	$1000 {(B k - 0.63)}^{2}$ , where Bk is the proportion of blacks by town	Proportion of black population
LSTAT	Percentage of the population classified as lower income	Lower-income class proportion
MEDV	Median value of owner-occupied homes	Typically, the target variable in an analysis

Table 5. MeSEs for

λ

,

β

, and

σ^{2}

.

Table 5. MeSEs for

λ

,

β

, and

σ^{2}

.

	$λ$	$β_{1}$	$β_{2}$	$β_{3}$	$β_{4}$	$β_{5}$	$σ^{2}$
Real parameters	4.446E-01	−1.770E-01	−3.425E-02	−4.261E-01	−1.249E-01	4.102E-01	8.168E-03
Incorporating measurement errors and missing data	8.870E-03	1.878E-03	8.669E-04	9.953E-03	1.716E-03	2.848E-01	9.182E-07
Ignoring missing data	3.536E-02	2.045E-03	8.998E-04	9.691E-03	1.805E-03	1.746E-01	8.075E-06
Ignoring measurement errors	1.043E-02	1.919E-03	8.840E-04	6.065E-03	2.132E-03	2.457E-01	8.077E-07
Ignoring measurement errors and missing data	4.528E-02	2.094E-03	8.565E-04	3.985E-03	1.629E-03	1.208E-01	1.125E-05

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, T.; Zhang, Z.; Song, Y. Parameter Estimation in Spatial Autoregressive Models with Missing Data and Measurement Errors. Axioms 2024, 13, 315. https://doi.org/10.3390/axioms13050315

AMA Style

Li T, Zhang Z, Song Y. Parameter Estimation in Spatial Autoregressive Models with Missing Data and Measurement Errors. Axioms. 2024; 13(5):315. https://doi.org/10.3390/axioms13050315

Chicago/Turabian Style

Li, Tengjun, Zhikang Zhang, and Yunquan Song. 2024. "Parameter Estimation in Spatial Autoregressive Models with Missing Data and Measurement Errors" Axioms 13, no. 5: 315. https://doi.org/10.3390/axioms13050315

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Parameter Estimation in Spatial Autoregressive Models with Missing Data and Measurement Errors

Abstract

1. Introduction

2. Parameter Estimation in Spatial Autoregressive Models with Measurement Errors and Missing Data

2.1. Spatial Autoregressive Models

2.2. Spatial Autoregressive Model with Missing Data and Measurement Errors

3. Simulation

4. Real Data Example

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI