Next Article in Journal
Statistical Inference of Two Gamma Distributions under the Joint Type-II Censoring Scheme
Previous Article in Journal
Lipschitz Stability in Terms of Two Measures for Kurzweil Equations and Applications
Previous Article in Special Issue
K-L Estimator: Dealing with Multicollinearity in the Logistic Regression Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Wild Bootstrap-Based Bias Correction for Spatial Quantile Panel Data Models with Varying Coefficients

1
School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China
2
Interdisciplinary Research Institute of Data Science, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China
3
School of Finance, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China
4
School of Statistics, Renmin University of China, Beijing 100872, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(9), 2005; https://doi.org/10.3390/math11092005
Submission received: 13 March 2023 / Revised: 17 April 2023 / Accepted: 20 April 2023 / Published: 23 April 2023
(This article belongs to the Special Issue Statistical Theory and Application)

Abstract

:
This paper studies quantile regression for spatial panel data models with varying coefficients, taking the time and location effects of the impacts of the covariates into account, i.e., the implications of covariates may change over time and location. Smoothing methods are employed for approximating varying coefficients, including B-spline and local polynomial approximation. A fixed-effects quantile regression (FEQR) estimator is typically biased in the presence of the spatial lag variable. The wild bootstrap method is employed to attenuate the estimation bias. Simulations are conducted to study the performance of the proposed method and show that the proposed methods are stable and efficient. Further, the estimators based on the B-spline method perform much better than those of the local polynomial approximation method, especially for location-varying coefficients. Real data about economic development in China are also analyzed to illustrate application of the proposed procedure.

1. Introduction

Panel data analysis has received much attention during the last two decades due to applications in many disciplines, such as Economics, Finance, Biology, Medicine, and Social Sciences. Panel data involve two dimensions: a cross-sectional dimension and a time-series dimension. Such two-dimensional information enables researchers to extend modeling possibilities compared to time-series or cross-sectional data. However, if the data are collected with reference to location, spatial dependence may arise from underlying regional interactions. This means that the outlying magnitude of one observation may be influenced by its neighborhood. Spatial dependence violates the independent assumption used in traditional panel data analysis. This gives rise to the need for alternative estimation approaches. Thus, spatial panel data models combining the spatial auto-regressive structure with a conventional regression model have received much attention, as they enable researchers to consider the cross-section dependence from contemporaneous or lagged cross-section interactions.
Recently, the literature on spatial panel data models has been growing (see [1,2,3,4,5,6,7,8,9,10,11,12,13,14]). Spatial panel data models have been successfully applied to various fields. For example, Ref. [15] employed spatial econometric methods via a spatial weights matrix to a panel municipal water consumption dataset; Ref. [16] used the spatial panel data model to examine the factors affecting COVID-19 and to analyze the spatial relationship between confirmed cases of COVID-19; Ref. [17] evaluated the impact of purchase restrictions on housing prices based on spatial dynamic models with short panels. Ref. [18] considered panel data models with cross-sectional dependence arising from spatial autocorrelation and unobserved common factors for analyzing the spatial correlation of US house prices.
Most of these works were developed via the homogeneous slope assumption. However, the homogeneous slope assumption is often unrealistic and too strict in applications and cannot be flexible enough to capture the underlying complex dependence structure [19]. Additionally, misspecification of the data-generation mechanism by a linear model can lead to excessive modeling biases and erroneous conclusions. Thus, [20] proposed the geographically weighted regression (GWR) model, which relaxes the homogeneous slope assumption and can investigate location effects of the impacts of covariates. However, the GWR model also ignores the spatial dependence between cross-sections. Further, some researchers investigated semi-parametric varying coefficient spatial panel data models [21,22,23], which took spatial dependence into account. However, in most of these works, varying coefficients were correlated with a one-dimensional smoothing variable, ignoring the time and location effects of the regression coefficients.
Moreover, most existing estimation methods are sensitive to outliers and influential observations in the datasets. However, the outliers in the dataset may provide some useful information behind the dataset. Simply throwing them out of the dataset loses important information [24]. Thus, this paper proposes a spatial panel data model with time-varying or location-varying coefficients, which allows us to investigate how the effects of covariates vary over time and location. The varying coefficients are approximated using basis function approximations, including B-spline and local polynomial approximation. The quantile regression (QR) method is employed for estimation since it is not sensitive to outliers and influential observations. Further, the QR method relaxes the normality assumption used in traditional regression modeling and can deal with data from different error distributions.
The major contributions of this paper can be outlined as follows. (1) The spatial autoregressive panel data models and the GWR model are two major tools for analyzing spatial panel data, which ignore the heterogeneity of slope and spatial dependence, respectively. This paper combines the spatial auto-regressive structure with the GWR and temporal weighted regression (TWR) models. It proposes a spatial autoregressive panel data model with time-varying and location-varying coefficients, which can investigate the time and location effects of the impacts of the covariates and simultaneously analyze spatial correlation. (2) Most current works focus on mean regression and assume homoscedastic regression errors. However, heteroscedasticity is often seen in applications, which may cause the covariates to have different impacts at different locations of the response distribution. The QR method used in this paper can automatically capture such heteroscedasticity and provide a comprehensive picture of the relationship between the response and explanatory variables. Further, the QR method is robust and relaxes the normality assumption and thus can be adopted to deal with data characterized by different error distributions. (3) The local linear polynomial approximation method is a major tool for approximating the location-varying coefficients. This paper proposes the B-spline surface fitting method to approximate the location-varying coefficients. Monte Carlo simulations show that the B-spline surface fitting method is more stable and provides better approximations than the local linear polynomial approximation method. (4) Endogeneity brought by the spatial lag variable can cause biased estimation. One commonly used approach is to employ the instrumental variable (IV) method to attenuate endogenous bias [7,25,26,27]. However, the effectiveness of the IV-based method depends on the selection of IV. Thus, this paper circumvents the issue of IV selection and considers an alternative approach to reduce the endogenous bias, i.e., the bootstrap-based bias correction method. However, traditional bootstrap re-sampling methods may be inappropriate for spatial panel data since the spatial dependence structure changes if one region is drawn more than once. Thus, this paper employs the wild residual-based bootstrap method [28,29] for bias correction.
The article is organized as follows. Section 2 presents the model. Section 3 provides the estimation procedure. Section 5 investigates the finite sample performance of the methods. An empirical illustration is considered in Section 6. Concluding remarks are presented in Section 7.

2. The Models

Consider the following spatial panel data model with varying coefficients:
y i t = ρ j = 1 n w i j y j t + X i t β ( U i t ) + η i + ε i t ,
where i = 1 , , N , t = 1 , , T , y i t is the dependent variable for subject i at time t, X i t is a p × 1 vector of nonstochastic explanatory variables, w i j is the ( i , j ) -th element of the spatial weight matrix W reflecting spatial dependence on y i t among cross-sectional units, and ε i t is independent and identically distributed across i and t. Interaction effects are reflected in the spatiotemporal lag variable j i w i j y j t (and associated scalar parameter ρ ); β ( U i t ) is a p × 1 vector of varying coefficients that vary with U i t ; U i t is a d × 1 smoothing variable; η i , i = 1 , , N are fixed effects for the regions.
Denote d i t = j = 1 n w i j y j t ; then, the following conditional quantile relationship can be defined:
Q τ ( y i t | d i t , X i t , Z i , U i t ) = ρ ( τ ) d i t + X i t β ( τ , U i t ) + Z i η ( τ ) ,
where Q τ ( y i t | d i t , X i t , Z i , U i t ) is the conditional τ -quantile of y i t given d i t , X i t , Z i , and U i t ; Z i = Z h i is an indicator variable for the individual effect η i ; h i is an N T × 1 vector with the i-th element equal to 1 and the rest equal to 0; Z = 1 T I N is an N T × N matrix; 1 J is the J × 1 vector with all the elements being 1; Q τ ( ε i t | d i t , X i t , Z i , U i t ) = 0 .
In model (1), the following two cases can be considered:
  • Case 1 (Time-varying coefficient): Let the smoothing variable U i t be the time t. Thus, we can investigate whether and how the impacts of covariates X i t vary over time.
  • Case 2 (Location-varying coefficient): Let the smoothing variable U i t be the location S i of the i-th observation, which is a two-dimensional vector consisting of latitude and longitude. Thus, we can investigate the location effects of the impacts of the covariates, i.e., the impacts of some covariates may vary over the location.

3. Estimation Procedure

This section employs the smoothing method to approximate the varying coefficient β ( U i t ) . For expositional simplicity, we focus on the case where U i t takes a value in a compact set U R d . Let B ( U i t ) = ( b 1 ( U i t ) , , b K ( U i t ) ) denote a set of basis functions (e.g., spline, polynomial, wavelet, etc.). Then each β l ( τ , U i t ) ( l = 1 , , p ) can be approximated by a linear combination of the basis functions
β l ( τ , U i t ) s = 1 K b s ( U i t ) θ l , s ( τ ) = B ( U i t ) θ l ( τ ) ,
where θ l ( τ ) = ( θ l , 1 ( τ ) , , θ l , K ( τ ) ) is the coefficient vector and β l ( τ , U i t ) is the l-th element of β ( U i t ) .
Thus, model (2) can be approximated by
Q τ ( y i t | d i t , X i t , U i t , Z i ) ρ ( τ ) d i t + l = 1 p X i t , l B ( U i t ) θ l ( τ ) + Z i η ( τ ) = ρ ( τ ) d i t + Π i t θ ( τ ) + Z i η ( τ ) ,
where θ ( τ ) = ( θ 1 ( τ ) , , θ p ( τ ) ) , Π i t = ( X i t , 1 B ( U i t ) , , X i t , p B ( U i t ) ) = X i t B ( U i t ) R p K , and X i t , l is the l-th element of X i t .
Then the following objective function can be defined:
R ( τ , ρ , θ , η ) = i = 1 N t = 1 T ρ τ ( y i t ρ d i t Π i t θ Z i η ) .
where ρ τ ( ε ) = ε ( τ I ( ε 0 ) ) . The FEQR estimation of ρ , θ and η can be obtained by
( ρ ^ ( τ ) , θ ^ ( τ ) , η ^ ( τ ) ) = arg min ρ , θ , η R ( τ , ρ , θ , η ) .
Finally, the estimation of β l ( τ , U i t ) can be obtained, which is β ^ l ( τ , U i t ) = B ( U i t ) θ ^ l ( τ ) .
Remark 1.
If the coefficient  β ( · )  is time-varying, then the smoothing variable  U i t  equals t. As an example, two different basis functions are provided for approximation:
1. 
The B-spline method is employed for approximation. For each U i t U , normalized B-splines of order h + 1 are employed to approximate the β ( τ , U i t ) . We consider an extended partition of U by m quasi-uniform internal knots. Then the basis function B ( U i t ) is a set of B-spline basis functions. In this case, K = m + h + 1 .
2. 
The local polynomial fitting scheme is employed for approximation. Under smoothness condition of each varying coefficient β l ( τ , U i t ) , it has ( m + 1 ) -th continuous derivative ( m 0 ) for any given grid point u 0 U . When U i t is in a neighborhood of u 0 , the local polynomial approximation of β l ( τ , U i t ) takes the form:
β l ( τ , U i t ) s = 1 m + 1 β l ( s 1 ) ( τ , u 0 ) ( U i t u 0 ) s 1 / ( s 1 ) ! = s = 1 m + 1 b s ( U i t ) θ l s ( τ ) = B ( U i t ) θ l ( τ ) ,
where B ( U i t ) = ( b 1 ( U i t ) , , b m + 1 ( U i t ) ) , b s ( U i t ) = ( U i t u 0 ) s 1 / ( s 1 ) ! , θ l ( τ ) = ( θ l , 1 ( τ ) , , θ l , m + 1 ( τ ) ) , θ l s ( τ ) = β l ( s 1 ) ( τ , u 0 ) , and β l ( s 1 ) ( τ , u 0 ) is the s 1 -th-order derivative of β l ( τ , u ) at u = u 0 . In this case, K = m + 1 .
In addition, when the smoothing variable U i t is some covariate Z i t (i.e., the impacts of covariates X i t may vary with some other covariate Z i t ), the approximation procedure above is still applicable.
Remark 2.
If the coefficient  β ( · )  is location-varying, then the smoothing variable  U i t  is equal to the location  S i = ( Lat i , Lon i ) S = ( S 1 , S 2 )  of the i-th region, where  Lat i S 1  and  Lon i S 2  are, respectively, the latitude and longitude of the i-th region. Similarly, two different basis functions are provided for approximation:
1. 
The B-spline surface fitting method is employed for approximation. For each S i S , normalized B-splines of order h + 1 are employed to approximate the β ( τ , S i ) . We consider an extended partition of S 1 by m quasi-uniform internal knots and an extended partition of S 2 by n quasi-uniform internal knots. Let B m ( Lat i ) = ( b 1 ( Lat i ) , , b m + h + 1 ( Lat i ) ) and B n ( Lon i ) = ( b 1 ( Lon i ) , , b n + h + 1 ( Lon i ) ) , respectively, denote a set of B-spline basis functions. For each β l ( τ , S i ) ( l = 1 , , p ) , the B-spline surface approximation can be defined as follows:
β l ( τ , S i ) j = 1 m + h + 1 k = 1 n + h + 1 b j ( Lat i ) b k ( Lon i ) θ l j k ( τ ) = B ( S i ) θ l ( τ ) ,
where B ( S i ) = B m ( Lat i ) B n ( Lon i ) , θ l ( τ ) = ( θ l , 1 , 1 ( τ ) , , θ l , 1 , n + h + 1 ( τ ) , , θ l , m , 1 ( τ ) , , θ l , m , n + h + 1 ( τ ) ) is the spline coefficient vector. In this case, K = ( m + h + 1 ) × ( n + h + 1 ) .
2. 
It is difficult to perform higher-order Taylor expansions on binary functions. For simplicity, the local linear approximation is employed for estimation. For any s = ( u , v ) , denote ( β l ( s ) / u , β l ( s ) / v ) by β ˙ l ( s ) . For any given s 0 S , if S i is in a neighborhood of s 0 , then we have
β l ( τ , S i ) β l ( τ , s 0 ) + ( S i s 0 ) β ˙ l ( s 0 ) = B ( S i ) θ l ( τ ) ,
where B ( S i ) = ( 1 , ( S i s 0 ) ) , θ l ( τ ) = ( β l ( τ , s 0 ) , β ˙ l ( s 0 ) ) . In this case, K = 3 .
Remark 3.
For the objective function (4), the knots for the B-spline and the order for the local polynomial can be chosen as the minimizer to the following Schwarz-type Information Criterion [25,30,31,32]
S I C = log i = 1 n ρ τ ( y i t ρ ( τ ) d i t Π i t θ ( τ ) Z i η ( τ ) ) } + log N T 2 N T K ,
where K is the number of unknown parameters. Additionally, in the B-spline approximation, the cubic spline is used with h = 3 .

4. Wild Bootstrap-Based Bias Correction

However, due to the presence of endogenous variable d i t , the FEQR estimation is biased as in the OLS case, especially for the spatial correlation coefficient ρ . Thus, the bootstrap method is employed for bias correction. When bootstrapping samples from a panel dataset, the bootstrap method should preserve the correlation structure of the data. As in [33], several bootstrap re-sampling methods can be employed in the panel data, such as cross-section re-sampling, temporal re-sampling, and cross-section and temporal re-sampling.
However, bootstrap re-sampling methods may be inappropriate for spatial panel data. For instance, when the coefficient β ( t ) is time-varying, a natural idea is to employ the cross-section sampling method in which different regions are sampled with and replaced with probability 1 / N . However, the spatial dependence structure is also changed if one region is drawn more than once.
Thus, this section employs the wild residual-based bootstrap method [28,29] to obtain the bias-corrected FEQR estimation. Denote Θ = ( ρ , θ , η ) . The wild bootstrap-based bias correction procedure is described as follows:
  • Calculate the residual ε ^ i t ( τ ) = y i t ρ ^ ( τ ) d i t Π i t θ ^ ( τ ) η ^ i ( τ ) .
  • Let ε ^ i t * ( τ ) = r i t | ε ^ i t ( τ ) | , where r i t is generated from an appropriate distribution that satisfies Conditions 3–5 in [28].
  • Generate the bootstrapped sample as y * = ( I N T ρ ^ ( τ ) W * ) 1 ( Π θ ^ ( τ ) + Z η ^ ( τ ) + ε ^ * ( τ ) ) , where W * = I N W and I a represents an a × a identity matrix. Then the corresponding spatial lag of y * is W * y * .
  • Repeat Steps 2–3 B times and obtain the bootstrap estimates based on the generated samples y * .
  • Then, the bootstrap estimated bias is computed with the formula
    bias ^ ( Θ ^ ( τ ) ) = 1 B b = 1 B Θ ^ b ( τ ) Θ ^ ( τ ) ,
    where Θ ^ ( τ ) is the FEQR estimator, which is defined in the Section 3, and Θ ^ b ( τ ) is the bootstrap estimate based on the b-th bootstrap sample. Therefore, the bias-corrected FEQR estimate Θ ^ c ( τ ) is of the form
    Θ ^ c ( τ ) = Θ ^ ( τ ) bias ^ ( Θ ^ ( τ ) ) .
Accordingly, the bias-corrected FEQR estimates of β ( τ , U i t ) can be obtained in the same way as in Section 3.

5. Monte Carlo Simulations

In this section, Monte Carlo simulations are conducted to examine the finite-sample performances of the proposed estimation procedures. The Monte Carlo simulations are repeated 1000 times. The number of bootstrap replicates B is set to B = 100 . We also try larger values of B (200 and 400) and found similar results. We consider several sample sizes and quantiles, where N { 50 , 100 } , T { 20 , 40 } , and τ { 0.25 , 0.5 , 0.75 } .
Example 1 (Time-varying coefficient).
In this example, the data are generated as follows:
y i t = ρ j = 1 n w i j y j t + X 1 , i t β 1 ( t ) + X 2 , i t β 2 ( t ) + η i + ε i t ( τ ) ,
where ρ = 0.5 , β 1 ( t ) = 1 0.5 t , β 2 ( t ) = 1 + sin ( 2 π t ) , and X 1 , i t , X 2 , i t , and η i , respectively, follow the N ( 0 , 1 ) , U [ 0 , 2 ] and N ( 0 , 1 ) distribution.
Example 2 (Location-varying coefficient).
In this example, the data-generating process is given by:
y i t = ρ j = 1 n w i j y j t + X 1 , i t β 1 ( S i ) + X 2 , i t β 2 ( S i ) + η i + ε i t ( τ ) ,
where ρ = 0.3 , β 1 ( S i ) = 1 + sin ( S i 2 π ) , β 2 ( S i ) = 1 + exp ( S i 2 ) , and X 1 , i t , X 2 , i t , and η i are independently drawn from the N ( 0 , 1 ) , N ( 1 , 1 ) , and U [ 2 , 2 ] distribution, respectively.
In all simulated examples, ε i t ( τ ) = ε i t F 1 ( τ ) , where F is the common CDF of ε i t , F = N ( 0 , 1 ) or t ( 1 ) . Therefore, the random errors ε i t are centered to have a zero τ -th quantile. Further, the spatial weight matrix W = ( w i j ) is generated according to the design considered by [34]:
w i j = exp ( s i s j ) / k i exp ( s i s k ) ,
where s i is drawn from U [ 0 , 1 ] 2 distribution.
In the experiments, three estimators are considered. The first one is the estimator in the mean regression framework, such as MLE ([35]), which requires the normality assumption. The second one is the FEQR estimator, which ignores the endogenous bias caused by the spatial lag. The third estimator is the wild bootstrap-based bias-corrected estimator (abbreviated as “FEQR-BC”) in which the random bootstrap weight r i t is generated from the two-point distribution with probability 0.5 at 1 and 1 . To assess the performance of finite samples, we compute the bias and root mean squared error (RMSE) for ρ and the mean absolute deviation error (MADE) for the varying coefficients β 1 ( · ) and β 2 ( · ) .
The QR and MLE estimators’ performances are compared under different distributions. Additionally, the proposed FEQR-BC estimator is compared with the FEQR estimator to assess the effectiveness of the proposed bias-correction method. To see whether the B-spline method provides better approximations, especially for the location-varying coefficients, the performances of the three estimators based on the B-spline approximation method are also compared with those based on the local polynomial approximation method. The B-spline knots and the local polynomial approximation order are chosen via the SIC Criterion. The results are summarized in the following Table 1, Table 2, Table 3 and Table 4.
Table 1 reports the comparison results for the time-varying coefficient SAR panel data model with Gaussian errors, which shows that, in the Gaussian case, the MLE estimator of the spatial autoregressive coefficient ρ is approximately unbiased. However, in the presence of a spatial lag variable, ρ ^ is biased upward for the FEQR estimator. Additionally, the FEQR-BC estimator can eliminate the bias essentially. For the time-varying coefficients β ^ 1 ( t ) and β ^ 2 ( t ) , the MADEs of the FEQR-BC estimator are smaller than those of the MLE estimator. Notably, the performance of the MLE estimator based on the local polynomial approximation becomes much worse when T is large. Likewise, when T is large, the performances of the QR estimators based on the local polynomial approximation are also slightly worse.
Table 2 collects the comparison results for the time-varying coefficient SAR panel data model in the t ( 1 ) case and shows that FEQR-BC can eliminate the bias of ρ , especially at τ = 0.5 . Additionally, the performances of the three estimators based on the local polynomial approximation all become worse as T increases. Moreover, our proposed FEQR-BC estimator outperforms the MLE estimator, as it does not impose any finite moment assumption on the disturbance errors.
Table 3 collects the comparison results for the location-varying coefficient SAR panel data model with Gaussian errors. The results show that, in the Gaussian case, the MLE estimator of the spatial autoregressive coefficient ρ is approximately unbiased and performs better than the FEQR and FEQR-BC estimators. Additionally, the FEQR-BC estimator has a much smaller bias than the FEQR estimator. For the location-varying coefficients β 1 ( s ) and β 2 ( s ) , the FEQR-BC estimator has smaller MADEs compared to the MLE estimator. Further, the three estimators based on the B-spline surface fitting scheme perform better than those of the local linear approximation method.
Table 4 presents the comparison results for the location-varying coefficient SAR panel data model in the t ( 1 ) case and shows that FEQR-BC can eliminate the bias of ρ and is approximately unbiased. The FEQR-BC and FEQR estimators outperform the MLE estimator. Additionally, the performances of the FEQR and FEQR-BC estimators based on the local linear approximation worsen as N increases. Overall, the FEQR and FEQR-BC estimators based on the B-spline surface fitting scheme perform better than those of the local linear approximation method.
In Example 1, the confidence intervals of the varying coefficients β 1 ( t ) and β 2 ( t ) are also considered. The results are reported in Figure 1. The x-axis presents the time t, and the y-axis presents the estimations of the varying coefficients at quantile 0.5 and sample size ( N , T ) = ( 100 , 20 ) (red lines) and their corresponding confidence intervals (blue lines) at significance level 0.05. Figure 1a,b gives the confidence intervals of β 1 ( t ) , β 2 ( t ) based on the B-spline curve fitting method, and Figure 1c,d gives the confidence intervals of β 1 ( t ) , β 2 ( t ) based on the local polynomial fitting method. All these show our estimation procedure works very well.
Figure 2 shows that the estimated surface based on the B-spline surface fitting method is close to the true surface. The estimated surface based on local linear approximation is a plane. Thus, the B-spline surface fitting method is superior to the local linear approximation method.

6. Illustration

Economic development is an important research area in economics. The data used in this paper were collected from 31 provinces in China from 2005–2019 and can be obtained from the EPSDATA Databases (https://www.epsnet.com.cn/index.html#/Index, accessed on 12 March 2023). The dependent variable is the economic development level. Our analysis aims to model the effects of the industrial structure (IS), urbanization (Urban), openness (Open), higher education (Edu), and infrastructure (Infra) on different quantiles of the economic development level (GDP). The detailed variable definitions are presented in Table 5.

6.1. Time-Varying Effects of the Covariates

Firstly, we try to find which covariates have time-varying effects on economic development and which covariates do not. We consider a set of quantiles with τ = { 0.1 , 0.2 , , 0.9 } . At each quantile τ , the SIC criterion is applied for model selection. The results are summarized in Table 6. The obtained result shows the coefficients of Edu ( X 4 ) and Infra ( X 5 ) are constant, and the covariates IS ( X 1 ), Urban ( X 2 ), and Openness ( X 3 ) have time-varying effects on the economic development. Additionally, the following QR model is considered:
Q τ ( y i t | d i t , X i t ) = ρ ( τ ) d i t + X 1 , i t β 1 ( t , τ ) + X 2 , i t β 2 ( t , τ ) + X 3 , i t β 3 ( t , τ ) + X 4 , i t β 4 ( τ ) + X 5 , i t β 5 ( τ ) + η i ( τ ) ,
where d i t = j = 1 n w i j y j t , w i j is the ( i , j ) -th element of W , and W is the spatial weight matrix generated by the longitude and latitude of the 31 regions [36]. We employ the B-spline curve fitting scheme to approximate the varying coefficients β 1 ( t ) , β 2 ( t ) , and β 3 ( t ) .
The MLE and bias-corrected FEQR estimates of ρ , β 4 , and β 5 at quantiles τ = 0.25 , 0.5 , 0.75 are reported in the top half of Table 7. The results show that the spatial correlation coefficient ρ is positive, which means that the economic growth rates in a neighborhood affect each other and the impact is positive. Additionally, Edu ( X 4 ) and Infra ( X 5 ) positively affect economic development at quantiles τ = 0.25 , 0.5 , 0.75 .
Figure 3a–c plot the surfaces of the estimated β 1 ( τ , t ) , β 2 ( τ , t ) , and β 3 ( τ , t ) . The x-axis represents the years, the y-axis represents the quantiles, and the z-axis represents the varying coefficients’ estimations. Figure 3 indicates that β 1 ( τ , t ) , β 2 ( τ , t ) , and β 3 ( τ , t ) all vary over time. The impact of industry structure (IS) is generally negative and gradually declines over time. The effect of urbanization (Urban) is positive and increases first then decreases. The impact of openness (Open) is initially negative and gradually becomes positive as time goes on.

6.2. Location-Varying Effects of the Covariates

Following, we investigate the location effects of the impacts of the covariates and try to find which covariates have location-varying effects on the urban–rural income gap and which covariates do not. We consider a set of quantiles with τ = { 0.1 , 0.2 , , 0.9 } . At each quantile τ , the SIC criterion is applied for model selection. The results are summarized in Table 8. The obtained results shows the effects of Edu ( X 4 ) and Infra ( X 5 ) are constant, and the coefficients of the rest of the covariates all have location-varying effects on the economic development level. The QR model is given by:
Q τ ( y i t | d i t , X i t ) = ρ ( τ ) d i t + X 1 , i t β 1 ( S i , τ ) + X 2 , i t β 2 ( S i , τ ) + X 3 , i t β 3 ( S i , τ ) + X 4 , i t β 4 ( τ ) + X 5 , i t β 5 ( τ ) + η i ( τ ) ,
where d i t = j = 1 n w i j y j t . The B-spline surface fitting scheme is employed to approximate the varying coefficients β 1 ( s ) , β 2 ( s ) , and β 3 ( s ) .
The MLE and bias-corrected FEQR estimates of ρ , β 4 , and β 5 at quantiles τ = 0.25 , 0.5 , 0.75 are reported in the latter part of Table 7, which shows that the spatial correlation coefficient ρ is positive, which means that the economic development levels in a neighborhood affect each other and the impact is positive. Additionally, Edu ( X 4 ) and Infra ( X 5 ) positively affect economic development at quantiles τ = 0.25 , 0.5 , 0.75 .
Figure 4a–c plot the surface of the estimated β 1 ( s ) , β 2 ( s ) , and β 3 ( s ) at quantile τ = 0.5 . The x-axis represents the latitude, the y-axis represents the longitude, and the z-axis represents the estimation of the varying coefficients. Figure 4a shows that the effect of industry structure (IS) is positive in low latitudes and gradually transforms to negative as the latitude increases. In general, the industry structure (IS) effect is more significant in Eastern China. Figure 4b indicates that the impact of openness (Open) is generally negative and gradually transforms to positive as the latitude increases. Figure 4c shows that openness (Open) generally hurts economic development and the negative impact becomes weaker as longitude and latitude increase.

7. Conclusions

This paper investigates a spatial quantile panel data model with varying coefficients. Both the time and location effects of the impacts of the covariates are taken into account, which allows us to investigate how the effects of covariates vary over time and location. The B-spline curve fitting scheme and the local polynomial approximation are employed to approximate the varying coefficient for the time-varying coefficient model. For the location-varying coefficient model, the local linear approximation is used for estimation since performing higher-order Taylor expansions on binary functions is difficult. Considering that the local linear estimator is too rough, the B-spline surface fitting method is introduced for approximation.
Moreover, the FEQR estimation is biased in the presence of the spatial lag variable. Thus, the wild bootstrap method is employed for bias correction. The proposed model relaxes the homogeneous slope assumption and can accommodate heterogeneity in the regression function and thus balances better between model flexibility and parsimony. Further, the estimation approach relaxes the normality assumption and therefore does not require any specification of the error distribution.
Important and interesting further studies can be conducted by considering three-dimensional spatiotemporal-varying coefficients. This paper applies the SIC criterion for model structure identification, i.e., identification of varying and constant coefficients. In future work, we hope to propose a penalty method to simultaneously achieve parameter estimation, variable selection, and model structure identification, which is more convenient and efficient in practical applications.

Author Contributions

Conceptualization, X.D., L.J. and M.T.; Methodology, X.D.; Investigation, X.D.; Simulation studies, X.D., S.H. and L.J.; Visualization, X.D.; Formal analysis, X.D.; Funding acquisition, L.J.; Writing—original draft preparation, X.D.; Writing—review and editing, X.D., S.H., L.J. and M.T.; Project administration, X.D.; Supervision, L.J. and M.T. All authors have read and agreed to the published version of the manuscript.

Funding

The work was funded by the National Natural Science Foundation of China (No. 11801370).

Data Availability Statement

The economic development data can be obtained from the EPSDATA Databases. https://www.epsnet.com.cn/index.html#/Index (accessed on 12 March 2023).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Anselin, L.; Sheri, H. Spatial econometrics in practice, a review of software options. Reg. Sci. Urban Econ. 1992, 22, 509–536. [Google Scholar] [CrossRef]
  2. Anselin, L. Spatial econometrics. In A Companion to Theoretical Econometrics; Baltagi, B.H., Ed.; Blackwell Publishing Ltd.: Malden, MA, USA, 2003; pp. 310–330. [Google Scholar]
  3. Baltagi, B.; Song, S.H.; Koh, W. Testing panel data regression models with spatial error correlation. J. Econom. 2003, 117, 123–150. [Google Scholar] [CrossRef]
  4. Baltagi, B.H.; Song, S.; Jung, B.; Koh, W. Testing for serial correlation, spatial autocorrelation and random effects using panel data. J. Econom. 2007, 140, 5–51. [Google Scholar] [CrossRef]
  5. Baltagi, B.H.; Kao, C.; Liu, L. Asymptotic properties of estimators for the linear panel regression model with random individual effects and serially correlated errors: The case of stationary and non-stationary regressors and residuals. J. Econom. 2008, 11, 554–572. [Google Scholar] [CrossRef]
  6. Baltagi, B.; Egger, P.; Pfaffermayr, M. A generalized spatial panel data model with random effects. Econom. Rev. 2013, 32, 650–685. [Google Scholar] [CrossRef]
  7. Dai, X.; Yan, Z.; Tian, M.; Tang, M. Quantile regression for general spatial panel data models with fixed effects. J. Appl. Stat. 2020, 47, 45–60. [Google Scholar] [CrossRef]
  8. Elhorst, J. Paul Dynamic models in space and time. Geogr. Anal. 2001, 33, 119–140. [Google Scholar] [CrossRef]
  9. Elhorst, J.P. Specification and estimation of spatial panel data models. Int. Reg. Sci. Rev. 2003, 26, 244–268. [Google Scholar] [CrossRef]
  10. Elhorst, J.P. Unconditional maximum likelihood estimation of linear and log-linear dynamic models for spatial panels. Geogr. Anal. 2005, 37, 85–106. [Google Scholar] [CrossRef]
  11. Kapoor, M.; Kelejian, H.H.; Prucha, I.R. Panel data models with spatially correlated error components. J. Econom. 2007, 140, 97–130. [Google Scholar] [CrossRef]
  12. Lee, L.F.; Yu, J.H. Estimation of spatial autoregressive panel data models with fixed effects. J. Econom. 2010, 154, 165–185. [Google Scholar] [CrossRef]
  13. Yu, J.; de Jong, R.; Lee, L.F. Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both n and T are large. J. Econom. 2008, 146, 118–134. [Google Scholar] [CrossRef]
  14. Yu, J.; Lee, L.F. Estimation of unit root spatial dynamic panel data models. Econom. Theory 2010, 26, 1332–1362. [Google Scholar] [CrossRef]
  15. Michael, O.; Robert, P.B. Spatial analysis of municipal water demand: A panel data approach. Appl. Econ. Lett. 2018, 25, 1157–1160. [Google Scholar]
  16. Guliyev, H. Determining the spatial effects of covid-19 using the spatial panel data model. Spat. Stat. 2020, 38, 100443. [Google Scholar] [CrossRef] [PubMed]
  17. Huang, N.; Yang, Z. Spatial dynamic models with short panels: Evaluating the impact of purchase restrictions on housing prices. Econ. Model. 2021, 103, 105597. [Google Scholar] [CrossRef]
  18. Yang, C.F. Common factors and spatial dependence: An application to US house prices. Econom. Rev. 2021, 40, 14–50. [Google Scholar] [CrossRef]
  19. Lee, K.; Pesaran, M.H.; Smith, R. Growth and convergence in a multi-country empirical stochastic Solow model. J. Appl. Econom. 1997, 12, 357–392. [Google Scholar] [CrossRef]
  20. Brunsdon, C.; Fotheringham, S.; Charlton, M. Geographically Weighted Regression. J. R. Stat. Soc. Ser. D (Stat.) 1998, 47, 431–443. [Google Scholar] [CrossRef]
  21. Zhang, Y.; Shen, D. Estimation of semi-parametric varying-coefficient spatial panel models with random-effects. J. Stat. Plan. Inferences 2015, 159, 64–80. [Google Scholar] [CrossRef]
  22. Sun, Y.; Malikov, E. Estimation and inference in functional-coefficient spatial autoregressive panel data models with fixed effects. J. Econom. 2018, 203, 359–378. [Google Scholar] [CrossRef]
  23. Mínguez, R.; Basile, R.; Durbán, M. An alternative semiparametric model for spatial panel data. Stat. Methods Appl. 2020, 29, 669–708. [Google Scholar] [CrossRef]
  24. Dai, X.; Jin, L.; Shi, A.; Shi, L. Outlier Detection and Accommodation in General Spatial Models. Stat. Methods Appl. 2016, 25, 453–475. [Google Scholar] [CrossRef]
  25. Dai, X.; Li, S.; Jin, L.; Tian, M. Quantile regression for partially linear varying coefficient spatial autoregressive models. Commun. Stat. Simul. Comput. 2022. [Google Scholar] [CrossRef]
  26. Dai, X.; Jin, L. Minimum distance quantile regression for spatial autoregressive panel data models with fixed effects. PLoS ONE 2021, 16, e0261144. [Google Scholar] [CrossRef] [PubMed]
  27. Xu, X.; Lee, L.F. A spatial autoregressive model with a nonlinear transformation of the dependent variable. J. Econom. 2015, 186, 209–232. [Google Scholar] [CrossRef]
  28. Feng, X.; He, X.; Hu, J. Wild bootstrap for quantile regression. Biometrika 2011, 98, 995–999. [Google Scholar] [CrossRef]
  29. Wang, L.; Keilegom, I.; Maidman, A. Wild residual bootstrap inference for penalized quantile regression with heteroscedastic errors. Biometrika 2018, 105, 859–872. [Google Scholar] [CrossRef]
  30. Kim, M.O. Quantile regression with varying coefficients. Ann. Stat. 2007, 35, 92–108. [Google Scholar] [CrossRef]
  31. Lu, Y.Q. Estimation for the Power-transformed Varying-coefficient Quantile Regression Model. Commun. Stat. Theory Methods 2013, 42, 2617–2633. [Google Scholar] [CrossRef]
  32. Wang, H.X.; Zhu, Z.Y.; Zhou, J.H. Quantile regression in partially linear varying coefficient models. Ann. Statist. 2009, 37, 3841–3866. [Google Scholar] [CrossRef]
  33. Galvao, A.F.; Montes-Rojas, G. On Bootstrap Inference for Quantile Regression Panel Data: A Monte Carlo Study. Econometrics 2015, 3, 654–666. [Google Scholar] [CrossRef]
  34. Sun, Y.; Yan, H.J.; Zhang, W.Y.; Lu, Z.D. A Semiparametric Spatial dynamic model. Ann. Stat. 2014, 42, 700–727. [Google Scholar] [CrossRef]
  35. Elhorst, J.P. Spatial Panal Data Models. In Handbook of Applied Spatial Analysis; Fischer, M.M., Getis, A., Eds.; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 2010; pp. 377–407. [Google Scholar]
  36. LeSage, J.P. The Theory and Practice of Spatial Econometrics; University of Toledo: Toledo, OH, USA, 1999. [Google Scholar]
Figure 1. Confidence intervals of β 1 ( t ) and β 2 ( t ) in Example 1 at τ = 0.5 and ( N , T ) = ( 100 , 20 ) with Gaussian errors. (a,b) The varying coefficients are approximated via the B-spline basis. (c,d) The varying coefficients are approximated via the local polynomial fitting scheme. The areas represent 95% point-wise confidence intervals.
Figure 1. Confidence intervals of β 1 ( t ) and β 2 ( t ) in Example 1 at τ = 0.5 and ( N , T ) = ( 100 , 20 ) with Gaussian errors. (a,b) The varying coefficients are approximated via the B-spline basis. (c,d) The varying coefficients are approximated via the local polynomial fitting scheme. The areas represent 95% point-wise confidence intervals.
Mathematics 11 02005 g001
Figure 2. The estimated β 1 ( s ) and β 2 ( s ) (multicolored surface) superimposed with the true β 1 ( s ) and β 2 ( s ) (grey surface) in Example 2 at τ = 0.5 and ( N , T ) = ( 100 , 20 ) with Gaussian errors. (a,b) The varying coefficients are approximated via the B-spline surface fitting method. (c,d) The varying coefficients are approximated via local linear approximation.
Figure 2. The estimated β 1 ( s ) and β 2 ( s ) (multicolored surface) superimposed with the true β 1 ( s ) and β 2 ( s ) (grey surface) in Example 2 at τ = 0.5 and ( N , T ) = ( 100 , 20 ) with Gaussian errors. (a,b) The varying coefficients are approximated via the B-spline surface fitting method. (c,d) The varying coefficients are approximated via local linear approximation.
Mathematics 11 02005 g002
Figure 3. The estimated surfaces of (a) β 1 ( τ , t ) , (b) β 2 ( τ , t ) , and (c) β 3 ( τ , t ) .
Figure 3. The estimated surfaces of (a) β 1 ( τ , t ) , (b) β 2 ( τ , t ) , and (c) β 3 ( τ , t ) .
Mathematics 11 02005 g003
Figure 4. The estimated surfaces of (a) β 1 ( s ) , (b) β 2 ( s ) , and (c) β 3 ( s ) at τ = 0.5 .
Figure 4. The estimated surfaces of (a) β 1 ( s ) , (b) β 2 ( s ) , and (c) β 3 ( s ) at τ = 0.5 .
Mathematics 11 02005 g004
Table 1. Monte Carlo results of Example 1 when ε i t N ( 0 , 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) .
Table 1. Monte Carlo results of Example 1 when ε i t N ( 0 , 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) .
NTBasisPara.FEQR-BCFEQRMLE
τ = 0.25 τ = 0.5 τ = 0.75 τ = 0.25 τ = 0.5 τ = 0.75
5020B-spline ρ 0.0040.0030.0040.0260.0230.0250.006
(0.040)(0.041)(0.040)(0.052)(0.048)(0.052)(0.040)
β 1 ( t ) [0.139][0.144][0.140][0.159][0.145][0.158][0.151]
β 2 ( t ) [0.138][0.142][0.138][0.157][0.143][0.158][0.151]
m15151515151515
Polynomial ρ 0.0040.0030.0050.0240.0230.023−0.003
(0.038)(0.040)(0.038)(0.049)(0.046)(0.048)(0.042)
β 1 ( t ) [0.134][0.141][0.134][0.154][0.140][0.153][0.154]
β 2 ( t ) [0.132][0.138][0.132][0.151][0.138][0.150][0.131]
m17171717171717
40B-spline ρ 0.0020.0020.0020.0080.0070.0080.004
(0.013)(0.014)(0.013)(0.017)(0.016)(0.017)(0.017)
β 1 ( t ) [0.114][0.116][0.113][0.126][0.117][0.128][0.131]
β 2 ( t ) [0.119][0.122][0.118][0.130][0.121][0.130][0.130]
m22222222222222
Polynomial ρ 0.0060.0070.0060.0140.0140.0130.089
(0.019)(0.020)(0.020)(0.024)(0.023)(0.023)(0.119)
β 1 ( t ) [0.134][0.126][0.133][0.141][0.128][0.141][0.172]
β 2 ( t ) [0.155][0.153][0.155][0.161][0.153][0.162][0.524]
m31313131313131
10020B-spline ρ −0.0000.0050.0010.0150.0150.0150.006
(0.032)(0.034)(0.033)(0.039)(0.050)(0.039)(0.034)
β 1 ( t ) [0.101][0.105][0.100][0.114][0.104][0.115][0.109]
β 2 ( t ) [0.097][0.101][0.097][0.110][0.101][0.111][0.111]
m16161616161616
Polynomial ρ 0.0010.0040.0040.0160.0150.014−0.002
(0.032)(0.032)(0.032)(0.039)(0.036)(0.038)(0.034)
β 1 ( t ) [0.099][0.101][0.097][0.111][0.102][0.112][0.111]
β 2 ( t ) [0.095][0.099][0.096][0.108][0.099][0.107][0.095]
m18181818181818
40B-spline ρ 0.0030.0050.0030.0090.0080.0090.004
(0.013)(0.015)(0.014)(0.017)(0.016)(0.017)(0.017)
β 1 ( t ) [0.079][0.082][0.079][0.089][0.082][0.090][0.091]
β 2 ( t ) [0.093][0.095][0.092][0.100][0.095][0.100][0.098]
m22222222222222
Polynomial ρ 0.0250.0250.0250.0140.0140.0130.131
(0.053)(0.050)(0.052)(0.025)(0.023)(0.024)(0.158)
β 1 ( t ) [0.090][0.090][0.092][0.137][0.126][0.138][0.135]
β 2 ( t ) [0.167][0.156][0.162][0.161][0.153][0.161][0.553]
m32323232323232
Table 2. Monte Carlo results of Example 1 when ε i t t ( 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) .
Table 2. Monte Carlo results of Example 1 when ε i t t ( 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) .
NTBasisPara.FEQR-BCFEQRMLE
τ = 0.25 τ = 0.5 τ = 0.75 τ = 0.25 τ = 0.5 τ = 0.75
5020B-spline ρ 0.0130.0040.0120.0190.0130.0180.072
(0.032)(0.020)(0.032)(0.038)(0.028)(0.035)(0.113)
β 1 ( t ) [0.220][0.186][0.221][0.283][0.184][0.285][7.360]
β 2 ( t ) [0.309][0.278][0.305][0.356][0.280][0.354][5.704]
m10101010101010
Polynomial ρ 0.0130.0010.0100.0190.0120.0180.023
(0.032)(0.018)(0.026)(0.035)(0.025)(0.034)(0.124)
β 1 ( t ) [0.238][0.194][0.239][0.295][0.192][0.298][5.608]
β 2 ( t ) [0.271][0.227][0.268][0.318][0.219][0.317][5.399]
m15151515151518
40B-spline ρ 0.0070.0030.0060.0100.0060.0100.060
(0.016)(0.010)(0.014)(0.018)(0.012)(0.018)(0.109)
β 1 ( t ) [0.216][0.180][0.214][0.283][0.179][0.280][6.137]
β 2 ( t ) [0.284][0.260][0.286][0.335][0.258][0.337][6.184]
m25252525252525
Polynomial ρ 0.0140.0090.0130.0190.0130.0170.109
(0.027)(0.020)(0.025)(0.031)(0.024)(0.029)(0.130)
β 1 ( t ) [0.193][0.171][0.191][0.247][0.169][0.247][9.304]
β 2 ( t ) [0.548][0.520][0.548][0.591][0.518][0.589][7.823]
m20202020202020
10020B-spline ρ 0.0060.0010.0060.0110.0060.0100.025
(0.020)(0.011)(0.019)(0.023)(0.015)(0.022)(0.124)
β 1 ( t ) [0.168][0.139][0.166][0.219][0.140][0.219][11.363]
β 2 ( t ) [0.176][0.149][0.177][0.223][0.148][0.227][5.429]
m14141414141414
Polynomial ρ 0.0060.0010.0060.0100.0060.0110.035
(0.019)(0.012)(0.018)(0.021)(0.015)(0.024)(0.119)
β 1 ( t ) [0.163][0.135][0.161][0.209][0.134][0.213][7.553]
β 2 ( t ) [0.183][0.156][0.180][0.215][0.151][0.220][8.412]
m16161616161616
40B-spline ρ 0.002−0.0000.0020.0060.0020.0050.018
(0.008)(0.004)(0.008)(0.011)(0.006)(0.011)(0.123)
β 1 ( t ) [0.159][0.132][0.159][0.213][0.131][0.215][8.169]
β 2 ( t ) [0.151][0.124][0.152][0.201][0.122][0.200][5.696]
m32323232323232
Polynomial ρ 0.0110.0090.0120.0140.0120.0130.100
(0.025)(0.020)(0.024)(0.027)(0.022)(0.024)(0.127)
β 1 ( t ) [0.134][0.120][0.134][0.177][0.121][0.176][10.647]
β 2 ( t ) [0.520][0.500][0.521][0.553][0.499][0.551][5.283]
m21212121212121
Table 3. Monte Carlo results of Example 2 when ε i t N ( 0 , 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) . The knots for B-spline surface fitting are ( m , n ) = ( 2 , 2 ) .
Table 3. Monte Carlo results of Example 2 when ε i t N ( 0 , 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) . The knots for B-spline surface fitting are ( m , n ) = ( 2 , 2 ) .
NTBasisPara.FEQR-BCFEQRMLE
τ = 0.25 τ = 0.5 τ = 0.75 τ = 0.25 τ = 0.5 τ = 0.75
5020B-spline ρ 0.0360.0380.0320.1030.0970.0940.001
(0.068)(0.069)(0.064)(0.121)(0.113)(0.112)(0.025)
β 1 ( t ) [0.128][0.130][0.128][0.142][0.130][0.145][0.132]
β 2 ( t ) [0.214][0.222][0.215][0.243][0.220][0.244][0.329]
Polynomial ρ 0.0470.0440.0410.1150.1150.1100.002
(0.079)(0.077)(0.075)(0.134)(0.132)(0.127)(0.027)
β 1 ( t ) [0.337][0.337][0.338][0.339][0.338][0.339][0.451]
β 2 ( t ) [0.239][0.240][0.238][0.245][0.241][0.246][0.285]
40B-spline ρ 0.0320.0330.0350.0970.0970.0960.001
(0.050)(0.050)(0.052)(0.104)(0.104)(0.105)(0.021)
β 1 ( t ) [0.090][0.092][0.089][0.101][0.093][0.100][0.100]
β 2 ( t ) [0.148][0.152][0.147][0.167][0.154][0.165][0.326]
Polynomial ρ 0.0430.0370.0440.1080.1120.1100.002
(0.060)(0.058)(0.060)(0.116)(0.118)(0.123)(0.024)
β 1 ( t ) [0.334][0.334][0.335][0.335][0.335][0.335][0.362]
β 2 ( t ) [0.228][0.228][0.227][0.232][0.230][0.232][0.287]
10020B-spline ρ 0.0390.0390.0380.1010.1000.0960.002
(0.076)(0.072)(0.068)(0.118)(0.115)(0.112)(0.019)
β 1 ( t ) [0.093][0.096][0.094][0.104][0.096][0.104][0.098]
β 2 ( t ) [0.148][0.153][0.149][0.167][0.151][0.167][0.226]
Polynomial ρ 0.0530.0490.0570.1190.1240.1200.000
(0.086)(0.087)(0.089)(0.138)(0.138)(0.140)(0.019)
β 1 ( t ) [0.399][0.399][0.399][0.399][0.399][0.399][0.354]
β 2 ( t ) [0.275][0.276][0.276][0.278][0.276][0.278][0.251]
40B-spline ρ 0.0360.0310.0370.0950.0970.0980.000
(0.053)(0.051)(0.052)(0.104)(0.104)(0.107)(0.017)
β 1 ( t ) [0.070][0.072][0.070][0.076][0.071][0.076][0.074]
β 2 ( t ) [0.102][0.106][0.105][0.116][0.106][0.116][0.224]
Polynomial ρ 0.0480.0470.0490.1180.1200.1210.002
(0.066)(0.065)(0.065)(0.127)(0.127)(0.129)(0.018)
β 1 ( t ) [0.398][0.397][0.398][0.398][0.398][0.398][0.382]
β 2 ( t ) [0.270][0.270][0.270][0.271][0.270][0.270][0.275]
Table 4. Monte Carlo results of Example 2 when ε i t t ( 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) . The knots for B-spline surface fitting are ( m , n ) = ( 2 , 2 ) .
Table 4. Monte Carlo results of Example 2 when ε i t t ( 1 ) . The table shows the mean of the bias and RMSE (in parentheses) for ρ and the mean of the MADE [in brackets] for β 1 ( t ) , β 2 ( t ) . The knots for B-spline surface fitting are ( m , n ) = ( 2 , 2 ) .
NTBasisPara.FEQR-BCFEQRMLE
τ = 0 . 25 τ = 0 . 5 τ = 0 . 75 τ = 0 . 25 τ = 0 . 5 τ = 0 . 75
5020B-spline ρ 0.0070.0000.0060.0110.0070.0140.150
(0.018)(0.008)(0.017)(0.021)(0.013)(0.026)(0.156)
β 1 ( t ) [0.242][0.192][0.242][0.306][0.194][0.308][15.095]
β 2 ( t ) [0.410][0.320][0.403][0.522][0.321][0.535][19.806]
Polynomial ρ 0.0080.0000.0060.0120.0070.0140.143
(0.018)(0.008)(0.017)(0.022)(0.013)(0.025)(0.150)
β 1 ( t ) [0.347][0.344][0.349][0.357][0.344][0.357][6.057]
β 2 ( t ) [0.274][0.258][0.271][0.305][0.262][0.311][6.016]
40B-spline ρ 0.0050.0000.0040.0080.0040.0080.160
(0.010)(0.004)(0.010)(0.014)(0.007)(0.013)(0.163)
β 1 ( t ) [0.150][0.125][0.151][0.200][0.124][0.199][6.368]
β 2 ( t ) [0.254][0.210][0.250][0.336][0.210][0.342][6.419]
Polynomial ρ 0.0040.0000.0040.0080.0040.0080.161
(0.010)(0.004)(0.009)(0.014)(0.008)(0.013)(0.165)
β 1 ( t ) [0.339][0.336][0.340][0.344][0.337][0.345][17.140]
β 2 ( t ) [0.243][0.237][0.243][0.262][0.238][0.262][35.104]
10020B-spline ρ 00030.0000.0030.0050.0040.0070.150
(0.011)(0.006)(0.011)(0.011)(0.008)(0.014)(0.155)
β 1 ( t ) [0.158][0.132][0.158][0.207][0.131][0.204][20.927]
β 2 ( t ) [0.263][0.217][0.262][0.346][0.217][0.355][14.256]
Polynomial ρ 0.0030.0000.0030.0060.0040.0080.148
(0.011)(0.006)(0.011)(0.012)(0.008)(0.016)(0.154)
β 1 ( t ) [0.404][0.401][0.403][0.408][0.401][0.407][5.220]
β 2 ( t ) [0.295][0.290][0.293][0.312][0.287][0.313][4.749]
40B-spline ρ 0.0020.0000.0010.0040.0020.0040.155
(0.005)(0.003)(0.005)(0.008)(0.004)(0.007)(0.160)
β 1 ( t ) [0.106][0.090][0.107][0.142][0.089][0.140][7.054]
β 2 ( t ) [0.171][0.144][0.170][0.234][0.140][0.234][7.691]
Polynomial ρ 0.0020.0000.0010.0040.0020.0040.156
(0.005)(0.003)(0.005)(0.008)(0.004)(0.007)(0.160)
β 1 ( t ) [0.400][0.398][0.400][0.402][0.398][0.402][12.685]
β 2 ( t ) [0.278][0.275][0.279][0.289][0.276][0.289][10.924]
Table 5. Variable definitions and descriptions.
Table 5. Variable definitions and descriptions.
VariableNotationDefinition
GDPyLogarithm of GDP
IS X 1 The ratio of tertiary industries to secondary industries (%)
Urban X 2 The proportion of urban population (%)
Open X 3 The ratio of total imports and exports to GDP
Edu X 4 Logarithm of average number of students in colleges and universities per 100,000 population
Infra X 5 Mileage of roads, railways, and inland waterways per square kilometer
Table 6. SIC values and corresponding models. Only the first five smallest SIC values in descending order are listed, where Ω represents all the covariates that have time-varying effects, represents all the constant coefficients, and { i 1 , , i k } stands for X i 1 , ⋯, X i k and indicates which models have time-varying effects.
Table 6. SIC values and corresponding models. Only the first five smallest SIC values in descending order are listed, where Ω represents all the covariates that have time-varying effects, represents all the constant coefficients, and { i 1 , , i k } stands for X i 1 , ⋯, X i k and indicates which models have time-varying effects.
τ = 0.1 τ = 0.2 τ = 0.3 τ = 0.4 τ = 0.5
ModelSICModelSICModelSICModelSICModelSIC
{ 1 , 2 , 3 , 5 } 1.827 { 1 , 2 , 3 , 5 } 2.306 { 1 , 2 , 3 , 5 } 2.491 { 1 , 2 , 3 , 5 } 2.578 { 1 , 2 , 3 } 2.584
{ 1 , 2 , 4 , 5 } 1.829 { 1 , 2 , 3 , 4 } 2.316 Ω 2.510 { 1 , 2 , 3 } 2.585 { 1 , 2 , 3 , 5 } 2.589
{ 1 , 2 , 5 } 1.835 { 1 , 2 , 3 } 2.322 { 1 , 2 , 3 } 2.519 Ω 2.597 { 1 , 2 , 5 } 2.598
{ 1 , 4 , 5 } 1.853 Ω 2.327 { 1 , 4 , 5 } 2.525 { 1 , 2 , 5 } 2.599 { 1 , 2 , 3 , 4 } 2.605
{ 1 , 2 , 3 } 1.855 { 1 , 2 , 4 , 5 } 2.329 { 1 , 2 , 3 , 4 } 2.533 { 1 , 2 , 3 , 4 } 2.606 Ω 2.605
τ = 0 . 6 τ = 0 . 7 τ = 0 . 8 τ = 0 . 9
ModelSICModelSICModelSICModelSIC
{ 1 , 2 , 3 } 2.540 { 1 , 2 , 3 } 2.422 { 1 , 2 , 5 } 2.228 { 1 , 2 , 3 } 1.737
{ 1 , 2 , 5 } 2.542 { 1 , 2 } 2.425 { 1 , 2 , 3 } 2.245 { 1 , 2 } 1.754
{ 1 , 2 , 3 , 5 } 2.546 { 1 , 2 , 5 } 2.428 { 1 , 4 } 2.245 { 1 , 2 , 4 } 1.774
{ 1 , 2 } 2.551 { 1 , 2 , 3 , 5 } 2.433 { 1 , 2 , 3 , 4 } 2.247 { 1 , 2 , 5 } 1.777
{ 1 , 2 , 3 , 4 } 2.552 { 1 , 2 , 4 } 2.433 { 1 , 2 , 3 , 5 } 2.250 { 1 , 4 } 1.777
Table 7. Estimates of ρ , β 4 and β 5 at quantiles τ = 0.25 , 0.5 , 0.75 .
Table 7. Estimates of ρ , β 4 and β 5 at quantiles τ = 0.25 , 0.5 , 0.75 .
ModelPara.FEQR-BCMLE
τ = 0 . 25 τ = 0 . 5 τ = 0 . 75
Model (6) ρ 0.5380.5100.5250.461
β 4 0.0110.0100.0310.087
β 5 0.0440.0460.0080.043
Model (7) ρ 0.7760.7690.7490.696
β 4 0.0770.0930.0710.156
β 5 0.0630.0860.0610.060
Table 8. SIC values and corresponding models. Only the first five smallest SIC values in descending order are listed, where Ω represents all the covariates that have location-varying effects, ∅ represents the coefficient of all the covariates that are constant, and { i 1 , , i k } stands for X i 1 , ⋯, X i k and indicates coefficients that have varying location effect.
Table 8. SIC values and corresponding models. Only the first five smallest SIC values in descending order are listed, where Ω represents all the covariates that have location-varying effects, ∅ represents the coefficient of all the covariates that are constant, and { i 1 , , i k } stands for X i 1 , ⋯, X i k and indicates coefficients that have varying location effect.
τ = 0.1 τ = 0.2 τ = 0.3 τ = 0.4 τ = 0.5
ModelSICModelSICModelSICModelSICModelSIC
{ 1 , 2 , 3 } 1.708 { 1 , 2 , 3 } 2.156 { 1 , 2 , 3 } 2.344 { 1 , 2 , 3 } 2.429 Ω 2.454
{ 1 , 2 , 3 , 4 } 1.709 { 1 , 2 , 3 , 4 } 2.165 { 1 , 2 , 3 , 4 } 2.349 Ω 2.438 { 1 , 2 , 3 } 2.463
{ 1 , 2 , 5 } 1.710 { 1 , 2 , 4 , 5 } 2.181 Ω 2.352 { 1 , 2 , 3 , 4 } 2.439 { 1 , 2 , 3 , 4 } 2.464
{ 1 , 2 , 4 , 5 } 1.711 Ω 2.185 { 1 , 2 , 4 , 5 } 2.368 { 1 , 2 , 3 , 5 } 2.454 { 1 , 2 , 4 , 5 } 2.473
Ω 1.722 { 1 , 2 , 5 } 2.193 { 1 , 2 , 3 , 5 } 2.374 { 1 , 2 , 4 , 5 } 2.457 { 1 , 2 , 3 , 5 } 2.479
τ = 0 . 6 τ = 0 . 7 τ = 0 . 8 τ = 0 . 9
ModelSICModelSICModelSICModelSIC
{ 1 , 2 , 3 } 2.446 { 1 , 2 , 3 } 2.381 { 1 , 2 , 3 } 2.208 { 1 , 2 , 3 } 1.733
Ω 2.454 Ω 2.394 Ω 2.229 { 1 , 2 } 1.775
{ 1 , 2 , 3 , 4 } 2.456 { 1 , 2 , 3 , 4 } 2.411 { 1 , 2 , 3 , 5 } 2.243 { 1 , 2 , 3 , 5 } 1.776
{ 1 , 2 , 3 , 5 } 2.474 { 1 , 2 , 3 , 5 } 2.413 { 1 , 2 , 3 , 4 } 2.255 { 1 , 2 , 4 } 1.779
{ 1 , 2 , 4 , 5 } 2.480 { 1 , 2 } 2.425 { 1 , 2 , 4 , 5 } 2.265 Ω 1.785
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dai, X.; Huang, S.; Jin, L.; Tian, M. Wild Bootstrap-Based Bias Correction for Spatial Quantile Panel Data Models with Varying Coefficients. Mathematics 2023, 11, 2005. https://doi.org/10.3390/math11092005

AMA Style

Dai X, Huang S, Jin L, Tian M. Wild Bootstrap-Based Bias Correction for Spatial Quantile Panel Data Models with Varying Coefficients. Mathematics. 2023; 11(9):2005. https://doi.org/10.3390/math11092005

Chicago/Turabian Style

Dai, Xiaowen, Shidan Huang, Libin Jin, and Maozai Tian. 2023. "Wild Bootstrap-Based Bias Correction for Spatial Quantile Panel Data Models with Varying Coefficients" Mathematics 11, no. 9: 2005. https://doi.org/10.3390/math11092005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop