Efficient Estimation in Heteroscedastic Varying Coefficient Models

Wei, Chuanhua; Wan, Lijie

doi:10.3390/econometrics3030525

Open AccessArticle

Efficient Estimation in Heteroscedastic Varying Coefficient Models

by

Chuanhua Wei

^1,* and

Lijie Wan

²

¹

Department of Statistics, Minzu University of China, Beijing 100081 , China

²

Department of Statistics, University of Kentucky, Lexington, KY 40508, USA

^*

Author to whom correspondence should be addressed.

Econometrics 2015, 3(3), 525-531; https://doi.org/10.3390/econometrics3030525

Submission received: 16 January 2015 / Revised: 24 May 2015 / Accepted: 2 July 2015 / Published: 15 July 2015

Download Versions Notes

Abstract

:

This paper considers statistical inference for the heteroscedastic varying coefficient model. We propose an efficient estimator for coefficient functions that is more efficient than the conventional local-linear estimator. We establish asymptotic normality for the proposed estimator and conduct some simulation to illustrate the performance of the proposed method.

Keywords:

heteroscedasticity; local linear; varying coefficient models

JEL classifications:

C13; C14

1. Introduction

Recently, the varying coefficient model has attracted much attention among econometricians and statisticians. One attractive feature of this model is its ability to capture the nonlinearity of the data without suffering from the “curse of dimensionality”. In general, it is of the form

Y_{i} = X_{i}^{T} α (U_{i}) + ε_{i}, i = 1, 2, \dots, n

(1.1)

where

Y_{i}^{,} s

are responses;

X_{i} = {(X_{i 1}, X_{i 2}, \dots, X_{i p})}^{T}

and

U_{i}

are associated covariates;

α (\cdot) = {(α_{1} (\cdot), α_{1} (\cdot), \dots, α_{p} (\cdot))}^{T}

is a p-dimensional vector of unknown functions;

ε_{i}^{,} s

are independent and identically distributed random errors with

E (ε_{i} | X_{i}, U_{i}) = 0

and

Var (ε_{i} | X_{i}, U_{i}) = σ^{2} (X_{i}, U_{i})

.

Due to its flexibility, the varying coefficient model has been studied in many different contexts and has been successfully applied to nonlinear time series analysis, longitudinal and functional data analysis, panel data analysis, spatial data analysis, and time-varying models in finance. See, for example, the work of Cai et al. [1], Cai [2], Cai and Li [3], Cai et al. [4], Fan and Zhang [5], Fan et al. [6], Fotheringham et al. [7], Hoover et al. [8], Li et al. [9] and Xiao [10], among others.

In the above models, the varying coefficient model is generally estimated by the local-linear approach. Usually, the errors are assumed to be i.i.d. to start. However, in applications, heteroscedasticity is often found in residuals from both cross-sectional and time series modelling. In the context of the linear regression model, it is well known that if the errors are heteroscedastic, then the generalized least-squares (GLS) estimator is more efficient than ordinary least-squares (OLS) estimator. To the best of our knowledge, there has been no work on the problem of designing an efficient estimation method for varying coefficient models with heteroscedastic errors. In this paper, we propose an efficient estimator for varying coefficients based on the local linear approach.

The paper is structured as follows. We introduce an efficient estimator in Section 2, and their asymptotic properties are given in Section 3. We report the results of some Monte Carlo simulations in Section 4.

2. Efficient Estimation

Without considering heteroscedasticity, we apply a local linear regression technique to estimate the varying coefficient functions. For each given u, the local linear estimator

\hat{α} (u)

of

α (u)

is the part corresponding to

a

of the minimizer of

\sum_{i = 1}^{n} {[Y_{i} - X_{i}^{T} a - X_{i}^{T} (U_{i} - u) b]}^{2} K_{h} (U_{i} - u_{0}),

(2.1)

where K is a kernel function, h is a bandwidth and

K_{h} (\cdot) = K (\cdot / h) / h

. Then we have

\hat{α} (u) = {[{\hat{α}}_{1} (u), \dots, {\hat{α}}_{p} (u)]}^{T} = (I_{p} 0_{p}) {D_{u}^{T} W_{u}^{δ} D_{u}}^{- 1} D_{u}^{T} W_{u}^{δ} Y .

(2.2)

where

X = [\begin{matrix} X_{1}^{T} \\ X_{2}^{T} \\ ⋮ \\ X_{n}^{T} \end{matrix}], Y = [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}], D_{u} = [\begin{matrix} X_{1}^{T} & \frac{U_{1} - u}{h} X_{1}^{T} \\ X_{2}^{T} & \frac{U_{2} - u}{h} X_{2}^{T} \\ ⋮ & ⋮ \\ X_{n}^{T} & \frac{U_{n} - u}{h} X_{n}^{T} \end{matrix}],

and

W_{u} = diag {K_{h} (U_{1} - u), K_{h} (U_{2} - u), \dots, K_{h} (U_{n} - u)}

.

The estimator

\hat{α} (u)

ignores the information contained in the variance matrix and it is inefficient. To overcome this, we propose a class of efficient estimators in the following.

Denote

σ_{i} = \sqrt{σ^{2} (x_{i}, u_{i})}

, for the moment where we assume that

σ_{i}

is known. Multiply both sides of model (1.1) by

1 / σ_{i}

, we have the following homoscedastic varying coefficient model

Y_{i}^{*} = Z_{i}^{T} α (u_{i}) + e_{i}, i = 1, 2, \dots, n,

(2.3)

where

Y_{i}^{*} = y_{i} / σ_{i}, Z_{i} = X_{i} / σ_{i}

and

e_{i} = ε_{i} / σ_{i}

with

E (e_{i} | x_{i}, u_{i}) = 0, Var (e_{i} | x_{i}, u_{i}) = 1

.

Applying the local linear approach to model (2.3), the efficient estimator of

α (u)

is given as follows

\tilde{α} (u) = (I_{p} 0_{p}) {B_{u}^{T} W_{u} B_{u}}^{- 1} B_{u}^{T} W_{u} Y^{*} .

(2.4)

where

Y^{*} = (Y_{1}^{*}, Y_{2}^{*}, \dots, Y_{n}^{*})

,

B_{u} = [\begin{matrix} Z_{1}^{T} & Z_{1}^{T} \frac{U_{1} - u}{h} \\ Z_{2}^{T} & Z_{2}^{T} \frac{U_{2} - u}{h} \\ ⋮ & ⋮ \\ Z_{n}^{T} & Z_{n}^{T} \frac{U_{n} - u}{h} \end{matrix}] .

3. Asymptotic Property

First, we make the following assumptions. Let

μ_{i} = \int_{0}^{\infty} t^{i} K (t) d t, ν_{i} = \int_{0}^{\infty} t^{i} K^{2} (t) d t

.

Assumption 1. The errors

ε_{i} (i = 1, 2, \dots, n)

are independent and

0 < E (ε_{i}^{2}) < \infty

and

Var (ε_{i}^{2}) > 0

.

Assumption 2. The random variable U has a bounded support Π. Its density function

f (\cdot)

is Lipschitz continuous and bounded away from 0 on its support.

Assumption 3. The

p \times p

matrixes

E [X X^{T} | U]

and

E [X X^{T} / σ^{2} (X, U) | U]

are non-singular for each

U \in Π

.

Assumption 4. There is an

s > 2

such that

E ∥ X ∥^{2 s} < \infty

and for some

k < 2 - s^{- 1}

such that

n^{2 k - 1} h \to \infty

as

n \to \infty

.

Assumption 5.

{α_{j} (\cdot), j = 1, \dots, p}

have continuous second derivatives in

U \in Π

.

Assumption 6. The function

K (\cdot)

is a symmetric density function with compact support and the bandwidth h satisfies

n h^{8} \to 0

and

n h^{2} / {(log n)}^{2} \to \infty

as

n \to \infty

.

For the estimator

\hat{α} (u)

, Cai et al. [1] proved the following result:

Theorem 1 Under the assumptions 1–6, the estimator

\hat{α} (u)

is asymptotically normal, namely,

\sqrt{n h} (\hat{α} (u) - α (u) - \frac{1}{2} h^{2} μ_{2} α^{''} (u)) \to N (0, ν_{0} Ψ / f (u)),

where

Ψ = Γ {(U)}^{- 1} E [X X^{T} σ^{2} (X, U) | U] Γ {(U)}^{- 1}

,

Γ (U) = E [X X^{T} | U]

.

For the estimator

\tilde{α} (u)

, we obtain the following result by the Theorem 1 directly.

Theorem 2 Under the assumptions 1–6, the estimator

\tilde{α} (u)

is asymptotically normal, namely,

\sqrt{n h} (\tilde{α} (u) - α (u) - \frac{1}{2} h^{2} μ_{2} α^{''} (u)) \to N (0, ν_{0} Φ^{- 1} / f (u)),

where

Φ = E (Z Z^{T} | U) = E (X X^{T} / σ^{2} (X, U) | U)

.

Denote

\bar{X} = {({\bar{X}}_{1}, \dots, {\bar{X}}_{n})}^{T}, {\bar{X}}_{i} = \sqrt{K_{h} (U_{i} - u)} X_{i}, Σ = diag \{σ^{2} (X_{1}, U_{1}), \dots, σ^{2} (X_{n}, U_{n})\}

. By the proof of Theorem 1 in Cai et al. [1], we have

\frac{1}{n} \sum_{i = 1}^{n} K_{h} (U_{i} - u) X_{i} X_{i}^{T} = {\bar{X}}^{T} \bar{X} \overset{p}{⟶} E (X X^{T} | U) f (u),

\frac{1}{n} \sum_{i = 1}^{n} K_{h} (U_{i} - u) X_{i} X_{i}^{T} σ^{2} (X, U) = {\bar{X}}^{T} Σ \bar{X} \overset{p}{⟶} E (X X^{T} σ^{2} (X, U) | U) f (u),

and

\frac{1}{n} \sum_{i = 1}^{n} K_{h} (U_{i} - u) X_{i} X_{i}^{T} / σ^{2} (X, U) = {\bar{X}}^{T} Σ^{- 1} \bar{X} \overset{p}{⟶} E (X X^{T} / σ^{2} (X, U) | U) f (u) .

Since

{({\bar{X}}^{T} \bar{X})}^{- 1} {\bar{X}}^{T} Σ \bar{X} {({\bar{X}}^{T} \bar{X})}^{- 1} \geq {({\bar{X}}^{T} Σ^{- 1} \bar{X})}^{- 1}

, then we have

Ψ \geq Φ^{- 1} .

This implies that

\tilde{α} (u)

is asymptotically more efficient than

\hat{α} (u)

in terms of asymptotic covariance matrix.

Remark 1. Since

\tilde{α} (u)

depends on the unknown parameters

σ^{2} (X, U)

, it is infeasible. To provide a feasible efficient estimator of

α (u)

, we need to estimate

σ^{2} (X, U)

consistently. It is not difficult to show that the resultant feasible estimator has the asymptotic property as

\tilde{α} (u)

.

Remark 2. To obtain the consistent estimator of the variance function

σ^{2} (z, u)

, it is important to model

σ^{2} (z, u)

. Several kinds of variance function have been proposed. Discussion on the parametric variance function can be found in Carroll and Ruppert [11]. Muller and Stadtmuller [12], Chiou and Muller [13] and Ruppert et al. [14] studied nonparametric variance estimation. Muller and Zhao [15] proposed a general semiparametric variance function model in a fixed design regression setting. Keilegom and Wang [16] considered a general class of mean-variance regression models, in which both the mean function and the variance function were semiparametrically modeled. Zhu et al. [17] consider a single-index structure to study heteroscedasticity in a single-index regression model with high-dimensional predictors.

4. Simulation Studies

In this section we compare the behavior of the conventional estimator

\hat{α} (u)

with that of the new estimator

\tilde{α} (u)

, given in (2.2) and (2.4), respectively, when the sample size is finite. The data are generated from the following varying coefficient model

y_{i} = x_{i} α (u_{i}) + σ (x_{i}, u_{i}) ε_{i}, i = 1, 2, \dots, n,

(4.1)

where

x_{i} \sim N (0, 1), u_{i} = i / n

,

α (u_{i}) = u_{i} + sin (2 π u_{i})

. Firstly, we consider the following four known variance functions:

(A) : σ (x_{i}, u_{i}) = e^{x_{i}}; (B) : σ (x_{i}, u_{i}) = e^{u_{i}};

(C) : σ (x_{i}, u_{i}) = 1 + x_{i}; (D) : σ (x_{i}, u_{i}) = 1 + u_{i} .

Secondly, we consider the case that the variance function is unknown. For simplicity, the variance function is assumed to have the following parametric structure,

(E) : E (ε_{i}^{2} | x_{i}, u_{i}) = σ^{2} (x_{i}, u_{i}) = exp (γ_{0} + γ_{1} u_{i}),

with

γ_{0} = 1, γ_{1} = 2

. Obviously, we can build the following linear regression model

ln ε_{i}^{2} = γ_{0} + γ_{1} u_{i} + ξ_{i},

(4.2)

with

E ξ_{i} = 0

. In practice,

ε_{i}

is not available, but it may be estimated by

{\hat{ε}}_{i} = y_{i} - x_{i}^{T} \hat{α} (u_{i})

, where

\hat{α} (u_{i})

are the local linear estimates of model (4.1) without considering the heteroscedasticity structure. Applying the least squares approach to liner model (4.2) with

ε_{i}

was replaced by

{\hat{ε}}_{i}

, we can obtain the estimators of

γ_{0}, γ_{1}

, denoted by

{\hat{γ}}_{0}

and

{\hat{γ}}_{1}

respectively. Accordingly, we get the estimator of

σ^{2} (x_{i}, z_{i}, u_{i})

, as

{\hat{σ}}^{2} (x_{i}, z_{i}, u_{i}) = exp ({\hat{γ}}_{0} + {\hat{γ}}_{1} u_{i})

.

To study the effect of the distribution of the error for our method, we take the following three different types of the error distribution, (1)

ε_{i} \sim N (0, 0 . 5^{2})

, (2)

ε_{i} \sim U (- \sqrt{3} / 2, \sqrt{3} / 2)

, (3)

ε_{i} \sim \frac{1}{8} χ_{8}^{2} - 1

. The Gauss kernel function and

h = n^{- 1 / 5}

are used in our simulation studies.

We compare the proposed efficient estimator

\tilde{α} (u)

with that of the ordinary local linear estimator

\hat{α} (u)

by using the estimated mean average squared error (MASE),

MASE {\hat{α} (\cdot)} = \frac{1}{1000 * n} \sum_{l = 1}^{N} \sum_{i = 1}^{n} {[{\hat{α}}_{j}^{l} (u_{i}) - α_{j} (u_{i})]}^{2},

where

{\hat{α}}^{l} (u_{i}), l = 1, 2, \dots, N,

are the estimate of the coefficient

α (u_{i})

in

N = 1000

replications. The simulation results are presented in Table 1, and for all the scenarios we studied, the proposed efficient estimators outperform the ordinary local linear estimators.

Table 1. Mean average squared error (MASE) index for the estimators of varying coefficients.

**Table 1.** Mean average squared error (MASE) index for the estimators of varying coefficients.
Variance	Sample	$N (0, 0 . 5^{2})$	$U (- \sqrt{3} / 2, \sqrt{3} / 2)$	$\frac{1}{8} χ_{8}^{2} - 1$
Function	n	$\hat{α} (\cdot)$ $\tilde{α} (\cdot)$	$\hat{α} (\cdot)$ $\tilde{α} (\cdot)$	$\hat{α} (\cdot)$ $\tilde{α} (\cdot)$
A	30	0.4873 0.1354	0.5125 0.1277	0.5109 0.1374
	50	0.4063 0.0805	0.3820 0.0804	0.3686 0.0805
	80	0.3146 0.0539	0.3054 0.0523	0.2910 0.0525
B	30	0.2018 0.1936	0.1872 0.1791	0.1835 0.1752
	50	0.1129 0.1077	0.1174 0.1126	0.1114 0.1072
	80	0.0728 0.0696	0.0734 0.0706	0.0733 0.0706
C	30	0.1469 0.1302	0.1502 0.1269	0.1571 0.1304
	50	0.1024 0.0920	0.1039 0.0893	0.1015 0.0884
	80	0.0670 0.0679	0.0731 0.0660	0.0734 0.0645
D	30	0.1446 0.1422	0.1390 0.1368	0.1487 0.1474
	50	0.0841 0.0831	0.0844 0.0832	0.0819 0.0807
	80	0.0612 0.0606	0.0556 0.0549	0.0595 0.0589
E	30	0.3950 0.3936	0.4227 0.4133	0.3996 0.3913
	50	0.3094 0.3039	0.3137 0.3068	0.3077 0.3015
	80	0.2549 0.2508	0.2513 0.2508	0.2544 0.2520

5. Conclusions

In this paper, we focus on the estimation problem of the varying coefficient model with heteroscedastic errors. Based on the local linear method, we develop a simple approach to estimate the nonparametric coefficient functions by taking the estimated error heteroscedasticity into account. The resulting estimators are shown to have smaller asymptotic variances than the conventional local-linear estimators. The asymptotic normality of the proposed estimator is established. Furthermore, some simulation experiments are performed to evaluate the finite sample behaviors of the proposed estimators.

Acknowledgments

We are grateful to two anonymous referees for helpful comments on this paper. Chuanhua Wei’s research was supported by the National Natural Science Foundation of China (No.11301565), and Beijing Higher Education Young Elite Teacher Project(No.YETP1316).

Author Contributions

The authors contributed equally to this work.

Conflicts of Interest

The authors declare no conflict of interest.

References

Z.W. Cai, J.Q. Fan, and Q.W. Yao. “Functional-coefficient regression models for nonlinear times series.” J. Am. Stat. Assoc. 95 (2000): 941–956. [Google Scholar] [CrossRef]
Z.W. Cai. “Trending time-varying coefficient time series models with serially correlated errors.” J. Econom. 136 (2007): 163–188. [Google Scholar] [CrossRef]
Z.W. Cai, and Q. Li. “Nonparametric Estimation of Varying Coefficient Dynamic Panel Data Models.” Econom. Theory 24 (2008): 1321–1342. [Google Scholar] [CrossRef]
Z.W. Cai, Q. Li, and J.Y. Park. “Functional-coefficient models for nonstationary time series data.” J. Econom. 148 (2009): 101–113. [Google Scholar] [CrossRef]
J.Q. Fan, and J.T. Zhang. “Functional linear models for longitudinal data.” J. R. Stat. Soc. Ser. B 62 (2000): 303–322. [Google Scholar] [CrossRef]
J.Q. Fan, J.C. Jiang, C. Zhang, and Z. Zhou. “Time-dependent diffusion models for term structure dynamics and the stock price volatility.” Stat. Sin. 13 (2003): 965–992. [Google Scholar]
A.S. Fotheringham, M. Charlton, and C. Brunsdon. Geographically Weighted Regression: The Analysis of Spatially Varying Relationships. New York, NY, USA: Wiley, 2002. [Google Scholar]
D.R. Hoover, J.A. Rice, C.O. Wu, and L.P. Yang. “Nonparametric smoothing estimation of time-varying coefficient models with longitudinal data.” Biometrika 85 (1998): 809–822. [Google Scholar] [CrossRef]
Q. Li, C.J. Huang, D. Li, and T.T. Fu. “Semiparametric smooth coefficient models.” J. Bus. Econ. Stat. 20 (2002): 412–422. [Google Scholar] [CrossRef]
Z. Xiao. “Functional-coefficient cointegration models.” J. Econ. 152 (2009): 81–92. [Google Scholar] [CrossRef]
R.J. Carroll, and D. Ruppert. Transformation and Weighting in Regression. London, UK: Chapman and Hall, 1988. [Google Scholar]
H.G. Muller, and U. Stadtmuller. “Estimation of Heteroscedasticity in Regression Analysis.” Ann. Stat. 15 (1987): 610–625. [Google Scholar] [CrossRef]
J.M. Chiou, and H.G. Muller. “Nonparametric quasi-likelihood.” Ann. Stat. 27 (1999): 36–64. [Google Scholar]
H.G. Muller, and P.L. Zhao. “On a semiparametric variance function model and a test for heteroscedasticity.” Ann. Stat. 23 (1995): 946–967. [Google Scholar] [CrossRef]
D. Ruppert, M.P. Wand, U. Host, and O. Hossjer. “Local polynomial variance-function estimation.” Technometrics 39 (1997): 262–273. [Google Scholar] [CrossRef]
I.V. Keilegom, and L. Wang. “Semiparametric modeling and estimation of heteroscedasticity in regression analysis of cross-sectional data.” Electron. J. Stat. 4 (2010): 133–160. [Google Scholar] [CrossRef]
L. Zhu, Y. Dong, and R. Li. “Semiparametric estimation of conditional heteroscedasticity through single index modeling.” Stat. Sin. 24 (2013): 1235–1256. [Google Scholar]

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wei, C.; Wan, L. Efficient Estimation in Heteroscedastic Varying Coefficient Models. Econometrics 2015, 3, 525-531. https://doi.org/10.3390/econometrics3030525

AMA Style

Wei C, Wan L. Efficient Estimation in Heteroscedastic Varying Coefficient Models. Econometrics. 2015; 3(3):525-531. https://doi.org/10.3390/econometrics3030525

Chicago/Turabian Style

Wei, Chuanhua, and Lijie Wan. 2015. "Efficient Estimation in Heteroscedastic Varying Coefficient Models" Econometrics 3, no. 3: 525-531. https://doi.org/10.3390/econometrics3030525

Article Menu

Efficient Estimation in Heteroscedastic Varying Coefficient Models

Abstract

1. Introduction

2. Efficient Estimation

3. Asymptotic Property

4. Simulation Studies

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI