Empirical Likelihood for Composite Quantile Regression Models with Missing Response Data

Shuanghua Luo; Yu Zheng; Cheng-yi Zhang

doi:10.3390/sym16101314

Abstract

Under the assumption of missing response data, empirical likelihood inference is studied via composite quantile regression. Firstly, three empirical likelihood ratios of composite quantile regression are given and proved to be asymptotically

χ^{2}

. Secondly, without an estimation of the asymptotic covariance, confidence intervals are constructed for the regression coefficients. Thirdly, three estimators are presented for the regression parameters to obtain its asymptotic distribution. The finite sample performance is assessed through simulation studies, and the symmetry confidence intervals of the parametric are constructed. Finally, the effectiveness of the proposed methods is illustrated by analyzing a real-world data set.

Keywords:

empirical likelihood; composite quantile regression; missing response data; confidence interval

1. Introduction

It is a common occurrence in opinion polls, biostatistics, and a multitude of scientific experiments for data to be missing. Consequently, numerous papers on the statistical analysis of missing data have been published (see [1,2,3,4,5,6]), and some different kinds of methods have been proposed, including imputation methods [7], complete-case (CC) analysis methods [8], likelihood-based methods [9], and inverse probability weighted methods (IPW) [10], to handle the missing data problem. The imputation method under MAR is the most popular and effective of these methods for dealing with missing data. This method uses an appropriate value to replace the missing data point. For missing response data, there are some famous imputation methods such as semi-parametric imputation [11], kernel regression imputation [12], linear regression imputation [13], and so on.

On the other hand, as an indispensable and adaptable instrument in the domain of statistical investigation, quantile regression (QR) has not only elegant mathematical properties and promising performance but also the ability to directly estimate the effects of the covariates at different quantiles, in addition to the center of the distribution, which is a limitation of traditional least squares regression (LSR) methods. Therefore, QR is less sensitive and more robust to outliers. However, since the estimation efficiency of quantile regression is easily affected by specific quantile values, a new quantile regression called composite quantile regression (CQR) was introduced by Zou and Yuan [14] for the estimation of the unknown parameters of a linear model. Recently, a linear model with missing covariates was proposed by Yang and Liu [15]; it uses the CQR method and IPW method. Penalized weighted composite quantile regression was considered by Jin [16] for partially linear varying coefficient models with missing covariates. Zou [17] discussed heteroscedastic partially linear varying-coefficient models with missing censoring indicators using the CQR method.

Furthermore, the empirical likelihood (EL) method, as outlined by Owen [18], was employed for the purpose of constructing confidence intervals. This method offers a number of advantages over traditional normal approximation techniques. For instance, the confidence intervals or regions, which are defined by their shape and orientation, are produced by the empirical likelihood method. The obtained confidence intervals or regions have many attractive features such as a range-preserving property, the circumvention of asymptotic variance estimation, a flexible shape, and so on. Recently, many papers have studied the empirical likelihood inferences for quantile regression models. For instance, Whang [19] proposed a smoothed empirical likelihood method and provided estimates of the parameters of quantile regression models, constructing confidence regions for the model parameters. More analogous works can be found in [3,20,21,22].

As mentioned above, although [15,16] discussed the estimation of composite quantile regression with missing covariates, the authors considered neither the empirical likelihood of the composite quantile regression nor response data missing at random. Ref. [6] studied smoothed empirical likelihood for quantile regression models with response data missing at random, but it did not consider the composite quantile regression model. In addition, in spite of [17] having investigated composite quantile regression for heteroscedastic partially linear varying-coefficient models with missing censoring indicators, the authors considered censoring indicators missing rather than the response data missing at random; furthermore, they also does not study the empirical likelihood of the composite quantile regression model. Thus, it has seldom been considered in the literature on empirical likelihood for composite quantile regression models with missing response data so far. In this paper, we focus on empirical likelihood inferences of composite quantile regression models with missing response data and establish some theoretical results.

The highlights of this article are as follows: Firstly, three empirical likelihood ratios of composite quantile regression are given and proved to be asymptotically distributed according to the chi-squared law. Secondly, without the estimation of the asymptotic covariance, the confidence intervals for the regression coefficients are constructed. Thirdly, a class of estimators for the regression parameters is presented to derive their asymptotic distribution.

The rest of this paper is organized as follows. In Section 2, we will construct three EL ratios and estimators for the regression parameters of a QR model with missing response data. In Section 3, we will derive some asymptotic properties of the proposed procedure. In Section 4, we will undertake a series of simulation studies to evaluate the performance of the proposed method. An application to a real-world data set is used to illustrate the effectiveness of our approach in Section 5. The discussion and conclusions are presented in Section 6. The proofs of the asymptotic results are provided in Appendix A.

2. Empirical Likelihood for Composite Quantile Regression with Missing Response Data

Consider the classic linear model

Y_{i} = X_{i}^{T} β + ε_{i}, i = 1, 2, \dots, n,

(1)

where

Y_{i}

is the response,

β

is a

d \times 1

vector of unknown regression coefficients,

X_{i}

is a

d \times 1

vector of covariates, and the error

ε_{i}

is a random error satisfying

P (ε_{i} < b_{k} | X_{i}) = τ_{k},

where

b_{k}

is the

τ_{k}

of

ε_{i}

,

τ_{k} = \frac{k}{K + 1}

(

k = 1, 2, \dots, K

), and K is the number of quantiles. For the above model, we focus on the case where X is observed completely in a sample of size n and some Y values may be missing. In other words, we obtain an incomplete sample

{X_{i}; Y_{i}; δ_{i}}, 1 \leq i \leq n

from model (1), where

δ_{i} = 0

if

Y_{i}

is missing, and

δ_{i} = 1

otherwise, and all

X_{i}

are observed. Throughout this paper, we assume that Y is missing at random (MAR). The MAR assumption implies that

δ

and Y are conditionally independent given X. That is,

P (δ = 1 | X; Y) = P (δ = 1 | X)

. MAR is reasonable in many practical situations and is a common assumption for statistical analysis with missing data [8].

2.1. Complete-Case Linear Composite Quantile Regression Empirical Likelihood

By model (1), the complete data CQR estimator

{\hat{β}}_{C Q}

of

β

solves

({\hat{b}}_{1}, \dots, {\hat{b}}_{K}, {\hat{β}}_{C Q}) = arg min_{b_{1}, \dots, b_{K}, β} \sum_{k = 1}^{K} \sum_{i = 1}^{n} ρ_{τ} (Y_{i} - X_{i}^{T} β - b_{k}) δ_{i},

(2)

where

ρ_{τ} (u) = u (τ - I_{(u < 0)})

is the quantile loss function. It is easy to show that

E \{δ_{i} (τ_{k} - I_{(Y_{i} - X_{i}^{T} β \leq b_{k})}) | X_{i}\} = 0,

(3)

where

I (\cdot)

is the function of the indicator. By Equation (3) and the EL method idea, the auxiliary random vector is defined as follows:

η_{i c} (β) = \sum_{k = 1}^{K} δ_{i} X_{i} [τ_{k} - I_{(Y_{i} - X_{i}^{T} β \leq b_{k})}],

(4)

when

β

is the true value of the parameter. It can be shown that

E {η_{i c} (β)} = 0

by Equation (3). An empirical log-likelihood ratio function for

β

can be defined based on

η_{i c} (β)

. However, because the quantiles

b_{k}

are unknown,

η_{i c} (β)

cannot be directly used to make inferences for

β

. Therefore, we replace

b_{k}, k = 1, \dots, K

, with their estimators

{\hat{b}}_{k}

from (2); thus,

{\hat{η}}_{i c} (β) = \sum_{k = 1}^{K} δ_{i} X_{i} [τ_{k} - I_{(Y_{i} - X_{i}^{T} β \leq {\hat{b}}_{k})}],

(5)

by the simple calculation of

{\hat{η}}_{i c} (β) = η_{i c} (β) + \sum_{k = 1}^{K} δ_{i} X_{i} [I_{(ε_{i} \leq b_{k})} - I_{(ε_{i} \leq {\hat{b}}_{k})}] .

Similar to the proof of Theorem 1 in [14], we have

{\hat{b}}_{k} - b_{k} = O_{p} (n^{- 1 / 2})

. Hence, it is easy to show that

E \{I_{(ε_{i} \leq b_{k})} - I_{(ε_{i} \leq {\hat{b}}_{k})}\} = P (ε_{i} \leq b_{k}) - P (ε_{i} \leq {\hat{b}}_{k}) = o (1) .

Then, by

E {η_{i c} (β)} = 0

, we can obtain

E {{\hat{η}}_{i c} (β)} = o (1)

, i.e., the auxiliary random vector is asymptotically unbiased; therefore, the complete-case composite quantile empirical log-likelihood (CCQEL) for

β

is defined as

{\hat{R}}_{C c} (β) = - 2 max \{\sum_{i = 1}^{n} log (n p_{i}) : p_{i} \geq 0, \sum_{i = 1}^{n} p_{i} = 1, \sum_{i}^{n} p_{i} {\hat{η}}_{i c} (β) = 0\} .

Then, the empirical likelihood estimation of the linear composite quantile under complete data can be defined as

{\hat{β}}_{C C Q E L}

:

{\hat{β}}_{C C Q E L} = arg min_{β} {\hat{R}}_{C c} (β) .

2.2. Weighted Composite Quantile Empirical Likelihood

In accordance with the method in Section 2.1, a weighted composite quantile empirical log-likelihood ratio function for

β

can be defined as follows:

{\hat{R}}_{C w}^{*} (β) = - 2 max \{\sum_{i = 1}^{n} log (n p_{i}) : p_{i} \geq 0, \sum_{i = 1}^{n} p_{i} = 1, \sum_{i}^{n} p_{i} {\hat{η}}_{i w}^{*} (β) = 0\},

where

{\hat{η}}_{i w}^{*} (β) = \sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} [τ_{k} - I_{(Y_{i} - X_{i}^{T} β \leq {\hat{b}}_{k})}],

(6)

and

p (x) = P (δ = 1 | X = x)

is referred to as the selection probability function. It should be noted that the selection probability

p (x)

in (6) is assumed to be known. A kernel smoothing method can be used to estimate the selection probability if it is unknown. The estimate for

p (x)

can be defined as follows:

\hat{p} (x) = \frac{\sum_{i = 0}^{n} δ_{i} K ((X_{i} - x) / h_{n})}{\sum_{i = 0}^{n} K ((X_{i} - x) / h_{n})},

(7)

where

K (\cdot)

represents a kernel function, and

h = h_{n}

denotes a sequence of positive numbers that tend to zero, which controls the amount of smoothing used in the estimations. Therefore, a weighted composite quantile empirical log-likelihood, say

{\hat{R}}_{C w} (β)

, can be obtained by replacing

p (X_{i})

with its estimator

\hat{p} (X_{i})

. That is,

{\hat{R}}_{C w} (β) = - 2 max \{\sum_{i = 1}^{n} log (n p_{i}) : p_{i} \geq 0, \sum_{i = 1}^{n} p_{i} = 1, \sum_{i}^{n} p_{i} {\hat{η}}_{i w} (β) = 0\},

where

{\hat{η}}_{i w} (β) = \sum_{k = 1}^{K} \frac{δ_{i}}{\hat{p} (X_{i})} X_{i} [τ_{k} - I_{(Y_{i} - X_{i}^{T} β \leq {\hat{b}}_{k})}] .

Then, the empirical likelihood estimation of the linear weighted composite quantile can be defined as

{\hat{β}}_{W C Q E L}

:

{\hat{β}}_{W C Q E L} = arg min_{β} {\hat{R}}_{C w} (β) .

2.3. Imputation Composite Quantile Empirical Likelihood

For estimations of the CCQEL and the WCQEL, the information contained in sample data is not fully utilized because only the information of the observed data is used in constructing the EL ratio; the coverage accuracy of the confidence region is reduced when there are many missing values. To solve the problem,

X_{i}^{T} {\hat{β}}_{C Q}

is imputed if

Y_{i}

is missing. The following introduces auxiliary random data:

{\hat{η}}_{i I} (β) = \sum_{k = 1}^{K} X_{i} [τ_{k} - I_{({\hat{Y}}_{i} - X_{i}^{T} β \leq {\hat{b}}_{k})}],

where

{\hat{Y}}_{i} = \frac{δ_{i} Y_{i}}{\hat{p} (X_{i})} + (1 - \frac{δ_{i}}{\hat{p} (X_{i})}) X_{i}^{T} {\hat{β}}_{C Q}

. Thus, an imputation composite quantile empirical log-likelihood (ICQEL) ratio is defined as

{\hat{R}}_{C I} (β) = - 2 max \{\sum_{i = 1}^{n} log (n p_{i}) : p_{i} \geq 0, \sum_{i = 1}^{n} p_{i} = 1, \sum_{i = 1}^{n} p_{i} {\hat{η}}_{i I} (β) = 0\},

The ratio is more appropriate than the quantile weighted empirical likelihood ratio, as it makes optimal use of the information contained in the data. In addition, the empirical likelihood estimation of the linear imputation composite quantile of

β

can be defined as

{\hat{β}}_{I C Q E L}

:

{\hat{β}}_{I C Q E L} = arg min_{β} {\hat{R}}_{C I} (β) .

3. Asymptotic Properties

Let

r \geq 2

be an integer. Denote as

f (\cdot | x)

and

F (\cdot | x)

, for the conditional density and conditional distribution functions of

ε

on conditional

X_{i} = x

, we denote

g (x)

as the density function of X. Let c be a positive constant that is not dependent on n and may assume a different value in each instance. The following conditions are necessary for the results to be valid:

(C1)

{(Y_{i}, X_{i})

:

i = 1, 2, \dots, n}

are independent and identically distributed random vectors.

(C2) Both

p (x)

and

g (x)

have bounded partial derivatives up to order r almost surely, and

{inf}_{x} p (x) > 0 .

(C3) This condition is made up of the following two aspects:

(a)

K (\cdot)

is bounded; it is compactly supported on

[- 1, 1]

.

(b)

K (\cdot)

is a kernel function of order r, and there are positive constants, denoted by

C_{1}

and

C_{2}

, and a positive real number, denoted by

ρ

, such that the following inequality holds:

C_{1} I_{[| | u | | \leq ρ]} \leq K (u) \leq C_{2} I_{[| | u | | \leq ρ]} .

(C4)

P (| | X | | > M_{n} = o (n^{- 1 / 2})

, where

M_{n} > 0

, and when

n \to \infty

,

M_{n} \to \infty

.

(C5) The positive bandwidth parameter h satisfies

n h^{2 r} \to 0

.

(C6)

X_{i}

has a bounded support, and matrices A and B are nonsingular, where B and A are defined in Theorem 2.

(C7) The conditional distribution of Y given

X = x

is absolutely continuous with a density function

f (\cdot)

strictly bounded away from zero and infinity at the

τ_{k}

conditional quantiles,

k = 1, 2, \dots, K

.

In this section, the asymptotic distributions of the CQEL ratios and the estimators proposed in Section 2.1, Section 2.2 and Section 2.3 will be considered. Firstly, the asymptotic distributions are established for

{\hat{R}}_{C c} (β)

,

{\hat{R}}_{C w} (β)

, and

{\hat{R}}_{C I} (β)

.

Theorem 1.

Suppose that Conditions

C 1 - C 7

hold. If β is the true parameter, then

\hat{R} (β) \overset{L}{⟶} χ_{d}^{2},

where

\hat{R} (β)

is desirable as

{\hat{R}}_{C c} (β)

,

{\hat{R}}_{C w} (β)

, or

{\hat{R}}_{C I} (β)

;

χ_{d}^{2}

is the chi square distribution with degrees of freedom d; and

\overset{L}{⟶}

indicates the convergence in distribution.

Let

χ_{d}^{2} (1 - α)

be the

1 - α

quantile of the

χ_{d}^{2}

for

0 < α < 1

. Using Theorem 1, we are able to derive an approximate

1 - α

confidence region for β, which is defined by

R_{α} (\tilde{β}) = {\tilde{β} | \hat{R} (\tilde{β}) \leq χ_{d}^{2} (1 - α)} .

Theorem 1 can also be used to test the hypothesis

H_{0}

:

β = β_{0}

. One could reject

H_{0}

at level α if

\hat{R} (β_{0}) > χ_{d}^{2} (1 - α)

.

In order to compare the EL method with the asymptotic normal method, the following theorem gives the asymptotic normality of

{\hat{β}}_{C Q}

and

{\hat{β}}_{C Q E L}

, where

{\hat{β}}_{C Q E L}

is desirable as

{\hat{β}}_{C C Q E L}

,

{\hat{β}}_{C W Q E L}

, or

{\hat{β}}_{I C Q E L}

.

Theorem 2.

Suppose that Conditions

C 1 - C 7

hold. Then,

\sqrt{n} ({\hat{β}}_{C Q E L} - {\hat{β}}_{C Q}) = o_{p} (1),

\sqrt{n} (\hat{β} - β) \overset{L}{⟶} N (0, D),

where

A = E {π (X) f (0 | X) X X^{T}}

,

B = \sum_{k = 1}^{K} \sum_{k^{'} = 1}^{K} (min (τ_{k}, τ_{k^{'}}) - τ_{k} τ_{k^{'}}) E {π (X) X X^{T}}

,

D = A^{- 1} {B A}^{- 1}

,

f (\cdot | x)

is the conditional density of ε when

X = x

, and

\hat{β}

is desirable as

{\hat{β}}_{C Q}

,

{\hat{β}}_{C C Q E L}

,

{\hat{β}}_{C W Q E L}

, or

{\hat{β}}_{I C Q E L}

; it has

π (x) = p (x)

when

\hat{β}

is desirable as

{\hat{β}}_{C Q}

and

{\hat{β}}_{C C Q E L}

; it has

π (x) = 1 / p (x)

when

\hat{β}

is desirable as

{\hat{β}}_{C W Q E L}

; and it has

π (x) = 1

when

\hat{β} = {\hat{β}}_{I C Q E L}

.

To construct the confidence region for

\hat{β}

, it is necessary to estimate the asymptotic covariance matrix

D

using

\hat{D} = {\hat{A}}^{- 1} \hat{B} {\hat{A}}^{- 1}

, where

\hat{A} = \frac{1}{n h} \sum_{i = 1}^{n} δ_{i} K_{h} (Y_{i} - X_{i}^{T} \hat{β}) X_{i} X_{i}^{T}

, and

\hat{B} = \frac{τ (1 - τ)}{n} \sum_{i = 1}^{n} δ_{i} X_{i} X_{i}^{T}

. We can prove that

\hat{D}

is a consistent estimator of

D

. Thus, by Theorem 2, we have

\sqrt{n} {\hat{D}}^{- 1 / 2} (\hat{β} - β) \overset{L}{⟶} N (0, I_{d}),

So, there is

n {(\hat{β} - β)}^{T} {\hat{D}}^{- 1} (\hat{β} - β) \overset{L}{⟶} χ_{d}^{2} .

(8)

Accordingly, the confidence regions of

\hat{β}

can be constructed using (8).

4. Simulation Study

In order to study the finite sample performance of the proposed method, we performed some simulations. The following two models are considered:

1. Homoscedastic model:

Y_{i} = X_{i} β + ε_{i}, i = 1, 2, \dots, n;

2. Heteroscedastic model:

Y_{i} = X_{i} β + ξ X_{i} ε_{i}, i = 1, 2, \dots, n .

Here, the variable X was simulated from the

N (0, 1)

, and

ε_{i} \overset{i i d}{\sim} N (0, 0.5)

,

i = 1, 2, \dots, n .

We consider the case where

ξ = 0.5

and

β = 1.5

. In the simulation study, for the convenience of calculation, the composite level was chosen

K = 9

, so the quantiles were taken as

(τ_{1}, τ_{2}, \dots, τ_{9}) = (0.1, 0.2, \dots, 0.9)

. Consider the following three selection probability functions:

(a)

p_{1} (x) = \{\begin{matrix} 0.8 + 0.2 | x - 1 |, & if | x - 1 | \leq 1, \\ 0.95, & others, \end{matrix}

(b)

p_{2} (x) = \{\begin{matrix} 0.9 - 0.2 | x - 1 |, & if | x - 1 | \leq 4.5, \\ 0.10, & others, \end{matrix}

(c)

p_{3} (x) = 0.6, x \in R .

Approximately,

0.09,

0.26, and

0.40

are the average missing rates corresponding to the three cases.

The kernel function was taken to be

K (x) = \{\begin{matrix} 0.75 (1 - x^{2}), & i f | x | \leq 1, \\ 0, & others, \end{matrix}

In total, 2000 Monte Carlo random samples of size

n = 100, 150,

and 200 were generated, and the cross-validation method was used to select the optimal bandwidths

h_{o p t}

. Consider the confidence intervals of

β

on model 1 and model 2. Then, five method, including imputed quantile empirical likelihood (IQEL) in [6], composite QR empirical likelihood without missing data (NCQEL), imputed composite quantile empirical likelihood (ICQEL), and the normal approximation in Theorem 2, were used. In the following, for convenience of expression, the normal approximation confidence intervals for

{\hat{β}}_{I C Q E L}

and

{\hat{β}}_{C Q}

are denoted as NA

({\hat{β}}_{I C Q E L})

and NA

({\hat{β}}_{C Q})

, respectively. Then, the corresponding empirical coverage probabilities and their average lengths of confidence intervals were computed with a nominal level

1 - α = 0.95

and

K = 9

. Table 1, Table 2, Table 3 and Table 4 show the results.

Table 1. Average lengths of the confidence intervals for

β

in model 1, calculated for different forms of the selection probability function

p (x)

and different values of sample size n, with a nominal level of

0.95

.

Table 2. Emprical coverage probabilities of the intervals for

β

in model 1, calculated for different forms of the selection probability function

p (x)

and different values of the sample sizes n, with a nominal level of 0.95.

Table 3. Average lengths of the confidence intervals for

β

in model 2 for different forms of the selection probability function

p (x)

and different values of sample size n under the nominal level of

0.95

.

Table 4. Emprical coverage probabilities of the intervals for

β

in model 1 for different forms of the selection probability function

p (x)

and different values of the sample sizes n under the nominal level of 0.95.

The results in Table 1, Table 2, Table 3 and Table 4 show that (1) for case 1, when

n = 200

, the average length of the confidence intervals for

β

in model 1 obtained by the ICQEL method was 0.9389, while the average length of the confidence intervals for

β

in model 1 obtained by the IQEL method was 0.9346. So, the ICQEL method gives higher coverage probabilities but slightly longer intervals compared to the other two methods. For cases 2 and 3, in the sense that the confidence intervals of ICQEL have higher coverage probabilities and uniformly shorter average lengths, ICQEL performs better than the other two methods. This indicates that when the missing rate is large, CQR imputation is necessary. (2) Both IQEL and ICQEL have slightly longer interval lengths but higher coverage probabilities than NA (

{\hat{β}}_{I C Q E L}

) and NA (

{\hat{β}}_{C Q}

). In addition, the confidence intervals obtained by NA (

{\hat{β}}_{I C Q E L}

) and NA (

{\hat{β}}_{C Q}

) have nearly equal lengths and coverage probabilities in the same case. (3) For every given missing rate, as the sample size n increases, all the interval lengths decrease and the empirical coverage probabilities increase. Observably, the missing rate also affects the interval length and coverage probability. Generally, for every fixed sample size, the coverage probability decreases and the interval length increases as the missing rate increases. However, for the ICQEL and IQEL methods, the two values do not exhibit a significant change because the QR imputation is used in the two methods. Moreover, it is evident that the other methods for the heteroscedastic model, ICQEL, continue to demonstrate superior performance.

5. A Real-World Example

The data originally obtained from Engel [23] are analyzed in this section in order to verify the results attained in this paper, using a real example of a declining share of personal income concerning food expenditure. The data set comprises 235 budget surveys of 19th-century European working-class households and has no missing data. Engel data can be accessed directly from the R package. In order to illustrate our method using the data set, we deleted some of the response values at random to create artificial missing data. Assume that in these data,

20 %

of the response values are missing. The following linear QR model

Y_{i} = β_{0} (τ) + β_{1} (τ) X_{i} + ε_{i}, i = 1, 2, \dots, 235,

was considered, where X is the centered annual household income in Belgian francs, and Y is the household’s centered annual food expenditure.

Now, based on the proposed ICQLE and NCQEL methods, the

95 %

confidence intervals and the estimators of

β

are presented, and the quantiles are taken as

(τ_{1}, τ_{2}, τ_{3}) = (0.25, 0.5, 0.75)

with

K = 3

. The quantile in the IQEL method is taken as

τ = 0.5

. The results are presented in Table 5. From an examination of Table 5, it can be observed that the confidence interval obtained by the IQEL method has a longer confidence interval than that obtained by the ICQEL method. The confidence intervals obtained by ICQEL and NCQEL are basically close to each other. The results are in good agreement with the simulation results.

Table 5. The confidence intervals and estimators of

β

based on IQEL, ICQEL, and NCQEL in the Engel data analysis.

6. Conclusions and Discussions

In this paper, a CQEL method is proposed for analysis for a QR model with missing response data. Three empirical likelihood ratios of CQR, including the CCQEL ratio, WCQEL ratio, and ICQEL ratio, for the regression parameter were proposed, and it was proved that they are asymptotically

χ^{2}

distributed. Also, three CQEL estimators for the regression parameter were constructed such that the three estimators were asymptotically normal. The benefits of the CQEL method were demonstrated through a simulation study and an analysis of a real-world data set.

While this paper focuses on the empirical likelihood estimation of composite quantile regression, other areas of empirical likelihood estimation could also be explored, such as modal composite quantile regression or expectation quantile regression with missing data. Furthermore, missing data are frequently not missing at random, and the composite quantile under missing at non-random can be considered at a later stage.

Author Contributions

Methodology, S.L.; software, Y.Z.; formal analysis, S.L.; investigation, S.L. and Y.Z.; writing—original draft, S.L.; writing—review and editing, C.-y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundations of China (No. 12271420), the Natural Science Foundation of Shaanxi Province of China (2024JC-YBMS-007), and the Planning Project of Yulin Science and Technology Bureau of Shaanxi Province of China (CXY-2021-117).

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors are grateful to all the reviewers for their constructive comments and suggestions that led to significant improvements to the original manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

The following Lemmas are instrumental in proving the theorems given in Section 3.

Lemma A1.

Assume that Conditions

C 1

–

C 7

hold. Then,

{\hat{b}}_{k} - b_{k} = O_{p} (n^{- 1 / 2}),

where

i = 1, 2, \dots, n

.

Proof.

Similar to the proof of the theorem in [14], this lemma can be easily derived. □

Lemma A2.

Suppose that the regularity Conditions

C 1

–

C 7

hold. If β is the true parameter in (1), we have that

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i} (β) \overset{L}{⟶} N (0, B),

(A1)

and

E (\partial {\hat{η}}_{i} (β) / \partial β) = A + o (1),

(A2)

which are all true where

{\hat{η}}_{i} (β)

takes

{\hat{η}}_{i c} (β)

,

{\hat{η}}_{i w} (β)

, or

{\hat{η}}_{i I} (β)

;

A = E {π (X) f (0 | X) X X^{T}}

; and

B = \sum_{k = 1}^{K} \sum_{k^{'} = 1}^{K} (min (τ_{k}, τ_{k^{'}}) - τ_{k} τ_{k^{'}}) E {π (X) X X^{T}}

; and when

π (x) = p (x)

,

{\hat{η}}_{i} (β)

takes

{\hat{η}}_{i c} (β)

; when

π (x) = 1 / p (x)

,

{\hat{η}}_{i} (β)

takes

{\hat{η}}_{i w} (β)

; and when

π (x) = 1

,

{\hat{η}}_{i} (β) = {\hat{η}}_{i I} (β)

.

Proof.

First is the proof when

{\hat{η}}_{i} (β) = {\hat{η}}_{i c} (β)

and equalities (A1) and (A2) both hold,

i = 1, 2, \dots, n

. Let

M_{i k} = I_{(ε_{i} < b_{k})} - I_{(ε_{i} < {\hat{b}}_{k})}

. The following formula can be obtained by simple calculation:

\begin{matrix} \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i c} (β) & = & \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} δ_{i} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}) + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} M_{i k} δ_{i} X_{i} \\ = & \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} A_{i 1} + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} A_{i 2}, \end{matrix}

(A3)

where

A_{i 1} = \sum_{k = 1}^{K} δ_{i} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})})

, and

A_{i 2} = \sum_{k = 1}^{K} M_{i k} δ_{i} X_{i} .

A simple calculation is available as follows:

E (A_{i 1}) = E \{\sum_{k = 1}^{K} δ_{i} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})})\} = 0 .

We have

\begin{matrix} V a r (A_{i 1}) & = & V a r \{\sum_{k = 1}^{K} δ_{i} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})})\} \\ = & E \{(\sum_{k = 1}^{K} δ_{i} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})})) {(\sum_{k = 1}^{K} δ_{i} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}))}^{T}\} \\ = & E \{\sum_{k = 1}^{K} \sum_{k^{'} = 1}^{K} δ_{i} X_{i} X_{i}^{T} (τ_{k} - I_{(ε_{i} < b_{k})}) (τ_{k^{'}} - I_{(ε_{i} < b_{k}^{'})})\} . \end{matrix}

(A4)

In addition, there are

E (I_{(ε_{i} < b_{k})}) = τ_{k}

,

E (I_{(ε_{i} < b_{k}^{'})}) = τ_{k}^{'}

, and

E (I_{(ε_{i} < b_{k})} I_{(ε_{i} < b_{k}^{'}}) = m i n {τ_{k}, τ_{k}^{'}} .

Hence, we obtain that

\begin{matrix} E (τ_{k} - I_{(ε_{i} < b_{k})}) (τ_{k}^{'} - I_{(ε_{i} < b_{k}^{'})}) & = & E \{τ_{k} τ_{k}^{'} - τ_{k} I_{(ε_{i} < b_{k}^{'})} - I_{(ε_{i} < b_{k})} τ_{k}^{'} + I_{(ε_{i} < b_{k})} I_{(ε_{i} < b_{k}^{'})}\} \\ = & min (τ_{k}, τ_{k^{'}}) - τ_{k} τ_{k^{'}}, \end{matrix}

We have

V a r (A_{i 1}) = \sum_{k = 1}^{K} \sum_{k^{'} = 1}^{K} (min (τ_{k}, τ_{k^{'}}) - τ_{k} τ_{k^{'}}) E (p (x) {XX}^{T}) .

From the central limit theorem,

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} A_{i 1} \overset{L}{⟶} N (0, B) .

(A5)

Next, we prove

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} A_{i 2} = o_{p} (1),

Let

\sum_{k = 1}^{K} M_{i k} = ξ_{i}

,

A_{i 2, j}

be the jth component of

A_{i 2}

, and

X_{i j}

be the jth component of

X_{i}

; by Lemma A.2 of [24], we have that

max_{\leq s \leq n} | \sum_{i = 1}^{s} X_{i j} | = O_{p} (n^{- 1 / 2}) .

(A6)

Furthermore, in accordance with Lemma A1, we have that

M_{i k} = I_{(ε_{i} < b_{k})} - I_{(ε_{i} < b_{k}^{'})} = O_{p} (n^{- 1 / 2}) .

(A7)

According to Abel’s inequality, using formulas (A6) and (A7), we have that

| \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} A_{i 2, j} | = \frac{1}{\sqrt{n}} | \sum_{i = 1}^{n} ξ_{i} δ_{i} X_{i j} | \leq max_{1 \leq i \leq n} | ξ_{i} | max_{1 \leq i \leq s} | \sum_{i = 1}^{n} δ_{i} X_{i j} | = o_{p} (1) .

Hence, we have

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} A_{i 2} = o_{p} (1) .

(A8)

According to formulas (A4), (A5), and (A8), it is proved that Equation (A1) holds. On the other hand, using the above idea of the proof, we can prove that Equation (A2) holds. Secondly, it is proved that Equations (A1) and (A2) hold when

{\hat{η}}_{i} (β) = {\hat{η}}_{i w} (β)

,

i = 1, 2, \dots, n

. Note that

ψ_{i k} (Y_{i}, X_{i}, β) = τ_{k} - I_{(Y_{i} - X_{i}^{T} β \leq {\hat{b}}_{k})} .

because

\begin{matrix} \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i w} (β) & = & \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} \frac{δ_{i}}{\hat{p} (X_{i})} X_{i} [τ_{k} - I_{(Y_{i} - X_{i}^{T} β \leq {\hat{b}}_{k})}] \\ = & \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} ψ_{i k} (Y_{i}, X_{i}, β) \\ + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} \frac{δ_{i} (p (X_{i}) - \hat{p} (X_{i}))}{\hat{p} (X_{i}) p (X_{i})} X_{i} ψ_{i k} (Y_{i}, X_{i}, β) \\ = & \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}) + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} M_{i k} \frac{δ_{i}}{p (X_{i})} X_{i} \\ + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} \frac{δ_{i} (p (X_{i}) - \hat{p} (X_{i}))}{\hat{p} (X_{i}) p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}) \\ + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \sum_{k = 1}^{K} \frac{δ_{i} (p (X_{i}) - \hat{p} (X_{i}))}{\hat{p} (X_{i}) p (X_{i})} X_{i} M_{i k} \\ = & \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 1} + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 2} + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 3} + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 4}, \end{matrix}

(A9)

where

\begin{matrix} B_{i 1} & = & \sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}), \\ B_{i 2} & = & \sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} M_{i k}, \\ B_{i 3} & = & \sum_{k = 1}^{K} \frac{δ_{i} (p (X_{i}) - \hat{p} (X_{i}))}{\hat{p} (X_{i}) p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}), \\ B_{i 4} & = & \sum_{k = 1}^{K} \frac{δ_{i} (p (X_{i}) - \hat{p} (X_{i}))}{\hat{p} (X_{i}) p (X_{i})} X_{i} M_{i k} . \end{matrix}

A simple calculation is available as follows:

E (B_{i 1}) = E \{\sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})})\} = 0,

We have that

\begin{matrix} V a r (B_{i 1}) & = & V a r \{\sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})})\} \\ = & E \{(\sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})})) {(\sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}))}^{T}\} \\ = & E \{\sum_{k = 1}^{K} \sum_{k^{'} = 1}^{K} \frac{δ_{i}}{p^{2} (X_{i})} X_{i} X_{i}^{T} (τ_{k} - I_{(ε_{i} < b_{k})}) (τ_{k^{'}} - I_{(ε_{i} < b_{k}^{'})})\} . \end{matrix}

(A10)

In addition, there are

E (I_{(ε_{i} < b_{k})}) = τ_{k}

,

E (I_{(ε_{i} < b_{k}^{'})}) = τ_{k}^{'}

, and

E (I_{(ε_{i} < b_{k})} I_{(ε_{i} < b_{k}^{'}}) = m i n {τ_{k}, τ_{k}^{'}}

. Hence, we have

\begin{matrix} E (τ_{k} - I_{(ε_{i} < b_{k})}) (τ_{k}^{'} - I_{(ε_{i} < b_{k}^{'})}) & = & E \{τ_{k} τ_{k}^{'} - τ_{k} I_{(ε_{i} < b_{k}^{'})} - I_{(ε_{i} < b_{k})} τ_{k}^{'} + I_{(ε_{i} < b_{k})} I_{(ε_{i} < b_{k}^{'})}\} \\ = & min (τ_{k}, τ_{k^{'}}) - τ_{k} τ_{k^{'}} . \end{matrix}

Hence, We obtain the following formula:

V a r (B_{i 1}) = \sum_{k = 1}^{K} \sum_{k^{'} = 1}^{K} (min (τ_{k}, τ_{k^{'}}) - τ_{k} τ_{k^{'}}) E (\frac{1}{p (x)} X X^{T}) .

From the central limit theorem,

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 1} \overset{L}{⟶} N (0, B) .

(A11)

It is not difficult to prove

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 2} = o_{p} (1)

from the above idea of

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} A_{i 2} = o_{p} (1)

. By referencing the proof of Theorem 3 of [25], and by Conditions

C 2, C 3

, and

C 5

, we have

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} \frac{δ_{i}}{\hat{p} (X_{i}) p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}) = O_{p} (1) .

(A12)

Because

s u p_{x} | \hat{p} (x) - p (x) | = o_{p} (1)

, by (A12), we have

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 3} = o_{p} (1)

, which is similar to the proof of

\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} B_{i 4} = o_{p} (1)

. So, we can prove that Equation (A1) holds by (A10) and (A11). In addition, using the above proof idea, we can prove that (A2) holds.

Last, it is proved that Equations (A1) and (A2) all hold when

{\hat{η}}_{i} (β) = {\hat{η}}_{i I} (β)

,

i = 1, 2, \dots, n

. Calculations yield

X_{i}^{T} β - {\hat{Y}}_{i} = \frac{δ_{i}}{\hat{p} (X_{i})} (X_{i}^{T} β - Y_{i}) + (1 - \frac{δ_{i}}{\hat{p} (X_{i})}) X_{i}^{T} (β - {\hat{β}}_{Q}) .

It is easy to prove that we have

∥ \frac{1}{n} \sum_{i = 1}^{n} (1 - \frac{δ_{i}}{\hat{p} (X_{i})}) X_{i} ∥ = o_{p} (1),

and

{\hat{β}}_{Q} - β = A^{- 1} \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i} (β) + o_{p} (n^{- 1 / 2}) = O_{p} (n^{- 1 / 2}) .

Therefore, it is possible to obtain

X_{i} (I_{(X_{i}^{T} β - {\hat{Y}}_{i} > 0)} - τ) = X_{i} (I_{(X_{i}^{T} β - Y_{i} > 0)} - τ) = X_{i} ψ_{i} (Y_{i}, X_{i}, β) .

(A13)

Then, we can prove that (A1) and (A2) all hold using the above proof idea, ending the lemma proof. □

Lemma A3.

Suppose that regularity Conditions

C 1

–

C 7

hold. If β is the true parameter in (1), we have that

\frac{1}{n} \sum_{i = 1}^{n} {\hat{η}}_{i} (β) {\hat{η}}_{i}^{T} (β) \overset{P}{⟶} B,

(A14)

where

{\hat{η}}_{i} (β)

takes

{\hat{η}}_{i c} (β)

,

{\hat{η}}_{i w} (β)

, or

{\hat{η}}_{i I} (β)

, and

B = \sum_{k = 1}^{K} \sum_{k^{'} = 1}^{K} (min (τ_{k}, τ_{k^{'}}) - τ_{k} τ_{k^{'}}) E \{π (X) X X^{T}\},

and when

π (x) = p (x)

,

{\hat{η}}_{i} (β)

takes

{\hat{η}}_{i c} (β)

; when

π (x) = 1 / p (x)

,

{\hat{η}}_{i} (β)

takes

{\hat{η}}_{i w} (β)

; and when

π (x) = 1

,

{\hat{η}}_{i} (β) = Z_{i I} (β)

.

Proof.

(a) This proves the conclusion when

{\hat{η}}_{i} (β) = {\hat{η}}_{i c} (β)

, where

i = 1, 2, \dots, n

. A simple calculation is available as follows:

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} {\hat{η}}_{i c} (β) {\hat{η}}_{i c}^{T} (β) & = & \frac{1}{n} \sum_{i = 1}^{n} A_{i 1} A_{i 1}^{T} + \frac{1}{n} \sum_{i = 1}^{n} A_{i 1} A_{i 2}^{T} + \frac{1}{n} \sum_{i = 1}^{n} A_{i 2} A_{i 1}^{T} + \frac{1}{n} \sum_{i = 1}^{n} A_{i 2} A_{i 2}^{T} \\ = & B_{1} + B_{2} + B_{3} + B_{4}, \end{matrix}

It can be obtained from the law of large numbers that

B_{1} \overset{P}{⟶} B

. We prove

B_{2} \overset{P}{⟶} 0

. Let us define the matrix

B_{2, k s}

as the

(k, s)

component of the matrix

B_{2}

. Similarly, let us define the matrix

A_{i j, r}

as the rth component of the matrix

A_{i j}

, where

j = 1, 2

. Subsequently, the Cauchy–Schwarz inequality is employed to derive the following result:

| B_{2, k s} | \leq {(\frac{1}{n} \sum_{i = 1}^{n} A_{i 1, k}^{2})}^{1 / 2} {(\frac{1}{n} \sum_{i = 1}^{n} A_{i 2, r}^{2})}^{1 / 2} .

From Lemmas A1 and A2, we can see that

n^{- 1} \sum_{i = 1}^{n} A_{i 1, k}^{2} = O_{p} (1)

and

n^{- 1} \sum_{i = 1}^{n} A_{i 2, r}^{2} = o_{p} (1)

. Hence,

B_{2} \overset{P}{⟶} 0

. Using a similar argument, we can prove

B_{i} \overset{P}{⟶} 0

,

i = 3, 4

. So, we prove that Equation (A14) holds.

(b) This proves the conclusion when

{\hat{η}}_{i} (β) = {\hat{η}}_{i w} (β)

, where

i = 1, 2, \dots, n

. This is because

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} {\hat{η}}_{i w} (β) {\hat{η}}_{i w}^{T} (β) & = & \frac{1}{n} \sum_{i = 1}^{n} B_{i 1} B_{i 1}^{T} + o_{p} (1) \\ = & C_{1} + o_{p} (1), \end{matrix}

(A15)

where

B_{i 1} = \sum_{k = 1}^{K} \frac{δ_{i}}{p (X_{i})} X_{i} (τ_{k} - I_{(ε_{i} < b_{k})}) .

In accordance with the law of large numbers, it can be inferred that

B_{1} \overset{P}{⟶} B .

(c) By (A13) and using the methods in (a) and (b), we can prove Equation (A14) when

{\hat{η}}_{i} (β) = {\hat{η}}_{i I} (β)

,

i = 1, 2, \dots, n

. □

Proof the Theorem 1.

The Lagrange multiplier method allows us to represent

\hat{R} (β)

in the following way:

\hat{R} (β) = 2 \sum_{i = 1}^{n} log (1 + λ^{T} (β) {\hat{η}}_{i} (β)),

(A16)

where

λ (β)

is a

d \times 1

vector and satisfies the solution of the following equation:

\sum_{i = 1}^{n} \frac{{\hat{η}}_{i} (β)}{1 + λ^{T} (β) {\hat{η}}_{i} (β))} = 0 .

(A17)

Using the method of proving (A17) from Lemma A2 and reference [18], we can obtain

λ (β) = {({\hat{η}}_{i} (β) {\hat{η}}_{i}^{T} (β))}^{- 1} \sum_{i = 1}^{n} {\hat{η}}_{i} (β) + o_{p} (n^{- 1 / 2}) .

(A18)

Then, Taylor expansion is applied to (A16), and by Lemmas A2 and (A18), we have

\hat{R} (β) = 2 \sum_{i = 1}^{n} [λ^{T} (β) {\hat{η}}_{i} (β) - {(λ^{T} (β) {\hat{η}}_{i} (β))}^{2} / 2] + o_{p} (1) .

(A19)

and by Equation (A17), we can obtain

0 = \sum_{i = 1}^{n} \frac{{\hat{η}}_{i} (β)}{1 + λ^{T} (β) {\hat{η}}_{i} (β)} = \sum_{i = 1}^{n} {\hat{η}}_{i} (β) - \sum_{i = 1}^{n} {\hat{η}}_{i} (β) {\hat{η}}_{i}^{T} (β) λ (β) + \sum_{i = 1}^{n} \frac{{\hat{η}}_{i} (β) {(λ^{T} (β) {\hat{η}}_{i} (β))}^{2}}{1 + λ^{T} (β) {\hat{η}}_{i} (β)} .

By Lemmas A2 and (A18), we can obtain

\sum_{i = 1}^{n} {(λ^{T} (β) {\hat{η}}_{i} (β))}^{2} = \sum_{i = 1}^{n} λ^{T} (β) {\hat{η}}_{i} (β) + o_{p} (1) .

Then, by (A19), we have

\hat{R} (β) = (\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i}^{T} (β)) {(\frac{1}{n} \sum_{i = 1}^{n} {\hat{η}}_{i} (β) {\hat{η}}_{i}^{T} (β))}^{- 1} (\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i} (β)) + o_{p} (1) .

The proof of Theorem 1 is derived from the combination of Lemmas A2 and A3. □

Proof the Theorem 2.

First, by the Taylor expansion, we have

\begin{matrix} \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i} (\hat{β}) & = & \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i} (β) + \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i}^{'} (β) (\hat{β} - β) + o_{p} (n^{- 1 / 2}) \\ = & D_{n} + A (\hat{β} - β) + o_{p} (n^{- 1 / 2}), \end{matrix}

(A20)

where

D_{n} = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} {\hat{η}}_{i} (β) .

Second, using an argument similar to that in [26] of (28)–(30) and Lemma A2,

{\hat{β}}_{C Q E L} - \hat{β} = o_{p} (n^{- 1 / 2})

. Lastly, this, together with Lemmas A2, (A19), and (A20), proves Theorem 2. □

References

Liu, H.; Yang, H.; Peng, C. Weighted composite quantile regression for single index model with missing covariates at random. Comput. Stat. 2019, 34, 1711–1740. [Google Scholar] [CrossRef]
Chen, X.; Alan, T.K.; Zhou, Y. Efficient quantile regression analysis with missing observations. J. Am. Stat. 2015, 110, 723–741. [Google Scholar] [CrossRef]
Luo, S.H.; Yan, Y.X.; Zhang, C.Y. Two-Stage estimation of partially linear varying coefffcient quantile regression model with missing data. Mathematics 2024, 12, 578. [Google Scholar] [CrossRef]
Xue, L.G.; Zhu, L.X. Empirical likelihood in a partially linear single-index model with censored response data. Comput. Stat. Data Anal. 2024, 193, 107912. [Google Scholar] [CrossRef]
Luo, S.H.; Zhang, C.Y.; Wang, M.H. Composite quantile regression for varying coefficient models with response data missing at random. Symmetry 2019, 11, 1065. [Google Scholar] [CrossRef]
Luo, S.H.; Mei, C.L.; Zhang, C.Y. Smoothed empirical likelihood for quantile regression models with response data missing at random. AStA-Adv. Stat. Anal. 2017, 15, 95–116. [Google Scholar] [CrossRef]
Aerts, M.; Claeskens, G.; Hens, N.; Molenberghs, G. Local multiple imputation. Biometrika 2002, 89, 375–388. [Google Scholar] [CrossRef]
Little, R.J.A.; Rubin, D.B. Statistical Analysis with Missing Data; John Wiley & Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
Schafer, J.; Graham, J. Missing data: Our view of the state of the art. Psychol. Methods 2002, 2, 147–177. [Google Scholar] [CrossRef]
Robins, J.M.; Rotnitzky, A.; Zhao, L.P. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. J. Am. Stat. Assoc. 1995, 90, 106–121. [Google Scholar] [CrossRef]
Wang, Q.; Linton, O.; Ärdle, W.H. Semiparametric regression analysis with missing response at random. J. Am. Stat. Assoc. 2004, 99, 334–345. [Google Scholar] [CrossRef]
Wang, Q.; Rao, N.K. Empirical likelihood-based inference under imputation for missing response data. Ann. Stat. 2002, 30, 896–924. [Google Scholar]
Xue, L.G. Empirical likelihood for linear models with missing responses. J. Multivar. Anal. 2009, 100, 1353–1366. [Google Scholar] [CrossRef]
Zou, H.; Yuan, M. Composite quantile regression and the oracle model selection theory. Ann. Stat. 2008, 36, 1108–1126. [Google Scholar] [CrossRef]
Yang, H.; Liu, H.L. Penalized weighted composite quantile estimators with missing covariates. Stat. Pap. 2016, 57, 69–88. [Google Scholar] [CrossRef]
Jin, J.; Ma, T.; Dai, J.; Liu, S. Penalized weighted composite quantile regression for partially linear varying coefficient models with missing covariates. Comput. Stat. 2020, 36, 541–575. [Google Scholar] [CrossRef]
Zou, Y.Y.; Fan, G.L.; Zhang, R.Q. Composite quantile regression for heteroscedastic partially linear varying-coefficient models with missing censoring indicators. J. Stat. Comput. Simul. 2023, 93, 341–365. [Google Scholar] [CrossRef]
Owen, A.B. Empirical likelihood ratio confidence regions. Ann. Stat. 1990, 18, 90–120. [Google Scholar] [CrossRef]
Whang, Y.J. Smoothed empirical likelihood methods for quantile regression models. Econom. Theory 2006, 22, 173–205. [Google Scholar] [CrossRef]
Zhao, P.X.; Lin, X.S.; Lin, L. Empirical likelihood for composite quantile regression modeling. J. Appl. Math. Comput. 2015, 48, 321–333. [Google Scholar] [CrossRef]
Wang, J.F.; Jiang, W.J.; Xu, F.Y.; Fu, W.X. Weighted composite quantile regression with censoring indicators missing at random. Commun. Stat.-Theory Methods 2021, 50, 2900–2917. [Google Scholar] [CrossRef]
Sun, J.; Ma, Y.Y. Empirical likelihood weighted composite quantile regression with partially missing covariates. J. Nonparametric Stat. 2017, 29, 137–150. [Google Scholar] [CrossRef]
Engel, E. Die productions and consumtionsver haltnisse des konigreichs sachsen. Stat. Burdes 1857, 8, 1–54. [Google Scholar]
Zhao, P.X.; Xue, L.G. Empirical likelihood inferences for semiparametric varying coefficient partially linear models with longitudinal data. Commun. Stat.-Theory Methods 2010, 39, 1898–1914. [Google Scholar] [CrossRef]
Wong, H.; Guo, S.; Chen, M. On locally weighted estimation and hypothesis testing of varying-coefficient models with missing covariates. J. Stat. Plan. Inference 2009, 139, 2933–2951. [Google Scholar] [CrossRef]
Otsu, T. Conditional empirical likelihood estimation and inference for quantile regression models. J. Econom. 2008, 142, 508–538. [Google Scholar] [CrossRef]

Table 1. Average lengths of the confidence intervals for

β

in model 1, calculated for different forms of the selection probability function

p (x)

and different values of sample size n, with a nominal level of

0.95

.

Table 1. Average lengths of the confidence intervals for

β

in model 1, calculated for different forms of the selection probability function

p (x)

and different values of sample size n, with a nominal level of

0.95

.

		QEL			NA
$p (x)$	$n$	IQEL	ICQEL	NA ( ${\hat{β}}_{ICQEL}$ )	NA ( ${\hat{β}}_{CQ}$ )	NCQEL
$p_{1} (x)$	100	0.9171	0.9199	0.9182	0.9012	0.9201
	150	0.9285	0.9298	0.9281	0.9121	0.9299
	200	0.9346	0.9379	0.9354	0.9243	0.9379
$p_{2} (x)$	100	0.9025	0.9046	0.9024	0.8998	0.9047
	150	0.9142	0.9158	0.9136	0.9032	0.9157
	200	0.9238	0.9298	0.9236	0.9198	0.9299
$p_{3} (x)$	100	0.9138	0.9198	0.9128	0.9016	0.9196
	150	0.9279	0.9312	0.9268	0.9189	0.9314
	200	0.9354	0.9416	0.9339	0.9296	0.9418

Table 2. Emprical coverage probabilities of the intervals for

β

in model 1, calculated for different forms of the selection probability function

p (x)

and different values of the sample sizes n, with a nominal level of 0.95.

Table 2. Emprical coverage probabilities of the intervals for

β

in model 1, calculated for different forms of the selection probability function

p (x)

and different values of the sample sizes n, with a nominal level of 0.95.

		QEL			NA
$p (x)$	$n$	IQEL	ICQEL	NA ( ${\hat{β}}_{ICQEL}$ )	NA ( ${\hat{β}}_{CQ}$ )	NCQEL
$p_{1} (x)$	100	0.2891	0.2942	0.2812	0.2898	0.2940
	150	0.2678	0.2698	0.2659	0.2645	0.2699
	200	0.2264	0.2314	0.2245	0.2214	0.2315
$p_{2} (x)$	100	0.3452	0.3246	0.3358	0.3389	0.3244
	150	0.3345	0.3187	0.3298	0.3301	0.3186
	200	0.3254	0.2978	0.3187	0.3198	0.2975
$p_{3} (x)$	100	0.3165	0.3056	0.3127	0.3157	0.3056
	150	0.3106	0.2986	0.3097	0.3102	0.2985
	200	0.2997	0.2856	0.2898	0.2984	0.2855

Table 3. Average lengths of the confidence intervals for

β

in model 2 for different forms of the selection probability function

p (x)

and different values of sample size n under the nominal level of

0.95

.

Table 3. Average lengths of the confidence intervals for

β

in model 2 for different forms of the selection probability function

p (x)

and different values of sample size n under the nominal level of

0.95

.

		QEL			NA
$p (x)$	$n$	IQEL	ICQEL	NA ( ${\hat{β}}_{ICQEL}$ )	NA ( ${\hat{β}}_{CQ}$ )	NCQEL
$p_{1} (x)$	100	0.9113	0.9251	0.9103	0.9098	0.9252
	150	0.9241	0.9312	0.9298	0.9214	0.9312
	200	0.9302	0.9389	0.9315	0.9299	0.9388
$p_{2} (x)$	100	0.9122	0.9298	0.9288	0.9119	0.9299
	150	0.9214	0.9384	0.9381	0.9207	0.9385
	200	0.9306	0.9476	0.9451	0.9302	0.9477
$p_{3} (x)$	100	0.9244	0.9285	0.9242	0.9231	0.9288
	150	0.9349	0.9364	0.9348	0.9316	0.9364
	200	0.9403	0.9478	0.9405	0.9399	0.9479

Table 4. Emprical coverage probabilities of the intervals for

β

in model 1 for different forms of the selection probability function

p (x)

and different values of the sample sizes n under the nominal level of 0.95.

Table 4. Emprical coverage probabilities of the intervals for

β

in model 1 for different forms of the selection probability function

p (x)

and different values of the sample sizes n under the nominal level of 0.95.

		QEL			NA
$p (x)$	$n$	IQEL	ICQEL	NA ( ${\hat{β}}_{ICQEL}$ )	NA ( ${\hat{β}}_{CQ}$ )	NCQEL
$p_{1} (x)$	100	0.2879	0.2912	0.2876	0.2896	0.2911
	150	0.2798	0.2822	0.2788	0.2799	0.2822
	200	0.2614	0.2698	0.2612	0.2616	0.2697
$p_{2} (x)$	100	0.3124	0.3056	0.3087	0.3134	0.3055
	150	0.3045	0.3002	0.3012	0.3055	0.3002
	200	0.2978	0.2945	0.2968	0.2998	0.2944
$p_{3} (x)$	100	0.3015	0.3002	0.3011	0.3014	0.3003
	150	0.2978	0.2968	0.2971	0.2979	0.2967
	200	0.2868	0.2854	0.2861	0.2867	0.2853

Table 5. The confidence intervals and estimators of

β

based on IQEL, ICQEL, and NCQEL in the Engel data analysis.

Table 5. The confidence intervals and estimators of

β

based on IQEL, ICQEL, and NCQEL in the Engel data analysis.

		Estimators		Confidence Intervals
$β$	IQEL	ICQEL	NCQEL	IQEL	ICQEL	NCQEL
$β_{0}$	101.01	101.02	101.04	(74.60, 112.21)	(76.25, 106.46)	(76.46, 106.51)
$β_{1}$	0.4993	0.4996	0.4995	(0.4656, 0.5875)	(0.4724, 0.5648)	(0.4727, 0.5649)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.