Next Article in Journal
Using Reinforcement Learning in the Path Planning of Swarms of UAVs for the Photographic Capture of Terrains
Previous Article in Journal
Numerical Simulations and Modal Analysis to Investigate the Defects in a Coating Process
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Bootstrap Selector for the Smoothing Parameter of Beran’s Estimator †

by
Rebeca Peláez Suárez
1,*,
Ricardo Cao Abad
2 and
Juan M. Vilar Fernández
2
1
Research Group MODES, Department of Mathematics, CITIC, University of A Coruña, 15071 A Coruña, Spain
2
Research Group MODES, Department of Mathematics, CITIC, University of A Coruña and ITMATI, 15071 A Coruña, Spain
*
Author to whom correspondence should be addressed.
Presented at the 4th XoveTIC Conference, A Coruña, Spain, 7–8 October 2021.
Eng. Proc. 2021, 7(1), 28; https://doi.org/10.3390/engproc2021007028
Published: 14 October 2021
(This article belongs to the Proceedings of The 4th XoveTIC Conference)

Abstract

:
This work proposes a resampling technique to approximate the smoothing parameter of Beran’s estimator. It is based on resampling by the smoothed bootstrap and minimising the bootstrap approximation of the mean integrated squared error to find the bootstrap bandwidth. The behaviour of this method has been tested by simulation on several models. Bootstrap confidence intervals are also addressed in this research and their performance is analysed in the simulation study.

1. Introduction

Let { ( X i , Z i , δ i ) } i = 1 n be a simple random sample of ( X , Z , δ ) with X being the covariate, Z = min { T , C } the observed variable and δ = I T C the uncensoring indicator. Usually, T is the time until the occurrence of an event and C is the censoring time. The generalised product-limit estimator of the conditional survival function proposed in [1] is given by
S ^ h B ( t | x ) = i = 1 n 1 I { Z i t , δ i = 1 } w n , i ( x ) 1 j = 1 n I { Z j < Z i } w n , j ( x )
where
w n , i ( x ) = K ( x X i ) / h j = 1 n K ( x X j ) / h
with i = 1 , , n and h = h n is the bandwidth for the covariable. This estimator depends on a smoothing parameter which is, in practice, unknown. Therefore, finding a method for automatic selection of this bandwidth is truly interesting and very helpful in the analysis of real data subject to censoring. Bootstrap confidence intervals of S ( t | x ) are also proposed.

2. Bandwidth Selector

Let r I h R be an appropriate pilot bandwidth. The bootstrap resampling algorithm consists of generating U i U ( 0 , 1 ) and V i K and obtaining
X i * = X [ n U i ] + 1 + r V i ,
Z i * = Z [ n U i ] + 1 ,
δ i * = δ [ n U i ] + 1 ,
for each i = 1 , , n . The bootstrap sample is formed as { ( X i * , Z i * , δ i * ) } i = 1 n .
The optimal smoothing parameter is the bandwidth that minimizes the mean integrated squared error given by:
M I S E x ( h ) = E S ^ h ( t | x ) S ( t | x ) 2 d t .
Then, the bootstrap bandwidth is obtained by minimizing the Monte Carlo approximation of the bootstrap MISE defined as follows
M I S E x * ( h ) 1 B j = 1 B S ^ h * ( j ) ( t | x ) S ^ r ( t | x ) 2 d t ,
where S ^ r ( t | x ) is the Beran survival estimation with pilot bandwidth r using the original sample { ( X i , Z i , δ i ) } i = 1 n , S ^ h * ( j ) ( t | x ) is the Beran survival estimation with bandwidth h using the bootstrap resample { ( X i * ( j ) , Z i * ( j ) , δ i * ( j ) ) } i = 1 n , and B the number of bootstrap resamples.

3. Bootstrap Confidence Intervals

Let h I h R be an appropriate smoothing parameter and fixed values ( t , x ) [ a , b ] × I , the bootstrap confidence interval for a confidence level of 1 α is given by
S ^ r ( t | x ) ρ 1 α / 2 n h , S ^ r ( t | x ) ρ α / 2 n h ,
where S ^ r ( t | x ) is the Beran estimation with the pilot bandwidth r that is used in the bootstrap resampling, and ρ α / 2 and ρ 1 α / 2 are the 100 α / 2 and 100 ( 1 α / 2 ) percentiles of the resampling distribution of n h S ^ h * ( t | x ) S ^ r ( t | x ) , being S ^ h * ( t | x ) the Beran survival estimation of the bootstrap resample.

4. Simulation Study

A simulation study is carried out to analyse the behaviour of the bootstrap algorithm previously described. Several models with different conditional probabilities of censoring were considered. Figure 1 shows the bootstrap estimations of the conditional survival function in two of these scenarios: Model 1 considers the Weibull distribution for life and censoring times and Model 2 considers exponential life and censoring times. Both models have a conditional probability of censoring equal to 0.5. Figure 2 shows the bootstrap confidence intervals in one sample from Models 1 and 2.

5. Conclusions

The results of the simulations show that this bootstrap algorithm provides adequate smoothing parameters to estimate the survival function in this context. The bootstrap bandwidths obtained are similar to the optimal ones and the estimation errors of both are quite similar. Bootstrap confidence intervals have a reasonable behaviour.
Future lines of work focus on developing a method for choosing the bidimensional smoothing parameters involved in the doubly smoothed Beran estimator presented in [2]. In addition, we deal with the construction of confidence intervals for the conditional survival function based on the doubly smoothed Beran estimator.

Funding

This research has been supported by MINECO Grant MTM2017-82724-R, and by the Xunta de Galicia (Grupos de Referencia Competitiva ED431C-2016-015 and Centro Singular de Investigación de Galicia ED431G/01), all of them through the ERDF.

References

  1. Beran, R. Nonparametric Regression with Randomly Censored Survival Data; Technical Report; University of California: Los Angeles, CA, USA, 1981. [Google Scholar]
  2. Peláez Suárez, R.; Cao Abad, R.; Vilar Fernández, J.M. Nonparametric Estimation of the Conditional Survival Function with Double Smoothing. SORT 2021, 45. [Google Scholar]
Figure 1. Theoretical survival function S ( t | x ) (solid line), Beran’s estimation with optimal bandwidth (dotted line) and Beran’s estimation with bootstrap bandwidth (dashed line) for Model 1 (left) and Model 2 (right).
Figure 1. Theoretical survival function S ( t | x ) (solid line), Beran’s estimation with optimal bandwidth (dotted line) and Beran’s estimation with bootstrap bandwidth (dashed line) for Model 1 (left) and Model 2 (right).
Engproc 07 00028 g001
Figure 2. Theoretical survival function (solid line), Beran’s estimator with bootstrap bandwidth (dashed line) and the bootstrap confidence intervals (dotted line) for each t in a grid of size n t = 100 in Model 1 (left) and Model 2 (right).
Figure 2. Theoretical survival function (solid line), Beran’s estimator with bootstrap bandwidth (dashed line) and the bootstrap confidence intervals (dotted line) for each t in a grid of size n t = 100 in Model 1 (left) and Model 2 (right).
Engproc 07 00028 g002
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Suárez, R.P.; Abad, R.C.; Fernández, J.M.V. Bootstrap Selector for the Smoothing Parameter of Beran’s Estimator. Eng. Proc. 2021, 7, 28. https://doi.org/10.3390/engproc2021007028

AMA Style

Suárez RP, Abad RC, Fernández JMV. Bootstrap Selector for the Smoothing Parameter of Beran’s Estimator. Engineering Proceedings. 2021; 7(1):28. https://doi.org/10.3390/engproc2021007028

Chicago/Turabian Style

Suárez, Rebeca Peláez, Ricardo Cao Abad, and Juan M. Vilar Fernández. 2021. "Bootstrap Selector for the Smoothing Parameter of Beran’s Estimator" Engineering Proceedings 7, no. 1: 28. https://doi.org/10.3390/engproc2021007028

Article Metrics

Back to TopTop