Abstract
This work examines the asymptotic characteristics of a conditional set-indexed empirical process composed of functional ergodic random variables with missing at random (MAR). This paper’s findings enlarge the previous advancements in functional data analysis through the use of empirical process methodologies. These results are shown under specific structural hypotheses regarding entropy and under appealing situations regarding the model. The regression operator’s asymptotic -confidence interval is provided for as an application. Additionally, we offer a classification example to demonstrate the practical importance of the methodology.
Keywords:
conditional distribution; small ball probability; missing at random; empirical process; ergodic functional data; semi-metric space; covering number MSC:
62G20; 62G05; 62G32; 62G08; 62G35; 62G07; 62E20
1. Introduction
There are several strategies for solving problems in statistics, among which empirical process techniques are considered the best. Historically, many limit theorems for the empirical process have been established in finite dimension frameworks (see, e.g., Refs. [1,2,3] for exhaustive, self-contained texts with a variety of statistical applications) together under mixing conditions and independent identically distributed framework, in the setting of independents variables [4] characterized modulo measurability, the classes of sets for which the Glivenko–Cantelli theorem holds, we may also cite Refs. [5,6,7,8,9,10,11,12,13,14,15]. Under various mixing conditions, empirical processes based on dependent data have been investigated; for instance, the authors of Ref. [16] established the asymptotic normality of sequences undergoing -mixing. Regarding these areas of investigation concerning an alternative form of mixing, it is possible to refer to Refs. [17,18,19,20]. Nevertheless, the author of [21] identified a bracketing condition that could occur due to vigorous mixing. The function-indexed empirical procedure for beta-mixing sequences was investigated by Ref. [22]. Uniform convergence and asymptotic normality of a set-indexed conditional empirical process within a strictly stationary and strong mixing framework have been established by Ref. [23]. Over the past few decades, there has been a growing interest in the statistical literature regarding matters concerning functional random variables, which are variables with values that exist in an infinite-dimensional space. As is the case, for example, in meteorology, medicine, satellite imagery, and numerous other scientific disciplines, the proliferation of data collected on an ever-increasingly precise temporal and spatial grid has inspired the development of this research topic. Numerous complex theoretical and numerical inquiries were thus engendered by the statistical modeling of these data, which were perceived as stochastic functions. The monographs of Refs. [24,25] provide comprehensive surveys of functional data analysis, encompassing both theoretical and practical aspects. These monographs discuss linear models for random variables that take values in a Hilbert space, scalar-on-function and function-on-function linear models, parametric discriminant analysis, and functional principle component analysis, respectively. To access the most recent findings on FDA and related subjects, we may consult the bibliographic reviews provided by sources such as Refs. [26,27,28,29,30,31], among others. For scalar-on-function nonlinear regression models, the authors of [32] emphasized nonparametric techniques, particularly kernel-type estimation. Such tools were subsequently expanded to include discrimination and classification analysis. An intriguing statistical concept that was extended to the functional data framework was examined by Ref. [33]. These concepts included the portmanteau test, change detection, and goodness-of-fit tests. Good overviews of this literature can be found in Refs. [20,34,35,36,37,38,39,40,41], and, more recently, Ref. [42] gave the first results of the conditional set-indexed empirical process in functional data. Considerable effort has been devoted to developing a convergence theory for empirical processes involving functional random variables, although these topics are well beyond the purview of the paper discussed in Ref. [23]. A theoretical framework of this nature is imperative for contemporary statistical analysis. For over six decades, functional data analysis has been acknowledged in the statistical literature and has since become the focus of numerous works. We observe the extreme limitedness of the outcomes produced by empirical processes utilizing functional frameworks. We may refer for recent references to Refs, [43,44,45,46,47], who achieved numerous valuable outcomes regarding set-indexed conditional empirical processes inside the functional setting of the ergodic framework. One should avoid overlooking the possibility that some pairings of observations may be incomplete in numerous practical applications, including sampling surveys, pharmaceutical tracing tests, and reliability tests. Such instances are commonly referred to as “missing data”. Others in the fields of data science and analytics will attest to the fact that missing data is a common issue. MAR (Missing At Random) indicates that while there may be systematic differences between the missing and observed values, these discrepancies can be fully accounted for by other observed variables. The situation changes significantly when predictors are present; for instance, the authors of [48,49,50,51,52,53,54,55,56,57,58] provide some examples of this in finite dimensionality, as recent references to Refs. [59,60]. In a recent study, the authors of [61] examined the linear quantile regression model in the presence of missing response data that occur randomly. The study utilized the inverse probability weight method. The authors developed a mathematical equation for estimating unknown parameters using quantile regression. They also introduced a standard estimator for quantile regression. Simultaneously, they formulated the empirical likelihood (EL) ratio function for the unknown parameter and established a maximum EL estimator for the unknown parameter. There is a scarcity of work that examines the statistical characteristics of functional nonparametric models for missing data. The kernel estimator of the conditional quantile was introduced by Ref. [62] under the assumptions of ergodicity and random censorship. The author also demonstrated strong consistency (with rate) and defined the asymptotic distribution of the estimator. Additionally, they applied the estimator to forecast the peak electricity demand interval using smart meter data, details of which have been omitted. In their study, the authors of [63] developed a type of estimator for the regression operator in the context of functional stationary ergodic data with missing at random (MAR) responses. They also established the asymptotic properties of the estimator, including its convergence rate in probability and asymptotic normality. For further references, we suggest consulting Refs. [64,65].
Our findings extend upon a prior study [44] by establishing more precise limits under less stringent limitations. This offers a new perspective of the empirical processes theory for random variables with general dependencies. This work addresses a problem that has not been thoroughly examined thus far. The framework of ergodic functional data was introduced by Ref. [66], who established consistencies with rates along with the asymptotic normality of the regression function estimate and provided some examples. For recent papers on the subject, we refer to Ref. [43], where the authors extended Ref. [66] to a more general framework. Some motivations to consider ergodic dependence structure in the data rather than a mixing one are discussed in Refs. [67,68].
The objective of this study is to enhance the development of a practical methodology for addressing MAR samples in functional nonparametric situations. We want to examine the estimation of conditional set-indexed empirical processes in the presence of both missing at random (MAR) data and ergodicity.
The structure of this paper is outlined as follows. In Section 2, we introduce the notation and definitions, along with the conditional empirical process. Our main results are presented in Section 3. Section 3.1 is dedicated to discussing the procedure for selecting the bandwidth. In Section 4, we apply our main result to classification. Concluding remarks and potential future developments are discussed in Section 5. To maintain a smooth presentation flow, all proofs are consolidated in Appendix A.
2. The Set Indexed Conditional Empirical Process
To enhance clarity, let us delve into the definition of the ergodic property for processes. Consider a measurable space , and denote by the space of all functions . If represents the value of the function s at , define as the j-th coordinate map, i.e., . Now, consider for ; a random process can be viewed as a random variable defined on the probability space , taking values in . For any , a set is termed invariant if there exists a set such that holds for every . The process Z is then considered ergodic when, for any invariant set B, we have or . As per the ergodic theorem, it is well-known that for a stationary ergodic process Z, the following convergence holds almost surely:
Therefore, the ergodic property in our setting is formulated based on the statement (1). We consider a sample of random elements , each drawn from the joint distribution of , where X takes values in a space and Y in . The functional space is endowed with a semi-metric . Our goal is to investigate the relationships between X and Y by estimating functional operators associated with the conditional distribution of Y given X. One such operator is the regression operator for a measurable set C in a class of sets :
To address this, we employ a Nadaraya–Watson-type conditional empirical distribution, as proposed by Refs. [42,44,69,70]. We introduce the term MAR (Missing mechanism with MAR) for the response variable. In an available incomplete sample of size from , denoted as , is fully observed, if is observed, and otherwise. The Bernoulli random variable satisfies:
where is a function operator, termed the conditional probability of observing the response given the predictor, often unknown. This mechanism implies that and Y are conditionally independent given X, akin to the finite-dimensionality case in Ref. [48].
The Nadaraya–Watson-type conditional empirical distribution function is given by:
where is a real-valued kernel function from into , is a smoothing parameter satisfying as , C is a measurable set, and . When choosing , where , it reduces to the conditional empirical distribution function , as referenced in Refs. [71,72,73]. However, the corresponding class is defined as . Regarding the semi-metric topology on , we introduce the notation
which denotes the ball in with center x and radius t. This concept is commonly referred to as the small ball probability function in the literature, especially when t tends to zero. The significance of this notion is both theoretically and practically profound, as the concept of a ball is intricately connected with the semi-metric . The selection of this semi-metric becomes pivotal when dealing with data in infinite-dimensional spaces.
In many cases, the probability function for the small ball can be roughly represented as the multiplication of two independent functions with respect to variables x and h. This insight is illustrated in several examples found in Proposition 1 of [74]:
- for some with ;
- for some and with is the Dirac’s function;
- with the indicator function in .
Define the following -fields: and Let
where be the -filed generated by and that generated by . Let be a ball centered at with radius u. Let so that is a nonnegative real-valued random variable. Operating within the probability space , consider
and to be the distribution function and the conditional distribution function, respectively, given the -field of . Here, denotes the ball in the space centered at x with radius u. Let represent a real random function such that converges to zero almost surely as . In a similar vein, define as a real random function such that is almost surely bounded. In what follows, we implicitly assume the ergodicity of the sequence of random elements .
2.1. Assumptions and Notation
In this paper, the variable x is a constant element within the functional space . We present the metric entropy with inclusion as a means to quantify the richness or complexity of the set class . For any given , the covering number is defined as:
The term is referred to as the metric entropy with inclusion of with respect to . For numerous classes, estimates for these covering numbers are well-documented; refer, for instance, to Ref. [75]. Below, we frequently make the assumption that either or exhibit behaviors reminiscent of powers of . We affirm that condition () is satisfied when
where
for some constants . As emphasized in Ref. [23], it is notable that the condition (3), where , is fulfilled by intervals, rectangles, balls, ellipsoids, and by classes derived from these through finite set operations of union, intersection, and complement. The class of convex sets in () satisfies the condition (3) with . Various other sets that satisfy (3) with are elaborated upon in Ref. [75]. We give now further notation. For , set
In this section, we establish the weak convergence of the process as defined by
In the course of our analysis, we will rely on the following assumptions.
- (H1)
- For every , there exists a sequence of nonnegative bounded random functionals , a sequence of random functions , a deterministic nonnegative bounded functional , and a nonnegative real function such that as , as , such that
- (i)
- (ii)
- For any with as almost surely bounded and
- (iii)
- almost surely as , for
- (iv)
- There exists a nondecreasing bounded function that uniformly holds for all .as and , .
- (H2)
- There exist positive constants and such that for all , a neighborhood of x, the following holds
- (H3)
- (i)
- The conditional mean of given the -field depends solely on , meaning that for any , almost surely. The conditional mean of given the -field also depends only on , i.e., for any ,almost surely.
- (ii)
- Furthermore, the functions and are continuous in a neighborhood of x, namely,
- (iii)
- such that we letbe continuous in a neighborhood of x for
- (H4)
- For any and positive constants and , the following holds for the conditional density of Y given :
- (H5)
- The kernel function has support within the interval and possesses a continuous first derivative on . It satisfies the condition for all . Moreover,
- (H6)
- Suppose that the set class adheres to condition (3);
- (H7)
- The smoothing parameter () fulfills the following criterion: and as .
2.2. Comments on the Assumptions
The significance of condition (H1) extends to both the ergodic and functional aspects addressed in this paper. The condition utilized here shares similarities with that employed in Ref. [66]. The functions and play roles analogous to the conditional and unconditional densities in the finite-dimensional scenario. In the meantime, describes the influence of the radius u on the small ball probability as u tends to zero, as illustrated in Ref. [66]. Conditions (H2)(i) are standard in nonparametric regression estimation. (H3)(i) is essential for establishing consistency, reflecting the Markovian nature of the functionally stationary ergodic data. This condition aligns with that used in Ref. [63]. (H3)(ii,iii) serve as continuous local conditions, necessary for the main results and for conciseness in this paper. Condition (H4) on the density conforms to a classical Lipschitz-type nonparametric functional model. Assumption (H5) relates to the choice of the kernel , a common practice in nonparametric functional estimation. It is worth noting that the Parzen symmetric kernel is unsuitable in this context due to the positivity of the random process . Hence, we consider with support , a natural generalization of the assumption usually made in the multivariate case, where is expected to be a spherically symmetric density function. The conditions and ensure that for all limit functions . The condition is necessary for defining the moments , which, in this case, are determined by the value . (H7) provides a condition on the bandwidths, acknowledging that consistency cannot be guaranteed without it.
3. Main Results
Below, we note when the random variable Z is distributed according to a normal distribution with mean and variance . The symbol represents convergence in distribution, while indicates convergence in probability.
Theorem 1
(Uniform Consistency). Assume that the conditions (H1)–(H7) are satisfied. Consider a class of measurable sets for which
for any . Moreover, assume that for every
If and as , then
Note that the proof of Theorem 1 follows directly from the decomposition
where
and
Let . We have
Henceforth, for , let us denote
and
here, represents the conditional expectation of the random variable X given the -field .
To establish asymptotic normality, define the “bias” term as
The subsequent result presents the weak convergence. It is important to note that is specified in (H1).
Theorem 2
(Asymptotic normality). Assuming (H1)–(H7), as , for and , we have
where and
whenever and
To obtain the density of the process, it is essential to introduce the following function, which provides insights into the asymptotic behavior of the modulus of continuity:
Theorem 3.
Assume that (H1)–(H7) are satisfied. For every , consider as a class of measurable sets with
and suppose that fulfils (3) with . Additionally, we assume that and as , such that
and as we have
Furthermore, we assume that . For and , the latter has to be replaced by . Under the conditions of Theorem 2, the process converges in law to a Gaussian process , which possesses a version with uniformly bounded and uniformly continuous paths with respect to the norm. The covariance is given by as specified in Theorem 2.
Remark 1.
The distance of two measures , in the Prokhorov metric is defined as (see, e.g., Refs. [76,77,78,79])
Here , where is the distance of x to B, i.e., . The distance of two random variables , in the Ky Fan metric is defined as [80]
It is worthwhile to establish an adequate link of our findings to these distances in the conditional setting.
Remark 2.
Central limit theorems are frequently utilized to establish confidence intervals for the target being estimated. In the realm of non-parametric estimation, the asymptotic variance in the central limit depends on certain unknown functions. Consequently, in practical scenarios, only approximate confidence intervals can be derived, even when is functionally specified. Notably, according to Theorem 2, the limiting variance incorporates the unknown function and the normalization is contingent on the function , which is not explicitly identifiable in practice. Furthermore, the quantities and need to be estimated. The corollary below, a slight modification of Theorem 2, permits a practical form of the results to be used, as typically the conditional variance is estimated similarly to what is obtained by Ref. [63].
Let
Let us introduce the following estimation
By employing the decomposition of in (H1)(i) and (H1)(i,iv), one can estimate as
Subsequently, for a given kernel and the quantities and can be estimated as follows
Finally, the estimator of is denoted by
Corollary 1.
Suppose that conditions (H1)–(H7) are satisfied, where and are integrable functions. Additionally, assume that and as . Then, for any such that , we have
Using Corollary (1) the asymptotic confidence band given by
where is the upper quantile of the Normal distribution
3.1. The Bandwidth Selection Criterion
Several approaches have been devised and refined to formulate asymptotically optimal bandwidth selection rules for nonparametric kernel estimators, particularly for the Nadaraya–Watson regression estimator. Some noteworthy contributions include [81,82,83,84,85,86,87]. Choosing this parameter appropriately is essential, whether in the conventional finite-dimensional case or within the infinite-dimensional framework, to guarantee favorable practical performance. Let us define the leave-out- estimator for the regression function
To minimize the quadratic loss function, we introduce the following criterion, where we have a (known) nonnegative weight function
Building upon the concepts developed by Ref. [83], a natural approach for selecting the bandwidth is to minimize the preceding criterion. Thus, let us choose , as the minimizer over h:
One can replace (6) by
In practice, one takes, for , the uniform global weights , and the local weights
For brevity, we have concentrated on the most popular method, namely, the cross-validated selected bandwidth. This approach can be extended to any other bandwidth selector, such as the bandwidth based on Bayesian ideas [88].
4. Applications to Classification with Partially Labeled Data
In this section, we apply the results developed in the previous sections to the problem of statistical classification. We consider a sample of random elements drawn from the joint distribution of , where X takes values in a space and Y in . In classification, the objective is to predict the integer-valued label Y based on the covariate vector X. More formally, we aim to find a function (classifier) for which the probability of misclassification error (incorrect prediction), i.e., , is minimized. Let
Demonstrating that the optimal classifier, i.e., the one with the minimum probability of error, is given by
i.e., the best classifier satisfies
As is unknown, the data is utilized to construct estimates of . Specifically, let represent a random sample from the distribution of , where each is fully observable. Let be any sample-based classifier. In other words, is the predicted value of Y, based on and X. Let
be the conditional probability of error of the sample-based classifier . Then is said to be consistent if as , for . Let be any sample-based estimators of and define the classification rule by
In other words, satisfies
to show it is sufficient to show that by posing , we have
Theorem 4.
Under the conditions of Theorem 3, we have the convergence
5. Concluding Remarks
In this investigation, we have examined the asymptotic properties of the conditional set-indexed empirical process involving ergodic functional data that are missing at random (MAR). Our findings are obtained under assumptions pertaining to the richness of the index class of sets in terms of metric entropy with bracketing. Our contribution is two-fold: first, we have developed a functional methodology for addressing MAR samples in non-parametric problems, and second, we have extended our non-parametric conditional methodology by incorporating the ergodicity concepts introduced in Ref. [44]. Several challenging open questions remain in this context, including potential extensions to other types of non-parametric predictors such as functional local linear predictors, functional kNN predictors, and others. Furthermore, exploring extensions to problems beyond prediction, such as the estimation of variance error, is an interesting avenue for future research. Another direction for future exploration is the consideration of reducing the predictor’s dimensionality by employing a Single Functional Index Model (SFIM) to estimate the regression, as discussed in Refs. [89,90]. SFIM has shown its effectiveness in improving the consistency of the regression operator estimator.
Author Contributions
Conceptualization, S.B.; methodology, S.B.; validation, S.B., Y.S. and F.M.; formal analysis, S.B. and Y.S.; investigation, S.B. and Y.S.; original draft preparation, S.B. and Y.S.; writing—review and editing, S.B. and Y.S.; supervision, S.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
Data is contained within the article.
Acknowledgments
The authors would like to thank the Editor-in-Chief, an Associate-Editor, and the three referees for their extremely helpful remarks, which resulted in a substantial improvement of the original form of the work and a presentation that was more sharply focused.
Conflicts of Interest
The authors declare no conflicts of interest.
Appendix A
The proofs of our results are presented in this section. The notation introduced earlier is also utilized in the subsequent sections.
Lemma A1.
Assume that conditions (H1(i))–(H1(ii))–(H1(iv))–(H5) hold true for any real numbers and with . As , we have:
- (i)
- ;
- (ii)
- ;
- (iii)
- .
Proof of Lemma A1.
For the proof of Lemma A1, the reader is directed to Ref. [66]. □
Lemma A2.
Assume that the hypotheses (H1) and (H5), along with condition (H7), are satisfied. As , for every fixed neighborhood of x in the functional space , we have:
Proof of Lemma A2.
We shall prove that
We employ the identical proof as presented in Ref. [63]. See that.
where
First, we need to establish under the assumption (H1)(i–iii) and (H3)(i) and for as , we have
as Using the properties of conditional expectation and the missing at random (MAR) mechanism, and combining assumptions (H1)(ii,iii) and (H3)(i) with the continuity property of along with Lemma A1, we derive:
Second, we will prove that as
On the one hand, we define for . Thus, forms a triangular array of martingale differences with respect to the -field and
By combining Burkholder’s inequality [91] and Jensen’s inequality, we establish that for any , there exists a constant such that:
as where we use the results from lemma (A1). Since as we then conclude that
Thus, the proof is complete. □
We will utilize arguments akin to those employed in the work of Ref. [63] to establish the asymptotic normality of the process defined as:
Lemma A3.
Assuming that the hypotheses (H1)–(H7) are fulfilled, we can state that for any such that , we have:
where
whenever
Proof of Lemma A3.
Let us introduce some notation. We put
and define . It is easily seen that
Here, for any fixed , the terms in (A4) form a triangular array of stationary martingale differences with respect to the -field . This allows us to apply the central limit theorem for discrete-time arrays of real-valued martingales (refer to Ref. [92], page 23) to establish the asymptotic normality of . This can be accomplished by verifying the following statements:
- (a)
- (b)
- holds for any (Lindeberg condition).
Proof of Part (a).
Observe first that
Making use of the condition (H2) and Lemma A1, one has
Thus, by (H1)(ii,iii), we have
The statement (a) follows then if we show that
To prove (A6), observe that
where
and
Hence, leveraging the properties of conditional expectation, we derive:
Likewise, with the assumptions (H2)(ii,iii) and (H4)(i), along with the aid of Lemma A1 once more, it follows that, as :
Again, combining Lemma A1 with conditions (H1)(ii), and (H3)(ii,iii), it is evident that:
almost surely, whenever . Consider now the term . Utilizing conditions (H1)(ii,iii) and (H2)(i) alongside Lemma A1, we can express, as :
whenever , this completes the proof of Part (a).
Proof of Part (b).
The Lindeberg condition results from Corollary 9.5.2 in Ref. ([93]), which implies that
Let and such that . Applying Hölder and Markov inequalities, one can express, for all :
where is a positive constant and . Utilizing from the condition (H3)(iii) of conditional moments, we obtain:
where the last equality follows from Lemma A1. This concludes the proof of part (b) as when . Thus, the proof is complete. □
Proof of Theorem 1.
By Lemma A3 it follows that
Thus, by Lemma A2 the proof is valid. □
Proof of Theorem 3.
Let us recall some facts. Let and . Given random measures on , we define
Say that a class of functions has uniformly integrable entropy with respect to -norm if
where
If the class possesses uniformly integrable entropy, is totally bounded for any measure . Let be an envelope of , i.e., is a measurable function mapping to such that:
Let be the set of all measures on with
and
Given random measures on , we define
Let us introduce the uniform entropy integral
We say that has uniformly integrable entropy with respect to -norm if
If the class possesses uniformly integrable entropy, is totally bounded for any measure . Let be a Gaussian process whose sample paths are contained in
Let denote the law of •. Notice that obtaining a uniform CLT essentially means that we show the following convergence
where the processes are indexed by and considered as random elements of the bounded real-valued functions on defined by
which is a Banach space equipped with the sup norm. In the following, we employ the weak convergence in the sense of Ref. [94], which we recap in the following definition. Throughout the paper, denotes the upper expectation with respect to the outer probability ; for further details and discussion, refer to Ref. [1] (p. 6) and Ref. [95] (§6.2, p. 88). □
Definition A1.
A sequence of -valued random functions converges in law to a -valued Borel measurable random function T whose law concentrates on a separable subset of , denoted , if,
where is the set of all bounded -continuous functions from into .
We set
with , and define in a similar way. Let
Let us define
To establish Theorem 3, we can rely on Theorem 2 of [96] (see also Refs. [10,13,15]). It is sufficient to demonstrate that, for all constant , as n tends to infinity:
which is implied by the following,
where we recall
In the rest of the proof, denote by and
Therefore, we have the following
We first evaluate . We have
Using the fact that (as indicated in Lemma A1), and taking into account that the class of functions has a constant envelope and is both bounded and bounded away from zero, one can obtain the following upper bound for the last equation, where C is a positive constant:
Making use of similar arguments, we infer that
We readily obtain that,
By employing arguments akin to those utilized in the proof of the previous statement, we can establish that
Using the Lindeberg conditions from the preceding proof and (A11), along with Theorem 1 of [96], we deduce that for a given and , there exists , such that:
Now, the proof of the theorem is completed by combining this last equation with Theorem 3.
References
- van der Vaart, A.W.; Wellner, J.A. Weak Convergence and Empirical Processes; Springer Series in Statistics; With applications to statistics; Springer: New York, NY, USA, 1996; pp. xvi+508. [Google Scholar] [CrossRef]
- Shorack, G.R.; Wellner, J.A. Empirical Processes with Applications to Statistics; Classics in Applied Mathematics; Society for Industrial and Applied Mathematics (SIAM): Philadelphia, PA, USA, 2009; Volume 59, pp. xli+956. [Google Scholar] [CrossRef]
- Dudley, R.M. Uniform Central Limit Theorems; Cambridge Studies in Advanced Mathematics; Cambridge University Press: Cambridge, UK, 1999; Volume 63, pp. xiv+436. [Google Scholar] [CrossRef]
- Vapnik, V.N.; Červonenkis, A.J. The uniform convergence of frequencies of the appearance of events to their probabilities. Teor. Verojatnost. i Primenen. 1971, 16, 264–279. [Google Scholar]
- Dudley, R.M. Central limit theorems for empirical measures. Ann. Probab. 1978, 6, 899–929. [Google Scholar] [CrossRef]
- Giné, E.; Zinn, J. Some limit theorems for empirical processes. Ann. Probab. 1984, 12, 929–998. [Google Scholar] [CrossRef]
- Le Cam, L. A remark on empirical measures. In A Festschrift for Erich Lehmann in Honor of His Sixty-Fifth Birthday; Wadsworth Statist./Probab. Ser.; UC Berkeley Statistics: Wadsworth, OH, USA; Belmont, CA, USA, 1983; pp. 305–327. [Google Scholar]
- Pollard, D. A central limit theorem for empirical processes. J. Aust. Math. Soc. Ser. A 1982, 33, 235–248. [Google Scholar] [CrossRef]
- Bass, R.F.; Pyke, R. A strong law of large numbers for partial-sum processes indexed by sets. Ann. Probab. 1984, 12, 268–271. [Google Scholar] [CrossRef]
- Bouzebda, S.; Soukarieh, I. Renewal type bootstrap for U-process Markov chains. Markov Process. Relat. Fields 2022, 28, 673–735. [Google Scholar]
- Alvarez-Andrade, S.; Bouzebda, S.; Lachal, A. Strong approximations for the p-fold integrated empirical process with applications to statistical tests. Test 2018, 27, 826–849. [Google Scholar] [CrossRef]
- Bouzebda, S. Some applications of the strong approximation of the integrated empirical copula processes. Math. Methods Stat. 2016, 25, 281–303. [Google Scholar] [CrossRef]
- Soukarieh, I.; Bouzebda, S. Renewal type bootstrap for increasing degree U-process of a Markov chain. J. Multivar. Anal. 2023, 195, 105143. [Google Scholar] [CrossRef]
- Bouzebda, S.; Soukarieh, I. Limit theorems for a class of processes generalizing the U-empirical process. Stochastics 2024, 1–36. [Google Scholar]
- Soukarieh, I.; Bouzebda, S. Exchangeably Weighted Bootstraps of General Markov U-Process. Mathematics 2022, 10, 3745. [Google Scholar] [CrossRef]
- Yoshihara, K.I. Conditional empirical processes defined by ϕ-mixing sequences. Comput. Math. Appl. 1990, 19, 149–158. [Google Scholar] [CrossRef]
- Eberlein, E. Weak convergence of partial sums of absolutely regular sequences. Stat. Probab. Lett. 1984, 2, 291–293. [Google Scholar] [CrossRef]
- Nobel, A.; Dembo, A. A note on uniform laws of averages for dependent processes. Stat. Probab. Lett. 1993, 17, 169–172. [Google Scholar] [CrossRef]
- Yu, B. Rates of convergence for empirical processes of stationary mixing sequences. Ann. Probab. 1994, 22, 94–116. [Google Scholar] [CrossRef]
- Bouzebda, S.; Nemouchi, B. Central limit theorems for conditional empirical and conditional U-processes of stationary mixing sequences. Math. Methods Stat. 2019, 28, 169–207. [Google Scholar] [CrossRef]
- Andrews, D.W.K.; Pollard, D. An Introduction to Functional Central Limit Theorems for Dependent Stochastic Processes. Int. Stat. Rev. Rev. Int. Stat. 1994, 62, 119–132. [Google Scholar] [CrossRef]
- Doukhan, P.; Massart, P.; Rio, E. Invariance principles for absolutely regular empirical processes. Ann. Inst. H. Poincaré Probab. Stat. 1995, 31, 393–427. [Google Scholar]
- Polonik, W.; Yao, Q. Set-indexed conditional empirical and quantile processes based on dependent data. J. Multivar. Anal. 2002, 80, 234–255. [Google Scholar] [CrossRef]
- Bosq, D. Linear Processes in Function Spaces; Lecture Notes in Statistics; Theory and Applications; Springer: New York, NY, USA, 2000; Volume 149, pp. xiv+283. [Google Scholar] [CrossRef]
- Ramsay, J.O.; Silverman, B.W. Functional Data Analysis, 2nd ed.; Springer Series in Statistics; Springer: New York, NY, USA, 2005; pp. xx+426. [Google Scholar]
- Cuevas, A. A partial overview of the theory of statistics with functional data. J. Stat. Plan. Inference 2014, 147, 1–23. [Google Scholar] [CrossRef]
- Goia, A.; Vieu, P. An introduction to recent advances in high/infinite dimensional statistics [Editorial]. J. Multivar. Anal. 2016, 146, 1–6. [Google Scholar] [CrossRef]
- Aneiros, G.; Cao, R.; Fraiman, R.; Genest, C.; Vieu, P. Recent advances in functional data analysis and high-dimensional statistics. J. Multivar. Anal. 2019, 170, 3–9. [Google Scholar] [CrossRef]
- Ling, N.; Vieu, P. Nonparametric modelling for functional data: Selected survey and tracks for future. Statistics 2018, 52, 934–949. [Google Scholar] [CrossRef]
- Chowdhury, J.; Chaudhuri, P. Multi-sample comparison using spatial signs for infinite dimensional data. Electron. J. Stat. 2022, 16, 4636–4678. [Google Scholar] [CrossRef]
- Chowdhury, J.; Chaudhuri, P. Convergence rates for kernel regression in infinite-dimensional spaces. Ann. Inst. Stat. Math. 2020, 72, 471–509. [Google Scholar] [CrossRef]
- Ferraty, F.; Vieu, P. Nonparametric Functional Data Analysis; Springer Series in Statistics; Theory and Practice; Springer: New York, NY, USA, 2006; pp. xx+258. [Google Scholar]
- Horváth, L.; Kokoszka, P. Inference for Functional Data with Applications; Springer Series in Statistics; Springer: New York, NY, USA, 2012; pp. xiv+422. [Google Scholar] [CrossRef]
- Bosq, D.; Blanke, D. Inference and Prediction in Large Dimensions; Wiley Series in Probability and Statistics; John Wiley & Sons, Ltd.: Chichester, UK; Dunod, Scotland; Paris, France, 2007; pp. x+316. [Google Scholar] [CrossRef]
- Shi, J.Q.; Choi, T. Gaussian Process Regression Analysis for Functional Data; CRC Press: Boca Raton, FL, USA, 2011; pp. xx+196. [Google Scholar]
- Zhang, J.T. Analysis of Variance for Functional Data; Monographs on Statistics and Applied Probability; CRC Press: Boca Raton, FL, USA, 2014; Volume 127, pp. xxiv+386. [Google Scholar]
- Bongiorno, E.G.; Goia, A.; Salinelli, E.; Vieu, P. An overview of IWFOS’2014. In Contributions in Infinite-Dimensional Statistics and Related Topics; Esculapio: Bologna, Italy, 2014; pp. 1–5. [Google Scholar]
- Hsing, T.; Eubank, R. Theoretical Foundations of Functional Data Analysis, with an Introduction to Linear Operators; Wiley Series in Probability and Statistics; John Wiley & Sons, Ltd.: Chichester, UK, 2015; pp. xiv+334. [Google Scholar] [CrossRef]
- Aneiros, G., Bongiorno, E.G., Cao, R., Vieu, P., Eds.; Functional statistics and related fields. In Proceedings of the 4th International Workshop on Functional and Operational Statistics, IWFOS, Corunna, Spain, 15–17 June 2017; Springer: Cham, Switzerland, 2017; pp. xxiv+288. [Google Scholar]
- Berrahou, N.; Bouzebda, S.; Douge, L. Functional uniform-in-bandwidth moderate deviation principle for the local empirical processes involving functional data. Math. Methods Stat. 2024, 33, 1–43. [Google Scholar]
- Poryvaĭ, D.V. An invariance principle for conditional empirical processes formed by dependent random variables. Izv. Ross. Akad. Nauk Ser. Mat. 2005, 69, 129–148. [Google Scholar] [CrossRef]
- Bouzebda, S.; Madani, F.; Souddi, Y. Some Asymptotic Properties of the Conditional Set-Indexed Empirical Process Based on Dependent Functional Data. Int. J. Math. Stat. 2022, 22, 77–105. [Google Scholar]
- Bouzebda, S.; Chaouch, M. Uniform limit theorems for a class of conditional Z-estimators when covariates are functions. J. Multivar. Anal. 2022, 189, 104872. [Google Scholar] [CrossRef]
- Souddi, Y.; Madani, F.; Bouzebda, S. Some characteristics of the conditional set-indexed empirical process involving functional ergodic data. Bull. Inst. Math. Acad. Sin. (New Ser.) 2021, 16, 367–399. [Google Scholar] [CrossRef]
- Bouzebda, S.; Soukarieh, I. Nonparametric conditional U-processes for locally stationary functional random fields under stochastic sampling design. Mathematics 2022, 10, 16. [Google Scholar] [CrossRef]
- Soukarieh, I.; Bouzebda, S. Weak Convergence of the Conditional U-statistics for Locally Stationary Functional Time Series. Stat. Inference Stoch. Process 2024, 16, 1–78. [Google Scholar] [CrossRef]
- Bouzebda, S.; Nezzal, A. Uniform in number of neighbors consistency and weak convergence of kNN empirical conditional processes and kNN conditional U-processes involving functional mixing data. AIMS Math. 2024, 9, 4427–4550. [Google Scholar] [CrossRef]
- Cheng, P.E. Nonparametric estimation of mean functionals with data missing at random. J. Am. Stat. Assoc. 1994, 89, 81–87. [Google Scholar] [CrossRef]
- Cheng, P.E.; Chu, C.K. Kernel estimation of distribution functions and quantiles with missing data. Stat. Sin. 1996, 6, 63–78. [Google Scholar]
- Little, R.J.A.; Rubin, D.B. Statistical Analysis with Missing Data; Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics; John Wiley & Sons, Inc.: New York, NY, USA, 1987; pp. xvi+278. [Google Scholar]
- Nittner, T. Missing at random (MAR) in nonparametric regression—A simulation experiment. Stat. Methods Appl. 2003, 12, 195–210. [Google Scholar] [CrossRef]
- Tsiatis, A.A. Semiparametric Theory and Missing Data; Springer Series in Statistics; Springer: New York, NY, USA, 2006; pp. xvi+383. [Google Scholar]
- Wang, Q.; Sun, Z. Estimation in partially linear models with missing responses at random. J. Multivar. Anal. 2007, 98, 1470–1493. [Google Scholar] [CrossRef]
- Wang, Q. Probability density estimation with data missing at random when covariables are present. J. Stat. Plan. Inference 2008, 138, 568–587. [Google Scholar] [CrossRef]
- Liang, H.; Wang, S.; Carroll, R.J. Partially linear models with missing response variables and error-prone covariates. Biometrika 2007, 94, 185–198. [Google Scholar] [CrossRef]
- Efromovich, S. Nonparametric regression with responses missing at random. J. Stat. Plan. Inference 2011, 141, 3744–3752. [Google Scholar] [CrossRef]
- Efromovich, S. Nonparametric regression with predictors missing at random. J. Am. Stat. Assoc. 2011, 106, 306–319. [Google Scholar] [CrossRef]
- Tang, N.; Zhao, P.; Zhu, H. Empirical likelihood for estimating equations with nonignorably missing data. Stat. Sin. 2014, 24, 723–747. [Google Scholar] [CrossRef]
- Müller, U.U.; Schick, A. Efficiency transfer for regression models with responses missing at random. Bernoulli 2017, 23, 2693–2719. [Google Scholar] [CrossRef][Green Version]
- Müller, U.U.; Schick, A. Efficiency for heteroscedastic regression with responses missing at random. J. Stat. Plan. Inference 2018, 196, 132–143. [Google Scholar] [CrossRef]
- Shen, Y.; Liang, H.Y. Quantile regression and its empirical likelihood with missing response at random. Stat. Pap. 2018, 59, 685–707. [Google Scholar] [CrossRef]
- Ferraty, F.; Sued, M.; Vieu, P. Mean estimation with data missing at random for functional covariables. Statistics 2013, 47, 688–706. [Google Scholar] [CrossRef]
- Ling, N.; Liang, L.; Vieu, P. Nonparametric regression estimation for functional stationary ergodic data with missing at random. J. Stat. Plan. Inference 2015, 162, 75–87. [Google Scholar] [CrossRef]
- Ling, N.; Liu, Y.; Vieu, P. Conditional mode estimation for functional stationary ergodic data with responses missing at random. Statistics 2016, 50, 991–1013. [Google Scholar] [CrossRef]
- Wang, L.; Cao, R.; Du, J.; Zhang, Z. A nonparametric inverse probability weighted estimation for functional data with missing response data at random. J. Korean Stat. Soc. 2019, 48, 537–546. [Google Scholar] [CrossRef]
- Laib, N.; Louani, D. Nonparametric kernel regression estimation for functional stationary ergodic data: Asymptotic properties. J. Multivar. Anal. 2010, 101, 2266–2281. [Google Scholar] [CrossRef]
- Didi, S.; Bouzebda, S. Wavelet Density and Regression Estimators for Continuous Time Functional Stationary and Ergodic Processes. Mathematics 2022, 10, 4356. [Google Scholar] [CrossRef]
- Didi, S.; Al Harby, A.; Bouzebda, S. Wavelet Density and Regression Estimators for Functional Stationary and Ergodic Data: Discrete Time. Mathematics 2022, 10, 3433. [Google Scholar] [CrossRef]
- Nadaraja, E.A. On a regression estimate. Teor. Verojatnost. i Primen. 1964, 9, 157–159. [Google Scholar]
- Watson, G.S. Smooth regression analysis. Sankhyā Ser. A 1964, 26, 359–372. [Google Scholar]
- Stute, W. Conditional empirical processes. Ann. Stat. 1986, 14, 638–647. [Google Scholar] [CrossRef]
- Stute, W. On almost sure convergence of conditional empirical distribution functions. Ann. Probab. 1986, 14, 891–901. [Google Scholar] [CrossRef]
- Horváth, L.; Yandell, B.S. Asymptotics of conditional empirical processes. J. Multivar. Anal. 1988, 26, 184–206. [Google Scholar] [CrossRef]
- Ferraty, F.; Mas, A.; Vieu, P. Nonparametric regression on functional data: Inference and practical aspects. Aust. N. Z. J. Stat. 2007, 49, 267–286. [Google Scholar] [CrossRef]
- Dudley, R.M. A course on empirical processes. In École d’été de Probabilités de Saint-Flour, XII—1982; Lecture Notes in Mathematics; Springer: Berlin, Germany, 1984; Volume 1097, pp. 1–142. [Google Scholar] [CrossRef]
- Billingsley, P. Convergence of Probability Measures, 2nd ed.; Wiley Series in Probability and Statistics: Probability and Statistics; A Wiley-Interscience Publication; John Wiley & Sons, Inc.: New York, NY, USA, 1999; pp. x+277. [Google Scholar] [CrossRef]
- Huber, P.J. Robust Statistics; Wiley Series in Probability and Mathematical Statistics; John Wiley & Sons, Inc.: New York, NY, USA, 1981; pp. ix+308. [Google Scholar]
- Parthasarathy, K.R. Probability Measures on Metric Spaces; Reprint of the 1967 original; AMS Chelsea Publishing: Providence, RI, USA, 2005; pp. xii+276. [Google Scholar] [CrossRef]
- Hofinger, A. The metrics of Prokhorov and Ky Fan for assessing uncertainty in inverse problems. Österreich. Akad. Wiss. Math.-Natur. Kl. Sitzungsber. II 2006, 215, 107–125. [Google Scholar] [CrossRef]
- Fan, K. Entfernung zweier zufälligen Grössen und die Konvergenz nach Wahrscheinlichkeit. Math. Z. 1944, 49, 681–683. [Google Scholar] [CrossRef]
- Bouzebda, S.; Nemouchi, B. Uniform consistency and uniform in bandwidth consistency for nonparametric regression estimates and conditional U-statistics involving functional data. J. Nonparametr. Stat. 2020, 32, 452–509. [Google Scholar] [CrossRef]
- Hall, P. Asymptotic properties of integrated square error and cross-validation for kernel estimation of a regression function. Z. Wahrsch. Verw. Geb. 1984, 67, 175–196. [Google Scholar] [CrossRef]
- Rachdi, M.; Vieu, P. Nonparametric regression for functional data: Automatic smoothing parameter selection. J. Stat. Plan. Inference 2007, 137, 2784–2801. [Google Scholar] [CrossRef]
- Dony, J.; Mason, D.M. Uniform in bandwidth consistency of conditional U-statistics. Bernoulli 2008, 14, 1108–1133. [Google Scholar] [CrossRef]
- Bouzebda, S. On the weak convergence and the uniform-in-bandwidth consistency of the general conditional U-processes based on the copula representation: Multivariate setting. Hacet. J. Math. Stat. 2023, 52, 1303–1348. [Google Scholar] [CrossRef]
- Bouzebda, S.; Taachouche, N. On the variable bandwidth kernel estimation of conditional U-statistics at optimal rates in sup-norm. Phys. A 2023, 625, 129000. [Google Scholar] [CrossRef]
- Bouzebda, S. General tests of conditional independence based on empirical processes indexed by functions. Jpn. J. Stat. Data Sci. 2023, 6, 115–177. [Google Scholar] [CrossRef]
- Shang, H.L. Bayesian bandwidth estimation for a functional nonparametric regression model with mixed types of regressors and unknown error density. J. Nonparametr. Stat. 2014, 26, 599–615. [Google Scholar] [CrossRef]
- Bouzebda, S.; Laksaci, A.; Mohammedi, M. The k-nearest neighbors method in single index regression model for functional quasi-associated time series data. Rev. Mat. Complut. 2023, 36, 361–391. [Google Scholar] [CrossRef]
- Bouzebda, S.; Laksaci, A.; Mohammedi, M. Single index regression model for functional quasi-associated time series data. Revstat 2022, 20, 605–631. [Google Scholar]
- Hall, P.; Heyde, C.C. Martingale Limit Theory and Its Application; Probability and Mathematical Statistics; Academic Press, Inc. [Harcourt Brace Jovanovich, Publishers]: New York, NY, USA; London, UK, 1980; pp. xii+308. [Google Scholar]
- Györfi, L.; Morvai, G.; Yakowitz, S.J. Limits to consistent on-line forecasting for ergodic time series. IEEE Trans. Inf. Theory 1998, 44, 886–892. [Google Scholar] [CrossRef]
- Chow, Y.S.; Teicher, H. Probability Theory, 3rd ed.; Springer Texts in Statistics; Independence, interchangeability, martingales; Springer: New York, NY, USA, 1997; pp. xxii+488. [Google Scholar] [CrossRef]
- Hoffmann-Jørgensen, J. Stochastic Processes on Polish Spaces; Various Publications Series (Aarhus); Aarhus Universitet, Matematisk Institut: Aarhus, Denmark, 1991; Volume 39, pp. ii+278. [Google Scholar]
- Kosorok, M.R. Introduction to Empirical Processes and Semiparametric Inference; Springer Series in Statistics; Springer: New York, NY, USA, 2008; pp. xiv+483. [Google Scholar] [CrossRef]
- Bae, J.; Jun, D.; Levental, S. The uniform CLT for martingale difference arrays under the uniformly integrable entropy. Bull. Korean Math. Soc. 2010, 47, 39–51. [Google Scholar] [CrossRef][Green Version]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).