Next Article in Journal
Optimum, Suboptimal and Solar Sailing Orbital Maneuvers for a Spacecraft Orbiting the Earth
Next Article in Special Issue
Numerical Modeling of Peridynamic Richards’ Equation with Piecewise Smooth Initial Conditions Using Spectral Methods
Previous Article in Journal
The Metastable State and the Finite-Size Effect of the First-Order Phase Transition
Previous Article in Special Issue
Analysis of an M/G/1 Retrial Queue with Delayed Repair and Feedback under Working Vacation policy with Impatient Customers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Empirical Bayes Decision for a Generalized Exponential Distribution with Contaminated Data

1
Department of Statistics, College of Science, Wuhan University of Technology, Wuhan 430070, China
2
Department of Epidemiology and Biostatistics, College of Public Health, University of South Florida, Tampa, FL 33612, USA
*
Author to whom correspondence should be addressed.
Symmetry 2023, 15(2), 511; https://doi.org/10.3390/sym15020511
Submission received: 9 January 2023 / Revised: 2 February 2023 / Accepted: 10 February 2023 / Published: 14 February 2023
(This article belongs to the Special Issue Mathematical Models: Methods and Applications)

Abstract

:
The two-sided and one-sided empirical Bayes test (EBT) rules for the parameter of a generalized exponential distribution with contaminated data (errors in variables) are constructed by a deconvolution kernel method, respectively. Under the type of the supersmooth error distributions and the supersmooth errors with the error level can be controlled situations, the asymptotically optimal uniformly over a class of prior distributions and uniform rates of convergence of the corresponding regret for the proposed EBT rules are obtained with suitable conditions. The example study shows that the assumptions and conditions of the main results of this paper are satisfied easily by calculating.

1. Introduction

Much of the more recent literature has looked at the empirical Bayes test (EBT). EBT for the parameter of some common distributions are investigated [1,2,3,4]. Under non-identical components case, empirical Bayes testing for a lifetime guarantee is considered for the double parameter exponential distribution [5]. Merging Bayesian and empirical Bayes posterior distributions in total variation is discussed [6]. A double empirical Bayes decision is obtained for multi-experiment studies by means of an empirical Bayes analytical method [7]. In order to study the relationship between empirical Bayes posterior distributions and false discovery rate control, a spike and slab empirical Bayes multiple testing is constructed [8]. An empirical Bayes multiple testing procedure for the sparse sequence model is investigated [9]. In earlier times, on the EBT for the continuous one-parameter exponential family has lots of work and is asymptotically optimal and the optimal convergence rates of EBT are obtained [10,11,12,13,14]. Most of the studies have discussed the empirical Bayes decision problem in the case of non-contaminated data, which is not the case for pure data cases. However, in practical application problems, contaminated data (errors in variables) are involved in many fields, and it has been widely studied [15,16,17,18,19]. In recent literature, the one-sided empirical Bayes decision problem is investigated for the continuous one-parameter exponential family with contaminated data [20].
Suppose that the random variable X has the generalized exponential distribution (GED) with probability density function (PDF) of the following forms [21]
f ( x | θ ) = ( μ x + θ ) exp ( θ x μ x 2 / 2 ) ,
where θ is a unknown parameter with the natural space Θ = { θ > 0 | Ω f ( x | θ ) d x = 1 } . In this article, we assume μ > 0 is a known constant, and the sample space is Ω = { x | x > 0 } .
GED is also called linear exponential distribution, it is a combinatorial distribution, and the exponential and Rayleigh distributions are considered as special cases of GED when θ = 0 and μ = 0 , respectively. The hazard function of GED is a linear function about time and age in linear exponential models, and it is one of the reasonable models for lifetime distributions of random phenomena. Progressive type-II censored competing risks data when the lifetimes are assumed to be a linear exponential distribution [21]. Recurrence relations for single and product moments of generalized order statistics have been derived with the linear exponential distribution [22]. The linear exponential distribution has been used in the area of reliability and life-testing see, for example, Broadbent [23] and Bain [24].
The two-sided and one-sided EBT rules are constructed for GED with contaminated data in this article. Deconvolution kernel method is employed to develop the two-sided and one-sided EBT rules with contaminated data, respectively. For errors in the variables model, deconvolution kernel method can eliminate the effect of the additive noise kernel density estimation. Furthermore, under the supersmooth error distributions and the supersmooth errors, the error level can be controlled, the asymptotically optimal and uniform rates of convergence are obtained with suitable conditions.
In practical problems, we often encounter measurement errors due to observation conditions, so the analysis of contaminated data is very important. For the pair random variables ( X , θ ) , assume that θ has a prior distribution G ( θ ) , X is one dimensional real random variable with a marginal density function f X | θ ( x ) when θ is given, ( X , θ ) is not directly observable. We observe only Y, where Y = X + ε , and ε are the random error. Suppose that ε follows a known distribution F ε on ( , ) , and independent on ( X , θ ) .
Firstly, we consider the two-sided test problem as follows
H 0 : θ 1 θ θ 2 H 1 : θ < θ 1 o r θ > θ 2
where θ 1 and θ 2 are known constants.
Define θ 0 = ( θ 1 + θ 2 ) / 2 , γ 0 = ( θ 2 θ 1 ) / 2 , then (2) is equivalent to
H 0 * : | θ θ 0 | γ 0 H 1 * : | θ θ 0 | > γ 0
For hypothesis test (3), let i = 0 , 1 . taking 0–1 weighted square loss function in the following
L i ( θ , d i ) = ( 1 i ) a [ ( θ θ 0 ) 2 γ 0 2 ] I [ | θ θ 0 | > γ 0 ] + i a [ γ 0 2 ( θ θ 0 ) 2 ] I [ | θ θ 0 | γ 0 ] ,
where a > 0 is a constant, and d = { d 0 , d 1 } is the decision space, d 0 indicates accepting H 0 * , d 1 indicates rejecting H 0 * .
When i = 0 , then we obtain L 0 θ , d 0 = a θ θ 0 2 γ 0 2 I θ θ 0 γ 0 ; when i = 1 then we have L 1 θ , d 1 = a γ 0 2 θ θ 0 2 I θ θ 0 γ 0 .
Let the parameter θ be distributed according to an unknown prior G ( θ ) , and assume that G ( θ ) belongs to the following class of distributions
ϑ = { G : G is a prior on Ω such that sup x f X ( m ) ( x ) B } ,
where f x m ( x ) denotes the m order derivative of f X ( x ) = Θ f X | θ ( x ) d G ( θ ) , which is the marginal density of X, and m 2 is an integer, B > 0 is a constant.
We define the randomized decision rule for hypothesis test (3) as follows
δ ( y ) = P ( a c c e p t i n g H 0 * | Y = y ) .
Then, the Bayes risk of δ ( y ) is given by
R ( δ ( y ) , G ( θ ) ) = Θ [ L 0 ( θ , d 0 ) δ ( y ) + L 1 ( θ , d 1 ) ( 1 δ ( y ) ] f Y | θ ( y ) d G ( θ ) d y = a β G ( y ) δ ( y ) d y + C G ,
where
C G = Θ L 1 ( θ , d 1 ) d G ( θ ) , β G ( y ) = Θ [ ( θ θ 0 ) 2 γ 0 2 ] f Y | θ ( y ) d G ( θ )
with
f Y ( y ) = Θ f Y | θ ( y ) d G ( θ ) .
and f Y | θ ( y ) denotes the density of Y given θ , i.e., f Y | θ ( y ) = f X | θ ( y x ) d F ε ( x ) .
Let P X ( x ) = Θ e θ x 1 2 μ x 2 d G ( θ ) , and
P X ( 1 ) ( x ) = Θ ( μ x + θ ) e θ x 1 2 μ x 2 d G ( θ ) = f X ( x ) ,
thus, we have x f X ( x ) d x = P X ( x ) .
From (7), we obtain
β G ( y ) = f X ( 2 ) ( y x ) d F ε ( x ) + Q ( y x ) f X ( 1 ) ( y x ) d F ε ( x ) + ϕ ( y x ) f X ( y x ) d F ε ( x ) μ Q ( y x ) p X ( y x ) d F ε ( x ) ,
where f X ( x ) = θ f X θ ( x ) d G ( θ ) is the marginal PDF of random variable X, f X ( 1 ) ( x ) and f X ( 2 ) ( x ) denote the first order and the second order derivative of f X ( x ) , respectively.
In (9), let
Q ( y x ) = 2 u ( y x ) + 2 θ 0 a n d ϕ ( y x ) = μ 2 ( y x ) 2 + 2 μ θ 0 ( y x ) + 3 μ + θ 0 2 γ 0 2 .
So from (9), we define the best Bayes decision minimizing R ( δ ( y ) , G ( θ ) ) as follows
δ G ( y ) = 1 , if β G ( y ) 0 , 0 , elsewhere .
A test is called a Bayes test with respect to G ( θ ) if
R δ G , G = inf δ R δ , G = a β G ( y ) δ G ( y ) d y + C G .
Since G ( θ ) is unknown in this paper, δ G ( y ) is unavailable to use, so this leads us to use the empirical Bayes approach in the following.
The rest of this article is organized as follows. In Section 2, the two-sided EBT rule for GED with contaminated data is proposed; Section 3 is devoted to obtaining asymptotic properties and the uniform convergence rate of two-sided EBT rule; the main results of two-sided EBT are proved in Section 4; Section 5 investigated one-sided EBT rule for GED with contaminated data; an example study is presented in Section 6.

2. The Proposed Two-Sided EBT Rule of GED with Contaminated Data

It is well-known that we usually make the following assumptions in the empirical Bayes framework, let ( Y 1 , θ 1 ) , ( Y 2 , θ 2 ) , ⋯, ( Y n , θ n ) , and ( Y , θ ) be independent pair of random variables, the parameters θ i ( 1 i n ) and θ have a common prior distribution G ( θ ) ; Y i ( 1 i n ) and Y are distributed according to the same marginal distribution F Y with density function f Y ( y ) = Θ f Y θ ( y ) d G ( θ ) , Y 1 , ⋯, Y n denotes historical samples and Y is called the present sample.
Deconvolution is a very important problem. It is often encountered when modeling unobservable data or to estimate conditional moments useful in likelihood calculations. When dealing with non-parametric estimation of priors or in measurement error models, the sample data are noisy because of the measurement error; deconvolution kernel method is adopted to eliminate the effect of the additive noise kernel density estimation. In order to obtain the empirical Bayes decision, we employ the deconvolution kernel method in the following by Fan [17,18,19].
Let φ Y ( t ) and φ ε ( t ) be the characteristic function (c.f.) of Y and ε , respectively. Note that
f X ( x ) = 1 2 π exp ( i t x ) φ Y ( t ) φ ε ( t ) d t
Thus, a deconvoluted kernel density estimation of f X ( r ) ( x ) ( r = 0 , 1 , 2 ) is defined by
f n ( r ) ( x ) = 1 2 π exp ( i t x ) ( i t ) r φ K t h n φ n ( t ) φ ε ( t ) d t ,
where 0 < h n 0 as n and φ n ( t ) = 1 n j = 1 n exp i t Y j is called the empirical c.f. of random variable Y. Note that f X ( 0 ) ( x ) = f X ( x ) and f n ( 0 ) ( x ) = f n ( x ) .
We can also rewrite (13) as kernel type of estimation as follows
f n ( r ) ( x ) = 1 n h n 1 + r j = 1 n K n r x Y j h n , r = 0 , 1 , 2 .
where
K n r ( x ) = 1 2 π exp ( i t x ) ( i t ) r φ K ( t ) φ ε t / h n d t
We define an estimator of the p X ( x ) of the random variable X by
p n ( x ) = M n x f n ( t ) d t ,
where f n ( x ) is the kernel density estimator given by (13), and M n ( ) is a sequence of constants.
Hence, we define an estimator of the β G ( y ) as follows
β n ( y ) = f n ( 2 ) ( y x ) d F ε ( x ) + Q ( y x ) f n ( 1 ) ( y x ) d F ε ( x ) + ϕ ( y x ) f n ( y x ) d F ε ( x ) μ Q ( y x ) p n ( y x ) d F ε ( x ) .
Furthermore, an empirical Byes test rule is defined as
δ n ( y ) = 1 , if β n ( y ) 0 0 , elsewhere .
In the following, let E be the expectation with respect to the joint distribution of Y 1 , Y 2 , , Y n . Then, the overall Bayes risk of δ n ( y ) would be
R δ n , G = a β G ( y ) E δ n ( y ) d y + C G .
By the definition, for any G ( θ ) ϑ , if sup G ϑ R δ n , G R δ G , G = O n q , where q > 0 , then δ n ( y ) is called asymptotically optimal uniformly with uniform convergence rate O n q .

3. Asymptotic Properties of Two-Sided EBT Rule

In this section, asymptotic properties of R δ n , G R δ G , G be investigated, some assumptions on the kernel function K ( x ) and the error variable ε are given in the following.
(A1)
The K ( x ) is a symmetric function about zero on ( , + ) and satisfies
K ( x ) d x = 1 , x j K ( x ) d x = 0 for j = 1 , , ( r 1 ) and x r K ( x ) d x 0 for some integer r > 0 ,
(A2)
φ K ( t ) is a symmetric function, having m + 2 bounded integral derivatives on ( , + ) ,
(A3)
φ K ( t ) = 1 + O | t | m as t 0 ,
(A4)
The characteristic function of ε satisfies φ ε ( t ) 0 for any t,
(A5)
Θ θ 2 f X θ ( y x ) d F ε ( x ) d G ( θ ) < uniformly in y, and
(A6)
for some 0 < λ < 1 , β G ( y ) 1 λ d y < and
β G ( y ) 1 λ | Q ( y x ) | d F ε ( x ) λ d y < and
β G ( y ) 1 λ | ϕ ( y x ) | d F ε ( x ) λ d y < , where β G ( y ) is given by (9).
Next, theorem below about the two-sided EBT establish the rates of convergence of the regret R δ n , G R δ G , G , where R δ G , G and R δ n , G are given by (11) and (18), respectively.
Theorem 1.
For some integer m 2 and constants 0 < λ < 1 , ϑ is defined by (4). Suppose that K ( x ) and F ε ( x ) are such that (A1)–(A6) hold, and the following conditions are satisfied:
(B1)
φ K ( t ) = 0 for | t | 1 ,
(B2)
φ s ( t ) | t | β 0 exp | t | β / γ γ 0 as | t | for some positive constants β , γ , γ 0 and a constant β 0 .
Then, by the choosing the bandwidth h n = ( 4 / γ ) 1 / β ( log n ) 1 / β , we obtain
sup G ϑ R δ n , G R δ G , G = O ( log n ) λ ( m 2 ) / β .
Remark 1.
If its characteristic function φ ε ( t ) satisfies condition (B2) of Theorem 1, then the distribution of a random variable ε is called supersmooth of order β. The common examples of supersmooth distributions are normal, Cauchy, mixture normal, etc. In practice, the conditions of Theorem 1 are easy to verify. It can be seen from the result of the Theorem 1 that the rate of convergence of EBT is very slow for very common error distributions, such as normal. Fan [17,18] pointed out the supersmooth error distribution will result in a worse convergence rate than of the smooth distribution.
It appears that the optimal rate of convergence for Gaussian deconvolution is extremely slow. Since the normal distribution is frequently used in applications, we need to study how to large a noise level is acceptable. Thus, considering the following model, let us assume that the data Y 1 , , Y n are independent identical distribution samples from
Y = X + ε ,
where ε = σ 0 ε ˜ , σ 0 parameterizes the noise level.
Theorem 2.
For some integer m 2 and constants 0 < λ < 1 , ϑ is defined by (4). Suppose that K ( x ) and F ε ( x ) are such that (A1)–(A6) hold with ε = σ 0 ε ˜ . Then, let σ 0 = O n 1 / ( 2 m + 1 ) and by choosing the bandwidth h n = O n 1 / ( 2 m + 1 ) , we have
sup G ϑ R n R G = O n λ ( m 2 ) / ( 2 m + 1 ) .
Remark 2.
Although all the data are contaminated with supersmooth errors, the results of Theorem 2 can also be as good as that of the uncontaminated data case. Suppose that all the data are contaminated with supersmooth errors, while the error level can be controlled, namely, φ ε ( t ) = φ ε ˜ σ 0 t . Fan [19] had been considered model (20). Theorem 2 indicates that the convergence rate is also very slow. The result of the following Lemma 3 is as good as ordinary smooth errors distribution but the result of the following Lemma 4 cause to the worse convergence rate of empirical Bayes estimator.

4. Proofs

In this section, first we need some lemmas to prove the main results of this paper. Lemmas 1 and 2 are due to Fan [17,18], Lemmas 3, and 4 are due to Fan [19]. The proof of Lemma 4 can be found in Johns and Van Ryzin [10]. Theorems 1 and 2 shall be proved, since the proofs of Theorems 1 and 2 are similar, only Theorem 1 is proved in detail. In the following, c , c 1 , c 2 , always stand for some positive constants and may be different even with the same notations.
Lemma 1.
Let f n ( r ) ( x ) be given by (14), under the assumptions (A1)–(A4) and the conditions (B1)–(B2) of Theorem 1 are satisfied, by the choosing the bandwidth h n = ( 4 / γ ) 1 / β ( log n ) 1 / β , we have
sup x sup G ϑ E f n ( r ) ( x ) f X ( r ) ( x ) 2 c ( log n ) 2 ( m r ) / β ,
where f X ( r ) ( x ) denotes the r order derivative of f X ( x ) and ϑ is given by (4).
Lemma 2.
Let p n ( x ) be given by (15), suppose that φ K ( t ) is a symmetric function, having m + 3 bounded integrable derivatives on ( , ) , and satisfying φ K ( t ) = 1 + O | t | m + 1 as t 0 . Under the assumption (A4) and the conditions (B1)–(B2) of Theorem 1 hold, with the choice h n = ( 4 / γ ) 1 / β ( log n ) 1 / β of the bandwidth and M n = n 1 / 3 , we have
sup P X Ω * sup G ϑ E p n ( x ) p X ( x ) 2 c ( log n ) 2 ( m + 1 ) / β
where ϑ is given by (4), and Ω * = p : p X ( x ) ϑ , p ( n ) + 1 p ( n ) = o ( log n ) ( m + 1 ) / β .
Lemma 3.
Let f X ( r ) ( x ) be given by (14). If the assumptions of (A1)–(A4) hold, let σ 0 = O n 1 / ( 2 m + 1 ) , then by choosing the bandwidth h n = O n 1 / ( 2 m + 1 ) , we have
sup x Ω sup G ϑ E n f n ( r ) ( x ) f X ( r ) ( x ) 2 c n 2 ( m r ) / ( 2 m + 1 ) ,
where f X ( r ) ( x ) denote the r order derivative of f X ( x ) and ϑ is given by (4).
Lemma 4.
Let p n ( x ) is given by (15), suppose that ϕ K ( · ) and ϕ ε ( · ) are bounded, respectively. Let K ( x ) satisfy (A1)–(A4) with m = 2 , and σ 0 = O n 1 / 5 , then we have
sup x Ω sup G ϑ E n p n ( x ) p X ( x ) 2 c n 1 .
Lemma 5.
Let R δ G , G and R δ n , G be defined by (11) and (18), respectively, then
0 R δ n , G R δ G , G a β G ( y ) p β n ( y ) β G ( y ) β G ( y ) d y ,
where β G ( y ) and β n ( y ) are given by (9) and (16), respectively.
Proof of Theorem 1.
By Lemma 5 and by the Markov inequality, for any 0 < λ < 1 ,
0 R δ n , G R δ G , G a β G ( y ) p β n ( y ) β G ( y ) β G ( y ) d y a β G ( y ) 1 λ E β n ( y ) β G ( y ) λ d y .
By applying the C r -inequality followed by Lyapunov’s inequality and using Fubini’s Theorem, we obtain
E β n ( y ) β G ( y ) λ c 1 E f n ( 2 ) ( y x ) f G ( 2 ) ( y x ) d F ε ( x ) λ + E Q ( y x ) f n ( 1 ) ( y x ) f G ( 1 ) ( y x ) d F ε ( x ) λ + E ϕ ( y x ) f n ( y x ) f G ( y x ) d F ε ( x ) λ μ E Q ( y x ) p n ( y x ) p G ( y x ) d F ε ( x ) λ c 1 E f n ( 2 ) ( y x ) f G ( 2 ) ( y x ) d F ε ( x ) λ + c 1 | Q ( y x ) | E f n ( 1 ) ( y x ) f G ( 1 ) ( y x ) d F ε ( x ) λ + c 1 | ϕ ( y x ) | E f n ( y x ) f G ( y x ) d F ε ( x ) λ + c 2 | Q ( y x ) | E p n ( y x ) p G ( y x ) d F ε ( x ) λ .
Furthermore, by (25) and (26), we obtain
sup G ϑ ( R δ n , G R δ G , G β G ( y ) 1 λ sup G ϑ E β n ( y ) β G ( y ) λ d y c 1 β G ( y ) 1 λ × sup G ϑ E f n ( 2 ) ( y x ) f G ( 2 ) ( y x ) d F ε ( x ) λ d y + c 1 β G ( y ) 1 λ × | Q ( y x ) | sup G ϑ E f n ( 1 ) ( y x ) f G ( 1 ) ( y x ) d F ε ( x ) λ d y + c 1 β G ( y ) 1 λ × | ϕ ( y x ) | sup G ϑ E f n ( y x ) f G ( y x ) d F ε ( x ) λ d y + c 2 β G ( y ) 1 λ × | Q ( y x ) | sup G ϑ E p n ( y x ) p G ( y x ) d F ε ( x ) λ d y = A n + B n + C n + D n
From Lemmas 1 and 2, by the assumption conditions of Theorem 1, we have
A n c 3 ( log n ) λ ( m 2 ) / β β G ( y ) 1 λ d y c ( log n ) λ ( m 2 ) / β , B n c 4 ( log n ) λ ( m 1 ) / β β G ( y ) 1 λ | Q ( y x ) | d F ε ( x ) λ d y
c ( log n ) λ ( m 1 ) / β ,
C n c 5 ( log n ) λ m / β β G ( y ) 1 λ | ϕ ( y x ) | d F ε ( x ) λ d y c ( log n ) λ m / β , D n c 6 ( log n ) λ ( m + 1 ) / β β G ( y ) 1 λ | Q ( y x ) | d F ε ( x ) λ d y
c ( log n ) λ ( m + 1 ) / β
Substituted (27) by (28) to (31), we obtain
sup G ϑ R δ n , G R δ G , G = O ( log n ) λ ( m 2 ) / β
So the proof of Theorem 1 was completed. □
Proof of Theorem 2.
The proof is similar to that of Theorem 1 above, except that we let Lemmas 3 and 4 in the place of Lemmas 1 and 2 in the proof of Theorem 1, respectively. □

5. One-Sided EBT Rule and Its Asymptotic Properties

In this section, we study one-sided EBT for the parameter θ of GED with contaminated data. Considering the problem of testing the hypotheses H 0 : θ θ 0 versus H 1 : θ > θ 0 , where θ 0 be a known positive constant. Let linear loss function of testing the hypotheses as follows
L 0 θ , d 0 = a θ θ 0 I θ > θ 0 , L 1 θ , d 1 = a θ θ 0 I θ θ 0 ,
where a is a positive constant and d = d 0 , d 1 is the action space, d 0 indicates accepting H 0 , d 1 indicates rejecting H 0 , I [ A ] is the indicator of the set A.
The same as above, assume that X is not directly observable and because of measurement error or the nature of environment, we can only observe Y = X + ε , where the error variable ε has a known distribution F ε on ( , ) . It is assumed that ε and ( X , θ ) are independent. It is assumed that the parameter θ is a realization of a random variable having an unknown prior distribution G ( θ ) over the natural parameter space Θ . Let randomized decision rule for the preceding testing problem is δ * ( y ) = P a c c e p t i n g   H 0 Y = y . For one-sided test, we assume that G ( θ ) belongs to the following class of distributions
ϑ * = G : G is a prior on Ω such that sup x f X ( m ) ( x ) B ,
where f X ( m ) ( x ) denotes the m order derivative of f X ( x ) = Θ f X θ ( x ) d G ( θ ) , which is the marginal density of X, and m 1 is an integer, B > 0 is a constant.
Let R δ * , G denotes the Bayes risk of the test δ * when G is the prior distribution, it can be expressed as
R δ * ( y ) , G ( θ ) = Θ Ω L 0 θ , d 0 δ * ( y ) + L 1 θ , d 1 1 δ * ( y ) f Y θ ( y ) d y d G ( θ ) = a Ω β G * ( y ) δ * ( y ) d y + C G ,
where
C G = Θ L 1 θ , d 1 d G ( θ ) , β G * ( y ) = Θ θ θ 0 f Y θ ( y ) d G ( θ ) .
From (35), we obtain
β G * ( y ) = μ p X ( y x ) d F ε ( x ) τ ( y x ) f X ( y x ) d F ε ( x ) f X ( 1 ) ( y x ) d F ε ( x ) ,
where τ ( y x ) = μ ( y x ) + θ 0 .
Therefore, the Bayes test δ G * can be presented as
δ G * ( y ) = 1 , if β G * ( y ) 0 0 , if β G * ( y ) > 0 .
The Bayes risk of δ G * ( y ) is
R δ G * , G = inf δ * R δ * , G = a Ω β G * ( y ) δ G * ( y ) d y + C G .
Thus, we defined the estimation of β G * ( y ) as
β n * ( y ) = μ p n ( y x ) d F ε ( x ) τ ( y x ) f n ( y x ) d F ε ( x ) f n ( 1 ) ( y x ) d F ε ( x ) .
Furthermore, one-sided empirical Byes test rule is defined by
δ n * ( y ) = 1 , if β n * ( y ) 0 0 , elsewhere .
Then, the overall Bayes risks of δ n * ( y ) would be
R δ n * , G = a Ω β n * ( y ) E δ n * ( y ) d y + C G .
It is necessary state that Lemmas 1–4 still hold over a class of new prior distributions ϑ * for one-sided EB decision problem. So by Lemmas 1–5, Theorem below establish the rates of convergence of the regret R δ n * , G R δ G * , G , where R δ G * , G and R δ n * , G are given by (38) and (41), respectively. For one-sided EBT, we assume that the following conditions are satisfied:
(C1)
Θ | θ | f X θ ( y x ) d F ε ( x ) d G ( θ ) < uniformly in y, and
(C2)
for some 0 < λ < 1 , β G * ( y ) 1 λ d y < and
β G * ( y ) 1 λ | τ ( y x ) | d F ε ( x ) λ d y < , where β G * ( y ) is given by (36).
Theorem 3.
For any 0 < λ < 1 , let ϑ * be defined by (33), suppose that K ( x ) , such that (A1)–(A4) and (B1)–(B2) of Theorem 1 hold and satisfying conditions (C1) and (C2). Then, by choosing the bandwidth h n = ( 4 / γ ) 1 / β ( log n ) 1 / β , we obtain
sup G g * R δ n * , G R δ G * , G = O ( log n ) λ ( m 1 ) / β .
Theorem 4.
For any 0 < λ < 1 and some integer m 1 , let ϑ * be defined by (33), suppose that K ( x ) and F ε ( x ) are such that (A1)–(A4) hold with ε = σ 0 ε ˜ , and satisfying conditions (C1) and (C2). Then, let σ 0 = O n 1 / ( 2 m + 1 ) and by choosing the bandwidth h n = O n 1 / ( 2 m + 1 ) , we have
sup G ϑ * R δ n * , G R δ G * , G = O n λ ( m 1 ) / ( 2 m + 1 ) .
Remark 3.
For one-sided EBT, Similar to Theorem 1, the supersmooth distribution of a random variable ε is also considered to Theorem 3, its characteristic function φ ε ( t ) satisfies condition (B2). Under all the data are contaminated while the error level can be controlled situation, for model (20), by Lemmas 1–5, Theorem 4, Theorem 4 obtained the rate of convergence of one-sided EBT, this result can also be as good as that of the uncontaminated data case.
Proof of Theorem 3.
By Lemma 5 and by the Markov inequality, for 0 < λ < 1 ,
0 R δ n * , G R δ G * , G a β G * ( y ) p β n * ( y ) β G * ( y ) β G * ( y ) d y a β G * ( y ) 1 λ E β n * ( y ) β G * ( y ) λ d y .
By applying the C r -inequality followed by Lyapunov’s inequality and using Fubini’s Theorem, we obtain
E β n * ( y ) β G * ( y ) λ c 1 E f n ( 1 ) ( y x ) f G ( 1 ) ( y x ) d F ε ( x ) λ + E τ ( y x ) f n ( y x ) f G ( y x ) d F ε ( x ) λ μ E p n ( y x ) p G ( y x ) d F ε ( x ) λ c 1 E f n ( 1 ) ( y x ) f G ( 1 ) ( y x ) d F ε ( x ) λ + c 1 | τ ( y x ) | E f n ( y x ) f G ( y x ) d F ε ( x ) λ + c 2 E p n ( y x ) p G ( y x ) d F ε ( x ) λ .
Furthermore, by (44) and (45), we obtain
sup G ϑ * R δ n * , G R δ G * , G β G * ( y ) 1 λ sup G ϑ * E β n * ( y ) β G * ( y ) λ d y c 1 β G * ( y ) 1 λ × sup G ϑ * E f n ( 1 ) ( y x ) f G ( 1 ) ( y x ) d F ε ( x ) λ d y + c 1 β G * ( y ) 1 λ × | τ ( y x ) | sup G ϑ * E f n ( y x ) f G ( y x ) d F ε ( x ) λ d y + c 2 β G * ( y ) 1 λ × sup G ϑ * E p n ( y x ) p G ( y x ) d F ε ( x ) λ d y = A n + B n + C n .
From Lemmas 1 and 2, by the assumption conditions of Theorem 3, we have
A n c 3 ( log n ) λ ( m 1 ) / β β G * ( y ) 1 λ d y c ( log n ) λ ( m 1 ) / β ,
    B n c 4 ( log n ) λ m / β β G * ( y ) 1 λ | τ ( y x ) | d F ε ( x ) λ d y c ( log n ) λ m / β ,
C n c 6 ( log n ) λ ( m + 1 ) / β β G * ( y ) 1 λ d y c ( log n ) λ ( m + 1 ) / β .
Substituted (46) by (47) to (49), we obtain
sup G ϑ * R δ n * , G R δ G * , G = O ( log n ) λ ( m 1 ) / β .
So the proof of Theorem 3 was completed. □
Proof of Theorem 4.
The proof is similar to that of Theorem 3 above, except that we let Lemmas 3 and 4 in the place of Lemmas 1 and 2 in the proof of Theorem 3, respectively. □

6. An Example Study

In this section, an example study is presented to verify the GED and the prior distribution which satisfies theorems in this paper exist. Suppose that the probability density function of random variable X as follows
f ( x θ ) = ( θ + 2 x ) e θ x + x 2 ,
where θ is a given parameter, and the sample space is Ω = { x x > 0 } , the parameter space is Θ = { θ θ > 0 } . Let the prior distribution of parameter θ is
g ( θ ) = 1 Γ ( r ) θ ( r + 1 ) e 1 / θ ,
where r is a positive known parameter and θ is a positive unknown parameter. By calculating we obtain
f X ( x ) = 0 f ( x θ ) g ( θ ) d θ = e x 2 r ( x + 1 ) r + 1 + 2 x ( x + 1 ) r = e x 2 q ( x ) ,
where q ( x ) = r ( x + 1 ) r + 1 + 2 x ( x + 1 ) r .
Obviously, f X ( m ) ( x ) is existence, and f X ( m ) ( x ) = e x 2 p ( x ) ( x + 1 ) 2 m ( r + 1 ) , where p ( x ) is polynomial with respect to x and ( p ( x ) ) 2 m 1 ( r + 1 ) 1 . Since lim x f X ( m ) ( x ) = 0 , f X ( m ) ( x ) is bounded on x Ω , where m 1 is an integer. Thus, G ( θ ) ϑ and G ( θ ) ϑ * are satisfied.
Let the supersmooth error distribution F ε be N(0,1), it is easy to check that φ ε ( t ) satisfies the condition (B2) of Theorem 1. Moreover, we can take b n = 2 ( log n ) 1 / 2 .
For the two-sided EBT case, we used the following kernel function
K ( x ) = 6144 sin x π x 5 + 18320 cos x π x 6 3225600 sin x π x 7 + 3360 × 13 ! π x 16 cos x ,
where < x < , and we choose the Fourier transform of the above kernel is
φ K ( t ) = 1 t 2 4 I [ | t | 1 ] .
Then, the deconvolution kernel density estimators (14) are the following kernels, for the type of supersmooth error distribution case,
K n l ( x ) = 1 2 π ( cos t x i sin t x ) ( i t ) l 1 t 4 4 exp t 2 2 h n 2 d t
Similar to [20], it is easily shown that assumptions and conditions of Theorems 1 and 2 are satisfied with the above specifications.
For one-sided EBT case, we choose φ K ( t ) = 1 t 2 3 I [ | t | 1 ] , then the Fourier transform of φ K ( t ) is a second order kernel as follows
K ( x ) = 48 cos x π x 4 1 15 x 2 144 sin x π x 5 2 15 x 2 , < x < .
The corresponding deconvolution kernel density estimators (14) are kernel in the following
K n l ( x ) = ( 1 ) l π 0 1 t l ( cos t x ) 1 l ( sin t x ) l 1 t 2 3 exp t 2 2 h n 2 d t , l = 0 , 1 .
Then, similar to literature [20], it is easily shown that assumptions and conditions of Theorem 3 and 4 are satisfied with the above specifications.
Actually, we can take φ K ( t ) = 1 t 2 k I [ | t | 1 ] , when k 4 , at the same time it may suit for one-sided and two-sided EBT. However, if k = 3 , the second order kernel (53) only satisfies kernel conditions of Theorems 3 and 4.

7. Conclusions

In this paper, we had studied the empirical Bayes decision for the parameter of a generalized exponential distribution with contaminated data, two-sided and one-sided empirical Bayes test rules were constructed by a deconvolution kernel method, respectively. For the type of the supersmooth error distributions the asymptotically optimal uniformly over a class of prior distributions and uniform rates of convergence of the corresponding regret for the proposed EBT rules are obtained under the conditions of Theorems 1 and 3. Furthermore, we also investigated the supersmooth errors with the error level can be controlled case, Y = X + ε , where ε = σ 0 ε ˜ , σ 0 parameterizes the noise level, that is, φ ε ( t ) = φ ε ˜ σ 0 t , and obtained Theorems 2 and 4. As an example, let the supersmooth error distribution F ε be N(0,1), we proved the assumptions and conditions of the main results of this paper are satisfied easily by calculating.
In many practical problems, not all the observations are contaminated, but there may be a partially contaminated case. Suppose that only 100 p %   ( 0 < p < 1 ) of the data are measured with error and the remaining data are error free. We consider the mode Y = X + ε , taking P ( ε = 0 ) = 1 p and P ε = ε * = p , where ε * is an error variable with distribution F ε * and the characteristic function φ ε * . Thus, the characteristic function of ε is denoted by φ ε ( t ) = ( 1 p ) + p φ ε * ( t ) . In this regard, we can consider extending the current research work to this situation, which is believed to be a very interesting topic.

Author Contributions

Conceptualization, J.C.; Data curation, H.Q.; Formal analysis, H.Q.; Funding acquisition, J.C.; Investigation, H.Q.; Methodology, H.Q.; Project administration, Z.Y. and Y.H.; Supervision, J.C.; visualization, Z.Y.; Validation, Y.H.; Writing—original draft, H.Q.; Writing—review and editing, H.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China under Grant 81671633 to J. Chen.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data in this paper have been presented in the manuscript.

Acknowledgments

Many thanks to reviewers for their positive feedback, valuable comments and constructive suggestions that helped improve the quality of this article. Many thanks to editors’ great help and coordination for the publish of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Karunamuni, R.; Li, J.; Wu, J. Robust empirical Bayes tests for continuous distributions. J. Stat. Plan. Inference 2010, 140, 268–282. [Google Scholar] [CrossRef]
  2. Chen, L.S.; Yang, M.C. Empirical Bayes testing for equivalence. J. Stat. Plan. Inference 2011, 141, 2670–2681. [Google Scholar] [CrossRef]
  3. Yuan, M.; Wei, L. Two-sided empirical Bayes test for location parameter in the gamma distribution. Commun. Stat.-Theory Methods 2017, 46, 4215–4225. [Google Scholar] [CrossRef]
  4. Yuan, M.; Zhang, Q.; Wei, L.S. One-sided empirical Bayes test for location parameter in Gamma distribution. Appl. Math. J. Chin. Univ. 2018, 33, 287–297. [Google Scholar] [CrossRef]
  5. Chen, L.S. Empirical Bayes testing for guarantee lifetime: Non identical components case. Commun. Stat.-Theory Methods 2017, 46, 683–705. [Google Scholar] [CrossRef]
  6. Petrone, S.; Rousseau, J.; Scricciolo, C. Bayes and empirical Bayes: Do they merge? Biometrika 2014, 101, 285–302. [Google Scholar] [CrossRef]
  7. Tansey, W.; Wang, Y.; Rabadan, R.; Blei, D. Double empirical bayes testing. Int. Stat. Rev. 2020, 88, S91–S113. [Google Scholar] [CrossRef]
  8. Castillo, I.; Roquain, É. On spike and slab empirical Bayes multiple testing. Ann. Statist. 2020, 48, 2548–2574. [Google Scholar] [CrossRef]
  9. Abraham, K.; Castillo, I.; Roquain, É. Empirical Bayes cumulative -value multiple testing procedure for sparse sequences. Electron. J. Stat. 2022, 16, 2033–2081. [Google Scholar]
  10. Johns, M., Jr.; Van Ryzin, J. Convergence rates for empirical Bayes two-action problems II. Continuous case. Ann. Math. Stat. 1972, 43, 934–947. [Google Scholar] [CrossRef]
  11. Singh, R.S.; Laisheng, W. Nonparametric empirical bayes procedures, asymptotic optimality And rates Of convergence For two-tail tests In exponential family. J. Nonparametric Stat. 2000, 475–501. [Google Scholar] [CrossRef]
  12. Pensky, M. Rates of convergence of empirical Bayes tests for a normal mean. J. Stat. Plan. Inference 2003, 111, 181–196. [Google Scholar] [CrossRef]
  13. Liang, T.C. On optimal convergence rate of empirical Bayes tests. Stat. Probab. Lett. 2004, 68, 189–198. [Google Scholar] [CrossRef]
  14. Gupta, S.S.; Li, J. On empirical Bayes procedures for selecting good populations in a positive exponential family. J. Stat. Plan. Inference 2005, 129, 3–18. [Google Scholar] [CrossRef]
  15. Carroll, R.J.; Hall, P. Optimal rates of convergence for deconvolving a density. J. Am. Stat. Assoc. 1988, 83, 1184–1186. [Google Scholar] [CrossRef]
  16. Stefanski, L.A. Rates of convergence of some estimators in a class of deconvolution problems. Stat. Probab. Lett. 1990, 9, 229–235. [Google Scholar] [CrossRef]
  17. Fan, J. On the optimal rates of convergence for nonparametric deconvolution problems. Ann. Stat. 1991, 19, 1257–1272. [Google Scholar] [CrossRef]
  18. Fan, J. Global behavior of deconvolution kernel estimates. Stat. Sin. 1991, 1, 541–551. [Google Scholar]
  19. Fan, J. Deconvolution with supersmooth distributions. Can. J. Stat. 1992, 20, 155–169. [Google Scholar] [CrossRef]
  20. Karunamuni, R.J.; Zhang, S. Empirical Bayes two-action problem for the continuous one-parameter exponential family with errors in variables. J. Stat. Plan. Inference 2003, 113, 437–449. [Google Scholar] [CrossRef]
  21. Davies, K.F.; Volterman, W. Progressively Type-II censored competing risks data from the linear exponential distribution. Commun. Stat.-Theory Methods 2022, 51, 1444–1460. [Google Scholar] [CrossRef]
  22. Ahmad, A.E.B.A. Single and product moments of generalized order statistics from linear exponential distribution. Commun. Stat.-Theory Methods 2008, 37, 1162–1172. [Google Scholar] [CrossRef]
  23. Broadbent, S. Simple mortality rates. Appl. Stat. 1958, 7, 86–95. [Google Scholar] [CrossRef]
  24. Bain, L.J. Analysis for the linear failure-rate life-testing distribution. Technometrics 1974, 16, 551–559. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Qiu, H.; Chen, J.; Yuan, Z.; Huang, Y. Empirical Bayes Decision for a Generalized Exponential Distribution with Contaminated Data. Symmetry 2023, 15, 511. https://doi.org/10.3390/sym15020511

AMA Style

Qiu H, Chen J, Yuan Z, Huang Y. Empirical Bayes Decision for a Generalized Exponential Distribution with Contaminated Data. Symmetry. 2023; 15(2):511. https://doi.org/10.3390/sym15020511

Chicago/Turabian Style

Qiu, Han, Jiaqing Chen, Zihao Yuan, and Yangxin Huang. 2023. "Empirical Bayes Decision for a Generalized Exponential Distribution with Contaminated Data" Symmetry 15, no. 2: 511. https://doi.org/10.3390/sym15020511

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop