Next Article in Journal
Geary’s c and Spectral Graph Theory: A Complement
Next Article in Special Issue
Nonparametric Estimation of Multivariate Copula Using Empirical Bayes Methods
Previous Article in Journal
Hopf Bifurcation Analysis of a Class of Saperda populnea Infectious Disease Model with Delay
Previous Article in Special Issue
On the Reliability of Machine Learning Models for Survival Analysis When Cure Is a Possibility
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Non-Parametric Test for Decreasing Uncertainty of Residual Life Distribution (DURL) †

by
Hassina Benaoudia
1,*,‡ and
Amar Aissani
2,‡
1
Department of Probability and Statistics, Faculty of Mathematics, University of Science and Technology Houari Boumedienne, Bab Ezzouar, Algiers 16111, Algeria
2
Laboratory of Research in Intelligent Informatics, Mathematics and Applications (RIIMA), Faculty of Computer Science, University of Science and Technology Houari Boumedienne, Bab Ezzouar, Algiers 16111, Algeria
*
Author to whom correspondence should be addressed.
ICCSA 2022, The 22th International Conference on Computational Science and Its Applications, The University of Malaga, Malaga, Spain, 4–7 July 2022; pp. 188–196. Lecture Notes in Computer Science, Volume 13376. Springer, Cham. https://doi.org/10.1007/978-3-031-10450-3_14.
These authors contributed equally to this work.
Mathematics 2023, 11(20), 4227; https://doi.org/10.3390/math11204227
Submission received: 29 August 2023 / Revised: 22 September 2023 / Accepted: 23 September 2023 / Published: 10 October 2023
(This article belongs to the Special Issue Nonparametric Statistical Methods and Their Applications)

Abstract

:
In this paper, we propose a new statistic to test the monotonicity of uncertainty based on derivative criteria and the histogram method. We test the null hypothesis that residual entropy is constant against the fact that it decreases over time. Hence, by the fact that the exponential distribution is the distribution with a constant uncertainty, we establish the test exponential distribution against the decreasing uncertainty of residual life distribution. Consistency and asymptotic normality are proved. The critical values of the statistics are given by means of the Monte Carlo simulation method to decide on the test. Then, the power estimates of the new test are compared to those of the test based on the criteria of monotonicity of residual entropy. Finally, we show, with real survival data, that the distributions belong to a decreasing uncertainty residual life class. Moreover, by applying a test of goodness of fit, we confirm that the data follow parametric distributions belonging to a decreasing uncertainty of residual life class.

1. Introduction

Several attempts have been made, in terms of effective procedures, at inherent uncertainty reasoning in medical diagnosis, expert systems, Artificial Intelligence and other engineering studies. “All of the time, agents are forced to make decisions based on incomplete information. Even when an agent senses the world to find out more information, it rarely finds out the exact state of the world. A robot does not know exactly where an object is. A doctor does not know exactly what is wrong with a patient. A teacher does not know exactly what a student understands. When intelligent agents must make decisions, they have to use whatever information they have” (Poole and Mackworth (2023) [1]). Probability theory is perhaps an adequate way to give a mathematical measure of an uncertain (or random) event from the point of view of the criteria enumerated by Waley (1996) [2], amongst others (interpretation, imprecision, calculus, consistency, assessment and computation), since it is naturally associated with mathematical statistics for inference (see also Poole and Mackworth (2023) [1], Zio and Pedroni (2013) [3]). Several objections have been made against probability models, whose role in dealing with uncertainty—particularly with clinical diagnosis—was the core of overheated debate that arose from misunderstanding questions. This led to new theories with different purposes of application: probability bounds, entropy, imprecise probability, fuzzy theory, possibility theory, belief functions, evidence theory, and so on, all of which tried to give more comprehensive insight into the notion of uncertainty.
In Colyvan (2008) [4], the author addressed this question through the famous Cox’s theorem, which states that “Any measure of believe is isomorphic to a probability measure”. His strategy was to first show that the claim that probability theory is the only coherent means of dealing with uncertainty is implausible. He did this by “considering different kinds of uncertainty and showing that probability theory seems ill-suited to the uncertainty arising in situations where the logical principle of excluded middle fails”.
In this paper, we do not enter into this debate, but we refer to an idea that goes back to the mathematical theory of communication by Shannon (1948) [5], which defines uncertainty through the notion of entropy.
Recall that all probability distributions considered in classical uncertainty theories (probabilistic or not) are qualified as “parametric” in the sense that they depend on some parameters. For example, exponential distribution depends on only one parameter (the mean); normal distribution depends on two parameters (the mean and the variance). In any practical situation, we are satisfied with these parametric laws, which are simple and intuitive: they give a first answer to any given question. As a second step, we can question ourselves about how to improve the model from the “uncertainty” point of view. Indeed, we can have doubts about the chosen probability distribution (law), although it is given by the well-established statistical mathematic of decision or inference:
  • First, experimentation with different statistical samples for a given variable can lead to the identification (estimation, in the statistical language) of different probability laws, without which that theory is false. This points, simply, to the fact that the identification is not unique, which is well known in theory and in practice.
  • In practice, engineers are not always interested in the probability (parametric) distribution (or law) itself but simply in some “physical” property, such as the “aging” property of the component or the system. A more simple example is given by reliability or survival analysis studies (see Shaked and Shantikumar (1994) [6] and Lai and Xie (2006) [7]), in which practitioners are interested only in a stage of aging or rejuvenation: for example, “Increasing (Decreasing) Failure Rate”, i.e., IFR (DFR). Therefore, we consider the class of all probability distributions with such a property IFR (DFR). Such a class is non-parametric rather than parametric, because it contains a collection of probability distribution functions (PDFs) that have in common a given aging property.
  • Reliability studies have shown that it is possible to associate bounds (majoration/minotation) with a given class of aging distribution. Similar bounds can be obtained for any structured (coherent) system (Barlow and Proshan (1975) [8])—series, parallel, etc.—on the basis of the information about the class of the elements (stability or preservation properties).
  • The properties of such classes are also of some interest for other stochastic models, such as queuing, insurance, networks, medicine or biological models (see Feng et al. (2017) [9], Feng et al. (2020) [10], Marshall and Olkin (1979) [11], Shaked and Shantikumar (1994) [6] and Senouci et al. (2012) [12] for more details).
Non-parametric probability distributions classes are always associated with the monotonicity and comparability properties of some stochastic order. In this paper, we consider the non-parametric class of DURL (IURL)—decreasing (increasing) uncertainty of residual life—which is based on the monotonicity of an uncertain order. These classes of probability distribution functions (PDFs), which were introduced by Ebrahimi (1996) [13], also provide several mathematical properties, particularly preservation results. Further results and references, particularly in connection to order statistics and record values, are provided in Asadi and Ebrahimi (2000) [14], Ebrahimi and Soofi (1994) [15], Ebrahimi and Pellery (1995) [16] and Ebrahimi and Kirmani (1996) [17]. Ebrahimi (1997) [18] provides some statistical hypothesis tests for DURL class based on the monotonicity of the residual entropy.
In this paper, we propose a new criterion based on a derivative criteria that tests such an uncertainty property. The paper is organized as follows: In the following Section 2, we present the notions of stochastic order and non-parametric aging probability distributions (NPDFs). The Less Uncertainty (LU) order and the DURL (IURL) non-parametric classes based on monotonicity of uncertainty are presented in Section 3. In Section 4, we propose the procedure of the test based on the histogram method and inspired by the work of Ebrahimi (1997) [18] but which is based on a derivative criteria rather than the monotonicity one. Section 5 is devoted to determining the mean and variance of the proposed statistic. The consistency and the asymptotic normality are discussed, respectively, in Section 6 and Section 7. We determine in Section 8 the critical point values of the statistic by means of Monte Carlo simulation. Afterwards, we compare the power estimates of this test whith those of Ebrahimi (1997) [18] in Section 9. Finally, illustrative examples are provided in Section 10 followed by a conclusion in Section 11.

2. Stochastic Orders and Non-Parametric Aging Distribution

The problem arises in reliability theory when we have to compare reliability (or other stochastic) models; see Marshall and Olkin (1979) [11] and Shaked and Shantikumar (1994) [6]. This can be completed by using the notion of some partial stochastic ordering (it is a binary relation that is reflexive, transitive and anti-symmetric).
We consider a partial ordering defined on the set (or its suitable subsets) of all distribution functions of real-valued random variables. For example, if X and Y are two random variables with their distribution function F and G satisfying F ( x ) (resp. ) G ( x ) for every x, then we say that X is stochastically larger (resp.smaller) than Y: we write F s t ( r e s p . s t ) G or X s t ( r e s p . s t ) Y . It must be noted that even if X and Y have been defined in the same probability space, we can have anti-symmetry holding for F and G without it necessarily being the case that X and Y are the same random variables. However, stochastic ordering between two distributions, if it holds, is more informative than simply comparing their means or dispersions only. Thus, the proposed approach leads to a qualitative rather than purely quantitative estimation of the system under study. For example, such an approach can be used to design a better system. Since an agent can find two situations incomparable, one situation may be better in some stochastic sense but worse in another one.
In economic theory, this is known as the first-order stochastic dominance and is denoted by F F S D (resp. F S D ) G . There is a growth in literature related to stochastic comparability (or dominance) and various stochastic orders have been introduced; most of them can be found in monographs by Marshall and Olkin (1979) [11] or Shaked and Shantikumar (1994) [6]. Note that we can use the terms “more variable”, “riskier” and “more uncertain” synonymously, although the term “more variable” is related to specific variability orders.
For example, we say that X is smaller than Y in the increasing convex (concave) order ( X i c x Y ) if E ( f ( X ) ) E ( f ( Y ) ) for all increasing convex (concave) functions f: R R where R is the space of real numbers. There is in the literature of reliability various variability orders:
  • The convex (concave) order c x : the functions f ( . ) are convex (concave);
  • The convex transform order: X c Y if G 1 F ( x ) is convex in x on the support of F, where G 1 is the inverse of G;
  • The star-shaped order * : X * Y if G 1 F ( x ) / x increases in x on the support of F.
Now, in reliability, we are interested in the lifetime of a component (or system of such components). Let X be a non-negative random variable considered as the failure time of such a component (or system) and denote by F ( t ) = P ( X t ) , t 0 , its probability distribution function, which is assumed to be absolutely continuous with the probability density function f ( t ) .
The function F ¯ ( t ) = 1 F ( t ) = P ( X > t ) represents the survival (or reliability) function, i.e., the probability of survival of the system over the period ( 0 , t ) . We are interested also in the residual lifetime X t = X t / X > t of a system which has survived until time t.
We denote by F t ( x ) = P ( X t < x / X > t ) = P ( t X < t + x ) P ( X > t ) , F ¯ t ( x ) = P ( X t > t ) = F ¯ ( t + x ) F ¯ ( t ) the reliability of a system of age t and λ F ( t ) = f ( t ) F ¯ ( t ) the failure (or hazard) rate of the component at time t also called hazard or risk function. This is a local characteristic as the density function and λ ( t ) d t represents the probability of no failure during the time interval ( t , t + d t ) given no failure until time t.
Now, let us define some usual non-parametric PDFs. In the following, by increasing (decreasing), we mean non-decreasing (non-increasing).
Definition 1. 
A non-negative random variable X is said to be IFR (Increasing Failure Rate) (resp. DFR (Decreasing Failure Rate)) if the failure rate λ F ( t ) is increasing (resp. decreasing) over ( 0 , t ) [6,11].
This definition assumes that F ( . ) is absolutely continuous. Otherwise, the definition remains valid when considering the relation between comparability (or monotonicity) and reliability theory. A non-negative random variable X is I F R ( D F R ) if and only if X t s t ( s t ) X t for all t t . So, the stochastic order s t characterizes the IFR (DFR) (non-parametric) probability distribution.
The order i c x can characterize another usual aging notion in reliability, i.e., DMRL (IMRL): Decreasing (Increasing) Mean Residual Life.
Definition 2. 
The random variable X has DMRL (IMRL) distribution if m ( t ) = t F ¯ ( x ) d x F ¯ ( t ) is decreasing (increasing) in t [ 0 , t ] [6,11].
Now, the random variable X is DMRL (IMRL) if, and only if, X t i c x ( i c x ) X t whenever t t .
The convex c and the star-shaped * orders can be used to characterize IFR and IFRA in the following sense.
Definition 3. 
A non-negative random variable X is said to be IFRA (Increasing Failure Rate in Average) if the average of its cumulative failure rate over ( 0 , t ) is increasing in t 0 [6,11].
Proposition 1. 
Let Exp denote any exponential distributed random variable (no matter what its mean). Let X be a non-negative random variable. Then, X is IFR (resp. IFRA) if and only if X c (resp * ) E x p [6,11].
There are some relations between these non-parametric distributions (classification). If F is IFR, then it is also IFRA and DMRL, in the sense that for the class IFR ⊂ IFRA and IFR ⊂ DMRL, the inclusion is strict. However, there are no relations between IFRA and DMRL. The same relations hold for the dual classes DFR, DFRA and IMRL [6,11].
In the following section, we will consider a non-parametric probability distribution that is not based on aging properties but rather on differential entropy in the sense of Shannon.

3. Uncertainty Order and NPDFs DURL (IURL)

It is well known that we can define an uncertainty measure for the probability distribution function F (with density f) of the non-negative random variable X via differential entropy in the sense of Shannon (Shannon (1948) [5]):
H ( f ) = 0 f ( x ) l o g ( f ( x ) ) d x = E ( l o g ( f ( X ) ) )
commonly referred to Shannon information measure, where l o g ( x ) denotes the natural logarithm. The entropy is interpreted as the expected uncertainty contained in the density f ( x ) about the predictability of an outcome of the random variable X, that is the quantity of information contained in the probability distribution of the random variable X. It measures the concentration of probabilities: low entropy distributions are more concentrated and hence more informative than higher ones. In this sense, the entropy can be used for qualitative studies. In the fields of reliability, survival analysis or insurance, we have additional information about the current age of the system S (or component) under study and we must reassess the uncertainty of the remaining lifetime of S.
Ebrahimi and Pellery (1995) [16] propose a similar notion of dynamic measure of entropy by considering the age of the component at time t , t > 0 .
More precisely, the residual entropy gives the uncertainty contained in the conditional density of the residual lifetime X t = X t / X > t on the predictability of the residual time of the component, which is denoted by H ( f ; t ) , where:
H ( f ; t ) = t f ( x ) F ¯ ( t ) l o g f ( x ) F ¯ ( t ) d x
After executing some algebraic manipulations and integration by parts, we can represent the residual entropy under the following form [9]:
H ( f ; t ) = 1 E [ l o g λ F ( X ) / X > t ]
= l o g F ¯ ( t ) 1 F ¯ ( t ) t f ( x ) l o g f ( x ) d x
= 1 1 F ¯ ( t ) t f ( x ) l o g λ F ( x ) d x
which can be seen as a dynamic measure of uncertainty about S associated to its lifetime distribution. In other words, H ( f ; t ) measures the expected uncertainty contained in conditional density of the residual lifetime X t of a system (or component) of age t, i.e., given X > t . In this sense, H ( f ; t ) measures the concentration of conditional probability distributions. Note finally that the dynamic entropy of a new component (of age 0) H ( f ; 0 ) = H ( f ) is the ordinary Shannon’s entropy and that the function H ( f ; t ) uniquely determines the reliability function F ¯ , or equivalently, the PDF F.
We can now define an uncertainty ordering. The non-negative random variable X has less uncertainty than Y; we note X L U Y , if H ( f ; t ) H ( g ; t ) , t 0 [17]. If X and Y are the lifetimes of two systems S and S and if X L U Y , then the expected uncertainty contained in the conditional density of X t , about the predictability of the residual lifetime of the first system S, is less than the expected uncertainty contained in the conditional density of Y t about the remaining lifetime of the second system S (see DiCrescenzo (2002) [19] for another notion of order related to past entropy). Note that the usual stochastic orderings used in the literature [6,11] can be interpreted in terms of aging properties, and in general, there is no relation between these orderings and the above-defined uncertainty order. So, intuitively speaking, the better system is the system which lives longer, and there is less uncertainty about its residual lifetime. This motivates the introduction of several definitions of preference based on aging and on uncertainty. This aspect is not considered here.
Note that Ebrahimi and Soofi (1994) [15] view X as less uncertain than Y, written X E N T Y , if E [ l o g f ( X ) ] E [ l o g f ( Y ) ] . But the order L U is stronger than the order E N T .
On the basis of the measure of uncertainty H ( f ; t ) , Ebrahim (1996) [13] defines two non-parametric classes of life distributions.
Definition 4. 
A survival function F ¯ has decreasing (resp. increasing) uncertainty of residual life DURL (resp. IURL) if H ( f ; t ) is decreasing (resp. increasing) in t [13].
If a component has a survival PDF belonging to the class DURL, then as the component ages, the conditional probability density function becomes more informative. So, if a component of age X has a DURL distribution (resp. IURL), its residual life can be predicted with more (resp. less) precision [13].
Remark 1. 
The exponential distribution is the only continuous distribution which is both DURL and IURL [13]. In fact, the exponential distribution has a constant uncertainty. Hence, the derivative of the uncertainty is null.
Example 1. 
Let X be a random variable having Weibull distribution with survival function F ¯ ( t ) = e λ t α , t > 0 . Then, X is DURL (IURL) for α > 1 (resp. 0 < α < 1 ) [13].
Remark 2. 
The relation between aging non-parametric PDFs [6,11] and uncertainty ones [13] is that:
  • If F is  I F R ( D F R ) then it is  D U R L ( I U R L ) . So that  I F R D U R L  and  D F R I U R L .
  • If F is  D M R L , then it is  D U R L So that  D M R L D U R L .
  • IF F is  I U R L , then it is  I M R L . So that  I U R L I M R L .
But there is no connection with IFRA (DFRA).
We can point out an interesting mathematical property of such DURL non-parametric probability distribution (Ebrahimi (1996) [13]):
Lemma 1. 
A survival function F ¯ is DURL (IURL), if L ( f ; t ) = H ( f ; t ) 0 ( H ( f ; t ) 0 ) . That is, if we have negative (positive) local reduction of uncertainty, then F ¯ is DURL (IURL) [13].
Lemma 2. 
A survival function F ¯ is D U R L if and only if H ( f ; x ) H ( f ; y ) 0 for all y x [13].
As DURL (IURL) is a non-parametric class of probability distributions, then if we want to test H 0 : F DURL, we cannot compute the statistical criteria under H 0 . So, it is more convenient to test H 0 : F Exp, against H 1 : F DURL, where Exp is the class of all exponential distributions. Ebrahimi (1997) [18] has initiated this test on the basis of the monotonicity property of uncertainty (Lemma 2). We note that under exponentiality, the residual entropy is constant (Ebrahimi (1996) [13]). So, in this paper, we propose a new criterion based on Lemma 1, which tests this uncertainty property (see also Baratpour and Habibi Rad (2012, 2016) [20,21]).
In the following, we explain the procedure of the test H 0 : exponentiality against H 1 : DURL, using the histogram method to estimate the density and the probability function [18] and derivative criteria.
We obtain the mean and variance of the proposed new statistic. The consistency and the asymptotic normality are discussed. We determine the critical point values of the statistic by means of Monte Carlo simulation. Then, we compare power estimates of this test with those of Ebrahimi (1997) [18]. Finally, results of the illustrative examples are discussed.

4. A Test Statistic Based on the Histogram Method and Derivative Criteria

Recall that E x p denotes the class of all exponential distributions (no matter what its parameter) and DURL denotes the class of all NPDFs which have the uncertain property of Definition 4, i.e., decreasing residual entropy.
In order to discriminate between the hypothesis:
H 0 : F E x p
against
H 1 : F D U R L ( a n d n o t e x p o n e n t i a l d i s t r i b u t i o n )
We use the derivative property of Lemma 1, contrary to the test of Ebrahimi (1997) [18], which uses the monotonicity of the dynamic entropy (Lemma 2).
The derivative of the uncertainty measure H ( f ; t ) can be written as [13]:
H ( f ; t ) = [ H ( f ; t ) 1 + l o g λ F ( t ) ] λ F ( t ) .
Under the hypothesis H 0 , the exponential has a constant residual entropy; then, the function H ( f ; t ) = 0 and λ F ( t ) 0 , t > 0 .
Let the function
g ( t ) = H ( f ; t ) 1 + l o g λ F ( t ) .
Then, g ( t ) = 0 under the hypothesis H 0 .
Consider now the expectation:
S = E [ g ( t ) ] .
We have
S = 0 g ( t ) f ( t ) d t =
= 0 [ H ( f ; t ) 1 + l o g λ F ( t ) ] f ( t ) d t =
= 0 f ( t ) l o g f ( t ) d t + 0 f ( t ) F ¯ ( t ) t f ( y ) l o g f ( y ) d y d t 1
We use the empirical probability density function to estimate f ( t ) using a histogram method [18] based on a partition of the order statistics. Let t 1 , t 2 , , t n be n ( n > 2 ) observations from F. Let t ( 1 ) , t ( 2 ) , t ( n ) be the corresponding order statistics. We choose k as an integer ( 0 < k < n ) such that:
λ i = ( n 1 ) i k a i 1 + 1 , i = 1 , , k ,
where a 0 = 1 and [ . ] indicates the nearest integer function.
We consider a partition of [ t ( 1 ) , t ( 2 ) , t ( n ) ] into k subintervals:
I i = [ t ( a i 1 ) , t ( a i ) [ , i = 1 , 2 , , k
where
a i = 1 + j = 1 i λ j , i = 1 , 2 , , k .
then,
a i = ( n 1 ) i k + 1 , i = 1 , 2 , , k .
The empirical density function is (see [22,23]):
f n ( x ) = { λ i ( n 1 ) t ( a i ) t ( a i 1 ) , if x I i , 1 i k , 0 , otherwise .
The empirical cumulative function is:
F n ( x ) = { 0 , if x < t ( 1 ) , a i 1 1 n 1 + λ i n 1 x t ( a i 1 ) t ( a i ) t ( a i 1 ) , if x I i , 1 i k , 1 , if x t ( n ) .
Let d i ( k ) = t ( a i ) t ( a i 1 ) , i = 1 , 2 , , k . Some algebras give:
0 f ( t ) l o g f ( t ) d t = i = 1 k I i λ i ( n 1 ) d i ( k ) l o g λ i ( n 1 ) d i ( k ) d x
= i = 1 k λ i ( n 1 ) l o g λ i ( n 1 ) d i ( k )
and
t f ( y ) l o g f ( y ) d y = λ i ( n 1 ) d i ( k ) l o g λ i ( n 1 ) d i ( k ) y | t t ( a i )
+ i > i λ i ( n 1 ) d i ( k ) l o g λ i ( n 1 ) d i ( k ) y | t ( a i 1 ) t ( a i )
= λ i ( n 1 ) d i ( k ) l o g λ i ( n 1 ) d i ( k ) [ t ( a i ) t ]
+ i > i λ i ( n 1 ) l o g λ i ( n 1 ) d i ( k )
where i is the i-th index such that the interval [ t ( a i 1 ) , t ( a i ) ] contains t.
Now, if we replace ( 12 ) in the second term of expression ( 8 ) , we obtain
0 f ( t ) F ¯ ( t ) t f ( y ) l o g f ( y ) d y d t =
i = 1 k I i b i ( n ) λ i ( n 1 ) d i ( k ) l o g λ i ( n 1 ) d i ( k ) [ t ( a i ) t ]
+ i = 1 k 1 I i b i ( n ) i > i λ i n 1 l o g λ i ( n 1 ) d i ( k ) d t
where
b i ( n ) = λ i ( n a i 1 ) d i ( k ) + λ i t ( a i 1 ) λ i t .
so,
S n ( k ) = i = 1 k 1 1 n 1 C n ( i ) l o g ( n 1 ) d i ( k ) λ i 1 ,
C n ( i ) = 2 λ i + ( λ i a i 1 + n + j > i λ j ) l o g n a i 1 n a i + λ k k 1 .
Note that under H 0 , we have g ( t ) = 0 and under H 1 , g ( t ) < 0 according to Lemma 1. It is the same logic for the statistic S n ( k ) . So, S n ( k ) is null under H 0 and negative under H 1 . Then, small values of S n ( k ) favor H 1 or equivalently, large values of W n ( k ) = S n ( k ) favor H 1 .
So, we reject H 0 in favor of H 1 at significance level α if W n ( k ) C k , n ( α ) . Note that C k , n ( α ) is the critical point value determined by the ( 1 α ) -quantile of the distribution of W n ( k ) under exponentiality.

5. Mean and Variance of the Statistic W n ( k )

We now derive the mean and variance of S n ( k ) by using the linear property of mathematical expectation and some results cited in [18]. Let:
B n ( i , k ) = E t ( a i ) t ( a i 1 ) =
= j = a i 1 a i 1 Γ ( 1 ) l o g ( n j ) Π m = a i 1 , m j a i 1 n m j m
and
D n ( i , k ) = V a r t ( a i ) t ( a i 1 ) =
= j = a i 1 a i 1 ( Γ ( 1 ) + 2 Γ ( 1 ) l o g ( n j )
+ l o g 2 ( n j ) ) m = a i 1 a i 1 n m j m B n 2 ( i , k ) ,
where Γ ( 1 ) and Γ ( 1 ) are, respectively, the first and second derivatives of the Gamma function:
Γ ( α ) = 0 x α 1 e x d x
evaluated at α = 1 [18].
Then, we have:
E W n ( k ) = i = 1 k 1 C n ( i ) n 1 B n ( i , k ) + i = 1 k 1 C n ( i ) n 1 l o g n 1 λ i + 1
and
V a r W n ( k ) = 1 n 1 2 i = 1 k 1 C n ( i ) 2 D n ( i , k ) .
Inspired by the proofs cited in [18], we have established proofs in Section 6 and Section 7 relating to the asymptotic properties of the new test.

6. Consistency of the Test W n ( k )

In this section, we prove the convergence of the test statistic W n ( k ) .
Theorem 1. 
The statistic W n ( k ) converges to S as n ,   k , and k n 0 .
Proof. 
Let
U n = i = 1 k 1 l o g n 1 λ i F t ( a i ) F t ( a i 1 ) C n ( i ) n 1
and
V n = i = 1 k 1 l o g F t ( a i ) F t ( a i 1 ) t ( a i ) t ( a i 1 ) C n ( i ) n 1 .
So, we have
W n ( k ) = U n ( k ) + V n ( k )
where F ( t ( a i ) ) are order statistics from a uniformly distributed population. Next, we have
E ( U n ) = i = 1 k 1 C n ( i ) n 1 E ( l o g F t ( a i ) F t ( a i 1 ) )
+ 1 n 1 i = 1 k 1 C n ( i ) l o g ( n 1 λ i )
= i = 1 k 1 C n ( i ) n 1 Ψ ( n 1 ) i k + 1 + 1 n 1 i = 1 k 1 C n ( i ) l o g ( n 1 λ i )
i = 1 k 1 C n ( i ) n 1 Ψ ( n + 1 )
= i = 1 k 1 C n ( i ) n 1 ( Ψ ( a i ) Ψ ( n + 1 ) ) + 1 n 1 i = 1 k 1 C n ( i ) l o g ( n 1 λ i )
where Ψ is the digamma function defined for each natural integer n by
Ψ ( n ) = 1 + 1 2 + 1 3 + + 1 n 1 γ .
where γ 0.557 is the approximation of the Euler–Mascheroni constant.
It follows that E ( U n ) 0 as n . So, U n ( k ) 0 in probability as n .
Now, let G be the primitive of the fraction:
S ( t ) l o g ( f ( t ) ) = f ( t ) f ( t ) F ¯ ( t ) l o g ( f ( t ) ) t f ( y ) l o g ( f ( y ) ) d y
+ f ( t ) l o g ( f ( t ) )
We express the statistic V n ( k ) as a Stieltjes sum of the function l o g ( f ( x ) ) with respect to the measure G n , where G n is the empirical estimation of G.
Then,
V n ( k ) = i = 1 k 1 l o g F t ( a i ) F t ( a i 1 ) t ( a i ) t ( a i 1 ) ( G n t ( a i ) G n ( t ( a i 1 ) ) )
Recall that the function t ( . ) and the sequence a i are chosen such as I i , t ( a i ) t ( a i 1 ) 0 when n . So, from ( 26 ) , we deduce that if G n ( x ) G x almost surely uniformly over x [24], then V n ( k ) S almost surely as n . Under this restriction, the test is consistent. □

7. Normality of the Statistic W n ( k )

In this section, we prove the normality of the statistic W n ( k ) as n .
Theorem 2. 
Under the null hypothesis H 0 , the normalized statistic k 2 [ W n ( k ) 1 ] converges to a standard normal distribution as n , k and k n 0 .
Proof. 
We can write W n ( k ) as follows,
W n ( k ) = i = 1 k 1 C n ( i ) n 1 l o g j = a i 1 a i 1 t ( j + 1 ) t ( j )
+ 1 n 1 i = 1 k 1 C n ( i ) l o g ( n 1 λ i ) + 1 .
As proposed by David and Nagaraja (2003) [25] and using the fact that:
k 2 ( n 1 ) i = 1 k 1 C n ( i ) l o g ( n 1 λ i ) 0
when n , k and k n 0 , the statistic W k ( n ) is asymptotically equivalent to:
1 n 1 i = 1 k 1 C n ( i ) l o g U n a i 1 ( n a i 1 + 1 )
where U m ( i ) is the order statistic of a uniform distribution of size m. Then, the statistic ( 28 ) is statistically equivalent to:
I n ( k ) = 1 n 1 i = 1 k 1 C n ( i ) Z i ,
where Z i , i = 1 , 2 , , k 1 are independent and identically distributed random variables.
Now, let the normalized random variable:
k 2 I n ( k ) = i = 1 k 1 k C n ( i ) 2 ( n 1 ) Z i
and for n , k and k n 0 , we have k C n ( i ) 2 ( n 1 ) 1 .
By the central limit theorem, we conclude that k 2 I n ( k ) is asymptotically normal, and k 2 [ W n ( k ) 1 ] is also asymptotically normal. □

8. Critical Values of the Test W n ( k )

In this section, by means of Monte Carlo simulation, we determine critical values C k , n ( α ) of the statistic W n ( k ) . For different confidence levels of 1 α : 0.900 , 0.950 , 0.975 and 0.990 and different values of sample size n, simulations are provided. A total of 5000 samples of exponential distribution with mean 1 are generated. Note that we compute for each sample the corresponding values of the empirical statistic distribution W n ( k ) for large spacings of the observations and W n ( k ) for little spacings. The established algorithm gives the critical values for each α and n, which are summarized in Table 1 and Table 2.

9. Power Estimates

Under the alternative hypothesis, the distribution of the statistic W n ( k ) is complicated. However, we can obtain power estimates by using Monte Carlo simulation method. We have provided a large number of experiments under Weibull ( 1 , θ )
F ¯ ( x ) = e x p ( x θ ) , θ > 0 ,
and Gamma ( θ , 1 )
G ( x ) = 1 Γ ( θ ) x y θ 1 e y d y
alternatives.
This choice is motivated by the fact that when θ = 1 , both Weibull and Gamma distributions become an exponential distribution corresponding to the null hypothesis. In this case, the power estimates are very weak. So, we can compare our results with those of Ebrahimi (1997) [18]. In Table 3 and Table 4, we give simulated powers for sample size n. Simulated results are based on 10,000 iterations. We notice that the proposed test based on the derivative criteria performs relatively well when the parameter θ of the distributions reaches 20.

10. Illustrative Examples

Example 2. 
As a first illustration, we consider survival times (in days) for 50 patients’ head and neck cancer cited in Lai and Xie (2006) [7] and Efron (1988) [26]. We have computed the empirical value of the statistic W n ( k ) which is equal to W n ( k ) = 26.4194 . When comparing the result with the critical values given in Table 1, we deduce that the statistic exceeds all percentiles.
We can conclude that this sample is provided from a D U R L distribution. So, the remaining lifetimes of a patient from this population can be predicted with more precision.
From Figure 1, we notice that the more t ( t > 0 ) increases, the more the curve decreases. Thus, the empirical residual entropy from the dataset cited in [7,26] is DURL. It confirms the established test based on the derivative criteria. However, we notice that there is a part of the curve parallel to the abscissa axis. This is because during the time interval [ 600 , 1100 ] , there are no data available regarding survival times, and therefore, the measure of information does not change.
By applying the Chi-square goodness-of-fit test to the data cited in [7,26], we can test whether these data belong to Weibull distribution against another one. The null hypothesis is accepted at a 0.05 level of significance, where the estimated shape and scale parameters given by Matlab are, respectively, as follows: 365.7673 and 1.0223 . This decision is further supported by the Weibull probability plot given by Matlab in Figure 2, where the data points (in cross line, representing the empirical data) appear along the reference line (in dashed line) representing the theoretical distribution.
Example 3. 
As a second illustration, we consider times to failure of 18 electronics devices from Lai and Xie (2006) [7] and Wang (2000) [27]. The compilation of the algorithm gives the estimation of the statistic W n ( k ) = 24.2029 . This value exceeds all percentiles given in Table 1 when n = 18 . We conclude also that this real sample is provided from D U R L distribution.
In Figure 3, we once again observe that the curve of the empirical residual entropy specific to the dataset cited in [7] decreases as t ( t > 0 ) increases. This confirms the result of the test based on the derivative criteria.
The same conclusion leads by applying the Chi-square goodness-of-fit test to the data cited in [7,27]. Once again, we accept that the distribution is a Weibull at the 0.05 level of significance; see Figure 4, and the estimated shape and scale parameters given by Matlab are, respectively, 179.656 and 1.1458 .
Notice that the results above show that the shape parameters exceed 1, so the Weibull distributions are also DURL (Example 1). This is due to the fact that the DURL class does not only contain non-parametric distributions, but it also includes parametric ones.
Another example in medicine has been provided by Benaoudia and Aissani (2022) [28], on different real data by Bryson and Siddiqui, as cited in Lai and Xie (2006) [7]. It leads to the same results.

11. Conclusions

In this paper, we have proposed a new statistical test for detecting the monotonicity of uncertainty based on derivative criteria different from that of Ebrahimi (1997) [18]. The choice of the exponential distribution in the null hypothesis is based on the fact that this distribution gives a constant residual entropy. The derivative of this last one is null. Hence, we test whether the residual entropy is constant against the decreasing one. The consistency and the asymptotic normality properties have been discussed. Power estimates have been calculated and compared to those of the test of Ebrahimi (1997) [18]. Illustrations based on real survival data have been provided, and the application of the Chi-square goodness-of-fit test confirms the datasets distributions belong to parametric distributions that are DURL at the same time. A second new test has been studied using the kernel estimation method, which provides the same conclusions. This is the subject of a further paper.

Author Contributions

Conceptualization, A.A. and H.B.; methodology, H.B.; software, B.H; validation, A.A.; formal analysis, H.B.; investigation, H.B.; resources, A.A. and H.B.; data curation; writing—original draft preparation, A.A. and H.B.; writing—review and editing, A.A. and H.B.; visualization, A.A. and H.B.; supervision, A.A.; project administration, A.A.; funding acquisition, A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded, in part, by The Ministry of Higher Education and Scientific Research (DGRSDT) grant number C00L07UN160420220004.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing is not applicable.

Acknowledgments

Many thanks for Antonio Di Crescenzo for his help with some calculus and the documentation given when the first author had a grant for some days in the University of Salerno (Italy). We appreciate the fruitful discussions with Mounir Mesbah when the second author visited the LPSM laboratory, University of Jussieu-Sorbonne in 2018. Finally, many thanks for the reviewers for the interest brought to this work and their useful comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Poole, D.L.; Mackworth, A.K. Artificial Intelligence, Foundations of Computational Agents, 2nd ed.; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar]
  2. Walley, P. Measures of uncertainty in expert systems. Artif. Intell. 1996, 83, 1–58. [Google Scholar] [CrossRef]
  3. Zio, E.; Pedroni, N. Literature Review of Methods for Representing Uncertainty; Cahiers de la Securité Industrielle; Foundation for an Industrial Safety Culture (FonCSI): Toulouse, France, 2013. [Google Scholar]
  4. Colyvan, M. Is the probability the only coherent approch to uncertainty? Risck Anal. 2008, 28, 645–652. [Google Scholar] [CrossRef]
  5. Shannon, C.E.S. A mathematical theory of communication. Bell System. Technol. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  6. Shaked, M.; Shantikumar, J.G. Stochastic Orders and Their Applications; Academic Press: San Diego, CA, USA, 1994. [Google Scholar]
  7. Lai, C.D.; Xie, M. Stochastic Ageing and Dependance in Reliability; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
  8. Barlow, R.E.; Proshan, F. Statistical Theory of Reliability and Life Testing; Holt, Rinehalt and Winston: New York, NY, USA, 1975. [Google Scholar]
  9. Feng, M.; Qu, H.; Yi, Z.; Kurths, J. Distribution derived from evolving networks whith variable elements. IEEE Trans. Cybernet 2017, 48, 2556–2568. [Google Scholar] [CrossRef] [PubMed]
  10. Feng, M.; Deng, L.J.; Chen, F.; Perc, M.; Kurths, J. The accumulative law and its probability model, an extension of the pareto distribution and the log-normal distribution. Proc. R. Soc. A 2020, 476, 20200019. [Google Scholar] [CrossRef] [PubMed]
  11. Marshall, A.W.; Olkin, I. Inequalities, Theory of Majorization and Its Application; Academic Press: New York, NY, USA, 1979. [Google Scholar]
  12. Senouci, M.R.; Mellouk, A.; Oukhelou, L.; Aissani, A. Using belief functions theory to deploy static wireless sensor networks. In Belief Functions, Theory and Applications; Springer: Berlin/Heidelberg, Germany, 2012; pp. 425–432. [Google Scholar]
  13. Ebrahimi, N. How to measure uncertainty about lifetime. Sankhya 1996, 58, 48–57. [Google Scholar]
  14. Asadi, M.; Ebrahimi, N. Residual entropy and its characterizations in terms of hazard function and mean residual life function. Stat. Probab. Lett. 2000, 49, 263–269. [Google Scholar] [CrossRef]
  15. Ebrahimi, N.; Soofi, E.S. Information functions for Reliability. In Mathematical Reliability, an Expository Perspective; Springer: Boston, MA, USA, 1994; pp. 127–155. [Google Scholar]
  16. Ebrahimi, N.; Pellery, F. New partial ordering of survival functions based on notion of uncertainty. J. Appl. Probab. 1995, 32, 202–211. [Google Scholar] [CrossRef]
  17. Ebrahimi, N.; Kirmani, S. Some results on ordering of survival functions through uncertainty. Stat. Probab. Lett. 1996, 28, 167–176. [Google Scholar] [CrossRef]
  18. Ebrahimi, N. Testing whether lifetime distribution is decreasing uncertainty. J. Stat. Plan. Inference 1997, 64, 9–19. [Google Scholar] [CrossRef]
  19. DiCrescenzo, A.; Longobardi, M. Entropy-based measure of uncertainty in past lifetime. J. Appl. Probab. 2002, 39, 434–440. [Google Scholar] [CrossRef]
  20. Baratpour, S.; Habibi Rad, N. Testing goodness of fit exponential distribution based on cumulative residual entropy. Commun. Stat. Theory 2012, 41, 1387–1396. [Google Scholar] [CrossRef]
  21. Baratpour, S.; Habibi Rad, N. Exponentiality test based on the progression type II censoring via cumulative entropy. Commun. Stat.-Simul. Comput. 2016, 45, 2625–2637. [Google Scholar] [CrossRef]
  22. Härdle, W. Smoothing Techniques; Springer: Berlin/Heidelberg, Germany, 1991. [Google Scholar]
  23. Silverman, B.W. Density for Statistics and Data Analysis; Chapman and Hall: New York, NY, USA, 1986. [Google Scholar]
  24. Mack, S.P.A. A Comparative Study of Entropy Estimators and Entropy Based Goodness of Fit Tests. Ph.D. Thesis, University of California, San Diego, CA, USA, 1988. [Google Scholar]
  25. David, H.A.; Nagaraja, H.N. Order Statistics, 3rd ed.; Wiley: New York, NY, USA, 2003. [Google Scholar]
  26. Efron, B.N. Logistic regression, survival analysis and the Kaplan Meier curve. J. Am. Stat. Assoc. 1988, 2883, 414–425. [Google Scholar] [CrossRef]
  27. Wang, F.K. A new model with bathtub shaped failure rate using an additive Burr XII distribution. Reliab. Eng. Syst. Saf. 2000, 70, 305–312. [Google Scholar] [CrossRef]
  28. Benaoudia, H.; Aissani, A. A non-parametric test for decreasing uncertainty life distribution (DURL). In International Conference on Computational Science and Its Applications; Springer International Publishing: Cham, Switzerland, 2022; pp. 188–196. [Google Scholar]
Figure 1. Curve of residual entropy of survival times (in days) for patients’ head and neck cancer.
Figure 1. Curve of residual entropy of survival times (in days) for patients’ head and neck cancer.
Mathematics 11 04227 g001
Figure 2. Weibull probability plot of Chi-2 goodness-of-fit test of survival times (in days) for patients’ head and neck cancer.
Figure 2. Weibull probability plot of Chi-2 goodness-of-fit test of survival times (in days) for patients’ head and neck cancer.
Mathematics 11 04227 g002
Figure 3. Curve of residual entropy of times to failure of 18 electronics devices.
Figure 3. Curve of residual entropy of times to failure of 18 electronics devices.
Mathematics 11 04227 g003
Figure 4. Weibull probability plot of Chi-2 goodness-of-fit-test of times to failure of 18 electronics devices.
Figure 4. Weibull probability plot of Chi-2 goodness-of-fit-test of times to failure of 18 electronics devices.
Mathematics 11 04227 g004
Table 1. Critical values of W n ( k ) .
Table 1. Critical values of W n ( k ) .
α 0.1000.0500.0250.010
n = 3 4.80065.53346.30697.0340
n = 4 3.97384.53215.08345.5868
n = 5 6.89377.83428.60389.5242
n = 6 5.74766.41447.10877.6660
n = 7 6.20657.02067.63908.3710
n = 8 6.67507.69588.37038.9823
n = 9 7.22827.95328.49979.1910
n = 10 7.45268.21488.87009.5785
n = 11 6.82197.52948.18058.7718
n = 12 6.97617.75438.34268.9653
n = 13 7.17697.83418.58099.2141
n = 14 7.34608.06988.76849.5144
n = 15 6.94587.66148.26888.7816
n = 19 7.63128.38438.96919.5865
n = 20 7.85898.56779.10779.7696
n = 25 7.87958.58139.18138.8255
n = 30 7.78848.48829.06889.7825
n = 40 8.44859.18319.712410.3109
n = 43 8.45449.20739.729710.3426
n = 50 8.50879.20049.807410.3617
n = 60 8.47379.11959.657010.3008
n = 70 8.98389.580210.195410.7712
n = 80 9.05569.706610.241210.8923
n = 90 8.93359.587010.051210.6993
n = 100 8.93639.574510.121410.7857
Table 2. Critical values of W n ( k ) .
Table 2. Critical values of W n ( k ) .
α 0.1000.0500.0250.010
n = 3 2.35643.92975.35917.7634
n = 4 1.33782.43783.31154.4102
n = 5 2.38164.27276.14518.5942
n = 6 0.82251.96773.25724.7316
n = 7 1.06332.39073.53095.2719
n = 8 1.61473.06264.69116.3835
n = 9 −0.46530.78711.87143.1158
n = 10 −0.03371.25302.65254.1937
n = 11 −0.49630.73621.81062.8472
n = 12 −0.43290.87871.99713.1916
n = 13 −0.33410.82871.95193.0975
n = 14 −0.13021.11042.17763.9166
n = 15 −0.43730.70191.81442.8823
n = 19 −1.65320.71070.35141.5840
n = 20 −1.6547−0.57650.42951.5230
n = 25 −1.6166−0.53380.46301.6354
n = 30 −1.4796−0.43970.65851.8028
n = 40 −2.5892−1.5634−0.72440.4818
n = 43 −2.6190−1.7370−0.81460.3497
n = 50 −2.5228−1.4527−0.51320.6715
n = 60 −2.7075−1.8218−0.79030.4079
n = 70 −3.5011−2.6071−1.8078−0.6376
n = 80 −3.4030−2.5228−1.7164−0.5773
n = 90 −3.4025−2.4769−1.5283−0.3414
n = 100 −3.4964−2.5482−1.6047−0.4008
Table 3. Power estimates for Weibull ( θ , 1 ) of W n ( k ) .
Table 3. Power estimates for Weibull ( θ , 1 ) of W n ( k ) .
n20105
θ = 20 0.99900.98620.8921
θ = 10 0.92880.78310.5686
θ = 5 0.55680.37500.2591
θ = 3 0.25060.17840.1321
θ = 1 0.05550.05220.0526
Table 4. Power estimates for Gamma ( θ , 1 ) of W n ( k ) .
Table 4. Power estimates for Gamma ( θ , 1 ) of W n ( k ) .
n20105
θ = 20 0.90810.84100.7513
θ = 10 0.79170.69400.6000
θ = 5 0.56800.47430.3966
θ = 3 0.36250.29260.2397
θ = 1 0.05010.05110.0496
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Benaoudia, H.; Aissani, A. Non-Parametric Test for Decreasing Uncertainty of Residual Life Distribution (DURL). Mathematics 2023, 11, 4227. https://doi.org/10.3390/math11204227

AMA Style

Benaoudia H, Aissani A. Non-Parametric Test for Decreasing Uncertainty of Residual Life Distribution (DURL). Mathematics. 2023; 11(20):4227. https://doi.org/10.3390/math11204227

Chicago/Turabian Style

Benaoudia, Hassina, and Amar Aissani. 2023. "Non-Parametric Test for Decreasing Uncertainty of Residual Life Distribution (DURL)" Mathematics 11, no. 20: 4227. https://doi.org/10.3390/math11204227

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop