Next Article in Journal
Grazing and Symmetry-Breaking Bifurcations Induced Oscillations in a Switched System Composed of Duffing and van der Pol Oscillators
Next Article in Special Issue
Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model
Previous Article in Journal
Decision-Making of Transnational Supply Chain Considering Tariff and Third-Party Logistics Service
Previous Article in Special Issue
Bayesian Inference for Stochastic Cusp Catastrophe Model with Partially Observed Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Cumulative Residual Tsallis Entropy-Based Test of Uniformity and Some New Findings

by
Mohamed S. Mohamed
1,*,
Haroon M. Barakat
2,
Salem A. Alyami
3 and
Mohamed A. Abd Elgawad
4,5
1
Department of Mathematics, Faculty of Education, Ain Shams University, Cairo 11341, Egypt
2
Department of Mathematics, Faculty of Science, Zagazig University, Zagazig 44519, Egypt
3
Department of Mathematics and Statistics, Faculty of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 13318, Saudi Arabia
4
Department of Mathematics, Faculty of Science, Benha University, Benha 13518, Egypt
5
School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430070, China
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(5), 771; https://doi.org/10.3390/math10050771
Submission received: 27 January 2022 / Revised: 22 February 2022 / Accepted: 23 February 2022 / Published: 28 February 2022
(This article belongs to the Special Issue Mathematical and Computational Statistics and Their Applications)

Abstract

:
The Tsallis entropy is an extension of the Shannon entropy and is used extensively in physics. The cumulative residual Tsallis entropy, which is a generalization of the Tsallis entropy, plays an important role in the measurement uncertainty of random variables and has simple relationships with other important information and reliability measures. In this paper, some novel properties of the cumulative residual Tsallis entropy are disclosed. Moreover, this entropy measure is applied to testing the uniformity, where the limit distribution and an approximation of the distribution of the test statistic are derived. In addition, the property of stability is discussed. Furthermore, the percentage points and power against seven alternative distributions of this test statistic are presented. Finally, to compare the power of the suggested test with that of other tests of uniformity, a simulation study is conducted.

1. Introduction

The classical measure of uncertainty in a discrete distribution (Shannon [1]) has been used in many areas, such as computer science [2], communication theory [2], the physical and chemical fields [3], fuzzy sets [4], and finance [5,6]. A straightforward extension of the discrete case to continuous distributions based on a probability density function (PDF) f X ( . ) of a continuous random variable (RV) X , called differential entropy, reads (cf. [7])
S ( X ) = E ( log f X ( X ) ) = f X ( x ) log f X ( x ) d x .
Many generalizations of the Shannon entropy have been published by inserting some additional parameters, making these generalizations more responsive to diverse shapes of probability distributions. Rao et al. [7] (see also, Wang et al. [8]) suggested a non-negative measure of uncertainty and referred to it as the cumulative residual entropy (CRE). This suggested measure is obtained by replacing the PDF f X ( x ) in (1) by the survival function F ¯ X ( x ) : = P ( X > x ) = 1 F X ( x ) . Thus, for any continuous RV X with a cumulative distribution function (CDF) F X ( x ) , the CRE is specified by
C R E ( F ) = 0 F ¯ X ( x ) log ( F ¯ X ( x ) ) d x ,
where F ¯ X ( x ) = P ( X > x ) .
The Tsallis entropy of order θ is a generalization of the Shannon entropy that was first given by Havrda and Charvat [9]. Then, Tsallis [10] used its properties and placed it in a physical context. This measure is defined for any continuous RV X as
T θ ( X ) = 1 θ 1 E ( 1 ( f X ( X ) ) θ 1 ) = 1 θ 1 1 ( f X ( x ) ) θ d x ,
where 1 θ > 0 . Clearly, as θ 1 , T θ ( X ) S ( X ) .
Motivated by the wide applicability of the Tsallis entropy, Sati and Gupta [11] proposed the cumulative residual Tsallis entropy (CRTE) of order θ , which is given by
C T θ * ( X ) = 1 θ 1 1 0 ( F ¯ X ( x ) ) θ d x , 1 θ > 0 .
Rajesh and Sunoj [12] introduced an alternate measure of CRTE of order θ , which possesses certain interesting properties with C T θ * ( X ) , as
C T θ ( X ) = C T θ ( F ) = 1 θ 1 0 ( F ¯ X ( x ) ( F ¯ X ( x ) ) θ ) d x , 1 θ > 0 ;
when θ 1 , then C T θ ( X ) C R E ( F ) but C T θ * ( X ) C R E ( F ) , and C T θ ( X ) = C T θ * ( X ) + E ( X ) 1 θ 1 , for more details see Mohamed [13]. In this work we focus only on the measure (2).
Rajesh and Sunoj [12] detected several eminent features of the CRTE (2). For example, the CRTE has more interesting mathematical features than the CRE, it can be easily estimated from sample data, and these estimates asymptotically converge to the true values. Moreover, the CRTE handles the information in residual life. For the standard uniform distribution, denoted by U ( 0 , 1 ) , Rajesh and Sunoj [12] determined the value of the CRTE, which is 1 2 ( θ + 1 ) . The literature teems with several results of the Shannon entropy and its related measures. Interested readers may refer to [14,15,16,17,18] for the Shannon entropy, Kullback–Leibler divergence, and Fisher information number; [19,20] for fractional cumulative residual entropy and cumulative residual entropy, respectively; [13,21,22,23,24] for the Tsallis entropy and its related measures; and finally [25,26] for the extropy and Rényi entropy and its applications, respectively.
Stephens [27] provided a useful guide to goodness-of-fit tests using statistics based on the empirical CDF. Furthermore, power comparisons of several uniformity tests were performed in [27]. The power attributes of an entropy-based test when employed for measuring uniformity were examined by Dudewicz and Van der Meulen [28]. Furthermore, Dudewicz and Van der Meulen [28] demonstrated that the entropy-based test has good power qualities for various alternatives by comparing it to other uniformity tests. Noughabi [29] developed a test for uniformity based on the CRE and studied some of its features. In addition, he compared the percentage points and power of seven alternative distributions. Mohamed et al. [30] used the fractional and weighted CRE measures to test the uniformity.
In this paper, we study the CRTE (2) for testing uniformity. The outcome of a simulation study reveals that the test under CRTE is competing with the test based on CRE in terms of power. In addition, some interesting statistical properties of the CRTE are revealed. We also use the Monte Carlo method via simulation and normality asymptotic, as well as the beta approximation, to derive the percentage points under the CRTE. In addition, the CRTE and other tests are compared in terms of power analysis.

Work Motivation

The Tsallis entropy of order θ , which was introduced by Tsallis [10], plays an important role in the measurement uncertainty of RVs and leads nonextensive statistics. The Tsallis entropy is the basis of the so-called nonextensive statistical mechanics, which generalizes the Boltzmann-Gibbs theory (cf. [31]). Tsallis statistics have found applications in a wide range of phenomena in diverse disciplines such as physics, chemistry, biology, medicine, economics, geophysics, etc. For example, Cartwright [32] proposed applications of the Tsallis entropy in various fields, such as describing the fluctuation of the magnetic field in the solar wind and signs of breast cancer in mammograms. Sati and Gupta [11] introduced a cumulative residual Tsallis entropy of order θ and studied its various properties in the context of reliability modeling. After one year, Rajesh and Sunoj [12] introduced an alternate measure of CRTE (defined by (2)) and studied its properties. Unlike the CRTE of Sati and Gupta [11], the proposed measure had some additional features and had simple relationships with other important information and reliability measures.
There are many different types of probability distributions, and the uniform distribution is perhaps the simplest of them all. For a continuous distribution, the uniform distribution defines equal probability over a given range. As a result, it is valuable as a reference distribution. Random number generation is one of the most important uses of uniform distribution. Moreover, in the field of economics, usually, demand and replenishment may not follow the expected normal distribution. As a result, different distribution models are employed to better anticipate probabilities and trends. According to Wanke [33], uniform distribution is more effective when evaluating lead-time for inventory management at the beginning of the lifecycle when a brand new product is being studied. Furthermore, social scientists use uniform distribution to represent a lack of knowledge. For example, in a simulation where distribution is not known, uniform random variates are often used. Uniform distribution is also used to describe the measurement error of some instruments or measuring systems. All of these factors (cf. [34]) explain the increasing interest in the choice of simple and computationally efficient tests for hypotheses about the uniform law of analyzed samples.
The aforesaid theoretical and practical importance of the statistic CRTE defined in (2) and the tests for uniformity provides a sufficient motivation to study and reveal some important properties of that statistic and use it for testing the uniformity.
The rest of the paper is organized as follows. In Section 2, we obtain some new findings of the CRTE. In Section 3, we propose the CRTE test statistic for uniformity and discuss some of its properties, including the property of stability. In Section 5, we propose the methods of finding the percentage points of CRTE. In addition, we estimate the percentage points of CRTE. In Section 6, we use a Monte Carlo simulation to carry out the power comparison of the uniformity of different tests against seven alternative distributions.

2. Some Properties of CRTE

In what follows, the symbols ( n p ), ( n d ), and ( n a . s . ) stand for convergence in probability, convergence in distribution and almost surely, as n . In this section, we derive some properties of the measure C T θ ( X ) , which is defined in (2).
Theorem 1.
Let X = ( X 1 , X 2 , , X N ) be a random vector in R N . Furthermore, for all 1 i N and some q > N , let X i L q , i.e., E [ | X i | q ] < . Then, C T θ ( X ) < , θ > 1 .
Proof. 
For all 0 x 1 and 1 θ > 0 , we can easily check that the function g ( x ; θ ) : = 1 θ 1 ( x x θ ) attains its maximum value θ θ 1 θ at x 0 = θ 1 θ 1 . Moreover,
0 g ( x ; θ ) θ θ 1 θ 1 , 0 x 1 , 1 θ > 0 .
On the other hand, for each 0 < p < 1 , we are now going to prove the inequality
g ( x ; θ ) = 1 θ 1 ( x x θ ) θ θ 1 θ x p 1 p , 0 x 1 ,   θ > 1 .
Clearly, in view of the relation (3), the inequality (4) holds if L ( x ; p ) : = x p 1 p 1 . Since, for L ( x ; p ) < 1 , we obtain x < ( 1 p ) 1 p , we consider the ratio g ( x ; θ ) θ θ θ 1 L ( x ; p ) in the interval x < ( 1 p ) 1 p , where θ > 1 . By using (3), we obtain
g ( x ; θ ) θ θ θ 1 L ( x ; p ) 1 p θ 1 ( x 1 p x θ p ) ( 1 p ) x 1 p θ 1 1 p θ 1 ( 1 p ) 1 p p = ( 1 p ) 1 p θ 1 .
Thus, the inequality (4) holds if ( 1 p ) 1 p θ 1 1 or, equivalently, θ 1 + ( 1 p ) 1 p . On the other hand, since p may be arbitrarily chosen in the interval ( 0 , 1 ) , we can choose it to be sufficiently close to 1, in order that the inequality (4) be held for any θ > 1 . This proves the inequality (4). It is worth mentioning that this inequality is not satisfied for θ < 1 , in general, Figure 1 shows this fact.
Now, for any 0 < p < 1 and 1 i N , we can use the inequality (4) to obtain
1 θ 1 P [ | X i | > x i , 1 i N ] P θ [ | X i | > x i , 1 i N ] θ θ 1 θ 1 p P p [ | X i | > x i , 1 i N ] θ θ 1 θ 1 p i = 1 N F ¯ X i p N ( x i ) .
Integrating both sides of (5) over R + N = { x i R N ; x i 0 } and utilizing the Markov inequality, we obtain
C T θ ( X ) θ θ 1 θ 1 p R + N i = 1 N F ¯ X i p N ( x i ) d x i = θ θ 1 θ 1 p i = 1 N 0 F ¯ X i p N ( x i ) d x i = θ θ 1 θ 1 p i = 1 N 0 1 F ¯ X i p N ( x i ) d x i + 1 F ¯ X i p N ( x i ) d x i θ θ 1 θ 1 p i = 1 N 1 + 1 1 x i q E [ | X i | q ] p N d x i ,
which is finite if p q N > 1 . Thus, for any q > N , we can choose p < 1 (sufficiently close to one) to satisfy p q N > 1 and the result follows. □
Remark 1.
For any RV X , it is well-known that the existence of Var ( X ) implies X L 2 . Thus, the existence of Var ( X ) is a sufficient condition for C T θ ( X ) < , θ > 1 .
Theorem 2 (Weak convergence).
Let the sequence X n of N-dimensional random vectors converge in distribution to a random vector X . Furthermore, for all n , let X n L q , q > N . Then,
lim n + C T θ ( X n ) = C T θ ( X ) .
Proof. 
Since X n n d X , we have
lim n + F ¯ | X n | ( x ) F ¯ | X n | θ ( x ) = F ¯ | X | ( x ) F ¯ | X | θ ( x ) , x R + N .
Meanwhile, from (4), we obtain
1 θ 1 F ¯ | X n | ( x ) F ¯ | X n | θ ( x ) θ θ 1 θ 1 p i = 1 N F ¯ | X n i | p N ( x i ) θ θ 1 θ 1 p i = 1 N I [ 0 , 1 ] ( x i ) + x i q I [ 1 , ) ( x i ) E ( | X n i | q ) p N ,
where X n i is the ith component of the random vector X n and I A ( x ) is the indicator function, i.e., I A ( x ) = 1 ,   x A , I A ( x ) = 0 ,   x A . Therefore, if q p N > 1 , 1 θ 1 F ¯ | X n | ( x ) F ¯ | X n | θ ( x ) is bounded by an integrable function. Meanwhile, for any q > N , we can choose p < 1 sufficiently close to one to satisfy p q N > 1 . The use of the dominated convergence theorem completes the proof. □
We show below that the measure C T θ ( X ) dominates the differential entropy (1), which may exist when X has density.
Theorem 3.
Suppose that X is a non-negative RV with CDF F X ( x ) ; then,
C T θ ( X ) C ( θ ) e S ( X ) , 1 θ > 0 ,
where C ( θ ) = e 0 1 log 1 θ 1 ( x x θ ) d x < , and S ( X ) is the differential entropy defined in (1).
Proof. 
Utilizing the log-sum inequality, we obtain
0 f X ( x ) log f X ( x ) 1 θ 1 F ¯ X ( x ) F ¯ X θ ( x ) d x log 1 0 1 θ 1 F ¯ X ( x ) F ¯ X θ ( x ) d x = log C T θ ( X ) .
Furthermore, the left-hand side in (6) is derived as
0 f X ( x ) log f X ( x ) 1 θ 1 F ¯ X ( x ) F ¯ X θ ( x ) d x = S ( X ) 0 1 log 1 θ 1 x x θ d x .
Thus,
log C T θ ( X ) S ( X ) + 0 1 log 1 θ 1 x x θ d x .
The result follows after exponentiating both sides of (7) and using (3), where C ( θ ) = e 0 1 ln 1 θ 1 ( x x θ ) d x θ θ 1 θ < 1 is finite. This completes the proof. □

3. Further Theoretical Aspects and Test Statistic

To establish the test with a null hypothesis H 0 of uniformity, we need the following theorem.
Theorem 4.
Let X be a non-negative RV with a continuous CDF F with a support [ 0 , 1 ] . Then, 0 C T θ ( F ) θ θ 1 θ . Moreover, the value 1 2 ( 1 + θ ) is uniquely attained by the uniform distribution U ( 0 , 1 ) for all 1 θ > 0 .
Proof. 
The proof of inequality 0 C T θ ( F ) θ θ 1 θ follows directly from (3). Meanwhile, using the strict concavity of g ( x ; θ ) = 1 θ 1 ( x x θ ) , we obtain C T θ ( . ) as a concave function of distributions (with support [ 0 , 1 ] ). Thus, C T θ ( F ) = 1 2 ( 1 + θ ) is uniquely acquired by the distribution U ( 0 , 1 ) . This completes the proof. □
Let X 1 , X 2 , , X n be a random sample with a continuous CDF F defined on [ 0 , 1 ] . Furthermore, let X ( 1 ) X ( 2 ) X ( n ) be the corresponding order statistics. Clearly, we can suggest an estimator of C T θ ( F ) by C T θ ( F n ) = 0 g ( F ¯ n ( x ) ; θ ) d x , 1 θ > 0 , where F ¯ n ( x ) = 1 F n ( x ) and F n ( x ) is the empirical CDF, which is given by
F n ( x ) = i = 1 n 1 i n I [ X ( i ) , X ( i + 1 ) ) ( x ) + I [ X ( n ) , ) ( x ) , x R .
Moreover, in order to get a consistent test of the hypothesis of uniformity, we propose the consistent statistic test
R n ( θ ) = 1 θ 1 i = 1 n 1 ( 1 i n ) 1 i n θ ( X ( i + 1 ) X ( i ) ) = i = 1 n 1 A i Δ i ,
where A i = 1 θ 1 ( 1 i n ) 1 i n θ , 1 θ > 0 , and Δ i = ( X ( i + 1 ) X ( i ) ) , i = 1 , 2 , , n 1 .
Theorem 5.
The test based on the sample estimate R n ( θ ) is consistent.
Proof. 
From the Glivenko-Cantelli theorem, see Howard [35], we have sup t | F n ( t ) F ( t ) | n a . s . 0 . Moreover, it is easily asserted that C T θ ( F n ) n a . s . C T θ ( F ) , which proves the theorem. □
Remark 2.
Since C T θ ( F n ) n p C T θ ( F ) , we obtain R n ( θ ) n p 1 2 ( 1 + θ ) , under the null hypothesis H 0 . On the other hand, under the alternative hypothesis (that F is any continuous CDF defined on [ 0 , 1 ] , which is not the uniform) we have R n ( θ ) n p q , where q is a smaller or larger number than 1 2 ( 1 + θ ) .
Theorem 6.
Let the random sample X 1 , X 2 , , X n be drawn from an unknown continuous CDF F defined on [ 0 , 1 ] . Then, 0 R n ( θ ) θ θ 1 θ , 1 θ > 0 .
Proof. 
In view of (3), we obtain
0 R n ( θ ) i = 1 n 1 θ θ 1 θ Δ i = θ θ 1 θ ( X ( n ) X ( 1 ) ) θ θ 1 θ .
This completes the proof. □
Theorem 7.
Under H 0 , the mean and variance of R n ( θ ) are given, respectively, by
E ( R n ( θ ) ) = 1 n + 1 i = 1 n 1 A i , a n d V a r ( R n ( θ ) ) = n ( n + 1 ) 2 ( n + 2 ) i = 1 n 1 A i 2 .
Proof. 
Clearly, for any i = 1 , 2 , , n 1 , the RV Δ i , based on the uniform distribution U ( 0 , 1 ) has a beta distribution with parameter-vector ( 1 , n ) , written Δ i B e t a ( . ; 1 , n ) (cf. [36]). This completes the proof. □
Remark 3.
Under H 0 , we have lim n E ( R n ( θ ) ) = 1 2 ( 1 + θ ) , and lim n V a r ( R n ( θ ) ) = 0 .
The critical region, which specifies the uniformity test, is defined by
C T θ ( F n ) C T θ , α 2 : = lower ,   or   C T θ ( F n ) C T θ , 1 α 2 : = upper ,
where α is the desired level of significance, and C T θ , α is the α quantile of the asymptotic, or approximated, CDF of the test statistic C T θ ( F n ) , under H 0 .

The Stability of CRTE

The stability of measures of information has been studied by several works of literature, see [19,37,38,39,40]. Analogously, we define the stability of the CRTE as the following.
Definition 1.
Let X 1 , X 2 , , X n be a random sample with a continuous CDF F and X 1 , X 2 , , X n be any small deformation of X 1 , X 2 , , X n . Then, the empirical CRTE is stable if ϵ > 0 , δ > 0 , and n Z + , we have i = 1 n | X i X i | < δ | C T θ ( F ¯ n ( X ) ) C T θ ( F ¯ n ( X ) ) | < ϵ .
The next theorem gives a sufficient condition of the stability of the empirical CRTE.
Theorem 8.
For any continuous RV X , the empirical CRTE is stable if X is distributed on a finite interval.
Proof. 
Suppose that the RV X is supported in the finite interval [ a , b ] such that a 0 and b < . In view of (8), the empirical CRTE can be derived as
C T θ ( F ¯ n ( X ) ) = 1 1 n j = 1 n X j , θ = 0 , 1 θ 1 j = 1 n 1 ( F ¯ n ( X ( j ) ) F ¯ n θ ( X ( j ) ) ) Δ j , 1 θ > 0 .
For θ = 0 , the stability of empirical CRTE is obvious. In brief, denote g j = g F ¯ n ( X ( j ) ) ; θ , g j = g F ¯ n ( X ( j ) ) ; θ , and Δ j = X ( j + 1 ) X ( j ) . Thus, when j = 1 n | X j X j | < δ , we obtain
C T θ ( F ¯ n ( X ) ) C T θ ( F ¯ n ( X ) ) = j = 1 n 1 g j Δ j j = 1 n 1 g j Δ j = j = 1 n 1 ( g j g j ) Δ j + j = 1 n 1 g j Δ j Δ j j = 1 n 1 g j g j Δ j + j = 1 n 1 g j | ( X ( j + 1 ) X ( j + 1 ) ) | + | ( X ( j ) X ( j ) ) | ϵ 2 ( b a ) ( X ( n ) X ( 1 ) ) + 2 δ ,
where the second term in the second inequality in (10) is legitimated in view of (3). On the other hand, the first term in that inequality is legitimated from the fact that for any x , x , and arbitrary small ϵ > 0 , δ > 0 , such that | F ¯ n ( x ) F ¯ n ( x ) ) | < ϵ , whenever | x x | < δ (cf. [19]), which implies g j g j ϵ 2 ( b a ) , whenever, j = 1 n | X j X j | < δ . Now, choose δ = ϵ 4 , we obtain ϵ 2 ( b a ) ( X ( n ) X ( 1 ) ) + 2 δ ϵ . This completes the proof. □

4. Percentage Points of the Test Statistic

In this section, we obtain the asymptotic distribution of R n ( θ ) under H 0 . From (8), we can write R n ( θ ) = i = 1 n 1 T i , where T i = A i Δ i , i = 1 , 2 , , n 1 , and Δ i B e t a ( . ; 1 , n ) . Thus, the RV T i has the PDF
f T i ( t ) = n A i 1 t A i n 1 , i = 1 , 2 , , n 1 .
The mean and variance of T i are μ i = E ( T i ) = A i E ( Δ i ) = A i n + 1 and σ i 2 = Var ( T i ) = A i 2 Var ( Δ i ) = n A i 2 ( n + 1 ) 2 ( n + 2 ) , respectively. By using the Lyapunov central limit theorem (cf. Billingsley [41]), we obtain i = 1 n 1 ( T i μ i ) i = 1 n 1 σ i 2 = R n ( θ ) E ( R n ( θ ) ) Var ( R n ( θ ) ) n d Z , where Z is the standard normal RV. Therefore, under H 0 , the percentage point ( α quantile) C T θ , α is estimated for large n by using the asymptotic normality of R n ( θ ) as follows
C T ^ θ , α = E ( R n ( θ ) ) + Var ( R n ( θ ) ) Z α ,
where Z α is the quantile ( α × 100 ) of the standard normal distribution Φ Z ( . ) .
Johannesson and Giri [42] suggested an approximation of the CDF of the linear combination of the finite number of beta RVs. Noughabi [29] utilized this result to approximate the percentage points of the CRE for finite n . By following a similar method, an approximation of R n ( θ ) for finite n can be obtained as follows:
R n ( θ ) i = 1 n 1 A i η ,
where the RV η has the beta distribution B e t a ( . ; a , b ) with
a = ( n + 2 ) ( i = 1 n 1 A i ) 2 ( n + 1 ) ( i = 1 n 1 A i 2 ) 1 n + 1 ,   and b = n n + 1 ( n + 2 ) ( i = 1 n 1 A i ) 2 i = 1 n 1 A i 2 1 .
According to (12), the mean and variance of R n ( θ ) are given, respectively, by
E ( R n ( θ ) ) = i = 1 n 1 A i a a + b , and Var ( R n ( θ ) ) = i = 1 n 1 A i 2 a b ( a + b ) 2 ( a + b + 1 ) .
Using this approximation of R n ( θ ) , the quantiles of order α 2 and 1 α 2 of the approximated CDF of the test statistic C T θ ( F n ) under H 0 are given, respectively, by
l o w e r : = i = 1 n 1 A i B e t a 1 α 2 ; a , b ,   and u p p e r : = i = 1 n 1 A i B e t a 1 1 α 2 ; a , b ,
where B e t a 1 ( . ; a , b ) is the quantile function of the beta distribution, B e t a ( . ; a , b ) , and the parameter-vector ( a , b ) is defined in (13).

Percentage Points

We generated 50,000 samples of sizes n = 10, 20, 30, 40, 50, 70, and 100 from U ( 0 , 1 ) . Utilizing (8), the test statistic R n ( θ ) was estimated by the empirical CRTE for each sample. Moreover, we can see that C T 0.1 ( U ) = 0.4545 , C T 0.5 ( U ) = 0.3333 , C T 0.9 ( U ) = 0.2631 , C T 2 ( U ) = 1 6 , C T 5 ( U ) = 1 12 , and C T 10 ( U ) = 1 22 , where C T θ ( U ) is the CRTE of the CDF U ( 0 , 1 ) . Consequently, for R n ( θ ) , we present the percentage points of the Monte Carlo method, asymptotic normality, and beta approximation by using (9), (11), and (14), respectively. Table 1 shows that as n increased, the difference between percentage points (upper–lower) decreased. Furthermore, the Monte Carlo approach was more accurate than the other two methods for R n ( θ ) , because it had almost minimum differences between percentage points.
Figure 2 and Figure 3 depict the empirical PDFs of the test statistics via Monte Carlo samples for n = 10, 20, 30, 50, and 100, θ = 0.1 , 0.5 , and 0.9 , and n = 10, 20, 30, 50, and 100, θ = 2 , 5 , and 10, respectively. It is noted that the means of the empirical PDFs of R n ( θ ) became nearer to the exact values ( C T θ ( U ) ) as n increased, which indicates that the bias and variance decreased with an increase in n . Moreover, the six corresponding figures to θ = 0.1 , 0.5 , 0.9 , 2 , 5 , and 10 in Figure 2 and Figure 3 reveal that the improvement in the bias and variance does not depend almost on θ .

5. Power Analysis

In this section, we examine the power test of the Monte Carlo method under alternative distributions. The power of R n ( θ ) is estimated by the proportion of the generated samples that are in the critical region. Under seven alternative distributions, the power of R n ( θ ) is calculated by the Monte Carlo method for the generated 50,000 samples each of size n = 10 , 20 , and 40 . The alternative CDFs introduced by Stephens [27] in the power study of uniformity tests are
A l : F ( y ) = 1 ( 1 y ) l , 0 y 1 , l = 1.5 , 2 , B l : F ( y ) = 2 l 1 y l , 0 y 0.5 , 1 2 l 1 ( 1 y ) l , 0.5 y 1 , l = 1.5 , 2 , 3 , C l : F ( y ) = 0.5 2 l 1 ( 0.5 y ) l , 0 y 0.5 , 0.5 + 2 l 1 ( y 0.5 ) l , 0.5 y 1 , l = 1.5 , 2 .
On the report of Stephens [27], the family A l gives points nearer to zero than predictable under the hypothesis of uniformity and is interpreted as a change in the mean, the family B l gives points near 0.5 and is interpreted as a change toward a smaller variance, and the family C l shows two clusters close to 0 , 1 and is interpreted as a change toward a larger variance.
In Table 2, we recorded the power values of the proposed test statistics R n ( θ ) , Kolmogorov–Smirnov (K-S), Kuiper (V), Cramer-von Mises ( W 2 ), Watson ( U 2 ), and Anderson-Darling ( A 2 ), for n = 10 , 20 , and 40, and α = 0.05 . From Table 2, we can draw the following conclusions:
  • For a fixed θ and as n increases, we see that the power of R n ( θ ) increases.
  • For the alternatives B l and C l , the power of R n ( θ ) increases and gives better performance against the other tests when θ tends to 1 ( θ 1 ).
  • For the alternative A l , when n and θ increases, the power of R n ( θ ) increases and gives a better performance than the other tests.

6. Conclusions

Some novel properties of the CRTE quantity were presented such as sufficient conditions for the CRTE to be finite, the weak convergence of the CRTE, the connections between the CRTE, the CRE, and classic differential entropy, and the stability of the empirical CRTE. Furthermore, for the CDFs with support [ 0 , 1 ] , we exhibited that the value of C T θ was within [ 0 , θ θ 1 θ ] . Moreover, the test of uniformity was proposed by calculating the percentage points and power analysis of C T θ . In addition, for C T θ , we obtained the percentage points by using the Monte Carlo method via simulation and normality asymptotic, as well as the beta approximation. A power comparison was performed between the CRTE and other tests, where, by changing the value of θ , we indicated when the test had higher and lower power compared with the other tests.
When we talk about the prospects for future research, we consider here two problems. The first one is to extend the result of this work to a multivariate version of the entropy measures; see, for example, the Formulas (4), (5), and (6) of [43], as a starting point for that future work. The second future research goal is to apply the proposed test to a recent real-world dataset to help solve one of society’s practical concerns.

Author Contributions

The authors contributed equally to the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The simulated data used to support the findings of this study are included within the article.

Acknowledgments

The authors are grateful to Mata Wang and the four anonymous reviewers for their careful and diligent reading, which improved the readability and presentation substantially.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shannon, C.E. A mathematical theory of communication. Bell. Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  2. Li, Z.; Li, W.; Liu, R. Applications of entropy principles in power systems: A survey. In Proceedings of the 2005 IEEE/PES Transmission & Distribution Conference & Exposition: Asia and Pacific, Dalian, China, 15–18 August 2005; pp. 1–4. [Google Scholar]
  3. Schmid, R.; Miah, A.M.; Sapunov, V.N. A new table of the thermodynamic quantities of ionic hydration: Values and some applications (enthalpy–entropy compensation and Born radii). Phys. Chem. Chem. Phys. 2000, 2, 97–102. [Google Scholar] [CrossRef]
  4. Song, Y.; Fu, Q.; Wang, Y.F.; Wang, X. Divergence-based cross entropy and uncertainty measures of Atanassov’s intuitionistic fuzzy sets with their application in decision making. Appl. Soft Comput. 2019, 84, 105703. [Google Scholar] [CrossRef]
  5. Gu, R. Multiscale Shannon entropy and its application in the stock market. Physica A 2017, 484, 215–224. [Google Scholar] [CrossRef]
  6. Zhou, R.; Cai, R.; Tong, G. Applications of entropy in finance: A review. Entropy 2013, 15, 4909–4931. [Google Scholar] [CrossRef]
  7. Rao, M.; Chen, Y.; Vemuri, B.C.; Wang, F. Cumulative residual entropy: A new measure of information. IEEE Trans. Inf. Theory 2004, 50, 1220–1228. [Google Scholar] [CrossRef]
  8. Wang, F.; Vemuri, B.C. Non-rigid multi-model image registration using cross-cumulative residual entropy. Int. J. Comp. Vision 2007, 74, 201–215. [Google Scholar] [CrossRef] [Green Version]
  9. Havrda, J.; Charvat, F. Quantification method of classification process: Concept of structural α-entropy. Kybernetika 1967, 3, 30–35. [Google Scholar]
  10. Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
  11. Sati, M.M.; Gupta, N. Some characterization results on dynamic cumulative residual Tsallis entropy. J. Probab. Stat. 2015, 2015, 694203. [Google Scholar] [CrossRef] [Green Version]
  12. Rajesh, G.; Sunoj, S.M. Some properties of cumulative Tsallis entropy of order α. Stat. Pap. 2019, 60, 933–943. [Google Scholar] [CrossRef]
  13. Mohamed, M.S. On cumulative Tsallis entropy and its dynamic past version. Indian J. Pure Appl. Math. 2020, 51, 1903–1917. [Google Scholar] [CrossRef]
  14. Abd Elgawad, M.A.; Alawady, M.A.; Barakat, H.M.; Xiong, S. Concomitants of generalized order statistics from Huang-Kotz Farlie-Gumbel-Morgenstern bivariate distribution: Some information measures. Bull. Malays. Math. Sci. Soc. 2020, 43, 2627–2645. [Google Scholar] [CrossRef]
  15. Abd Elgawad, M.A.; Barakat, H.M.; Xiong, S.; Alyami, S.A. Information measures for generalized order statistics and their concomitants under general framework from Huang-Kotz FGM bivariate distribution. Entropy 2021, 23, 335. [Google Scholar] [CrossRef]
  16. Alawady, M.A.; Barakat, H.M.; Abd Elgawad, M.A. Concomitants of generalized order statistics from bivariate Cambanis family of distributions under a general setting. Bull. Malays. Math. Sci. Soc. 2021, 44, 3129–3159. [Google Scholar] [CrossRef]
  17. Barakat, H.M.; Husseiny, I.A. Some information measures in concomitants of generalized order statistics under iterated Farlie-Gumbel-Morgenstern bivariate type. Quaest. Math. 2021, 44, 581–598. [Google Scholar] [CrossRef]
  18. Park, S. Information measure in terms of the hazard function and its estimate. Entropy 2021, 23, 298. [Google Scholar] [CrossRef]
  19. Xiong, H.; Shang, P.; Zhang, Y. Fractional cumulative residual entropy. Comm. Nonlin. Sci. Num. Simul. 2019, 78, 104879. [Google Scholar] [CrossRef]
  20. Zhang, Y.; Shang, P.; He, J.; Xiong, H. Cumulative Tsallis entropy based on power spectrum of financial time series. Chaos 2019, 29, 103–118. [Google Scholar] [CrossRef]
  21. Irshad, M.R.; Maya, R.; Buono, F.; Longobardi, M. Kernel estimation of cumulative residual Tsallis entropy and its dynamic version under ρ-mixing dependent data. Entropy 2022, 24, 9. [Google Scholar] [CrossRef]
  22. Mohamed, M.S. On cumulative residual Tsallis entropy and its dynamic version of concomitants of generalized order statistics. Commun. Stat. Theory Methods 2020. [CrossRef]
  23. Mohamed, M.S.; Abdulrahman, A.T.; Almaspoor, Z.; Yusuf, M. Ordered variables and their concomitants under extropy via COVID-19 data application. Complexity 2021, 2021, 114. [Google Scholar] [CrossRef]
  24. Toomaj, A.; Atabay, H.A. Some new findings on the cumulative residual Tsallis entropy. J. Comput. Appl. Math. 2021, 400, 113669. [Google Scholar] [CrossRef]
  25. Mohamed, M.S. A measure of inaccuracy in concomitants of ordered random variables under Farlie-Gumbel-Morgenstern family. Filomat 2019, 33, 4931–4942. [Google Scholar] [CrossRef]
  26. Mohamed, M.S. Some new findings on the survival Rényi entropy and application of COVID-19 data. Results Phys. 2021, 31, 104966. [Google Scholar] [CrossRef] [PubMed]
  27. Stephens, M.A. EDF statistics for goodness of fit and some comparisons. J. Am. Stat. Assoc. 1974, 69, 730–737. [Google Scholar] [CrossRef]
  28. Dudewicz, E.J.; Van der Meulen, E.C. Entropy-based tests of uniformity. J. Am. Stat. Assoc. 1981, 76, 967–974. [Google Scholar] [CrossRef]
  29. Noughabi, H.A. Cumulative residual entropy applied to testing uniformity. Commun. Stat. Theory Methods 2020, 50, 1811339. [Google Scholar] [CrossRef]
  30. Mohamed, M.S.; Barakat, H.M.; Alyami, S.A.; Abd Elgawad, M.A. Fractional entropy-based test of uniformity with power comparisons. J. Math. 2021, 2021, 5331260. [Google Scholar] [CrossRef]
  31. Anastasiadis, A. Special Issue: Tsallis Entropy. Entropy 2012, 14, 174–176. [Google Scholar] [CrossRef] [Green Version]
  32. Cartwright, J. Roll over, Boltzmann. Phys. World 2014, 27, 31–35. [Google Scholar] [CrossRef]
  33. Wanke, P. The uniform distribution as a first practical approach to new product inventory management. Int. J. Prod. Econ. 2008, 114, 811–819. [Google Scholar] [CrossRef]
  34. Blinov, P.Y.; Lemeshko, B.Y. A review of the properties of tests for uniformity. In Proceedings of the 2014 12th International Conference on Actual Problems of Electronic Instrument Engineering, Novosibirsk, Russia, 2–4 October 2014. [Google Scholar] [CrossRef]
  35. Howard, G.T. A generalization of the Glivenko-Cantelli theorem. Ann. Math. Stat. 1959, 30, 828–830. [Google Scholar] [CrossRef]
  36. Arnold, B.C.; Balakrishnan, N.; Nagaraja, H.N. A First Course in Order Statistics; Wiley: New York, NY, USA, 1992. [Google Scholar]
  37. Abe, S. Stability of Tsallis entropy and instabilities of Renyi and normalized Tsallis entropies: A basis for q-exponential distributions. Phys. Rev. E 2002, 66, 046134. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Abe, S.; Kaniadakis, G.; Scarfone, A.M. Stabilities of generalized entropies. J. Phys. A Math. Gen. 2004, 37, 10513. [Google Scholar] [CrossRef] [Green Version]
  39. Lesche, B. Instabilities of Renyi entropies. J. Stat. Phys. 1982, 27, 419–422. [Google Scholar] [CrossRef]
  40. Ubriaco, M.R. Entropies based on fractional calculus. Phys. Lett. A 2009, 373, 2516–2519. [Google Scholar] [CrossRef] [Green Version]
  41. Billingsley, P. Probability and Measure; John Wiley & Sons: New York, NY, USA, 2008. [Google Scholar]
  42. Johannesson, B.; Giri, N. On approximations involving the beta distribution. Commun. Stat. Simul. Comput. 1995, 24, 489–503. [Google Scholar] [CrossRef]
  43. Mesiar, R.; Sheikhi, A. Nonlinear random forest classification, a copula-based approach. Appl. Sci. 2021, 11, 7140. [Google Scholar] [CrossRef]
Figure 1. The relation between the functions g ( x ; θ ) and m ( x ) : = θ θ 1 θ L ( x ; p ) with different values of θ and 0 < p < 1 .
Figure 1. The relation between the functions g ( x ; θ ) and m ( x ) : = θ θ 1 θ L ( x ; p ) with different values of θ and 0 < p < 1 .
Mathematics 10 00771 g001
Figure 2. The estimated PDFs of R n ( θ ) under U ( 0 , 1 ) , for 0 < θ < 1 .
Figure 2. The estimated PDFs of R n ( θ ) under U ( 0 , 1 ) , for 0 < θ < 1 .
Mathematics 10 00771 g002
Figure 3. The estimated PDFs of R n ( θ ) under U ( 0 , 1 ) , for θ > 1 .
Figure 3. The estimated PDFs of R n ( θ ) under U ( 0 , 1 ) , for θ > 1 .
Mathematics 10 00771 g003
Table 1. Percentage points of the suggested test statistic R n ( θ ) at level α = 0.05 .
Table 1. Percentage points of the suggested test statistic R n ( θ ) at level α = 0.05 .
n θ R n ( θ )
Normal ApproximationBeta ApproximationMonte Carlo Method
LowerUpperLowerUpperLowerUpper
100.10.12850.63110.16820.66550.22610.5302
0.50.10590.47770.13410.50230.18090.3872
0.90.08730.38150.10930.40070.14860.3056
20.056090.24390.070060.25610.09660.1956
50.02690.12260.03420.129020.04680.1002
100.014070.0669080.018180.07040.02470.0554
200.10.21630.61670.23990.63810.30590.5228
0.50.16830.45810.18470.47310.23810.3781
0.90.13510.36310.14790.37480.19190.2976
20.08560.23090.09380.23840.122010.19017
50.042080.116140.046330.120010.05990.0967
100.0226020.06350.02490.06570.03210.0534
300.10.25910.59930.27570.61470.33910.5145
0.50.19790.44250.20940.45320.26080.3723
0.90.15780.350010.16680.35830.20880.2933
20.09980.22230.10560.22760.13210.1868
50.04930.11170.05230.11450.065060.09504
100.026640.061130.028310.06260.035050.0524
400.10.28530.58590.29820.59790.35810.5098
0.50.21590.43130.22480.43960.27330.3688
0.90.17170.34080.17860.34730.21780.2904
20.108630.21630.113050.22050.13780.1847
50.05370.108690.056080.11080.06790.0938
100.02910.05940.03040.06060.03670.0516
500.10.30340.57550.31390.58530.37040.5052
0.50.22840.42290.23560.42970.28110.3661
0.90.18130.334050.18690.33930.22390.2883
20.11460.21190.11820.21530.14110.1834
50.05680.10640.05870.10820.06970.0929
100.030820.05820.03180.05920.03770.05111
700.10.32730.56040.33490.56760.38530.4993
0.50.24480.411090.250050.416080.29070.3617
0.90.194030.32450.19810.32840.23090.28504
20.12270.20580.12530.20830.14590.1811
50.060940.10330.06230.10460.07220.0916
100.033080.0565090.03380.05720.039080.0503
1000.10.34860.54560.354040.55080.39820.4943
0.50.25940.39970.26310.40330.298830.3581
0.90.20540.31550.20830.31830.23730.2822
20.12990.200080.13170.20180.14970.1792
50.06460.10030.06550.10130.07410.0904
100.03510.05480.03560.05530.040180.0496
Table 2. Power estimates of the tests at the level α = 0.05 .
Table 2. Power estimates of the tests at the level α = 0.05 .
nAlternative R n ( θ ) K-SV W 2 U 2 A 2
θ = 0 . 1 θ = 0.9 θ = 2 θ = 10
10 A 1.5 0.0790.066560.080720.105580.126160.07560.14560.077760.1877
A 2 0.114040.125640.187060.272160.302980.16310.35510.163080.4761
B 1.5 0.0950.118680.137620.105040.073520.09710.07410.10170.1349
B 2 0.222020.305720.35770.254580.11840.23070.11040.24810.3269
B 3 0.53490.724440.79970.617280.24240.53940.21540.56990.72308
C 1.5 0.08040.118340.130020.10760.03420.09740.02390.10310.0222
C 2 0.126660.243660.278960.185780.04020.23330.011140.24750.00924
20 A 1.5 0.107580.072720.131040.240560.21790.12260.252080.12250.3235
A 2 0.152180.18760.395460.639340.56160.34860.62410.33580.7538
B 1.5 0.130840.253160.304620.183580.08690.16340.07810.17860.1774
B 2 0.34620.668580.751460.506040.18490.46470.1620.50670.52802
B 3 0.761440.98520.995940.93190.45880.87110.46150.89780.93998
C 1.5 0.08970.195960.22440.141420.05090.16210.024060.17910.0213
C 2 0.13610.423380.538460.25190.11620.46330.04620.50480.0338
40 A 1.5 0.200480.083980.249620.510940.31440.180020.3660.17210.4498
A 2 0.26630.296040.725380.949420.75220.54470.81050.50710.8973
B 1.5 0.16680.51490.615880.349120.10210.24770.08730.26670.2281
B 2 0.48240.958940.983220.84080.27060.66950.251080.70760.7002
B 3 0.90634110.999520.67010.975060.72370.98190.99104
C 1.5 0.101340.327460.44690.182160.070.24920.03030.26780.0271
C 2 0.15110.646960.870820.328540.20770.67110.12580.71110.1105
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mohamed, M.S.; Barakat, H.M.; Alyami, S.A.; Abd Elgawad, M.A. Cumulative Residual Tsallis Entropy-Based Test of Uniformity and Some New Findings. Mathematics 2022, 10, 771. https://doi.org/10.3390/math10050771

AMA Style

Mohamed MS, Barakat HM, Alyami SA, Abd Elgawad MA. Cumulative Residual Tsallis Entropy-Based Test of Uniformity and Some New Findings. Mathematics. 2022; 10(5):771. https://doi.org/10.3390/math10050771

Chicago/Turabian Style

Mohamed, Mohamed S., Haroon M. Barakat, Salem A. Alyami, and Mohamed A. Abd Elgawad. 2022. "Cumulative Residual Tsallis Entropy-Based Test of Uniformity and Some New Findings" Mathematics 10, no. 5: 771. https://doi.org/10.3390/math10050771

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop