1. Introduction
The main purpose of this article is to introduce a family of lifetime distributions, called the Tsallis log-scale-location family. For this family of distributions, we analyze different stochastic orders and also the order between some characteristics of distributions, for example, between the Gini index or the moments. Given the specificity of our research, we will briefly present in the sequel some important topics and we also give some classical and recent references for the main ingredients of our work: (i) survival analysis; (ii) the Gini index and corresponding applications in economy, demography and social sciences in general; (iii) Tsallis statistics and related concepts, like Tsallis exponential and logarithm functions, q-exponential family, maximum entropy principle, etc. (iv) stochastic orders and related properties. Since our work is at the crossroads of these different research directions, we think that it is useful to provide a short overview of some important problems in these fields that are related to our work.
In risk theory, life expectancy is a significant measure. Due to the high average standard of living of some countries lately, there is an interest in studying the extent to which this standard of living is equally accessible to all people. That is why lately many people are studying the measures of variability in terms of lifespan (Anand et al. [
1]). The changes that characterize the changing stage of mortality are measured by variables such as age-specific death rates, life expectancy at birth, probabilities of death and survival function. Survival analysis is used in medicine, biology, social sciences such as economics, engineering (reliability and failure time analysis) and many other sciences. Survival analysis methods depend on the distribution of survival and on the hazard function. Parametric models are in practice easy to adapt and process because they are defined by a small and fixed number of unknown parameters. This allows one to use standard statistical methods in order to carry out statistical inference. These techniques depend on the adequacy of the specific parametric model used. For example, in biomedical applications, nonparametric (e.g., estimator of the survival curve) and semi-parametric (e.g., Cox proportional risk model) models are most important because they have the flexibility to adapt to a wide range of forms of hazard functions. Even so, parametric models are used in biomedical research and may be appropriate when the set of survival data indicates approximately a parametric form.
Another important subject for our work is the Gini coefficient or index. The Gini coefficient is the most common statistical index of diversity or inequality in social sciences (see, e.g., Gini [
2,
3], Nygard and Sandröm [
4], Kakwani [
5], Kendall et al. [
6], Allison [
7]). It is used in econometrics as a standard measure of inter-individual or inter-household inequality in income and wealth (Atkinson [
8,
9], Sen [
10], Anand [
11]). Illsey and Le Grand [
12], who justified the use of Gini coefficient for the analysis of inequality in health in the 1980s, stressed that the individual-based measurement of inequality in health is a way to a universal comparability of degrees of inequality over time and across countries. They also computed Gini coefficient from distributions of deaths by age in real populations. Other researchers linked the Gini coefficient and other measures of inter-individual inequality in age at death with the life table (Hanada [
13], Silber [
14], Wilmoth and Horiuchi [
15]). Hicks proposed to use the Gini coefficient to adjust average life expectancy for variability in order to construct the inequality-adjusted human development index (Hicks [
16]). Recent interesting articles were proposed by Kim and Kim [
17], Bonetti et al. [
18], Ostasiewicz and Mazurek [
19]. Among the recent works related to the Gini index and statistical extensions and applications, we can mention the Gini regressions and the principal component analysis based on the Gini correlation matrix developed in Charpentier et al. [
20,
21] and linear discriminant analysis based on generalized Gini correlation indexes proposed in Condevaux et al. [
22]. It is worth noting that the Gini index can be expressed in terms of the Lorenz curve
as the area between the first diagonal (equality) and the Lorenz curve, divided by the whole area below the diagonal. If we compute the GIni index on a finite sample of observations, it is equal to zero if all individuals die at the same age and it is equal to one if all the individuals except one of them die at birth and this remaining one dies at a positive age.
An important concept in demography is the concept of longevity that denotes the long duration of life and is used ss a synonym for high life expectancy. It is well known that a significant increase in longevity has been observed during the past several centuries: in particular, bestpractice life expectancy at birth has risen by 2.5 years per decade since the 1840 (Oeppen and Vaupel [
23]). Such increase in life expectancy is one of the consequences of changes in the survival distribution of the population over different cohorts. Evidence suggests that higher life expectancy at birth is associated with a lower concentration of survival times, both cross-country and over time. An empirical analysis of approximately 45 countries for the years 1960–1990 reveals a tight negative association between life expectancy at birth and the Gini coefficient (Shkolnikov et al. [
24]). Specifically, during the first three quarters of the 20th century the inter-individual inequality in length of life has been declining.
Several distributions have been proposed to model real lifetime data. The Weibull distribution is one of the most commonly used distributions for this purpose. In practice, it has been shown to be very flexible in modeling various types of lifetime data with monotone failure rates but it is not useful for modeling the bathtub shaped and the unimodal failure rates, which are common in reliability and biological studies. It is of utmost interest because of its great number of special features and its ability to fit data from various fields, ranging from life data to observations made in economics and business administration, meteorology, hydrology, quality control, acceptance sampling, statistical process control, inventory control, physics, chemistry, geology, geography, astronomy, medicine, psychology, material science, engineering, biology. Iriarte et al. [
25] introduced a new probability distribution class (Lambert-F Distributions Class) and they analyzed the hazard rate function. Al-Mofleh et al. [
26] proposed a new two-parameter generalized Ramos–Louzada distribution and analyzed the hazard rate function. Gigliarano et al. [
27] worked with the log-scale-location model. They analyzed the Gini index and the first moment of this model. Important analyzes on the population were made by Haberman and Renshaw [
28], Finkelstein [
29], Debon et al. [
30], Canudas-Romo [
31], Brown et al. [
32], Booth and Tickle [
33]. Hazra et al. [
34] considered the location-scale family of distributions and derived conditions under which the largest order statistic of a set of random variables with different/the same location as well as different/the same scale parameters dominates that of another set of random variables with respect to various stochastic orders.
Recently, the notion of the exponential family has been generalized by Naudts [
35,
36,
37,
38,
39,
40]. The same definition of generalized exponential family has been introduced in the mathematical literature by Naudts [
36,
40], Grunwald and Dawid [
41], Eguchi [
42], Briggs and Beck [
43]. This class of models was also derived using the maximum entropy principle in Abe [
44], Hanel and Thurner [
45] and in the context of game theory by Topsoe [
46,
47]. The notion of q-exponential family is connected with Amari’s family (see Amari [
48]), studied in the context of information geometry. The geometric approach is very appealing also in the context of statistical physics (see, for instance, [
49,
50]). The q-deformed exponential and logarithmic functions were first introduced in Tsallis’ statistics in 1994 [
51].
The last important concept for our work is the one of stochastic order. It is clear that stochastic orders provide methods of comparing random variables and vectors which are now used in many areas such as statistics, operations research, biomathematics, actuarial sciences, economic theory, queuing theory, risk management and other related fields. For a comprehensive review of the properties and characterizations of stochastic orderings, including a variety of applications, the reader is referred to the monographs of Shaked and Shantikumar [
52], Levy [
53], Denuit et al. [
54], Balakrishnan et al. [
55]. Many of these orders have characterizations as so-called integral stochastic orders which is obtained by comparing expectations of functions in a certain class. Lando and Bertoli-Barsotti [
56] obtained a method for deriving second-order stochastic dominance between multiparametric families which can be decomposed into a functional composition of two cumulative distributions and a quantile function. The method is applied to stochastic comparisons of order statistics. Recently, Sarabia et al. [
57] introduced a general class of multivariate GB2 distributions based on a generalization of the order statistics distribution, its construction resulting in a multivariate GB2 distribution with support above the diagonal. Aijaz et al. [
58] introduced a new Hamza two parameter distribution and studied its properties, including the moments, stochastic orderings, Bonferroni and Lorenz curves, Rényi entropy, order statistics, hazard rate function and mean residual function. Analytic representations of the multivariate Lorenz surface for a relevant type of models based on the class of distributions with given marginals described by Sarmanov and Lee have been obtained recently by Sarabia and Jorda [
59]. Das and Kayal [
60] obtained ordering results for the largest and the smallest order statistics arising from dependent heterogeneous exponentiated location-scale random observations, for the case that the sets of observations follow a common or different Archimedean copulas. Moreover, sufficient conditions for which the usual stochastic order and the reversed hazard rate order between the extreme order statistics hold have been derived. Aijaz et al. [
61] proposed the inverse analogue of Ailamujia distribution. The relevant statistical properties of the new distribution investigated include moments, moment generating function, order statistics, survival measures, Shanon entropy, mode and median. Recently, Castaño-Martínez et al. [
62] extended the results related to the increasing convex order of relative spacings for two distributions from consecutive spacings to the case of general spacings. Panja et al. [
63] considered stochastic comparisons of lifetimes of series and parallel systems with dependent and heterogeneous components with lifetimes following the proportional odds model and component lifetimes joint distribution modeled by Archimedean survival copula. By comparisons of heterogeneous series systems with location-scale family distributed components, Kundu and Chowdhury [
64] proved that the systems with dependent series components modeled by Archimedean copula with more dispersion in the location or scale parameters perform better in the sense of the usual stochastic order.
Taking into account all these bibliographic references ans associated discussions that we have presented up to this moment, we can state now that the main objective of our work is to introduce a generalized log-scale-location family of distributions that extends existing classes from the literature. Thus we obtain a more flexible model, interesting for lifetime applications in various fields. Then, for this family of distributions, we show that some types of stochastic orders are preserved, under certain conditions. To illustrate our findings, we consider the data from a simpler model existing in the literature and, applying our generalized log-scale-location model, we illustrate graphically that certain stochastic orders are preserved, which is coherent with some of our theoretical results obtained in the article.
The paper is organized as follows.
Section 2 introduces some preliminary notions and results. In
Section 3 we define the generalized
-log-scale-location model and we analyze the moments of these models in
Section 4. Necessary or sufficient conditions for usual stochastic order are derived in
Section 5, while necessary conditions for the Lorenz order, order for Gini index and generalized Gini index are obtained in
Section 6. The hazard rate order is analyzed in
Section 7, while in
Section 8 the excess wealth order and convex order are studied. A real data application is presented in
Section 9 and some general conclusions of the article are given in the last section.
2. Preliminaries
In this section we introduce some notation and basic definitions that will be used along the article. Except some classical notions and notations, we will give here the definitions of Lorenz curve and Gini index of a random variable, the notions of q-deformed Tsallis exponential function and q-deformed Tsallis logarithm function, the associated q-Weibull and q-Normal distributions, some notions of stochastic ordering and relationships between them.
Let be a probability space and a random variable. We denote by the corresponding distribution function, and by the corresponding survival function, We set .
For a function , and We say that a function is:
(i) non-decreasing (or increasing) if for all with
(ii) non-increasing (or decreasing) if for all with
If X is absolutely continuous with respect to the Lebesgue measure, then we denote its density function. We also denote the inferior quantile function of We will also use the notation
If is differentiable, we define the hazard rate function , where for a function ,
For a positive random variable
X with
we introduce the Lorenz curve of
X defined by
and also the Gini index of
X (see Arnold [
65,
66]) defined
Other formulas for computing the Gini index of
X can also be derived (see, e.g., Gigliarano et al. [
27]) are the following:
and
For
, one can define the so-called generalized
a-Gini index by
We notice that
and a simple calculation shows that
Let us now recall the definitions of the
q-deformed Tsallis logarithm function and of the
q-deformed Tsallis exponential function introduced by Tsallis [
51] and also give some properties.
Definition 1. For a real number the q-deformed Tsallis logarithm function is with
and
All along this article we will use the notations or for the q-deformed Tsallis logarithm function computed in a point
Remark 1. The function has the following properties:
(i)
(ii) is strictly non-decreasing function on because for all
Definition 2. For a real number the q-deformed Tsallis exponential function is with
and
All along this article we will use the notations or for the q-deformed Tsallis exponential function computed in a point
Remark 2. The function has the following properties:
(i)
(ii) for all and for all
(iii) is non-decreasing function on
(iv) For , we have
(v) For , we have
These functions are equally studied in Naudts [
67], that provides also the notion of
q-exponential family.
The q-deformed Tsallis exponential function and q-deformed Tsallis logarithm function allow one to introduce new random variables, by analogy with the ones defined using classical exponential and logarithm functions. We will introduce now the notions of q-Weibull distribution and q-Normal distribution.
Definition 3. We say that X is q -Weibull distributed with and denote it by if In this case:
For the general form of this distribution and further investigations, one can see Picoli et al. [
68].
Definition 4. We say that X is q-Normal distributed with parameters and denote it by if
For further investigations on this distribution and related topics, like q-Central Limit Theorem, one can see Umarov et al. [
69].
Let us now recall some definitions of stochastic orders and also some properties of these orders. All these definitions and results can be founded in Shaked and Shantikumar [
52].
Definition 5 (cf. [
52]).
Let X, Y be two random variables. X is said to be smaller than Y in the(i) stochastic order (written as ) if
(ii) hazard rate order (written as ) if
(iii) Lorenz order (written as ) if
(iv) dispersive order (written as ) if ∀
(v) excess wealth order (written as ) if
Another equivalent definition for stochastic order is given in the next definition.
Definition 6 (cf. [
52]).
Let X, Y be two random variables. X is said to be smaller than Y in the stochastic order (written as ) if for all non-decreasing functions , provided that the means exists. Definition 7 (cf. [
52]).
Let X, Y be two random random variables. X is said to be smaller than Y in the convex order (written as ) if for all convex functions , provided that the means exists. The following two results concern some properties of the stochastic order and of the dispersive order, respectively.
Theorem 1 (cf. [
52]).
Let random variable and the functions If then Theorem 2 (cf. [
52], Theorem 3.B.10, p. 152).
Let X, Y be two random variables such that (i) If , then for all non-decreasing convex or non-increasing concave functions
(ii) If , then for all non-increasing convex or non-decreasing concave functions
Let us now state some well known properties between these stochastic orders.
Proposition 1 (cf. [
52], Theorem 1.B.1, p. 18).
Proposition 2 (cf. [
52]).
Proposition 3 (cf. [
52]).
Proposition 4 (cf. [
52], 2007, p. 166).
If is finite then In particular, if is finite, then Proposition 5 (cf. [
52], 3.C.9, p. 166).
3. Generalized -Log-Scale-Location Model
In this section we propose a new log-scale-location class of lifetime distributions that extend existing classes presented in [
27]. We compute several characteristics of these distributions, like the hazard rate, the Gini index and the generalized
a-Gini index.
Definition 8. For a real random variable , functions and we say that the positive random variable T follows the -log-scale-location model if In this definition of the -log-scale-location model, the real number x stands for any continuous covariate on which the distribution of the lifetime T depends on. One possible generalization of this model would be the introduction of several covariates.
For
we obtain the model (5) from Gigliarano et al. [
27].
If the random variable
is absolutely continuous with respect to the Lebesgue measure, with density
then we can immediately obtain several characteristics of the random variable
namely the density
the hazard rate
the Gini index
and the generalized
a-Gini index
and
We also have the alternative expression of the generalized
a-Gini index in terms of the Lorenz curve of
4. The Moments of
In this section we analyze the stochastic order and, as a consequence, the inequalities between the moments of two log-scale-location models. Next theorem provides a formula for computing the mean of
Theorem 3. Let a random variable and the positive random variable T follows the -log-scale-location model, with . If there exists then Proof. We make the change of variable
and we obtain
□
In particular, for
we have
and, for
it results that
where
R is a random variable with survival function
It is worth noticing that we have thus obtained the Proposition 2 from Gigliarano et al. [
27].
In the next results we investigate several particular cases, namely the cases where the baseline distribution of is a q-Weibull distribution, a q-Normal distribution or a bounded distribution.
Corollary 1. Let and T following the -log-scale-location model, , with , Then Proof. if
then
In this case we have
Thus there exists
such that
It results
If
then
In this, case we have
It is obvious that
thus
□
Corollary 2. Let and T following the -log-scale-location model, . Then Proof. If
then
thus there exists an
such that
It results
and then
Thus
If
then
It results that
Thus
□
Corollary 3. Let be a real random variable with the property that there exists such that a.s. Then:
(i) For , there exists (ii) For , there exists Proof. Since a.s., then for all and we obtain that
Then
Applying Theorem 3 we obtain the desired result. □
5. Stochastic Order of These Models
If we want give an order between the moments of a random variable, it is complicated to compute all the moments and then establish an order. For this reason, we give a theorem which characterizes the stochastic order of these models.
Theorem 4. Let be a random variable, let be a positive random variable that follows the -log-scale-location model and let be a positive random variable that follows the -log-scale-location model. Then if and only if Proof. Let us consider
. We have:
For we have
We have
and
Then:
and
Then and
Now, let us prove the converse. For
we have
Thus □
The next proposition is a particular case where we examine the behavior of the survival distribution.
Proposition 6. If u is constant and b is a non-decreasing (non-increasing) function, then:
(i) for is a non-decreasing (non-increasing) function;
(ii) for is a non-increasing (non-decreasing) function; and
(iii) for is a constant function.
Proof. We consider only the case where b is a non-decreasing function (the non-increasing case can be proved similarly). Then:
(i) If
i.e.,
, and is we assume that
then
(ii) If
i.e.,
and if we assume that
then
(iii) If
i.e.,
and if we assume that
then
□
This proposition generalizes Corollary 1 from Gigliarano et al. [
27] when
The next two results are consequences of Theorem 4.
Corollary 4. Let is consider a random variable and a random variable following the -log-scale-location model, a random variable following the -log-scale-location model. If then
Proof. This is a consequence of Theorem 5 and Definition 6. □
Proposition 7. Let be a random variable following the -log-scale-location model and let be a random variable following the -log-scale-location model, with
Then if and only if Proof. Let us consider
. We have:
For we have
We have
and
Then:
and
Then and
Now, let us prove the converse. For
we have
Thus □
6. Lorenz Order and Gini Index
In this section we give some results on stochastic orderings according to Lorenz curve and Gini index. We also give some results related to the generalized Gini index.
The next theorem characterizes the dispersive order.
Theorem 5. If b is a non-decreasing (non-increasing) function and is a positive random variable following the -log-scale-location model, is a positive random variable following the -log-scale-location model, , then Proof. We consider only the case where b is a non-decreasing function (the non-increasing case can be proved similarly). Let us consider and with It results that Let
Then
and
It results
But
Then
Therefore Thus □
The next theorem characterizes the Lorenz order.
Theorem 6. If b is a non-decreasing (non-increasing) function and is a positive random variable following the -log-scale-location model, is a positive random variable following the -log-scale-location model, , with () then Proof. As previously, we consider only the case where b is a non-decreasing function (the non-increasing case can be proved similarly). From Theorem 6 we have Notice that and the function
is increasing and convex. Then It results Thus □
Let us now state a consequence of this result.
Corollary 5. If b is a non-decreasing (non-increasing) function and T is a positive random variable following the -log-scale-location model, , then the function is non-decreasing (non-increasing).
Proof. This is a direct consequence of Theorem 7. □
This result shows that, under the class of -log-scale-location model, the Gini index is non-decreasing (non-increasing) as x increases, if the shape parameter is non-decreasing (non-increasing).
Theorem 7 and Corollary 5 generalize Theorem 1 from Gigliarano et al. [
27] when
Let us now focus on the generalized a-Gini index. The following two results characterize the generalized a-Gini index.
Proposition 8. Let X, Y be two random variables. If then
Proof. □
The next result generalizes Theorem 7 and Corollary 5.
Corollary 6. If b is a non-decreasing (non-increasing) function and T follows the -log-scale-location model, , then:
(i) The function is non-decreasing (non-increasing), for
(ii) The function is non-increasing (non-decreasing), for
Proof. This is a consequence of Proposition 8. □
This result shows that, under the class of -log-scale-location model, the a-Gini index is non-decreasing (non-increasing) as x increases, if the shape parameter is non-decreasing (non-increasing).
7. The Hazard Rate Order
In this section we give results for the hazard rate order and hazard rate functions.
Theorem 7. If are non-decreasing (non-increasing) functions and is a positive real random variable following the -log-scale-location model, is a positive real random variable following the -log-scale-location model, then ().
Proof. The proof will be done for non-decreasing
(the non-increasing case can be proved similarly). For
and
we have
Proposition 9. If and T is a positive real random variable that follows the -log-scale-location model, with non-decreasing, then is non-increasing.
Proof. It is clear that is a non-decreasing function of This implies that is non-increasing. □
Proposition 10. If , where μ can be (), , , , , , , , , and T follows the -log-scale-location model, with non-increasing (non-decreasing), then is non-increasing (non-decreasing).
Proof. We have that is non-decreasing. □
8. The Excess Wealth and Convex Order
In this section we analyze the excess wealth and convex orders. Theorem 9 gives a sufficient condition for excess wealth order of two -log-scale-location models, while Theorem 10 gives sufficient conditions for convex order of these models.
Theorem 8. Let us consider some positive real random variables and If b is non-decreasing (non-increasing) and follows the -log-scale-location model, while follows the -log-scale-location model, , with () then Proof. Without loss of generality we take b to be non-decreasing.
From Theorem 6 we have We have and the function is increasing convex. Then
It results that Thus □
Theorem 9. If b is a non-decreasing (non-increasing) function, is a positive random variable that follows the -log-scale-location model, is a positive random variable that follows the -log-scale-location model, , with and () then Proof. It results from Theorem 9 and Proposition 4. □
The next result is a consequence of Theorem 9 and Proposition 4.
Corollary 7. If b is a non-decreasing (non-increasing) function, is a positive random variable that follows the -log-scale-location model, is a positive random variable that follows the -log-scale-location model, , with finite ( finite) and () then Proof. It results from Theorem 9 and Proposition 4. □
9. Real Data Application
In this section we illustrate the theoretical results obtained in the paper. We use the data and the estimated parameters of a Pareto distribution from Nadarajah et al. [
70]. The data represents automobile insurance claims from a large midwestern US property. As we already mentioned, the Pareto distribution
is used in [
70] for this application.
Let Let also be random variables that follow the -log-scale-location,
-log-scale-location, -log-scale-location, -log-scale-location models, respectively.
In each plots we will represent
when
x takes the values 0, 5, 10 and 15 for two Pareto distribution with parameters estimated
and
and different values of
The survival functions are very important because their ordering implies the ordering between the moments of Gini coefficients (when the means are equal). The estimated values of the parameters are given in
Table 1.
For each of the following graphs, the black line corresponds to the case the red line corresponds to the case the green line corresponds to the case and the yellow one corresponds to the case
Figure 1 displays the plot of
y for
.
Figure 2 displays the plot of
y for
.
Figure 3 displays the plot of
y for
Figure 4 displays the plot of
y for
Figure 5 displays the plot of
y for
Figure 6 displays the plot of
y for
From these six graphs we observe that, for
the functions are convex and, for
the functions are concave. We observe also that, for
the graphs of these four functions are closer than in the other cases. Another conclusion is that the function
is increasing on
This implies that
Moreover, from Theorem 7 we have
10. Conclusions
In this article we propose a new generalized log-scale-location family of distributions and we gave results on different stochastic orders for this generalized log-scale-location family that uses the Tsallis statistics. For this family of lifetime distributions, we have studied different stochastic orders, the moments and Gini indexes according to the parameters. On the one hand, the interest in the research work that we proposed in this article comes from the fact that we have developed new classes of lifetimes that extend existing classes from the literature. Thus we obtain a modeling tool that is more flexible, from a certain point of view, than the ones existing in the literature. On the other hand, for the models that we define in this work, we show that some types of stochastic orders are preserved, under certain conditions. Having in mind various potential fields of applications for this family of lifetime distributions (e.g., risk theory, reliability, survival analysis, epidemiology, insurance, demography), these stochastic orderings are extremely important. Last but not least, our research is a contribution to the growing literature of Tsallis statistical applications.
As for the future work, it would be interesting to study this topic for other values of the Tsallis parameter and also to carry out some extended simulations and detailed real data applications of the type of models and techniques developed in the present article.