Next Article in Journal
A Many-Objective Evolutionary Algorithm Based on Indicator and Decomposition
Previous Article in Journal
Does ESG Predict Systemic Banking Crises? A Computational Economics Model of Early Warning Systems with Interpretable Multi-Variable LSTM based on Mixture Attention
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multivariate Extension of Raftery Copula

1
Departement of Mathematics, Universiti Teknologi Malaysia, Johor Bahru 81310, Malaysia
2
Département de Mathématiques et D’informatique, Université du Québec à Trois-Rivières, Trois-Rivières, QC G8Z 4M3, Canada
3
Department of Statistics, United Arab Emirates University, Al Ain 15551, United Arab Emirates
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(2), 414; https://doi.org/10.3390/math11020414
Submission received: 14 December 2022 / Revised: 7 January 2023 / Accepted: 9 January 2023 / Published: 12 January 2023
(This article belongs to the Section Probability and Statistics)

Abstract

:
This paper introduces a multivariate extension of Raftery copula. The proposed copula is exchangeable and expressed in terms of order statistics. Several properties of this copula are established. In particular, the multivariate Kendall’s tau and Spearman’s rho, as well as the density function, of the suggested copula are derived. The lower and upper tail dependence of the proposed copula are also established. The dependence parameter estimator of this new copula is examined based on the maximum likelihood procedure. A simulation study shows a satisfactory performance of the presented estimator. Finally, the proposed copula is successfully applied to a real data set on black cherry trees.

1. Introduction

The bivariate exponential distribution has been receiving more attention in research studies for many decades [1]. It has been applied in a variety of statistical practices, such as reliability theory, queuing theory and physics.
A significant class of the bivariate family of distributions is the Marshall–Olkin bivariate exponential distribution (MOBE) introduced in [1]. The MOBE model provides an interesting physical interpretation in terms of fatal shocks as it contains both continuous and singular components. Furthermore, the MOBE provides an interesting copula, which has been embraced in several practical applications, such as finance [2]. The bivariate exponential literature demonstrated the existence of countless models and classes that have been further developed in recent decades (see references from [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18]) especially.
In [19], Raftery has introduced another class of bivariate exponential distribution that shows an interesting interpretation in physics. Unlike the Marshall–Olkin model, Fréchet’s upper bound belongs to the class of Raftery’s bivariate exponential distributions, which allows the correlation to be modeled broadly. Moreover, this distribution is continuous without any singular part, which makes the dependence parameter estimation more tractable [20].
As outlined in [19], there are several versions of Raftery bivariate and multivariate exponential distributions. An important version of these kinds of distributions is defined as follows. Consider Z 1 , Z 2 , and Z 3 identical and independent exponential random variables with parameter λ . Assume J is a Bernoulli random variable with parameter θ ( 0 , 1 ) assumed independent of Z 1 , Z 2 and Z 3 . The random pair ( X , Y ) defined by
X = ( 1 θ ) Z 1 + J Z 3 and Y = ( 1 θ ) Z 2 + J Z 3 ,
has a Raftery bivariate exponential distribution with parameters λ > 0 and θ ( 0 , 1 ) . As shown in [19], the marginal random variables X and Y are exponentially distributed with parameter λ . This is a simple and efficient model in the context of its ability to model the full range of positive correlations using only one dependence parameter, namely θ ( 0 , 1 ) . In fact, it is easily seen that the model includes the bivariate Fréchet distribution when θ tends to 1 and reaches the independence case when θ tends to 0. Furthermore, it could be seen that the random vector ( X , Y ) is exchangeable, so the model can only be used to describe an exponential random pair with the same marginal distributions.
To avoid this limitation and considering [7], one can adopt the comonotonic shocks method introduced in [9] in order to adapt Model (1) for no exchangeable random pair ( X , Y ) . The idea behind this method is to replace the common shock Z 3 with the pair of shocks ( Z 3 , Z 4 ) , such that Z 3 and Z 4 are perfectly positively dependent and exponentially distributed with parameters λ 1 and λ 2 , respectively. In other words, the distribution of the random vector ( Z 3 , Z 4 ) is the upper Fréchet bound expressed by M ( x , y ) = min { 1 exp ( λ 1 x ) , 1 exp ( λ 2 x ) } . This means that there exists an exponential random variable Z with parameter 1 such that Z 3 = λ 1 Z and Z 4 = λ 2 Z . To define the proposed alternative model, let Z 1 , Z 2 and Z be exponential random variables with parameters λ 1 , λ 2 and 1, respectively. Let J be Bernoulli random variable with parameter θ ( 0 , 1 ) and presume that the random variables Z 1 , Z 2 , Z and J are independent. Hence, the suggested bivariate exponential random pair ( X 1 , Y 2 ) is defined by
X 1 = ( 1 θ ) Z 1 + λ 1 J Z and X 2 = ( 1 θ ) Z 2 + λ 2 J Z .
It can be shown that X 1 and X 2 are exponentially distributed with different parameters λ 1 and λ 2 , correspondingly. Finally, in contrast to Model (1), Representation (2) provides a bivariate exponential random vector that is not necessarily exchangeable. Similarly, the family of random pairs generated by Representation (2) contains the Fréchet upper bound, so it models the full range of positive correlation, namely [ 0 , 1 ] . Moreover, Model (1) is a special case of Model (2) obtained when λ 1 = λ 2 = λ . Furthermore, the joint survival function of ( X , Y ) is given, for all ( x , y ) R + × R + , by
H ¯ ( x , y ) =   exp [ max ( λ 1 x , λ 2 y ) ] +   1 θ 1 + θ exp λ 1 x + λ 2 y 1 θ 1 exp 1 + θ 1 θ max ( λ 1 x , λ 2 y ) .
Note that both of the random vectors generated by Models (1) and (2) have the same survival copula expressed, for all ( u , v ) [ 0 , 1 ] 2 , by
C ^ ( u , v ) = min ( u , v ) + 1 θ 1 + θ u 1 1 θ v 1 1 θ 1 max ( u , v ) 1 + θ 1 θ .
The latter is called the Raftery copula and its Spearman’s rho and Kendall’s tau of this copula are given in terms of θ by
ρ = θ ( 4 3 θ ) ( 2 θ ) 2 ,
and
τ = 2 θ 3 θ ,
respectively. Hence, the goal of this paper is to extend the Raftery copula to the multivariate setting and study its properties.
The paper is structured as follows. Section 2 establishes a multivariate extension of the Raftery copula extracted from a multivariate version of the model described in Model (2). Section 3 derives the Kendall’s tau, the Spearman’s rho and the density function corresponding to the proposed copula. Section 4 establishes the lower and upper tail dependence of the proposed survival copula. Section 5 is devoted to the estimation of the dependence parameter of this copula and presents a simulation study showing its performance. The proposed approach has been applied successfully to fit a multivariate distribution to a real data set about black cherry trees.

2. Multivariate Raftery Copula with One Parameter of Dependence

The aim of this section is to propose a multivariate extension of the Raftery copula presented in Equation (4). To do so, we start by briefly discussing the multivariate Raftery exponential distribution. In fact, Raftery has presented a multivariate exponential model in [19] that extends the bivariate distribution given in Model (1). As outlined in this paper, the resulting model is exchangeable and the number of its parameters decreases exponentially in terms of the dimension of the random vector. Here, we propose a nonexchangeable multivariate extension of the bivariate model (Model (1)) with fewer parameters. This can be carried out by using the concept of the comonotonic shocks method introduced in [9]. Specifically, let Z 1 , , Z d , Z be independent exponential random variables with parameters λ 1 , λ d , 1 , respectively. Let J be a Bernoulli random variable with parameter θ ( 0 , 1 ) . Assume further that J is independent of Z 1 , , Z d , Z . A multivariate exponential random vector ( X 1 , , X d ) of the Raftery type is constructed as follows:
X j = ( 1 θ ) Z j + J λ j 1 Z , j = 1 , , d .
The above construction provides a class of multivariate distributions with given marginals that are exponentially distributed with parameters λ 1 , , λ d . Note that the value of θ [ 0 , 1 ] can be viewed as a dependence parameter of this set of distributions. In addition, this family of distributions describes only the positive dependence and contains the Fréchet upper bound, obtained when θ tends to 1 . An alternative formulation of the above model is
X j = λ j 1 ( 1 θ ) ln ( U j ) + J ln ( U ) , j = 1 , , d .
where U 1 , , U d , U are independent random variables uniformly distributed over [0,1]. These random variables are independent of J. Since the marginal random variable X j is exponentially distributed with parameters λ j , j = 1 , , d . It is easy to check that the survival copula C ^ θ associated with ( X 1 , , X d ) is the distribution of the uniform random vector ( V 1 , , V d ) defined by
V j = exp ( λ j X j ) = U j 1 θ U J , j = 1 , , d .
Hereafter, we explicit the form of the survival copula C ^ θ .
Proposition 1.
The survival copula of the random vector ( X 1 , , X d ) is given, for all u = ( u 1 , , u d ) [ 0 , 1 ] d , by
C ^ θ ( u ) = u ( 1 ) + ( 1 θ ) ( 1 d ) 1 θ d j = 1 d u j 1 1 θ i = 2 d θ ( 1 θ ) ( 1 θ i ) ( 2 θ i ) j = 1 i 1 u ( j ) 1 1 θ u ( i ) 2 θ i 1 θ
where u ( 1 ) , , u ( d ) denote the order statistics of u 1 , , u d .
Proof. 
For all u = ( u 1 , , u d ) [ 0 , 1 ] d , one has
C ^ θ ( u ) = P V 1 u 1 , , V d u d = ( 1 θ ) P U 1 1 θ u 1 , , U d 1 θ u d + θ P U 1 1 θ U u 1 , , U d 1 θ U u d = ( 1 θ ) j = 1 d u i 1 1 θ + θ P U 1 1 θ U u 1 , , U d 1 θ U u d .
Let F θ be the distribution function of U 1 1 θ . Conditioning on the random variable U, one obtains
P U 1 1 θ U u 1 , , U d 1 θ U u d = 0 1 P U 1 1 θ u 1 t , , U d 1 θ u d t d t = 0 1 F θ u ( 1 ) t × × F θ u ( d ) t d t = i = 0 d u ( i ) u ( i + 1 ) F θ u ( 1 ) t × × F θ u ( d ) t d t ,
where u ( 0 ) = 0 and u ( d + 1 ) = 1 . Note that, for all t [ u ( i ) , u ( i + 1 ) ] , i = 1 , , d , one has F θ u ( j ) t = u ( j ) t 1 1 θ , if 1 j i and F θ u ( j ) t = 1 , if i < j d . Hence, Equation (11) becomes,
u ( 1 ) + i = 1 d j = 1 i u ( j ) 1 1 θ u ( i ) u ( i + 1 ) t i 1 θ d t = u ( 1 ) + i = 1 d 1 θ 1 θ i j = 1 i u ( j ) 1 1 θ u ( i + 1 ) 1 θ i 1 θ u ( i ) 1 θ i 1 θ .
By inserting (12) into (10), one obtains
C ^ θ ( u ) = ( 1 θ ) j = 1 d u j 1 1 θ + θ u ( 1 ) + i = 1 d θ ( 1 θ ) 1 θ i j = 1 i u ( j ) 1 1 θ u ( i + 1 ) 1 θ i 1 θ u ( i ) 1 θ i 1 θ = u ( 1 ) + ( 1 θ ) ( 1 d ) 1 θ d j = 1 d u j 1 1 θ i = 2 d θ ( 1 θ ) ( 1 θ i ) ( 2 θ i ) j = 1 i 1 u ( j ) 1 1 θ u ( i ) 2 θ i 1 θ ,
which ends the proof. □
Observe from Equation (9) that the survival copula C ^ θ coincides with the independence copula when θ = 0 , and it reaches the Fréchet upper bound when θ tends to 1 . Furthermore, one checks that the Raftery bivariate survival copula given in Equation (4) is a special case of Equation (9), obtained when d = 2 . In fact, for d = 2 , one has
C ^ θ ( u ) = u ( 1 ) + 1 θ 1 + θ j = 1 2 u ( j ) 1 1 θ 1 θ 1 + θ u ( 1 ) 1 1 θ u ( 2 ) θ 1 θ = u ( 1 ) + 1 θ 1 + θ u 1 1 1 θ u 2 1 1 θ 1 u ( 2 ) 1 + θ 1 θ .
For illustration, let us express Formula (9) for d = 3 in terms of the order statistics u ( 1 ) , u ( 2 ) and u ( 3 ) . Indeed, for d = 3 , standard calculations show from Equation (9) that
C ^ θ ( u ) = u ( 1 ) + 1 θ ( 1 + θ ) ( 2 + θ ) u 1 1 1 θ u 2 1 1 θ u 3 1 1 θ 2 ( 1 + θ ) θ u ( 3 ) 2 + θ 1 θ ( 2 + θ ) u ( 2 ) 1 + θ 1 θ u ( 3 ) 1 1 θ .
Note that it is easy to simulate from the survival copula C ^ θ . This follows from the fact that the proposed copula is deduced from the stochastic representation described in Equation (8). Hence, the following algorithm allows simulating data from the survival copula C ^ θ .
Algorithm of simulation:
1.
Generate independent values u , u 1 , , u d from uniform [ 0 , 1 ] ;
2.
Generate j from Bernoulli distribution with parameter θ ;
3.
Set v 1 = u 1 1 θ u j , , v d = u d 1 θ u j ;
4.
The desired vector is ( v 1 , , v d ) .
As a corollary of Proposition 1, one obtains the survival function of the multivariate exponential random vector given by the stochastic representation shown in Equation (7).
Corollary 1.
The survival function of the random vector ( X 1 , , X d ) is given, for all x = ( x 1 , , x d ) [ 0 , ) d , by
H ¯ θ ( x ) = exp ( x ˜ ( d ) ) + ( 1 θ ) ( 1 d ) 1 θ d exp 1 1 θ i = 1 d x ˜ i i = 2 d θ ( 1 θ ) ( 1 θ i ) ( 2 θ i ) exp 1 1 θ j = 1 i 1 x ˜ ( d j + 1 ) 2 θ i 1 θ x ˜ ( d i + 1 ) ,
where x ˜ ( 1 ) , , x ˜ ( d ) denote the order statistics of x ˜ 1 = λ 1 x 1 , , x ˜ d = λ d x d .
Proof. 
From Sklar’s theorem, one observes that
H ¯ θ ( x ) = C ^ θ exp ( λ 1 x 1 ) , , exp ( λ d x d ) .
Let u i = exp ( λ i x i ) = exp ( x ˜ i ) , i = 1 , , d . This implies that u ( i ) = exp ( x ˜ ( d i + 1 ) ) . It results from Equation (13) that the multivariate survival function H ¯ θ provided in the above corollary is directly deduced by substituting u ( i ) into exp ( x ˜ ( d i + 1 ) ) in Equation (9). □
Simple calculations allow checking that, for d = 2 , the multivariate survival function H ¯ θ given in Corollary 1 reduces to the bivariate survival function given in Equation (3). Hereafter, we derive the Pearson correlation coefficients of random pairs selected from a random vector following the survival function H ¯ θ .
Proposition 2.
Let ( X 1 , , X d ) be an exponential random vector with survival function given in Corollary 1. Then for any, 1 i j d ,
C o r r θ ( X i , X j ) = θ ( 2 θ ) , θ [ 0 , 1 ] .
Proof. 
The components of ( X 1 , , X d ) are defined through Equation (7). Thus, one has for any, 1 i j d ,
X i = ( 1 θ ) Z i + J λ i 1 Z and X j = ( 1 θ ) Z j + J λ j 1 Z .
Since X i and X j are exponentially distributed with parameters λ i and λ j , var ( X i ) = 1 / λ i 2 and var ( X j ) = 1 / λ j 2 . Using the fact that, respectively, Z i , Z j , Z and J are independent,
C o r r θ ( X i , X j ) = C o v ( 1 θ ) Z i + J λ i 1 Z , ( 1 θ ) Z j + J λ j 1 Z var ( X i ) var ( X i ) = var ( J Z ) = E ( J 2 Z 2 ) ( E ( J Z ) ) 2 = θ E ( Z 2 ) θ 2 ( E ( Z ) ) 2 = θ ( 2 θ ) ,
which ends the proof. □
Note that the function θ C o r r θ ( X i , X j ) = θ ( 2 θ ) increases from [ 0 , 1 ] to [ 0 , 1 ] . This implies that the range of C o r r θ ( X i , X j ) is exactly [ 0 , 1 ] . This is not surprising because the family of survival functions derived in Corollary 1 reaches the Fréchet upper bound when θ tends to 1 .

3. Properties of C ^ θ

This section provides some properties of the proposed survival copula C ^ θ .

3.1. Density Function of the Survival Copula C ^ θ

The next result states the expression of the density c θ of the survival copula C ^ θ . This formula will be used to estimate the dependence parameter θ through the maximum likelihood method.
Proposition 3.
The density function of the survival copula C ^ θ is given by
c θ ( u ) = 1 ( 1 θ ) d 1 ( 1 θ d ) 1 d θ u ( d ) 1 θ d 1 θ j = 1 d u j θ 1 θ .
Proof. 
The density function of c θ is the derivative of C ^ θ with respect to each of its arguments. Therefore, one has
c θ ( u ) = d C ^ ( u ) u 1 u d = d C ^ ( u ) u ( 1 ) u ( d ) .
To more easily handle the above derivatives, let us decompose C ^ θ as follows:
C ^ θ ( u ) = f ( u ( 1 ) , , u ( d ) ) + g ( u ( 1 ) , , u ( d 1 ) ) ,
where
g ( u ( 1 ) , , u ( d 1 ) ) = u ( 1 ) i = 2 d 1 θ ( 1 θ ) ( 1 θ i ) ( 2 θ i ) j = 1 i 1 u ( j ) 1 1 θ u ( i ) 2 θ i 1 θ ,
and
f ( u ( 1 ) , , u ( d ) ) = ( 1 θ ) ( 1 d ) 1 θ d j = 1 d u ( j ) 1 1 θ θ ( 1 θ ) ( 1 θ d ) ( 2 θ d ) j = 1 d 1 u ( j ) 1 1 θ u ( d ) 2 θ d 1 θ .
Notice that the partial derivative of g ( u ( 1 ) , , u ( d 1 ) ) with respect to u ( d ) vanishes. Hence, standard calculations lead to
c θ ( u ) = d f ( u ( 1 ) , , u ( d ) ) u ( 1 ) u ( d ) = 1 ( 1 θ ) d 1 ( 1 θ d ) 1 d θ u ( d ) 1 θ d 1 θ j = 1 d u j θ 1 θ ,
which ends the proof. □

3.2. Spearman’s Rho

Spearman’s rho is an important measure of dependence. It measures the strength of association among the components of a random pair ( X , Y ) . It is well-known that this dependence measure is independent of the marginal distributions of X and Y, and it can be written with regards to the copula of ( X , Y ) . There exists several ways to extend the Spearman’s rho to a multivariate case. For instance, Schmid and Schmidt have studied in [21] three multivariate extensions of the population version of Spearman’s rho. This section provides an expression of Spearman’s rho related to the proposed survival copula C ^ θ . To this end, we study the version of the multivariate Spearman’s rho defined by
ρ d = d + 1 2 d d 1 2 d [ 0 , 1 ] 2 i = 1 d u i d C ^ θ ( u ) 1 = d + 1 2 d d 1 2 d E i = 1 d V i 1 ,
where ( V 1 , , V d ) is a uniform random vector distributed as the survival copula C θ ^ .
Proposition 4.
The expression of the above Spearman’s rho for the survival copula C θ ^ is given by
ρ d = ( d + 1 ) ( 2 d ( 2 θ ) d ) 2 d θ d ( 2 θ ) d ( 2 d d 1 ) .
Proof. 
Since the random vector ( U 1 1 θ U J , , U d 1 θ U J ) follows the survival copula C ^ θ , then from Equation (14),
ρ d = d + 1 2 d d 1 2 d E i = 1 d U i 1 θ U J 1 = d + 1 2 d d 1 2 d E U d J i = 1 d E U i 1 θ 1 .
Moreover,
E U d J = ( 1 θ ) d + 1 + θ and E U i 1 θ = 1 2 θ , i = 1 , , d .
Hence, Equation (15) is immediately obtained from Equations (16) and (17). □
It can be easily seen that for d = 2 , the general formula of Spearman’s rho given in Equation (15) reduces to Equation (5). In fact, for d = 2 , one has
ρ 2 = ( 2 + 1 ) ( 2 2 ( 2 θ ) 2 ) 2 2 θ × 2 ( 2 θ ) 2 ( 2 2 2 1 ) = θ ( 4 3 θ ) ( 2 θ ) 2 .
Similarly, for d = 3 , and after some elementary calculations, one obtains
ρ 3 = ( 3 + 1 ) ( 2 3 ( 2 θ ) 3 ) 2 3 θ × 3 ( 2 θ ) 3 ( 2 3 3 1 ) = θ [ ( 3 θ ) 2 3 ] ( 2 θ ) 3 .

3.3. Kendall’s Tau of C ^ θ

The multivariate Kendall’s tau is introduced in [14,17]. For the proposed survival copula, this measure is defined by
τ d = 2 d E C ^ θ ( V 1 , , V d ) 1 2 d 1 1 ,
where the uniform random vector ( V 1 , , V d ) is distributed as the survival copula C ^ θ .
Proposition 5.
The Kendall’s tau of the survival copula C ^ θ is given by
τ d = 2 d 1 d ! ( 2 d 1 1 ) i = 2 d ( i + 1 θ ) + ( 1 θ ) 2 ( d 2 1 ) ( d 1 + θ ) ( d + 1 θ ) ( 2 d 1 1 ) 2 d d ! 2 d 1 1 k = 2 d θ ( 1 θ ) ( 2 θ ) 2 k ( k 1 ) ! ( 1 θ k ) ( 2 θ k ) i = k d ( i + 1 θ ) 1 2 d 1 1 .
Proof. 
Since the uniform random vector ( U 1 1 θ U J , , U d 1 θ U J ) is distributed as the survival copula C ^ θ , then
τ d = 2 d E C ^ θ ( U 1 1 θ U J , , U d 1 θ U J ) 1 2 d 1 1 .
The fact that the survival copula C ^ θ is exchangeable implies that
E C ^ θ ( U 1 1 θ U J , , U d 1 θ U J ) = d ! E C ^ θ ( U 1 1 θ U J , , U d 1 θ U J ) I [ U 1 < U 2 < < U d ] ,
where I [ . ] stands for the indicator function. From Equation (9), one observes that the expectation, E C ^ θ ( U 1 1 θ U J , , U d 1 θ U J ) I [ U 1 < U 2 < < U d ] , is reduced to the next expression:
I 1 + ( 1 θ ) ( 1 d ) 1 θ d I 2 k = 2 d θ ( 1 θ ) ( 1 θ k ) ( 2 θ k ) I k ,
where
I 1 = E U 1 1 θ U J I [ U 1 < U 2 < < U d ] ,
I 2 = E U d J 1 θ j = 1 d U i I [ U 1 < U 2 < < U d ] ,
and for k = 2 , , d ,
I k = E U J U k 2 θ k j = 1 k 1 U j I [ U 1 < U 2 < < U d ] .
By virtue of Equations (19)–(21), one observes that
τ d = 2 d d ! 2 d 1 1 I 1 + ( 1 θ ) ( 1 d ) d ! 2 d ( 2 d 1 1 ) ( 1 θ d ) I 2 k = 2 d θ ( 1 θ ) d ! 2 d ( 2 d 1 1 ) ( 1 θ k ) ( 2 θ k ) I k 1 2 d 1 1 .
The quantities I k , k = 1 , 2 , , d , involved in the above expression of τ d can be calculated as follows:
I 1 = E U J E U 1 1 θ I [ U 1 < U 2 < < U d ] = 2 θ 2 0 1 0 u d 0 u 3 0 u 2 u 1 1 θ d u 1 d u 2 d u d 1 d u d = 2 θ 2 × 1 ( 2 θ ) ( 3 θ ) ( d + 1 θ ) = 1 2 i = 2 d ( i + 1 θ ) ,
I 2 = E U d J 1 θ E j = 1 d U i I [ U 1 < U 2 < < U d ] = ( 1 θ ) ( d + 1 ) d + 1 θ 0 1 0 u d 0 u 3 0 u 2 u 1 u 2 u d 1 u d d u 1 d u 2 d u d 1 d u d = ( 1 θ ) ( d + 1 ) d + 1 θ × 1 2 d d ! = ( 1 θ ) ( d + 1 ) 2 d ( d + 1 θ ) d ! ,
and for k = 2 , , d ,
I k = E U J E U k 2 θ k j = 1 k 1 U j I [ U 1 < U 2 < < U d ] = 2 θ 2 0 1 0 u d 0 u 3 0 u 2 u k 2 θ k u k 1 u 1 d u 1 d u 2 d u d 1 d u d = 2 θ 2 0 1 0 u d 0 u k + 1 u k 2 θ k 0 u k 0 u 2 u 1 u k 1 d u 1 d u k 1 d u k = 2 θ 2 0 1 0 u d 0 u k + 1 u k 2 θ k u k 2 ( k 1 ) 2 k 1 ( k 1 ) ! d u k d u d 1 d u d = 2 θ 2 k ( k 1 ) ! 0 1 0 u d 0 u k + 1 u k k θ d u k d u d 1 d u d = 2 θ 2 k ( k 1 ) ! × 1 ( k + 1 θ ) ( k + 2 θ ) ( d + 1 θ ) = 2 θ 2 k ( k 1 ) ! × 1 i = k d ( i + 1 θ ) .
Hence, Expression (18) is deduced by inserting Equations (23)–(25) into Equation (22), which ends the proof. □
Hereafter, one shows that for d = 2 , the general formula of Kendall’s tau described in Equation (18) reduces to Equation (6). In fact, for d = 2 , one see from Equation (18) that
τ 2 = 4 ( 3 θ ) + 3 ( 1 θ ) 2 ( 1 + θ ) ( 3 θ ) 2 ( 1 θ ) ( 2 θ ) ( 1 + θ ) ( 3 θ ) 1 = 2 θ ( 3 θ ) .
Remark that both ρ d and τ d derived in Propositions 4 and 5, respectively, are nonincreasing functions in terms of d. This behavior is illustrated in Figure 1 and Figure 2. In fact, Figure 1 presents the curves of ρ 2 , ρ 3 and ρ 4 in terms of θ [ 0 , 1 ] . Since the calculations show that τ 2 = τ 3 , Figure 2 exposes the curves of τ 2 , τ 4 and τ 5 in terms of θ .

4. Lower and Upper Tail Dependence

There are many ways to define the lower and upper tail dependence in the multivariate setting. In this section, we adopt the definition of these parameters provided in [13]. Specifically, let ( U 1 , , U d ) be a uniform random vector with copula C. According to [13], the multivariate lower and upper tail dependence associated with C are defined by
λ L = lim u 0 + P U 1 u , , U d u | U d u = lim u 0 + C ( u , , u ) u ,
and
λ U = lim u 0 + P U 1 1 u , , U d 1 u | U d 1 u = lim u 0 + C ¯ ( 1 u , , 1 u ) u ,
where C ¯ denotes the survival function of ( U 1 , , U d ) . Hereafter, we derive the expressions of λ U and λ L of the proposed survival copula.
Proposition 6.
The lower and upper tail dependence of the survival copula C ^ θ are expressed by
λ L = 1 d θ + ( d 1 ) ( 1 θ ) 1 d θ and λ U = 0 .
Proof. 
From Equation (9), one has
C ^ θ ( u , , u ) u = 1 + ( 1 θ ) ( 1 d ) 1 θ d u d 1 + θ 1 θ θ ( 1 θ ) i = 2 d 1 ( 1 θ i ) ( 2 θ i ) .
Clearly, u d 1 + θ 1 θ tends to 0 as u tends to 0 + because d 1 + θ 1 θ > 0 . Therefore, the above sum can be simplified as follows:
θ ( 1 θ ) i = 2 d 1 ( 1 θ i ) ( 2 θ i ) = θ ( 1 θ ) i = 2 d 1 1 θ i 1 2 θ i = θ ( 1 θ ) 1 1 θ d + 1 θ = ( 1 d ) ( 1 θ ) 1 θ d .
From Equations (26) and (27), one deduces that
λ L = lim u 0 + C ^ θ ( u , , u ) u = 1 ( 1 d ) ( 1 θ ) 1 θ d = 1 d θ + ( d 1 ) ( 1 θ ) 1 d θ .
To establish λ U , first note that
0 C ^ ¯ θ ( 1 u , , 1 u ) u C ^ ¯ θ ( 1 u , 1 u ) u
It is well-known that for the Bivariate Raftery copula, λ U = 0 (see Example 5.21 in [16]). This means that
lim u 0 + C ^ ¯ θ ( 1 u , 1 u ) u = 0 .
Combining Equations (28) and (29), one obtains
λ U = lim u 0 + C ^ ¯ θ ( 1 u , , 1 u ) u = 0 .
Notice that for d = 2 , the lower tail dependence provided in Proposition 6 reduces to 2 θ / ( 1 + θ ) , which is the expression of λ L related to the bivariate Raftery copula (see Example 5.21 in [16]). It is similar for the upper tail dependence.

5. Parameter Estimation and Simulation Study

5.1. Dependence Parameter Estimation

In this section, we discuss the estimation of the dependence parameter θ using the maximum likelihood procedure; moreover, we will examine the finite-sample accuracy of the estimates for several sample sizes. To this end, let u i = ( u i 1 , , u i d ) , i = 1 , , n be a sample that has been established earlier from the survival copula C ^ θ . The log-likelihood function is given by
L ( θ , u 1 , , u n ) = i = 1 n ln { c θ ( u i ) } = n ( d 1 ) ln ( 1 θ ) n ln ( 1 θ d ) + i = 1 n ln 1 d θ u ( i d ) 1 θ d 1 θ + θ 1 θ ln j = 1 d u i j ,
where u ( i d ) = max { u i 1 , , u i d } , i = 1 , , n . The maximum likelihood estimator θ ^ of θ is achieved by maximizing the above log-likelihood function in terms of θ . More specifically,
θ ^ = argmax θ [ 0 , 1 ] L ( θ , u 1 , , u n ) .
This maximization cannot be explicitly solved. Because, there is no closed-form solution of the next equation,
L ( θ , u 1 , , u n ) θ = 0 .
To solve the problem shown in Equation (30), one adopts numerical maximization, which provides efficient results, as shown by the following simulation study established for d = 3 .

5.2. Simulation Study

The tables below present the outcomes of the estimator θ ^ , the bias and the mean squared error (MSE) of θ . The following scenarios present, when investigated, simulations that demonstrate that θ ^ provides a good estimator for the dependence parameter θ . Being a comprehensible result, the effectiveness of our estimator θ ^ increases as n increases, and the bias and MSE of θ ^ decrease. Hence, the estimator becomes narrower as the sample size grows. In addition to the fact that we observed many more simulations, which are not presented here, where the estimator θ ^ performed very well when n was getting larger.
This can be brought forward by looking at the behavior of the estimator θ ^ in three different scenarios: weak dependence ( ρ 3 = 0.25 , i.e., θ = 0.2880 ) in Table 1; moderate dependence ( ρ 3 = 0.5 , i.e., θ = 0.5155 ) in Table 2; and strong dependence ( ρ 3 = 0.75 , i.e., θ = 0.7162 ) in Table 3.

5.3. Real Data

An actual data set is utilized to assess the effectiveness of this extension. The data set trees may be found in the datasets R package, as well as in [12]. The information was gathered from a sample size of 31 black cherry trees from the forest in order to calculate the volume of a tree based on its height and diameter. The data set includes 31 observations of the variables. The variables are given as Diam, which represents the diameter in inches, Height, which represents the height in feet, and Volume, which represents the volume in cubic feet. Our goal is to fit a multivariate distribution describing the data by using the proposed copula. This can be conducted in two steps. In the first step, we select the marginal distributions. Then, we look for the copula in the second step. These two steps are achieved through goodness-of-fit procedures.
First step. Using the bootstrap technique based on the Kolmogorov–Smirnov (KS) Test, Table 4 demonstrates that the variables Diam and Volume are distributed as a gamma distribution and the Height follows the Weibull distribution. The maximum likelihood estimates (MLEs) of the model parameters are given in Table 5, Table 6 and Table 7.
Second step. Now, we evaluate the goodness-of-fit test (GOF) of the proposed copula C ^ θ based on Cramér-von Mises statistics using the bootstrap algorithm proposed by [10]. The estimator of the dependence parameter of the proposed copula is obtained by θ ^ = 0.6102115 . The p-value of this GOF test is 0.813, which is much higher than 0.05. This confirms that our proposed copula C ^ θ describes the dependency among the components of the data set reasonably well. The next Table 8 summarizes this information.

Author Contributions

Conceptualization, T.S., M.M. and A.S.; Methodology, T.S., M.M. and A.S.; Software, T.S. and A.S.; Validation, T.S., M.M. and A.S.; Formal analysis, T.S.; Investigation, M.M.; Data curation, T.S., M.M. and A.S.; Writing—original draft, T.S.; Writing—review & editing, T.S., M.M. and A.S.; Visualization, A.S.; Supervision, M.M. and A.S.; Funding acquisition, M.M. All authors have read and agreed to the published version of the manuscript.

Funding

The work of the second author is funded by the Natural Sciences and Engineering Research Council of Canada No. 06536-2018.

Data Availability Statement

This The data set may be found in the R package called: datasets, or in [12].

Acknowledgments

Mhamed Mesfoui acknowledges the Financial support of the Natural Sciences and Engineering Research Council of Canada No. 06536-2018.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Marshall, A.; Olkin, I. A multivariate exponential distribution. J. Am. Stat. Assoc. 1967, 62, 30–44. [Google Scholar] [CrossRef]
  2. Bouye, E.; Durlleman, V.; Nikeghbali, A.; Riboulet, G.; Roncalli, T. Copulas for Finance; Financial Econometrics Research Centre: Paris, France, 2000. [Google Scholar]
  3. Arnold, B.C.; Strauss, D. Bivariate distributions with exponential conditionals. J. Am. Stat. Assoc. 1988, 83, 522–527. [Google Scholar] [CrossRef]
  4. Basu, A.P. Multivariate exponential distributions and their applications in reliability. Handb. Stat. 1988, 7, 467–477. [Google Scholar]
  5. Basu, A.P. Bivariate exponential distributions. In The Exponential Distribution: Theory, Methods, and Applications; Routledge: London, UK, 1995; pp. 327–331. [Google Scholar]
  6. Block, H.W.; Basu, A.P. A Continuous Bivariate Exponential Extension. J. Am. Stat. Assoc. 1974, 69, 1031–1037. [Google Scholar]
  7. Bukovšek, D.K.; Košir, T.; Mojškerc, B.; Omladič, M. Extreme generators of shock induced copulas. Appl. Math. Comput. 2022, 429, 127214. [Google Scholar] [CrossRef]
  8. David, H.A. Order Statistics, 2nd ed.; Wiley: New York, NY, USA, 1981. [Google Scholar]
  9. Genest, C.; Mesfioui, M.; Schulz, J. A new bivariate Poisson common shock model covering all possible degrees of dependence. Stat. Probabil. Lett. 2018, 140, 202–209. [Google Scholar] [CrossRef]
  10. Genest, C.; Rémillard, B.; Beaudoin, D. Goodness-of-fit tests for copulas: A review and a power study. Insur. Math. Econ. 2009, 44, 199–213. [Google Scholar] [CrossRef]
  11. Gumbel, E.J. Bivariate exponential distributions. J. Am. Stat. Assoc. 1960, 55, 698–707. [Google Scholar] [CrossRef]
  12. Hand, D.J. A Handbook of Small Data Sets; Chapman & Hall/CRC: Boca Raton, FL, USA, 1994. [Google Scholar]
  13. Joe, H.; Li, H.; Nikoloulopoulos, A.K. Tail dependence functions and vine copulas. J. Multivariate Anal. 2010, 101, 252–270. [Google Scholar] [CrossRef] [Green Version]
  14. Joe, H. Multivariate concordance. J. Multivariate Anal. 1990, 35, 12–30. [Google Scholar] [CrossRef] [Green Version]
  15. Kundu, D.; Gupta, R.D. Absolute continuous bivariate generalized exponential distribution. Adv. Stat. Anal. 2011, 95, 169–185. [Google Scholar] [CrossRef]
  16. Nelsen, R.B. An Introduction to Copulas; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
  17. Nelsen, R.B. Concordance and copulas: A survey. Distrib. Given Marginals Stat. Model. 2002, 37, 169–177. [Google Scholar]
  18. Paulson, A.S. A characterization of the exponential distribution and a bivariate exponential distribution. Sankhy Indian J. Stat. Ser. A 1973, 35, 69–78. [Google Scholar]
  19. Raftery, A.E. A continuous multivariate exponential distribution. Commun. Stat. Theory Methods 1984, 13, 947–965. [Google Scholar] [CrossRef]
  20. Regoli, G. A class of bivariate exponential distributions. J. Multivariate Anal. 2009, 100, 1261–1269. [Google Scholar] [CrossRef] [Green Version]
  21. Schmid, F.; Schmidt, R. Multivariate extensions of Spearman’s rho and related statistics. Stat. Probab. Lett. 2007, 77, 407–416. [Google Scholar] [CrossRef]
Figure 1. The curves of ρ 2 , ρ 3 and ρ 4 in terms of θ are indicated with colors red, blue and green, respectively.
Figure 1. The curves of ρ 2 , ρ 3 and ρ 4 in terms of θ are indicated with colors red, blue and green, respectively.
Mathematics 11 00414 g001
Figure 2. The curves of τ 2 , τ 4 and τ 5 in terms of θ are indicated with colors blue, green and purple, respectively.
Figure 2. The curves of τ 2 , τ 4 and τ 5 in terms of θ are indicated with colors blue, green and purple, respectively.
Mathematics 11 00414 g002
Table 1. Estimation for θ corresponding to weak dependence.
Table 1. Estimation for θ corresponding to weak dependence.
n θ ^ Bias( θ ^ )MSE( θ ^ )
θ = 0.2880 200.2927000000.318500000.144625
500.290500000.002500000.00884725
1000.27625000−0.011750000.00208025
2000.27600000−0.012000000.00113675
Table 2. Estimation for θ corresponding to moderate dependence.
Table 2. Estimation for θ corresponding to moderate dependence.
n θ ^ Bias( θ ^ )MSE( θ ^ )
θ = 0.5155 20 0.52640000 0.01090000 0.01062985
50 0.52080000 0.00530000 0.00396345
100 0.52000000 0.00450000 0.00294825
200 0.51680000 0.00130000 0.00274345
Table 3. Estimation for θ corresponding to strong dependence.
Table 3. Estimation for θ corresponding to strong dependence.
n θ ^ Bias( θ ^ )MSE( θ ^ )
θ = 0.7162 20 0.730600 0.0144000 0.003619
50 0.7216000 0.0054000 0.0022986
100 0.72000000 0.00380000 0.00166244
200 0.71675000 0.00055000 0.00041259
Table 4. Test statistics and p-value tests.
Table 4. Test statistics and p-value tests.
Test Statisticp-Value
Dim0.104580.85217
Height0.110790.80158
Volume0.121170.70799
Table 5. MLEs of the model parameters for Dimension.
Table 5. MLEs of the model parameters for Dimension.
Dim
Gammashape = 4.4408, scale = 1.5095, location = 6.5452
Table 6. MLEs of the model parameters for Height.
Table 6. MLEs of the model parameters for Height.
Height
Weibullshape = 6.2766, scale = 35.628, location = 42.914
Table 7. MLEs of the model parameters for Volume.
Table 7. MLEs of the model parameters for Volume.
Volume
Gammashape = 3.3689, scale = 8.9557, location = 0
Table 8. GOF test for C ^ θ .
Table 8. GOF test for C ^ θ .
Cramér-von Mises
Test statistic S n = 0.4188187
p-value0.818
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Saali, T.; Mesfioui, M.; Shabri, A. Multivariate Extension of Raftery Copula. Mathematics 2023, 11, 414. https://doi.org/10.3390/math11020414

AMA Style

Saali T, Mesfioui M, Shabri A. Multivariate Extension of Raftery Copula. Mathematics. 2023; 11(2):414. https://doi.org/10.3390/math11020414

Chicago/Turabian Style

Saali, Tariq, Mhamed Mesfioui, and Ani Shabri. 2023. "Multivariate Extension of Raftery Copula" Mathematics 11, no. 2: 414. https://doi.org/10.3390/math11020414

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop