Next Article in Journal
Variance Swap Replication: Discrete or Continuous?
Next Article in Special Issue
Hierarchical Transmuted Log-Logistic Model: A Subjective Bayesian Analysis
Previous Article in Journal
Effectiveness of Interest Rate Policy of the Fed in Management of Subprime Mortgage Crisis
Previous Article in Special Issue
Does the Assumption on Innovation Process Play an Important Role for Filtered Historical Simulation Model?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Generalization of the Pareto Distribution and Its Application to Insurance Data

by
Mohamed E. Ghitany
1,*,
Emilio Gómez-Déniz
2 and
Saralees Nadarajah
3
1
Department of Statistics and Operations Research, Faculty of Science, Kuwait University, Safat 13060, Kuwait
2
Department of Quantitative Methods and TiDES Institute, University of Las Palmas de Gran Canaria, 35017 Gran Canaria, Spain
3
School of Mathematics, University of Manchester, Manchester M13 9PL, UK
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2018, 11(1), 10; https://doi.org/10.3390/jrfm11010010
Submission received: 26 November 2017 / Revised: 28 January 2018 / Accepted: 2 February 2018 / Published: 7 February 2018
(This article belongs to the Special Issue Extreme Values and Financial Risk)

Abstract

:
The Pareto classical distribution is one of the most attractive in statistics and particularly in the scenario of actuarial statistics and finance. For example, it is widely used when calculating reinsurance premiums. In the last years, many alternative distributions have been proposed to obtain better adjustments especially when the tail of the empirical distribution of the data is very long. In this work, an alternative generalization of the Pareto distribution is proposed and its properties are studied. Finally, application of the proposed model to the earthquake insurance data set is presented.
MSC:
62E10; 62F10; 62F15; 62P05

Graphical Abstract

1. Introduction

In general insurance, only a few large claims arising in the portfolio represent the largest part of the payments made by the insurance company. Appropriate estimation of these extreme events is crucial for the practitioner to correctly assess insurance and reinsurance premiums. On this subject, the single parameter Pareto distribution (Arnold 1983; Brazauskas and Serfling 2003; Rytgaard 1990), among others has been traditionally considered as a suitable claim size distribution in relation to rating problems. Concerning this, the single parameter Pareto distribution, apart from its favourable properties, provides a good depiction of the random behaviour of large losses (e.g., the right tail of the distribution). Particularly, when calculating deductibles and excess–of–loss levels for reinsurance, the simple Pareto distribution has been demonstrated convenient, see for instance (Boyd 1988; Mata 2000; Klugman et al. 2008), among others.
In this work, an alternative to the Pareto distribution will be carried out. Properties and applications of this distribution will be studied here. As far as we know, these properties have not been studied for this distribution. In particular, we concentrate our attention to results connected with financial risk and insurance.
The paper is organized as follows. In Section 2, the new proposed distribution is shown, including some of its more relevant properties. Section 3 presents some interesting results connecting with financial risk and insurance. Next, Section 4 deals with parameter estimation, paying special attention to the maximum likelihood method. In Section 5, numerical application by using real insurance data is considered. Finally, some conclusions are given in the last section.

2. The Proposed Distribution

2.1. Probability Density Function

A continuous random variable X is said to have a generalized truncated log-gamma (GTLG) distribution if its probability density function (p.d.f.) is given by
f ( x ) = θ λ α Γ ( λ ) x α θ 1 log x α λ 1 , x α , α , θ , λ > 0 ,
where Γ ( z ) = 0 t z 1 exp ( t ) d t is the Euler gamma function. Note that, for all α , θ > 0 , we have
f ( α ) = , if 0 < λ < 1 , θ / α , if λ = 1 , 0 , if λ > 1 , f ( ) = 0 .
As it can be easily seen, the parameter α marks a lower bound on the possible values that (1) can take on. When α = 1 , the GTLG distribution reduced to the log-gamma distribution proposed by Consul and Jain (1971) with p.d.f.
f Z ( z ) = θ λ Γ ( λ ) z θ 1 ( log z ) λ 1 , z > 1 , θ , λ > 0 .
Note that Consul and Jain (1971) considered only the case λ 1 . For this case, they derived the raw moments and the distribution of the product of two independent log-gamma random variables. The p.d.f. (1) can now be obtained by the transformation X = α Z .
Expression (1) is a particular case of the generalized truncated log–gamma distribution proposed in Amini et al. (2014) and related with the family proposed by Zografos and Balakrishnan (2009). When λ = 1 , we obtain the famous Pareto distribution. In addition, when λ = 2 , we obtain a distribution reminiscent of the distribution proposed in Gómez-Déniz and Calderín (2014). Properties and applications of this distribution will be studied here. In particular, we concentrate attention to results connecting with financial risk and insurance.
Theorem 1.
For all α , θ > 0 ,   f ( x ) is decreasing (increasing-decreasing) if 0 < λ 1 ( λ > 1 ) .
Proof. 
The first derivative of f ( x ) given by
f ( x ) = ( θ + 1 ) + λ 1 log ( x / α ) f ( x ) x ,
which can be seen to be strictly negative if 0 < λ 1 and has a unique zero at x m = α exp ( λ 1 ) / ( θ + 1 ) , if λ > 1 .  ☐
Note that the mode of f ( x ) is given by α if 0 < λ 1 ( x m if λ > 1 ) .
Figure 1 shows the p.d.f. (1) for selected values of λ and θ when α = 1 .

2.2. Hazard Rate Function

The survival function (s.f.) of the GTLG distribution is given by
F ¯ ( x ) = P ( X > x ) = Γ λ , θ log x α Γ ( λ ) , x α .
where Γ ( a , z ) = z t a 1 exp ( t ) d t is the incomplete gamma function. When λ is a positive integer, we have
F ¯ ( x ) = ( x / α ) θ k = 0 λ 1 [ θ log ( x / α ) ] k k ! , x α .
The hazard rate function (h.r.f.) of the GTLG distribution is given by
h ( x ) = f ( x ) F ¯ ( x ) = θ λ α Γ ( λ , θ log x α ) x α θ 1 log x α λ 1 , x α , α , θ , λ > 0 .
Note that, h ( α ) = f ( α ) and h ( ) = 0 .
Theorem 2.
For all α , θ > 0 , h ( x ) is decreasing ( increasing-decreasing) if 0 < λ 1 ( λ > 1 ) .
Proof. 
Let
η ( x ) = f ( x ) f ( x ) = ( θ + 1 ) λ 1 log ( x / α ) 1 x .
It is straightforward to show that η ( x ) is decreasing if 0 < λ 1 and η ( x ) is increasing-decreasing if λ > 1 . Now by Glaser (1980), h ( x ) is decreasing if λ 1 and increasing-decreasing if λ > 1 , since f ( α ) = h ( α ) = 0 when λ > 1 .  ☐
Figure 2 shows the h.r.f. (3) for selected values of λ and θ when α = 1 .

2.3. Mean Residual Life Function

For the GTLG ( α , λ , θ ) , we have
x y f ( y ) d y = μ x f θ 1 ( y ) d y = μ F ¯ θ 1 ( x ) , x > α , θ > 1 .
where, for θ > 1 , μ = E ( X ) = α 1 1 θ λ is the mean of the GTLG distribution, and f θ 1 ( x ) ( F ¯ θ 1 ( x ) ) is the p.d.f. (1) (s.f. (2)) when θ is replaced by θ 1 .
The mean residual life function (m.r.l.f.) of the GTLG distribution is given by
e ( x ) = E ( X x | X > x ) = 1 F ¯ ( x ) x y f ( y ) d y x = μ F ¯ θ 1 ( x ) F ¯ θ ( x ) x , x > α , θ > 1 .
Theorem 3.
For all α > 0 , θ > 1 , the m.r.l.f. e ( x ) is increasing ( decreasing-increasing) if 0 < λ 1 ( λ > 1 ) .
Proof. 
Since h ( x ) is decreasing for 0 < λ 1 , it follows that, in this case, e ( x ) is increasing. In addition, since h ( x ) is increasing-decreasing for λ > 1 and f ( α ) e ( α ) = 0 , it follows that, in this case, e ( x ) is decreasing-increasing, by Gupta and Akman (1995). ☐
From the point of view of a risk manager, the expression e ( x ) + x = E ( X | X > x ) is the so-called Expected Shortfall, that is the conditional mean of X given X exceeds a given quantile value x. This is a risk measurement appropriate to evaluate the market risk or credit risk of a portfolio.
Figure 3 shows the m.r.l.f. (4) for selected values of λ and θ when α = 1 .
It is noted that, unlike the classical Pareto distribution, this expression is not a linear function of x.

2.4. Moments

The GTLG distribution with p.d.f. (1) can be obtained from a monotonic transformation of the gamma distribution, as it can be seen in the next result.
Theorem 4.
Let us assume that Y follows a G a m m a ( λ , θ ) distribution with p.d.f. f ( y ) y λ 1 exp ( θ y ) , where λ > 0 and θ > 0 . Then the random variable
X = α e Y
has p.d.f. (1).
Proof. 
The proof follows after a simple change of variable. ☐
Note that Z = e Y has a log-gamma distribution over ( 1 , ) . That is X = α Z as indicated before.
Now, by using representation (5) and the moments of the Gamma distribution, the expression for the r-th moment about zero of distribution (1) is easily obtained,
μ r = E ( X r ) = α r M Y ( r ) = α r 1 r θ λ , r = 1 , 2 , ,
provided θ > r and λ > 0 .
In particular, the mean is given by
μ = α 1 1 θ λ , θ > 1 ,
and the variance is given by
σ 2 = α 2 1 2 θ λ 1 1 θ 2 λ , θ > 2 ,
Furthermore, by using the representation given by (5) the following result is obtained
E log X α r = E ( Y r ) = Γ ( λ + r ) θ r Γ ( λ ) , r = 1 , 2 , .
Solving the equation
μ = α 1 1 θ λ ,
in θ , we obtain
θ = μ / α 1 / λ μ / α 1 / λ 1 , μ > α .
This implies that the covariates can be introduced into the model in a simple way.

2.5. Conjugate Distributions

The following results show that both the inverse Gaussian distribution and the gamma distribution are conjugate with respect to the distribution proposed in this work.
Theorem 5.
Let X i , i = 1 , 2 , , n independent and identically distributed random variables following the p.d.f. (1). Let us suppose that θ follows a prior inverse Gaussian distribution π ( θ ) with parameters τ and ϕ, i.e., π ( θ ) θ 3 / 2 exp 1 2 ϕ τ 2 θ + ϕ θ . Then the posterior distribution of θ given the sample information ( X 1 , , X n ) is a generalized inverse Gaussian distribution G I G ( λ , τ , ϕ ) , where
λ = n λ 1 2 , τ = τ 1 + 2 τ 2 ϕ i = 1 n log ( x i / α ) , ϕ = ϕ .
Proof. 
The result follows after some computations by applying Bayes’ Theorem and arranging parameters. ☐
Theorem 6.
Let X i , i = 1 , 2 , , n independent and identically distributed random variables following the p.d.f. (1). Let us suppose that θ follows a prior gamma distribution π ( θ ) with a shape parameter τ > 0 and a scale parameter σ > 0 , i.e., π ( θ ) θ τ 1 exp ( σ θ ) . Then the posterior distribution of θ given the sample information ( X 1 , , X n ) is again a gamma distribution with shape parameter τ + n λ and scale parameter σ + log ( x i / α ) .
Proof. 
Again, the result follows after some algebra by using Bayes’ Theorem and arranging parameters. ☐

2.6. Stochastic Ordering

Stochastic ordering of positive continuous random variables is an important tool for judging the comparative behavior. We will recall some basic definitions, see (Shaked and Shanthikumar 2007).
Let X and Y be random variables with p.d.f.s f ( x ) and g ( y ) (s.f.s F ¯ ( x ) and G ¯ ( y ) ) (h.r.f.s h ( x ) and r ( y ) ), respectively.
A random variable X is said to be smaller than a random variable Y in the
(i)
stochastic order (denoted by X S T Y ) if F ¯ ( x ) G ¯ ( x ) for all x ,
(ii)
hazard rate order (denoted by X H R Y ) if h ( x ) r ( x ) for all x ,
(iii)
likelihood ratio order (denoted by X L R Y ) if f ( x ) g ( x ) decreases for all x.
The following implications are well known:
X L R Y X H R Y X S T Y .
Members of the family of distributions with p.d.f. (1) are ordered with respect to the strongest “likelihood ratio” ordering, as shown in the following theorem.
Theorem 7.
Let X and Y be two continuous random variables distributed according to (1) with p.d.f.’s given by f ( x ) = f ( x ; α , θ 1 , λ ) and g ( x ) = f ( x ; α , θ 2 , λ ) , respectively. If θ 1 θ 2 > 0 , then X L R Y ( X H R Y ) ( X S T Y )
Proof. 
Firstly, let us observe that the ratio
f ( x ) g ( x ) = θ 1 θ 2 λ α θ 1 θ 2 x θ 2 θ 1
wih derivative
f ( x ) g ( x ) = θ 1 θ 2 λ α θ 1 θ 2 ( θ 2 θ 1 ) x θ 2 θ 1 1 0 ,
for all θ 1 θ 2 > 0 , proving the theorem. ☐
Properties for higher-order stochastic dominance in financial economics can be obtained following the line of the work of (Guo and Wong 2016). In this regard, let X and Y be random variables defined on [ a , b ] with p.d.f.’s f ( x ) , g ( y ) and s.f.’s F ¯ ( x ) , G ¯ ( y ) , respectively, satisfying
F j D ( x ) = x b F j 1 D ( y ) d y , G j D ( x ) = x b G j 1 D ( y ) d y , j 1 ,
where F 0 D ( x ) = f ( x ) ,   G 0 D ( x ) = g ( x )   F 1 D ( x ) = F ¯ ( x ) , and G 1 D ( x ) = G ¯ ( x ) .
A random variable X is said to be smaller than a random variable Y
(i)
in the first-order descending stochastic dominance (denoted by X 1 Y ) iff F 1 D ( x ) G 1 D ( x ) for each x [ a , b ] .
(ii)
in the second-order descending stochastic dominance (denoted by X 2 Y ) iff F 2 D ( x ) G 2 D ( x ) for each x [ a , b ] .
(iii)
in the N-order descending stochastic dominance (denoted by X N Y ) iff F N D ( x ) G N D ( x ) for each x [ a , b ] and F k D ( a ) G k D ( a ) for 2 k N 1 , N 3 .
Theorem 8.
Let X and Y be two continuous random variables distributed according to (1) with p.d.f.’s given by f ( x ) = f ( x ; α , θ 1 , λ ) and g ( y ) = f ( y ; α , θ 2 , λ ) , respectively.
(i) 
If θ 1 θ 2 > 0 , then X 1 Y .
(ii) 
If θ 1 θ 2 > 0 , then X 2 Y .
(iii) 
If θ 1 θ 2 > 1 , then X N Y for N 3 .
Proof. 
(i)
For θ 1 θ 2 > 0 , we have
F 1 D ( x ) = F ¯ θ 1 ( x ) = Γ ( λ , θ 1 log ( x / α ) ) Γ ( λ ) Γ ( λ , θ 2 log ( x / α ) ) Γ ( λ ) = G ¯ θ 2 ( x ) = G 1 D ( x ) .
Therefore, for θ 1 θ 2 > 0 , X 1 Y .
(ii)
For θ 1 θ 2 > 0 , we have
F 2 D ( x ) = x F 1 D ( y ) d y x G 1 D ( y ) d y = G 2 D ( x ) .
Therefore, for θ 1 θ 2 > 0 , X 2 Y .
(iii)
For θ 1 θ 2 > 1 , we have
F 3 D ( x ) = x F 2 D ( y ) d y x G 2 D ( y ) d y = G 3 D ( x ) .
Also, for θ 1 θ 2 > 1 , we have
F 2 D ( α ) = α F ¯ θ 1 ( y ) d y = μ θ 1 μ θ 2 = α G ¯ θ 2 ( y ) d y = G 2 D ( α ) .
Therefore, for θ 1 θ 2 > 1 , X 3 Y .
Now assume that, for θ 1 θ 2 > 1 , X N Y for some N 3 , i.e., F N D ( x ) G N D ( x ) for each x [ a , b ] and F k D ( a ) G k D ( a ) for 2 k N 1 , N 3 .
Now for θ 1 θ 2 > 1 , we have
F N + 1 D ( x ) = x F N D ( y ) d y x G N D ( y ) d y = G N + 1 D ( x ) .
Also, for θ 1 θ 2 > 1 , we have
F N D ( α ) = α F N 1 D ( y ) d y μ θ 2 = α G N 1 D ( y ) d y = G N D ( α ) .
Therefore, for θ 1 θ 2 > 1 , X N Y for all N 3 . ☐

3. Some Theoretical Financial Results

The integrated tail distribution function (also known as equilibrium distribution function):
F I ( x ) = 1 E ( X ) α x F ¯ ( y ) d y , x > α .
is an important probability model that often appears in insurance and many other applied fields (see for example Yang 2004).
For the GTLG ( α , λ , θ ) , we have
α x y f ( y ) d y = μ α x f θ 1 ( y ) d y = μ F θ 1 ( x ) , x > α , θ > 1 .
The integrated tail distribution of the GTLG ( α , λ , θ ) is given by
F I ( x ) = 1 μ α x F ¯ θ ( y ) d y = 1 μ x F ¯ θ ( x ) α + μ α x f θ 1 ( y ) d y = 1 μ x F ¯ θ ( x ) α + μ F θ 1 ( x ) , x > α , θ > 1 .
Under the classical model (see Yang 2004) and assuming a positive security loading, ρ , for the claim size distributions with regularly varying tails we have that, by using (3), it is possible to obtain an approximation of the probability of ruin, Ψ ( u ) , when u . In this case the asymptotic approximations of the ruin function is given by
Ψ ( u ) 1 ρ F ¯ I ( u ) , u .
where F ¯ I ( u ) = 1 F I ( u ) .
The use of heavy right-tailed distribution is of vital importance in general insurance. In this regard, Pareto and log-normal distributions have been employed to model losses in motor third liability insurance, fire insurance or catastrophe insurance. It is already known that any probability distribution, that is specified through its cumulative distribution function F ( x ) on the real line, is heavy right-tailed if and only if for every t > 0 , e t x F ¯ ( x ) has an infinite limit as x tends to infinity. On this particular subject, (1) decays to zero slower than any exponential distribution and it is long-tailed since for any fixed t > 0 (see Rytgaard 1990) it is verified that
F ¯ ( x + t ) F ¯ ( x ) , x .
Therefore, as a long-tailed distribution is also heavy right-tailed, the distribution introduced in this manuscript is also heavy right–tailed.
Another important issue in extreme value theory is the regular variation (see Bingham 1987; Rytgaard 1990). A distribution function is called regular varying at infinity with index β if
lim x F ¯ ( t x ) F ¯ ( x ) = t β ,
where the parameter β 0 is called the tail index.
Theorem 9.
The GTLG distribution is regularly varying at infinity with index θ .
Proof. 
Using L’Hospital rule, we have
lim x F ¯ ( t x ) F ¯ ( x ) = lim x Γ ( λ , θ log t x α ) Γ ( λ , θ log x α ) = lim x θ log t x α λ 1 e θ log t x α θ x θ log x α λ 1 e θ log x α θ x = lim x 1 + log t log x α λ 1 t θ = t θ ,
for all α , θ , λ > 0 . ☐
As a consequence of this result we have that if X , X 1 , , X n are i.i.d. random variables with common s.f. (2) and S n = i = 1 n X i , n 1 , then
Pr ( S n > x ) Pr ( X > x ) a s x .
Therefore, if P n = max i = 1 , , n X i , n 1 , we have that
Pr ( S n > x ) n Pr ( X > x ) Pr ( P n > x ) .
This means that for large x the event { S n > x } is due to the event { P n > x } . Therefore, exceedance of high thresholds by the sum S n are due to the exceedance of this threshold by the largest value in the sample.
On the other hand, let the random variable X represent either a policy limit or reinsurance deductible (from an insurer’s perspective); then the limited expected value function L of X with cdf F ( x ) , is defined by
L ( x ) = E [ min ( X , x ) ] = α x y f θ ( y ) d y + x F ¯ θ ( x ) = μ F θ 1 ( x ) + x F ¯ θ ( x ) , x > α , θ > 1 .
Note that L ( x ) represents the expected amount per claim retained by the insured on a policy with a fixed amount deductible of x.
Figure 4 shows the limited expected value function (10) for selected values of λ and θ when α = 1 .
On the other hand, the new distribution can also be applied in rating excess–of–loss reinsurance as it can be seen in the next result.
Theorem 10.
Let X be a random variable denoting the individual claim size taking values only for individual claims greater than d. Let us also assumed that X follows the pdf (1), then the expected cost per claim to the reinsurance layer when the losses excess of m subject to a maximum of l is given by
E [ min ( l , max ( 0 , X m ) ) ] = θ λ Γ ( λ ) m R ( λ , θ , m + l ) R ( λ , θ , m ) + α R ( λ , θ 1 , m ) R ( λ , θ 1 , m + 1 ) + l F ¯ ( m + l ) ,
where R ( a , b , z ) = log λ ( z / α ) E 1 a ( b log ( z / α ) ) , being E n ( z ) = 1 t n exp ( z t ) d t the exponential integral function.
Proof. 
The result follows by having into account that
E [ min ( l , max ( 0 , X m ) ) ] = m m + l ( x m ) f ( x ) d x + l F ¯ ( m + l ) ,
from which we get the result after some tedious algebra. ☐

4. Maximum Likelihood Estimation

In the following it will be assumed that x = ( x 1 , x 2 , , x n ) is a random sample selected from the GTLG distribution with known parameter α and unknown parameters ν = ( θ , λ ) from the p.d.f. (1). Then, the log–likelihood function is given by
( ν ; x ) = n λ log θ log α Γ ( λ ) + i = 1 n ( θ + 1 ) log x i α + ( λ 1 ) log log x i α .
The maximum likelihood estimates (MLEs) ν ^ = ( θ ^ , λ ^ ) , of the parameters ν = ( θ , λ ) are obtained by solving the score equations:
θ = n λ θ i = 1 n log ( x i / α ) = 0 , λ = n log θ ψ ( λ ) + i = 1 n log log ( x i / α ) = 0 ,
where ψ ( · ) is the digamma function. Therefore,
θ ^ = n λ ^ i = 1 n log ( x i / α ) ,
where λ ^ is the solution of the equation:
log ( λ ^ ) ψ ( λ ^ ) log 1 n i = 1 n log ( x i / α ) + 1 n i = 1 n log log ( x i / α ) = 0 .
The second partial derivatives are given by
2 θ 2 = n λ θ 2 , 2 θ λ = n θ , 2 λ 2 = n ψ ( λ ) .
The expected Fisher’s information matrix is given by
I ( ν ) = n λ θ 2 n θ n θ n ψ ( λ ) .
Now the estimated variance-covariance matrix of the MLEs ν ^ is given by the inverse matrix I 1 ( ν ^ ) .
It is known that under certain regularity conditions, the maximum likelihood estimator ν ^ converges in distribution to a bivariate normal distribution with mean equal to the true parameter value and variance-covariance matrix given by the inverse of the information matrix. That is, ν ^ D N ( ν , I 1 ( ν ) ) , which provides a basis for constructing tests of hypotheses and confidence regions. The regularity conditions are verified by taking into account that the Fisher’s information matrix exists and is non-singular and that the parameter space is a subset of the real line and the range of x is independent of ν . Furthermore, additional computations provides that E f ( x ) ν = 0 and that 3 f ( x ) ν 3 is bounded.

5. Numerical Application

Because the main application of the heavy tail distributions is the so-called extreme value theory, we consider a data set coming from catastrophic events. The data set represents loss ratios (yearly data in billion of dollars) for earthquake insurance in California from 1971 through 1993 for values larger than zero. The data are given in Embrechts et al. (1999).
For comparison with other heavy tail distributions, we consider the following models:
(1)
Pareto distribution:
f ( x ) = θ x α x θ , x α , α , θ > 0 .
(2)
Shifted log-normal (SLN):
f ( x ) = θ 2 π x α exp 1 2 θ 2 ( log ( x α ) λ ) 2 , x α , α , θ > 0 , λ I R .
(3)
Burr distribution:
f ( x ) = θ λ ( x α ) θ 1 1 ( x α ) θ λ + 1 , x α , α , θ , λ > 0 .
(4)
Stoppa distribution:
f ( x ) = λ θ x α x θ 1 α x θ λ 1 , x α , α , θ , λ > 0 .
(5)
Log-gamma distribution (LG):
f ( x ) = ( 1 + x α ) 1 1 / θ θ λ Γ ( λ ) log λ 1 ( 1 + x α ) , x α , α , θ , λ > 0 .
Table 1 provides parameter estimates together with standard errors (in brackets) using the maximum likelihood estimation method of the parameters θ and λ when α = 0.1 . This table also gives the negative log-likelihood (NLL), Akaike’s Information Criteria (AIC), Bayesian information criterion (BIC), and Consistent Akaike’s Information Criteria (CAIC).
A lower value of these measures is desirable. These results show that the proposed GTLG distribution provides better fit than the considered competing distributions. Table 2 shows three goodness-of-fit tests for all considered models and that the classical Pareto model is rejected for this data set.

6. Conclusions

In this paper, a continuous probability distribution function with positive support suitable for fitting insurance data has been introduced. The distribution, that arises from a monotonic transformation of the classical Gamma distribution, can be considered as a generalization of the log-gamma distribution. This new development, which has a promising approach for data modeling in the actuarial field, may be very useful for practitioners who handle large claims. For that reason, it can be deemed as an alternative to the classical Pareto distribution. Besides, an extensive analysis of its mathematical properties has been provided.

Acknowledgments

The authors would like to express their gratitude to two anonymous referees for their relevant and useful comments. Emilio Gómez-Déniz was partially funded by grant ECO2013-47092 (Ministerio de Economía y Competitividad, Spain and ECO2017-85577-P (Ministerio de Economía, Industria y Competitividad, Agencia Estatal de Investigación)).

Author Contributions

Mohamed E. Ghitany, Emilio Gómez-Déniz and Saralees Nadarajah have contributed jointly to all of the sections of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Amini, Morteza, S. M. T. K. MirMostafaee, and Jafar Ahmadi. 2014. Log-gamma-generated families of distributions. Statistics 48: 913–32. [Google Scholar] [CrossRef]
  2. Arnold, Barry C. 1983. Pareto Distributions. Silver Spring: International Cooperative Publishing House. [Google Scholar]
  3. Bingham, Nicholas H. 1987. Regular Variation. Cambridge: Cambridge University Press. [Google Scholar]
  4. Boyd, Albert V. 1988. Fitting the truncated Pareto distribution to loss distributions. Journal of the Staple Inn Actuarial Society 31: 151–58. [Google Scholar] [CrossRef]
  5. Brazauskas, Vytaras, and Robert Serfling. 2003. Favorable estimator for fitting Pareto models: A study using goodness-of-fit measures with actual data. ASTIN Bulletin 33: 365–81. [Google Scholar] [CrossRef]
  6. Consul, Prem C., and Gaurav C. Jain. 1971. On the log-Gamma distribution and its properties. Statistische Hefte 12: 100–6. [Google Scholar] [CrossRef]
  7. Embrechts, Paul, Sidney I. Resnick, and Gennady Samorodnitsky. 1999. Extreme value theory as a risk management tool. North American Actuarial Journal 3: 30–41. [Google Scholar] [CrossRef]
  8. Glaser, Ronald E. 1980. Bathtub and related failure rate characterizations. Journal of the American Statistical Association 75: 667–72. [Google Scholar] [CrossRef]
  9. Gómez-Déniz, Emilio, and Enrique Calderín. 2014. A suitable alternative to the Pareto distribution. Hacettepe Journal of Mathematics and Statistics 43: 843–60. [Google Scholar]
  10. Guo, Xu, and Wing-Keung Wong. 2016. Multivariate stochastic dominance for risk averters and risk seekers. RAIRO-Operations Research 50: 575–86. [Google Scholar] [CrossRef]
  11. Gupta, Ramesh C., and Akman H. Olcay. 1995. Mean residual life function for certain types of non-monotonic ageing. Communications in Statistics-Stochastic Models 11: 219–25. [Google Scholar]
  12. Klugman, Stuart A., Harry H. Panjer, and Gordon E. Willmot. 2008. Loss Models: From Data to Decisions, 3rd ed. Hoboken: Wiley. [Google Scholar]
  13. Mata, Ana J. 2000. Princing excess of loss reinsurance with reinstatements. ASTIN Bulletin 30: 349–68. [Google Scholar] [CrossRef]
  14. Rytgaard, Mette. 1990. Estimation in the Pareto distribution. ASTIN Bulletin 20: 201–16. [Google Scholar] [CrossRef]
  15. Shaked, Moshe, and Jeyaveerasingam G. Shanthikumar. 2007. Stochastic Orders. Series: Springer Series in Statistics; New York: Springer. [Google Scholar]
  16. Yang, Hailiang. 2004. Crámer-Lundberg asymptotics. In Encyclopedia of Actuarial Science. New York: Wiley, pp. 1–6. [Google Scholar]
  17. Zografos, Konstantinos, and Narayanaswamy Balakrishnan. 2009. On families of beta and generalized gamma-generated distributions and associated inference. Statistical Methodology 6: 344–62. [Google Scholar] [CrossRef]
Figure 1. Probability density function of GTLG distribution for selected values of θ and λ when α = 1 .
Figure 1. Probability density function of GTLG distribution for selected values of θ and λ when α = 1 .
Jrfm 11 00010 g001
Figure 2. Hazard rate function of GTLG distribution for selected values of θ and λ when α = 1 .
Figure 2. Hazard rate function of GTLG distribution for selected values of θ and λ when α = 1 .
Jrfm 11 00010 g002
Figure 3. Mean residual life function of GTLG distribution for selected values of θ and λ when α = 1 .
Figure 3. Mean residual life function of GTLG distribution for selected values of θ and λ when α = 1 .
Jrfm 11 00010 g003
Figure 4. Limited expected value function of GTLG distribution for selected values of θ and λ when α = 1 .
Figure 4. Limited expected value function of GTLG distribution for selected values of θ and λ when α = 1 .
Jrfm 11 00010 g004
Table 1. Estimated values of the considered models when α = 0.1 .
Table 1. Estimated values of the considered models when α = 0.1 .
DistributionEstimates (S.E.)NLLAICBICCAIC
Pareto θ = 0.249 ( 0.057 ) 77.939157.878158.822159.822
SLN θ = 1.477 ( 0.239 ) 66.080136.161138.05140.05
λ = 1.668 ( 0.339 )
Burr θ = 2.287 ( 0.895 ) 67.352138.703140.592142.592
λ = 0.243 ( 0.106 )
Stoppa θ = 0.768 ( 0.159 ) 66.321136.643138.532140.532
λ = 12.013 ( 6.065 )
LG θ = 0.802 ( 0.271 ) 66.273136.547138.435140.435
λ = 2.474 ( 0.755 )
GTLG θ = 1.845 ( 0.606 ) 65.987135.974137.863139.863
λ = 7.401 ( 2.352 )
Table 2. Test statistics (p-values) of goodness-of-fit tests of the considered models when α = 0.1 .
Table 2. Test statistics (p-values) of goodness-of-fit tests of the considered models when α = 0.1 .
DistributionKolmogorov-SmirnovCramér-Von MissesAnderson-Darling
Pareto0.360 (0.010)0.706 (0.012)3.574 (0.014)
SLN0.116 (0.933)0.031 (0.970)0.205 (0.989)
Burr0.182 (0.500)0.090 (0.633)0.483 (0.761)
Stoppa0.148 (0.746)0.040 (0.933)0.242 (0.974)
LN0.149 (0.731)0.043 (0.918)0.257 (0.966)
GTLG0.148 (0.745)0.040 (0.932)0.242 (0.974)

Share and Cite

MDPI and ACS Style

Ghitany, M.E.; Gómez-Déniz, E.; Nadarajah, S. A New Generalization of the Pareto Distribution and Its Application to Insurance Data. J. Risk Financial Manag. 2018, 11, 10. https://doi.org/10.3390/jrfm11010010

AMA Style

Ghitany ME, Gómez-Déniz E, Nadarajah S. A New Generalization of the Pareto Distribution and Its Application to Insurance Data. Journal of Risk and Financial Management. 2018; 11(1):10. https://doi.org/10.3390/jrfm11010010

Chicago/Turabian Style

Ghitany, Mohamed E., Emilio Gómez-Déniz, and Saralees Nadarajah. 2018. "A New Generalization of the Pareto Distribution and Its Application to Insurance Data" Journal of Risk and Financial Management 11, no. 1: 10. https://doi.org/10.3390/jrfm11010010

APA Style

Ghitany, M. E., Gómez-Déniz, E., & Nadarajah, S. (2018). A New Generalization of the Pareto Distribution and Its Application to Insurance Data. Journal of Risk and Financial Management, 11(1), 10. https://doi.org/10.3390/jrfm11010010

Article Metrics

Back to TopTop