Next Article in Journal
FHA Loans in Foreclosure Proceedings: Distinguishing Sources of Interdependence in Competing Risks
Next Article in Special Issue
Modified Stieltjes Transform and Generalized Convolutions of Probability Distributions
Previous Article in Journal
Estimating and Forecasting Generalized Fractional Long Memory Stochastic Volatility Models
Previous Article in Special Issue
Bivariate Kumaraswamy Models via Modified FGM Copulas: Properties and Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Burr X Pareto Distribution: Properties, Applications and VaR Estimation

1
Department of Measurement and Evaluation, Artvin Çoruh University, Artvin 08000, Turkey
2
Department of Statistics, Hacettepe University, Ankara 06800, Turkey
3
Department of Statistics, Mathematics and Insurance, Benha University, Benha 13511, Egypt
4
School of Mathematics, University of Manchester, Manchester M13 9PL, UK
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2018, 11(1), 1; https://doi.org/10.3390/jrfm11010001
Submission received: 31 October 2017 / Revised: 30 November 2017 / Accepted: 18 December 2017 / Published: 21 December 2017
(This article belongs to the Special Issue Extreme Values and Financial Risk)

Abstract

:
In this paper, a new three-parameter Pareto distribution is introduced and studied. We discuss various mathematical and statistical properties of the new model. Some estimation methods of the model parameters are performed. Moreover, the peaks-over-threshold method is used to estimate Value-at-Risk (VaR) by means of the proposed distribution. We compare the distribution with a few other models to show its versatility in modelling data with heavy tails. VaR estimation with the Burr X Pareto distribution is presented using time series data, and the new model could be considered as an alternative VaR model against the generalized Pareto model for financial institutions.

1. Introduction

The Pareto (P) distribution is very versatile, and a variety of uncertainties can be usefully modelled by it. It has several applications in actuarial science, economics, finance, life testing, survival analysis and telecommunications because of its heavy tail properties. The probability density function (pdf) and cumulative distribution function (cdf) of the P distribution are given (for x > β ) by:
g ( x ; α , β ) = α x x β α and G ( x ; α , β ) = 1 x β α ,
where β > 0 is a scale parameter and α > 0 is a shape parameter. This distribution is a special form of the Pearson Type VI distribution. Since the P distribution has a reversed-J pdf shape and a decreasing hazard rate function (hrf), it may sometimes be insufficient to model data. Generally, practical problems require a wider range of possibilities for the medium risk, for example when the lifetime data present a bathtub-shaped hrf, such as human mortality and machine life cycles. For this reason, researchers developed various extensions and modified forms of the P distribution to obtain a more flexible model with different numbers of parameters. Some of them can be cited as follows: Exponentiated P (EP) (Stoppa 1990; Gupta et al. 1998), Beta P (BP) (Akinsete et al. 2008), Kumaraswamy P (KwP) (Bourguignon et al. 2013), Kumaraswamy generalized P (Nadarajah and Eljabri 2013), P ArcTan (PAT) (Gómez-Déniz and Calderín-Ojeda 2015), exponentiated Weibull P (Afify et al. 2016) and Weibull P (WP) distributions (Tahir et al. 2016). On the other hand, Yousof et al. (2016) defined the cdf of the Burr X-G(BX-G) family (for x ) by:
F ( x ; δ , ξ ) = 1 exp G ( x ; ξ ) G ¯ ( x ; ξ ) 2 δ ,
where δ > 0 is the shape parameter and ξ = ξ k = ( ξ 1 , ξ 2 , ) is a parameter vector. The BX-G density function becomes:
f ( x ; δ , ξ ) = 2 δ g ( x ; ξ ) G ( x ; ξ ) G ¯ ( x ; ξ ) 3 exp G ( x ; ξ ) G ¯ ( x ; ξ ) 2 1 exp G ( x ; ξ ) G ¯ ( x ; ξ ) 2 δ 1 .
This generator can supply the flexibility of pdf and hrf to any baseline distribution model (Yousof et al. 2016).
In this paper, we introduce a new extended P distribution, called the Burr X Pareto (BXP) model, based on the BX-G family. With this idea, we construct the new BXP distribution as more flexible than the P distribution and provide a comprehensive description of some of its mathematical properties. We prove empirically that the BXP model provides better fits than some extensions and generalizations of the P, some of which have one extra model parameter, and the others have the same number of parameters, by means of two applications to real data. We hope that the new distribution will attract wider applications in reliability, engineering and other areas of research.
The rest of the paper is organized as follows. In Section 2, we define the BXP model. In Section 3, we provide a useful mixture representation for its pdf. In Section 4, we derive some of its general mathematical properties. Some estimation methods of the model parameters are performed in Section 5. In Section 6, simulation results to assess the performance of the proposed maximum likelihood estimation procedure are discussed. In Section 7, we provide two applications to real data to illustrate the importance and flexibility of the new family. Value-at-Risk estimation with the BXP distribution is presented in Section 8. Finally, some concluding remarks are presented in Section 9.

2. The New Model

In this section, we define the BXP model and provide some plots for its pdf and hrf. The BXP cdf is given by:
F ( x ; δ , α , β ) = 1 exp x β α 1 2 δ , x > β > 0 , α , δ > 0 .
The pdf corresponding to (3) is given by:
f ( x ; δ , α , β ) = 2 δ α x x β 2 α 1 x β α exp x β α 1 2 × 1 exp x β α 1 2 δ 1 .
Lemma 1 provides random number generations from the BXP and some relations and of the BXP distribution with the well-known Burr X and uniform distributions.
Lemma 1.
(a) If a random variable Y follows the Burr X distribution with shape parameter δ and scale parameter one, then the random variable X = β ( 1 + Y ) ( 1 / α ) follows the BXP ( δ , α , β ) distribution.
(b) If a random variable Y follows the uniform distribution on [0,1], then the random variable:
X = β 1 + log ( 1 Y 1 / δ ) 1 / α
follows the BXP ( δ , α , β ) distribution.
Proof. 
The proofs of (a) and (b) are obtained by the transformation method. ☐
The hrf, reversed hazard rate function and cumulative hazard rate function of X are given, respectively, by:
h ( x ; δ , α , β ) = 2 δ α x x β 2 α 1 x β α exp x β α 1 2 1 exp x β α 1 2 δ 1 1 1 exp x β α 1 2 δ ,
r x ; δ , α , β = 2 δ α x x β 2 α 1 x β α exp x β α 1 2 1 exp x β α 1 2
and:
H ( x ; δ , α , β ) = log 1 exp x β α 1 2 δ .
In Figure 1, we sketched the possible pdf and hrf shapes of the BXP distribution for some selected parameter values. Figure 1 shows that the BXP distribution has various pdf and hrf shapes.

3. Expansions of pdf and cdf

In this section, we provide a very useful linear representation for the BXP density function. If  z < 1 and b > 0 is a real non-integer, the power series holds:
1 z b 1 = i = 0 1 i Γ b i ! Γ b i z i .
For simplicity, ignoring the dependence of G ( x ) and g ( x ) on ξ and applying (5) to (4), we have:
f ( x ) = 2 δ α x x β 2 α 1 x β α i = 0 1 i Γ δ i ! Γ δ i exp i + 1 x β α 1 2 .
Applying the power series to the term exp i + 1 x β α 1 2 , Equation (6) becomes:
f ( x ) = 2 δ α x x β α i , j = 0 1 i + j i + 1 j Γ δ i ! j ! Γ δ i 1 x β α 2 j + 1 x β α 2 j + 3 .
Consider the series expansion:
1 z b = k = 0 Γ b + k k ! Γ b z k , z < 1 , b > 0 .
Applying the expansion in (8) to (7) for the term x β α 2 j + 3 , Equation (7) becomes:
f ( x ) = j , k = 0 Ω j , k π 2 j + 1 + k ( x ; α , β ) ,
where:
Ω j , k = 2 δ 1 j Γ δ Γ 2 j + k + 3 j ! k ! Γ 2 j + 3 2 j + k + 2 i = 0 1 i i + 1 j i ! Γ δ i
and:
π 2 j + 1 + k ( x ; α , β ) = 2 j + 1 + k g ( x ; α , β ) G ( x ; α , β ) 2 j + k + 1 .
Equation (9) reveals that the density of X can be expressed as expansions of the EP densities. Therefore, several mathematical properties of the new family can be obtained by knowing those of the EP distribution. Similarly, the cdf of the BXP family can also be expressed as a mixture of EP cdfs given by:
F ( x ) = j , k = 0 Ω j , k Π 2 j + 1 + k x ; α , β
where:
Π 2 j + 1 + k ( x ) = G ( x ; α , β ) 2 j + 1 + k
is the cdf of the EP family with power parameter 2 j + 1 + k .

4. Properties

In this section, we will provide some mathematical properties of the BXP distribution.

4.1. Moments

The r-th ordinary moment of X is given by μ r = E ( X r ) = x r f x d x . By using Equation (9), we obtain:
μ r = j , k = 0 Ω j , k E ( Y 2 ( j + 1 ) + k r ) ,
E ( Y 2 ( j + 1 ) + k r ) is the r-th ordinary moment of EP distribution with power parameter 2 ( j + 1 ) + k .
The j-th order central moment can be obtained by the following relationship:
μ j = E [ ( X μ ) j ] = r = 0 j j r μ r ( μ ) j r for j = 2 , 3 , ,
where μ = E ( X ) .
For the skewness and kurtosis coefficients, we have:
β 1 = μ 3 2 μ 2 3 and β 2 = μ 4 μ 2 2 .
The values for mean, variance, β 1 and β 2 for selected values of δ , α and β are shown in Table 1. We can say that the BXP model can be useful for various data modelling in terms of skewness and kurtosis.

4.2. Residual and Reversed Residual Life

The n-th moment of the residual life, say m n ( t ) = E [ ( X t ) n X > t ] , n = 1 , 2 , …, uniquely determines F ( x ) . The n-th moment of the residual life of X is given by:
m n ( t ) = 1 1 F ( t ) t ( x t ) n d F ( x ) .
Therefore,
m n ( t ) = 1 1 F ( t ) j , k = 0 r = 0 n Ω j , k n r t n r β n 2 j + 1 + k B t 1 n α , 2 j + 1 + k , n α .
where:
B z a , b = 0 z w a 1 1 w b 1 d w
is the incomplete beta function.
The Mean Residual Life (MRL) function or the life expectation at age t defined by m 1 ( t ) = E ( X t ) X > t follows by setting n = 1 in the last equation.
The n-th moment of the reversed residual life, say M n ( t ) = E ( t X ) n X t for t > 0 and n = 1 , 2 , … uniquely determines F ( x ) . We obtain:
M n ( t ) = 1 F ( t ) 0 t ( t x ) n d F ( x ) .
Then, the n-th moment of the reversed residual life of X becomes:
M n ( t ) = 1 F ( t ) j , k = 0 r = 0 n Ω j , k 1 r n r t n r β n 2 j + 1 + k B t 1 n α , 2 j + 1 + k , n α .
The mean inactivity time (MIT) or mean waiting time is given by M 1 ( t ) = E [ ( t X ) X t ] , and it can be obtained easily by setting n = 1 in the above equation.

4.3. Order Statistics

Order statistics make their appearance in many areas of statistical theory and practice. Let X 1 , , X n be a random sample from the BXP of distributions, and let X 1 , , X n be the corresponding order statistics. The pdf of the i-th order statistic, say X i : n , can be written as:
f i : n x = f x ; δ , α , β B i , n i + 1 j = 0 n i 1 j n i j F j + i 1 x .
Using (3), (4) and (10), we get:
f x F ( x ) j + i 1 = w , k = 0 t w , k π 2 w + 1 + k x ,
where:
t w , k = 2 δ 1 w Γ 2 w + k + 3 w ! k ! Γ 2 w + 3 2 w + k + 2 m = 0 1 m m + 1 w δ j + i 1 m .
The pdf of X i : n can be expressed as:
f i : n x = w , k = 0 j = 0 n i 1 j ( n i j ) b w , k B i , n i + 1 π 2 w + 1 + k x .
Then, the density function of the BXP order statistics is a mixture of EP densities. Based on the last equation, we note that the properties of X i : n follow from those properties of Y 2 w + k + 2 . For example, the moments of X i : n can be expressed as:
E X i : n q = w , k = 0 j = 0 n i 1 j ( n i j ) t w , k B i , n i + 1 β q 2 w + 1 + k B 1 q α , 2 w + 1 + k , q α .

5. Estimation Methods

In this section, we consider the maximum likelihood, least square and weighted least square estimation of the parameters of the BXP distribution.

5.1. Maximum Likelihood Estimation

We consider the estimation of the unknown parameters of the BXP model from complete samples by the maximum likelihood method. The maximum likelihood estimators (MLEs) of the parameters of the BXP ( δ , α , β ) model are now discussed. Let x 1 , , x n be a random sample of this distribution with parameter vector Θ = ( δ , α , β ) . The log-likelihood function for δ is given by:
= n log 2 + n log δ + n log α 2 α log β + i = 1 n log 1 β α x i α + 2 α 1 i = 1 n log x i + i = 1 n log s i + δ 1 i = 1 n log 1 s i ,
where s i = exp x i β α 1 2 .
The last equation can be also maximized either by using the different programs such as R (optim function), SAS (PROC NLMIXED) or by solving the nonlinear likelihood equations obtained by differentiating . We note that since x β , , the MLE of the β parameter cannot be obtained in the usual way. Hence, the MLE of β is the first order statistic X ( 1 ) (Johnson et al. 1994).
The components of the score vector, U Θ = Θ = δ , α , are:
U δ = n δ + i = 1 n log 1 exp x i β α 1 2 ,
and:
U α = n α 2 log β i = 1 n x β α log x β 1 β α x i α + 2 i = 1 n log x i + i = 1 n m i s i δ 1 i = 1 n m i 1 s i
where:
m i = 2 s i x i β α 1 x i β α log x i β .
For fixed β , the interval estimation of the model parameters requires the 2 × 2 observed information matrix J ( Θ ) = { J i j } for i , j = δ , α . The multivariate normal N 2 ( 0 , J ( Θ ^ ) 1 ) distribution, under standard regularity conditions, can be used to provide approximate confidence intervals for the unknown parameters, where J ( Θ ^ ) is the total observed information matrix evaluated at Θ ^ . Then, approximate 100 ( 1 δ ) % confidence intervals for δ and α can be determined by:
δ ^ ± z ζ / 2 J ^ δ δ and α ^ ± z ζ / 2 J ^ α α , where z ζ / 2 is the upper ζ -th percentile of the standard normal model.

5.2. Ordinary and Weighted Least Squares

In this section, we use the least square (LS) and weighted least square (WLS) estimators (Swain et al. 1988) to estimate the parameters of the BXP distribution. Let X 1 , , X n be the order statistics of a random sample of size n from the BXP defined in (4), then the least square estimators (LSEs) of the unknown parameters δ , α and β of the BXP distribution can be obtained by minimizing:
i = 1 n 1 exp x i β α 1 2 δ i n + 1 2 ,
with respect to unknown parameters δ , α and β .
The weighted least square estimators (WLSEs) of the unknown parameters δ , α and β follow by minimizing:
i = 1 n n + 1 2 n + 2 n i + 1 1 exp x i β α 1 2 δ i n + 1 2 ,
with respect to unknown parameters δ , α and β .

6. Simulation Study

Here, we perform the simulation study for MLEs of the BXP distribution. We generate N = 1000 samples of sizes n = 50 , 100, 200 from selected BXP distributions. The random numbers generation is simulated by:
x = β 1 + log ( 1 u 1 / δ ) 1 / α ,
where u is a uniform random number on [0,1]. We also calculate the empirical mean, standard deviations (sd), bias and mean square error (MSE) of the MLEs. The empirical bias and MSE are calculated by:
B i a s h ^ = 1 N i = 1 N h h ^ i
and:
M S E h ^ = 1 N i = 1 N h h ^ i 2
respectively, where h = ( δ , α , β ) . All results of MLEs were obtained using the optim-CG routine in the R programme. The empirical results of this simulation study are reported in Table 2. Table 2 shows that when the sample size increases, the empirical means approach the true parameter value. For the same case, the standard deviations, biases and MSEs decrease in all the cases as expected. Therefore, the MLE method works very well to estimate the model parameters of the BXP distribution.

7. Real Data Modelling

In this section, we present two applications based on the real datasets to show the flexibility of the BXP distribution. The BXP model is compared with the WP, BP, KwP, PAT and P distributions. The cdfs of the above distributions are given (for x > β and a , α , δ > 0 ) by:
F W P x = 1 exp x / β α 1 δ ,
F K w P x = 1 1 1 x / β α a δ ,
F E P x = 1 x / β α δ ,
F P A T x = 1 tan 1 α β / x δ tan 1 α 1
and:
F B P x = 1 B a , δ 0 1 β x 1 α w a 1 1 w δ 1 d w .
In order to see the best model, we obtain the Akaike Information Criteria (AIC), Corrected Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), Hannan–Quinn Information Criterion (HQIC) and Kolmogorov–Smirnov (KS) goodness of-fit statistic to see the fitting of the models to dataset. In general, the best model can be chose as the one that has the smallest values of the AIC, CAIC, BIC, HQIC and KS statistics. All computations of the MLEs are performed by the maxLik routine in the R program.
The first dataset gives the survival times, in weeks, of 33 patients suffering from acute myelogenous leukaemia. These data have been introduced by Feigl and Zelen (1965) and analysed by Mead et al. (2017). The data are: 65, 156, 100, 134, 16, 108, 121, 4, 39, 143, 56, 26, 22, 1, 1, 5, 65, 56, 65, 17, 7, 16, 22, 3, 4, 2, 3, 8, 4, 3, 30, 4, 43. This dataset is well known as being bathtub hrf-shaped.
The second data-set shows the time intervals of the successive earthquakes in the last century in the North Anatolia fault zone between 39 . 00 to 42 . 00 north latitude and 39 . 00 to 40 . 00 east longitude. This dataset was introduced and analysed by Kuş (2007). This dataset is well known as being decreasing hrf-shaped.
For both datasets, the estimated parameters based on the MLE method are given in Table 3, whereas the values of the information criteria and goodness-of-fit statistics are given in Table 4. Since MLE of the β equals the minimum order statistics, we suppose it as known to be the minimum value the dataset. Table 4 shows that the BXP distribution has the lowest values of these statistics among all the fitted models. Hence, it could be chosen as the best model under these criteria for both datasets.
The histogram of these datasets and the estimated pdfs and cdfs of the application models are displayed in Figure 2 and Figure 3. From the this figure, we show that the BXP model provides the best fit to these datasets as compared to other models.

8. Value-at-Risk Estimation with the BXP Distribution

In this section, the performance of BXP distribution in estimating Value-at-Risk (VaR) is discussed and compared with the Generalized P (GP) distribution. GP is a widely-used distribution in actuarial sciences, economics and statistics to model the tail of the distribution that contains extreme events. VaR is one of the most popular approaches to measure market risk. From a statistical point of view, the VaR entails the estimation of the quantile of the distribution of returns. The VaR for a long position (left tail of the distribution function) over a given time horizon tis defined as:
V a R p = F 1 ( p ) ,
where F is the distribution function of financial losses, F 1 denotes the inverse of F and p is the quantile at which VaR is calculated.
The Peaks-Over-Threshold (POT) method is used to model the tail of the distribution. POT is based on the distribution of exceedances over a given threshold. The conditional excess distribution, F u , can be defined as follows:
F u ( y ) = P ( X u y / X > u ) , 0 y x F u ,
where random variable X represents the financial losses, u is the threshold, y = X u are the excesses and x F is the right endpoint of F. F u ( y ) can be re-defined as follows:
F u ( y ) = Pr { X u y , X > u } Pr ( X > u ) = F ( y + u ) F ( u ) 1 F ( u ) = F ( x ) F ( u ) 1 F ( u ) .
The Balkema and De Haan (1974) and Pickands (1975) theorem shows that for a sufficiently high threshold u, the excess distribution function F u can be approximated by the GP distribution:
F u ( y ) G ξ , σ ( y ) , u G ξ , σ ( y ) = 1 ( 1 + ξ y σ ) 1 / ξ , ξ 0 1 e y / σ , ξ = 0 ,
where y 0 for ξ 0 and 0 y σ ξ for ξ < 0 and ξ and σ are shape and scale parameters of the GP distribution, respectively. Isolating F x from (14), we get:
F ( x ) = ( 1 F ( u ) ) F u ( y ) + F ( u ) ,
where F u y is the GP distribution and F u = ( n N u ) ( n N u ) n n . Then, substituting (14) in (16), the following estimate for F x is obtained:
F ^ ( x ) = 1 N u n ( 1 + ξ ^ σ ^ ( x u ^ ) ) 1 / ξ ^ ,
where ξ ^ and σ ^ are maximum likelihood estimates of ξ and σ , respectively. Inverting (17) for a given probability p, V a R p can be obtained as:
V a R p = u ^ + σ ^ ξ ^ n N u 1 p ξ ^ 1 .
Threshold selection is a difficult task and an essential part for tail modelling with the GP distribution. The most used method is the Mean Excess (ME) plot for the determination of the threshold. The ME function can be defined as follows:
e n ( u ) = i = 1 n ( X i u ) i = 1 n I X i > u ,
where I is the indicator function. When the empirical ME function is a positively sloped straight line above a certain threshold u, it is evidence that the used dataset follows the GP distribution with a positive ξ parameter.
Here, the BXP distribution is adopted in the POT method. It is assumed that BXP provides a good approximation to F u y for a sufficiently high threshold u. Then, substituting the cdf of BXP in (16), the new estimate for F x can be obtained as:
F ^ x = 1 n N u n 1 exp x β ^ α ^ 1 2 δ ^ + n N u n .
The V a R p can be obtained by inverting (20) for a given probability p, as follows:
V a R p = u ^ + β ^ 1 + log 1 n p 1 N u 1 δ ^ 1 2 1 α ^ ,
where β ^ , δ ^ and α ^ are the maximum likelihood estimates of β , δ and α , respectively.

8.1. S&P-500

To evaluate and compare the performance of the BXP with GP distribution in terms of VaR accuracy, the S&P-500 index is used. The used time series data contain 1465 daily log returns from 4 January 2012 to 27 October 2017. The descriptive statistics of S&P-500 are given in Table 4.
Table 5 shows that the mean returns are closed to zero. The Jarque–Bera statistics in Table 5 also show that the null hypothesis of normality is rejected at any level of significance, as evidenced by the high excess kurtosis and negative skewness. Thus, it is clear that log returns of S&P-500 indexes have non-normal characteristics, excess kurtosis and fat tails. The result of the Ljung–Box test indicates that the raw returns are free from autocorrelation. Therefore, BXP and GP distributions could be applied to the independent and identically distributed observations.
The ME plot is used to determine the optimal threshold value for the POT method. Figure 4 displays the ME plot of the S&P-500 dataset. The optimal threshold could be chosen as 0.02 for the used dataset. It is near the 90% quantile value of the S&P-500.
Table 6 shows the estimated parameters of BXP distribution and GP distribution using the POT method for the S&P-500 dataset. Based on the figures in Table 6, we conclude that since the BXP distribution has the lowest values of these statistics, BXP provides better fits than the GP distribution for tail modelling of S&P-500 indexes. Figure 5 displays the fitted pdf and cdfs of the BXP and GP distributions. Figure 5 reveals that the BXP distribution provides superior fits to the used dataset.
Here, VaR is estimated with the GP and BXP distribution using the POT method for values of p = 0.95 , 0.975 and 0.99 . The rolling window estimation method is used to evaluate the out-of-sample performance of the GP and BXP models. The first 1064 daily returns are used as the window length, and the next 400 data points are considered as out-of-sample period. Figure 6 displays daily VaR estimates of the BXP and GP models. Based on Figure 6, it is clear that the BXP and GP models produce similar VaR estimates. Therefore, the BXP model could be considered as an alternative VaR model against to GP model for financial institutions.
In VaR estimation, using the POT method is applied to raw return data assuming the distribution to be stationary or unconditional without considering the time-varying volatility. The POT method can also be considered as a dynamic model, where the conditional distribution of F is taken into account and the volatility of returns is captured. The dynamic POT method based on the BXP distribution, combined with the generalized autoregressive conditional heteroscedasticity type process, introduced by Bollerslev (1986), could be considered as future work of this study.

9. Conclusions

In this study, we proposed a new distribution that was referred to as the Burr X Pareto (BXP) using the Burr X generator. Some mathematical properties were obtained. The estimation of the model parameters is performed by the MLE, LS and WLS methods. We compare the distribution with a few other models using two real datasets. It is expected that the BXP distribution will serve as a better alternative in modelling real-life datasets. Value-at-Risk estimation with the BXP distribution is presented using time series data, we showed that the new model could be considered as an alternative VaR model against the generalized Pareto model for financial institutions.

Author Contributions

Mustafa Ç. Korkmaz, Emrah Altun, Haitham M. Yousof, Ahmed Z. Afify and Saralees Nadarajah have contributed jointly to all of the sections of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Afify, Ahmed Z., Haitham M. Yousof, Gholamhossein Hamedani, and Gokarna Aryal. 2016. The exponentiated Weibull-Pareto distribution with Application. Journal of Statistical Theory and Applications 15: 326–44. [Google Scholar] [CrossRef]
  2. Akinsete, Alfred, Felix Famoye, and Carl Lee. 2008. The beta-Pareto distribution. Statistics 42: 547–63. [Google Scholar] [CrossRef]
  3. Balkema, August Aimé, and Laurens De Haan. 1974. Residual life time at great age. The Annals of Probability 2: 792–804. [Google Scholar] [CrossRef]
  4. Bourguignon, Marcelo, Rodrigo B. Silva, Luz M. Zea, and Gauss M. Cordeiro. 2013. The Kumaraswamy Pareto distribution. Journal of Statistical Theory and Applications 12: 129–44. [Google Scholar] [CrossRef]
  5. Bollerslev, Tim. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: 307–27. [Google Scholar] [CrossRef]
  6. Feigl, Polly, and Marvin Zelen. 1965. Estimation of exponential survival probabilities with concomitant information. Biometrics 21: 826–38. [Google Scholar] [CrossRef] [PubMed]
  7. Gómez-Déniz, Emilio, and Enrique Calderín-Ojeda. 2015. Modelling insurance data with the Pareto ArcTan distribution. ASTIN Bulletin The Journal of the International Actuarial Association 45: 639–60. [Google Scholar] [CrossRef]
  8. Gupta, Ramesh C., Pushpa Gupta, and Rameshwar Gupta. 1998. Modeling failure time data by Lehmann alternatives. Communications in Statistics-Theory and Methods 27: 887–904. [Google Scholar] [CrossRef]
  9. Johnson, Norman L., Samuel Kotz, and Narayanaswamy Balakrishnan. 1994. Continuous Univariate Distributions. New York: Wiley, vol. 1. [Google Scholar]
  10. Kuş, Coşkun. 2007. A new lifetime distribution. Computational Statistics & Data Analysis 51: 4497–509. [Google Scholar]
  11. Mead, Mohamed E., Ahmed Z. Afify, Gholamhossein Hamedani, and Indranil Ghosh. 2017. The beta exponential Frechet distribution with applications. Austrian Journal of Statistics 46: 41–63. [Google Scholar] [CrossRef]
  12. Nadarajah, Saralees, and Sumaya Eljabri. 2013. The Kumaraswamy GP distribution. Journal of Data Science 11: 739–66. [Google Scholar]
  13. Pickands, James, III. 1975. Statistical inference using extreme order statistics. The Annals of Statistics 3: 119–31. [Google Scholar]
  14. Stoppa, Gabriele. 1990. A new model for income size distribution. In Income and Wealth Distribution, Inequality and Poverty. Berlin: Springer, pp. 33–41. [Google Scholar]
  15. Swain, James J., Sekhar Venkatraman, and James R. Wilson. 1988. Least squares estimation of distribution function in Johnsons translation system. Journal of Statistical Computation and Simulation 29: 271–97. [Google Scholar] [CrossRef]
  16. Tahir, Muhammad H., Gauss M. Cordeiro, Ayman Alzaatreh, M. Mansoor, and M. Zubair. 2016. A New Weibull-Pareto Distribution: Properties and Applications. Communications in Statistics-Simulation and Computation 45: 3548–67. [Google Scholar] [CrossRef]
  17. Yousof, Haitham M., Ahmed Z. Afify, Gholamhossein Hamedani, and Gokarna Aryal. 2016. The Burr X generator of distributions for lifetime data. Journal of Statistical Theory and Applications 16: 1–19. [Google Scholar] [CrossRef]
Figure 1. Plots of the Burr XPareto (BXP) pdf (top) and plots of the BXP hazard rate function (hrf) (bottom).
Figure 1. Plots of the Burr XPareto (BXP) pdf (top) and plots of the BXP hazard rate function (hrf) (bottom).
Jrfm 11 00001 g001
Figure 2. Fitted pdfs (left panel) and cdfs (right panel) of leukaemia data.
Figure 2. Fitted pdfs (left panel) and cdfs (right panel) of leukaemia data.
Jrfm 11 00001 g002
Figure 3. Fitted pdfs (left panel) and cdfs (right panel) of earthquake data.
Figure 3. Fitted pdfs (left panel) and cdfs (right panel) of earthquake data.
Jrfm 11 00001 g003
Figure 4. Mean excess plot of the S&P-500 dataset.
Figure 4. Mean excess plot of the S&P-500 dataset.
Jrfm 11 00001 g004
Figure 5. Fitted pdfs (left) and cdfs (right) of the BXP and GP distribution for the S&P-500 dataset.
Figure 5. Fitted pdfs (left) and cdfs (right) of the BXP and GP distribution for the S&P-500 dataset.
Jrfm 11 00001 g005
Figure 6. Daily VaR estimates of the BXP and GP models.
Figure 6. Daily VaR estimates of the BXP and GP models.
Jrfm 11 00001 g006
Table 1. Mean, variance, coefficients of skewness and kurtosis for different values of parameters.
Table 1. Mean, variance, coefficients of skewness and kurtosis for different values of parameters.
( δ , α , β ) μ V a r ( X ) β 1 β 2
(0.5, 0.5, 0.5)1.28011.13950.73114.4238
(1, 1, 1)1.63300.9671−1.25393.2132
(2, 2, 2)2.53651.7311−1.96444.3986
(1, 2, 3)2.96065.9323−0.83551.3785
(4, 2, 0.5)0.74110.0415−4.121817.7934
(10, 2, 0.25)0.40740.0011−6.371097.4674
(0.25, 5, 2)0.49621.26711.42872.6058
(0.9, 5, 1.8)1.06331.54400.02550.7191
Table 2. The empirical means, sds (given in · ), biases (given in [ · ] ) and MSEs (given in { · } ) for the special BXP distributions.
Table 2. The empirical means, sds (given in · ), biases (given in [ · ] ) and MSEs (given in { · } ) for the special BXP distributions.
Parameters n = 50 n = 100 n = 200
δ , α , β δ ^ α ^ β ^ δ ^ α ^ β ^ δ ^ α ^ β ^
3, 1.5, 23.01441.54952.02862.99951.52472.01593.00011.51252.0060
(0.1831)(0.1585)(0.1155)(0.0399)(0.1059)(0.0757)(0.0400)(0.0648)(0.0485)
[0.0144][0.0494][0.0286][−0.0005][0.0247][0.0160][0.0001][0.0125][0.0059]
{0.0330}{0.0270}{0.0140}{0.0016}{0.0117}{0.0060}{0.0016}{0.0043}{0.0023}
3, 2, 13.07722.05501.00403.00192.00931.00213.00162.00731.0013
(0.2928)(0.2053)(0.0443)(0.0212)(0.0976)(0.0211)(0.0203)(0.0851)(0.0182)
[0.0772][0.0550][0.0040][0.0019][0.0093][0.0021][0.0016][0.0073][0.0013]
{0.0900}{0.0443}{0.0020}{0.0004}{0.0095}{0.0004}{0.0004}{0.0072}{0.0003}
5, 0.5, 55.08630.51115.12165.00440.50125.00654.99540.49964.9970
(0.2792)(0.0290)(0.3641)(0.0404)(0.0095)(0.0490)(0.0400)(0.0084)(0.0439)
[0.0863][0.0111][0.1216][0.0044][0.0012][0.0065][−0.0046][−0.0004][−0.0030]
{0.0838}{0.0010}{0.1447}{0.0072}{0.00008}{0.0071}{0.0054}{0.00007}{0.0070}
10, 30, 2010.040730.043820.002410.000930.001319.99989.998429.998020.0001
(0.2318)(0.2809)(0.0101)(0.0110)(0.0130)(0.0086)(0.0101)(0.0120)(0.0059)
[0.0406][0.0438][0.0024][0.0009][0.0013][−0.0002][−0.0016][−0.0020][0.0001]
{0.0543}{0.0793}{0.0001}{0.0001}{0.0001}{0.00007}{0.0001}{0.0001}{0.00004}
4, 0.5, 0.53.90770.51470.52654.01790.51210.52034.00120.50520.5079
(0.1261)(0.0532)(0.0926)(0.1010)(0.0411)(0.0711)(0.0878)(0.0246)(0.0440)
[−0.0923][0.0147][0.0265][0.0179][0.0121][0.0203][0.0012][0.0052][0.0079]
{0.0356}{0.0030}{0.0100}{0.0164}{0.0018}{0.0054}{0.0076}{0.0006}{0.0019}
Table 3. MLEs and their standard errors (in parentheses) for both datasets. P, Pareto; PAT, P ArcTan; KwP, Kumaraswamy P; WP, Weibull P; BP, Beta P; EP, Exponentiated P.
Table 3. MLEs and their standard errors (in parentheses) for both datasets. P, Pareto; PAT, P ArcTan; KwP, Kumaraswamy P; WP, Weibull P; BP, Beta P; EP, Exponentiated P.
Leukaemia Data
Model δ ^ α ^ a ^ β ^
BXP0.85050.1900 1
(0.1785)(0.0146)
PAT0.860312.6124 1
(0.1428)(6.6619)
KwP2.39920.00071,828,0151
(0.0291)(0.0001)(5.9317)
WP1.82740.1994 1
(0.2846)(0.0145)
BP51.98000.02393.85401
(0.1240)(0.0048)(0.6551)
EP4.36060.7089 1
(1.3221)(0.1192)
P 0.3319 1
(0.0596)
Earthquake Data
BXP1.99160.1678 9
(0.5622)(0.0117)
PAT1.1704168.1574 9
(0.0667)(5.9619)
WP2.98430.1408 9
(0.4949)(0.0074)
BP60.83410.042812.55929
(1.0981)(0.0053)(0.9570)
EP26.98370.8707 9
(5.7196)(0.0770)
P 0.2264 9
(0.0472)
Table 4. Goodness-of-fit statistics for both datasets. CAIC, Corrected Akaike Information Criterion; HQIC, Hannan–Quinn Information Criterion.
Table 4. Goodness-of-fit statistics for both datasets. CAIC, Corrected Akaike Information Criterion; HQIC, Hannan–Quinn Information Criterion.
Leukaemia Data
ModelAICCAICBICHQICKS
BXP295.0115295.4401297.8795295.94640.1328
PAT301.1477301.5763304.0157302.08260.1398
KwP298.9148299.8037303.2167300.31710.1486
WP295.2830295.7116298.1510296.21790.1418
BP301.5970302.4859305.8990302.99940.1494
EP300.9643301.3929303.8323301.89920.1630
P319.1294319.2673320.5634319.59680.2733
Earthquake Data
BXP381.9004382.5004384.1714382.47150.0817
PAT383.7187384.3187385.9897384.28990.0971
WP382.3901382.9901384.6610382.96120.0962
BP384.5029385.7661387.9094385.35970.0819
EP384.3233384.9233386.5943384.89440.1038
P420.6338420.8243421.7693420.91940.4218
Table 5. Summary statistics for the S&P-500 index.
Table 5. Summary statistics for the S&P-500 index.
Descriptive StatisticsS&P-500
Number of observations1465
Minimum−0.0402
Maximum0.0383
Mean0.0004
Median0.0004
Std.Deviation0.007
Skewness−0.322
Kurtosis5.403
Jarque–Bera377.839 (<0.001)
Ljung–Box28.516 (0.098)
Table 6. MLEs, corresponding standard errors (in second line) and goodness-of-fit statistics for the S&P-500.
Table 6. MLEs, corresponding standard errors (in second line) and goodness-of-fit statistics for the S&P-500.
ModelsParametersGoodness-of-Fit
ξ δ σ α β KS A * W *
BXP 3.2480 0.18934.89818 × 10 5 −93.40160.14270.38090.0556
1.0266 0.0120-
GP0.0847 0.0057 −88.71710.14980.40390.0661
0.1996 0.0015

Share and Cite

MDPI and ACS Style

Korkmaz, M.Ç.; Altun, E.; Yousof, H.M.; Afify, A.Z.; Nadarajah, S. The Burr X Pareto Distribution: Properties, Applications and VaR Estimation. J. Risk Financial Manag. 2018, 11, 1. https://doi.org/10.3390/jrfm11010001

AMA Style

Korkmaz MÇ, Altun E, Yousof HM, Afify AZ, Nadarajah S. The Burr X Pareto Distribution: Properties, Applications and VaR Estimation. Journal of Risk and Financial Management. 2018; 11(1):1. https://doi.org/10.3390/jrfm11010001

Chicago/Turabian Style

Korkmaz, Mustafa Ç., Emrah Altun, Haitham M. Yousof, Ahmed Z. Afify, and Saralees Nadarajah. 2018. "The Burr X Pareto Distribution: Properties, Applications and VaR Estimation" Journal of Risk and Financial Management 11, no. 1: 1. https://doi.org/10.3390/jrfm11010001

Article Metrics

Back to TopTop