Next Article in Journal
The Moderating Role of Corporate Governance in the Relationship between Leverage and Firm Value: Evidence from the Korean Market
Next Article in Special Issue
Stochastic Claims Reserve in the Healthcare System: A Methodology Applied to Italian Data
Previous Article in Journal
Socially Responsible Investment Funds—An Analysis Applied to Funds Domiciled in the Portuguese and Spanish Markets
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Credibility Distribution Estimation with Weighted or Grouped Observations

by
Georgios Pitselis
1,2
1
Department of Statistics & Insurance Science, University of Piraeus, 80 Karaoli & Dimitriou Str. T. K., 18534 Piraeus, Greece
2
Department of Mathematics & Statistics, Concordia University, 1455 De Maisonneuve Blvd. W., Montreal, QC H3G 1M8, Canada
Risks 2024, 12(1), 10; https://doi.org/10.3390/risks12010010
Submission received: 17 November 2023 / Revised: 26 December 2023 / Accepted: 26 December 2023 / Published: 3 January 2024

Abstract

:
In non-life insurance practice, actuaries are often faced with the challenge of predicting the number of claims and claim amounts to be incurred at any given time, which serve to implement fair pricing and reserves given the nature of the risk. This paper extends Jewell’s credible distribution in terms of forecasting the distribution of individual risk in cases where the observations are weighted or are grouped in intervals. More specifically, we show how empirical distribution functions can be embedded within Bühlmann’s and Straub’s credibility model. The optimal projection theorem is applied for credibility estimation and more insight into the derivation of the credibility distribution estimators is also provided. In addition, distribution credibility estimators are established and numerical illustrations are presented herein. Two examples of distribution credibility estimation are given, one with insurance loss data and the other with industry financial data.

1. Introduction

In actuarial science, one of the fundamental problems is that of predicting future claims of individual risk given one’s past experience of a collective of heterogeneous risks. Credibility is a ratemaking technique that serves to forecast future premiums for a group of insurance contracts for which we have experience, whilst we have a lot more experience for a collection of contracts that are similar but not exactly the same.
In the insurance industry, some legislated rules indicate that some changes over time occurred across the claim distribution. Therefore, it is essential to examine these changes at different points of the distribution. An empirical distribution function provides a way to model and sample cumulative probabilities for a data sample that does not fit a standard probability distribution. Its value at a given point is equal to the proportion of observations from the sample that are less than or equal to that point.
In non-life insurance practice, actuaries are often faced with the challenge of predicting the number of claims and the claim amounts to be incurred at any given time, which serve to implement fair pricing and reserves given the nature of the risk. Actuaries usually deal with events that are uncertain and their economic consequences. The aim of this paper is to carry out the credibility estimation of empirical distribution functions in measuring and managing these uncertainties.
In the first part of this paper, we extend the work of Jewell (1974b) in terms of forecasting the distribution of individual risk in cases where the observations are weighted for the non-homogeneous and homogeneous models. Here, the weights (sizes) w i j , i = 1 , , n j , j = 1 , , K are now changing in time. The contract j might result from a grouping and averaging of w i j observations in a contract with several independent and identically distributed observations S l i j , l = 1 , w i j , during the year i, i.e., X i j = 1 w i j l = 1 w i j S l i j , and then taking the conditional mean of the identity function E [ I ( X i j x ) | Θ j ] . Alternatively, in the case of raw data, the contract j might result from the grouping and averaging of identity functions within the year i, I ¯ ( S i j x ) = 1 w i j l = 1 w i j I ( S l i j x ) and then taking the E [ I ¯ ( S i j x ) | Θ j ] .
Here, we proceed with the former considering the credibility distribution estimation as a point estimate approach of F X i j | Θ j ( x | Θ j ) = E [ I ( X i j x ) | Θ j ] . Optimal linearized estimators of F X i j Θ j ( x | Θ j ) are obtained by the classical least squares approach as well as by the optimal projection theorem of random variables on planes as presented by De Vylder (1976, 1996).
In the second part of this paper, we consider credibility distribution estimation based on grouped data formed by aggregating the individual observations of a variable into groups. The construction of the empirical distribution based on grouped data can be performed by obtaining the point values of the empirical distribution function whenever is possible. Then, we approximate the distribution functions by connecting those points with straight lines and applying premium estimation in a credibility framework. An alternative model of credibility estimation is also obtained similarly as in Bühlmann and Straub (1970) model.

Related Works

Bühlmann (1967) and Bühlmann and Straub (1970) established the theoretical foundation of modern credibility theory, presented as a distribution-free credibility estimation. The method was extended in the regression model by Hachemeister (1975), where the credibility premium depends linearly on a number of risk characteristics.
Jewell (1974a) has shown that credibility is exactly Bayesian for a certain exponential family of distributions with natural conjugate priors. Furthermore, Landsman and Makov (1998, 1999) extended the results on the exponential family to the exponential dispersion family. The following key references are related to new developments in credibility estimation: Makov et al. (1996), Christiansen and Schinzinger (2016), Tsai and Lin (2017), Gong et al. (2018), Xacur and Garrido (2018), Tsai and Wu (2020), Tsai and Zhang (2019), Bozikas and Pitselis (2020, 2021), Youn et al. (2021), Wang et al. (2021), Yan and Song (2022), and Kim et al. (2022).
Credibility distribution estimation is closely connected to the area of quantile credibility estimation. The quantile function is the inverse of the distribution function. It specifies the value of the random variable such that the probability of the variable that is less than or equal to that value is equal to the given probability. Kim and Jeon (2013) proposed a credibility theory by truncating the loss data based on quantiles. Some other references related to quantile estimation are: Pitt (2006), Pitselis (2009, 2013, 2017), Kudryavtsev (2009), Gebizlioglu and Yagci (2008), Denuit (2008) and Landsman (1996).
Jewell (1974b) extended the classical Bühlmann (1967) model to the problem of forecasting the distribution of individual risk based upon collective statistics and individual experience data and solved the problem by finding a Bayesian conditional distribution. Jewell (1974b) also obtain an additional insight into the nature of credibility estimation assuming that the true value of Θ j is known and obtained credible distributions and credible densities by carrying out simulations for some conjugate prior families of distributions (e.g., Poisson–gamma, etc.). He also considered the problem of founding a credibility approximation to the true distribution of the next observation.
Korwar and Hollander (1973) defined a sequence of empirical Bayes estimators for estimating a distribution function. Zehnwirth (1981) established the asymptotic optimality of the empirical Bayes distribution function created from the Bayes rule relative to the Dirichlet process prior with unknown parameter. Cai et al. (2015) combined Bühlmann’s credibility theory and Ferguson’s (1973) nonparametric Bayes analysis to develop a completely nonparametric estimation for loss distributions and established a unified distribution-free approach to experience rating for arbitrary premium principles.
This paper is organized as follows. In Section 2, both the linearized non-homogeneous and homogeneous estimators in the weighted credibility distribution model are obtained, and the credibility parameters are estimated. Optimal credibility distribution estimators are also obtained using the optimal projection theorem. In Section 3, the credibility distribution estimation for grouped data is presented. In Section 4, an alternative model of credibility distribution estimation is obtained when the observations are grouped in intervals. Applications to real data are presented in Section 5, one with insurance loss data and the other with industry financial data. Some concluding remarks are presented in Section 6.

2. Weighted Credibility Distribution Estimation

In the following, we consider the credibility model with several contracts and weighted observations. For an insurance portfolio, X i j are the average losses of w i j observations for contract j = 1 , , K and period i = 1 , n j . For industry portfolios, X i j denotes the average returns (losses/gains) of w i j firms in j = 1 , K portfolios for period i = 1 , , n j .

2.1. Assumptions

We have the following assumptions:
(i)
The contracts are independent and the variables Θ 1 , , Θ K are identically distributed.
(ii)
F X i j | Θ j ( x | Θ j ) = P ( X i j x | Θ j ) = E [ I ( X i j x ) | Θ j ] , where I ( X i j x ) is an indicator function that is equal to 1 if X i j x and 0 otherwise.
(iii)
C o v [ I ( X i j x ) , I ( X r j x ) | Θ j ] = δ i r 1 w i j σ F x 2 ( Θ j ) , where δ i r = 1 if i = r and 0 otherwise.

2.2. Structural Parameters

The structural parameters F X i j ( x ) , s F x 2 , and a F x are as follows:
(SP1)
F X i j ( x ) = E [ F X i j | Θ j ( x | Θ j ) ] = E [ P ( X i j x | Θ j ) ] ;
(SP2)
s F x 2 = E [ σ F x 2 ( Θ j ) ] ;
(SP3)
a F x = V a r { E [ I ( X i j x ) | Θ j ] } = V a r [ F X i j | Θ j ( x | Θ j ) ] .

2.3. Notation

Here, we present the weighted empirical distribution function as well as some notations that are useful for the derivation of the credibility distribution estimation.
F n w j ( x ) = i = 1 n j w i j w . j I ( X i j x ) , F n w w ( x ) = j = 1 K w . j w . . F n w j ( x ) , w . j = i = 1 n j w i j , w . . = i = 1 K w . j , F n w z ( x ) = j = 1 K Z j F x z . F x F n w j ( x ) , z . F x = j = 1 K Z j F x , Z j F x = a F x w . j a F x w . j + s F x 2 .
Lemma 1
(Expectation and Covariance Relations). Based on the above assumptions, we can obtain expressions for the conditional expectations and covariances as follows,
E [ I ( X i j x ) ] = E [ F n w j ( x ) ] = E [ F n w w ( x ) ] = E [ F n w z ( x ) ] = F X i j ( x ) ,
C o v [ I ( X i j x ) , I ( X r j x ) ] = δ i r 1 w i j s F x 2 + a F x ,
C o v [ I ( X i j x ) , F n w j ( x ) ] = C o v [ F n w j ( x ) , F n w j ( x ) ] = 1 w . j s F x 2 + a F x = a F x Z j F x ,
C o v [ I ( X i j x ) , F n w z ( x ) ] = C o v [ F n w j ( x ) , F n w z ( x ) ] = C o v [ F n w z ( x ) , F n w z ( x ) ] = a F x z . F x ,
C o v [ F X i j | Θ j ( x | Θ j ) , I ( X i j x ) ] = δ j j a F x .
Proof. 
Relation (2) is straightforward. Relation (3) results from
C o v [ I ( X i j x ) , I ( X r j x ) ] = E C o v [ I ( X i j x ) , I ( X r j x ) | Θ j ] + C o v E [ I ( X i j x ) | Θ j ] , E [ I ( X r j x ) | Θ j ] = δ i r 1 w i j s F x 2 + a F x .
The first part of (4) results from
C o v [ I ( X i j x ) , F n w j ( x ) ] = C o v I ( X i j x ) , r = 1 n j w r j w . j I ( X r j x ) = r = 1 n j w r j w i j C o v [ I ( X i j x ) , I ( X r j x ) ] = r = 1 n j w r j w . j δ i r 1 w i j s F x 2 + a F x = 1 w . j s F x 2 + r i n j δ i r 1 w i j s F x 2 + a F x = 1 w . j s F x 2 + a F x .
Similarly, we can prove the second and third parts of (4). For the proof of the first part of relation (5), we have
C o v [ I ( X i j x ) , F n w z ( x ) ] = C o v I ( X i j x ) , j = 1 K Z j F x z . F x F n w j ( x ) = j = 1 K r = 1 n j Z j F x z . F x C o v [ I ( X i j x ) , I ( X r j x ) ] = r = 1 n j Z j F x z . F x C o v [ I ( X i j x ) , I ( X r j x ) ] + j j K r = 1 n j Z j F x z . F x C o v [ I ( X i j x ) , I ( X r j x ) ] = Z j F x z . F x C o v [ I ( X i j x ) , I ( X i j x ) ] + r i n j Z j F x z . F x C o v [ ( I ( X i j x ) , I ( X r j x ) ] = a F x z . F x .
In the same way, we can prove the second and third parts of (5). Finally, (6) can be proved as
C o v [ F X i j | Θ j ( x | Θ j ) , I ( X i j x ) ] = E { C o v [ F X i j ( x | Θ j ) , I ( X i j x ) | Θ j ] } + C o v { E [ F X i j | Θ j ( x | Θ j ) | Θ j ] , E [ I ( X i j x ) | Θ j ] } = δ j j a F x .
Similarly, as in Bühlmann and Straub (1970), by the following theorems, we will provide the optimal linearized non-homogeneous (as well as the homogeneous) credibility estimators and provide some useful estimators for the structure parameters.
Theorem 1
(Linearized non-homogeneous credibility distribution estimator). Under the assumptions ( i ) ( i i i ) , the optimal linearized non-homogeneous estimator of F X i j ( x | Θ j ) is obtained by
F X i j C r e d ( x | Θ j ) = Z j F x F n w j ( x ) + ( 1 Z j F x ) F X i j ( x ) ,
with F n w j ( x ) and Z j F x as in (1).
Proof. 
We have to find c 0 j , c 11 j , . . . , c n j K j in
g j [ I ( X 11 x ) , , I ( X n j K x ) ] = c 0 j + l = 1 K i = 1 n j c i l j I ( X i j x ) ,
such that
Q = E P ( X i j x | Θ j ) c 0 j l = 1 K i = 1 n j c i l j I ( X i j x ) 2
is minimum. Differentiating (9) with respect to c 0 j , we have
Q c 0 j = E F X i j | Θ j ( x | Θ j ) c 0 j l = 1 K i = 1 n j c i l j I ( X i j x ) = 0 c 0 j = E [ F X i j | Θ j ( x | Θ j ) ] l = 1 K i = 1 n j c i l j E [ I ( X i j x ) ] = F X i j ( x ) l = 1 K i = 1 n j c i l j F X i j ( x ) .
Substituting the value of c 0 j in (9) and differentiating with respect to c r l , we obtain
Q c r l = c r l E F X i j Θ j ( x | Θ j ) F X i j ( x ) + l = 1 K i = 1 n j c i l j F X i j ( x ) l = 1 K i = 1 n j c i l j I ( X i j x ) 2 = E F X i j | Θ j ( x | Θ j ) F X i j ( x ) l = 1 K i = 1 n j c i l j [ I ( X i l x ) F X i j ( x ) ] [ I ( X r l x ) F X i j ( x ) ] = 0
C o v [ F X i j | Θ j ( x | Θ j ) , I ( X r l x ) ] = l = 1 K i = 1 n j c i l j C o v [ I ( X i l x ) , I ( X r l x ) ] .
The right-hand side of (11) becomes
l = 1 K i = 1 n j c i l j C o v [ I ( X i l x ) , I ( X r l x ) ] = i = 1 n j c i j j C o v [ I ( X i j x ) , I ( X r j x ) ] + l j K i = 1 n j C o v [ I ( X i l x ) , I ( X r j x ) ] = i = 1 n j c i j j δ i r 1 w r j s F x 2 + a F x .
Then, (11) implies that
a F x = i = 1 n j c i j j a F x + c i j j 1 w i j s F x 2 .
Multiplying (12) by w i j and summing with respect to i = 1 , , n j , (i.e., i = 1 n j c i j j = c . j j ), we obtain
c . j j = a F x w . j a F x w . j + s F x 2 = Z j F x c i j j = w i j w , j Z j F x .
Since the probability distribution of I ( X 11 x ) , , I ( X n j K x ) is invariant under permutations of I ( X i j x ) and F X i j | Θ j ( x | Θ j ) is uniquely defined, it must hold that c 1 j j = c 2 j j = = c n j j j . Then, (8) becomes
F X i j C r e d ( x | Θ j ) = F X i j ( x ) l = 1 K i = 1 n j c i l j F X i j ( x ) + l = 1 K i = 1 n j c i l j I ( X i l x ) = F X i j ( x ) i = 1 n j c i j j F X i j ( x ) l j K i = 1 n j c i l j F X i j ( x ) + i = 1 n j c i l j I ( X i j x ) + l l K i = 1 n j c i l j I ( X i l x ) = Z j F x i = 1 n j w i j w . j I ( X i j x ) + ( 1 Z j F x ) F X i j ( x ) ,
which provides (7). □
Theorem 2
(Linearized homogeneous credibility distribution estimator). Under the assumptions ( i ) ( i i i ) , the optimal linearized homogeneous estimator of F X i j ( x | Θ j ) is obtained by
F X i j | Θ j C r e d ( x | Θ j ) = Z j F x F n w j ( x ) + ( 1 Z j F x ) F n w z ( x ) ,
with F n w j ( x ) , F n w z ( x ) and Z j F x as defined in (1).
Proof. 
Letting
g j [ I ( X 11 x ) , , I ( X n l K x ) ] = l = 1 K i = 1 n l c i l j I ( X i l x ) ,
we have to minimize
Q = E F X i j | Θ j ( x | Θ j ) l = 1 K i = 1 n l c i l j I ( X i l x ) 2 ,
such that
E [ F X i l ( x | Θ j ) ] = l = 1 K i = 1 n l c i l j E [ I ( X i l x ) ]
holds under the restrictions l = 1 K i = 1 n l c i l j = 1 , with the Lagrange multiplier 2 λ . The following quantity leads to
Q = E F X i j | Θ j ( x | Θ j ) F X i j ( x ) l = 1 K i = 1 n l c i l j [ I ( X i j x ) F X i j ( x ) ] 2 2 λ l = 1 K i = 1 n l c i l j F X i j ( x ) F X i j ( x ) .
From (16), we obtain
F X i j ( x ) = E [ F X i j | Θ j ( x | Θ j ) ] = l = 1 K i = 1 n l c i l j E [ I ( X i l x ) ] = F X i j ( x ) l = 1 K i = 1 n l c i l j l = 1 K i = 1 n l c i l j = 1 .
Differentiating (17) with respect to c i l , we obtain
C o v [ F X i j | Θ j ( x | Θ j ) , I ( X i l x ) ] l = 1 K i = 1 n l c i l j C o v [ I ( X i l x ) , I ( X i l x ) ] λ F X i j ( x ) = 0
δ j l a F x + λ F X i j ( x ) = i = 1 n l c i l j C o v [ I ( X i l x ) , I ( X i l x ) ] = i = 1 n l c i l j a F x + δ i i 1 w i l s F x 2 = c . l j a F x + c i l j 1 w i l s F x 2 .
Multiplying both sides by w i l and taking the sum over i , we obtain for each l :
c . l j = [ δ j l a F x + λ F X i j ( x ) ] w . l a F x w . l + s F x 2 ] = [ δ j l + λ a F x F X i j ( x ) ] Z l F x .
Substituting (20) into (19), we obtain
c i l j = [ δ j l a F x + λ F X i j ( x ) ] ( 1 Z l F x ) s F x 2 w i l
1 = l = 1 K i = 1 n j c i l j = l = 1 K c . l j = l = 1 K δ j l + λ a F x F X i j ( x ) Z l F x = λ a F x F X i j ( x ) l = 1 K Z l F x + l j K Z l F x δ j l + δ j j Z j F x λ F X i j ( x ) = a F x ( 1 Z j F x ) z . F x .
Then, the optimal linearized homogeneous estimator of F X i j ( x | Θ j ) becomes
F X i j C r e d ( x | Θ j ) = l = 1 K i = 1 n j c i l j I ( X i l x ) = l = 1 K i = 1 n j [ δ j l a F x + λ F X i j ( x ) ] ( 1 Z l F x ) s F x 2 w i l I ( X i l x ) = l = 1 K Z j F x z . F x i = 1 n j w i l w . l ( 1 Z j F x ) I ( X i l x ) + l = 1 K i = 1 n Z l F x w i l w . l δ j l I ( X i l x ) = ( 1 Z j F x ) l = 1 K Z j F x z . F x F n w j ( x ) + l j K i = 1 n j Z l F x w i l w . l δ j l I ( X i l x ) + Z j F x i = 1 n j w i j w . j I ( X i j x )
resulting in (13). □
The following theorem will prove that F n w z ( x ) has a smaller variance than F n w j ( x ) , i.e., based on the heterogeneity and the fluctuation of the risk, F n w z ( x ) has a minimal mean square error.
Theorem 3.
The V a r j = 1 K i = 1 n j c i j I ( X i j x ) is the minimum for all c i j , such that j = 1 K i = 1 n j c i j = 1 , for c i j = w i j w . j Z j F x z . F x .
Proof. 
We have to minimize the following quantity
Q = E j = 1 K i = 1 n j c i j I ( X i j x ) E [ j = 1 K i = 1 n j c i j j I ( X i j x ) ] 2 2 λ F X i j ( x ) ( j = 1 K i = 1 n j c i j j 1 ) .
Taking the derivative of (23) with respect c i l for i = 1 , , n and j = 1 , , K , we obtain
Q c i j = 2 E j = 1 K i = 1 n j c i j I ( X i j x ) E [ j = 1 K i = 1 n j c i j I ( X i j x ) ] I ( X i j x ) F X i j ( x ) 2 λ F X i j ( x ) = 0 E j = 1 K i = 1 n j c i j [ I ( X i j x ) F X i j ( x ) ] [ I ( X i j x ) F X i j ( x ) ] = λ F X i j ( x ) i = 1 n j c i j C o v [ I ( X i j x ) , I ( X i j x ) ] = λ F X i j ( x ) .
This is the same as
j j K i = 1 n j c i j C o v [ I ( X i j x ) , I ( X i j x ) ] + + i = 1 n j c i j C o v [ I ( X i j x ) , I ( X i j x ) ] = λ F X i j ( x ) i = 1 n j c i j ( a F x + δ i i s F x 2 w i j ) = λ F X i j ( x ) .
This gives
a F x i = 1 n j c i j + i i n c i j δ i i s F x 2 w i j + c i j δ i i s F x 2 w i j = λ F X i j ( x ) a F x i = 1 n j c i j + c i j s F x 2 w i j = λ F X i j ( x ) a F x w i j c . j + c i j s F x 2 = λ F X i j ( x ) w i j c i j = w i j [ λ F X i j ( x ) a F x c . j ] s F x 2
i = 1 n j a F x w i j c . j + i = 1 n j c i j s F x 2 = i = 1 n j λ F X i j ( x ) w i j a F x w . j c . j + c . j s F x 2 ] = λ F X i j ( x ) w . j c . j = w . j λ F X i j ( x ) a F x w . j + s F x 2 = λ F X i j ( x ) Z j F x a F x .
We know that
j = 1 K i = 1 n j c i j = 1 j = 1 K c . j = 1 j = 1 K λ F X i j ( x ) Z j F x a F x = λ F X i j ( x ) z . F x a F x = 1 λ = a F x F X i j ( x ) z . F x .
We therefore obtain
c . j = a F x F X i j ( x ) Z j F x F X i j ( x ) z . F x = Z j F x z . F x
and from (26), we have
c i j = w i j ( λ F X i j ( x ) a F x c . j ) s F x 2 = w i j ( a F x z . F x a F x Z j F x z . F x ) s F x 2 = w i j w . j Z j F x z . F x .
Theorem 4.
Under assumptions ( i ) ( i i i ) , the quadratic loss for the credibility distribution estimator is given by
E [ F X i j C r e d ( x | Θ j ) F X i j ( x | Θ j ) ] 2 = a F x ( 1 Z j F x ) .
Proof. 
We have
E [ F X i j C r e d ( x | Θ j ) F X i j ( x | Θ j ) ] 2 = E Z j F x [ F w j ( x ) F X i j ( x ) ] [ F X i j ( x | Θ j ) F X i j ( x ) ] 2 = ( Z j F x ) 2 V a r [ F w j ( x ) ] + V a r [ F X i j ( x | Θ j ) ] 2 Z j F x C o v [ F w j ( x ) , F X i j ( x | Θ j ) ] = Z j F x a F x w . j a F x w . j + s F x 2 s F x 2 w . j + a F x + a F x 2 Z j F x a F x
that provides (29). □

2.4. Optimal Projection Theorem

In the following, De Vylder’s (1976, 1996) optimal projection theorem of random variables in the plane is applied in order to derive the optimal estimator of F X i j ( x ) and F X i j ( x | Θ j ) . Practically, F X i j ( x ) is replaced by F n w z ( x ) in (7).
Theorem 5.
The optimal estimator of F X i j ( x ) in the plane H ( I ( X i j x ) , i = 1 , , n j , j = 1 , , K ) is
F X i j ( x ) P r o j = P r o j [ F X i j ( x ) | H F ( I ( X i j x ) , i = 1 , , n j , j = 1 , , K ) ] = F n w z ( x ) .
Proof. 
Directly from (2) and (5). □
Theorem 6.
The optimal credibility estimator of F X i j ( x | Θ j ) based on I ( X 11 x ) , , I ( X n j K x ) is
F X i j | Θ j C r e d ( x | Θ j ) = P r o j [ F X i j | Θ j ( x | Θ j ) | H F ( I ( X i j x ) , i = 1 , , n j , j = 1 , , K ) ] = Z j F x F n w j ( x ) + ( 1 Z j F x ) F n w z ( x ) .
Proof. 
In order to prove (30), it is sufficient to prove the unbiasedness and covariance conditions of the optimal projection theorem of random variables on planes not through the origin (see De Vylder (1976, 1996)), that is
E [ F X i j | Θ j C r e d ( x | Θ j ) ] = E [ F X i j | Θ j ( x | Θ j ) ] = F X i j ( x )
and
C o v [ F X i j | Θ j C r e d ( x | Θ j ) F X i j | Θ j ( x | Θ j ) , I ( X i j x ) ] = c o n s t .
The unbiasedness condition results from (2) and
E [ F X i j | Θ j C r e d ( x | Θ j ) ] = E Z j F x F n w j ( x ) + ( 1 Z j F x ) F n w z ( x ) = F X i j ( x ) .
The covariance condition results from the independence of the contracts and the covariance relations of Lemma 1, which gives
C o v [ F X i j | Θ j C r e d ( x | Θ j ) F X i j | Θ j ( x | Θ j ) , I ( X i j x ) ] = C o v Z j F x F n w j ( x ) + ( 1 Z j F x ) F n w z ( x ) F X i j | Θ j ( x | Θ j ) , I ( X i j x ) = Z j F x δ j j C o v [ F n w j ( x ) , I ( X i j x ) ] + ( 1 Z j F x ) C o v [ F n w z ( x ) , I ( X r j x ) ] δ j j C o v [ F X i j | Θ j ( x | Θ j ) , I ( X i j x ) ] = Z j F x δ j j a F x Z j F x + ( 1 Z j F x ) a F x z . F x δ j j a F x = ( 1 Z j F x ) a F x z . F x .

2.5. Unbiased Estimators

Below, we provide unbiased estimators analogous to the Bühlmann and Straub (1970) model.
Lemma 2.
The following estimators of the structural parameters F X i j ( x ) , s F x 2 and a F x , presented in Section 2.2, are unbiased.
F ^ X i j ( x ) = F n w w ( x ) o r F ^ X i j ( x ) = F n w z ( x ) ,
s ^ F x 2 = j = 1 K i = 1 n j w i j [ I ( X i j x ) F n w j ( x ) ] 2 j = 1 K ( n j 1 ) ,
a ^ F x = w . . w . . 2 j = 1 K w . j 2 j = 1 K w . j [ F n w j ( x ) F n w w ( x ) ] 2 ( K 1 ) s ^ F x 2 .
Based on De Vylder (1978), an unbiased estimator of a F x can take the form
a ^ F x = j = 1 K Z j F x [ F n w j ( x ) F n w z ( x ) ] 2 ( K 1 ) , ( p s e u d o - e s t i m a t o r ) .
Proof. 
The unbiasedness of F ^ X i j ( x ) is straightforward and is omitted. The unbiasedness of s ^ F 2 follows from
j = 1 K ( n j 1 ) E ( s ^ F x 2 ) = E j = 1 K i = 1 n j w i j [ I ( X i j x ) F n w j ( x ) ] 2 = j = 1 K i = 1 n j w i j V a r [ I ( X i j x ) ] + V a r [ F n w j ( x ) ] 2 C o v [ I ( X i j x ) , F n w j ( x ) ] = j = 1 K i = 1 n j w i j s F x 2 w i j + a F x + a F x Z j F x 2 a F x Z j F x = j = 1 K ( n j 1 ) s F x 2 ,
resulting in (33). For the proof of the unbiasedness of (34), we refer to Bühlmann and Straub (1970). Finally, the unbiasedness of a ^ F in (35) results from
( K 1 ) E ( a ^ F x ) = E j = 1 K Z j F x [ F n w j ( x ) F n w z ( x ) ] 2 = j = 1 K Z j F x V a r [ F n w j ( x ) ] + V a r [ F n w z ( x ) ] 2 C o v [ F n w j ( x ) , F n w z ( x ) ] = j = 1 K Z j F x a F x Z j F x + a F x z . F x 2 a F x z . F x = ( K 1 ) a F x
which implies (35). □

3. Credible Distribution for Grouped Data

Grouped data are formed by aggregating the individual observations of a variable into groups. For example, a histogram is a density approximation for grouped data. The construction of the empirical distribution based on grouped data can be achieved by obtaining the point values of the empirical distribution function whenever possible. Then, we can approximate the distribution function by connecting those point values with straight lines.
Empirical distribution for grouped data is evaluated at a point estimate x. We consider the case where the point estimate x is at a boundary and the case where the value of x is between the boundaries.

3.1. Empirical Distribution for Grouped Data at Boundary

For contract j, let the group boundaries be c 0 j < c 1 j < < c n j , where c 0 j = 0 and c n + 1 , j = . Let m i j be the number of observations in the interval ( c i 1 , j , c i j ) , i = 1 , 2 , , n j , j = 1 , 2 , , K and m . j = i = 1 n j m i j be the total number of observations for the j contract. For grouped data, the empirical distribution function at each group boundary c i j is defined as
F m j ( c r j ) = 1 m . j i = 1 r m i j .
For grouped data, there is no problem if the distribution function has to be estimated at a boundary. When all of the information is available, working with the empirical estimate of the distribution function is straightforward (see Klugman et al. (2012)). We have the following assumptions:

3.1.1. Assumptions

(i*)
The contracts are independent and the variables Θ 1 , , Θ K are identically distributed. The observations X i j have finite variance,
(ii*)
E [ I ( X i j x ) | Θ j ) ] = E [ F m j ( x | Θ j ) ] = F X i j | Θ j ( x | Θ j ) ,
(iii*)
V a r [ I ( X i j x ) | Θ j ] = 1 m i j σ x 2 ( Θ j ) and   V a r [ F m j ( x | Θ j ) ] = 1 m . j σ x 2 ( Θ j ) .

3.1.2. Structural Parameters

μ x = E [ F m j ( x ) ] = F X i j ( x ) , s x 2 = E [ σ x 2 ( Θ j ) ] , a x = V a r { E [ I ( X i j x ) | Θ j ] } .

3.1.3. Notation

Here, we adopt the following notation:
F m j ( x ) = j = 1 n j m i j m . j I ( X i j x ) , F m m ( x ) = j = 1 K m . j m . . F m j ( x ) , F m z ( x ) = j = 1 K Z j x z . x F m j ( x ) , m . j = i = 1 n j m i j , m . . = i = 1 K m . j , z . x = j = 1 K Z j x , Z j x = m . j a x m . j a x + s x 2 .
Based on the above assumptions, a credibility distribution estimator for F X i j ( x | Θ j ) is obtained as
F X i j C r e d ( x | Θ j ) = Z j x F m j ( x ) + ( 1 Z j x ) F X i j ( x ) .
With the following theorem, we can obtain the credibility distribution estimator of F X i j ( x | Θ j ) .
Theorem 7.
Under the assumptions ( i * ) ( i i i * ) , the credibility factor in (40) is given by
Z j x = m . j a x m . j a x + s x 2 ,
with a x as in (38) and m . j as in (39).
Proof. 
The proof of the theorem can be obtained by minimizing the expression
Q = E F X i j C r e d ( x | Θ j ) Z j x F m j ( x ) ( 1 Z j x ) F X i j ( x ) 2 ,
with respect to Z j x . □

3.1.4. Credibility Estimators

Lemma 3.
The credibility point estimators of F X i j ( x ) , s x 2 and a x are given as follows:
F ^ X i j ( x ) = F m m ( x ) , o r F ^ X i j ( x ) = F m z ( x ) s ^ x 2 = j = 1 K i = 1 n j m i j [ I ( X i j x ) F m j ( x ) ] 2 j = 1 K ( n j 1 ) a ^ x = m . . m . . 2 j = 1 K m . j 2 j = 1 K m . j [ F m j ( x ) F m m ( x ) ] 2 ( K 1 ) s ^ x 2 o r a ^ x = j = 1 K Z j x [ F m j ( x ) F m z ( x ) ] 2 ( K 1 ) .
Proof. 
Similarly to the proof of Lemma 2. □

3.2. Empirical Distribution for Grouped Data at Value x between Boundaries

Now, suppose that the value of x is between the boundaries c i 1 , j and c i j . Then, for contract j, the empirical distribution function is given by
F m j ( x ) = 0 , x c 0 , ( c i j x ) F m j ( c i 1 , j ) + ( x c i 1 , j ) F m j ( c i j ) c i j c i 1 , j , c i 1 , j x c i j , 1 , x > c n .
This function is differentiable at all values except for the group boundaries. Based on (41), we can obtain the following
E [ F m j ( x | Θ j ) ] = ( c i j x ) F X i j ( c i 1 , j | Θ j ) + ( x c i 1 , j ) F X i j ( c i j | Θ j ) c i j c i 1 , j
and
F X i j ( x ) = E [ F m j ( x ) ] = ( c i j x ) F X i j ( c i 1 , j ) + ( x c i 1 , j ) F X i j ( c i j ) c i j c i 1 , j .
Note that the above estimator is biased although it is an unbiased estimator of the true interpolated value (see Klugman et al. (2012)).
The conditional variance of the empirical distribution is
V a r [ F m j ( x | Θ j ) ] = ( c i j x ) 2 V a r [ F m j ( c i 1 , j | Θ j ) ] + ( x c i 1 , j ) 2 V a r [ F m j ( c i j | Θ j ) ] ( c i j c i 1 , j ) 2 + 2 C o v [ F m j ( c i 1 , j | Θ j ) , F m j ( c i j | Θ j ) ] ( c i j c i 1 , j ) 2 ,
where
V a r [ F m j ( c i 1 , j | Θ j ) ] = 1 m . j F X i j ( c i 1 , j | Θ j ) [ 1 F X i j ( c i 1 , j | Θ j ) ] ,
V a r [ F m j ( c i j | Θ j ) ] = 1 m . j F X i j ( c i j | Θ j ) [ 1 F X i j ( c i j | Θ j ) ]
and
C o v [ F m j ( c i 1 , j | Θ j ) , F m j ( c i j | Θ j ) ] = 1 m . j ( F X i j ( min { c i 1 , j , c i j } | Θ j ) F X i j ( c i 1 , j | Θ j ) F X i j ( c i j | Θ j ) ) .
Then, we can proceed as in Section 3.1 for obtaining the credibility distribution estimator of F X i j ( x | Θ j ) , when the value of x is between boundaries.

4. Alternative Credibility Distribution Approach for Grouped Data

For grouped data, the previous approaches yield credibility point estimates. If we want to find the credibility estimation in the framework of Bühlmann and Straub (1970), we may apply the concept of uniform distribution within each interval ( c i 1 , j , c i j ) and the first two moments can be estimated from
μ ^ j ( k ) = i = 1 r m i j m . j c i j k + 1 c i 1 , j k + 1 ( k + 1 ) ( c i j c i 1 , j ) ,
for k = 1 , 2 . Thus, for contract j, the empirical estimate of the mean ( k = 1 ) is the weighted average of the interval midpoints where the weight m i j for an interval is the proportion of the observations that are in the interval (histogram), i.e.,
μ ^ j = i = 1 r m i j m . j c i j + c i 1 , j 2 .
Letting C i j = c i j + c i 1 , j 2 and assuming that E ( C i j | Θ j ) = μ ( Θ j ) and Cov ( C r j , C i j | Θ j ) = δ r i 1 m i j σ 2 ( Θ j ) , the credibility estimation based on grouped data can be obtained similarly as in the Bühlmann and Straub (1970) model
μ C r e d ( Θ j ) = Z j μ j + ( 1 Z j ) μ ,
with parameters
μ = E [ μ ( Θ j ) ] , s 2 = E [ σ 2 ( Θ j ) ] , a = V a r [ μ ( Θ j ) ] , Z j = a m . j a m . j + s 2 .
Theorem 8.
The following are unbiased estimators for μ, s 2 , and a:
μ ^ = C ¯ . . = j = 1 K m . j m . . C ¯ . j , w i t h μ ^ j = C ¯ . j = j = 1 K m i j m . j C i j ,
s ^ 2 = 1 K j = 1 K s ^ j 2 , w i t h s ^ j 2 = μ ^ j ( 2 ) ( μ ^ j ) 2
and
a ^ = m . . m . . 2 j = 1 K m . j 2 j = 1 K m . j ( μ ^ j μ ^ ) 2 ( K 1 ) s ^ 2 ,
or
a ^ = 1 K 1 j = 1 K Z j ( C ¯ . j C ¯ z ) 2 , w h e r e C ¯ z = j = 1 K Z j z . C ¯ . j .
Proof. 

5. Numerical Illustrations

In this section, we use two datasets, one with insurance motor claims data and a second with monthly returns financial data.

5.1. Numerical Example with Insurance Data

The dataset is provided by Insurance Europe (2022) and includes a database with figures on the European insurance industry during the period 2004–2020 for 32 EU countries. Our numerical illustration is based on a complete dataset of 10 selected countries for the years 2004–2018. Our dataset also contains the motor claims paid and the number of motor claims for each country and each year. The selected countries are the following: Austria (AT); Germany (DE); Finland (FI); Greece (GR); Hungary (HR); Italy (IT); Norway (NO); Poland (PL); Portugal (PT); and Sweden (SE). Table 1 shows the summary statistics of the motor claim amounts and the claim numbers for countries j = 1 , , 10 and years i = 1 , , 15 .
Table 2 illustrates the results of a credibility distribution function for motor claims amount data during the years 2004–2018. More analytically, the upper part of the table shows the individual empirical distribution F ^ n w j ( x ) of claim amounts X i j x , X i j x , (x = 320, 800, 1000, 2000, 3000, 23,800, 23,896, 23,897) and the corresponding credibility distribution estimators F ^ X i j C r e d ( x | Θ j ) are shown in the middle part of the table. The estimated credibility factors Z ^ j F x , as well the estimated parameters F ^ n w w ( x ) , s ^ F 2 , a ^ F , are presented in the lower part of Table 2. Note that F ^ n w j ( x ) = 0 means that the value of all claims X i j > x and F ^ n w j ( x ) = 1 if claims X i j x.
In Table 2, we observe a lack of monotonicity of the estimated credibility distributions for all contracts. In order to obtain monotonicity, we similarly proceed as in Cai et al. (2015) by restricting the credibility factor Z j F to be a constant free of x. The results are shown in Table 3. Although monotonicity has been restored from a risk management perspective (which serves to fair pricing and reserves given the nature of the risk), more investigation is required, especially in the points where monotonicity breaks down.
Remark 1.
Another way of obtaining monotonicity of the credibility estimated by distribution functions is by sorting the resulting credibility by estimated distribution functions. In the relevant literature, there are methods for extracting a monotone function from non-monotonic data. Such a method is the monotonic regression that achieves the monotonicity and smoothness of the regression by introducing a regularization term, and solving an optimization problem with constraints. Some key references are: Friedman and Tibshirani (1984), Mukerjee (1988), Shively et al. (2009) and Zhang (2004). Similarly, the above approaches could be applied to our model.
By letting the values of motor claims be larger than x = 23,800 and less than or equal to x = 23,897 x = 23,897 is the maximum threshold of contract DE, which is the contract with the largest values of motor claims, as shown in Table 1), whilst the values of the estimated credibility distribution F ^ X i j C r e d ( x | Θ j ) remain the same up to the fifth decimal place. By letting x > 23,897, the estimated credibility distribution goes to 1 (see Table 2).
Remark 2.
Similarly to in Bühlmann and Straub (1970) model, a ^ F x can possibly be negative. This means that there is no detectable difference between the risks. In this case we put a ^ F x = 0 , as in our cases for x = 23,800, 23,896, 23,897.
Figure 1 displays the individual empirical distribution in each contract. Note that the red bullets indicate the corresponding credibility estimate at specific points presented in Table 2.

Credibility Coefficients for Motor Claims Data

In the following, we provide an intuitive interpretation for the form of the credibility distribution estimator given in Theorem 1 for motor claims by presenting the following coefficients in Table 4, which were derived based on the lower part of Table 2. These are: the coefficient of variation B R V = a ^ F x F ^ n w w ( x ) , which is a good measure for the heterogeneity of the portfolio, (i.e., a good measure for the between-risk variability) and the average within the risk coefficient of variation W R V = s ^ F x F ^ n w w ( x ) , which is a good measure of the within risk variability. The smaller the credibility coefficient C C = s ^ F x 2 a ^ F x , the greater the Z ^ j F x .
Remark 3.
The results of Table 2 and Remark 2, for x = 23,800, a ^ F x = 0 and s ^ F x 2 = 61,020, imply that BRV = 0 and C C = . Similarly, setting x = 23,900, a ^ F x = 0 and s ^ F x 2 = 1,736,923 implies that BRV = 0 and C C = .

5.2. Example of Credibility Distribution Estimation with Financial Data

The dataset was created (see Fama and French (2022)) as follows: each NYSE, AMEX, and NASDAQ stock was assigned to an industry portfolio at the end of June of year t based on its four-digit SIC code at that time. Compustat SIC codes have been used for the fiscal year ending in the calendar year t 1 . Whenever Compustat SIC codes are not available, CRSP SIC codes for June of year t were used. Then, returns from July of year t to June of year t + 1 are computed. The weights are the number of firms in portfolios.
In particular, the portfolios are constructed with monthly returns from July 1926 to July 2022 and it contains value returns for 10 industry portfolios. The credibility distribution for each of these portfolios needs to be estimated. As a profit (P), we consider a random variable X, with positive returns values and as a loss (L) with negative return values. The 10 industry portfolios are as follows:
(1)
NoDur: consumer non-durables—food, tobacco, textiles, apparel, leather, and toys.
(2)
Durbl: Consumer durables—cars, TVs, furniture, household appliances.
(3)
Manuf: Manufacturing—machinery, trucks, planes, chemicals, off-furn, and paper.
(4)
Enrgy: Oil, gas, and coal extraction and products.
(5)
HiTec: Business equipment—computers, software, and electronic equipment.
(6)
Telcm: Telephone and television transmission.
(7)
Shops: Wholesale, retail, and some services (laundries, repair shops).
(8)
Hlth: healthcare, medical equipment, and drugs.
(9)
Utils: Utilities.
(10)
Other: Other—mines, construction, building material, transportation, hotels, bus service, entertainment, and finance.
Table 5 provides some descriptive statistics of the (P/L) monthly returns of the 10 industry portfolios. The number of observations in each portfolio is n = 1155 .
Table 6 illustrates the results of credibility distribution function for monthly returns for 10 industry portfolios from July 1926 to July 2022. More analytically, the upper part of the table shows the individual empirical distribution F ^ n w j ( x ) of the returns X i j h x , ( x = 15 , 10 , 5 , 0 , 10 , 15 , 34.17 , 59 , 60 , 79.79 ) and the corresponding credibility distribution estimators F ^ X i j | Θ j C r e d ( x | Θ j ) are shown in the middle part of the table. The estimated credibility factors Z ^ j F x , as well the estimated parameters F ^ n w w ( x ) , s ^ F x 2 , a ^ F x , are presented in the lower part of Table 6. The monotonicity of the estimated distribution function is shown in Table 6. By letting the values of returns be larger than x = 59 and less than or equal to x = 79.79 (x = 79.79 is the maximum threshold of portfolio Durbl, which is the portfolio with the largest return values, as shown in Table 5), the values of the estimated credibility distribution F ^ X i j C r e d ( x = 0 | Θ j ) remain the same up to the fifth decimal place. By letting x > 79.79 , F ^ X i j C r e d ( x > 70.79 | Θ j ) goes to 1 (see Table 6).
Figure 2 displays the individual empirical distribution in each contract. Again, note that the red bullets indicate the corresponding credibility estimate at specific points presented in Table 6.

Credibility Coefficients for Industry Portfolios Data

Here, we provide an intuitive interpretation for the form of the credibility distribution estimator for the monthly returns for the 10 industry portfolios, by presenting the following credibility coefficients. Table 7 illustrates the coefficient of variation B R V = a ^ F x F ^ n w w ( x ) , the average within-risk coefficient of variation W R V = s ^ F x F ^ n w w ( x ) , and the credibility coefficient C C = s ^ F x 2 a ^ F x for the industry portfolio data.
Remark 4.
The results of Table 6 and Remark 2, for x = 50, a ^ F x = 0 and s ^ F x 2 = 0.0469, imply that BRV = 0 and C C = .

5.3. Example of Credibility Distribution Estimation with Financial Grouped Data

The empirical distribution function for the grouped data was depicted by the step function of Fama and French (2022) data. The grouping (see Table 8) is a subjective element in this fit and other persons would have different ones. The total number of observations in each portfolio is the same ( m . j = 1155 ).
Table 9 illustrates the results of the credibility distribution function for monthly returns for 10 industry portfolios from July 1926 to July 2022. Analytically, the upper part of the table shows the individual empirical distribution F ^ m j ( x ) of returns X i j x , ( x = 15 , 10 , 5 , 0 , 10 , 15 ) and the corresponding credibility distribution estimators F ^ X i j | Θ j C r e d ( x | Θ j ) are shown in the middle part of the table. The estimated credibility factors Z j x , as well the estimated parameters F ^ m m ( x ) , s ^ x 2 , a ^ x , are presented in the lower part of Table 9. The monotonicity of the estimated distribution function is shown in Table 9, but the convergence to one of the estimated credibility distribution for grouped data should be further investigated.
Figure 3 displays the smoothed individual empirical distribution for grouped data in each contract. Again, the red bullets indicate the corresponding credibility estimate at specific points presented in Table 9.

Credibility Coefficients for Financial Grouped Data

Table 10 illustrates the coefficient of variation B R V , the average within-risk coefficient of variation W R V , and the credibility coefficient C C for the industry portfolios of grouped data.

5.4. Example of the Classical Credibility Estimation with Financial Grouped Data

For grouped data, the previous approach gives a credibility point estimate. If we want to derive the classical credibility estimation, we can apply the concept of uniform distribution within each interval of returns and take the interval midpoints as the value of return. The weights are the number of observations in each interval. Table 11 shows the individual average return for the 10 industry portfolios μ ^ j , the credibility estimation of returns for these portfolios μ ( Θ j ) C r e d , along with the credibility factor Z j and the estimated parameters μ ^ , a ^ and s ^ 2 .

6. Concluding Remarks

The objective of this paper was to present the appropriate credibility distribution model that adequately describes the insurance losses, a model that can be used for risk management purposes.
The main contribution of the paper is that it embedded the empirical distribution into credibility modeling in the form of the Bühlmann and Straub (1970) model. In the first part of the paper, we present the model of the weighted credibility distribution, and in the second part, a model that applies to a grouped data in intervals.
With our models, we examine two datasets, one with motor claim amounts and the number of motor claims from 10 selected European countries during the period 2004–2020, and a second with monthly returns from July 1926 to July 2022 for 10 industry portfolios. For applying our credibility distribution model with grouped data, we grouped the second dataset (Fama/French financial data) into intervals of claim amounts. Under this setting, the grouping is subjective and the weights are the number of points within each interval and the total weights in each interval are the same.
The monotonicity (or non-monotonicity) and the convergence to one of the estimated distribution functions are shown numerically in Table 2, Table 3, Table 6 and Table 9. From a theoretical point of view, the monotonicity, as well as the convergence of the estimated distribution functions need further investigation. Furthermore, the sufficient conditions for the asymptotic optimality of the empirical credibility distribution estimators can be also investigated, providing some good ideas for a new project.

Funding

This research received no external funding.

Data Availability Statement

The datasets that were used in this study are available online on the following link, https://www.insuranceeurope.eu/statistics, accessed on 10 September 2022 and https://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html, accessed on 20 September 2022.

Acknowledgments

This work has been partly supported by the University of Piraeus Research Center. The author thanks the anonymous referees for their suggestions and comments that helped to improve the paper.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Bozikas, Apostolos, and Georgios Pitselis. 2020. Incorporating crossed classification credibility into the Lee–Carter model for multi-population mortality data. Insurance: Mathematics and Economics 93: 353–68. [Google Scholar] [CrossRef]
  2. Bozikas, Apostolos, and Georgios Pitselis. 2021. Multi-population mortality modelling and forecasting: A hierarchical credibility regression approach. European Actuarial Journal 11: 231–67. [Google Scholar] [CrossRef]
  3. Bühlmann, Hans. 1967. Experience rating and credibility. ASTIN Bulletin 4: 199–207. [Google Scholar]
  4. Bühlmann, Hans, and Erwin Straub. 1970. Glaubwürdigkeit für Schadensätze. Mitteilungen der Vereinigung Schweizerischer Versicherungsmathematiker 70: 111–33. [Google Scholar]
  5. Cai, Xiaoqiang, Limin Wen, Xianyi Wu, and Xian Zhou. 2015. Credibility Estimation of distribution functions with applications to experience rating in general insurance. North American Actuarial Journal 19: 311–35. [Google Scholar] [CrossRef]
  6. Christiansen, Marcus, and Edo Schinzinger. 2016. A Credibility Approach for Combining Likelihoods Generalized Linear Models. Astin Bulletin 46: 531–69. [Google Scholar] [CrossRef]
  7. Denuit, Michel. 2008. Comonotonic approximations to quantiles of life annuity conditional expected present value. Insurance: Mathematics and Economics 42: 831–38. [Google Scholar] [CrossRef]
  8. De Vylder, Etienne F. 1976. Geometrical Credibility. Scandinavian Actuarial Journal 3: 121–49. [Google Scholar] [CrossRef]
  9. De Vylder, Etienne F. 1978. Parameter Estimation in Credibility Theory. ASTIN Bulletin 10: 99–112. [Google Scholar] [CrossRef]
  10. De Vylder, Etienne F. 1996. Advanced Risk Theory-A Self-Contained Introduction. Brussels: Editions de L’Universite de Bruxelles. [Google Scholar]
  11. Fama, Eugene F., and Kenneth R. French. 2022. CRSP Data. Available online: https://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html (accessed on 20 September 2022).
  12. Ferguson, Thomas 1973. A Bayesian analysis of some non-parametric problems. Annals of Statistics 1: 209–30.
  13. Friedman, Jerome, and Robert Tibshirani. 1984. The monotone smoothing of scatterplots. Technometrics 26: 243–50. [Google Scholar] [CrossRef]
  14. Gebizlioglu, Omer L., and Banu Yagci. 2008. Tolerance intervals for quantiles of bivariate risks and risk measurement. Insurance: Mathematics and Economics 42: 1022–27. [Google Scholar] [CrossRef]
  15. Gong, Yikai (Maxwell), Zhuangdi Li, Maria Milazzo, Kristen Moore, and Matthew Provencher. 2018. Credibility methods for individual life insurance. Risks 6: 144. [Google Scholar] [CrossRef]
  16. Hachemeister, Charles. A. 1975. Credibility for regression models with application to trend. In Credibility, Theory and Applications. Edited by P. Kahn. New York: Academic Press, Inc., pp. 307–48. [Google Scholar]
  17. Insurance Europe. 2022. Available online: https://www.insuranceeurope.eu/statistics (accessed on 10 September 2022).
  18. Jewell, William S. 1974a. Credible means are exact Bayesian for exponential families. ASTIN Bulletin 8: 77–90. [Google Scholar] [CrossRef]
  19. Jewell, William S. 1974b. The Credible distribution. ASTIN Bulletin 7: 237–69. [Google Scholar] [CrossRef]
  20. Kim, Joseph H.T., and Yongho Jeon. 2013. Credibility theory based on trimming. Insurance: Mathematics and Economics 53: 46–57. [Google Scholar] [CrossRef]
  21. Kim, Minwoo, Himchan Jeong, and Dipak Dey. 2022. Approximation of Zero-Inflated Poisson Credibility Premium via Variational Bayes Approach. Risks 10: 54. [Google Scholar] [CrossRef]
  22. Klugman, Stuart A., Harry Panjer, and Gordon E. Willmot. 2012. Loss Models: From Data to Decisions. New York: Wiley. [Google Scholar]
  23. Korwar, Ramesh M., and Myles Hollander. 1973. Contributions to the theory of Dirichlet processes. Annals of Statistics 1: 705–11. [Google Scholar] [CrossRef]
  24. Kudryavtsev, Andrey. 2009. Using quantile regression for rate-making. Insurance: Mathematics and Economics 45: 296–304. [Google Scholar]
  25. Landsman, Zinoviy. 1996. Sample quantiles and additive statistics: Information, sufficiency, estimation. Journal of Statistical Planning and Inference 52: 93–108. [Google Scholar]
  26. Landsman, Zinoviy M., and Udi E. Makov. 1998. Exponential dispersion models and credibility. Scandinavian Actuarial Journal 1: 89–96. [Google Scholar] [CrossRef]
  27. Landsman, Zinoviy M., and Udi E. Makov. 1999. Credibility evaluations for exponential dispersion families. Insurance: Mathematics and Economics 24: 33–9. [Google Scholar] [CrossRef]
  28. Makov, Udi E., Adrian F. M. Smith, and Yu H. Liu. 1996. Bayesian methods in actuarial science. Journal of the Royal Statistical Society Series D 45: 503–15. [Google Scholar] [CrossRef]
  29. Mukerjee, Hari. 1988. Monotone nonparametric regression. Annals of Statistics 16: 741–50. [Google Scholar] [CrossRef]
  30. Pitselis, Georgios. 2009. Solvency Supervision based on a total balance sheet approach. Journal of Computational and Applied Mathematics 233: 83–96. [Google Scholar] [CrossRef]
  31. Pitselis, Georgios. 2013. Quantile credibility models. Insurance: Mathematics and Economics 52: 477–89. [Google Scholar] [CrossRef]
  32. Pitselis, Georgios. 2017. Risk measures in a quantile regression credibility framework with Fama/French data applications. Insurance: Mathematics and Economics 74: 122–34. [Google Scholar] [CrossRef]
  33. Pitt, David G. W. 2006. Regression quantile analysis of claim termination rates for income protection insurance. Annals of Actuarial Science 1: 345–57. [Google Scholar] [CrossRef]
  34. Shively, Thomas S., Thomas W. Sager, and Stephen G. Walker. 2009. A Bayesian Approach to Non-Parametric Monotone Function Estimation. Journal of the Royal Statistical Society Series B: Statistical Methodology 71: 159–75. [Google Scholar] [CrossRef]
  35. Tsai, Cary Chi-Liang, and Adelaide Di Wu. 2020. Bühlmann credibility-based approaches to modelling mortality rates for multiple populations. North American Actuarial Journal 24: 290–315. [Google Scholar] [CrossRef]
  36. Tsai, Cary Chi-Liang, and Tzuling Lin. 2017. Incorporating the Bühlmann credibility into mortality models to improve forecasting performances. Scandinavian Actuarial Journal 5: 419–40. [Google Scholar] [CrossRef]
  37. Tsai, Cary Chi-Liang, and Ying Zhang. 2019. A multi-dimensional Bühlmann credibility approach to modelling multi-population mortality rates. Scandinavian Actuarial Journal 5: 406–31. [Google Scholar] [CrossRef]
  38. Wang, Wei, Limin Wen, Zhixin Yang, and Quan Yuan. 2021. Quantile Credibility Models with Common Effects. Risks 8: 100. [Google Scholar] [CrossRef]
  39. Xacur, Oscar Alberto Quijano, and José Garrido. 2018. Bayesian credibility for GLMs. Insurance: Mathematics and Economics 83: 180–89. [Google Scholar] [CrossRef]
  40. Yan, Yujie, and Kai-Sheng Song. 2022. A general optimal approach to Bühlmann credibility theory. Insurance: Mathematics and Economics 104: 262–82. [Google Scholar] [CrossRef]
  41. Youn, Ahn Jae, Himchan Jeong, and Yang Lu. 2021. On the ordering of credibility factors. Insurance: Mathematics and Economics 101: 626–38. [Google Scholar] [CrossRef]
  42. Zehnwirth, Benjamin. 1981. A Note on the Asymptotic Optimality of the Empirical Bayes Distribution Function. Annals of Statistics 9: 221–24. [Google Scholar] [CrossRef]
  43. Zhang, Jin-Ting. 2004. A simple and efficient monotone smoother using smoothing splines. Journal of Nonparametric Statistics 16: 779–96. [Google Scholar] [CrossRef]
Figure 1. Individual empirical distribution and credibility distribution point estimates (in red) for motor claims per contract.
Figure 1. Individual empirical distribution and credibility distribution point estimates (in red) for motor claims per contract.
Risks 12 00010 g001
Figure 2. Individual empirical distribution and credibility distribution point estimates (in red) for industry portfolios per contract.
Figure 2. Individual empirical distribution and credibility distribution point estimates (in red) for industry portfolios per contract.
Risks 12 00010 g002
Figure 3. Individual empirical distribution and credibility distribution point estimates (in red) for grouped data per contract.
Figure 3. Individual empirical distribution and credibility distribution point estimates (in red) for grouped data per contract.
Risks 12 00010 g003
Table 1. Summary statistics for 10 selected European countries.
Table 1. Summary statistics for 10 selected European countries.
Motor Claims and Number of Motor Claims for the Years 2004–2018
X i j : Motor claims amount in millions for the years i = 1 , , 15 and countries j = 1 , , 10
CountryATDEFIGRHRITNOPLPTSE
Min.192318,78969243820712,79110621582987311
1st Qu.198919,32280353022612,968120519351187912
Median203220,22291497824815,239139622191261987
Mean210420,69290984025615,1121359232212181098
3rd Qu.220121,8281024108328816,4921514262012821504
Max.243023,8971141122433018,2101637323513621670
w i j : Weights–number of motor claims
Min.1,177,2698,673,000368,898403,604170,2053,389,677562,9811,307,003643,713890,304
1st Qu.1,238,3929,002,000420,122427,265199,1793,467,180659,0631,465,112712,8191,043,110
Median1,279,5869,247,000497,201474,875204,4214,541,671758,8141,749,483837,6941,098,411
Mean1,282,0939,220,067493,392475,977208,0844,317,710728,3722,177,715800,4251,111,064
3rd Qu.1,323,9499,425,500573,942515,984218,9645,026,480783,3941,891,870876,3231,168,650
Max.1,396,2509,750,000641,513580,985238,9045,249,558908,6634,515,087959,7811,328,331
Note: X i j are the average claims per year and w i j represents the number of motor claims that correspond to each X i j .
Table 2. Credibility distribution estimation for motor claims.
Table 2. Credibility distribution estimation for motor claims.
Motor Claim Amounts and Number of Motor Claims from 10 Selected European Countries
during the Period 2004–2018
Individual empirical distribution with claim amount X i j x ,   ( x = 320 , 800 , 1000 , 2000 , 3000 , 23,800 , 23,896 , 23,897 )
CountryATDEFIGRHRITNOPLPTSE
F ^ n w j ( 320 ) 0.00000.00000.00000.00000.92350.00000.00000.00000.00000.0624
F ^ n w j ( 800 ) 0.00000.00000.15840.42061.00000.00000.00000.00000.00000.1940
F ^ n w j ( 1000 ) 0.00000.00000.52860.55011.00000.00000.00000.00000.14280.5359
F ^ n w j ( 2000 ) 0.25180.00001.00001.00001.00000.00001.00000.21281.00001.0000
F ^ n w j ( 3000 ) 1.00000.00001.00001.00001.00000.00001.00000.72781.00001.0000
F ^ n w j (23,800)1.00000.93391.00001.00001.00001.00001.00001.00001.00001.0000
F ^ n w j (23,896)1.00000.93391.00001.00001.00001.00001.00001.00001.00001.0000
F ^ n w j (23,897)1111111111
Credibility distribution estimation with claim amount X i j x ,   ( x = 320, 800, 1000, 2000, 3000, 23,800, 23,896, 23,897)
CountryATDEFIGRHRITNOPLPTSE
F ^ X i j C r e d ( 320 Θ j ) 0.012560.012560.012560.012560.012560.012560.012560.012560.012560.01256
F ^ X i j C r e d ( 800 Θ j ) 0.027900.013480.042990.061550.065390.019800.030150.024900.029830.05827
F ^ X i j C r e d ( 1000 Θ j ) 0.027540.005830.238200.242100.252700.011360.037220.019390.105000.33390
F ^ X i j C r e d ( 2000 Θ j ) 0.250800.001050.936300.934200.864200.002230.955700.213000.959500.97040
F ^ X i j C r e d ( 3000 Θ j ) 0.955200.003130.894600.891400.794000.006600.924900.711500.930900.94880
F ^ X i j C r e d (23,800 Θ j ) 0.97070.97070.97070.97070.97070.97070.97070.97070.97070.9707
F ^ X i j C r e d (23,896 Θ j ) 0.97070.97070.97070.97070.97070.97070.97070.97070.97070.9707
F ^ X i j C r e d (23,897 Θ j ) 1111111111
Credibility factor X i j x ,   ( x = 320, 800, 1000, 2000, 3000, 23,800, 23,896, 23,897)
Parameter Z ^ 1 F x Z ^ 2 F x Z ^ 3 F x Z ^ 4 F x Z ^ 5 F x Z ^ 6 F x Z ^ 7 F x Z ^ 8 F x Z ^ 9 F x Z ^ 10 F x
x = 3200000000000
x = 8000.17270.60020.07440.07190.03280.41280.10600.26180.11530.1532
x = 10000.60200.91580.36790.35960.19710.83590.46220.71980.48570.5672
x = 20000.96690.99530.91820.91550.82560.98990.94310.98020.94790.9619
x = 30000.93400.99030.84480.84000.69650.97940.88930.96000.89830.9246
x = 23,8000000000000
x = 23,8960000000000
x = 23,8970000000000
Parameter estimation X i j x , (x = 320, 800, 1000, 2000, 3000, 23,800, 23,896, 23,897)
x = 320 F ^ n w w ( x ) = 0.01256 a ^ F x = 0.00000 s ^ F x 2 = 456,695
x = 800 F ^ n w w ( x ) = 0.03372 a ^ F x = 0.00412 s ^ F x 2 = 379,951
x = 1000 F ^ n w w ( x ) = 0.06920 a ^ F x = 0.02147 s ^ F x 2 = 273,007
x = 2000 F ^ n w w ( x ) = 0.22117 a ^ F x = 0.09854 s ^ F x 2 = 64,969
x = 3000 F ^ n w w ( x ) = 0.32113 a ^ F x = 0.14983 s ^ F x 2 = 203,743
x = 23,800 F ^ n w w ( x ) = 0.97070 a ^ F x = 0.00000 s ^ F x 2 = 1,610,559
x = 23,896 F ^ n w w ( x ) = 0.97070 a ^ F x = 0.00000 s ^ F x 2 = 1,610,559
x = 23,897 F ^ n w w ( x ) = 1.00000 a ^ F x = 0.00000 s ^ F x 2 = 1,736,923
Table 3. Credibility distribution estimation for motor claims.
Table 3. Credibility distribution estimation for motor claims.
Motor Claim Amounts and Number of Motor Claims from 10 Selected European Countries
during the Period 2004–2018, Z F Free of x
Individual empirical distribution with claim amount X i j x ,   ( x = 320, 800, 1000, 2000, 3000, 23,800, 23,896, 23,897)
CountryATDEFIGRHRITNOPLPTSE
F ^ n w j ( 320 ) 0.00000.00000.00000.00000.92350.00000.00000.00000.00000.0624
F ^ n w j ( 800 ) 0.00000.00000.15840.42061.00000.00000.00000.00000.00000.1940
F ^ n w j ( 1000 ) 0.00000.00000.52860.55011.00000.00000.00000.00000.14280.5359
F ^ n w j ( 2000 ) 0.25180.00001.00001.00001.00000.00001.00000.21281.00001.0000
F ^ n w j ( 3000 ) 1.00000.00001.00001.00001.00000.00001.00000.72781.00001.0000
F ^ n w j (23,800)1.00000.93391.00001.00001.00001.00001.00001.00001.00001.0000
F ^ n w j (23,896)1.00000.93391.00001.00001.00001.00001.00001.00001.00001.0000
F ^ n w j (23,897)1111111111
Credibility distribution estimation with claim amount X i j x ,   ( x = 320, 800, 1000, 2000, 3000, 23,800, 23,896, 23,897)
CountryATDEFIGRHRITNOPLPTSE
F ^ X i j C r e d ( 320 Θ j ) 0.007060.001900.009660.009740.114760.003470.008710.005410.008450.03262
F ^ X i j C r e d ( 800 Θ j ) 0.018960.005110.062460.120470.142120.009310.023380.014520.022690.09829
F ^ X i j C r e d ( 1000 Θ j ) 0.038910.010490.175110.177030.173620.019110.047980.029790.093270.25723
F ^ X i j C r e d ( 2000 Θ j ) 0.234580.033520.400720.395810.308540.061060.460020.216400.475920.53495
F ^ X i j C r e d ( 3000 Θ j ) 0.618310.048660.4776410.473360.397290.088660.529320.552690.543170.59463
F ^ X i j C r e d (23,800 Θ j ) 0.983520.939470.977450.977270.973980.991910.979680.987380.980280.98250
F ^ X i j C r e d (23,896 Θ j ) 0.983520.939470.977450.977270.973980.991910.979680.987380.980280.98250
F ^ X i j C r e d (23,897 Θ j ) 1111111111
F ^ n w w ( 320 ) = 0.01256 , F ^ n w w ( 800 ) = 0.03372 , F ^ n w w ( 1000 ) = 0.06920 , F ^ n w w ( 2000 ) = 0.22117 ,
F ^ n w w ( 3000 ) = 0.32113 , F ^ n w w (23,800) = 0.97070, F ^ n w w (23,896) = 0.97070, F ^ n w w (23,897) = 1.000
Parameter estimation free of x
a ^ F = 0.008457228 s ^ F 2 = 208898.6
Credibility factor Z ^ 1 F Z ^ 2 F Z ^ 3 F Z ^ 4 F Z ^ 5 F Z ^ 6 F Z ^ 7 F Z ^ 8 F Z ^ 9 F Z ^ 10 F
Z j F free of x0.437750.848460.230540.224230.112180.723910.306670.569420.327080.40288
Table 4. Credibility coefficients.
Table 4. Credibility coefficients.
Credibility Coefficients
x3208001000200023,80023,900
WRV23,262.85435787.85873718.72161152.5669254.45231317.9237
BRV8.4484994.1401723.1707541.97650500
CC7,581,7051,954,3361,375,506340,045
Table 5. Summary statistics for 10 industry portfolios.
Table 5. Summary statistics for 10 industry portfolios.
Monthly Returns and Number of Firms in Portfolios from July 1926–July 2022
X i j : Value returns for i = 1 , , 1155 , j = 1 , , 10
PortfolioNoDurDurblManufEnrgyHiTecTelcmShopsHlthUtilsOther
Min.−24.6900−34.800−29.820−34.490−33.870−21.5600−30.240−34.080−33.0500−30.0200
1st Qu.−1.3900−2.855−2.000−2.470−2.600−1.5200−2.030−1.920−1.6850−2.0750
Median1.08000.9801.3500.8901.3200.90001.0901.1001.05001.2800
Mean0.95241.1581.0021.0271.1160.81981.0141.0720.87380.9013
3rd Qu.3.64504.8454.2354.5905.0303.24004.0904.0603.62004.1850
Max.34.170079.79057.20038.99053.49028.170042.45037.13043.460058.6700
w i j : Weight–number of firms in portfolios for i = 1 , , 1155 , j = 1 , , 10
Min.87.037.0125.045.018.04.0041.04.021.0110.0
1st Qu.136.056.0313.055.044.08.0084.018.072.0156.0
Median173.092.0449.0116.0358.041.00276.0122.0102.01002.0
Mean230.2101.2507.7131.1428.153.76298.4237.2106.9887.5
3rd Qu.334.0148.0772.5173.0797.599.00472.5509.0179.51619.0
Max.547.0213.0967.0404.01465.0189.00823.0868.0204.02249.0
Note: X i j denotes the values of monthly returns per year and w i j represents the number of firms in portfolios that correspond to each X i j .
Table 6. Credibility distribution estimation for industry portfolios.
Table 6. Credibility distribution estimation for industry portfolios.
Monthly returns for 10 industry portfolios from July 1926–July 2022
Individual empirical distribution with returns X i j x ,   ( x = 15 , 10 , 5 , 0 , 10 , 15 , 34.17 , 59 , 60 , > 79.79 )
PortfoliosNoDurDurblManufEnrgyHiTecTelcmShopsHlthUtilsOther
F ^ n w j ( 15 ) 0.003090.016700.007450.010000.018400.004980.006300.002130.001770.00952
F ^ n w j ( 10 ) 0.021130.044860.026800.040530.057380.028310.021710.010050.014720.03030
F ^ n w j ( 5 ) 0.072670.138720.107600.121740.162530.106110.101990.105530.071470.11525
F ^ n w j ( 0 ) 0.385870.429460.408250.417810.438660.390400.398270.379400.390130.40579
F ^ n w j ( 10 ) 0.976180.933030.964680.950400.907790.974790.959210.976280.984550.96935
F ^ n w j ( 15 ) 0.995000.977000.993000.982000.962000.998000.995000.997000.996000.99200
F ^ n w j ( 34.17 ) 0.999590.996760.998960.999290.999851.000000.999800.999960.999820.99974
F ^ n w j ( 59 ) 1.0000000.99961.000001.000001.000001.000001.000001.000001.000001.00000
F ^ n w j ( 60 ) 1.0000000.99961.000001.000001.000001.000001.000001.000001.000001.00000
F ^ n w j ( x > 79.79 ) 1.0000001.000001.000001.000001.000001.000001.000001.000001.000001.00000
Credibility distribution estimation with returns X i j x ,   ( x = 15 , 10 , 5 , 0 , 10 , 15 , 34.17 , 59 , 60 , > 79.79 )
PortfoliosNoDurDurblManufEnrgyHiTecTelcmShopsHlthUtilsOther
F ^ X i j C r e d ( 15 Θ j ) 0.004960.012700.007710.009520.016500.007620.007000.004260.005370.00946
F ^ X i j C r e d ( 10 Θ j ) 0.022600.040730.027110.038130.055010.029390.022830.013230.019260.03032
F ^ X i j C r e d ( 5 Θ j ) 0.079190.131600.108100.120000.158200.109800.103600.104800.083710.11520
F ^ X i j C r e d ( 0 Θ j ) 0.401000.408000.405000.407000.414000.404000.403000.400000.404000.40600
F ^ X i j C r e d ( 10 Θ j ) 0.975000.936000.965000.949000.910000.971000.959000.975000.981000.96900
F ^ X i j C r e d ( 15 Θ j ) 0.994400.978900.992800.982800.963200.995300.994600.996300.994700.99190
F ^ X i j C r e d ( 34.17 Θ j ) 0.999520.999510.999510.999520.999530.999520.999520.999520.999520.99953
F ^ X i j C r e d ( 59 Θ j ) 0.999990.999990.999990.999990.999990.999990.999990.999990.999990.99999
F ^ X i j C r e d ( 60 Θ j ) 0.999990.999990.999990.999990.999990.999990.999990.999990.999990.99999
F ^ X i j C r e d ( x > 79.79 Θ j ) 1.0000001.000001.000001.000001.000001.000001.000001.000001.000001.00000
Credibility factor with returns X i j x ,   ( x = 15 , 10 , 5 , 0 , 10 , 15 , 34.17 , 59 , 60 , > 79.79 )
Parameter Z ^ 1 F x Z ^ 2 F x Z ^ 3 F x Z ^ 4 F x Z ^ 5 F x Z ^ 6 F x Z ^ 7 F x Z ^ 8 F x Z ^ 9 F Z ^ 10 F
x = −150.681050.484120.824880.548730.798870.332760.734620.687540.497960.89169
x = −100.844750.705140.923100.756020.910080.559640.875840.848660.716520.95451
x = −50.845670.706590.923590.757310.910660.561370.876600.849550.717940.95481
x = 00.204160.101320.361380.127470.323030.056530.249560.209080.106480.49727
x = 100.928600.851110.966320.881050.960310.752340.944010.930570.857990.98045
x = 150.918140.831340.961150.864630.954260.723730.935650.920370.838970.97740
x = 34.170.096710.044940.191060.057470.166080.024400.121880.099370.047380.29220
x = 590000000000
x = 600000000000
x > 79.79 0000000000
Parameter estimation X i j x ,   ( x = 15 , 10 , 5 , 0 , 10 , 15 , 34.17 , 59 , 60 , > 79.79 )
x = −15 F ^ n w w ( x ) = 0.00894 a ^ F x = 2.118   ×   10 5 s ^ F x 2 = 2.637
x = −10 F ^ n w w ( x ) = 0.03077 a ^ F x = 0.00018110 s ^ F x 2 = 8.848
x = −5 F ^ n w w ( x ) = 0.11460 a ^ F x = 0.00062020 s ^ F x 2 = 30.09
x = 0 F ^ n w w ( x ) = 0.40530 a ^ F x = 6.935   ×   10 5 s ^ F x 2 = 71.87
x = 10 F ^ n w w ( x ) = 0.95820 a ^ F x = 0.00057730 s ^ F x 2 = 11.80
x = 15 F ^ n w w ( x ) = 0.98810 a ^ F x = 0.00014610 s ^ F x 2 = 3.463
x = 34.17 F ^ n w w ( x ) = 0.99952 a ^ F x = 5.731   ×   10 8 s ^ F x 2 = 0.1423
x = 59 F ^ n w w ( x ) = 0.99999 a ^ F x = 0.00000000 s ^ F x 2 = 0.00424
x = 60 F ^ n w w ( x ) = 0.99999 a ^ F x = 0.00000000 s ^ F x 2 = 0.00424
x > 79.79 F ^ n w w ( x ) = 1.00000 a ^ F x = 0.00000000 s ^ F x 2 = 0.00000
Table 7. Credibility coefficients.
Table 7. Credibility coefficients.
Credibility Coefficients for Industry Portfolios
x−15−10−50101550
WRV181.64253296.67074447.86592720.9168953.5849641.88332521.656408
BRV0.514784500.437352620.217310780.020546920.025075210.012232750
CC124504.2548856.9948516.611036337.4220439.9823702.94
Table 8. Credibility distribution estimation for grouped data.
Table 8. Credibility distribution estimation for grouped data.
10 Industry Portfolios
Grouped Monthly Returns in 10 Intervals from July 1926 to July 2022
m i j : Number of data points in the interval for each portfolio
Interval of returnNoDurDurblManufEnrgyHiTecTelcmShopsHlthUtilsOther
35 X i j 20 412641016438
20 X i j 13 22019162191381318
13 X i j 6 6010584921086173706494
6 X i j 4 58866780776478776865
4 X i j 1 199188195210181205197204200195
1 X i j 2 383245273274234394300304351272
2 X i j 8 406357434364380362388408391415
8 X i j 10 20553052582952422848
10 X i j 22 20724156812943333334
22 X i j 80 31567515546
Total # of observations1155115511551155115511551155115511551155
Table 9. Grouped credibility distribution estimation.
Table 9. Grouped credibility distribution estimation.
Grouped monthly returns in 10 intervals for 10 industry portfolios
from July 1926–July 2022
Individual empirical distribution with returns X i j x , ( x = 15 , 10 , 5 , 0 , 10 , 15 )
PortfoliosNoDurDurblManufEnrgyHiTecTelcmShopsHlthUtilsOther
F ^ m j ( 15 ) 0.004700.022760.016940.013360.021650.006430.013230.008410.010640.01800
F ^ m j ( 10 ) 0.027460.066670.052810.051450.066910.031290.043540.036360.037600.05739
F ^ m j ( 5 ) 0.082250.155800.123400.131600.153700.089180.113400.104300.098700.13200
F ^ m j ( 0 ) 0.390200.426600.400000.427100.411300.408100.404300.402000.402600.40750
F ^ m j ( 10 ) 0.980090.924680.959310.945450.925540.974030.958440.967100.967970.96537
F ^ m j ( 15 ) 0.987300.950650.974100.965660.954760.984490.973950.979000.979870.97763
Credibility distribution estimation with returns X i j x , ( x = 15 10 , 5 , 0 , 10 , 15 )
PortfoliosNoDurDurblManufEnrgyHiTecTelcmShopsHlthUtilsOther
F ^ X i j C r e d ( 15 Θ j ) 0.004720.022700.016900.013400.021600.006450.013200.008420.010600.01810
F ^ X i j C r e d ( 10 Θ j ) 0.027640.066490.052760.051410.066730.031430.043570.036460.037690.0573
F ^ X i j C r e d ( 5 Θ j ) 0.082680.155400.123300.131400.153300.089530.113500.104500.098930.13180
F ^ X i j C r e d ( 0 Θ j ) 0.392000.424700.400800.425200.411000.408100.404700.402600.403100.40705
F ^ X i j C r e d ( 10 Θ j ) 0.979860.924990.959290.945560.925840.973860.958420.967000.967860.96529
F ^ X i j C r e d ( 15 Θ j ) 0.987170.950840.974090.965720.954920.984390.973940.978950.979810.97759
Parameter estimation X i j x , ( x = 15 , 10 , 5 , 0 , 10 , 15 )
x = −15 F ^ m m ( x ) = 0.013617 a ^ x = 3.8391   ×   10 5 s ^ x 2 = 7.9089   ×   10 7 Z ^ j x = 0.997944
x = −10 F ^ m m ( x ) = 0.047149 a ^ x = 0.000196743 s ^ x 2 = 1.7862   ×   10 5 Z ^ j x = 0.991002
x = −5 F ^ m m ( x ) = 0.011843  a ^ x = 0.0006369784 s ^ x 2 = 7.6405   ×   10 5 Z ^ j x = 0.950530
x = 0 F ^ m m ( x ) = 0.407970 a ^ x = 0.000118016 s ^ x 2 = 0.1327   ×   10 3 Z ^ j x = 0.9881473
x = 10 F ^ m m ( x ) = 0.956798 a ^ x = 0.000362038 s ^ x 2 = 3.5504   ×   10 5 Z ^ j x = 0.990288
x = 15 F ^ m m ( x ) = 0.972741 a ^ x = 0.000146344 s ^ x 2 = 1.2874   ×   10 5 Z ^ j x = 0.9912793
Table 10. Credibility coefficients.
Table 10. Credibility coefficients.
Credibility Coefficients for Industry Portfolios for Grouped Data
x−15−10−501015
WRV0.065309540.089638080.112099260.057209510.006227520.00368866
BRV0.455022920.297493280.155401770.094269770.019886430.01243626
CC0.020600920.090788490.520347660.368291370.098065700.08797441
Table 11. Classical credibility model for grouped data.
Table 11. Classical credibility model for grouped data.
Individual Average Return for the 10 Industry Portfolios
PortfoliosNoDurDurblManufEnrgyHiTecTelcmShopsHlthUtilsOther
μ ^ j 1.150221.525541.202171.180521.341991.018621.270991.253681.125971.06277
Credibility estimation of returns for the 10 industry portfolios
μ ( Θ j ) C r e d 1.181661.369741.207691.196851.277761.115711.242191.233511.169511.13784
Credibility parameter estimation
μ ^ = 1.213247 a ^ = 4.807552 s ^ 2 = 5528.034 Z j = 0.501114
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pitselis, G. Credibility Distribution Estimation with Weighted or Grouped Observations. Risks 2024, 12, 10. https://doi.org/10.3390/risks12010010

AMA Style

Pitselis G. Credibility Distribution Estimation with Weighted or Grouped Observations. Risks. 2024; 12(1):10. https://doi.org/10.3390/risks12010010

Chicago/Turabian Style

Pitselis, Georgios. 2024. "Credibility Distribution Estimation with Weighted or Grouped Observations" Risks 12, no. 1: 10. https://doi.org/10.3390/risks12010010

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop