1. Introduction
While high dimensional data and its validation is important for many machine learning experiments, it is also true that many machine learning applications combine mathematical statistical methods with prior knowledge. The difficulty is to include this prior knowledge to upgrade the statistical analysis without violating the fundamental principles of mathematical statistics. These kinds of applications are omnipresent in insurance reserving, which often cite the original paper of Bornhuetter and Ferguson from 1972. This combination of prior knowledge and mathematical statistics is the purpose of this paper, where we are able to make it while sticking to the classical maximum likelihood technique of mathematical statistics.
The chain ladder method is the basic actuarial tool for reserving in general insurance. This method is based on the paid run-off triangle and provides estimates for the ultimate reserve along with development factors that are used for determining cash flow. In practice, the actuary usually adjusts the ultimates using additionally available information. With the
Bornhuetter et al. (
1972) method the chain ladder ultimates are adjusted using prior knowledge while the adjusted cash flow is proportional to the original chain ladder cash flow.
Mack (
2000) gave a credibility interpretation of the Bornhuetter–Ferguson method.
The adjustment of the ultimates can be done in two ways. Either by correcting the levels of the ultimates or the relative levels of the ultimates. By this, we distinguish between the situation where the actuary has an estimate for the ultimate for a given policy year and the situation where the actuary is more comfortable with the forecast that the ultimate for a given policy year is 10% higher, say, than in the previous year. Such an estimate could, for instance, come from chain ladder analysis of incurred data. Indeed, we provide an empirical illustration where this is the case. The levels approach is most common in the literature; see for instance (
Mack 2000,
2006),
Taylor (
2000),
Verrall (
2004),
Wüthrich and Merz (
2008) and
Heberle and Thomas (
2016). The relative levels approach is more recent; see (
Martínez-Miranda et al. 2013,
2015).
There are potentially two concerns with the traditional Bornhuetter–Ferguson correction. It may move the reserves too much, and the cash flow distribution is not adjusted in light of the external information.
Verrall (
2004) addressed this in a Bayesian setup and
Mack (
2006) proposed an alternative approach where new weights are computed by combining actual payments and the externally estimated reserves.
Our proposal is related to that of
Mack (
2006), but with weights derived from a likelihood function. Adjusting relative ultimates as opposed to level ultimates is natural when working with the likelihood function in the same way as traditional chain ladder development factors are concerned with relative effects. A feature of our approach is, therefore, that external information is linked directly to the parameters of the underlying Poisson model and it is possible to express the Bornhuetter–Ferguson adjustment in terms of adjustments to the development factors. Another feature of this approach is that we can evaluate how much the adjustment moves the reserves and establish inequalities relating our approach and the traditional Bornhuetter–Ferguson adjustments.
A fundamental interpretation of the Bornhuetter–Ferguson method arises when combining chain ladder with credibility formulas. Credibility formulas have been investigated in reserving by, for instance,
de Vylder (
1982),
Mack (
2000). and more recently,
Bühlmann and Moriconi (
2015). We have been particularly influenced by
Mack (
2000), who gives a credibility formula showing that adjusting the ultimates with prior knowledge yields a partial adjustment of the reserves. He then continues to show that the iterations of the credibility formula leads to the
Benktander (
1976) approach. These ideas are taken a step further by
Gigante et al. (
2013), whereas
Taylor (
2000) and
Wüthrich and Merz (
2008) give general overviews of the Bornhuetter–Ferguson method. Our first contribution is to show that the credibility formula also applies when adjusting the relative levels of the ultimates.
It is useful to recall that the chain ladder method has the nice interpretation as maximum likelihood in a Poisson model.
Kremer (
1985) (see also
Mack 1991) showed that the chain ladder forecasts are maximum likelihood. These forecasts are the product of observed accident year row sums and functions of the development factors; see (
4).
Renshaw and Verrall (
1998) showed that the development factors themselves are maximum likelihood estimators in a conditional Poisson model conditioning on row sums, while
Kuang et al. (
2009) showed that they are also maximum likelihod in the unconditional Poisson model. The maximum likelihood result means that it is possible to compute the chain ladder estimates using generalised linear model methods. In practice the Poisson assumption is not realistic as the paid data typically have considerable over-dispersion; see for instance
England and Verrall (
2002). Nonetheless, the chain ladder method provides good reserve estimates that are, at least, anchored in a quasi-likelihood.
The main idea of our approach is to impose the externally estimated relative ultimates on the Poisson likelihood. Initially, it is useful to work with the standard parametrisation of the generalised linear model as opposed to the development factors. We can then formulate the relative ultimates’ constraint as a linear constraint on the parameters and derive maximum likelihood estimators. Subsequently, we translate these estimators into adjusted development factors.
The constrained maximum likelihood approach satisfies a monotonicity result. If, for instance, all the relative ultimates are increased relative to the chain ladder ultimates, then it follows that the reserves are increased. However, these new reserves increase less than the traditional Bornhuetter–Ferguson reserves that would arise by combining the adjusted relative ultimates with the chain ladder development factors.
In this paper we focus on classical mathematical statistics through the maximum likelihood method. Recent work in reserving has emerged in the literature using modern machine learning techniques.
Kuo (
2019) proposes deep neural networks to joint modelling paid data and total claims outstanding, claiming that no manual input is required during model updates or forecasting. Additionally, using neural networks
Gabrielli and Wüthrich (
2018) develop a “stochastic simulation machine” that generates individual claims histories of non-life insurance claims. Another individual claims approach but based on the classification and regression trees is suggested by
De Felice and Moriconi (
2019).
Chukhrova and Johannssen (
2017) describe a state space model for cumulative payments (that extend the chain ladder method) in combination with the Kalman filter.
The rest of the paper is organised as follows. In
Section 2 we first describe the chain ladder forecasts in terms of the development factors and certain weights. Using this formulation we describe two Bornhuetter–Ferguson approaches in the literature: the interpretation offered by
Mack (
2000) that uses levels of ultimates, and the approach of
Martínez-Miranda et al. (
2013) using relative ultimates. From these two approaches the future cash flow is not influenced by the external information. We then present our proposed Bornhuetter–Ferguson reserves which are determined by a Poisson likelihood constrained by the external information. The formal derivation of our proposal is provided in
Section 3 in the generalised, lineal model framework. Our proposal is derived as the solution of a constrained maximum likelihood approach, where the constraint is given by the imposed external information. Later, in
Section 3.5 we show that the approach by
Martínez-Miranda et al. (
2013) is a mixed approach which combines unconstrained and constrained maximum likelihood estimators. Reserve forecasts and cash flow are described from our proposal, the mixed approach and the traditional chain ladder method. Only our proposed cash flow is affect by the external information, which is explicitly shown in terms of some pseudo development factors introduced in
Section 3.6. In
Section 4 we illustrate our proposal using a motor portfolio from a Greek insurer. These data include both paid and incurred triangles. In addition, an external estimate of the reserve is available so that this example nicely illustrates the practical issues that lead to the use of the Bornhuetter–Ferguson method. Conclusions and final remarks are provided in
Section 5.
3. Generalised Linear Model Framework
We present a Generalised Linear Model framework for Bornhuetter–Ferguson analysis. The usual chain ladder estimators are maximum likelihood in a Poisson model; see
Kremer (
1985). In practice, reserving data have considerable over-dispersion; see
England and Verrall (
2002), so that Poisson likelihood becomes a quasi likelihood. In the present paper this distinction is not so important as we will only be concerned with point forecasts. Now, if we maximise the likelihood while imposing constraints from external relative levels of ultimates, we get a closed form cash flow forecast that adapts to both data and the imposed constraints.
Next we describe the unconstrained and constrained Poisson likelihood. The first one provides the chain ladder forecasts without external information in
Section 3.2, and the second one provides our proposed forecasts in
Section 3.3. The approach of
Martínez-Miranda et al. (
2013) is shown in
Section 3.5 as a mixed approach that combines constrained and unconstrained maximum likelihood estimates. Our proposed forecasts have an equivalent expression involving new column-wise development factors provided in
Section 3.6. These development factors will be different for different accident years. For this reason we refer to them as pseudo development factors. This kind of formulation is also possible for the mixed approach, but with the standard chain ladder development factors, as we show in
Section 3.7. A monotonicity result provided in
Section 3.8 gives some insight about the effect that the imposed external information has on our proposed forecasts and those from the mixed approach.
3.1. Statistical Model
We assume that the incremental observations
are independent Poissons with log expectation
, where the predictor is given by
Here
is the level of the accident year effect,
is the level of the development year effect and
is an overall level. The parametrisation presented in (
23) does not identify the distribution, so we switch to the invariant parametrisation of
Kuang et al. (
2009); that is,
with the convention that empty sums are zero. Here
is the relative accident year effect and
is the relative development year effect, while the overall level is determined by
. The Poisson log likelihood function is
This is a regular exponential family with canonical parameters
.
3.2. The Chain Ladder
The chain ladder arises by maximising the unconstrained likelihood. Theorem 3 in
Kuang et al. (
2009) shows that the maximum likelihood estimators are
When inserting these estimators into Equation (
23) we get estimators
. In turn, the relative ultimates are estimated by
which are the relative ultimates entering in Equation (
16). It is convenient to define
, as this says that the ultimate for first accident year equals the claims observed. With this definition, we find that
is the maximum likelihood estimator for the expected ultimates
for the first accident year. In turn, the maximum likelihood estimators for the ultimate levels satisfy
Now, insert the expression for
in (
26) to get
which are the ultimates in (
6). Thus, in both cases the ultimate formulas are closely linked to the estimated relative accident year effects
.
An additional result from Theorem 3 in
Kuang et al. (
2009) is that the forward factors
and
can be viewed as maximum likelihood estimators for certain combinations of the canonical parameters
and
, respectively. These combinations are, for
,
with the convention that empty sums are zero. The development factors are the corresponding maximum likelihood estimators; that is,
and
.
3.3. Imposing External Information on the Relative Ultimates
Suppose some external values are available for the relative ultimates,
. Equivalently, we have external values for the relative accident year effects
; that is,
We could impose these as a constraint on the likelihood (
25). The constraint is linear and the likelihood remains that of a regular exponential family.
The constrained maximum likelihood estimators have a simple analytical form. In line with the parameters
defined in (
32), define
We then have the following result, which is proven in the
Appendix A.
Theorem 1. Consider the Poisson likelihood (25) with known for and define as (32), computed using . The constrained maximum likelihood estimator is unique if and only if for all and given by As a consequence, the out-of-sample forecast from the constrained chain ladder has a simple explicit form, as shown in the following result, which is proven in the
Appendix A. The result resembles the forecast in the unrestricted chain ladder computed from column sums and row-wise development factors as described in (
11).
Theorem 2. Consider the setup in Theorem 1. Point forecasts for the lower triangle are given by We can now compute a Bornhuetter–Ferguson reserve based on Theorem 2. For each accident year we get
In the case where we impose external relative ultimates, the above expressions reduce to those presented previously in (
22). In the above expression the notation reflects that the external information is concerned with the relative accident year parameters
. Now, suppose the relative ultimates
are taken as given. We then apply the Formula (
30) to get cumulated relative accident parameters
Inserting this in the expression (
34) for
in (
34) gives
which is the expression for
in (
21). Since
we see that the point forecast
in (
37) equals the point forecast
in (
22). In turn the reserve
in (
38) equals the reserve
in (
22).
3.4. Implementation in GLM Software
The constrained model can also be estimated using ready-made algorithms for generalised linear models. The analysis presented above shows that the constrained model is a regular exponential family so the algorithms should perform well.
For the implementation we organise the triangle
Y as a vector
, say, of dimension
. A design matrix
can be constructed from the formula (
24). It has dimension
and the row corresponding to entry
is given by
where the indicator function
takes the value unity if
and zero otherwise. The unrestricted model is then estimated through a generalised linear model regression of
on
using the Poisson distribution with a log-link function.
In the constrained model the parameters are known. Deleting the corresponding columns from gives a design matrix with k columns. The deleted columns are collected as , say. The model is then estimated as a generalised linear model regression of on using the Poisson distribution with a log-link function and offset given by .
3.5. A Mixed Approach
By now we have two maximum likelihood approaches: the classical chain ladder and the restricted maximum likelihood approach derived above. These give different point forecasts for the lower triangle. A third type of point forecast arises from the Bornhuetter–Ferguson double chain ladder (BDCL) method in
Martínez-Miranda et al. (
2013). In the following it is shown how the three are connected.
Let us first summarise the results we obtained so far in terms of the log likelihood. In the classical chain ladder approach, we maximise the unrestricted likelihood in (
25), which leads to the unrestricted estimator
The restricted likelihood from
Section 3.3 with restriction
has a restricted likelihood maximum likelihood estimator given by
Notice, that if , then and .
A third estimator is achieved by mixing the above estimators. This combines the unrestricted estimators for
and
with the given
, such that
In the following, parameters resulting from this mixed approach will be marked with the index “‡,” just as parameters resulting from the constrained method will be marked with “†.” The forecast for future payments computed from
is
In the case when the known accident parameters are derived by applying chain ladder on the incurred data, such that
, this method gives exactly the same results as the Bornhuetter–Ferguson double chain ladder (BDCL) method in
Martínez-Miranda et al. (
2013).
The log likelihood function evaluated in the three points satisfies
The first inequality holds since is maximum likelihood, while is restricted maximum likelihood. The second inequality holds since satisfies the restriction, but it is not maximum likelihood.
3.6. Pseudo Development Factors
It is common practice to think about the classical chain ladder method in terms of row sums
and column wise development factors
given in (
1) and (
3). For the restricted maximum likelihood approach there are no natural development factors in a maximum likelihood sense. Since development factors are important in daily actuarial work it is of interest to develop pseudo-development factors that keep the chain ladder pattern.
In the classical chain ladder, the forecasts for the lower triangle are computed using the Formula (
11) by forwarding the row sums
using the factors
. However, in this classical setting the predicted value for the row sum equals the row sum. In the likelihood analysis, this stems from a likelihood equation of the type
; see Equation (
20) in
Kuang et al. (
2009). Thus, we can also interpret the chain ladder forecast as forwarding the predicted row sums.
Once we have imposed external information on the relative ultimates, then the forecast changes and we break the link to the original row sums and development factors. We can, however, construct pseudo forecasts of the row sums and pseudo forward factors that satisfy a relationship like (
11) but with the new forecasts.
Under the constraint that
we compute estimates
and
using (
35) and (
36) in Theorem 1. From these we compute pseudo forward factors from (
32); that is,
and a pseudo first row sum from (
28) as
and then the remaining pseudo row sums from (
26) as
We show in the
Appendix A that the forecast from (
37) can be computed as
The above formulas for predicted reserve and the cash flow can also be written in the credibility format we saw in (
17) and (
18). To see this introduce the weights
where, as before,
. Introducing the ultimates and relative ultimates
we can then write the predicted reserve and cash flow as
3.7. Chain Ladder Forecasts with the Mixed Approach
In the mixed approach we follow a similar procedure to satisfy a relationship like (
11) in order to obtain the new forecasts. The difference to the constrained method is that we can keep the forward factors from the unconstrained chain ladder model,
. However, we need to construct pseudo row sums
as follows.
We fix the pseudo first row sum as
and then compute the remaining pseudo row sums from (
26) as
We show in the
Appendix A that the forecast from (
44) can be computed as
The forecast can be written in terms of weights, as before. Since the cash flow is derived from the chain ladder development factors, the weights are as defined in (
7) and (
8). In particular we have the ultimates and relative ultimates
We can then write the forecast of future payments and the cash flow as
3.8. Monotonicity
The idea of the Bornhuetter–Ferguson approach is to first compute the chain ladder, and then adjust it by imposing values for the ultimates. This is a quite complicated approach and it is not immediately clear what the effect is. However, when all adjustments are in the same direction it is actually possible to show a monotonicity result for the effect of the Bornhuetter–Ferguson adjustment.
Let us consider the case when the known accident parameters,
, are bigger than the accident parameters we obtain from the chain ladder method on paid data,
. The following theorem, proved in the
Appendix A, shows monotonicity results regarding the remaining parameters given in Theorem 1, and the resulting forecasts we obtain from the constrained approach in
Section 3.3,
, and the mixed approach in
Section 3.5,
.
Theorem 3. Suppose for all . Then,
- (a)
for all ;
- (b)
for all ;
- (c)
;
- (d)
for all so that ;
- (e)
for all ;
- (f)
for all ;
- (g)
for all .
To interpret this, suppose all imposed relative ultimates
are larger than the chain ladder forecasts of the relative ultimates
. Suppose also that the imposed relative ultimates are taken from incurred estimates as in the Bornhuetter–Ferguson double chain ladder (BDCL) method in
Martínez-Miranda et al. (
2013). We then get that the point forecasts for the lower triangle are ordered so that the Bornhuetter–Ferguson double chain ladder forecast
is larger than the Bornhuetter–Ferguson-restricted maximum likelihood forecast
, which is larger than the chain ladder forecast
. This will be the situation in the empirical illustration in
Section 4.
4. Empirical Illustration
We illustrate the new methods by an example where the external knowledge comes from incurred payments. In practice, the external knowledge may also come from incurred counts, from other business lines or from other sources.
We used data from a Greek non-life insurer for motor third party liability, aggregated over bodily injury and property damage. The data are presented as cumulative run-off triangles for accident years from 2005 to 2013.
Table 1 shows payments, while
Table 2 shows incurred amounts.
Table 3 shows parameter estimates for the paid data computed using the chain ladder and the Bornhuetter–Ferguson constrained model. For the moment we focus on the canonical parameters
for the relative accident year effect,
for the relative development year effect and
for the overall level. First, the chain ladder estimates are reported as
,
and
. Second, for the constrained model we first applied chain ladder to the incurred data. The estimates for the relative accident year effect are reported as
. The estimates
and
were then computed from the paid data using Theorem 1. We note that the ordering
applies for these data for all
. Thus, the monotonicity results from Theorem 3 apply. In particular, we see that
for all
and
in
Table 3.
A third approach is to use the mixed approach outlined in
Section 3.5. Here we use the external estimate
for the relative accident year effects along with the chain ladder estimates
and
. When the external estimate is based on the incurred data, as in here, this is the same as the Bornhuetter–Ferguson double chain ladder (BDCL) approach of
Martínez-Miranda et al. (
2013).
Table 4 presents the estimated (pseudo) forward factors and the (pseudo) row sums. For the chain ladder, we have the observed row sums
and the traditional forward factors
computed by (
1) and (
3). For the Bornhuetter–Ferguson constrained model we have the pseudo row sums
and the pseudo forward factors
computed by (
46)–(
48). For the mixed approach we have the pseudo row sums
computed by (
50) and (
51) and the traditional forward factors
. Once again we see that the monotonicity results from Theorem 3 apply so that
and
.
Table 5 shows the reserves resulting from the classical chain ladder method,
from (
5); the constrained approach,
from (
38); and the mixed approach,
from (
53). We see that the ordering from Theorem 3 applies. For comparison we note that this portfolio was evaluated at 137 million by an external actuary, with the comment that this figure may be slightly too low. This valuation is based on the information that since 2009, the case reserves incurred were gradually increased, but the gap between incurred and paid reserves was not fully closed as of 2014. In light of this, the Bornhuetter–Ferguson constrained method appears to apply rather well in this situation.
5. Conclusions
The paper introduces a Bornhuetter–Ferguson approach that replaces the relative ultimates rather than levels of ultimates. This approach has been suggested in the Bornhuetter–Ferguson double chain ladder (BDCL) method in
Martínez-Miranda et al. (
2013). The traditional Bornhuetter–Ferguson method uses chain ladder weights, whereas we have estimated weights.
We made use of the fact that the chain ladder method has a nice interpretation as maximum likelihood in a Poisson model, and we formulated the relative ultimates constraint as a linear constraint on the parameters and derived maximum likelihood estimators. Furthermore, we followed this approach to reproduce the results of the BDCL method in a mixed approach, combining the constrained method with the classical chain ladder.
Monotonicity results compare the constrained method, the mixed approach and the original chain ladder results. An example illustrates the mentioned results with data from a Greek general insurer. The example shows that, when comparing all methods mentioned above, including chain ladder, the reserve given by the constrained method is in fact the closest estimate to the number given by an external expert.
Our proposal incorporates prior knowledge in a transparent way, keeping the standard principles of maximum likelihood and its well known mathematical properties. In this sense we recommend our approach over traditional Bornuetter-Ferguson adjustments as a formal statistical method for the same purpose, which keeps the simplicity and the intuition of traditional reserving. This is further shown in the convenient formulation of the forecasts in terms of the pseudo development factors provided above. Apart from this, one would also benefit from the practical advantages of using maximum likelihood that include standard inference and distribution forecasting. This cannot be done with such a level of formality in the classical approach, while for the BDCL method it has been done using intense bootstrap techniques. Another advantage of our approach is that it can, unlike the BDCL method, be applied using only one triangle, usually the payments triangle. On the other hand, this has the disadvantage of not being able to distinguish between IBNR and RBNS reserves as the BDCL method does. Another limitation of our proposal is that it cannot handle negative cells, as it is sometimes the case in the payments triangle. Further refinements are required to deal with this problem.
An outstanding problem is to provide distribution forecasts of Bornhuetter–Ferguson adjusted reserves. In practice the data will have considerable over-dispersion. By modelling that we could complement the point forecasts with distribution forecasts. Recently,
Harnau and Nielsen (
2018) developed an asymptotic distribution theory for the chain ladder within an over-dispersed Poisson framework. The present situation is a special case of their setup so it could potentially be extended with Bornhuetter–Ferguson adjustments. That was beyond the scope of this paper though.
Finally, because there is a full statistical model specification incorporating prior knowledge, one could implement the same type of cash-flow data validation as in
Agbeko et al. (
2014), based on back-testing (see also
De Felice and Moriconi 2019). However, this approach has several drawbacks, more so for small datasets. Controversy also exits about which error criteria should be considered. We did not consider empirical validation in this paper and focused on theoretical statistical properties when comparing reserving methods under the generalised linear models framework. A recent discussion on empirical validation methods in reserving can be found in
Matinek (
2019). These can be potentially used in the context of this paper.