Understanding Reporting Delay in General Insurance

Verrall, Richard J.; Wüthrich, Mario V.

doi:10.3390/risks4030025

Open AccessFeature PaperArticle

Understanding Reporting Delay in General Insurance

by

Richard J. Verrall

^1,† and

Mario V. Wüthrich

^2,3,*,†

¹

Cass Business School, City University London, 106 Bunhill Row, London EC1Y 8T2, UK

²

ETH Zurich, RiskLab, Department of Mathematics, Zurich 8092, Switzerland

³

Swiss Finance Institute, Walchestrasse 9, Zurich CH-8006, Switzerland

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Risks 2016, 4(3), 25; https://doi.org/10.3390/risks4030025

Submission received: 9 February 2016 / Revised: 10 June 2016 / Accepted: 29 June 2016 / Published: 8 July 2016

(This article belongs to the Special Issue Non-Life Insurance Mathematics beyond Risk Theory: Pricing and Claims Reserving)

Download

Browse Figures

Versions Notes

Abstract

:

The aim of this paper is to understand and to model claims arrival and reporting delay in general insurance. We calibrate two real individual claims data sets to the statistical model of Jewell and Norberg. One data set considers property insurance and the other one casualty insurance. For our analysis we slightly relax the model assumptions of Jewell allowing for non-stationarity so that the model is able to cope with trends and with seasonal patterns. The performance of our individual claims data prediction is compared to the prediction based on aggregate data using the Poisson chain-ladder method.

Keywords:

claims arrival; reporting delay function; individual claims data; granular claims data; claims reserving; Poisson chain-ladder method

Graphical Abstract

1. Introduction

The aim of this paper is to understand the reporting delay function of general insurance claims. The reporting delay function is an important building block in claims reserving. We study this reporting delay function in continuous time based on real individual claims data. The data is used to calibrate a non-stationary version of the statistical model considered in Jewell [1,2]. This calibration then provides an estimation of the number of incurred but not yet reported (IBNYR) claims. The second building block in claims reserving is the modeling of the cost process of insurance claims. Jewell [1] stated in 1989: “Currently, the development of a good model for cost evolution over continuous time appears to require a long-term research effort, one that we believe will use the basic understanding of the event generation and reporting processes developed here, but will require much additional empirical effort to develop an understanding of cost-generating mechanisms and their evolution over time.” Meanwhile, there have been some improvements into this direction, see Bühlmann et al. [3], Arjas [4], Norberg [5,6], Haastrup-Arjas [7], Taylor [8,9], Herbst [10], Larsen [11], Taylor et al. [12], Jessen et al. [13], Rosenlund [14], Pigeon et al. [15], Agbeko et al. [16], Antonio-Plat [17] and Badescu et al. [18,19]. But we believe that state-of-the-art modeling is still far from having a good statistical model that can be used in daily industry practice. This may partly be explained by the fact that it is rather difficult to get access to real individual claims data.

In this paper we study individual claims data of two different portfolios: a property insurance portfolio and a casualty insurance portfolio. We choose explicit distributional models for the individual claims arrival process modeling and we calibrate these models dynamically to the data. The calibration is back-tested against the observations and compared to the (Poisson) chain-ladder method (which is applied to aggregated data). The main conclusion is that the chain-ladder method has a good performance as long as the claims process is stationary, but in non-stationary environments our individual claims estimation approach clearly outperforms the chain-ladder method.

In the next two sections we introduce the underlying statistical model and we describe the available claims data. In Section 4 we model seasonality of the claims arrival process. In Section 5 we calibrate the reporting delay function and underpin this by statistical analysis. Finally, in Section 6 we compare the individual claims modeling estimate to the classical chain-ladder method on aggregated data and we back-test the results of these two approaches. The figures and the proofs are deferred to the appendix.

2. Individual Claims Arrival Modeling

We extend the model considered in Jewell [1,2] to an inhomogeneous marked Poisson point process. We define

Λ (t) \geq 0

to be the instantaneous claims frequency and

w (t) \geq 0

to be the instantaneous exposure at time

t \geq 0

. The total exposure

W_{Λ}

on time interval

(0, τ_{m}]

is given by

W_{Λ} = \int_{0}^{τ_{m}} w (t) Λ (t) d t

(1)

We consider the run-off situation after time

τ_{m}

meaning that the exposure expires at time

τ_{m}

, i.e.,

w (t) = w (t) 1_{{t \leq τ_{m}}}

for

t \geq 0

. This implies that the total exposure is assumed to be finite

W_{Λ} < \infty

.

Assume that the claims of the total exposure

W_{Λ}

on

(0, τ_{m}]

occur at times

T_{ℓ}

(called accident dates or claims occurrence dates) and the claims counting process

{(N (t))}_{t \geq 0}

is given by

N (t) = \sum_{ℓ \geq 1} 1_{{T_{ℓ} \leq t}}, for t \geq 0

and the total number of claims is given by

N = N (τ_{m}) = lim_{t \to \infty} N (t)

Model Assumptions 1.

We assume that the claims counting process

{(N (t))}_{t \geq 0}

is an inhomogeneous Poisson point process with intensity

{(w (t) Λ (t))}_{t \geq 0}

.

The second ingredient that we consider is the reporting date. Assume that a given claim ℓ occurs at time

T_{ℓ}

then we denote its reporting date at the insurance company by

S_{ℓ} \geq T_{ℓ}

. For accident date

T_{ℓ}

and corresponding reporting date

S_{ℓ}

of claim ℓ we define the reporting delay by

U_{ℓ} = S_{ℓ} - T_{ℓ} \geq 0

This motivates the study of the following inhomogeneous marked Poisson point process.

Model Assumptions 2.

We assume that

{({(T_{ℓ}, U_{ℓ})}_{ℓ = 1, \dots, N (t)})}_{t \geq 0}

describes an inhomogeneous marked Poisson process with accident dates

{(T_{ℓ})}_{ℓ \geq 1}

generated by an inhomogeneous Poisson point process

{(N (t))}_{t \geq 0}

having intensity

{(w (t) Λ (t))}_{t \geq 0}

and with mutually independent reporting delays (marks)

U_{ℓ} = U (T_{ℓ})

having a time-dependent distribution

U = U (t) \sim F_{U | t, Θ}

for

t \geq 0

and being independent of

{(N (t))}_{t \geq 0}

.

From Jewell [1,2] and Norberg [5,6] we immediately obtain the following likelihood function for parameters Λ and Θ

L_{N, {(T_{ℓ}, S_{ℓ})}_{ℓ = 1, \dots, N}} (Λ, Θ) = e^{- W_{Λ}} \frac{W_{Λ}^{N}}{N!} \prod_{ℓ = 1}^{N} \frac{w (T_{ℓ}) Λ (T_{ℓ})}{W_{Λ}} f_{U | T_{ℓ}, Θ} (S_{ℓ} - T_{ℓ})

(2)

where

f_{U | t, Θ}

denotes the density of

F_{U | t, Θ}

(for parameter Θ). Observe that we use a slight abuse of notation here. In Norberg [5,6] there is an additional factor

N!

because strictly speaking Model Assumptions 2 consider ordered claims arrivals

T_{(ℓ)} \leq T_{(ℓ + 1)}

for all

ℓ = 1, \dots, N - 1

, whereas in Jewell [1] and in Equation (2) claims arrivals are not necessarily ordered. The aim is to calibrate this model to individual claims data, that is, we would like to calibrate claims frequency Λ and reporting delay distribution

F_{U | t, Θ}

. One difficulty in this calibration lies in the fact that we have missing data, because information about occurred claims with reporting dates

S_{ℓ} > τ

is not available at time

τ \geq τ_{m}

. Therefore, we only observe a thinned inhomogeneous Poisson process, we also refer to Norberg [5,6].

We choose

τ \geq τ_{m}

. This implies that all claims have occurred at time τ, providing

N = N (τ) = N (τ_{m})

. By

M = M (τ) \leq N

we denoted the number of claims that are reported at time τ. The intractable likelihood (2) is then converted to, for details we refer to Jewell [1],

\begin{matrix} L_{{(T_{ℓ}, S_{ℓ})}_{ℓ = 1, \dots, M}} (Λ, Θ) & = & e^{- π_{Λ, Θ} (τ) W_{Λ}} \frac{W_{Λ}^{M}}{M!} \prod_{ℓ = 1}^{M} \frac{w (T_{ℓ}) Λ (T_{ℓ})}{W_{Λ}} f_{U | T_{ℓ}, Θ} (S_{ℓ} - T_{ℓ}) \\ = & e^{- π_{Λ, Θ} (τ) W_{Λ}} \frac{{(π_{Λ, Θ} (τ) W_{Λ})}^{M}}{M!} \prod_{ℓ = 1}^{M} \frac{w (T_{ℓ}) Λ (T_{ℓ})}{π_{Λ, Θ} (τ) W_{Λ}} f_{U | T_{ℓ}, Θ} (S_{ℓ} - T_{ℓ}) \end{matrix}

(3)

where we only consider reported claims ℓ with

S_{ℓ} \leq τ

and

π_{Λ, Θ} (τ)

denotes the probability that an incurred claim is reported by time

τ \geq τ_{m}

. This is given by

π_{Λ, Θ} (τ) = \int_{0}^{τ_{m}} \frac{w (t) Λ (t)}{W_{Λ}} F_{U | t, Θ} (τ - t) d t

3. Description of the Data

For the statistical analysis we consider two different European insurance portfolios: (1) line of business (LoB) Property, and (2) LoB Casualty. For both portfolios data is available from 1/1/2001 until 31/10/2010. In Figure 1, Figure 2 and Figure 3 we illustrate the data. Generally, LoB Property is colored blue and LoB Casualty is colored green (depending on the context special features may also be highlighted with other colors, this will be described in the corresponding captions). Figure 1 gives daily claims counts on the left-hand side (lhs) and monthly claims counts on the right-hand side (rhs). The following needs to be remarked for Figure 1:

The monthly claims counts on the rhs show a clear annual seasonality.
The daily claims counts on the lhs show a weekly seasonality with blue/green dots for weekdays, violet dots for Saturdays and orange dots for Sundays, in Table 1 we present the corresponding statistics.
In general, these graphs are decreasing because of missing IBNYR claims (late reportings) that affect younger accident years more than older ones.

In Figure 2 (lhs) we give the daily claims reporting and Figure 2 (rhs) plots accident dates

T_{ℓ}

versus reporting delays

U_{ℓ} = S_{ℓ} - T_{ℓ}

:

Daily reporting differs between weekdays (blue/green) and weekends (violet for Saturdays and orange for Sundays). Basically there is no reporting on weekends because claims staff in insurance companies does not work on weekends, however there is a visible change in LoB Property after 2006.
There is a change in reporting policy in LoB Property after 2006 (top, lhs), this is visible by the change of reportings on weekends (and will become more apparent in the statistical analysis below). We do not have additional information on this, but it may be caused by web-based reporting and needs special attention in modeling. We call 1/1/2006 “break point” in our analysis because it leads to non-stationarity, this will analyzed in detail below.
Figure 2 (rhs) gives the accident dates $T_{ℓ}$ versus the reporting delays $U_{ℓ} = S_{ℓ} - T_{ℓ}$ . We observe that the big bulk of the claims has a reporting delay of less than 1 year, and for both LoBs the resulting dots are located densely for $U_{ℓ} \leq 1$ . Bigger reporting delays are more sparse and LoB Casualty has more heavy-tailed reporting delays than LoB Property, the former having several claims with a reporting delay $U_{ℓ}$ of more than 3 years.

Finally, Figure 3 gives the box plots on the yearly scale of the logged reporting delays

log (U_{ℓ})

:

LoB Property has a change in reporting policy that leads to a faster reporting after break point 1/1/2006.
The graphs are generally decreasing because IBNYR claims (late reportings) are still missing, this corresponds to the upper-right (white) triangles in Figure 2 (rhs).

4. Seasonal Claims Frequency Modeling

4.1. Likelihood Function with Seasonality

We discuss the modeling of the claims frequency

Λ = {(Λ (t))}_{t \geq 0}

which can be any measurable function having finite integral on interval

(0, τ_{m}]

. Time

t \in R

is measured in daily units (unless stated otherwise). In order to get an appropriate model we study annual and weekly seasonality that both influence Λ, see Figure 1. For the weekly seasonality we choose a stationary periodic pattern. This is an appropriate choice unless the insurance product or the portfolio changes. For the annual seasonality we split the time interval

(0, τ_{m}]

into smaller sub-intervals on which statistical estimation is carried out. These smaller time intervals

(τ_{i - 1}, τ_{i}]

are given by finitely many integer-valued endpoints

0 = τ_{0} < τ_{1} < \dots < τ_{m - 1} < τ_{m}, and we set Δ τ_{i} = τ_{i} - τ_{i - 1}

(4)

for

1 \leq i \leq m

. Typically,

Δ τ_{i}

corresponds to a calendar month: this is naturally given if data becomes available on a monthly time scale. The following has to be considered: (i) the monthly time grid is not equidistant because months differ in the number of days; (ii) months and weekdays do not have the same periodicity; (iii)

Δ τ_{i}

should be sufficiently large so that reliable estimation can be done, and sufficiently small so that we have homogeneity on these time intervals; and (iv) smoothing between neighboring time intervals can be applied later on. Such a (monthly) seasonal split is reasonable because often insurance claims are influenced by external factors (such as winter and summer) that (may) only affect bounded time intervals for claims occurrence. This approximation can be seen as a reasonable modeling assumption; if more randomness is involved then we should switch to a hidden Markov model, such as the Cox model presented in Badescu et al. [18,19].

On this time grid we then make the following assumptions: for all

1 \leq i \leq m

and

t \in (τ_{i - 1}, τ_{i}]

w_{i} : = w (τ_{i}) = w (t), Λ_{i} λ_{⌈ t ⌉} = Λ_{i} λ_{t} = Λ (t)

(5)

with weekly periodic (piece-wise constant) pattern

λ_{⌈ t ⌉} = λ_{t} = λ_{t + 7}

for all

t > 0

fulfilling normalization

\sum_{k = 1}^{7} λ_{k} = 7

and global parameter

Λ_{i}

for interval

(τ_{i - 1}, τ_{i}]

. We remark the following:

We have a weekly periodic piece-wise constant pattern that is assumed to be stationary and a (monthly) seasonal parameter $Λ_{i}$ . The total exposure on $(τ_{i - 1}, τ_{i}]$ is given by

$W_{Λ}^{(i)} = w_{i} Λ_{i} \int_{τ_{i - 1}}^{τ_{i}} λ_{t} d t = w_{i} Λ_{i} \sum_{k = τ_{i - 1} + 1}^{τ_{i}} λ_{k} = : w_{i} Λ_{i} λ_{i}^{+}$

(6)

$λ_{i}^{+} = \sum_{k = τ_{i - 1} + 1}^{τ_{i}} λ_{k}$ in general differs from $Δ τ_{i}$ because different months may have different weekday constellations.
In the special case of $λ_{t} \equiv 1$ we obtain the piece-wise homogeneous case

$w_{i} = w (τ_{i}) = w (t), Λ_{i} = Λ (τ_{i}) = Λ (t), λ_{i}^{+} = Δ τ_{i}$

(7)

This is a step function for the claims frequency providing total exposure $W_{Λ}^{(i)} = w_{i} Λ_{i} Δ τ_{i}$ . Note that model (7) was studied in Section 4.2 of Antonio-Plat [17]. For our real data examples, the influence of the additional weekly periodic parameter ${(λ_{k})}_{k = 1, \dots, 7}$ will be visualized in Figure 4, below.
We could also choose a yearly seasonal pattern for ${(Λ_{i})}_{1 \leq i \leq m}$ if, for instance, $Δ τ_{i}$ correspond to calendar months. This is supported by Figure 1 (rhs) and would reduce the number of parameters. This is particularly important for claims prediction, i.e., for predicting the number of claims of future exposure years. In our analysis we refrain from choosing additional structure for ${(Λ_{i})}_{1 \leq i \leq m}$ because we will concentrate on inference of past exposures and because the volumes of the two LoBs are sufficiently large to get reliable inference results on a monthly scale.

Time grid (4) defines a natural partition on which the inhomogeneous marked Poisson point process decouples into independent components, see Theorem 2 in Norberg [6]. The (τ-observable) likelihood on time interval

(τ_{i - 1}, τ_{i}]

under the above assumptions is given by

L_{{(T_{ℓ}^{(i)}, S_{ℓ}^{(i)})}_{ℓ = 1, \dots, M_{i}}}^{(i)} (Λ, Θ) = e^{- π_{Θ}^{(i)} (τ) W_{Λ}^{(i)}} \frac{{(W_{Λ}^{(i)})}^{M_{i}}}{M_{i}!} \prod_{ℓ = 1}^{M_{i}} \frac{λ_{T_{ℓ}^{(i)}}}{λ_{i}^{+}} f_{U | T_{ℓ}^{(i)}, Θ} (S_{ℓ}^{(i)} - T_{ℓ}^{(i)})

(8)

where

M_{i} = M_{i} (τ)

denotes the number of reported claims at time

τ \geq τ_{m}

with accident dates

T_{ℓ}^{(i)} \in (τ_{i - 1}, τ_{i}]

and corresponding reporting dates

S_{ℓ}^{(i)} \leq τ

for

ℓ = 1, \dots, M_{i}

. The probability that an incurred claim with accident date in

(τ_{i - 1}, τ_{i}]

is reported at time

τ \geq τ_{i}

simplifies to

π_{Θ}^{(i)} (τ) = \frac{1}{W_{Λ}^{(i)}} \int_{τ_{i - 1}}^{τ_{i}} w_{i} Λ_{i} λ_{t} F_{U | t, Θ} (τ - t) d t = \frac{1}{λ_{i}^{+}} \int_{τ_{i - 1}}^{τ_{i}} λ_{t} F_{U | t, Θ} (τ - t) d t

Observe that probability

π_{Θ}^{(i)} (τ)

only depends on the (weekly-) seasonal pattern

{(λ_{t})}_{t \in (τ_{i - 1}, τ_{i}]}

and on Θ, but not on global parameter

Λ_{i}

of time interval

(τ_{i - 1}, τ_{i}]

.

Formula (8) specifies the likelihood on time interval

(τ_{i - 1}, τ_{i}]

. Due to the independent splitting property of inhomogeneous marked Poisson processes under partitions the total likelihood function at time

τ \geq τ_{m}

is given by

L_{{(T_{ℓ}, S_{ℓ})}_{ℓ = 1, \dots, M}} (Λ, Θ) = \prod_{i = 1}^{m} L_{{(T_{ℓ}^{(i)}, S_{ℓ}^{(i)})}_{ℓ = 1, \dots, M_{i}}}^{(i)} (Λ, Θ)

(9)

In the estimation procedure below we assume that the weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

is given and parameters

{(Λ_{i})}_{1 \leq i \leq m}

and Θ are estimated with maximum likelihood estimation (MLE) from (9), based on the knowledge of

{(λ_{k})}_{k = 1, \dots, 7}

. In fact, in the applications below we will use a plug-in estimate for

{(λ_{k})}_{k = 1, \dots, 7}

. We could also consider the full likelihood, including

{(λ_{k})}_{k = 1, \dots, 7}

, but for computational reasons we refrain from doing so.

4.2. Analysis of the MLE System

In this section we derive the maximum likelihood estimate (MLE) of

{(Λ_{i})}_{1 \leq i \leq m}

and Θ based on the knowledge of the weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

. We calculate the derivatives of the logarithms of (9) and (8), respectively, and set them equal to zero to find the MLE. This provides the following lemma.

Lemma 3.

Under Model Assumptions 2 with exposure (5), the MLE

({({\hat{Λ}}_{i}^{(τ)})}_{1 \leq i \leq m}, {\hat{Θ}}^{(τ)})

of parameter

({(Λ_{i})}_{1 \leq i \leq m}, Θ)

at time

τ \geq τ_{m}

for given weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

is obtained by the solution of

\frac{\partial}{\partial Θ} \sum_{i = 1}^{m} \sum_{ℓ = 1}^{M_{i}} log (\frac{λ_{T_{ℓ}^{(i)}} f_{U | T_{ℓ}^{(i)}, Θ} (S_{ℓ}^{(i)} - T_{ℓ}^{(i)})}{λ_{i}^{+} π_{Θ}^{(i)} (τ)}) = 0

and

Λ_{i} = \frac{M_{i}}{π_{Θ}^{(i)} (τ) w_{i} λ_{i}^{+}}

The proof is given in Appendix A. Observe that for known weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

the MLE decouples in the sense that the MLE of Θ can be calculated independently of

{(Λ_{i})}_{1 \leq i \leq m}

. This substantially helps in the calibration below because it reduces complexity. Secondly, we remark that the function

(t, s) \mapsto f_{T, S | {τ_{i - 1} < T \leq τ_{i}}, {S \leq τ}, Θ} (s - t) = \frac{λ_{t} f_{U | t, Θ} (s - t)}{λ_{i}^{+} π_{Θ}^{(i)} (τ)} 1_{{s \geq t}}

(10)

gives a density on

(τ_{i - 1}, τ_{i}] \times (0, τ]

.

4.3. Calibration of the Weekly Periodic Pattern

The MLE in Lemma 3 assumes that the weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

is given (and known). In this section we compute a plug-in estimate

{({\hat{λ}}_{k})}_{k = 1, \dots, 7}

. This has the advantage that the MLE remains tractable. We estimate this weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

under one additional assumption which we only use for this purpose (and drop again thereafter): assume

τ \geq τ_{m}

is fixed and that there exists

m^{*} \in {1, \dots, m}

such that

F_{U | t, Θ} (τ - t) = 1

(11)

for all

t \leq τ_{m^{*}}

and all Θ. Assumption Equation (11) is an approximation that we only use for choosing

{(λ_{k})}_{k = 1, \dots, 7}

. It has the advantage that all time points

t \leq τ_{m^{*}}

are fully experienced at time τ. Estimation of

{(λ_{k})}_{k = 1, \dots, 7}

is then only done based on claims with

T_{ℓ} \leq τ_{m^{*}}

because for these occurrence days there are no missing values. Of course, this neglects the latest information but often (in stationary cases, if

τ_{m^{*}}

is not too small and if late reportings do not distort the weekly periodic pattern) this estimation is sufficiently robust. The following lemma is proved in Appendix A.

Lemma 4.

Under Model Assumptions 2 with exposure (5) on a weekly time grid

Δ τ_{i} = 7

(for all

1 \leq i \leq m^{*}

) and under assumption (11), the MLE

{({\hat{λ}}_{k})}_{k = 1, \dots, 7}

of the weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

with side constraint

\sum_{k = 1}^{7} λ_{k} = 7

based on claims with accident dates before

τ_{m^{*}}

is for

k = 1, \dots, 7

given by

{\hat{λ}}_{k} = 7 \frac{\sum_{i = 1}^{m^{*}} \sum_{ℓ = 1}^{M_{i}} 1_{{⌈ {\tilde{T}}_{ℓ}^{(i)} ⌉ = k}}}{\sum_{i = 1}^{m^{*}} M_{i}}

where

{\tilde{T}}_{ℓ}^{(i)} = T_{ℓ}^{(i)} - τ_{i - 1}

We may examine the robustness of these estimates by choosing different time horizons

τ_{m^{*}}

, this is done in Figure 5 (lhs); these graphs also show the confidence bounds we explain how to construct. We have

{\hat{λ}}_{k}^{- 1} = \frac{1}{7} \frac{\sum_{i = 1}^{m^{*}} M_{i}}{\sum_{i = 1}^{m^{*}} \sum_{ℓ = 1}^{M_{i}} 1_{{⌈ {\tilde{T}}_{ℓ}^{(i)} ⌉ = k}}} = \frac{1}{7} (1 + \frac{\sum_{i = 1}^{m^{*}} \sum_{ℓ = 1}^{M_{i}} 1_{{⌈ {\tilde{T}}_{ℓ}^{(i)} ⌉ \neq k}}}{\sum_{i = 1}^{m^{*}} \sum_{ℓ = 1}^{M_{i}} 1_{{⌈ {\tilde{T}}_{ℓ}^{(i)} ⌉ = k}}}) = : \frac{1}{7} (1 + \frac{X_{1}}{X_{2}})

The latter ratio considers the realization of two independent Poisson distributed random variables with means and variances, respectively,

μ_{1} = E [X_{1}] = Var (X_{1}) = \frac{7 - λ_{k}}{7} \sum_{i = 1}^{m^{*}} w_{i} Λ_{i}, μ_{2} = E [X_{1}] = Var (X_{1}) = \frac{λ_{k}}{7} \sum_{i = 1}^{m^{*}} w_{i} Λ_{i}

We set

v^{*} = \sum_{i = 1}^{m^{*}} w_{i} Λ_{i}

. The central limit theorem provides for

i = 1, 2

(X_{i} - μ_{i}) / \sqrt{μ_{i}} \Rightarrow N (0, 1) for v^{*} \to \infty

For this reason we approximate

X_{i} \overset{(d)}{\approx} Z_{i} \sim N (μ_{i}, μ_{i})

and similarly

{\hat{λ}}_{k}^{- 1} = \frac{1}{7} (1 + \frac{X_{1}}{X_{2}}) \overset{(d)}{\approx} \frac{1}{7} (1 + \frac{Z_{1}}{Z_{2}})

Following Hinkley [20] we can study the asymptotic behavior of the latter using

μ_{2} / \sqrt{μ_{2}} \to \infty

for

v^{*} \to \infty

. This provides approximation for large

v^{*}

\begin{matrix} P [{\hat{λ}}_{k} \leq x] & \approx & 1 - P [\frac{Z_{1}}{Z_{2}} \leq 7 x^{- 1} - 1] \approx 1 - Φ (\frac{μ_{2} (7 x^{- 1} - 1) - μ_{1}}{\sqrt{μ_{2} {(7 x^{- 1} - 1)}^{2} + μ_{1}}}) \end{matrix}

This allows us to derive approximate confidence bounds of the weekly periodic pattern estimates. Choose confidence level

α \in (1 / 2, 1)

, then we get a two-sided confidence bound estimate

{\hat{λ}}_{k} \in [x_{-} ((1 - α) / 2), x_{+} ((1 - α) / 2)]

(12)

with for

p \in (0, 1)

\begin{matrix} x_{\pm} (p) & = & 7 \frac{{(Φ^{- 1} (p))}^{2} μ_{2} - μ_{2}^{2}}{{(Φ^{- 1} (p))}^{2} μ_{2} - μ_{2}^{2} - μ_{1} μ_{2} \mp Φ^{- 1} (p) \sqrt{μ_{1} μ_{2}} \sqrt{μ_{1} + μ_{2} - {(Φ^{- 1} (p))}^{2}}} \end{matrix}

Replacing all parameters by their MLEs given by Lemma 4 and

w_{i} {\hat{Λ}}_{i} = M_{i}

(which is the MLE for

i \leq m^{*}

under assumption (11)) we get an estimate for the confidence bounds (12). These are plotted in Figure 5.

In Figure 5 (rhs) we present the resulting MLEs

{({\hat{λ}}_{k})}_{k = 1, \dots, 7}

and the corresponding (estimated) confidence bounds for confidence level

α = 90 %

. We observe narrow confidence bounds and substantial daily differences. In particular, claims frequencies in LoB Casualty are much lower on weekends than on weekdays (this may suggest that we consider commercial casualty insurance business). For LoB Property we observe higher frequencies on Fridays and Saturdays, also this is directly related to the underlying business. Figure 5 (lhs) gives the corresponding time series as a function of

τ_{m^{*}}

. We observe convergence of the estimates after roughly 3 years of observations.

5. Calibration of the Reporting Delay Distribution

In this section we study the choice of the reporting delay distribution

F_{U | t, Θ}

. In an empirical analysis we identify three regimes of reporting delays which we will model separately. In short, (1) small reporting delay layer where we consider a weekday structure, see Figure 6; (2) middle reporting delay layer with the main bulk of reportings, see Figure 14 (top); and (3) large reporting delay layer that should have an appropriate tail for late reportings, see Figure 14 (bottom). We call these the small, middle and large layers, and label them by

n = 1, 2, 3

.

5.1. Decoupling of the Reporting Delay Distribution

As introduced in (4), we choose a monthly time grid

0 = τ_{0} < τ_{1} < \dots < τ_{m}

with

τ_{m}

being the end point of the last observed calendar month. The monthly time grid is naturally given because available data is provided on that time scale. From a statistical point of view also finer or wider time grids are possible. In LoB Property we have about 400 claims per month which gives a coefficient of variation of 5% (for the Poisson distribution) and in LoB Casualty we have between 80 and 100 claims which gives a coefficient of variation of roughly 10%, see Figure 1 (rhs). If we would have stationarity we could (or even should) take bigger time intervals, but because of the yearly seasonal pattern, monthly time intervals are preferred to capture these seasonal differences.

The end point

τ_{m}

of the last observed calendar month will also be considered as a variable that evolves when more and more information becomes available. First (rather limited) information is available at 31/1/2001 and latest available information is as of 31/10/2010. Thus,

τ_{m}

will run from 31/1/2001 to 31/10/2010 and we perform dynamic calibration based on actual information.

We make assumption (5) on that monthly time grid and we remark that the monthly time grid is not equally spaced in number of days. Therefore it is convenient to measure time t in daily units. We then assume that the weekly seasonal pattern

{(λ_{k})}_{k = 1, \dots, 7}

is estimated (and fixed, see Figure 5) through Lemma 4 (and we drop the upper hat in the notation of

λ_{k}

). Note that fixing the weekly periodic pattern reduces the computational complexity in the sequel.

Next we need to choose the reporting delay distribution

F_{U | t, Θ}

. We choose three layers with thresholds

0 = u^{(0)} < u^{(1)} < u^{(2)} < u^{(3)} = \infty

and density

f_{U | t, Θ} (u) = \sum_{n = 1}^{3} p_{U | t, Θ}^{(n)} f_{U | t, Θ}^{(n)} (u) 1_{{u^{(n - 1)} \leq u < u^{(n)}}}

(13)

where the probability weights

p_{U | t, Θ}^{(n)} \geq 0

are normalized

\sum_{n = 1}^{3} p_{U | t, Θ}^{(n)} = 1

, and

f_{U | t, Θ}^{(n)} (\cdot)

are densities supported on

[u^{(n - 1)}, u^{(n)})

for

n = 1, 2, 3

. We make the following assumptions:

Assumption 5.

We choose time units in days and make the following (additional) assumptions for the density in (13): For

t \in (τ_{i - 1}, τ_{i}]

and

n = 1, 2, 3

we assume

p_{U | t, Θ}^{(n)} = p_{U | τ_{i}, Θ}^{(n)}, f_{U | t, Θ}^{(1)} = f_{U | ⌈ t ⌉, Θ}^{(1)}, f_{U | t, Θ}^{(2)} = f_{U | τ_{i}, Θ}^{(2)}, f_{U | t, Θ}^{(3)} = f_{U | τ_{i}, Θ}^{(3)}

This assumption says that the density of the small layer depends on weekdays

⌈ t ⌉

and the remaining terms only depend on the accident month

(τ_{i - 1}, τ_{i}]

. The cumulative distribution function is under Assumption 5 for

t \in (τ_{i - 1}, τ_{i}]

given by

F_{U | t, Θ} (u) = \{\begin{matrix} p_{U | τ_{i}, Θ}^{(1)} F_{U | ⌈ t ⌉, Θ}^{(1)} (u) & for u^{(0)} \leq u < u^{(1)} \\ p_{U | τ_{i}, Θ}^{(1)} + p_{U | τ_{i}, Θ}^{(2)} F_{U | τ_{i}, Θ}^{(2)} (u) & for u^{(1)} \leq u < u^{(2)} \\ p_{U | τ_{i}, Θ}^{(1)} + p_{U | τ_{i}, Θ}^{(2)} + p_{U | τ_{i}, Θ}^{(3)} F_{U | τ_{i}, Θ}^{(3)} (u) & for u^{(2)} \leq u < u^{(3)} \end{matrix}

(14)

This split in layers again defines a partition and the likelihood decouples into independent parts; see also Theorem 2 of Norberg [6]. The log-likelihood of (9) is then at time

τ_{m}

given by

\begin{matrix} \sum_{i = 1}^{m} \sum_{n = 1}^{3} ℓ_{{(T_{ℓ}^{(i, n)}, S_{ℓ}^{(i, n)})}_{ℓ = 1, \dots, M_{i, n}}}^{(i, n)} (Λ, Θ) & \propto & \sum_{i = 1}^{m} \sum_{n = 1}^{3} - π_{Θ}^{(i, n)} (τ_{m}) W_{Λ, Θ}^{(i, n)} + M_{i, n} log (W_{Λ, Θ}^{(i, n)}) \\ + \sum_{ℓ = 1}^{M_{i, n}} log (\frac{λ_{T_{ℓ}^{(i, n)}}}{λ_{i}^{+}} f_{U | T_{ℓ}^{(i, n)}, Θ}^{(n)} (U_{ℓ}^{(i, n)})) \end{matrix}

(15)

with

M_{i, n} = M_{i, n} (τ_{m})

being the number of reported claims at time

τ_{m}

with accident dates

T_{ℓ}^{(i, n)} \in (τ_{i - 1}, τ_{i}]

and reporting delays

U_{ℓ}^{(i, n)} = S_{ℓ}^{(i, n)} - T_{ℓ}^{(i, n)} \in [u^{(n - 1)}, u^{(n)})

. The total exposures for this partition are given by

W_{Λ, Θ}^{(i, n)} = w_{i} Λ_{i} \int_{τ_{i - 1}}^{τ_{i}} λ_{t} \int_{u^{(n - 1)}}^{u^{(n)}} f_{U | t, Θ} (u) d u d t = w_{i} Λ_{i} p_{U | τ_{i}, Θ}^{(n)} \int_{τ_{i - 1}}^{τ_{i}} λ_{t} d t = w_{i} Λ_{i} λ_{i}^{+} p_{U | τ_{i}, Θ}^{(n)}

(16)

The probability that these claims are reported at time

τ_{m}

is given by

π_{Θ}^{(i, n)} (τ_{m}) = \frac{1}{λ_{i}^{+}} \int_{τ_{i - 1}}^{τ_{i}} λ_{t} F_{U | t, Θ}^{(n)} (τ_{m} - t) d t

(17)

Note that

F_{U | t, Θ}^{(n)} (τ_{m} - t) = 1

for

τ_{m} - t \geq u^{(n)}

because in that case all claims have been reported at time

τ_{m}

with reporting delay less than

u^{(n)}

. This may substantially simplify the analysis in the lower and middle layers

n = 1, 2

, and is similar to (11).

5.2. Calibration of the Small Reporting Delay Layer

5.2.1. Model in the Small Reporting Delay Layer

We start by considering the small reporting delay layer

[u^{(0)}, u^{(1)})

. Data shows that reporting delays have a weekly pattern because claims divisions do not (necessarily) work at weekends and a claim occurring, for instance, on a Saturday can only be reported on Monday. This is illustrated in Figure 6. This indicates that we need a (week-) daily modeling approach. In order to not over-parametrize our model we try to keep this (week-) daily modeling layer as small as possible. The canonical choice then is to set

u^{(1)} = 7

because after one week all claims have experienced a full weekly cycle and reporting should be on a similar level for all weekdays, this is supported by Figure 6 (middle), though not fully.

We could now try to maximize the log-likelihood (15) by brute force. Observe that this includes a coupling between all layers through exposures

W_{Λ, Θ}^{(i, n)}

because we have

p_{U | t, Θ}^{(3)} = 1 - p_{U | t, Θ}^{(1)} - p_{U | t, Θ}^{(2)}

. This is unpleasant from a computational point of view, and we therefore propose an approximation. For

i < m

we have

τ_{m} \geq u^{(1)} + τ_{i}

which implies

π_{Θ}^{(i, 1)} (τ_{m}) = 1

for all

i < m

. For

i = m

we have from (17) and under Assumption 5, assumption

λ_{t} = λ_{⌈ t ⌉}

and a change of variable

u = τ_{m} - t

\begin{matrix} π_{Θ}^{(m, 1)} (τ_{m}) & = & \frac{1}{λ_{m}^{+}} \int_{0}^{Δ τ_{m}} λ_{τ_{m} - u} F_{U | τ_{m} - u, Θ}^{(1)} (u) d u \\ = & 1 - \frac{1}{λ_{m}^{+}} \sum_{v = 0}^{u^{(1)} - 1} λ_{τ_{m} - v} \int_{v}^{v + 1} 1 - F_{U | τ_{m} - v, Θ}^{(1)} (u) d u \end{matrix}

(18)

This shows that if parameter

Θ = (Θ_{1}, Θ_{2}, Θ_{3}, p_{1}, p_{2})

splits into parameters

Θ_{n}

for

f_{U | t, Θ}^{(n)} = f_{U | t, Θ_{n}}^{(n)}

,

n = 1, 2, 3

, and

p_{U | t, Θ}^{(1)} = p_{1}

and

p_{U | t, Θ}^{(2)} = p_{2}

(we omit possible time dependence in the notation of

Θ_{n}

and

p_{n}

), then the coupling of the lower layer with the other two layers happens through

π_{Θ_{1}}^{(m, 1)} (τ_{m})

and

p_{U | t, Θ}^{(1)} = p_{1}

and

p_{U | t, Θ}^{(2)} = p_{2}

in exposures

W_{Λ, Θ}^{(i, n)}

. If we neglect in (18) the latest 7 days of observations, i.e., claims with occurrence dates

T_{ℓ}^{(m, 1)} \in (τ_{m} - u^{(1)}, τ_{m}]

, then the calibration of the lower layer density

f_{U | t, Θ}^{(1)} = f_{U | t, Θ_{1}}^{(1)}

completely decouples from the other two layers because we have full information (no missing data) for accidents with occurrence dates in

(τ_{0}, τ_{m} - u^{(1)}]

at time

τ_{m}

. This is indicated by the dashed red line in Figure 7 (lhs). In most cases this provides a reasonable approximation because the last

u^{(1)} = 7

days will not completely change the calibration of the lower reporting delay distribution

F_{U | t, Θ_{1}}^{(1)}

if we have observations over, say, 10 years ≈ 3652 days. For this reason we shorten the last time interval to

(τ_{m - 1}, τ_{m} - u^{(1)}]

which provides in view of (15) MLE for

Θ_{1}

0 \overset{!}{=} \frac{\partial}{\partial Θ_{1}} [\sum_{i = 1}^{m - 1} \sum_{ℓ = 1}^{M_{i, 1}} log f_{U | T_{ℓ}^{(i, 1)}, Θ_{1}}^{(1)} (U_{ℓ}^{(i, 1)}) + \sum_{ℓ = 1}^{M_{m, 1}^{'}} log f_{U | T_{ℓ}^{(m, 1)}, Θ_{1}}^{(1)} (U_{ℓ}^{(m, 1)})]

(19)

where

M_{m, 1}^{'}

denotes the number of reported claims at time

τ_{m}

with accident dates

T_{ℓ}^{(m, 1)} \in (τ_{m - 1}, τ_{m} - u^{(1)}]

and reporting delays

U_{ℓ}^{(m, 1)} \in [0, u^{(1)})

. The first component

Θ_{1}

of Θ is assumed to fully characterize density

f_{U | t, Θ_{1}}^{(1)}

but no other part of the reporting delay distribution.

Next we discuss the explicit choice of

f_{U | t, Θ_{1}}^{(1)}

. In Figure 6 (lhs) we plot the empirical distribution of all claims with accident dates before 01/2006 and a maximal reporting delay of 365 days. Figure 6 only shows the claims with reporting delays

U_{ℓ} < u^{(1)} = 7

, i.e., belonging to the small layer. The lhs shows the individual delay distributions per weekday of occurrence, the middle pictures the same distributions but compressed by the weekends (because there are (almost) no reportings on Saturdays and Sundays) and the rhs shows the compressed graph that is normalized to 1 at time

u^{(1)}

. The graphs indicate that we should start by modeling weekdays individually. We make the following Ansatz: choose discrete distributions with, for

u = 0, \dots, u^{(1)} - 1 = 6

and

t \geq 0

,

f_{U | t, Θ_{1}}^{(1)} (u) = θ_{⌈ t ⌉} (u) = θ_{⌈ t ⌉ + u^{(1)}} (u)

(20)

with

θ_{⌈ t ⌉} (u) \geq 0

such that

\sum_{v = 0}^{u^{(1)} - 1} θ_{⌈ t ⌉} (v) = 1

. The second identity in (20) implies that we obtain a weekly periodic reporting delay distribution with 42 parameters

Θ_{1} = {(θ_{s} (u))}_{0 \leq u \leq u^{(1)} - 1, 1 \leq s \leq u^{(1)} - 1}

, if we assume stationarity (20) on the weekly time grid. We may even ask for more parameters because of potential non-stationarity of the reporting behavior, see Figure 2. We refrain from doing so but we use a rolling window to detect and capture non-stationarity. We should also remark that the 42 parameters raise questions about over-parametrization. We will investigate this question below and we will find that we can reduce the number of parameters in LoB Casualty, in LoB Property we will work with parametrization (20) which will provide rather stable results due to sufficient volume in this LoB.

Optimization problem (19) provides Lagrangian with Lagrange multipliers

χ = {(χ_{s})}_{1 \leq s \leq u^{(1)}}

\begin{matrix} L_{τ_{m}} (Θ_{1}, χ) = \sum_{i = 1}^{m - 1} \sum_{ℓ = 1}^{M_{i, 1}} log f_{U | T_{ℓ}^{(i, 1)}, Θ_{1}}^{(1)} (U_{ℓ}^{(i, 1)}) & + & \sum_{ℓ = 1}^{M_{m, 1}^{'}} log f_{U | T_{ℓ}^{(m, 1)}, Θ_{1}}^{(1)} (U_{ℓ}^{(m, 1)}) \\ - & \sum_{s = 1}^{u^{(1)}} χ_{s} (\sum_{v = 0}^{u^{(1)} - 1} θ_{s} (v) - 1) \end{matrix}

The MLE of (19) is then found by setting the derivatives of

L_{τ_{m}} (Θ_{1}, χ)

w.r.t.

Θ_{1}

and χ equal to zero and solving this system of equations. For

s = 1, \dots, u^{(1)}

and

u = 0, \dots, u^{(1)} - 1

we define

M_{s, u}^{''} = M_{s, u}^{''} (τ_{m})

to be the number of reported claims at time

τ_{m}

with accident dates in

(τ_{0}, τ_{m} - u^{(1)}]

, reported on weekday s and having reporting delay u (we think of

s = 1

being Mondays and

s = 7

being Sundays). The MLE of

θ_{s} (u)

at time

τ_{m}

is then given by

{\hat{θ}}_{s}^{(τ_{m})} (u) = \frac{M_{s, u}^{''} (τ_{m})}{\sum_{u = 0}^{u^{(1)} - 1} M_{s, u}^{''} (τ_{m})}

The lower reporting delay layer distribution is at time

τ_{m}

estimated by

f_{U | t, {\hat{Θ}}_{1}^{(τ_{m})}}^{(1)} (u) = {\hat{θ}}_{⌈ t ⌉}^{(τ_{m})} (u) = \frac{M_{⌈ t ⌉, u}^{''} (τ_{m})}{\sum_{u = 0}^{u^{(1)} - 1} M_{⌈ t ⌉, u}^{''} (τ_{m})}

(21)

To capture potential non-stationarity we will choose a fixed window length K and consider this estimate based on observations in

(τ_{(m - K) \lor 0}, τ_{m} - u^{(1)}]

.

5.2.2. Empirics and Fitting the Small Reporting Delay Layer Distributions

We estimate distributions (21) in the lower reporting delay layer

[u^{(0)}, u^{(1)})

for the 2 LoBs.

In Table 2 and Table 3 we give the observed number of reported claims

M_{s, u}^{''} (τ_{m})

for weekdays

1 \leq s \leq 7

and reporting delays

0 \leq u \leq 6 = u^{(1)} - 1

at time

τ_{m} = 31 / 10 / 2010

for claims with accident dates before 26/10/2010. We see that there is a weekly pattern with no reportings on weekends in LoB Casualty and fewer reportings on weekends in LoB Property. For the latter we estimate all parameters

θ_{s} (u)

individually, for the former we may also discuss other approaches, for instance, compressing weekends and shift the weekdays in Table 3 to the left (which is done on the rhs of Table 3). Moreover, the numbers in Table 3 are rather small and of similar size which may also suggest that we should not distinguish different reporting delays. We investigate this more formally below.

We start with LoB Property. We show in Figure 8 the resulting estimates (see (20)) with a rolling window of length 2·365 days (solid lines) which is compared to the estimate considering all observations (dotted lines). We clearly see the non-stationarity after the break point at 1/1/2006, and the rolling window seems to capture it rather well. Therefore, we do not use any other measures here, but work with the rolling window of length 2·365 days (solid lines).

For LoB Casualty the non-stationarity is less obvious, see Figure 9. In fact, the resulting estimates with a rolling window of length 2·365 days (solid lines) are rather volatile which is a clear sign of over-parametrization. The dotted lines show the estimates based on all available observations, these are much smoother with a slight positive trend for some of the weekdays. At this point we could investigate more thoroughly this non-stationarity, we refrain from doing so because in this case study it may only marginally influence the estimation of the number of IBNYR claims: the potential trend has a very moderate slope which affects less then 10% of the claims in LoB Casualty (small reporting delay layer). For this reason, we simply choose a stationary model and we mainly aim at studying whether we can further reduce the number of parameters in

Θ_{1}

. First, we compress the weekends (in Table 3 we go from the lhs denoted

0, \dots, 6

to the rhs denoted

0^{*}, \dots, 4^{*}

). Then we test the null hypothesis whether for all weekdays

s = 1, \dots, 7

and compressed reporting delays

u^{*} = 1^{*}, \dots, 4^{*}

we can choose the same (empirical) probability, and for all weekdays

s = 1, \dots, 7

we can choose the same probability for reporting delay

u^{*} = 0^{*}

, that is, we test the null hypothesis

θ_{1} (0^{*}) = \dots = θ_{7} (0^{*})

(delay

0^{*}

does not differ between weekdays

s = 1, \dots, 7

) and

θ_{1} (1^{*}) = \dots = θ_{7} (4^{*})

(delays

1^{*}, \dots, 4^{*}

and weekdays

s = 1, \dots, 7

do not differ). We perform for every weekday

s = 1, \dots, 7

a Pearson’s

χ^{2}

-test (for the information at time

τ_{m} = 31 / 10 / 2010

). The corresponding test statistics is

χ_{s}^{2} = \sum_{u^{*} = 0^{*}}^{4^{*}} \frac{{(M_{s, u^{*}}^{''} (τ_{m}) - M_{s, •}^{''} (τ_{m}) {\hat{θ}}_{s}^{(τ_{m})} (u^{*}))}^{2}}{M_{s, •}^{''} (τ_{m}) {\hat{θ}}_{s}^{(τ_{m})} (u^{*})},

(22)

with

M_{s, u^{*}}^{''} (τ_{m})

being the number of reported claims with occurrence day s and compressed reporting delay

u^{*}

,

M_{s, •}^{''} (τ_{m}) = \sum_{u^{*} = 0^{*}}^{4^{*}} M_{s, u^{*}}^{''} (τ_{m})

and

{\hat{θ}}_{s}^{(τ_{m})} (u^{*})

being the corresponding MLE under the null hypothesis.

We provide the resulting p-values in Table 4. The resulting p-value for claims with accident dates on Wednesdays

s = 3

is 3.2% and for all other weekdays s we obtain p-values bigger than 20%. We consider these p-values to be sufficiently large so that we do not reject the null hypothesis. This leads to a substantial reduction in the number of parameters, and we only choose three different values for

Θ_{1} = {(θ_{s} (u))}_{0 \leq u \leq u^{(1)} - 1, 1 \leq s \leq u^{(1)}}

at any time point

τ_{m}

in this reduced case (the third one being 0% for weekends). In Figure 10 we show the results, the solid line gives the estimates under the null hypothesis and the dotted lines the estimates of the model with 42 parameters. For LoB Casualty we choose this reduced model (under the null hypothesis).

Using (18) and the fact that we choose a step function for

F_{U | t, Θ_{1}}^{(1)}

we obtain estimated probability in the small reporting delay layer given by

π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(m, 1)} (τ_{m}) = 1 - \frac{1}{λ_{m}^{+}} \sum_{v = 0}^{u^{(1)} - 1} λ_{τ_{m} - v} (1 - \sum_{u = 0}^{v} {\hat{θ}}_{τ_{m} - v}^{(τ_{m})} (u)) .

(23)

The results are presented in Figure 4 and they are compared to the case where we do not choose a weekly periodic pattern, that is, where we set

{(λ_{k})}_{k = 1, \dots, 7} \equiv 1

. This latter model is the one used in Section 4.2 of Antonio-Plat [17]. We see that the weekly periodic pattern essentially smooths the estimates, in particular, for weekends in LoB Casualty. This confirms the findings of Section 4.3 and, in particular, of Figure 5. For this reason we continue with the model allowing us for the modeling of a weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

. We fix the resulting estimates

π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(m, 1)} (τ_{m})

and then calibrate the middle and large layers, this is explained next.

5.3. Calibration of Middle and Large Reporting Delay Layers

We come back to log-likelihood (15). We replace in the small reporting delay layer parameter

Θ_{1}

by its estimate

{\hat{Θ}}_{1}^{(τ_{m})}

derived in the previous subsection. This provides the log-likelihood at time

τ_{m}

, we only show the terms including the unknown parameters

(Λ, Θ_{- 1}) : = (Λ, Θ_{2}, Θ_{3}, p_{1}, p_{2})

,

\begin{matrix} ℓ_{τ_{m}} (Λ, Θ_{- 1}) & \propto & \sum_{i = 1}^{m} [- π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) W_{Λ, Θ}^{(i, 1)} + \sum_{n = 2}^{3} - π_{Θ_{n}}^{(i, n)} (τ_{m}) W_{Λ, Θ}^{(i, n)} + \sum_{n = 1}^{3} M_{i, n} log (W_{Λ, Θ}^{(i, n)})] \\ + \sum_{i = 1}^{m} \sum_{n = 2}^{3} \sum_{ℓ = 1}^{M_{i, n}} log (\frac{λ_{T_{ℓ}^{(i, n)}}}{λ_{i}^{+}} f_{U | T_{ℓ}^{(i, n)}, Θ_{n}}^{(n)} (U_{ℓ}^{(i, n)})) \end{matrix}

(24)

where

π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) = 1

for

i < m

. From this we compute the MLE of

Λ = {(Λ_{i})}_{i = 1, \dots, m}

(see also Lemma 3):

Λ_{i} = \frac{M_{i} (τ_{m})}{w_{i} λ_{i}^{+} (π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) p_{U | τ_{i}, Θ}^{(1)} + \sum_{n^{'} = 2}^{3} π_{Θ_{n^{'}}}^{(i, n^{'})} (τ_{m}) p_{U | τ_{i}, Θ}^{(n^{'})})} for i = 1, \dots, m

(25)

If we insert this back into (24) we get (only stating relevant terms for parameter estimation)

\begin{matrix} ℓ_{τ_{m}} (Θ_{- 1}) & \propto & \sum_{i = 1}^{m} \sum_{n = 1}^{3} M_{i, n} log (\frac{p_{U | τ_{i}, Θ}^{(n)}}{π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) p_{U | τ_{i}, Θ}^{(1)} + \sum_{n^{'} = 2}^{3} π_{Θ_{n^{'}}}^{(i, n^{'})} (τ_{m}) p_{U | τ_{i}, Θ}^{(n^{'})}}) \\ + \sum_{i = 1}^{m} \sum_{n = 2}^{3} \sum_{ℓ = 1}^{M_{i, n}} log (f_{U | τ_{i}, Θ_{n}}^{(n)} (U_{ℓ}^{(i, n)})) \end{matrix}

where we have used Assumption 5 for

f_{U | t, Θ_{n}}^{(n)}

with

n = 2, 3

. Recall that we have normalization

\sum_{n^{'} = 1}^{3} p_{U | τ_{i}, Θ}^{(n^{'})} = 1

of the layer probabilities because the reporting delays need to be in one of the three layers. Assuming

p_{U | τ_{i}, Θ}^{(3)} > 0

we can normalize these probabilities by dividing by this third probability and setting

q_{τ_{i}}^{(n)} = p_{U | τ_{i}, Θ}^{(n)} / p_{U | τ_{i}, Θ}^{(3)} \geq 0

. From this we see that we can rewrite the last log-likelihood in terms of

q = {(q_{τ_{i}}^{(1)}, q_{τ_{i}}^{(2)}, q_{τ_{i}}^{(3)} = 1)}_{1 \leq i \leq m}

. This provides the log-likelihood

\begin{matrix} ℓ_{τ_{m}} (Θ_{2}, Θ_{3}, q) & \propto & - \sum_{i = 1}^{m} M_{i} log (π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) q_{τ_{i}}^{(1)} + \sum_{n^{'} = 2}^{3} π_{Θ_{n^{'}}}^{(i, n^{'})} (τ_{m}) q_{τ_{i}}^{(n^{'})}) \\ + \sum_{i = 1}^{m} \sum_{n = 1}^{2} M_{i, n} log (q_{τ_{i}}^{(n)}) + \sum_{i = 1}^{m} \sum_{n = 2}^{3} \sum_{ℓ = 1}^{M_{i, n}} log (f_{U | τ_{i}, Θ_{n}}^{(n)} (U_{ℓ}^{(i, n)})) \end{matrix}

(26)

To implement MLE of (26) there remains the calculation of

π_{Θ_{n^{'}}}^{(i, n^{'})} (τ_{m})

for

n^{'} = 2, 3

. This is what we are going to discuss next.

5.3.1. Choice of Layers and Approximate Log-likelihood

We still need to specify threshold

u^{(2)}

. The lower limit was chosen to be

u^{(1)} = 7

days. For

u^{(2)}

we test different reporting delays

κ = 3, 6, 9

or 12 months. Note that κ months is not well-defined in terms of number of days. We set

u^{(2)}

equal to

89, 181, 273

or 365 which is the minimal number of days that κ consecutive calendar months can have. The accident periods

(τ_{i - 1}, τ_{i}]

with

i \leq m - κ

are then fully observed in the middle layer at time

τ_{m}

, and we have

π_{Θ_{2}}^{(i, 2)} (τ_{m}) = 1 for 1 \leq i \leq m - κ

Thus, we only need to study

m - κ + 1 \leq i \leq m

in more detail for the middle reporting delay layer. As for the large reporting delay layer we see that there are no observations possible at time

τ_{m}

for accident periods

(τ_{i - 1}, τ_{i}]

with

m - κ + 2 \leq i \leq m

and, therefore,

π_{Θ_{3}}^{(i, 3)} (τ_{m}) = 0 for m - κ + 2 \leq i \leq m

The remaining layers and probabilities are more involved, and we consider these next.

Large reporting delay layer

[u^{(2)}, \infty)

with

n = 3

. For

i \leq m - κ + 1

we have under Assumption 5

π_{Θ_{3}}^{(i, 3)} (τ_{m}) = \frac{1}{λ_{i}^{+}} \int_{τ_{i - 1}}^{τ_{i}} λ_{t} F_{U | τ_{i}, Θ_{3}}^{(3)} (τ_{m} - t) d t

To simplify optimization (26) we assume that for reportings in the large layer the weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

only has a marginal influence. This is justified by the argument that between claims occurrence and reporting there are at least κ months of delay, and therefore the specific weekday of the accident should only marginally influence the late reporting (as long as a particular claim type cannot only occur on one specific weekday). For

i \leq m - κ

this provides the approximation

\begin{matrix} π_{Θ_{3}}^{(i, 3)} (τ_{m}) \approx {\tilde{π}}_{Θ_{3}}^{(i, 3)} (τ_{m}) = \frac{1}{Δ τ_{i}} \int_{τ_{m} - τ_{i}}^{τ_{m} - τ_{i - 1}} F_{U | τ_{i}, Θ_{3}}^{(3)} (u) d u \end{matrix}

The situation

i = m - κ + 1

is more delicate. We have

u^{(2)} \in {89, 181, 273, 365}

which is the minimal number of days that κ consecutive calendar months can have, the maximal number of days being

92, 184, 276

or 366 days, respectively. Therefore, the following integral may also be non-zero on time interval

(τ_{m - κ}, τ_{m - κ + 1}]

at time

τ_{m}

\begin{matrix} π_{Θ_{3}}^{(m - κ + 1, 3)} (τ_{m}) \approx {\tilde{π}}_{Θ_{3}}^{(m - κ + 1, 3)} (τ_{m}) & = & \frac{1}{Δ τ_{m - κ + 1}} \int_{τ_{m} - τ_{m - κ + 1}}^{τ_{m} - τ_{m - κ}} F_{U | τ_{m - κ + 1}, Θ_{3}}^{(3)} (u) d u \\ = & \frac{1}{Δ τ_{m - κ + 1}} \int_{u^{(2)}}^{τ_{m} - τ_{m - κ}} F_{U | τ_{m - κ + 1}, Θ_{3}}^{(3)} (u) d u \end{matrix}

Note that

Δ τ_{m - κ + 1} \geq 28

and

τ_{m} - τ_{m - κ} - u^{(2)} \leq 3

which implies that

{\tilde{π}}_{Θ_{3}}^{(m - κ + 1, 3)} (τ_{m}) \leq 3 / 28 < 1 / 9

for all m. Therefore, this term only marginally influences the results (and it could also be skipped for parameter estimation but we will keep it).

Middle reporting delay layer

[u^{(1)}, u^{(2)})

with

n = 2

. We still need to treat the cases

m - κ + 1 \leq i \leq m - 1

and

i = m

. We have for

m - κ + 1 \leq i \leq m

\begin{matrix} π_{Θ_{2}}^{(i, 2)} (τ_{m}) & = & \frac{1}{λ_{i}^{+}} \int_{τ_{i - 1}}^{τ_{i}} λ_{t} F_{U | τ_{i}, Θ_{2}}^{(2)} (τ_{m} - t) d t \\ = & \frac{1}{λ_{i}^{+}} \sum_{v = 0}^{Δ τ_{i} - 1} λ_{τ_{i} - v} \int_{v}^{v + 1} F_{U | τ_{i}, Θ_{2}}^{(2)} (τ_{m} - τ_{i} + u) d u \end{matrix}

As we indicate below, this can be implemented and MLE can be performed. In order to speed up the MLE optimization we also approximate

π_{Θ_{2}}^{(i, 2)} (τ_{m})

. However, this approximation is only used for parameter estimation, for the number of IBNYR claims estimation we will use the exact form

π_{Θ_{2}}^{(i, 2)} (τ_{m})

. For the approximation in MLE we also neglect weekday differences which provides for

m - κ + 1 \leq i \leq m - 1

\begin{matrix} π_{Θ_{2}}^{(i, 2)} (τ_{m}) \approx {\tilde{π}}_{Θ_{2}}^{(i, 2)} (τ_{m}) & = & \frac{1}{Δ τ_{i}} \int_{τ_{m} - τ_{i}}^{τ_{m} - τ_{i - 1}} F_{U | τ_{i}, Θ_{2}}^{(2)} (u) d u \end{matrix}

For

i = m

we have

F_{U | τ_{m}, Θ_{2}}^{(2)} (u) = 0

for

u \leq u^{(1)}

. This provides the approximation

\begin{matrix} π_{Θ_{2}}^{(m, 2)} (τ_{m}) \approx {\tilde{π}}_{Θ_{2}}^{(m, 2)} (τ_{m}) & = & \frac{1}{Δ τ_{m}} \int_{u^{(1)}}^{Δ τ_{m}} F_{U | τ_{m}, Θ_{2}}^{(2)} (u) d u \end{matrix}

because otherwise reporting delays belong to the small layer.

This allows us to approximate the log-likelihood (26) by (we also refer to Corollary 7, below)

\begin{matrix} {\tilde{ℓ}}_{τ_{m}} (Θ_{2}, Θ_{3}, q) & \propto & - \sum_{i = 1}^{m} M_{i} log (π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) q_{τ_{i}}^{(1)} + \sum_{n^{'} = 2}^{3} {\tilde{π}}_{Θ_{n^{'}}}^{(i, n^{'})} (τ_{m}) q_{τ_{i}}^{(n^{'})}) \\ + \sum_{i = 1}^{m} \sum_{n = 1}^{2} M_{i, n} log (q_{τ_{i}}^{(n)}) + \sum_{i = 1}^{m} \sum_{n = 2}^{3} \sum_{ℓ = 1}^{M_{i, n}} log (f_{U | τ_{i}, Θ_{n}}^{(n)} (U_{ℓ}^{(i, n)})) \end{matrix}

(27)

where several of the π’s and

\tilde{π}

’s are either 0 or 1 (we give them in detail below). To calculate them we need the following lemma.

Lemma 6.

Assume

X \sim F

and choose

x_{1} < x_{2}

. We have

\begin{matrix} \int_{x_{1}}^{x_{2}} F (x) d x & = & x_{2} F (x_{2}) - x_{1} F (x_{1}) - E [X 1_{{x_{1} < X \leq x_{2}}}] \end{matrix}

Proof of Lemma 6.

The proof follows by applying integration by parts. ☐

Corollary 7.

We choose

U^{(i, n)} \sim F_{U | τ_{i}, Θ_{n}}^{(n)}

and denote the corresponding expectation by

E_{U | τ_{i}, Θ_{n}}^{(n)}

.

Small layer $[0, u^{(1)}) = [0, 7)$ . For $1 \leq i \leq m - 1$ we have probability $π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) = 1$ and the case $i = m$ is given by (23).
Middle layer $[u^{(1)}, u^{(2)}) = [7, u^{(2)})$ . For $1 \leq i \leq m - κ$ we have probability $π_{Θ_{2}}^{(i, 2)} (τ_{m}) = {\tilde{π}}_{Θ_{2}}^{(i, 2)} (τ_{m}) = 1$ . For $m - κ + 1 \leq i \leq m - 1$ we have

$\begin{matrix} {\tilde{π}}_{Θ_{2}}^{(i, 2)} (τ_{m}) & = & \frac{τ_{m} - τ_{i - 1}}{Δ τ_{i}} F_{U | τ_{i}, Θ_{2}}^{(2)} (τ_{m} - τ_{i - 1}) - \frac{τ_{m} - τ_{i}}{Δ τ_{i}} F_{U | τ_{i}, Θ_{2}}^{(2)} (τ_{m} - τ_{i}) \\ - \frac{1}{Δ τ_{i}} E_{U | τ_{i}, Θ_{2}}^{(2)} [U^{(i, 2)} 1_{{τ_{m} - τ_{i} < U^{(i, 2)} \leq τ_{m} - τ_{i - 1}}}] \end{matrix}$

and for $i = m$

$\begin{matrix} {\tilde{π}}_{Θ_{2}}^{(m, 2)} (τ_{m}) & = & F_{U | τ_{m}, Θ_{2}}^{(2)} (Δ τ_{m}) - \frac{1}{Δ τ_{m}} E_{U | τ_{m}, Θ_{2}}^{(2)} [U^{(m, 2)} 1_{{u^{(1)} < U^{(m, 2)} \leq Δ τ_{m}}}] \end{matrix}$
Large layer $[u^{(2)}, \infty)$ . For $m - κ + 2 \leq i \leq m$ we have probability $π_{Θ_{3}}^{(i, 3)} (τ_{m}) = {\tilde{π}}_{Θ_{3}}^{(i, 3)} (τ_{m}) = 0$ . For $1 \leq i \leq m - κ$ we have

$\begin{matrix} {\tilde{π}}_{Θ_{3}}^{(i, 3)} (τ_{m}) & = & \frac{τ_{m} - τ_{i - 1}}{Δ τ_{i}} F_{U | τ_{i}, Θ_{3}}^{(3)} (τ_{m} - τ_{i - 1}) - \frac{τ_{m} - τ_{i}}{Δ τ_{i}} F_{U | τ_{i}, Θ_{3}}^{(3)} (τ_{m} - τ_{i}) \\ - \frac{1}{Δ τ_{i}} E_{U | τ_{i}, Θ_{3}}^{(3)} [U^{(i, 3)} 1_{{τ_{m} - τ_{i} < U^{(i, 3)} \leq τ_{m} - τ_{i - 1}}}] \end{matrix}$

and for $i = m - κ + 1$

$\begin{matrix} {\tilde{π}}_{Θ_{3}}^{(m - κ + 1, 3)} (τ_{m}) & = & \frac{τ_{m} - τ_{m - κ}}{Δ τ_{m - κ + 1}} F_{U | τ_{m - κ + 1}, Θ_{3}}^{(3)} (τ_{m} - τ_{m - κ}) \\ - \frac{1}{Δ τ_{m - κ + 1}} E_{U | τ_{m - κ + 1}, Θ_{3}}^{(3)} [U^{(m - κ + 1, 3)} 1_{{u^{(2)} < U^{(m - κ + 1, 3)} \leq τ_{m} - τ_{m - κ}}}] \end{matrix}$

Finally, we need to choose explicit distribution functions

F_{U | τ_{i}, Θ_{n}}^{(n)}

for the reporting delay layers

n = 2, 3

. This is what we are going to do in the next subsection.

5.3.2. Choice of Explicit Distributions and Layer Probabilities

There remains the modeling of the reporting delay distributions

F_{U | τ_{i}, Θ_{n}}^{(n)}

in the middle and large layers as well as the relative layer probabilities

q_{τ_{i}}^{(n)}

for

n = 1, 2

. We have considered different models, compared them to each other, checked them for robustness and applied statistical model selection criteria such as Akaike’s information criterion. Our favorite model that is at the same time not too difficult and gives appropriate results is the following: (i) for the middle layer

n = 2

we choose a stationary truncated log-normal distribution, (ii) for the upper layer

n = 3

we choose a stationary shifted log-normal distribution, and (iii) we choose non-stationary relative layer probabilities

q_{τ_{i}}^{(n)}

. We justify these choices by some statistical analysis below.

Lemma 8.

Assume

X^{(2)}

has a truncated log-normal distribution with parameters

μ_{2} \in R

and

σ_{2} > 0

supported in a non-empty interval

[ν_{i - 1}, ν_{i}] \subset R_{+}

. The density of

X^{(2)}

is given by

f^{(2)} (x) = \frac{1}{Φ (\frac{log ν_{i} - μ_{2}}{σ_{2}}) - Φ (\frac{log ν_{i - 1} - μ_{2}}{σ_{2}})} \frac{1}{\sqrt{2 π} σ_{2}} \frac{1}{x} exp \{- \frac{{(log x - μ_{2})}^{2}}{σ_{2}^{2}}\} 1_{{ν_{i - 1} \leq x \leq ν_{i}}}

The distribution of

X^{(2)}

is given by

F^{(2)} (x) = \frac{Φ (\frac{log (x \land ν_{i}) - μ_{2}}{σ_{2}}) - Φ (\frac{log ν_{i - 1} - μ_{2}}{σ_{2}})}{Φ (\frac{log ν_{i} - μ_{2}}{σ_{2}}) - Φ (\frac{log ν_{i - 1} - μ_{2}}{σ_{2}})} 1_{{ν_{i - 1} \leq x}}

The expectation of

X^{(2)}

on layer

(x_{1}, x_{2}] \subset [ν_{i - 1}, ν_{i}]

is given by

E [X^{(2)} 1_{{x_{1} < X^{(2)} \leq x_{2}}}] = exp {μ_{2} + σ_{2}^{2} / 2} \frac{Φ (\frac{log x_{2} - (μ_{2} + σ_{2}^{2})}{σ_{2}}) - Φ (\frac{log x_{1} - (μ_{2} + σ_{2}^{2})}{σ_{2}})}{Φ (\frac{log ν_{i} - μ_{2}}{σ_{2}}) - Φ (\frac{log ν_{i - 1} - μ_{2}}{σ_{2}})}

Assume

X^{(3)} = ν + Z \sim F^{(3)}

has a shifted log-normal distribution with Z being log-normally distributed with parameters

μ_{3} \in R

and

σ_{3} > 0

. We have on the layer

(x_{1}, x_{2}] \subset [ν, \infty)

, set

z_{1} = x_{1} - ν

and

z_{2} = x_{2} - ν

,

\begin{matrix} \int_{x_{1}}^{x_{2}} F^{(3)} (x) d x & = & z_{2} Φ (\frac{log z_{2} - μ_{3}}{σ_{3}}) - z_{1} Φ (\frac{log z_{1} - μ_{3}}{σ_{3}}) \\ - exp \{μ_{3} + \frac{σ_{3}^{2}}{2}\} [Φ (\frac{log z_{2} - (μ_{3} + σ_{3}^{2})}{σ_{3}}) - Φ (\frac{log z_{1} - (μ_{3} + σ_{3}^{2})}{σ_{3}})] \end{matrix}

Proof of Lemma 8.

The proof is a straightforward consequence of calculations with log-normal distributions, see also Section 3.2.3 in [21]. ☐

The model of Lemma 8 will be chosen below. We will compare it to the situation where we also have a truncated log-normal distribution for

F^{(3)}

in the upper layer, and we also compare it to the case of replacing the log-normal by gamma distributions. More details are provided below.

There remains the choice of

q_{τ_{i}}^{(n)} = p_{U | τ_{i}, Θ}^{(n)} / p_{U | τ_{i}, Θ}^{(3)} \geq 0

for

n = 1, 2

. We consider for break point

τ_{m_{0}} = 1 / 1 / 2006

,

γ > 0

and

t \in (τ_{i - 1}, τ_{i}]

the functional forms

\begin{matrix} p_{U | τ_{i}, Θ}^{(1)} / p_{U | τ_{i}, Θ}^{(3)} & = & q_{τ_{i}}^{(1)} = {\bar{q}}^{(1)} exp \{α {(i - m_{0})}_{+}^{γ}\} \\ p_{U | τ_{i}, Θ}^{(2)} / p_{U | τ_{i}, Θ}^{(3)} & = & q_{τ_{i}}^{(2)} = {\bar{q}}^{(2)} exp \{- α \frac{{\bar{q}}^{(1)}}{{\bar{q}}^{(2)}} {(i - m_{0})}_{+}^{γ}\} \end{matrix}

(28)

with given trend parameter

α \geq 0

after break point

τ_{m_{0}}

. For

α \to 0

we have

\begin{matrix} p_{U | t, Θ}^{(1)} & = & {\bar{q}}^{(1)} p_{U | τ_{i}, Θ}^{(3)} + α {\bar{q}}^{(1)} p_{U | τ_{i}, Θ}^{(3)} {(i - m_{0})}_{+}^{γ} + o (α) \\ p_{U | t, Θ}^{(2)} & = & {\bar{q}}^{(2)} p_{U | τ_{i}, Θ}^{(3)} - α {\bar{q}}^{(1)} p_{U | τ_{i}, Θ}^{(3)} {(i - m_{0})}_{+}^{γ} + o (α) \end{matrix}

This shows that we roughly model a γ-power increase/decrease after the break point

τ_{m_{0}}

. Normalization

p_{U | τ_{i}, Θ}^{(1)} + p_{U | τ_{i}, Θ}^{(2)} + p_{U | τ_{i}, Θ}^{(3)} = 1

implies

\begin{matrix} p_{U | τ_{i}, Θ}^{(3)} & = & {({\bar{q}}^{(1)} exp {α {(i - m_{0})}_{+}^{γ}} + {\bar{q}}^{(2)} exp {- α {(i - m_{0})}_{+}^{γ}} + 1)}^{- 1} \\ = & {({\bar{q}}^{(1)} + {\bar{q}}^{(2)} + 1 + o (α))}^{- 1} as α \to 0 \end{matrix}

This shows that

p_{U | τ_{i}, Θ}^{(3)}

is almost constant for α small, i.e., the break point only marginally influences late reportings (because it mainly speeds up immediate reporting after claims occurrence, this will be seen below and corresponds to the red graphs in Figure 11).

With these choices we revisit log-likelihood (27) which is now given by, set

\bar{q} = ({\bar{q}}^{(1)}, {\bar{q}}^{(2)}, α, γ)

,

\begin{matrix} {\bar{ℓ}}_{τ_{m}} (Θ_{2}, Θ_{3}, \bar{q}) & \propto & - \sum_{i = 1}^{m} M_{i} log (π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(i, 1)} (τ_{m}) q_{τ_{i}}^{(1)} + {\tilde{π}}_{Θ_{2}}^{(i, 2)} (τ_{m}) q_{τ_{i}}^{(2)} + {\tilde{π}}_{Θ_{3}}^{(i, 3)} (τ_{m})) \\ + \sum_{i = 1}^{m} \sum_{n = 1}^{2} M_{i, n} log (q_{τ_{i}}^{(n)}) + \sum_{i = 1}^{m} \sum_{n = 2}^{3} \sum_{ℓ = 1}^{M_{i, n}} log (f_{U | τ_{i}, Θ_{n}}^{(n)} (U_{ℓ}^{(i, n)})) \end{matrix}

(29)

If we use the explicit truncated/translated log-normal distributions introduced in Lemma 8 there are 9 parameters

({\bar{q}}^{(1)}, {\bar{q}}^{(2)}, α, γ, μ_{2}, σ_{2}, μ_{3}, σ_{3}, u^{(2)})

involved in this optimization. Note that we use canonical translation

ν = u^{(2)}

for the upper layer distribution

F_{U | t, Θ_{3}}^{(3)}

.

5.3.3. Preliminary Model Selection

We need to choose optimal parameters

({\bar{q}}^{(1)}, {\bar{q}}^{(2)}, α, γ, μ_{2}, σ_{2}, μ_{3}, σ_{3}, u^{(2)})

for the model presented in Lemma 8. This can be achieved by applying MLE to log-likelihood (29). Since, eventually, we would like to do this for any time point

τ_{m} \in {31 / 12 / 2001, \dots, 31 / 10 / 2010}

, this would be computationally too expensive, and also not sufficiently robust (over time). For this reason we do a preliminary model selection based on the data as of 31/10/2010. In this preliminary model selection analysis we determine (a) the explicit distributions, (b) the upper layer threshold

u^{(2)}

as well as (c) the parameter

γ > 0

of power function (28). Based on these three choices we then calibrate the model for the time series

τ_{m} \in {31 / 12 / 2001, \dots, 31 / 10 / 2010}

. These three choices are done based on Table 5, Table 6 and Table 7.

We start by comparing the following models for the pair

(F^{(2)}, F^{(3)})

: (i) truncated/shifted log-normal (as in Lemma 8); and (ii) truncated/truncated log-normal. For these two models we consider the static version

α = 0

in (28) and the dynamic version

α > 0

. We set

γ = 1 / 4

and

u^{(2)} = 6

months (this is further considered below) and then we calculate the MLE of

({\bar{q}}^{(1)}, {\bar{q}}^{(2)}

,

α = 0

,

μ_{2}, σ_{2}, μ_{3}, σ_{3})

(for the static version) and the MLE of

({\bar{q}}^{(1)}, {\bar{q}}^{(2)}, α, μ_{2}, σ_{2}, μ_{3}, σ_{3})

(for the dynamic version). Finally, we calculate Akaike’s information criterion (AIC) and the Bayesian information criterion (BIC) for these model choices. The model with the smallest AIC and BIC, respectively, should be preferred. The results are presented in Table 5 (top). LoB Property: the dynamic (

α > 0

) truncated/truncated log-normal model should be preferred, closely followed by the dynamic (

α > 0

) truncated/shifted log-normal one. LoB Casualty: the truncated/truncated log-normal model should be preferred, the static and the dynamic versions are judged by BIC rather similarly. We have decided in favor of the static version because this reduces the number of parameters to be selected.

Next we analyze AIC and BIC of the optimal layer limit

u^{(2)}

for the dynamic truncated/truncated log-normal model in LoB Property (for given

γ = 1 / 4

) and the static truncated/truncated log-normal model in LoB Casualty. The results are presented in Table 5 (bottom). We observe that we prefer for both LoBs a small threshold of

u^{(2)} = 3

months.

Next we do the same analysis for the parameter γ in the dynamic (

α > 0

) version of LoB Property, see Table 6 and we compare the log-normal model to the gamma model, see Table 7. From this preliminary analysis our conclusions are:

⊳: LoB Property: We choose the dynamic ( $α > 0$ ) truncated/truncated log-normal model with $γ = 1 / 4$ and $u^{(2)} = 3$ months.
⊳: LoB Casualty: We choose the static ( $α = 0$ ) truncated/truncated log-normal model with $u^{(2)} = 3$ months.

These are our preferred models as of

τ_{m} = 31 / 10 / 2010

and the remaining model parameters

({\bar{q}}^{(1)}, {\bar{q}}^{(2)}, α, μ_{2}, σ_{2}, μ_{3}, σ_{3})

are obtained by MLE from (29).

5.3.4. Model Calibration Over Time

In the previous subsection we have identified the optimal models as of

τ_{m} = 31 / 10 / 2010

using AIC and BIC. Observe that this is the optimal model selection with having maximal available data. This selection will be revised in this subsection because we study the models when incoming information increases over

τ_{m} \in {31 / 12 / 2001, \dots, 31 / 10 / 2010}

. In Figure 12 and Figure 13 we present the MLEs of parameters

(μ_{2}, σ_{2}, μ_{3}, σ_{3})

using (29) for (lhs) the truncated/truncated log-normal models and (rhs) the truncated/shifted log-normal models. LoB Property considers the dynamic version

α > 0

with

γ = 1 / 4

and LoB Casualty considers the static version

α = 0

. From this analysis we see that the parameters behave much more robust over time in the truncated/shifted log-normal model, see Figure 12 (rhs) and Figure 13 (rhs). For this reason we abandon our previous choice, and we select the truncated/shifted log-normal model for both LoBs. This is in slight contrast to the AIC and BIC analysis in the previous subsection, but the differences between the two models in Table 5 (lhs) are rather small which does not severely contradict the truncated/shifted model selection. In addition, for LoB Property we choose threshold

u^{(2)} = 3

months (Figure 12 (top, rhs)) and for LoB Casualty we decide for the bigger threshold

u^{(2)} = 6

months (Figure 13 (bottom, rhs)), also because estimation over time is more robust for this latter choice.

In Figure 11 we provide the estimated layer probabilities

p_{U | τ_{m}, Θ}^{(n)}

,

n = 1, 2, 3

, in the truncated/shifted log-normal model for LoB Property (with

u^{(2)} = 3

months) and for LoB Casualty (with

u^{(2)} = 6

months) for

τ_{m} \in {31 / 12 / 2010, \dots, 31 / 10 / 2010}

. The solid lines give the dynamic versions

α > 0

with

γ = 1 / 4

and the dotted lines the static versions

α = 0

. In LoB Property we observe stationarity of these layer probabilities up to break point

τ_{m_{0}} = 1 / 1 / 2006

, and our modeling approach Equation (28) with

γ = 1 / 4

seems to capture the non-stationarity after the break point rather well (note that the thin solid line shows an exact function

x^{1 / 4}

after the break point by considering the map

i \mapsto p_{U | τ_{i}, Θ}^{(n)}

at time

τ_{m} = 31 / 10 / 2010

and the dotted lines show the stationary case

α = 0

). In this analysis we also see that LoB Casualty may be slightly non-stationary after 1/1/2008, see Figure 11 (rhs). This was not detected previously, but is also supported by the time series of the trend parameter α estimates given in Figure 7 (rhs). However, since the resulting trend parameter α is comparably small we remain with the static version for LoB Casualty.

Our conclusions are as follows. For LoB Property we choose the dynamic (

α > 0

) truncated/shifted log-normal model with

γ = 1 / 4

and

u^{(2)} = 3

months; for LoB Casualty we choose the static (

α = 0

) truncated/shifted log-normal model with

u^{(2)} = 6

months. In Figure 14 we present the resulting calibration as of

τ_{m} = 31 / 10 / 2010

. In the middle layer the fitted distribution looks convincing, see Figure 14 (top). In the upper layer the fitted distribution is more conservative than the observations, see Figure 14 (bottom), this is supported by the fact that the empirical distribution is not sufficiently heavy-tailed because of missing information about late reportings (IBNYR claims in upper left triangles in Figure 2). This missing information has a bigger influence in casualty insurance because reporting delays are more heavy-tailed. Moreover, in both LoBs the shifted modeling approach is more conservative than the truncated one.

The model is now fully specified and we can estimate the number of IBNYR claims (late reportings). Using Equations (6), (25) and replacing all parameters Θ by their MLEs

{\hat{Θ}}^{(τ_{m})}

at time

τ_{m}

we receive estimate for the total number of incurred claims in time interval

(τ_{i - 1}, τ_{i}]

{\hat{N}}_{i}^{(τ_{m})} = w_{i} {\hat{Λ}}_{i}^{(τ_{m})} λ_{i}^{+} = \frac{M_{i} (τ_{m})}{\sum_{n = 1}^{3} π_{{\hat{Θ}}_{n}^{(τ_{m})}}^{(i, n)} (τ_{m}) p_{U | τ_{i}, {\hat{Θ}}^{(τ_{m})}}^{(n)}}

(30)

The number of estimated IBNYR claims is given by the difference

{\hat{IBNYR}}_{i}^{(τ_{m})} = {\hat{N}}_{i}^{(τ_{m})} - M_{i} (τ_{m})

. For illustration we choose three time points

τ_{m} \in {31 / 03 / 2004, 31 / 07 / 2007, 31 / 10 / 2010}

, and we always use the relevant available information at those time points

τ_{m}

. The results are presented in Figure 15, the blue/green line gives the number of reported claims

M_{i} (τ_{m})

and the red line the estimated number of incurred claims

{\hat{N}}_{i}^{(τ_{m})}

(the spread being the number of estimated IBNYR claims). We note that the spread is bigger for LoB Casualty than LoB Property (which is not surprising in view of our previous analysis). For LoB Property in Figure 15 (top) we also compare the static (gray) to the dynamic (red) estimation. The static version seems inappropriate since it cannot sufficiently capture the non-stationarity and estimation is too conservative after the break point

τ_{m_{0}} = 1 / 1 / 2006

.

In the final section of this paper, we back-test our calibration and estimation, and compare it to the chain-ladder estimation.

6. Homogeneous (Poisson) Chain-Ladder Case and Back-Testing

In this section we compare our individual claims modeling calibration to the classical chain-ladder method on aggregate data. The chain-ladder method is probably the most popular method in claims reserving. Interestingly, the cross-classified chain-ladder estimation can be derived under Model Assumptions 2 and additional suitable homogeneity assumptions.

6.1. Cross-Classified Chain-Ladder Model

We choose an equidistant grid

0 = τ_{0} < τ_{1} < τ_{2} < \dots with Δ τ_{i} = τ_{i} - τ_{i - 1} = τ_{1} for all i \in N .

Denote by

M_{i, j}

the number of claims with accident dates in

(τ_{i - 1}, τ_{i}]

and reporting dates in

(τ_{i + j - 1}, τ_{i + j}]

, for

i \in N

and

j \in N_{0}

. Under Model Assumptions 2 the random variables

M_{i, j}

are independent and Poisson distributed with exposures

W_{i, j} = \int_{τ_{i - 1}}^{τ_{i}} w (t) Λ (t) \int_{τ_{i + j - 1} - t}^{τ_{i + j} - t} f_{U | t, Θ} (u) d u d t

We now make the following homogeneity assumptions for

t \in (τ_{i - 1}, τ_{i}]

and

i \in N

w (t) Λ (t) = w_{τ_{i}} Λ_{τ_{i}}, f_{U | t, Θ} = f_{U | 0, Θ} = f_{U | Θ}

(31)

i.e., we may drop time index t in

f_{U | t, Θ}

and

F_{U | t, Θ}

, respectively. We define

W_{i} = Δ τ_{i} w_{τ_{i}} Λ_{τ_{i}}

. Assumptions (31) imply

W_{i, j} = W_{i} \frac{1}{Δ τ_{i}} \int_{τ_{i - 1}}^{τ_{i}} \int_{τ_{i + j - 1} - t}^{τ_{i + j} - t} f_{U | Θ} (u) d u d t = W_{i} \frac{1}{Δ τ_{1}} \int_{0}^{τ_{1}} \int_{τ_{j} - t}^{τ_{j + 1} - t} f_{U | Θ} (u) d u d t

If we now define reporting pattern

{(γ_{j})}_{j \geq 0}

by

γ_{j} = γ_{j} (Θ) = \frac{1}{Δ τ_{1}} \int_{0}^{τ_{1}} \int_{τ_{j} - t}^{τ_{j + 1} - t} f_{U | Θ} (u) d u d t = \frac{1}{Δ τ_{1}} \int_{0}^{τ_{1}} F_{U | Θ} (τ_{j + 1} - t)) - F_{U | Θ} (τ_{j} - t)) d t

we see that under (31) the random variables

M_{i, j}

are independent and Poisson distributed with

M_{i, j} \sim Poisson (W_{i} γ_{j})

(32)

Moreover, we have normalization

\sum_{j \geq 0} γ_{j} = 1

. This provides the cross-classified Poisson version of the chain-ladder model. Under the additional assumption that

\sum_{j = 0}^{J} γ_{j} = 1

for a finite J the MLE exactly provides the chain-ladder estimator. This result goes back to Hachemeister-Stanard [22], Kremer [23] and Mack [24], and for more details and the calculation of the chain-ladder estimator

{\hat{M}}_{i, j}^{C L (τ_{m})}

of

M_{i, j}

with

i + j > m

at time

τ_{m} \geq τ_{J}

we refer to Theorem 3.4 in Wüthrich-Merz [25]. Using these chain-ladder estimators we get estimate

{\hat{N}}_{i}^{C L (τ_{m})} = \sum_{j = 0}^{m - i} M_{i, j} + \sum_{j \geq m - i + 1} {\hat{M}}_{i, j}^{C L (τ_{m})} = M_{i} (τ_{m}) + \sum_{j \geq m - i + 1} {\hat{M}}_{i, j}^{C L (τ_{m})}

(33)

for the estimated number of claims in period

(τ_{i - 1}, τ_{i}]

with

τ_{i} \leq τ_{m}

. This chain-ladder estimate

{\hat{N}}_{i}^{C L (τ_{m})}

is compared to the estimate

{\hat{N}}_{i}^{(τ_{m})}

provided in (30).

6.2. Back-Testing

In this section we back-test the estimations obtained by calibration (21) and (29) and compare it to the homogeneous chain-ladder case (33). We therefore calculate for each exposure period

(τ_{i - 1}, τ_{i}]

with

τ_{i} \leq τ_{m} \in {31 / 12 / 2001, \dots, 31 / 10 / 2010}

the estimates

{\hat{N}}_{i}^{(τ_{m})}

and

{\hat{N}}_{i}^{C L (τ_{m})}

. These estimates are compared to the latest estimates

{\hat{N}}_{i}^{(τ_{e})}

and

{\hat{N}}_{i}^{C L (τ_{e})}

at time

τ_{e} = 31 / 10 / 2010

. In particular, we back-test the estimates with time lags

j = 0, 1

against the latest available estimation. We therefore define the ratios

\begin{matrix} χ_{i, 0} = \frac{{\hat{N}}_{i}^{(τ_{i})}}{{\hat{N}}_{i}^{(τ_{e})}} & and & χ_{i, 1} = \frac{{\hat{N}}_{i}^{(τ_{i + 1})}}{{\hat{N}}_{i}^{(τ_{e})}} \\ χ_{i, 0}^{C L} = \frac{{\hat{N}}_{i}^{C L (τ_{i})}}{{\hat{N}}_{i}^{C L (τ_{e})}} & and & χ_{i, 1}^{C L} = \frac{{\hat{N}}_{i}^{C L (τ_{i + 1})}}{{\hat{N}}_{i}^{C L (τ_{e})}} \end{matrix}

The first ratio

χ_{i, 0}

compares the estimation of the number of claims in period

(τ_{i - 1}, τ_{i}]

based on the information available at time

τ_{i}

to the latest available estimation at time

τ_{e} = 31 / 10 / 2010

, and the second variable

χ_{i, 1}

considers the same ratio but with the estimation based on the information at time

τ_{i + 1}

(i.e., after one period of development). Moreover, we calculate the relative process uncertainties for

j = 0, 1

defined by

ς_{i, j} = \frac{{({\hat{N}}_{i}^{(τ_{i + j})} - M_{i} (τ_{i + j}))}^{1 / 2}}{{\hat{N}}_{i}^{(τ_{e})}}

this is the relative standard deviation of the number of IBNYR claims at time

τ_{i + j}

for a Poisson random variable, normalized with

{\hat{N}}_{i}^{(τ_{e})}

to make it comparable to

χ_{i, j}

. We present the results in Figure 16:

Figure 16 (top), LoB Property: We see that the chain-ladder estimate ${\hat{N}}_{i}^{C L (τ_{i + j})}$ , $j = 0, 1$ , clearly over-estimates the number of claims after the break point $τ_{m_{0}} = 1 / 1 / 2006$ , whereas our non-stationary approach (28) can capture this change rather well and estimations $χ_{i, j}$ are centered around 1. Remarkable is that after 5 years of observations, the volatility of the estimation can almost completely be explained by process uncertainty (we plot confidence bounds of $2 ς_{i, j}$ ), which means that model uncertainty is comparably low. We also see that the faster reporting behavior after break point $τ_{m_{0}}$ has substantially decreased the uncertainty and the volatility in the number of IBNYR claims. Before the break point, uncertainty for $j = 0$ is comparably high, this can partly be explained by the fact that claims history is too short for model calibration.
Figure 16 (bottom), LoB Casualty: After roughly 5 years of claims experience the estimation in the individual claims model and the chain-ladder model are very similar. This indicates that in a stationary situation, the performance of the (simpler) chain-ladder model is sufficient. However, this latter statement needs to be revised because non-stationarity can be detected more easily in the individual claims history modeling approach, in particular, the individual claims model reacts more sensitively to structural changes. The confidence bounds have a reasonable size because the back-testing exercise does not violate them too often (i.e., the observations are mostly within the confidence bounds), however, in practice they should be chosen slightly bigger because they do not consider model uncertainty.

We close this section with a brief discussion of two related modeling approaches. Our approach can be viewed as a refined model of the one used in Antonio-Plat [17]. The first refinement we use is that we consider the weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

which was supported by the statistical analysis given in Figure 5. Neglecting this weekly periodic pattern would lead to less smooth small layer probabilities, in particular, in LoB Casualty, see Figure 4. The second refinement is that we choose a reporting delay distribution that depends on the weekday of the claims occurrence. This is especially important for the small reporting delay layer because weekday configuration essentially influences the short reporting delays. In our analysis, this then leads to a mixture model with three different layers. Antonio-Plat [17] use a mixture of a Weibull distribution and 9 degenerate components which fits their purposes well. Our approach raises the issue of over-parametrization which we analyze graphically, i.e., we observe rather stable parameter estimates after 4 years of observations, see for instance Figure 12 (rhs) and Figure 13 (rhs).

The issue of over-parametrization is of essential relevance if we would like to do prediction for future exposure periods. That is, in our analysis we have mainly concentrated on making statistical inference of the instantaneous claims frequency

Λ (t)

of past exposures

t \leq τ_{m}

at a given time point

τ \geq τ_{m}

. Going forward, we may also want to model and predict the claims occurrence and the claims reporting processes, respectively, of future exposures. An over-parametrized model will have a low predictive power because it involves too much model uncertainty, therefore we should choose as few parameters as necessary. Moreover, for predictive modeling it will also be necessary to model stochastically the instantaneous claims frequency process Λ, i.e., our statistical inference method is not sufficient for predicting claims of future exposures, but it will help to calibrate a stochastic model for Λ. A particularly interesting model is the marked Cox model proposed in Badescu et al. [18,19]. Similarly to Antonio-Plat [17], Badescu et al. [18,19] consider the piece-wise homogeneous case (7), but with

{(Λ_{i})}_{i}

being a state-dependent process which is driven by a time-homogeneous hidden Markov process (in the sense of a state-space model). We believe that this is a promising modeling approach for claims occurrence and reporting prediction, if merged with our weekday dependent features (both for claims occurrence and claims reporting delays).

6.3. Conclusions

We have provided an explicit calibration of a reporting delay model to individual claims data. For the two LoBs considered it takes about 5 years of observations to have sufficient information to calibrate the model (this is true for the stationary case and needs to be analyzed in more detail in a non-stationary situation). As long as the claims reporting process is stationary the individual claims reporting model and the aggregate chain-ladder model provide very similar estimates for the number of IBNYR claims, but as soon as we have non-stationarity the chain-ladder model fails to provide reliable estimates and one should use individual claims modeling. Moreover, our individual claims modeling approach is able to detect non-stationarity more quickly than the aggregate chain-ladder method. Going forward, there are two different directions that need to be considered. First, for the evaluation of parameter uncertainty one can embed our individual claims reporting model into a Bayesian framework (this is similar to the Cox model proposed in Badescu et al. [18,19]) or one can use bootstrap methods. Secondly, the far more difficult problem is the modeling of the cost evolution and the claims cash flow process. We believe that this is still an open problem and it is the next building block of individual claims reserving that should be studied on real data examples.

Author Contributions

Both authors have contributed equally to this paper.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Proofs

Proof of Lemma 3.

We maximize likelihood function (9). The derivatives of its logarithm provide for

i = 1, \dots, m

the requirements (denoted by

\overset{!}{=}

)

\frac{\partial {log L}_{{(T_{ℓ}, S_{ℓ})}_{ℓ = 1, \dots, M}} (Λ, Θ)}{\partial Λ_{i}} = - π_{Θ}^{(i)} (τ) w_{i} λ_{i}^{+} + \frac{M_{i}}{Λ_{i}} \overset{!}{=} 0

and for the derivative w.r.t. the reporting delay parameter Θ we obtain

\frac{\partial log L_{{(T_{ℓ}, S_{ℓ})}_{ℓ = 1, \dots, M}} (Λ, Θ)}{\partial Θ} = \sum_{i = 1}^{m} [- w_{i} Λ_{i} λ_{i}^{+} \frac{\partial}{\partial Θ} π_{Θ}^{(i)} (τ) + \sum_{ℓ = 1}^{M_{i}} \frac{\frac{\partial}{\partial Θ} f_{U | T_{ℓ}^{(i)}, Θ} (S_{ℓ}^{(i)} - T_{ℓ}^{(i)})}{f_{U | T_{ℓ}^{(i)}, Θ} (U_{ℓ}^{(i)} - S_{ℓ}^{(i)})}] \overset{!}{=} 0

The first requirements provide identity

Λ_{i} = \frac{M_{i}}{π_{Θ}^{(i)} (τ) w_{i} λ_{i}^{+}}

Plugging this into the second requirement provides identity

\begin{matrix} 0 & = & \sum_{i = 1}^{m} [- M_{i} \frac{\partial}{\partial Θ} log (π_{Θ}^{(i)} (τ)) + \sum_{ℓ = 1}^{M_{i}} \frac{\partial}{\partial Θ} log (f_{U | T_{ℓ}^{(i)}, Θ} (S_{ℓ}^{(i)} - T_{ℓ}^{(i)}))] \\ = & \sum_{i = 1}^{m} \sum_{ℓ = 1}^{M_{i}} \frac{\partial}{\partial Θ} [log (f_{U | T_{ℓ}^{(i)}, Θ} (S_{ℓ}^{(i)} - T_{ℓ}^{(i)})) - log (π_{Θ}^{(i)} (τ))] \\ = & \sum_{i = 1}^{m} \sum_{ℓ = 1}^{M_{i}} \frac{\partial}{\partial Θ} log (\frac{λ_{T_{ℓ}^{(i)}} f_{U | T_{ℓ}^{(i)}, Θ} (S_{ℓ}^{(i)} - T_{ℓ}^{(i)})}{λ_{i}^{+} π_{Θ}^{(i)} (τ)}) \end{matrix}

where in the last identity we have added constants in Θ which vanish under the derivative (note that these constants were added to indicate that we obtain the densities (10)). ☐

Proof of Lemma 4.

We maximize likelihood function (9) for

i \leq m^{*}

on weekly time grid

Δ τ_{i} = 7

under side constraint

\sum_{k = 1}^{7} λ_{k} = 7

and under assumption (11) which implies

π_{Θ}^{(i)} (τ) = 1

and

\frac{\partial}{\partial λ_{k}} π_{Θ}^{(i)} (τ) = 0

for

i \leq m^{*}

. The corresponding Lagrangian is given by

\sum_{i = 1}^{m^{*}} log L_{{(T_{ℓ}^{(i)}, S_{ℓ}^{(i)})}_{ℓ = 1, \dots, M_{i}}}^{(i)} (Λ, Θ) - χ (\sum_{k = 1}^{7} λ_{k} - 7)

Under the above assumptions we calculate the derivative w.r.t.

λ_{k}

,

1 \leq k \leq 7

, of the Lagrangian

\begin{matrix} \frac{\partial}{\partial λ_{k}} (\sum_{i = 1}^{m^{*}} log L_{{(T_{ℓ}^{(i)}, S_{ℓ}^{(i)})}_{ℓ = 1, \dots, M_{i}}}^{(i)} (Λ, Θ) - χ (\sum_{k = 1}^{7} λ_{k} - 7)) & = & \sum_{i = 1}^{m^{*}} - w_{i} Λ_{i} + \sum_{ℓ = 1}^{M_{i}} \frac{1}{λ_{k}} 1_{{⌈ {\tilde{T}}_{ℓ}^{(i)} ⌉ = k}} - χ \\ \overset{!}{=} & 0 \end{matrix}

and the derivative w.r.t.

Λ_{i}

are given by

\frac{\partial}{\partial Λ_{i}} (\sum_{i = 1}^{m^{*}} log L_{{(T_{ℓ}^{(i)}, S_{ℓ}^{(i)})}_{ℓ = 1, \dots, M_{i}}}^{(i)} (Λ, Θ) - χ (\sum_{k = 1}^{7} λ_{k} - 7)) = - w_{i} λ_{i}^{+} + \frac{M_{i}}{Λ_{i}} = - 7 w_{i} + \frac{M_{i}}{Λ_{i}} \overset{!}{=} 0

where we have used

λ_{i}^{+} = 7

on the weekly time grid

Δ τ_{i} = 7

. The latter implies

Λ_{i} = M_{i} / (7 w_{i})

and plugging this into the former requirement provides

\sum_{i = 1}^{m^{*}} - M_{i} / 7 + \sum_{ℓ = 1}^{M_{i}} \frac{1}{λ_{k}} 1_{{⌈ {\tilde{T}}_{ℓ}^{(i)} ⌉ = k}} - χ \overset{!}{=} 0

This implies that

λ_{k} = \frac{\sum_{i = 1}^{m^{*}} \sum_{ℓ = 1}^{M_{i}} 1_{{⌈ {\tilde{T}}_{ℓ}^{(i)} ⌉ = k}}}{χ + \sum_{i = 1}^{m^{*}} M_{i} / 7}

Lagrange multiplier χ is found from normalization

\sum_{k = 1}^{7} λ_{k} = 7

which provides the claim. ☐

References

W.S. Jewell. “Predicting IBNYR events and delays I. Continuous time.” ASTIN Bull. 19 (1989): 25–55. [Google Scholar] [CrossRef]
W.S. Jewell. “Predicting IBNYR events and delays II. Discrete time.” ASTIN Bull. 20 (1990): 93–111. [Google Scholar] [CrossRef]
H. Bühlmann, R. Schnieper, and E. Straub. “Claims reserves in casualty insurance based on a probabilistic model.” Bull. Swiss Assoc. Actuaries 1980 (1980): 21–45. [Google Scholar]
E. Arjas. “The claims reserving problem in non-life insurance: Some structural ideas.” ASTIN Bull. 19 (1989): 139–152. [Google Scholar] [CrossRef]
R. Norberg. “Prediction of outstanding liabilities in non-life insurance.” ASTIN Bull. 23 (1993): 95–115. [Google Scholar] [CrossRef]
R. Norberg. “Prediction of outstanding liabilities II. Model variations and extensions.” ASTIN Bull. 29 (1999): 5–25. [Google Scholar] [CrossRef]
S. Haastrup, and E. Arjas. “Claims reserving in continuous time; a nonparametric Bayesian approach.” ASTIN Bull. 26 (1996): 139–164. [Google Scholar] [CrossRef]
G. Taylor. The Statistical Distribution of Incurred Losses and Its Evolution Over Time I: Non-Parametric Models. Arlington, VA, USA: Casualty Actuarial Society, 1999, working paper. [Google Scholar]
G. Taylor. The Statistical Distribution of Incurred Losses and Its Evolution Over Time II: Parametric Models. Arlington, VA, USA: Casualty Actuarial Society, 1999, working paper. [Google Scholar]
T. Herbst. “An application of randomly truncated data models in reserving IBNR claims.” Insur. Math. Econ. 25 (1999): 123–131. [Google Scholar] [CrossRef]
C.R. Larsen. “An individual claims reserving model.” ASTIN Bull. 37 (2007): 113–132. [Google Scholar] [CrossRef]
G. Taylor, G. McGuire, and J. Sullivan. “Individual claim loss reserving conditioned by case estimates.” Ann. Actuar. Sci. 3 (2008): 215–256. [Google Scholar] [CrossRef]
A.H. Jessen, T. Mikosch, and G. Samorodnitsky. “Prediction of outstanding payments in a Poisson cluster model.” Scand. Actuar. J. 2011 (2011): 214–237. [Google Scholar] [CrossRef]
S. Rosenlund. “Bootstrapping individual claims histories.” ASTIN Bull. 42 (2012): 291–324. [Google Scholar]
M. Pigeon, K. Antonio, and M. Denuit. “Individual loss reserving with the multivariate skew normal framework.” ASTIN Bull. 43 (2013): 399–428. [Google Scholar] [CrossRef]
T. Agbeko, M. Hiabu, M.D. Martínez-Miranda, J.P. Nielsen, and R. Verrall. “Validating the double chain ladder stochastic claims reserving model.” Variance 8 (2014): 138–160. [Google Scholar]
K. Antonio, and R. Plat. “Micro-level stochastic loss reserving for general insurance.” Scand. Actuar. J. 2014 (2014): 649–669. [Google Scholar] [CrossRef]
A.L. Badescu, X.S. Lin, and D. Tang. “A marked Cox model for the number of IBNR claims: Theory.” Insur. Math. Econ. 69 (2016): 29–37. [Google Scholar] [CrossRef]
A.L. Badescu, X.S. Lin, and D. Tang. A Marked Cox Model for the Number of IBNR Claims: Estimation and Application. Version 14 March 2016; Rochester, NY, USA: SSRN, 2016. [Google Scholar]
D.V. Hinkley. “On the ratio of two correlated normal random variables.” Biometrika 56 (1969): 635–639. [Google Scholar] [CrossRef]
M.V. Wüthrich. Non-Life Insurance: Mathematics & Statistics. Version 15 April 2016; Rochester, NY, USA: SSRN, 2016. [Google Scholar] [CrossRef]
C.A. Hachemeister, and J.N. Stanard. “IBNR claims count estimation with static lag functions.” ASTIN Colloq., 1975. [Google Scholar]
E. Kremer. Einführung in die Versicherungsmathematik. Göttingen, Germany: Vandenhoek & Ruprecht, 1985. [Google Scholar]
T. Mack. “A simple parametric model for rating automobile insurance or estimating IBNR claims reserves.” ASTIN Bulletin 21 (1991): 93–109. [Google Scholar] [CrossRef]
M.V. Wüthrich, and M. Merz. Stochastic Claims Reserving Manual: Advances in Dynamic Modeling. Version 21 August 2015; Swiss Finance Institute Research Paper No. 15-34; Rochester, NY, USA: SSRN, 2015. [Google Scholar] [CrossRef]

Figure 1. Observed claims counts from 1/1/2001 until 31/10/2010: (top) LoB Property colored blue and (bottom) LoB Casualty colored green. The lhs gives daily claims counts and the rhs monthly claims counts; the red lines are the rolling averages over 30 days on the lhs and over 2 months on the rhs; violet dots show claims occurrence on Saturdays and orange dots claims occurrence on Sundays, the resulting statistics are provided in Table 1.

Figure 2. Observed and reported claims counts from 1/1/2001 until 31/10/2010: (top) LoB Property colored blue and (bottom) LoB Casualty colored green. The lhs gives daily claims reporting; the red lines are the 30 days rolling averages; violet dots show claims reporting on Saturdays and orange dots claims reporting on Sundays. The rhs plots accident dates

T_{ℓ}

versus reporting delays

U_{ℓ} = S_{ℓ} - T_{ℓ}

; the upper-right (white) triangle corresponds to the missing data (IBNYR claims); the blue/green dots illustrate reported and settled claims; the orange dots reported but not settled (RBNS) claims.

Figure 2. Observed and reported claims counts from 1/1/2001 until 31/10/2010: (top) LoB Property colored blue and (bottom) LoB Casualty colored green. The lhs gives daily claims reporting; the red lines are the 30 days rolling averages; violet dots show claims reporting on Saturdays and orange dots claims reporting on Sundays. The rhs plots accident dates

T_{ℓ}

versus reporting delays

U_{ℓ} = S_{ℓ} - T_{ℓ}

; the upper-right (white) triangle corresponds to the missing data (IBNYR claims); the blue/green dots illustrate reported and settled claims; the orange dots reported but not settled (RBNS) claims.

Figure 3. Box plots of the logged reporting delays

log (U_{ℓ})

on the yearly scale (lhs) LoB Property, (rhs) LoB Casualty.

Figure 3. Box plots of the logged reporting delays

log (U_{ℓ})

on the yearly scale (lhs) LoB Property, (rhs) LoB Casualty.

Figure 4. Estimated probability

π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(m, 1)} (τ_{m})

, see (23), for

τ_{m} \in {31 / 1 / 2001, \dots, 31 / 10 / 2010}

using weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

(blue/green) and setting

{(λ_{k})}_{k = 1, \dots, 7} \equiv 1

(black) for (lhs) LoB Property under the full model (21), and (rhs) LoB Casualty under the null hypothesis reduced model.

Figure 4. Estimated probability

π_{{\hat{Θ}}_{1}^{(τ_{m})}}^{(m, 1)} (τ_{m})

, see (23), for

τ_{m} \in {31 / 1 / 2001, \dots, 31 / 10 / 2010}

using weekly periodic pattern

{(λ_{k})}_{k = 1, \dots, 7}

(blue/green) and setting

{(λ_{k})}_{k = 1, \dots, 7} \equiv 1

(black) for (lhs) LoB Property under the full model (21), and (rhs) LoB Casualty under the null hypothesis reduced model.

Figure 5. Weekly periodic pattern estimate

{({\hat{λ}}_{k})}_{k = 1, \dots, 7}

(top) LoB Property and (bottom) LoB Casualty: (lhs) time series as a function of

τ_{m^{*}}

and (rhs) for maximal

τ_{m^{*}}

such that (11) holds at time

τ_{m} = 31 / 10 / 2010

for a maximal reporting delay of 2 years (LoB Property) and of 4 years (LoB Casualty). The confidence bounds in all plots are given by (12) for confidence level

α = 90 %

.

Figure 5. Weekly periodic pattern estimate

{({\hat{λ}}_{k})}_{k = 1, \dots, 7}

(top) LoB Property and (bottom) LoB Casualty: (lhs) time series as a function of

τ_{m^{*}}

and (rhs) for maximal

τ_{m^{*}}

such that (11) holds at time

τ_{m} = 31 / 10 / 2010

for a maximal reporting delay of 2 years (LoB Property) and of 4 years (LoB Casualty). The confidence bounds in all plots are given by (12) for confidence level

α = 90 %

.

Figure 6. (top) LoB Property and (bottom) LoB Casualty: empirical distribution of reporting delays separated by weekdays of claims occurrence of all data with accident date prior to 01/2006 and maximal reporting delay of

U_{ℓ} \leq 365

days (lhs) per day, (middle) compressed by weekends, and (rhs) compressed and normalized to 1 after a delay of one week.

Figure 6. (top) LoB Property and (bottom) LoB Casualty: empirical distribution of reporting delays separated by weekdays of claims occurrence of all data with accident date prior to 01/2006 and maximal reporting delay of

U_{ℓ} \leq 365

days (lhs) per day, (middle) compressed by weekends, and (rhs) compressed and normalized to 1 after a delay of one week.

Figure 7. (lhs) small reporting delay layer as of 31/10/2010 for occurrence dates in 10/2010 of LoB Property, and (rhs) estimated trend parameter α at times

τ_{m} \in {31 / 12 / 2001, \dots, 31 / 10 / 2010}

for LoB Property (blue/light blue) and LoB Casualty (green/light green) with dark colors are for

u^{(2)} = 3

months and light colors for

u^{(2)} = 6

months.

Figure 7. (lhs) small reporting delay layer as of 31/10/2010 for occurrence dates in 10/2010 of LoB Property, and (rhs) estimated trend parameter α at times

τ_{m} \in {31 / 12 / 2001, \dots, 31 / 10 / 2010}

for LoB Property (blue/light blue) and LoB Casualty (green/light green) with dark colors are for

u^{(2)} = 3

months and light colors for

u^{(2)} = 6

months.

Figure 8. LoB Property, small reporting delay layer: estimated cumulative distribution

F_{U | t, Θ}^{(1)}

per weekday

s = ⌈ t ⌉

. Dotted lines show calibration based on all observations since

τ_{0}

and lines show calibration based on a rolling window of length 2·365 days.

Figure 8. LoB Property, small reporting delay layer: estimated cumulative distribution

F_{U | t, Θ}^{(1)}

per weekday

s = ⌈ t ⌉

. Dotted lines show calibration based on all observations since

τ_{0}

and lines show calibration based on a rolling window of length 2·365 days.

Figure 9. LoB Casualty, small reporting delay layer: estimated cumulative distribution

F_{U | t, Θ}^{(1)}

per weekday

s = ⌈ t ⌉

. Dotted lines show calibration based on all observations since

τ_{0}

and lines show calibration based on a rolling window of length 2·365 days.

Figure 9. LoB Casualty, small reporting delay layer: estimated cumulative distribution

F_{U | t, Θ}^{(1)}

per weekday

s = ⌈ t ⌉

. Dotted lines show calibration based on all observations since

τ_{0}

and lines show calibration based on a rolling window of length 2·365 days.

Figure 10. LoB Casualty, small reporting delay layer: estimated cumulative distribution

F_{U | t, Θ}^{(1)}

per weekday

s = ⌈ t ⌉

. Dotted lines show calibration based on individual weekdays and reporting delays and lines show calibration under compressed weekends and the null hypothesis that we only need three parameters.

Figure 10. LoB Casualty, small reporting delay layer: estimated cumulative distribution

F_{U | t, Θ}^{(1)}

per weekday

s = ⌈ t ⌉

. Dotted lines show calibration based on individual weekdays and reporting delays and lines show calibration under compressed weekends and the null hypothesis that we only need three parameters.

Figure 11. Estimated layer probabilities

p_{U | τ_{m}, Θ}^{(n)}

,

n = 1, 2, 3

, in the truncated/shifted log-normal model (lhs) LoB Property with

u^{(2)} = 3

and (rhs) LoB Casualty with

u^{(2)} = 6

months for

τ_{m} \in {31 / 12 / 2010, \dots, 31 / 10 / 2010}

; the solid lines give the dynamic versions

α > 0

with

γ = 1 / 4

and the dotted lines the static versions

α = 0

.

Figure 11. Estimated layer probabilities

p_{U | τ_{m}, Θ}^{(n)}

,

n = 1, 2, 3

, in the truncated/shifted log-normal model (lhs) LoB Property with

u^{(2)} = 3

and (rhs) LoB Casualty with

u^{(2)} = 6

months for

τ_{m} \in {31 / 12 / 2010, \dots, 31 / 10 / 2010}

; the solid lines give the dynamic versions

α > 0

with

γ = 1 / 4

and the dotted lines the static versions

α = 0

.

Figure 12. LoB Property: parameter estimates of

(μ_{2}, σ_{2}, μ_{3}, σ_{3})

for the dynamic model

α > 0

with

γ = 1 / 4

for

τ_{m} \in {31 / 12 / 2010, \dots, 31 / 10 / 2010}

(top, lhs)

u^{(2)} = 3

months truncated/truncated and (top, rhs) and truncated/shifted; and (bottom, lhs)

u^{(2)} = 6

months truncated/truncated and (bottom, rhs) and truncated/shifted.

Figure 12. LoB Property: parameter estimates of

(μ_{2}, σ_{2}, μ_{3}, σ_{3})

for the dynamic model

α > 0

with

γ = 1 / 4

for

τ_{m} \in {31 / 12 / 2010, \dots, 31 / 10 / 2010}

(top, lhs)

u^{(2)} = 3

months truncated/truncated and (top, rhs) and truncated/shifted; and (bottom, lhs)

u^{(2)} = 6

months truncated/truncated and (bottom, rhs) and truncated/shifted.

Figure 13. LoB Casualty: parameter estimates of

(μ_{2}, σ_{2}, μ_{3}, σ_{3})

for the static model

α = 0

for

τ_{m} \in {31 / 12 / 2010, \dots, 31 / 10 / 2010}

(top, lhs)

u^{(2)} = 3

months truncated/truncated and (top, rhs) and truncated/shifted; and (bottom, lhs)

u^{(2)} = 6

months truncated/truncated (bottom, rhs) and truncated/shifted.

Figure 13. LoB Casualty: parameter estimates of

(μ_{2}, σ_{2}, μ_{3}, σ_{3})

for the static model

α = 0

for

τ_{m} \in {31 / 12 / 2010, \dots, 31 / 10 / 2010}

(top, lhs)

u^{(2)} = 3

months truncated/truncated and (top, rhs) and truncated/shifted; and (bottom, lhs)

u^{(2)} = 6

months truncated/truncated (bottom, rhs) and truncated/shifted.

Figure 14. (top) calibration of truncated log-normal distribution in the middle layer

[u^{(1)}, u^{(2)})

for (lhs) LoB Property with

u^{(2)} = 3

months and (rhs) LoB Casualty with

u^{(2)} = 6

months; (bottom) calibration of shifted and truncated log-normal distributions in the large layer

[u^{(2)}, \infty)

for (lhs) LoB Property with

u^{(2)} = 3

months (static and dynamic versions) and (rhs) LoB Casualty with

u^{(2)} = 6

months (only static versions).

Figure 14. (top) calibration of truncated log-normal distribution in the middle layer

[u^{(1)}, u^{(2)})

for (lhs) LoB Property with

u^{(2)} = 3

months and (rhs) LoB Casualty with

u^{(2)} = 6

months; (bottom) calibration of shifted and truncated log-normal distributions in the large layer

[u^{(2)}, \infty)

for (lhs) LoB Property with

u^{(2)} = 3

months (static and dynamic versions) and (rhs) LoB Casualty with

u^{(2)} = 6

months (only static versions).

Figure 15. Estimation of the number of incurred claims

{\hat{N}}_{i}^{(τ_{m})}

for (top) LoB Property and (bottom) LoB Casualty at times

τ_{m} \in {31 / 03 / 2004, 31 / 07 / 2007, 31 / 10 / 2010}

using the truncated/shifted log-normal model with

u^{(2)} = 3

months and

u^{(2)} = 6

months, respectively; “observed” (blue/green) gives the number of reported claims

M_{i} (τ_{m})

in each period

(τ_{i - 1}, τ_{i}]

at time

τ_{m}

, “dynamic/static” (red) gives the total number of estimated claims

{\hat{N}}_{i}^{(τ_{m})}

(the spread

{\hat{N}}_{i}^{(τ_{m})} - M_{i} (τ_{m})

giving the estimated number of IBNYR claims).

Figure 15. Estimation of the number of incurred claims

{\hat{N}}_{i}^{(τ_{m})}

for (top) LoB Property and (bottom) LoB Casualty at times

τ_{m} \in {31 / 03 / 2004, 31 / 07 / 2007, 31 / 10 / 2010}

using the truncated/shifted log-normal model with

u^{(2)} = 3

months and

u^{(2)} = 6

months, respectively; “observed” (blue/green) gives the number of reported claims

M_{i} (τ_{m})

in each period

(τ_{i - 1}, τ_{i}]

at time

τ_{m}

, “dynamic/static” (red) gives the total number of estimated claims

{\hat{N}}_{i}^{(τ_{m})}

(the spread

{\hat{N}}_{i}^{(τ_{m})} - M_{i} (τ_{m})

giving the estimated number of IBNYR claims).

Figure 16. Back-test LoB Property (top) and LoB Casualty (bottom): we compare the (non-stationary) estimate

χ_{i, j}

(blue/green) to the (stationary) chain-ladder estimate

χ_{i, j}^{C L}

(orange) for (lhs) time lag

j = 0

and (rhs) time lag

j = 1

. The black line shows the process uncertainty confidence bounds of 2 (relative) standard deviations

ς_{i, j}

.

Figure 16. Back-test LoB Property (top) and LoB Casualty (bottom): we compare the (non-stationary) estimate

χ_{i, j}

(blue/green) to the (stationary) chain-ladder estimate

χ_{i, j}^{C L}

(orange) for (lhs) time lag

j = 0

and (rhs) time lag

j = 1

. The black line shows the process uncertainty confidence bounds of 2 (relative) standard deviations

ς_{i, j}

.

Table 1. Statistics per weekday: average daily claims counts and empirical standard deviation for LoB Property and LoB Casualty.

**Table 1.** Statistics per weekday: average daily claims counts and empirical standard deviation for LoB Property and LoB Casualty.
	Mon	Tue	Wed	Thu	Fri	Sat	Sun
LoB Property
average	11.48	11.02	11.87	12.15	14.23	15.46	10.73
standard deviation	4.51	4.05	4.26	4.30	4.66	4.95	4.26
LoB Casualty
average	2.89	2.92	2.89	2.65	2.61	1.09	0.82
standard deviation	2.72	2.82	2.63	2.40	2.45	2.08	1.80

Table 2. Observed number of reported claims

M_{s, u}^{''} (τ_{m})

for weekdays

1 \leq s \leq 7

and reporting delays

0 \leq u \leq 6 = u^{(1)} - 1

at time

τ_{m} = 31 / 10 / 2010

for claims with accident dates before 26/10/2010 of LoB Property.

**Table 2.** Observed number of reported claims $M_{s, u}^{''} (τ_{m})$ for weekdays $1 \leq s \leq 7$ and reporting delays $0 \leq u \leq 6 = u^{(1)} - 1$ at time $τ_{m} = 31 / 10 / 2010$ for claims with accident dates before 26/10/2010 of LoB Property.
$u =$	0	1	2	3	4	5	6
Mon $s = 1$	164	482	254	220	172	5	3
Tue $s = 2$	172	504	255	221	3	5	207
Wed $s = 3$	178	545	258	17	4	246	248
Thu $s = 4$	163	542	20	9	355	239	257
Fri $s = 5$	193	88	15	651	369	296	287
Sat $s = 6$	49	33	754	445	314	266	266
Sun $s = 7$	20	470	307	211	195	197	6

Table 3. Observed number of reported claims

M_{s, u}^{''} (τ_{m})

for weekdays

1 \leq s \leq 7

and reporting delays

0 \leq u \leq 6 = u^{(1)} - 1

at time

τ_{m} = 31 / 10 / 2010

for claims with accident dates before 26/10/2010 of LoB Casualty, the rhs is compressed by weekends.

**Table 3.** Observed number of reported claims $M_{s, u}^{''} (τ_{m})$ for weekdays $1 \leq s \leq 7$ and reporting delays $0 \leq u \leq 6 = u^{(1)} - 1$ at time $τ_{m} = 31 / 10 / 2010$ for claims with accident dates before 26/10/2010 of LoB Casualty, the rhs is compressed by weekends.
$u =$	0	1	2	3	4	5	6	0*	1*	2*	3*	4*
Mon $s = 1$	17	31	33	41	34	0	0	17	31	33	41	34
Tue $s = 2$	15	36	25	42	0	0	30	15	36	25	42	30
Wed $s = 3$	16	40	25	0	0	43	53	16	40	25	43	53
Thu $s = 4$	19	43	0	0	33	27	38	19	43	33	27	38
Fri $s = 5$	12	0	0	29	40	33	40	12	29	40	33	40
Sat $s = 6$	0	0	9	10	12	10	15	9	10	12	10	15
Sun $s = 7$	0	5	12	7	5	10	0	5	12	7	5	10

Table 4. p-values of the

χ^{2}

-tests under the corresponding null hypotheses for test statistics

χ_{s}^{2}

, see Equation (22), for weekdays

s = 1, \dots, 7

(with 4 degrees of freedom).

**Table 4.** p-values of the $χ^{2}$ -tests under the corresponding null hypotheses for test statistics $χ_{s}^{2}$ , see Equation (22), for weekdays $s = 1, \dots, 7$ (with 4 degrees of freedom).
	Mon $s = 1$	Tue $s = 2$	Wed $s = 3$	Thu $s = 4$	Fri $s = 5$	Sat $s = 6$	Sun $s = 7$
p-values	79%	29%	3.2%	37%	44%	52%	47%

Table 5. AIC and BIC as of 31/10/2010 (top)

γ = 1 / 4

and

u^{(2)} = 6

months and (bottom) LoB Property truncated/truncated dynamic (

α > 0

) log-normal model with

γ = 1 / 4

and LoB Casualty truncated/truncated static (

α = 0

) log-normal model.


Method: Log-normal Distribution	AIC	BIC
LoB Property
truncated/shifted (static $α = 0$ )	347’532	347’592
truncated/truncated (static $α = 0$ )	347’461	347’522
truncated/shifted (dynamic $α > 0$ )	339’270	339’348
truncated/truncated (dynamic $α > 0$ )	339’204	339’274
LoB Casualty
truncated/shifted (static $α = 0$ )	87’979	88’028
truncated/truncated (static $α = 0$ )	87’941	87’990
truncated/shifted (dynamic $α > 0$ )	87’973	88’028
truncated/truncated (dynamic $α > 0$ )	87’934	87’990


Threshold	AIC	BIC
LoB Property
$u^{(2)} = 3$ months	339’033	339’103
$u^{(2)} = 6$ months	339’204	339’274
$u^{(2)} = 9$ months	339’291	339’361
$u^{(2)} = 12$ months	339’376	339’445
LoB Casualty
$u^{(2)} = 3$ months	87’908	87’957
$u^{(2)} = 6$ months	87’941	87’990
$u^{(2)} = 9$ months	87’963	88’013
$u^{(2)} = 12$ months	87’988	88’037

Table 6. AIC and BIC as of 31/10/2010 for LoB Property truncated/truncated dynamic log-normal model with

u^{(2)} = 6

months.

**Table 6.** AIC and BIC as of 31/10/2010 for LoB Property truncated/truncated dynamic log-normal model with $u^{(2)} = 6$ months.
Threshold	AIC	BIC
LoB Property
$γ = 1 / 2$ months	339’594	339’663
$γ = 1 / 3$ months	339’267	339’337
$γ = 1 / 4$ months	339’204	339’274
$γ = 1 / 5$ months	339’222	339’292

Table 7. AIC and BIC as Table 5 (top) but with log-normal distributions replaced by gamma distributions.

**Table 7.** AIC and BIC as Table 5 (top) but with log-normal distributions replaced by gamma distributions.
Method: Gamma Distribution	AIC	BIC
LoB Property
truncated/shifted (static $α = 0$ )	348’121	348’174
truncated/truncated (static $α = 0$ )	348’109	348’161
truncated/shifted (dynamic $α > 0$ )	339’935	340’005
truncated/truncated (dynamic $α > 0$ )	339’856	339’926
LoB Casualty
truncated/shifted (static $α = 0$ )	88’000	88’042
truncated/truncated (static $α = 0$ )	88’887	88’929
truncated/shifted (dynamic $α > 0$ )	87’995	88’051
truncated/truncated (dynamic $α > 0$ )	88’888	88’943

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Verrall, R.J.; Wüthrich, M.V. Understanding Reporting Delay in General Insurance. Risks 2016, 4, 25. https://doi.org/10.3390/risks4030025

AMA Style

Verrall RJ, Wüthrich MV. Understanding Reporting Delay in General Insurance. Risks. 2016; 4(3):25. https://doi.org/10.3390/risks4030025

Chicago/Turabian Style

Verrall, Richard J., and Mario V. Wüthrich. 2016. "Understanding Reporting Delay in General Insurance" Risks 4, no. 3: 25. https://doi.org/10.3390/risks4030025

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Understanding Reporting Delay in General Insurance

Abstract

1. Introduction

2. Individual Claims Arrival Modeling

3. Description of the Data

4. Seasonal Claims Frequency Modeling

4.1. Likelihood Function with Seasonality

4.2. Analysis of the MLE System

4.3. Calibration of the Weekly Periodic Pattern

5. Calibration of the Reporting Delay Distribution

5.1. Decoupling of the Reporting Delay Distribution

5.2. Calibration of the Small Reporting Delay Layer

5.2.1. Model in the Small Reporting Delay Layer

5.2.2. Empirics and Fitting the Small Reporting Delay Layer Distributions

5.3. Calibration of Middle and Large Reporting Delay Layers

5.3.1. Choice of Layers and Approximate Log-likelihood

5.3.2. Choice of Explicit Distributions and Layer Probabilities

5.3.3. Preliminary Model Selection

5.3.4. Model Calibration Over Time

6. Homogeneous (Poisson) Chain-Ladder Case and Back-Testing

6.1. Cross-Classified Chain-Ladder Model

6.2. Back-Testing

6.3. Conclusions

Author Contributions

Conflicts of Interest

Appendix A. Proofs

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI