Next Article in Journal
Actuarial Geometry
Previous Article in Journal
Maximum Market Price of Longevity Risk under Solvency Regimes: The Case of Solvency II
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

State Space Models and the Kalman-Filter in Stochastic Claims Reserving: Forecasting, Filtering and Smoothing

by
Nataliya Chukhrova
and
Arne Johannssen
*
Faculty of Business Administration, University of Hamburg, 20146 Hamburg, Germany
*
Author to whom correspondence should be addressed.
Risks 2017, 5(2), 30; https://doi.org/10.3390/risks5020030
Submission received: 1 April 2017 / Revised: 7 May 2017 / Accepted: 15 May 2017 / Published: 27 May 2017

Abstract

:
This paper gives a detailed overview of the current state of research in relation to the use of state space models and the Kalman-filter in the field of stochastic claims reserving. Most of these state space representations are matrix-based, which complicates their applications. Therefore, to facilitate the implementation of state space models in practice, we present a scalar state space model for cumulative payments, which is an extension of the well-known chain ladder (CL) method. The presented model is distribution-free, forms a basis for determining the entire unobservable lower and upper run-off triangles and can easily be applied in practice using the Kalman-filter for prediction, filtering and smoothing of cumulative payments. In addition, the model provides an easy way to find outliers in the data and to determine outlier effects. Finally, an empirical comparison of the scalar state space model, promising prior state space models and some popular stochastic claims reserving methods is performed.

1. Introduction

At the end of each fiscal year, non-life insurance companies face the situation that the earned premiums are known, but not the outstanding loss liabilities. Consequently, one of the main tasks for actuaries in non-life insurance is to quantify accurately the outstanding loss liabilities. The outstanding claims reserves are often a large share of the liability side of the balance sheet, so it is very important for every non-life insurer to handle claims reserving adequately. It is not surprising, therefore, that over the past 40 years, numerous reserving methods have been developed, particularly since the early 1990s. These methods are based on various models (see Wüthrich and Merz (2008)), but rarely on time series models. This is surprising, especially in light of the fact that the claims process is a stochastic process, and claims data, as a sequence of discrete time data, represent time series.
In this paper, methods of claims reserving are considered, which are based on time series models, particularly on state space models. A state space model consists of a state equation describing the dynamics of the system and an observation equation establishing a link between the unobservable states of the system and the observations. Compared to other models, state space models have the advantage that the temporal dynamics of a system can often be detected more accurately. In addition, state space representations can be used flexibly to model univariate and multivariate, stationary and non-stationary time series or in cases of structural changes, interventions, missing data or other data irregularities. A consideration of state space models leads directly to the application of the Kalman-filter algorithms for parameter estimation, forecasting, filtering and smoothing. The Kalman-filter is generally distribution-free and provides the best linear predictors in the sense of minimizing the mean squared error (see Kalman (1960)).
There are currently 16 research papers in stochastic claims reserving on the topic of state space models and the Kalman-filter. Most of these state space representations are based on a calendar year approach, i.e., all available observations of one calendar year are stacked into one observation vector. The matrix-based approach of these models and the resulting Kalman recursions can complicate their applications. Therefore, to facilitate the implementation of state space models in practice, a new scalar approach is introduced in this paper. The resulting distribution-free scalar state space model for cumulative payments can be considered as an extension of the chain ladder (CL) method under the assumption that the observations in the upper triangle are based on unobservable states.
According to this content, the paper is structured as follows. Section 2 gives a brief overview of claims development triangles and the CL method. In Section 3, we present the previous papers in stochastic claims reserving on the topic of state space models, their contentual similarities and selected modeling approaches. Section 4 introduces the scalar state space model and gives derivations of the formulas for claims reserves and the mean squared error of prediction (MSEP). An empirical comparison of the scalar state space model and promising state space models, as well as popular stochastic claims reserving methods is given in Section 5. Finally, Section 6 provides concluding remarks.

2. Development Triangles and the CL Method

Statistical analysis of claims payments data is usually done by using claims development triangles. In this paper, we denote accident years by i = 0 , , I , development years by j = 0 , , J and calendar years by t = i + j . We assume further that all claims are settled after the J-th development year. Cumulative payments in accident year i after j development years are denoted by C i , j , and incremental payments in the j-th development year are denoted by X i , j , respectively. An exemplary claims development triangle (see Table 1) is the dataset of Taylor and Ashe (1983), which includes data from the motor bodily injury class of business in one Australian state (1972–1981, quoted in Australian Dollars). Due to its popularity in claims reserving literature, the dataset of Taylor and Ashe (1983) acts like a kind of benchmark development triangle for claims reserving methods.
The CL method is probably the most popular loss reserving technique, and there are several stochastic models that could be used as a basis for the application of the CL method. In this paper, we introduce briefly the distribution-free CL model of Mack (1993).
Model Assumptions 1
(Distribution-free CL model of Mack (1993)).
Cumulative payments C i , j of different accident years i are stochastically independent.
There exist factors f 0 , , f J 1 > 0 and variance parameters σ 0 2 , , σ J 1 2 > 0 , such that for all i = 0 , , I and all j = 0 , , J 1 , we have:
E [ C i , j + 1 | C i , 0 , , C i , j ] = E [ C i , j + 1 | C i , j ] = f j C i , j Var ( C i , j + 1 | C i , 0 , , C i , j ) = Var ( C i , j + 1 | C i , j ) = σ j 2 C i , j
The development factors and variance parameters are then estimated using the (unconditionally) unbiased estimators:
f ^ j = i = 0 I j 1 C i , j + 1 i = 0 I j 1 C i , j
and:
σ ^ j 2 = 1 I j 1 i = 0 I j 1 C i , j C i , j + 1 C i , j f ^ j 2
for f j and σ j 2 , respectively (for more details, see Mack (1993)).

3. Prior Applications in Stochastic Claims Reserving

This section presents the previous papers in claims reserving literature on the topic of state space models and the Kalman-filter, their relationships and selected modeling approaches of claims development data, as well as their state space representations.

3.1. Chronology and Categorization of the Papers

There are 16 papers in the claims reserving literature that are based on state space models and the Kalman-filter. Figure 1 displays these papers in chronological order and classifies them into four categories considering their substantial contentual similarities. The formulated categories “parametric development”, “log-normal model”, “different approaches” and “comparisons” need not be taken as mutually exclusive. Instead, the choice of the appropriate category is made considering the main approach used in the respective paper. In the papers of the first category a parametric development of claims data over the development years is assumed. The papers of the second category are based on the log-normal model for incremental payments, while in the third category, different approaches are aggregated. The fourth category unites comparisons of various reserving methods. The solid arrows in Figure 1 represent the contentual similarities among the papers in modeling approaches. The dashed arrows indicate, however, that the respective models are included in the papers of the fourth category that compares various models.

3.2. Modeling of Claims Development Data

In the first category, the authors De Jong and Zehnwirth (1983), Wright (1990), Zehnwirth (1997), Taylor et al. (2003) and Pang and He (2012) mainly use variations on the Hoerl curve to model the claims data over the development years. The general exponential-logarithmic Hoerl curve is given as β j = exp ( κ j + δ log j ) with development year parameter β j for all j = 0 , , J and κ , δ R . An advantage of treating development time j as a continuous covariate is that extrapolation is possible beyond the range of development times observed (see Frees (2010) and Kaas et al. (2009)). As a variation, De Jong and Zehnwirth (1983), for example, define incremental payments X i , j as:
X i , j = α i ( j + 1 ) e j + w i , j
with accident year parameter α i and w i , j W N 0 ; σ w 2 for all i = 0 , , I and j = 0 , , J . To allow a dynamic recursive estimation of the accident year parameters and to avoid over-parametrization, they assume the accident year parameters evolve in the way α i + 1 = α i + v i with α 0 = 0 and v i W N 0 ; σ v 2 . The modeling (4) leads to the general shape of incremental payments curve over the development years, i.e., the incremental payments are supposed to rise very fast in early development years and to decrease exponentially over the following development years.
The models in the papers of the first category are mostly distribution-free in contrast to the papers of the second category (Verrall (1989,1994), Ntzoufras and Dellaportas (2002)), in which incremental payments are assumed to follow a log-normal distribution. The logarithmic incremental payments are specified by the log-normal model:
Y i , j = log X i , j = μ + α i + β j + w i , j
with X i , j > 0 , mean μ , accident year parameter α i , development year parameter β j and the Gaussian white noise process w i , j W N 0 ; σ w 2 for all i = 0 , , I and j = 0 , , J . The accident and development year parameters are further assumed to evolve as follows for the same reasons as in De Jong and Zehnwirth (1983):
α i + 1 = α i + v i ( α ) β j + 1 = β j + v j ( β )
with α 0 = β 0 = 0 and white noise processes v i ( α ) and v j ( β ) for all i = 0 , , I and j = 0 , , J . Model (5) was also named as the “linear CL model” by Verrall (1989,1994), since it is very similar to an additive representation of the CL method (see also Kremer (1982)). In addition to the basic model (5), which is used by Verrall (1989) and Li (2006), Verrall (1994) and Ntzoufras and Dellaportas (2002) extend the basic model by integrating varying run-off evolutions.
As a first example of the third category, Atherino et al. (2010) consider a different data ordering of non-cumulative run-off triangles, in which the stacked rows of a triangle form a univariate time series y t with several runs of missing data. They choose a structural model for the incremental payments with a local level component ν t , a stochastic periodic component η t and a regression term h t T u ,
y t = ν t + η t + h t T u + w t
ν t + 1 = ν t + v t ( ν )
η t + 1 = d = 1 J 1 η t + 1 d + v t ( η ) ( t = J 1 , J , )
with the Gaussian white noise processes w t , v t ( ν ) , v t ( η ) . This model and its components are motivated by the claims process behavior: The level component shall respond for the average value of claims along each accident year, while the periodic component is supposed to capture the development year effect. The regression term is mainly motivated by the need for intervention effects due to the presence of outliers.
Another example of the third category to model the claims data can be found in Alpuim and Ribeiro (2003). They assume that the incremental payments X i , j for the i-th accident year ( i = 0 , , I ) in the j-th development year ( j = 1 , , J ) depend on the payments X i , 0 of the respective accident year,
X i , j = λ i , j X i , 0 + w i , j
with w i , j W N 0 ; σ w 2 . That is, the total amount of claims incurred in year i and paid j years later is proportional to the claims incurred and paid in accident year i. This proportion varies randomly with development and accident year, so that they assume for the proportion parameter λ i , j a first order autoregressive process:
λ i , j = μ j + ϕ j ( λ i 1 , j μ j ) + v i , j
with mean μ j and v i , j W N 0 ; σ v 2 . Moreover, the X i , j of different development years j = 1 , , J are stochastically independent.
Since the fourth category (England and Verrall (2002), Verrall (2004), Li (2006)) provides comparisons of various methods, there is only a recapitulation of existing approaches in the remaining papers.
All of these modeling approaches can be converted into a state space representation, and then, the Kalman-filter can be used for prediction, filtering and smoothing of the claims development data.

3.3. Modeling Approaches of State Space Representations

Most of state space representations in the introduced papers are based on the calendar year approach, which provides the claims data of each calendar year to be stacked into separate observation vectors of the respective calendar years. In addition to this approach, there are further differing approaches considering an accident year- or a development year-based modeling of the observation vectors (see Figure 2) as also detached approaches, which model run-off triangle data, for example as a univariate time series.
The popularity of the calendar year-based approach can be justified as follows:
  • Annually-added observations build a new diagonal in the run-off triangle. Therefore, the calendar year approach corresponds to natural modeling of the claims data.
  • The observations of the same calendar year are subjected to calendar year effects of the same level, such as the inflation factor or changes in legislation.
  • As for estimating and forecasting, the recent observations should be weighted higher compared to past observations. This proposition is also consistent with the view of many authors such as Verrall (1994), Taylor (2000) or De Jong (2005) and De Jong (2006). Therefore, the use of the Kalman-filter is justified here. Its recursive and dynamic nature complies with this requirement especially in relation to the calendar year approach.
Four representative state space models, which are also used for the empirical comparison in Section 5, are presented below. Two of these models (Verrall (1989), Li (2006)) are based on the calendar year approach, and the other two models (Atherino et al. (2010), Alpuim and Ribeiro (2003)) consider detached approaches. As for accident year and development year approaches, they can only be found in Taylor et al. (2003) and De Jong and Zehnwirth (1983). However, these approaches have no significant advantages compared to the calendar year approach, so they are not introduced in this paper.
The state space representation of the log-normal model for incremental payments in Verrall (1989) has the observation equation:
Y 0 , t Y 1 , t 1 Y 2 , t 2 Y t 1 , 1 Y t , 0 observation vector = 1 0 0 1 1 1 0 0 1 0 0 1 0 0 1 0 0 1 0 0 0 0 1 0 1 0 0 1 0 0 0 1 0 0 1 0 system matrix μ α 1 β 1 α t β t state vector in t + w 0 , t w 1 , t 1 w 2 , t 2 w t 1 , 1 w t , 0 measurement noise vector
for calendar year t = 0 , , I , which implies (5) for each Y i , j of calendar year t = i + j , and the state equation:
μ α 1 β 1 α t + 1 β t + 1 state vector in t + 1 = 1 0 0 0 0 0 0 1 0 1 0 0 0 1 transition matrix μ α 1 β 1 α t β t state vector in t + 0 0 0 v t ( α ) v t ( β ) process noise vector
for t = 0 , , I , where (12) allows a dynamic recursive estimation of the accident and development year parameters via (6). The state space model of Verrall (1989) is also used in Li (2006), but slightly modified.
The state space representation of the structural model in Atherino et al. (2010) has the observation equation:
y t = 1 1 0 0 system matrix ν t η t η t 1 η t J + 2 state vector in t + h t T u + w t ,
that stands for (7), as well as the state equation:
ν t + 1 η t + 1 η t η t J + 3 state vector in t + 1 = 1 0 0 0 0 1 1 1 0 1 0 0 0 0 1 0 transition matrix ν t η t η t 1 η t J + 2 state vector in t + v t ( ν ) v t ( η ) 0 0 process noise vector ,
which includes (8) and (9). In both state space representations Verrall (1989) and Atherino et al. (2010), the Gaussian white noise processes are assumed to be stochastically independent as also independent of the initial state.
Both examples and the most state space representations in the other papers have a matrix-based approach in common, so the Kalman recursions also contain numerous matrices of high dimensions (see, for example, Brockwell and Davis (2006)). This complicates parameter estimation and practical application considerably. Therefore, a scalar structure could likewise be a preferable approach. One alternative is the state space model in Alpuim and Ribeiro (2003), which consists of the observation Equation (10), the state Equation (11) and the additional assumption E [ v i , j w k , l ] = 0 for all i , k = 0 , , I and j , l = 1 , , J . Another promising alternative is the newly-developed scalar state space model introduced in Section 4.

4. Scalar State Space Model for Cumulative Payments

The idea behind the scalar state space model is a modification of the CL method to get a meaningful state space representation and to use the Kalman-filter for calculating the claims reserves, as well as for measuring their precision. Since the CL method operates under the assumption that the cumulative payments are a linear function of the cumulative payments of the previous development year, we consider a linear state space model.

4.1. Model Assumptions and Kalman Recursions

It is assumed that a run-off triangle of observed cumulative payments C i , j obs is based on a run-off triangle of unobservable states C i , j with i + j I for all i = 0 , , I and j = 0 , , J with I = J . Therefore, there is a probable observation error in the claims data, and we can not definitely observe “real cumulative payments”, i.e., the payments made do not necessarily correspond with the payments actually incurred. One possible reason for this difference is that a claim is not reported correctly, for example if the claim is not reported, only partially reported or reported too late. Therefore, we model these unobservable “real cumulative payments” as latent variables. The scalar state space model for cumulative payments presented below provides a basis for determining the entire unobservable upper and lower run-off triangles using Kalman-filter, that is prediction, filtering and smoothing of all states C i , j with i = 0 , , I and j = 0 , , J (see Figure 3).
Model Assumptions 2
(Scalar state space model for cumulative payments).
There exist parameters g j > 0 and σ w 2 > 0 , such that:
C i , j obs = g j C i , j + w i , j ( observation equation )
with w i , j W N 0 ; σ w 2 for i = 0 , , I and j = 0 , , J .
There exist parameters f j > 0 and σ v 2 > 0 , such that:
C i , j + 1 = f j C i , j + v i , j ( state equation )
with v i , j W N 0 ; σ v 2 for i = 0 , , I and j = 0 , , J 1 .
The white noise processes ( w i , j ) j = 0 , , J i = 0 , , I and ( v i , j ) j = 0 , , J 1 i = 0 , , I are uncorrelated and therefore satisfy E [ v i , j w k , l ] = 0 for all i , k = 0 , , I , j = 0 , , J 1 and l = 0 , , J .
Cumulative payments C i , j of different accident years i are stochastically independent.
The assumption of uncorrelated white noise processes is motivated by the fact that there is no reason to assume a systematic relationship between the measurement noise ( w i , j ) j = 0 , , J i = 0 , , I and the process noise ( v i , j ) j = 0 , , J 1 i = 0 , , I . Nevertheless, this assumption is not necessary, but simplifies the Kalman recursions. The further assumption of stochastically independent accident years is frequently applied to claims reserving and especially used in the CL method (see Model Assumptions 1).
The state equation and the observation equation can also be stated as follows:
C i , j = f j 1 C i , j 1 + v i , j 1 = = a i , j ( C i , 0 , v i , 0 , , v i , j 2 , v i , j 1 )
C i , j obs = g j C i , j + w i , j = = b i , j ( C i , 0 , v i , 0 , , v i , j 2 , v i , j 1 , w i , j )
Here, a i , j and b i , j for i = 0 , , I and j = 0 , , J are appropriate linear functions. Considering (15), (16) and the assumptions regarding the measurement and process noise, it is clear that:
E [ C i , j v i , l ] = 0 and E [ C i , j w i , k ] = 0
for all j , k = 0 , , J , l = 0 , , J 1 with j k , j l . Consequently, the initial state C i , 0 of an accident year i = 0 , , I is uncorrelated with v i , j and w i , j for all j.
Remark 1.
  • To forecast future cumulative payments C i , j with i + j > I for i = 1 , , I , j = 1 , , J (lower triangle) the corresponding Kalman predictions are required. The one-step predictor provides forecasts for the next calendar year t = I + 1 , while the h-step predictor shall be used for forecasting cumulative payments in calendar years t = I + h with h > 1 .
  • As for the underlying states C i , j of the observations C i , j obs in the upper triangle, the Kalman filterings (for i + j = I ) and smoothings (for i + j < I ) are useful to identify outliers in the observations and to replace them with smoothed or filtered observations, as well as to obtain an adjusted presentation of the observed quantities and to determine outlier effects. Another key application of smoothing and filtering is the determination of missing values in the upper run-off triangle (for example, resulting from a merger) to interpolate gaps in the data.
Remark 2.
  • We denote the one-step predictor by C ^ i , j + 1 ( P ) and its error variance by ( σ ^ i , j + 1 ( P ) ) 2 = E [ ( C i , j + 1 C ^ i , j + 1 ( P ) ) 2 ] for j = 0 , , I i , as well as the h-step predictor by C ^ i , j + h ( P ) and its error variance by ( σ ^ i , j + h ( P ) ) 2 = E [ ( C i , j + h C ^ i , j + h ( P ) ) 2 ] for j = 0 , , J h . The superscript (P) stands for “prediction” and the subscript indicates the cell ( i , j + 1 ) or ( i , j + h ) in the lower triangle, for which we predict cumulative payments.
  • We denote the filtering by C ^ i , j ( F ) and its error variance by ( σ ^ i , j ( F ) ) 2 = E [ ( C i , j C ^ i , j ( F ) ) 2 ] for j = 0 , , I i , as well as the smoothings by C ^ i , j ( S ) and their error variances by ( σ ^ i , j ( S ) ) 2 = E [ ( C i , j C ^ i , j ( S ) ) 2 ] for j = I i , , 0 . The superscript (F) or (S) stands for “filtering” or “smoothing”, and the subscript indicates the ( i , j )-cell in the upper triangle, for which we filter or smooth cumulative payments.
The Kalman recursions for prediction, filtering and smoothing are given below. There are two different smoothing approaches: fixed-point and fixed-interval smoothing. While the fixed-point approach calculates smoothed values for a few fixed, predetermined points of time, the fixed-interval approach provides an ex post reconstruction of the behavior of a system in order to understand the phenomenon underlying observations. Since we are going to smooth all observations in the upper run-off triangle for identifying outliers in the data, the fixed-interval algorithm is presented (the fixed-point algorithm can be found for example in Brockwell and Davis (2006)).
Theorem 1 (Kalman-filter algorithms for the scalar state space model).
Given the Model Assumptions 2, the one-step, h-step, filtering and fixed-interval smoothing predictors, as well as their error variances are uniquely determined by the initial conditions C ^ i , 0 ( P ) and σ ^ i , 0 ( P ) 2 and the recursions ( i = 0 , , I , h 2 ):
C ^ i , j + 1 ( P ) = f j C ^ i , j ( P ) + f j γ i , j Δ i , j C i , j obs g j C ^ i , j ( P ) ( j = 0 , , I i )
C ^ i , j + h ( P ) = ( f j + h 1 f j + 1 ) C ^ i , j + 1 ( P ) ( j = 0 , , J h )
C ^ i , j ( F ) = C ^ i , j ( P ) + γ i , j Δ i , j C i , j obs g j C ^ i , j ( P ) ( j = 0 , , I i )
C ^ i , j ( S ) = C ^ i , j ( F ) + ψ i , j C ^ i , j + 1 ( S ) C ^ i , j + 1 ( P ) ( j = I i , , 0 )
and:
σ ^ i , j + 1 ( P ) 2 = f j 2 σ ^ i , j ( P ) 2 + σ v 2 f j γ i , j 2 Δ i , j ( j = 0 , , I i )
σ ^ i , j + h ( P ) 2 = f j + h 1 2 σ ^ i , j + h 1 ( P ) 2 + σ v 2 ( j = 0 , , J h )
σ ^ i , j ( F ) 2 = σ ^ i , j ( P ) 2 γ i , j 2 Δ i , j ( j = 0 , , I i )
σ ^ i , j ( S ) 2 = σ ^ i , j ( F ) 2 + ψ i , j 2 σ ^ i , j + 1 ( S ) 2 σ ^ i , j + 1 ( P ) 2 ( j = I i , , 0 )
where γ i , j = σ ^ i , j ( P ) 2 g j , Δ i , j = g j 2 σ ^ i , j ( P ) 2 + σ w 2 and ψ i , j = f j + 1 σ ^ i , j ( F ) 2 σ ^ i , j + 1 ( P ) 2 .
Proof: 
See Appendix A. □
Remark 3.
  • The Kalman gain 0 γ i , j Δ i , j 1 represents the relative importance of the innovation ε i , j = C i , j obs g j C ^ i , j ( P ) with respect to the prior predictor C ^ i , j ( P ) . The higher the covariance γ i , j between the innovation and the state to be predicted and/or the lower the variance Δ i , j of the innovation, the higher the trust in the new observation C i , j obs and therefore the higher the Kalman gain.
  • Due to the fact that there is no observation after the recent calendar year, and therefore no innovation ε i , j , the covariance γ i , j is equal to zero for i + j > I . This implies that the Kalman gain is equal to zero in the h-step recursions.
  • Since the Kalman smoother is a backwards recursive algorithm and its initializations C ^ i , I i ( F ) and σ ^ i , I i ( F ) 2 are values of the last filtering recursion, the smoothings and filterings are identical for the current calendar year t = I .

4.2. Determination of Kalman Reserves and MSEP

Using Theorem 1, the ultimate claims C i , J of accident years i = 1 , , I can be forecasted. The ultimate claim of the first accident year is predicted by the one-step predictor:
C ^ 1 , J ( P ) = f J 1 C ^ 1 , J 1 ( P ) + f J 1 γ 1 , J 1 Δ 1 , J 1 C 1 , J 1 obs g J 1 C ^ 1 , J 1 ( P ) ,
and the ultimate claims of the accident years i = 2 , , I are predicted by the h-step predictors:
C ^ i , J ( P ) = ( f J 1 f J i + 1 ) C ^ i , J i + 1 ( P ) .
Since we predict the ultimate claims in (26), we have h = i . The Kalman reserve R ^ i for a single accident year i = 1 , , I , as well as the total Kalman reserve R ^ across all accident years can therefore be determined by:
R ^ i = C ^ i , J ( P ) C i , I i obs and R ^ = i = 1 I R ^ i ,
respectively. The error variance of the predictor C ^ 1 , J ( P ) according to (25) results in:
σ ^ 1 , J ( P ) 2 = f J 1 2 σ ^ 1 , J 1 ( P ) 2 + σ v 2 f J 1 γ 1 , J 1 2 Δ 1 , J 1 ,
while the error variances of C ^ i , J ( P ) for i = 2 , , I according to (26) are given by:
σ ^ i , J ( P ) 2 = f J 1 2 σ ^ i , J 1 ( P ) 2 + σ v 2 .
The error variances (27) and (28) provide a basis for estimating the (unconditional) MSEP. The MSEP for a single accident year i = 1 , , I using the Kalman predictor C ^ i , J ( P ) is defined as follows:
MSEP C ^ i , J ( P ) = E C i , J C ^ i , J ( P ) 2
Therefore, an appropriate estimator of the MSEP for a single accident year results in:
Definition 1 (Kalman estimator of MSEP for single accident years).
Using Model Assumptions 2 and the Kalman predictor (25) for i = 1 or (26) for i = 2 , , I at time t = I , an estimator of the MSEP for a single accident year i = 1 , , I is given by:
MSEP ^ C ^ i , J ( P ) = σ ^ i , J ( P ) 2
To determine an estimator of the MSEP for aggregated accident years, we consider two different accident years i , k = 1 , , I with i < k in the first step.
MSEP C ^ i , J ( P ) + C ^ k , J ( P ) = E C i , J C ^ i , J ( P ) + C k , J C ^ k , J ( P ) 2 = E C i , J C ^ i , J ( P ) 2 + E C k , J C ^ k , J ( P ) 2 + 2 E C i , J C ^ i , J ( P ) C k , J C ^ k , J ( P ) = MSEP C ^ i , J ( P ) + MSEP C ^ k , J ( P ) + 2 E C i , J C ^ i , J ( P ) C k , J C ^ k , J ( P )
Therefore, the MSEP for two accident years is the sum of two single accident year MSEPs and a mixed term based on both accident years. Using independence of the different accident years (see Model Assumptions 2), we obtain for the expectation in (29):
E C i , J C ^ i , J ( P ) C k , J C ^ k , J ( P ) = E C i , J C ^ i , J ( P ) E C k , J C ^ k , J ( P ) = E C i , J E C ^ i , J ( P ) E C k , J E C ^ k , J ( P )
Since C ^ i , J ( P ) is an unbiased predictor for the ultimate claim C i , J , the term (30) is equal to zero. Consequently, the MSEP for two accident years results from the sum of two single accident year MSEPs:
MSEP C ^ i , J ( P ) + C ^ k , J ( P ) = MSEP C ^ i , J ( P ) + MSEP C ^ k , J ( P )
The MSEP of all considered accident years is therefore, by independence, determined as:
MSEP i = 1 I C ^ i , J ( P ) = i = 1 I MSEP C ^ i , J ( P )
Thus, an estimator of the MSEP for aggregated accident years is given by:
Definition 2 (Kalman estimator of MSEP for aggregated accident years).
Using Model Assumptions 2, we have the following estimator for the MSEP of the ultimate claim for aggregated accident years:
MSEP ^ i = 1 I C ^ i , J ( P ) = i = 1 I MSEP ^ C ^ i , J ( P )

5. Empirical Applications

In this section, various empirical applications are considered. The applications are based on the claims development triangle of Taylor and Ashe (1983) (see Table 1). Firstly, the scalar state space model is used to calculate predicted, filtered and smoothed values for cumulative payments, as well as outlier effects in the data. Secondly, an empirical comparison of representative state space models and other popular methods in claims reserving is made.

5.1. Applications of Scalar State Space Model

Using Theorem 1, we can calculate forecasts, as well as filtered and smoothed cumulative payments, but in a prior step, we need to estimate the model parameters g j , f j , σ w 2 , σ v 2 and the initializations C ^ i , 0 ( P ) and σ ^ i , 0 ( P ) 2 for all i , j . Alternatively, they can also be determined by consulting expert opinion, market statistics or similar portfolios. In this section, we estimate the parameters as follows. Taking conditional expectations of both sides of (14) with respect to C i , j leads directly to E [ C i , j + 1 | C i , j ] = f j C i , j . If we compare this result to (1), we find that f j plays the role of the usual CL factor. This fact motivates the estimation of the factors f 0 , , f J 1 using the CL estimator (2). To avoid over-parametrization of the model, the parameter g j = g is assumed to be time-invariant. The parameters g, σ w 2 , σ v 2 are estimated using maximum likelihood (ML) method in conjunction with the expectation-maximization (EM) algorithm. Since a direct maximization of the Gaussian log likelihood can cause local maxima or convergence problems, we use the EM algorithm, which is an iterative method to find ML estimates based on the expected conditional Gaussian log likelihood (for more details on the EM algorithm, see Shumway and Stoffer (1982), Shumway and Stoffer (2010) or Johannssen (2016)).
Taking the observations C i , 0 obs in the respective accident years i = 0 , , I and the CL variance parameter σ ^ 0 2 (see (3)) as initializations, we get the following parameter estimates for the run-off triangle of Taylor and Ashe (1983) in Table 2:
Since individual estimates of g for different years slightly deviate between 0 . 98 and 1 . 02 , the assumption of time invariance has no significant impact on the calculated results. On the contrary: Due to the small number of observations in more recent years, the quality of the respective estimates would be questionable. Moreover, the estimated value g ^ 1 suggests the presumption for the dataset of Taylor and Ashe (1983) that the observations do not differ systematically upwards or downwards from the underlying states.
The predicted, filtered and smoothed cumulative payments are given in Table 3.
Using the smoothed values C ^ i , j ( S ) for i + j 9 (upper triangle in Table 3), outliers in the data can be identified in the first place. Subsequently, outlier effects can be isolated, and observations can be robustified. Table 4 shows the outlier effects Z ^ i , j = C i , j obs C ^ i , j ( S ) for all i = 0 , , 9 and j = 0 , , 9 i .
The outlier effects at j = 0 are negligible or equal to zero for all i = 0 , , 9 due to the chosen initializations. As for the magnitude of the observed cumulative payments, the outlier effects are relatively minor with a few exceptions. The largest outlier effect Z ^ 3 , 3 has an (absolute) deviation of 199148. For example, the outlier effect Z ^ 0 , 9 = 6470 indicates that we have an observation error in the data, which implies that the cumulative payments made are way too low compared to the unobservable “real cumulative payments”, i.e., there are still outstanding payments. It is for some claims reserving methods (such as the CL method), which are not robust against outliers, of high relevance, if the observations C i , I i obs for i = 1 , , I of the current calendar year represent outliers. Therefore, two cumulative payments C 3 , 6 obs and C 7 , 2 obs in the run-off triangle of Taylor and Ashe (1983) should be treated with caution. In particular, such observations should be robustified, for example, they could be replaced by the Kalman smoothed values. As for the Kalman recursion algorithms, outliers entail less problems because of the lack of credibility of these observations and, consequently, the minor Kalman gain, i.e., outliers do not change the forecast decisively.
The robustness of the CL method and the impact of outliers were also considered by Verdonck et al. (2009), who primarily surveyed how (simulated) outliers affect the CL total reserve. The results of their study show that the most problematic areas in a run-off triangle are the lower left corner and the upper right corner, since there are too few observations. In these areas, the CL method is particularly sensitive to outliers, because the CL reserves directly and indirectly (via the CL estimators (2)) depend on the observations of the recent calendar year. Further papers on the subject of robustness of the CL method are, for example, Van Wouwe et al. (2009), Verdonck and Debruyne (2011) and Verdonck and Van Wouwe (2011).

5.2. Empirical Comparison of Selected Models

The models considered in the empirical comparison are the scalar state space model, but the models in Verrall (1989), Alpuim and Ribeiro (2003), Li (2006), Atherino et al. (2010) and the well-known methods CL and Bornhuetter–Ferguson (BF), as well as the (overdispersed) Poisson (ODP) model. The chosen BF method is rather conservative, i.e., we use a priori estimates for the expected ultimate claims, which generally exceed estimates based on development triangle data. It should be pointed out that the BF results depend largely on the quality of the a priori estimates.
The most results in Table 5 suggest total claims reserves of approximately 18500000 ± 200000 . The model in Atherino et al. (2010) and the conservative BF method present differing results with more optimistic and more conservative total reserves, respectively. Higher reserves tend to lead to higher MSEP, just like in the ODP model, Li (2006), Verrall (1989) and the CL method, with the exception of the BF method and the scalar state space model. The model of Atherino et al. (2010), which provides by far the lowest total claims reserves, also leads to the smallest MSEP for aggregated accident years, but not to the smallest variational coefficient ( VCO = MSEP / reserve ) for aggregated accident years.
The conservative BF method produces the smallest VCO for aggregated accident years, closely followed by Atherino et al. (2010), the scalar state space model and Alpuim and Ribeiro (2003). The model of Alpuim and Ribeiro (2003) and the scalar state space model provide a large VCO in the first accident year, resulting from the interaction of relatively low reserves and a larger MSEP for this year. In particular, for the scalar state space model the VCO decreases noticeably in subsequent accident years. Other models, like the log-normal models in Verrall (1989) and Li (2006), lead to relatively low VCO in the first accident years, but without a remarkable reduction in later accident years, so they remain mostly at the level of 25 % ± 3 % .
Compared with the results of the other seven models, the scalar state space model produces quite precise results with relatively low MSEPs and VCOs. Thus, the scalar structure of this model leads not only to facilitated practical application, but it is also in no way inferior to more complex models for the considered dataset.

6. Conclusions

The previously written papers in the stochastic claims reserving literature based on state space models and the Kalman-filter have numerous contentual similarities, and we can classify them into four categories related to the methodology or modeling used in the respective papers. Models for incremental payments dominate in this field of research, in particular variations on the Hoerl curve, the log-normal model, as well as calendar year-based state space representations. However, most state space representations in these papers entail a matrix-based approach, which complicates their direct application in practice.
In contrast, the newly-developed scalar state space model for cumulative payments is quite elegant due to its simple, yet powerful structure. Since the CL method is the most commonly-used reserving method in practice and the scalar state space model is an extension of the CL method, the scalar state space model can readily be applied to claims reserving practice. In particular, this model provides facilitated calculation of forecasts for the cumulative payments in the lower run-off triangle and of smoothed values for the cumulative payments in the upper triangle. Moreover, the determination of claims reserves and the estimation of their MSEP for single and aggregated accident years are straightforward calculations using Kalman-filter. The scalar state space model can also be used to identify and smooth outliers. This subject is of particular importance for less robust claims reserving methods such as the CL method. Summarizing, the scalar state space model is a very promising, robustified extension of the CL method, which renounces a complex matrix structure compared to most state space models in claims reserving and therefore simplifies its practical applications.
Because of its recursive and dynamic nature, the Kalman-filter is predestined for use in stochastic claims reserving. The authors McGuire (2007), Taylor and McGuire (2008) and Chima-Okereke (2013) even recommend the use of state space models as a part of a nearly completely automated script, a so-called reserving robot in stochastic claims reserving. Due to the high flexibility of state space models and the Kalman-filter, it is also possible to perform multivariate analyses in a simple manner and to take several run-off triangles simultaneously into account. In this way, dependencies, in particular correlations, between run-off triangles can be surveyed and included into the modeling.

Acknowledgments

The authors would like to thank three anonymous reviewers for their valuable feedback and suggestions.

Author Contributions

The authors have equally contributed to the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 1

This Appendix provides a brief proof of Theorem 1; for a more detailed proof, see Johannssen (2016) and Shumway and Stoffer (2010). For the derivation of the Kalman recursions, we do not need any distributional assumptions. However, to simplify the notation, we can think of E for conditional expectations as a projection operator instead of an expectation. Then, the predictors obtained are the minimum MSEP predictors within the class of linear predictors.

Appendix A.1. One-Step Predictors and Error Variances

To derive recursion (17), we use the decomposition:
E C i , j + 1 | C i , 0 obs , , C i , j obs 1 st component = E C i , j + 1 | C i , 0 obs , , C i , j 1 obs 2 nd component + E C i , j + 1 | ε i , j 3 rd component .
For the first component, we have:
E C i , j + 1 | C i , 0 obs , , C i , j obs = C ^ i , j + 1 ( P ) .
For the second component, the following expression can be specified with (14):
E C i , j + 1 | C i , 0 obs , , C i , j 1 obs = E f j C i , j + v i , j | C i , 0 obs , , C i , j 1 obs = f j C ^ i , j ( P )
The third component:
E C i , j + 1 | ε i , j = E C i , j + 1 ε i , j 1 st term ( E ε i , j 2 2 nd term ) 1 ε i , j 3 rd term
consists of three terms, which we determine separately below. Since we need the innovation ε i , j for each of these three terms, we start with the third term using (13):
ε i , j = C i , j obs g j C ^ i , j ( P ) = g j C i , j C ^ i , j ( P ) + w i , j
Then, the first term in (A4) can be specified with the help of (14) and (A5) as follows:
E C i , j + 1 ε i , j = E f j C i , j + v i , j g j C i , j C ^ i , j ( P ) + w i , j = f j σ ^ i , j ( P ) 2 g j
The second term in (A4) can be determined using (A5):
E ε i , j 2 = E g j C i , j C ^ i , j ( P ) + w i , j 2 = g j 2 σ ^ i , j ( P ) 2 + σ w 2
Substituting three terms (A5), (A6) and (A7) into (A4) leads to:
E C i , j + 1 | ε i , j = f j σ ^ i , j ( P ) 2 g j g j 2 σ ^ i , j ( P ) 2 + σ w 2 1 C i , j obs g j C ^ i , j ( P )
Inserting components (A2), (A3) and (A8) into (A1) directly implies:
C ^ i , j + 1 ( P ) = f j C ^ i , j ( P ) + f j σ ^ i , j ( P ) 2 g j g j 2 σ ^ i , j ( P ) 2 + σ w 2 1 C i , j obs g j C ^ i , j ( P ) ,
which is equal to (17) with γ i , j = σ ^ i , j ( P ) 2 g j and Δ i , j = g j 2 σ ^ i , j ( P ) 2 + σ w 2 . □
The verification of recursion (21) can be carried out using the decomposition:
σ ^ i , j + 1 ( P ) 2 = E C i , j + 1 2 1 st component E C ^ i , j + 1 ( P ) 2 2 nd component .
For the first component, we have with (14):
E C i , j + 1 2 = E f j C i , j + v i , j 2 = f j 2 E C i , j 2 + σ v 2 ,
while we get for the second component with (17), (A5) and (A7):
E C ^ i , j + 1 ( P ) 2 = E f j C ^ i , j ( P ) + f j γ i , j Δ i , j C i , j obs g j C ^ i , j ( P ) 2 = f j 2 E C ^ i , j ( P ) 2 + f j γ i , j 2 Δ i , j
Inserting (A10) and (A11) into (A9) directly leads to:
σ ^ i , j + 1 ( P ) 2 = f j 2 E C i , j 2 + σ v 2 f j 2 E C ^ i , j ( P ) 2 + f j γ i , j 2 Δ i , j = f j 2 σ ^ i , j ( P ) 2 + σ v 2 f j γ i , j 2 Δ i , j

Appendix A.2. h-Step Predictors and Error Variances

Recursion (18) can be easily determined by updating the one-step predictions (17) (see Remark 3, second bullet point):
C ^ i , j + h ( P ) = f j + h 1 C ^ i , j + h 1 ( P ) = = ( f j + h 1 f j + 1 ) C ^ i , j + 1 ( P )
Recursion (22) for the error variances σ ^ i , j + h ( P ) 2 with h 2 can be obtained using (14) and (A12):
σ ^ i , j + h ( P ) 2 = E f j + h 1 C i , j + h 1 + v i , j + h 1 f j + h 1 C ^ i , j + h 1 ( P ) 2 = f j + h 1 2 σ ^ i , j + h 1 ( P ) 2 + σ v 2

Appendix A.3. Filtering Predictors and Error Variances

To derive recursion (19), we use the decomposition:
E C i , j | C i , 0 obs , , C i , j obs 1 st component = E C i , j | C i , 0 obs , , C i , j 1 obs 2 nd component + E C i , j | ε i , j 3 rd component .
The specification of three components in (A13) is analogous to the derivation of the recursion for one-step prediction. Therefore, we have for the first and second component:
E C i , j | C i , 0 obs , , C i , j obs = C ^ i , j ( F )
E C i , j | C i , 0 obs , , C i , j 1 obs = C ^ i , j ( P )
The third component consists of three terms, whose second and third terms are identical to those in (A4), but the first term differs slightly:
E C i , j | ε i , j = E C i , j ε i , j 1 st term ( E ε i , j 2 2 nd term ) 1 ε i , j 3 rd term
Whereas the last both terms in (A16) are already given by (A5) and (A7), the first term can be specified using (A5):
E C i , j ε i , j = E C i , j g j C i , j C ^ i , j ( P ) + w i , j = σ ^ i , j ( P ) 2 g j
Substituting three terms (A5), (A7) and (A17) into (A16) leads to:
E C i , j | ε i , j = σ ^ i , j ( P ) 2 g j g j 2 σ ^ i , j ( P ) 2 + σ w 2 1 C i , j obs g j C ^ i , j ( P )
Inserting components (A14), (A15) and (A18) into (A13) directly implies:
C ^ i , j ( F ) = C ^ i , j ( P ) + σ ^ i , j ( P ) 2 g j g j 2 σ ^ i , j ( P ) 2 + σ w 2 1 C i , j obs g j C ^ i , j ( P ) ,
which is equal to (19) with γ i , j = σ ^ i , j ( P ) 2 g j and Δ i , j = g j 2 σ ^ i , j ( P ) 2 + σ w 2 . □
To establish recursion (23), we use (19) as follows:
C i , j C ^ i , j ( P ) = C i , j C ^ i , j ( F ) + C ^ i , j ( F ) C ^ i , j ( P ) = C i , j C ^ i , j ( F ) + γ i , j ε i , j Δ i , j
Hence, from (A19), we obtain:
C ^ i , j ( P ) = C ^ i , j ( F ) γ i , j ε i , j Δ i , j
Using (A7) and (A20), we get the following expression:
σ ^ i , j ( P ) 2 = E C i , j C ^ i , j ( F ) + γ i , j ε i , j Δ i , j 2 = σ ^ i , j ( F ) 2 + γ i , j 2 Δ i , j ,
which directly leads to (23). □

Appendix A.4. Fixed-Interval Smoothing Predictors and Error Variances

In order to obtain fixed-interval recursion (20), first, we need to derive the Kalman fixed-point recursion:
C ^ i , j | s ( S ) = C ^ i , j | s 1 ( S ) + K i , j , s C i , s obs g s C ^ i , s ( P )
with K i , j , s = γ i , j , s Δ i , s , γ i , j , s = g s E C i , j C ^ i , j ( P ) C i , s C ^ i , s ( P ) , Δ i , s = g s 2 σ ^ i , s ( P ) 2 + σ w 2 for s = j + 1 , j + 2 , , I i , a fixed i = 0 , , I and a fixed j = 0 , , I i . For derivation of fixed-point recursion (A21), we use the decomposition:
E C i , j | C i , 0 obs , , C i , s obs 1 st component = E C i , j | C i , 0 obs , , C i , s 1 obs 2 nd component + E C i , j | ε i , s 3 rd component .
The specification of three components in (A22) is analogous to the derivation of the recursion for one-step prediction. Therefore, we have for the first and second component:
E C i , j | C i , 0 obs , , C i , s obs = C ^ i , j | s ( S )
E C i , j | C i , 0 obs , , C i , s 1 obs = C ^ i , j | s 1 ( S )
The third component consists of three terms, whose second and third terms are identical to those in (A4), but the first term differs slightly:
E C i , j | ε i , s = E C i , j ε i , s 1 st term ( E ε i , s 2 2 nd term ) 1 ε i , s 3 rd term
Whereas the last both terms in (A25) are already given by (A5) and (A7), the first term can be specified using (A5):
E C i , j ε i , s = E C i , j g s C i , s C ^ i , s ( P ) + w i , s = E C i , j C ^ i , j ( P ) + C ^ i , j ( P ) g s C i , s C ^ i , s ( P ) + w i , s = g s E C i , j C ^ i , j ( P ) C i , s C ^ i , s ( P )
Substituting three terms (A5), (A7) and (A26) into (A25) leads to:
E C i , j | ε i , s = g s E C i , j C ^ i , j ( P ) C i , s C ^ i , s ( P ) g s 2 σ ^ i , s ( P ) 2 + σ w 2 1 C i , s obs g s C ^ i , s ( P )
Inserting components (A23), (A24) and (A27) into (A22) directly implies:
C ^ i , j | s ( S ) = C ^ i , j | s 1 ( S ) + g s E C i , j C ^ i , j ( P ) C i , s C ^ i , s ( P ) g s 2 σ ^ i , s ( P ) 2 + σ w 2 1 C i , s obs g s C ^ i , s ( P ) ,
which is equal to (A21) with γ i , j , s = g s E C i , j C ^ i , j ( P ) C i , s C ^ i , s ( P ) , Δ i , s = g s 2 σ ^ i , s ( P ) 2 + σ w 2 and K i , j , s = γ i , j , s Δ i , s .
Now, we can derive fixed-interval recursion (20) by considering (A21), i.e.,
C ^ i , j | s ( S ) = C ^ i , j | s 1 ( S ) + K i , j , s ε i , s = = C ^ i , j ( F ) + l = j + 1 s K i , j , l ε i , l .
In a similar way, we get:
C ^ i , j + 1 | s ( S ) = C ^ i , j + 1 ( P ) + l = j + 1 s K i , j + 1 , l ε i , l
Comparing (A28) and (A29), we find that the relation K i , j , l = ψ i , j K i , j + 1 , l with ψ i , j = f j + 1 σ ^ i , j ( F ) 2 σ ^ i , j + 1 ( P ) 2 holds. Therefore, using these findings, we get the following equation:
C ^ i , j | s ( S ) = C ^ i , j ( F ) + ψ i , j l = j + 1 s K i , j + 1 , l ε i , l = C ^ i , j ( F ) + ψ i , j C ^ i , j + 1 | s ( S ) C ^ i , j + 1 ( P )
Since we smooth observations in the upper triangle within the fixed-interval recursion using all available observations, we have s = I i (s is therefore no longer required) and can simplify the notation C ^ i , j | s ( S ) to C ^ i , j ( S ) for all j = I i , , 0 . Then, (A30) is equivalent to (20). □
The verification of (24) follows from (20) with some straightforward calculations. Using (20), we obtain:
C i , j C ^ i , j ( S ) + ψ i , j C ^ i , j + 1 ( S ) = C i , j C ^ i , j ( F ) + ψ i , j C ^ i , j + 1 ( P ) .
Squaring and taking expectations of both sides of (A31), as also using the fact the cross products are zero, imply:
σ ^ i , j ( S ) 2 + ψ i , j 2 E C ^ i , j + 1 ( S ) 2 = σ ^ i , j ( F ) 2 + ψ i , j 2 E C ^ i , j + 1 ( P ) 2 .
Now, we use decompositions for the remaining expectations (see (A9)):
E C ^ i , j + 1 ( S ) 2 = E C i , j + 1 2 σ ^ i , j + 1 ( S ) 2
E C ^ i , j + 1 ( P ) 2 = E C i , j + 1 2 σ ^ i , j + 1 ( P ) 2
Inserting the right-hand sides of (A33) and (A34) into (A32) directly leads to (24). □

References

  1. Alpuim, Teresa, and Isabel Ribeiro. 2003. A State Space Model for Run-Off Triangles. Applied Stochastic Models in Business and Industry 19: 105–20. [Google Scholar] [CrossRef]
  2. Atherino, Rodrigo, Adrian Pizzinga, and Cristiano Fernandes. 2010. A row-wise Stacking of the Runoff Triangle: State Space Alternatives for IBNR Reserve Prediction. ASTIN Bulletin 40: 917–46. [Google Scholar]
  3. Brockwell, Peter J., and Richard A. Davis. 2006. Time Series: Theory and Methods, 2nd ed. New York: Springer. [Google Scholar]
  4. Chima-Okereke, Chibisi. 2013. A practical Approach to Claims Reserving using State Space Models with Growth Curves. Paper presented at the R in Insurance Conference 2013, London, UK, July 15. [Google Scholar]
  5. De Jong, Piet. 2005. State Space Models in Actuarial Science. Paper presented at the Second Brazilian Conference on Statistical Modelling in Insurance, Institute of Mathematics and Statistics, University of São Paulo, Maresias, Brazil, August 28–September 3. [Google Scholar]
  6. De Jong, Piet. 2006. Forecasting Runoff Triangles. North American Actuarial Journal 10: 28–38. [Google Scholar] [CrossRef]
  7. De Jong, Piet, and Ben Zehnwirth. 1983. Claims Reserving, State-Space Models and the Kalman Filter. Journal of the Institute of Actuaries 110: 157–81. [Google Scholar] [CrossRef]
  8. England, Peter D., and Richard J. Verrall. 2002. Stochastic Claims Reserving in General Insurance. British Actuarial Journal 8: 443–518. [Google Scholar] [CrossRef]
  9. Frees, Edward W. 2010. Regression Modeling with Actuarial and Financial Applications. Cambridge: Cambridge University Press. [Google Scholar]
  10. Johannssen, Arne. 2016. Stochastische Schadenreservierung unter Verwendung von Zustandsraummodellen und des Kalman-Filters. Hamburg: Dr. Kovac. [Google Scholar]
  11. Kaas, Rob, Marc Goovaerts, Jan Dhaene, and Michel Denuit. 2009. Modern Actuarial Risk Theory—Using R, 2nd ed. Berlin: Springer. [Google Scholar]
  12. Kalman, Rudolf E. 1960. A New Approach to Linear Filtering and Prediction Problems. Trans. of the ASME—Journal of Basic Engineering (Series D) 82: 35–45. [Google Scholar] [CrossRef]
  13. Kremer, Erhard. 1982. IBNR-Claims and the Two-Way Model of ANOVA. Scandinavian Actuarial Journal 1982: 47–55. [Google Scholar] [CrossRef]
  14. Li, Jackie. 2006. Comparison of Stochastic Reserving Methods. Australian Actuarial Journal 12: 489–569. [Google Scholar]
  15. Mack, Thomas. 1993. Distribution-free Calculation of the Standard Error of Chain Ladder Reserve Estimates. ASTIN Bulletin 23: 213–25. [Google Scholar] [CrossRef]
  16. McGuire, Gráinne. 2007. Building a Reserving Robot. Paper presented at the Biennial Convention 2007, Institute of Actuaries of Australia, Christchurch, New Zealand, September 23–26. [Google Scholar]
  17. Ntzoufras, Ioannis, and Petros Dellaportas. 2002. Bayesian Modelling of Outstanding Liabilities incorporating Claim Count Uncertainty. North American Actuarial Journal 6: 113–28. [Google Scholar] [CrossRef]
  18. Pang, Liyan, and Siqi He. 2012. The Application of State-Space Model in Outstanding Claims Reserve. Paper presented at the 2012 International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII), Sanya, China, October 20–21; pp. 271–74. [Google Scholar]
  19. Shumway, Robert H., and David S. Stoffer. 1982. An Approach to Time Series Smoothing and Forecasting using the EM Algorithm. Journal of Time Series Analysis 3: 253–64. [Google Scholar] [CrossRef]
  20. Shumway, Robert H., and David S. Stoffer. 2010. Time Series Analysis and Its Applications (With R Examples), 3rd ed. New York: Springer. [Google Scholar]
  21. Taylor, Greg C. 2000. Loss Reserving: An Actuarial Perspective. Boston: Kluwer Academic Publishers. [Google Scholar]
  22. Taylor, Greg C., and Frank R. Ashe. 1983. Second Moments of Estimates of Outstanding Claims. Journal of Econometrics 23: 37–61. [Google Scholar] [CrossRef]
  23. Taylor, Greg C., and Gráinne McGuire. 2008. Robotic Reserving. GIRO Convention 2013, Italy. [Google Scholar]
  24. Taylor, Greg C., Gráinne McGuire, and Alan Greenfield. 2003. Loss Reserving: Past, Present and Future. Research Paper No. 109. Melbourne: University of Melbourne. [Google Scholar]
  25. Van Wouwe, Martine, Tim Verdonck, and Kristel Van Rompay. 2009. Application of Classical and Robust Chain-Ladder Methods: Results for the Belgian Non-Life Business. Global Business and Economics Review 11: 99–115. [Google Scholar] [CrossRef]
  26. Verdonck, Tim, and Michiel Debruyne. 2011. The Influence of Individual Claims on the Chain-Ladder Estimates: Analysis and Diagnostic Tool. Insurance: Mathematics and Economics 48: 85–98. [Google Scholar] [CrossRef]
  27. Verdonck, Tim, and Martine Van Wouwe. 2011. Detection and Correction of Outliers in the Bivariate Chain-Ladder Method. Insurance: Mathematics and Economics 49: 188–93. [Google Scholar] [CrossRef]
  28. Verdonck, Tim, Martine Van Wouwe, and Jan Dhaene. 2009. A Robustification of the Chain-Ladder Method. North American Actuarial Journal 13: 280–98. [Google Scholar] [CrossRef]
  29. Verrall, Richard J. 1989. A State Space Representation of the Chain Ladder Linear Model. Journal of the Institute of Actuaries 116: 589–610. [Google Scholar] [CrossRef]
  30. Verrall, Richard J. 1994. A Method for Modelling Varying Run-off Evolutions in Claims Reserving. ASTIN Bulletin 24: 325–32. [Google Scholar] [CrossRef]
  31. Verrall, Richard J. 2004. Kalman Filter, Reserving Methods. In Encyclopedia of Actuarial Science. Edited by J. L. Teugels and B. Sundt. Chichester: John Wiley & Sons, vol. 1, pp. 952–55. [Google Scholar]
  32. Wright, Thomas S. 1990. A Stochastic Method for Claims Reserving in General Insurance. Journal of the Institute of Actuaries 117: 677–731. [Google Scholar] [CrossRef]
  33. Wüthrich, Mario V., and Michael Merz. 2008. Stochastic Claims Reserving Methods in Insurance. Chichester: John Wiley & Sons. [Google Scholar]
  34. Zehnwirth, Ben. 1997. Kalman Filters with Applications to Loss Reserving. Insurance: Mathematics and Economics 20: 149–218. [Google Scholar]
Figure 1. Chronology and categorization of the papers based on state space models.
Figure 1. Chronology and categorization of the papers based on state space models.
Risks 05 00030 g001
Figure 2. Modeling approaches of claims development data.
Figure 2. Modeling approaches of claims development data.
Risks 05 00030 g002
Figure 3. Unobservable states, observations and Kalman smoothings ( i + j < I ), Kalman filterings ( i + j = I ) and Kalman predictions ( i + j > I ).
Figure 3. Unobservable states, observations and Kalman smoothings ( i + j < I ), Kalman filterings ( i + j = I ) and Kalman predictions ( i + j > I ).
Risks 05 00030 g003
Table 1. Cumulative payments in the claims development triangle of Taylor and Ashe (1983).
Table 1. Cumulative payments in the claims development triangle of Taylor and Ashe (1983).
AccidentDevelopment Year j
Year i0123456789
0357848112478817353302218270274559633199943466336360628638335153901463
135211812361392170033335332237990674120063464786749140395339085
22905071292306221852532351793985995413291846289104909315
3310608141885821950473757447402992943819824588268
444316011363502128333289782134026723873311
53961321333217218071529857523691712
6440832128846324198613483130
735948014211282864498
83766861363294
9344014
Table 2. Estimated parameter values for the dataset of Taylor and Ashe (1983).
Table 2. Estimated parameter values for the dataset of Taylor and Ashe (1983).
f ^ 0 f ^ 1 f ^ 2 f ^ 3 f ^ 4 f ^ 5 f ^ 6 f ^ 7 f ^ 8 g ^
3.49061.74731.45741.17391.10381.08631.05391.07661.01771.0014
Table 3. Predictions (lower triangle), Filterings (last diagonal of upper triangle), Smoothings (upper triangle) for the data set of Taylor and Ashe (1983).
Table 3. Predictions (lower triangle), Filterings (last diagonal of upper triangle), Smoothings (upper triangle) for the data set of Taylor and Ashe (1983).
AccidentDevelopment Year j
Year i0123456789
0357846108864617247032344677280192632164823460558360873238497093907933
1352118124580522026643283457381144541861794624394491042253186015412740
2290510121843122048413256998388872242133134619217489288752676835360921
3310611130118623049553558299404680543744444652591490336552789635372401
4443156125565821034812949026344435738451214176955440209347392934823179
5396131130354521752613073819365943540392844387874462438149786085066730
6440830138419824111623494015410162545273734918086518317055802015678971
7359483145523827506224008757470588051943505642622594676064022826515602
8376686134407523485033422708401791744349774817715507739054663185563072
9344014120081520981853057894358966239622694304213453621048836834970125
Table 4. Kalman outlier effects.
Table 4. Kalman outlier effects.
AccidentDevelopment Year j
Year i0123456789
023614210627 126407 56330 1035125778 2446 16194 6470
10 9666 32631 69865 12378 66116 23473361720484
2 3 7387513684 21819 97273 80395 969316428
3 3 117672 109908 199148 16876 7538 64323
44 119308 24852 51205 41685 28190
51296725454 88067 32277
62 95735 8699 10885
7 3 34110 113876
8019219
90
Table 5. Reserves, standard errors and VCOs for selected models.
Table 5. Reserves, standard errors and VCOs for selected models.
iScalar State Space ModelVerrall (1989)
Reserve MSEP VCO Reserve MSEP VCO
173655167499227.4%1438347267550.5%
245160622166749.1%46584716643835.7%
378413327052434.5%67317519422928.9%
494986831733133.4%106079426622825.1%
5137501836600626.6%147940733975523.0%
6219584142215919.2%221873848797522.0%
7365110450733713.9%328763373566922.4%
8419977866265415.8%4517179104059623.0%
9462611179716117.2%4570683116706825.5%
aggr.1830711313766707.5%18417290262719014.3%
i Alpuim and Ribeiro (2003)Atherino et al. (2010)
Reserve MSEP VCO Reserve MSEP VCO
166860161177241.1%789041838523.3%
232142122724670.7%4337907504617.3%
355162527801751.0%6633129087413.7%
4124390032274525.9%89177410701312.0%
5153550242270927.5%133636114432710.8%
6235644062512526.5%200991320702110.3%
72817779166738159.2%291958730363710.4%
84472888144825132.4%381076941156310.8%
9494288976324115.4%472693557195912.1%
aggr.1830930416372848.9%1687134511978657.1%
i Li (2006)BF Method
Reserve MSEP VCO Reserve MSEP VCO
11013745475554.0%104097117241112.6%
245778817824238.9%51646221818742.2%
365112319874430.5%78060225540132.7%
4103573927113526.2%108337828427626.2%
5147333836071524.5%156140533428621.4%
6219041052296723.9%239540540924717.1%
7344243280806123.5%431233155006512.8%
84269816105473124.7%470687056083311.9%
95027791142552228.4%508839357856511.4%
aggr.18649811280922015.1%2054894212205255.9%
i CL MethodODP Model
Reserve MSEP VCO Reserve MSEP VCO
1946347553579.8%94634110100116.3%
246951112170025.9%46951121604346.0%
370963813355118.8%70963826087236.8%
498488926141226.5%98488930355030.8%
5141945941102829.0%141945937501426.4%
6217764155835625.6%217764149537822.7%
7392030187543022.3%392030178996120.2%
8427897297138522.7%4278972104651424.5%
94625811136338529.5%4625811198010142.8%
aggr.18680856244761813.1%18680856294566115.8%

Share and Cite

MDPI and ACS Style

Chukhrova, N.; Johannssen, A. State Space Models and the Kalman-Filter in Stochastic Claims Reserving: Forecasting, Filtering and Smoothing. Risks 2017, 5, 30. https://doi.org/10.3390/risks5020030

AMA Style

Chukhrova N, Johannssen A. State Space Models and the Kalman-Filter in Stochastic Claims Reserving: Forecasting, Filtering and Smoothing. Risks. 2017; 5(2):30. https://doi.org/10.3390/risks5020030

Chicago/Turabian Style

Chukhrova, Nataliya, and Arne Johannssen. 2017. "State Space Models and the Kalman-Filter in Stochastic Claims Reserving: Forecasting, Filtering and Smoothing" Risks 5, no. 2: 30. https://doi.org/10.3390/risks5020030

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop