Next Article in Journal
Synthesizing High-Utility Patterns from Different Data Sources
Previous Article in Journal
Linking Synthetic Populations to Household Geolocations: A Demonstration in Namibia
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Nested Stochastic Valuation of Large Variable Annuity Portfolios: Monte Carlo Simulation and Synthetic Datasets

Department of Mathematics, University of Connecticut, 341 Mansfield Road, Storrs, CT 06269-1009, USA
*
Author to whom correspondence should be addressed.
Submission received: 11 July 2018 / Revised: 24 August 2018 / Accepted: 30 August 2018 / Published: 1 September 2018

Abstract

:
Dynamic hedging has been adopted by many insurance companies to mitigate the financial risks associated with variable annuity guarantees. To simulate the performance of dynamic hedging for variable annuity products, insurance companies rely on nested stochastic projections, which is highly computationally intensive and often prohibitive for large variable annuity portfolios. Metamodeling techniques have recently been proposed to address the computational issues. However, it is difficult for researchers to obtain real datasets from insurance companies to test metamodeling techniques and publish the results in academic journals. In this paper, we create synthetic datasets that can be used for the purpose of addressing the computational issues associated with the nested stochastic valuation of large variable annuity portfolios. The runtime used to create these synthetic datasets would be about three years if a single CPU were used. These datasets are readily available to researchers and practitioners so that they can focus on testing metamodeling techniques.

1. Introduction

A variable annuity (VA) is a popular life insurance product created by insurance companies to address many people’s concerns about outliving their assets [1,2]. Under a VA policy, the policyholder agrees to make a lump-sum or a series of purchase payments to the insurer and in return the insurer agrees to make benefit payments to the policyholder, beginning either immediately or on a future date. Policyholders choose to invest their money in one or more investment funds provided by the insurance company. A main feature of VAs is that they come with guarantees or riders, which are designed to protect the policyholder’s capital against market downturns.
There are two types of guaranteed benefits embedded in VA policies: death benefits and living benefits. A guaranteed minimum death benefit (GMDB) guarantees a specified amount to the beneficiary upon the death of the policyholder regardless of the performance of the investment portfolio. Examples of living benefits include the guaranteed minimum withdrawal benefit (GMWB), the guaranteed minimum income benefit (GMIB), the guaranteed minimum maturity benefit (GMMB), and the guaranteed minimum accumulation benefit (GMAB). A GMWB guarantees that the policyholder can take systematic annual withdrawals of a specified amount from the policy over a period of time, even though the investment portfolio might be depleted. A GMIB guarantees that the policyholder can convert the VA policy to an annuity according to a specified rate. A GMMB guarantees that the policyholder can receive a specific amount at the maturity of the policy. A GMAB guarantees that the policyholder can renew the contract during a specified window after a specified waiting period.
Due to these attractive guarantees, many VA policies were sold in the past two decades. Figure 1 shows the annual VA sales in the United States during the period from 2008 to 2017. In the figure, we see that, except for 2017, the annual sales in all these years were above $100 billion. The guarantees embedded in VA policies are financial guarantees and cannot be adequately addressed by traditional actuarial methods [3]. To mitigate the financial risks associated with the VA guarantees, many insurance companies with a VA business have adopted dynamic hedging [4,5].
To simulate the performance of dynamic hedging for VA products, insurance companies rely on nested stochastic projections [5]. Nested stochastic projections are also referred to as “stochastic on stochastic” projections. Figure 2 conceptualizes the structure of a typical nested stochastic projection, which involves two layers of stochastic projections. At each node of an outer stochastic path, a set of inner stochastic paths is embedded. Usually the outer stochastic paths are real-world scenarios, which reflect a realistic pattern of underlying market prices that are used to generate realistic distributions of outcomes. In contrast, the inner stochastic paths are risk-neutral scenarios, which use unrealistic assumptions about risk premiums for purposes of calculating derivative prices under the no-arbitrage assumption.
The computation of nested stochastic projections for a large VA portfolio is highly computationally intensive and often prohibitive because every policy in the portfolio needs to be projected over many paths for a long time horizon [6]. For example, if we use 1000 real-world scenarios in the outer layer and 1000 risk-neutral paths in the inner layer, and project the cash flows at yearly steps for 30 years, then the total number of projections for each policy is
1000 × 1000 × 30 × 31 / 2 = 4.65 × 10 8 ,
which is already a big number. For a portfolio of 100,000 contracts, the number of projections would be 4.65 × 10 13 . Suppose that a single CPU can process 200,000 cash flow projections in a second. Then, it will take this CPU
4.65 × 10 13 200 , 000 × 3600 × 24 × 365 = 7.37
years to process all the cash flow projections for the portfolio. The amount of time shown in the above equation is just the runtime used to project the cash flows once. To calculate the Greeks, we need to project the cash flows multiple times at different shocks of the market. This will increase the runtime multifold.
Recently, metamodeling approaches have been proposed to address the computational issues associated with the valuation of large VA portfolios (see, for example, [7,8,9,10,11,12,13,14,15,16]). The main idea of these metamodeling approaches is to build a predictive model based on a set of representative VA policies and their fair market values (or other quantities of interest). The predictive model is then used to estimate the fair market values for all the policies in the portfolio. This can reduce the number of policies that are valued by Monte Carlo simulation. Since predictive models are usually much faster than Monte Carlo simulation, the gain in valuation runtime is significant.
However, it is difficult for academic researchers to obtain real datasets from insurance companies to assess the performance of metamodeling techniques. In this paper, we create synthetic datasets that can be used by researchers and practitioners to test metamodeling methods for the efficient valuation of large VA portfolios under nested stochastic simulation. In particular, we implement a nested stochastic valuation engine that is used to calculate the Greeks for VA policies along outer layer paths. The purpose of this work is to relieve researchers from spending time on creating such datasets, which can be extremely time-consuming to create. This paper differs from a previous paper [10] in that this paper focuses on developing comprehensive synthetic datasets while the previous paper focuses on metamodeling. This paper also differs from the paper [17] because this paper is about creating synthetic datasets under the stochastic-on-stochastic valuation framework while the paper [17] is about creating synthetic datasets for valuation only at time zero.
The remaining part of this paper is structured as follows. Section 2 presents a nested stochastic simulation engine for valuing the guarantees embedded in variable annuities. In Section 3 and Section 4, we present synthetic datasets that can be used to test the performance of metamodeling techniques. In Section 5, we conclude the paper with some remarks. The software that implements the nested Monte Carlo simulation engine is described in Appendix A.

2. Nested Stochastic Valuation

In this section, we describe the nested stochastic valuation engine. In particular, we introduce the risk-neutral scenario generator, the real-world scenario generator, and the cash flow projections.

2.1. Risk-Neutral Scenario Generator

Risk-neutral scenarios are used in the inner loop to calculate the dollar Deltas. We use a multivariate Black–Scholes model introduced by Carmona and Durrelman [18] to generate risk-neutral scenarios. This model is also described in [17].
Let S ( 1 ) , S ( 2 ) , …, S ( k ) be k indices in the financial market. Under the multivariate Black–Scholes model, the risk-neutral dynamics of the k indices are given by [18]:
d   S t ( h ) S t ( h ) = r t d   t + l = 1 k σ h l d   B t ( l ) , S 0 ( h ) = 1 , h = 1 , 2 , , k
or
S t ( h ) = exp 0 t r s d   s t 2 l = 1 k σ h l 2 + l = 1 k σ h l B t ( l ) , h = 1 , 2 , , k ,
where B t ( 1 ) , B t ( 2 ) , …, B t ( k ) are independent standard Brownian motions, r t is the short rate of interest, and the matrix ( σ h l ) is used to capture the correlation among the indices.
Let t 0 = 0 , t 1 = Δ , , t m = m Δ be time steps with equal space Δ and suppose that the continuous forward rate is constant within each period. For j = 1 , 2 , , m , the accumulation factor of the hth index for the period ( t j 1 , t j ) can be calculated as:
A j ( h ) = S j Δ ( h ) S ( j 1 ) Δ ( h ) = exp f j 1 2 l = 1 k σ h l 2 Δ + l = 1 k σ h l Δ Z j ( l ) ,
where f j is the annualized continuous forward rate for period ( t j 1 , t j ) and
Z j ( l ) = B j Δ ( l ) B ( j 1 ) Δ ( l ) Δ .
By the property of Brownian motion, we know that Z 1 ( l ) , Z 2 ( l ) , …, Z m ( l ) are independent random variables with a standard normal distribution.
The continuous return for the period ( t j 1 , t j ) is calculated as:
R j ( h ) = ln A j ( h ) = f j 1 2 l = 1 k σ h l 2 Δ + l = 1 k σ h l Δ Z j ( l ) .
The matrix
σ = σ 11 σ 12 σ 1 k σ 21 σ 22 σ 2 k σ k 1 σ k 2 σ k k
can be obtained from the following Cholesky decomposition of the covariance matrix Σ :
σ · σ = Σ = diag ( ν ) R diag ( ν ) ,
where ν = ( ν 1 , ν 2 , , ν k ) is a vector of index volatilities, diag ( v ) is a diagonal matrix with ν as diagonal elements, and R is the correlation matrix. In matrix form, Equation (4) can be expressed as
R j ( 1 ) R j ( 2 ) R j ( k ) = f j 1 2 ν 1 2 f j 1 2 ν 2 2 f j 1 2 ν k 2 Δ + σ Z j ( 1 ) Z j ( 2 ) Z j ( k ) Δ .
Algorithm 1 shows the pseudo-code of the risk-neutral scenario generator. Once we have index scenarios simulated from Equation (3), we can obtain the investment fund scenarios by blending these index scenarios as follows:
F i j ( h ) = l = 1 k w h l A i j ( l ) , h = 1 , 2 , , g ,
where g is the number of investment funds and
W = w 11 w 12 w 1 k w 21 w 22 w 2 k w g 1 w g 2 w g k
is the fund mapping that maps the k indices to the g investment funds.
Algorithm 1: Pseudo-code of the Risk-neutral Scenario Generator.
Data 03 00031 i001

2.2. Real-World Scenario Generator

Real-world scenarios are used in the outer loop to simulate the movements of the market. Risk-neutral scenarios are prospective and parameters of the risk-neutral scenario generator are calibrated to market data. Real-world scenarios are retrospective and the parameters of a real-world scenario generator are calibrated to historical data.
In practice, the regime-switching model [19] is typically used to generate real-world scenarios. Here, we introduce a multivariate two-regime regime-switching model for generating correlated real-world scenarios for multiple indices. Within a regime and a time period, the evolution of the indices follows the multivariate log-normal model.
Let ρ t denote the regime at time t and M denote the transition matrix, i.e.,
M = p 1 , 1 p 1 , 2 p 2 , 1 p 2 , 2 ,
where
p s , r = P ( ρ t + 1 = r | ρ t = s ) , , s , r = 1 , 2 .
Let π = ( π 1 , π 2 ) be the unconditional probability distribution of the regime-switching process. Then, we have
π M = π ,
which gives [19]:
π 1 = p 2 , 1 p 1 , 2 + p 2 , 1 , π 2 = p 1 , 2 p 1 , 2 + p 2 , 1 .
Let S ( 1 ) , S ( 2 ) , …, S ( k ) be k indices in the financial market. Under the multivariate two-regime regime-switching log-normal model, the risk-world dynamics of the k indices are given by:
S t ( h ) ρ t = exp μ h ( ρ t ) t + l = 1 k σ h l ( ρ t ) B t ( l ) , S 0 ( h ) = 1 , h = 1 , 2 , , k ,
where B t ( 1 ) , B t ( 2 ) , …, B t ( k ) are independent standard Brownian motions, μ h ( ρ t ) is the geometric mean of the hth index in the regime ρ t , the matrix ( σ h l ( ρ t ) ) is used to capture the correlation among the indices in the regime ρ t , and ρ t { 1 , 2 } is the regime number.
Let t 0 = 0 , t 1 = Δ , …, t m = m Δ be time steps, where Δ is the time step. Then, for j = 1 , 2 , , m , the accumulation factor of the hth index for the period ( t j 1 , t j ) in the regime ρ j can be calculated as
A j ( h ) ρ j = S j Δ ( h ) ρ j S ( j 1 ) Δ ( h ) ρ j = exp μ h ( ρ t ) Δ + l = 1 k σ h l ( ρ t ) Δ Z j ( l ) ,
where
Z j ( l ) = B j Δ ( l ) B ( j 1 ) Δ ( l ) Δ .
In matrix form, the returns can be expressed as
R j ( 1 ) R j ( 2 ) R j ( k ) ρ j = ln A j ( 1 ) ln A j ( 2 ) ln A j ( k ) ρ j = μ 1 ( ρ j ) μ 2 ( ρ j ) μ k ( ρ j ) Δ + σ ( ρ j ) Z j ( 1 ) Z j ( 2 ) Z j ( k ) Δ ,
where the matrix σ ( ρ j ) can be obtained from the following Cholesky decomposition of the covariance matrix Σ ( ρ j ) :
σ ( ρ j ) · σ ( ρ j ) = Σ ( ρ j ) = diag ( ν ( ρ j ) ) R ( ρ j ) diag ( ν ( ρ j ) ) ,
where ν ( ρ j ) is a vector of index volatilities for regime ρ j , diag ( v ( ρ j ) ) is a diagonal matrix with ν ( ρ j ) as diagonal elements, and R ( ρ j ) is the correlation matrix for regime ρ j .
Let ρ 0 be the initial regime. Then, for j = 1 , 2 , , m , the regime for period j can be determined by generating a uniform random number u as follows:
ρ j = 2 , if ρ j 1 = 1 and u p 1 , 2 ; 1 , if ρ j 1 = 1 and u > p 1 , 2 ; 1 , if ρ j 1 = 2 and u p 2 , 1 ; 2 , if ρ j 1 = 2 and u > p 2 , 1 .
The continuous return of the hth index for the period ( t j 1 , t j ) in the regime ρ j is calculated
R j ( h ) ρ j = ln A j ( h ) ρ j = μ h ( ρ t ) Δ + l = 1 k σ h l ( ρ t ) Δ Z j ( l ) .
Algorithm 2 shows the pseudo-code of the two-regime regime-switching real-world scenario generator.
Algorithm 2: Pseudo-code of the Two-regime Regime-switching Real-world Scenario Generator.
Data 03 00031 i002
From the above equation, we can derive the expectations and covariances of the conditioned returns as follows:
E R j ( h ) ρ j = μ h ( ρ t ) Δ
and
Cov R j ( h ) ρ j , R j ( m ) ρ j = l = 1 k σ h l ( ρ t ) σ m l ( ρ t ) Δ .
The return of the hth index for the period ( t j 1 , t j ) can be expressed as
R j ( h ) = R j ( h ) ρ j = 1 I { ρ j = 1 } + R j ( h ) ρ j = 2 I { ρ j = 2 } ,
where I is an indicator function.
The expected return for the period ( t j 1 , t j ) can be calculated as p177 in [20]:
E R j ( h ) = E R j ( h ) ρ j = 1 P ( ρ j = 1 ) + E R j ( h ) ρ j = 2 P ( ρ j = 2 ) = μ h ( 1 ) Δ π 1 + μ h ( 2 ) Δ π 2 .
From Equation (14), we have
R j ( h ) R j ( m ) = R j ( h ) ρ j = 1 R j ( m ) ρ j = 1 I { ρ j = 1 } + R j ( h ) ρ j = 2 R j ( m ) ρ j = 2 I { ρ j = 2 } ,
which gives
E R j ( h ) R j ( m ) = π 1 μ h ( 1 ) μ m ( 1 ) Δ 2 + l = 1 k σ h l ( 1 ) σ m l ( 1 ) Δ + π 2 μ h ( 2 ) μ m ( 2 ) Δ 2 + l = 1 k σ h l ( 2 ) σ m l ( 2 ) Δ .
Therefore, we have
Cov R j ( h ) , R j ( m ) = E R j ( h ) R j ( m ) E R j ( h ) E R j ( m ) = π 1 π 2 Δ 2 μ h ( 1 ) μ h ( 2 ) μ m ( 1 ) μ m ( 2 ) + π 1 l = 1 k σ h l ( 1 ) σ m l ( 1 ) Δ + π 2 l = 1 k σ h l ( 2 ) σ m l ( 2 ) Δ .
Letting m = h in the above equation, we get the variance of the return as follows:
Var R j ( h ) = E R j ( h ) 2 E R j ( h ) 2 = π 1 π 2 Δ 2 μ h ( 1 ) μ h ( 2 ) 2 + π 1 l = 1 k σ h l ( 1 ) 2 Δ + π 2 l = 1 k σ h l ( 2 ) 2 Δ = π 1 π 2 Δ 2 μ h ( 1 ) μ h ( 2 ) 2 + π 1 Δ ν h ( 1 ) 2 + π 2 Δ ν h ( 2 ) 2 ,
where ν h ( ρ j ) is the volatility of the hth index in the regime ρ j , i.e.,
ν h ( ρ j ) = l = 1 k σ h l ( ρ j ) 2 .
Equations (15) and (17) can be used to validate the real-world scenarios. These equations can also be used to specify parameters for the two-regime regime-switching model if we want to control the overall mean returns and the overall volatilities of the indices.

2.3. Nested Stochastic Valuation

To describe how nested stochastic projections are done, we let N 1 be the number of outer loop paths and let T 1 be the number of time nodes in the outer loop. Let X = { x 1 , x 2 , , x n } be a portfolio of n VA policies. Algorithm 3 shows a high-level sketch of the nested stochastic valuation engine. At each node along each outer loop path, we calculate the fair market values of each policy using the risk-neutral scenarios. For details about how policies are aged along a real-world path and how the cash flows are projected along a risk-neutral path, readers are referred to [17].
Algorithm 3: A High-level Sketch of the Nested Stochastic Valuation Engine.
Data 03 00031 i003
To assess the performance of dynamic hedging, partial dollar deltas are required as hedging is done by individual tradable indices. The partial dollar delta on the hth index is normally calculated as follows:
F M V ( , A V h 1 , 1.01 A V h , A V h + 1 , ) F M V ( , A V h 1 , 0.99 A V h , A V h + 1 , ) 0.02 ,
where A V h denotes the account value invested in the hth index. However, calculating partial dollar deltas using the above equation requires projecting cash flows at many index shocks. This is prohibitive under the nested stochastic valuation framework.
To reduce the runtime, we only calculate total dollar delta at each node along an outer loop path as follows:
Δ = F M V ( 1.01 A V 1 , , 1.01 A V k ) F M V ( 0.99 A V 1 , , 0.99 A V k ) 0.02 ,
where k is the number of indices. Then, we approximate the partial dollar deltas as follows:
Δ ( h ) = A V h A V 1 + + A V k Δ .
The relation given in Equation (18) can be derived as follows. Suppose that the fair market value of the guarantees embedded in a VA policy is a function of the total account value, i.e.,
F M V = f ( A V 1 + A V 2 + + A V k ) .
Then, the partial dollar delta on the hth index is calculated as
Δ ( h ) = f A V h A V h = f T A · T A A V h · A V h = f T A · T A · A V h T A = Δ · A V h T A ,
where T A = A V 1 + A V k is the total account value.

3. Synthetic Portfolio and Payoffs

In this section, we describe the synthetic portfolio and the payoffs of the guarantees embedded in the VA policies.

3.1. Synthetic Portfolio

We adopted a subset of the synthetic VA portfolio created in [17]. That synthetic portfolio contains 19 types of products, each of which has 10,000 policies. We selected 2000 policies from each product type. The subset contains 38,000 policies. Readers are referred to [17] for a description of the features or variables of the VA policies.
Table 1 shows the number of policies in each product type by gender. About 40% of the policies in each product are female. Table 2 shows the summary statistics of some numerical fields. In the table, we see that all funds have many zeros. This is because many policies generally do not invest in all the funds. The age is the number of years between the birth date and the current date. The time to maturity is calculated from the current date and the maturity date.

3.2. Guarantee Payoffs

We calculated the payoffs of the guarantees for the portfolio along each of the 1000 real-world path. The payoff is calculated as the sum of the death benefit and the living benefit. Figure 3 shows the guarantee payoffs of the portfolio at each month along the 1000 real-world path. In the figure, we see that there are some relatively large guarantee payoffs after the 300th month at some real-world paths. The large payoffs are caused the GMAB products, which allow policyholders to renew. A histogram of the present values of the guarantee payoffs along the real-world paths is shown in Figure 4. From the histogram, we see that the distribution of these present values is positively skewed.
We also calculated the present values of the guarantee payoffs along each of the 1000 real-world paths. Figure 5 shows a histogram of these present values. The histogram shows that the distribution of the present values is positively skewed. At some bad real-world paths, the guarantee payoffs are much larger than those at other real-world paths. Table 3a shows some summary statistics of these present values. At the best real-world path, the guarantee payoff of this portfolio is 11,421 millions. If the worst real-world path occurs, the guarantee payoff of this portfolio is 168,574 millions.
Table 3b shows the conditional tail expectations (CTEs) of the present values of the guarantee payoffs at three different levels. The CTE75 is calculated as the mean of present values from the worst 0.25 × 1000 = 250 real-world paths. In the table, we see that the CTE75 is around 75,595 millions.
Figure 6 shows the 1000 real-world paths of the five indices, which are the large cap equity, the small cap equity, the international equity, the fixed income, and the money market. The dark thick line in each subfigure corresponds to the worst real-world path, which produces the largest present value of the guarantee payoffs. The gray thick line in each subfigure corresponds to the best real-world path, which produces the lowest present value of the guarantee payoffs. In the figure, we see that the best path is above the worst path.
Note that the best real-world path is not the one at the very top and the worst real-world path is not the one at the very bottom. This is because the payoffs of GMAB products in bull market are large. In other words, if the real-world path at the very top occurs, the GMAB products will incur large payoffs because the policyholders can renew by reseting the benefit to the higher of the account value and the existing benefit base.
Figure 5 shows the guarantee payoffs at monthly steps along the best and the worst real-world paths. In the figure, we see that in general the payoffs along the best real-world path are higher than those along the worst real-world path. At a few months near the end of the projection horizon, the payoffs at the best path are higher than those at the worst path. This is caused by the GMAB products, which have higher payoffs at better markets due to the renew feature.

4. Partial Dollar Deltas

In this section, we present the partial dollar deltas calculated by the nested Monte Carlo simulation method described in Section 2.
As discussed in Section 2, the nested stochastic valuation program produces many matrices of the partial dollar deltas. In fact, the program produces N 1 × H matrices of partial dollar deltas, where N 1 is the number of real-world paths and H is the number of indices. For p = 1 , 2 , , N 1 and h = 1 , 2 , , H , let M ( p , h ) be the matrix of the partial dollar deltas on the hth index:
M ( p , h ) = M 1 , 1 ( p , h ) M 1 , 2 ( p , h ) M 1 , T ( p , h ) M 2 , 1 ( p , h ) M 2 , 2 ( p , h ) M 2 , T ( p , h ) M n , 1 ( p , h ) M n , 2 ( p , h ) M n , T ( p , h ) ,
where n is the number of policies in the portfolio, T is the number of time points where partial dollar deltas are calculated, and M i , j ( p , h ) denotes the partial dollar delta of the ith policy on the hth index at jth evaluation time point along the pth real-world path. Since we used N 1 = 1000 real-world paths and T = 31 evaluation time points and the number of indices is H = 5 , the number of matrices we produced is 5000. Each matrix has a size of 38 , 000 × 31 . We saved all the matrices to CSV files with only six decimal places. If zip all the CSV files, the size of the zip file is around 20 GB.

4.1. Aggregate Results

The aggregate partial dollar deltas along a real-world path are calculated as follows:
M j ( p , h ) = i = 1 n M i , j ( p , h ) .
In other words, the aggregate partial dollar deltas are the partial dollar deltas of the whole portfolio. The aggregate total dollar deltas are calculated as
M j ( p ) = h = 1 H i = 1 n M i , j ( p , h ) .
Figure 7 shows the aggregate partial dollar deltas and aggregate total dollar deltas along the 1000 real-world paths. In the figure, we have the following observations:
  • The aggregate partial dollar deltas do not approach zero at the end of the projection horizon. This is caused by the GMAB products, which behave similar to call options.
  • The guarantees are more sensitive to indices with higher volatilities. For example, the magnitudes of the aggregate partial dollar deltas on the small cap equity are larger than those on other indices.
  • The aggregate total dollar deltas along the best and the worst real-world paths have similar magnitudes. This is because the dollar deltas of the GMAB product offset those of other products.
  • For equity indices, which have high volatilities, the aggregate partial dollar deltas have similar magnitudes along the best and the worst real-world paths. For non-equity indices, which have low volatilities, the aggregate partial dollar deltas along the worst real-world path have higher magnitudes than those along the best real-world path.
Figure 8 and Figure 9 show the histograms of the aggregate partial dollar deltas at the Year 1 and the Year 30, respectively. In the figures, we see that the distributions of the aggregate partial dollar deltas at the Year 30 is more skewed that those at the Year 1.

4.2. Seriatim Results

There are many seriatim results, making it difficult to show all the results in detail. In this section, we only show the seriatim results from the best and the worst real-world paths identified before. Figure 10 shows a histogram of the seriatim partial dollar deltas at the end of Year 1 if the best real-world path occurs. Figure 11 shows a similar histogram if the worst real-world path occurs. Both figures show that the distributions of the seriatim partial dollar deltas are highly skewed. In addition, some policies have positive dollar deltas if the best real-world path occurs. This is caused by the GMAB products as a bull market can trigger the renew option embedded in such products.
Figure 12 and Figure 13 show the box plots of seriatim partial dollar deltas by product type at the end of Year 1 along the best and the worst real-world paths, respectively. In these figures, we see that the GMAB, GMIB, and GMMB products are more sensitive than the GMDB and GMWB products in terms of the magnitudes of the deltas. In addition, more policies have positive deltas when the best real-world path occurs than the case when the worst real-world path occurs.

4.3. Runtime

We implemented the nested stochastic valuation engine as a distributed multi-threading program in Java. We used the HPC (High Performance Computing) cluster (https://hpc.uconn.edu/) at the University of Connecticut to run the program. In particular, we used eight instances of the program with 20 cores for each instance to calculate the partial dollar deltas for the portfolio. Each instance of the program handles one outer loop path at a time. The coordination between different instances is done via the mechanism of file locking. Even with 160 cores, it took about two weeks to get all the calculations done.
For the convenience of comparison, we accumulate the runtime used by all threads to get the runtime that would be used by a single core. Figure 14 shows a histogram of the runtime used to calculate the partial dollar deltas for an outer loop path. In the figure, we see that, if a single core is used, it would take the core about 20–32 h to finish the calculation for a single outer loop path. If we aggregate the runtime used to process all 1000 outer loop paths, the runtime is 93,722,002.966 s or 2.97 years. In other words, if we used a single CPU to calculate the partial dollar deltas for the portfolio of 38,000 VA policies with 1000 real-world path and 1000 risk-neutral paths, it would take this CPU about 2.97 years to finish the calculation. Note that we only calculated the partial dollar deltas at 30 time points along the outer loop paths. If we want to calculate the deltas at 360 time points along the outer loop paths, it would take a single core about 36 years.

5. Concluding Remarks

Metamodeling techniques have been proposed to address the computational issues associated with the nested stochastic valuation of large VA portfolios. However, it is difficult for researchers to obtain real datasets from insurance companies to test the metamodeling techniques and publish the results in academic journals. It is the primary purpose of this paper to create synthetic datasets to address computational issues. These synthetic datasets can be used by researchers and practitioners to test techniques, especially metamodeling techniques, to speed up the nested stochastic valuation of large VA portfolios.
These synthetic datasets have some limitations. First, the synthetic VA policies are simpler than VA policies sold in the real-world. Second, the Monte Carlo simulation is also simpler than the one used in practice. For example, we did not consider the policyholder behavior in the cash flow projections. Although the synthetic datasets have limitations, we can still use them to test metamodeling techniques. If a metamodeling technique does not work for the synthetic datasets, then it is unlikely to work for real datasets.

Author Contributions

Both G.G. and E.A.V. conceived the primary idea of the methodology used. G.G. wrote the Java code to implement the methodology and create the synthetic data. In consultation with E.A.V., G.G. analyzed the data and produced the output. Both authors had periodic meetings to discuss the results and the writing of the manuscript.

Funding

This work is supported by a CAE (Centers of Actuarial Excellence) grant (http://actscidm.math.uconn.edu) from the Society of Actuaries.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Software and Datasets

We implemented the nested stochastic valuation engine as a distributed multi-threading program in Java. The datasets and the software code can be downloaded from http://www.math.uconn.edu/~gan/software.html.

References

  1. Ledlie, M.C.; Corry, D.P.; Finkelstein, G.S.; Ritchie, A.J.; Su, K.; Wilson, D.C.E. Variable Annuities. Br. Actuarial J. 2008, 14, 327–389. [Google Scholar] [CrossRef]
  2. The Geneva Association Report. Variable annuities—An analysis of financial stability. Available online: https://www.genevaassociation.org/sites/default/files/research-topics-document-type/pdf_public/ga2013-variable_annuities_0.pdf (accessed on 30 August 2018).
  3. Boyle, P.; Hardy, M. Reserving for maturity guarantees: Two approaches. Insurance Math. Econ. 1997, 21, 113–127. [Google Scholar] [CrossRef]
  4. Chopra, D.; Erzan, O.; de Gantes, G.; Grepin, L.; Slawner, C. Responding to the Variable Annuity Crisis. McKinsey Working Papers on Risk. 2009. Available online: https://www.mckinsey.com/business-functions/risk/our-insights/responding-to-the-variable-annuity-crisis (accessed on 30 August 2018).
  5. International Actuarial Association. Stochastic Modeling: Theory and Reality from an Actuarial Perspective; International Actuarial Association: Ontario, Canada, 2010. [Google Scholar]
  6. Dardis, T. Model Efficiency in the U.S. Life Insurance Industry. Model. Platform 2016, 3, 9–16. [Google Scholar]
  7. Gan, G. Application of data clustering and machine learning in variable annuity valuation. Insurance Math. Econ. 2013, 53, 795–801. [Google Scholar] [CrossRef]
  8. Gan, G.; Lin, X.S. Valuation of large variable annuity portfolios under nested simulation: A functional data approach. Insurance Math. Econ. 2015, 62, 138–150. [Google Scholar] [CrossRef]
  9. Gan, G. Application of Metamodeling to the Valuation of Large Variable Annuity Portfolios. In Proceedings of the Winter Simulation Conference, Huntington Beach, CA, USA, 6–9 December 2015; pp. 1103–1114. [Google Scholar]
  10. Gan, G.; Lin, X.S. Efficient Greek Calculation of Variable Annuity Portfolios for Dynamic Hedging: A Two-Level Metamodeling Approach. N. Am. Actuarial J. 2017, 21, 161–177. [Google Scholar] [CrossRef]
  11. Gan, G.; Valdez, E.A. An Empirical Comparison of Some Experimental Designs for the Valuation of Large Variable Annuity Portfolios. Depend. Model. 2016, 4, 382–400. [Google Scholar]
  12. Hejazi, S.A.; Jackson, K.R. A neural network approach to efficient valuation of large portfolios of variable annuities. Insurance Math. Econ. 2016, 70, 169–181. [Google Scholar] [CrossRef] [Green Version]
  13. Gan, G.; Huang, J. A Data Mining Framework for Valuing Large Portfolios of Variable Annuities. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, 13–17 August 2017; pp. 1467–1475. [Google Scholar]
  14. Hejazi, S.A.; Jackson, K.R.; Gan, G. A Spatial Interpolation Framework for Efficient Valuation of Large Portfolios of Variable Annuities. Quant. Financ. Econ. 2017, 1, 125–144. [Google Scholar]
  15. Gan, G.; Valdez, E.A. Regression Modeling for the Valuation of Large Variable Annuity Portfolios. N. Am. Actuarial J. 2018, 22, 40–54. [Google Scholar] [CrossRef]
  16. Xu, W.; Chen, Y.; Coleman, C.; Coleman, T.F. Moment matching machine learning methods for risk management of large variable annuity portfolios. J. Econ. Dyn. Control 2018, 87, 1–20. [Google Scholar] [CrossRef]
  17. Gan, G.; Valdez, E.A. Valuation of Large Variable Annuity Portfolios: Monte Carlo Simulation and Synthetic Datasets. Depend. Model. 2017, 5, 354–374. [Google Scholar] [CrossRef]
  18. Carmona, R.; Durrelman, V. Generalizing the Black-Scholes Formula to Multivariate Contingent Claims. J. Comput. Financ. 2006, 9, 43–67. [Google Scholar] [CrossRef]
  19. Hardy, M. A Regime-Switching Model of Long-Term Stock Returns. N. Am. Actuarial J. 2001, 5, 41–53. [Google Scholar] [CrossRef]
  20. Gan, G.; Ma, C.; Xie, H. Measure, Probability, and Mathematical Finance: A Problem-Oriented Approach; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2014. [Google Scholar]
Figure 1. Variable annuity sales in the United States from 2008 to 2017. The numbers are obtained from LIMRA Secure Retirement Institute.
Figure 1. Variable annuity sales in the United States from 2008 to 2017. The numbers are obtained from LIMRA Secure Retirement Institute.
Data 03 00031 g001
Figure 2. Nested stochastic projections.
Figure 2. Nested stochastic projections.
Data 03 00031 g002
Figure 3. Guarantee payoffs along the 1000 real-world paths.
Figure 3. Guarantee payoffs along the 1000 real-world paths.
Data 03 00031 g003
Figure 4. Present values of the guarantee payoffs.
Figure 4. Present values of the guarantee payoffs.
Data 03 00031 g004
Figure 5. Guarantee payoffs along the worst (the dark line) and the best (the gray line) real-world paths.
Figure 5. Guarantee payoffs along the worst (the dark line) and the best (the gray line) real-world paths.
Data 03 00031 g005
Figure 6. Real-world paths of the indices. The dark thick line is the worst real-world path. The gray thick line is the best real-world path.
Figure 6. Real-world paths of the indices. The dark thick line is the worst real-world path. The gray thick line is the best real-world path.
Data 03 00031 g006
Figure 7. Aggregate partial dollar deltas and aggregate total dollar delta along the 1000 real-world paths. The dark thick and the gray thick lines correspond to the worst and the best real-world paths, respectively.
Figure 7. Aggregate partial dollar deltas and aggregate total dollar delta along the 1000 real-world paths. The dark thick and the gray thick lines correspond to the worst and the best real-world paths, respectively.
Data 03 00031 g007
Figure 8. A histogram of the aggregate partial dollar deltas along the 1000 real-world paths at the end of Year 1. The numbers are in millions.
Figure 8. A histogram of the aggregate partial dollar deltas along the 1000 real-world paths at the end of Year 1. The numbers are in millions.
Data 03 00031 g008
Figure 9. A histogram of the aggregate partial dollar deltas along the 1000 real-world paths at the end of Year 30. The numbers are in millions.
Figure 9. A histogram of the aggregate partial dollar deltas along the 1000 real-world paths at the end of Year 30. The numbers are in millions.
Data 03 00031 g009
Figure 10. Histograms of the seriatim partial dollar deltas at the end of Year 1 of the best real-world path.
Figure 10. Histograms of the seriatim partial dollar deltas at the end of Year 1 of the best real-world path.
Data 03 00031 g010
Figure 11. Histograms of the seriatim partial dollar deltas at the end of Year 1 of the worst real-world path.
Figure 11. Histograms of the seriatim partial dollar deltas at the end of Year 1 of the worst real-world path.
Data 03 00031 g011
Figure 12. Box plots of the partial dollar deltas by product type at the end of Year 1 of the best real-world path.
Figure 12. Box plots of the partial dollar deltas by product type at the end of Year 1 of the best real-world path.
Data 03 00031 g012
Figure 13. Box plots of the partial dollar deltas by product type at the end of Year 1 of the worst real-world path.
Figure 13. Box plots of the partial dollar deltas by product type at the end of Year 1 of the worst real-world path.
Data 03 00031 g013
Figure 14. Distribution of the runtime for the 1000 real-world paths.
Figure 14. Distribution of the runtime for the 1000 real-world paths.
Data 03 00031 g014
Table 1. Distribution of gender by product type.
Table 1. Distribution of gender by product type.
GenderABRPABRUABSUDBABDBIBDBMBDBRP
F779768809787759833805
M1221123211911213124111671195
GenderDBRUDBSUDBWBIBRPIBRUIBSUMBRP
F785782789782826824775
M1215121812111218117411761225
GenderMBRUMBSUWBRPWBRUWBSU
F798798813812781
M12021202118711881219
Table 2. Summary statistics of some fields. Note that age and ttm are calculated from the birth date, valuation date, and maturity date.
Table 2. Summary statistics of some fields. Note that age and ttm are calculated from the birth date, valuation date, and maturity date.
Min1st QMean3rd QMax
gbAmt0.00187,601.23327,213.71446,403.841,060,311.72
gmwbBalance0.000.0035,501.020.00499,708.73
withdrawal0.000.0022,605.360.00499,585.73
FundValue10.000.0033,632.3050,083.85798,936.37
FundValue20.000.0038,673.1857,221.551,026,213.34
FundValue30.000.0026,778.1439,154.69752,945.34
FundValue40.000.0026,231.2539,331.61566,338.64
FundValue50.000.0022,768.9134,841.42481,399.12
FundValue60.000.0035,386.6452,585.381,042,335.65
FundValue70.000.0029,898.1844,511.19806,540.12
FundValue80.000.0030,303.8745,505.06704,720.85
FundValue90.000.0029,983.6844,034.63851,307.63
FundValue100.000.0030,092.1345,276.64691,822.70
age34.5242.1149.5656.9664.46
ttm0.5910.2614.4918.6828.52
Table 3. Summary statistics and conditional tail expectations of the present values of the guarantee payoffs. The numbers are in millions.
Table 3. Summary statistics and conditional tail expectations of the present values of the guarantee payoffs. The numbers are in millions.
(a)
Min.1st Qu.MedianMean3rd Qu.Max.
11,42130,39340,55645,53554,793168,574
(b)
CTE50CTE75CTE95
61,332.6875,594.85108,113.29

Share and Cite

MDPI and ACS Style

Gan, G.; Valdez, E.A. Nested Stochastic Valuation of Large Variable Annuity Portfolios: Monte Carlo Simulation and Synthetic Datasets. Data 2018, 3, 31. https://doi.org/10.3390/data3030031

AMA Style

Gan G, Valdez EA. Nested Stochastic Valuation of Large Variable Annuity Portfolios: Monte Carlo Simulation and Synthetic Datasets. Data. 2018; 3(3):31. https://doi.org/10.3390/data3030031

Chicago/Turabian Style

Gan, Guojun, and Emiliano A. Valdez. 2018. "Nested Stochastic Valuation of Large Variable Annuity Portfolios: Monte Carlo Simulation and Synthetic Datasets" Data 3, no. 3: 31. https://doi.org/10.3390/data3030031

Article Metrics

Back to TopTop