Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria

Xiao, Helu; Ren, Tiantian; Bai, Yanfei; Zhou, Zhongbao

doi:10.3390/math7090857

Open AccessArticle

Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria

¹

School of Business, Hunan Normal University, Changsha 410081, China

²

School of Business Administration, Hunan University, Changsha 410082, China

^*

Author to whom correspondence should be addressed.

Mathematics 2019, 7(9), 857; https://doi.org/10.3390/math7090857

Submission received: 16 August 2019 / Revised: 10 September 2019 / Accepted: 14 September 2019 / Published: 17 September 2019

Download

Browse Figures

Versions Notes

Abstract

:

Most of the existing literature on optimal investment-reinsurance only studies from the perspective of insurers and also treats the investment-reinsurance decision as a continuous process. However, in practice, the benefits of reinsurers cannot be ignored, nor can decision-makers engage in continuous trading. Under the discrete-time framework, we first propose a multi-period investment-reinsurance optimization problem considering the joint interests of the insurer and the reinsurer, among which their performance is measured by two generalized mean-variance criteria. We derive the time-consistent investment-reinsurance strategies for the proposed model by maximizing the weighted sum of the insurer’s and the reinsurer’s mean-variance objectives. We discuss the time-consistent investment-reinsurance strategies under two special premium principles. Finally, we provide some numerical simulations to show the impact of the intertemporal restrictions on the time-consistent investment-reinsurance strategies. These results indicate that the intertemporal restrictions will urge the insurer and the reinsurer to shrink the position invested in the risky asset; however, for the time-consistent reinsurance strategy, the impact of the intertemporal restrictions depends on who is the leader in the proposed model.

Keywords:

investment and reinsurance; insurer and reinsurer; generalized mean-variance criteria; time-consistent strategy

1. Introduction

Since the insurer and the reinsurer can be allowed to invest their wealth in the securities market, they can obtain profits not only by collecting premiums but also by investing in securities. Different from the other institutional investors, the insurer and the reinsurer will face the double risks that exist in both the insurance market and the securities market. In order to reduce the risk of claims, the insurer can purchase reinsurance contracts from the reinsurer and transfer part of the risk of claims to the reinsurer, because the reinsurer is more risk-seeking than the insurer. Therefore, how to design a suitable reinsurance contract is also the concern of the insurer and the reinsurer. Obviously, the setting of the reinsurance contract depends on the mutual agreement between the insurer and the reinsurer. However, most of the existing literature mainly focuses on the optimal investment-reinsurance problems only from the perspective of the insurer, while the interest of the reinsurer is generally ignored (e.g., Schmidli [1], Zeng and Li [2], Zhu et al. [3], Huang et al. [4], Hu and Wang [5], Deng et al. [6] and so on). Actually, the optimal reinsurance contract for the insurer may not be optimal or even unacceptable for the reinsurer. That is, the reinsurance contract should be designed to take into account the interests of both the insurer and the reinsurer. To address this problem, we will propose an investment-reinsurance optimization problem considering the joint interests of the insurer and the reinsurer, and the corresponding investment-reinsurance strategy will be investigated.

As far as we know, some researchers have paid attention to the joint interests of the insurer and the reinsurer. Li et al. [7] considered the weighted sum of an insurer’s and a reinsurer’s mean-variance objectives and aimed to find the corresponding time-consistent reinsurance-investment strategy. Li et al. [8] discussed the optimal investment-reinsurance strategy by maximizing the expected exponential utility of the weighted sum of the insurer’s and the reinsurer’s terminal wealth. Under the mean-variance criterion, Zhao et al. [9] also discussed the time-consistent investment-reinsurance strategy by maximizing the utility of a weighted sum of the insurer’s and the reinsurer’s surplus processes. Zhou et al. [10] derived the optimal investment-reinsurance strategy with consideration of the joint interests of the insurer and the reinsurer, and also assumed that the decision-maker was an ambiguity-averse manager. Huang et al. [11] investigated a robust optimal investment and reinsurance problem on considering the product of the insurer’s and the reinsurer’s utilities. Obviously, the above optimization models on the joint interests of the insurer and the reinsurer can be classified into the following two categories. The first kind of model is built by maximizing the utility of a weighted sum of the insurer’s and the reinsurer’s surplus processes, while the second one is constructed by maximizing the weighted sum/product of the insurer’s and the reinsurer’s utilities. The former assumes that the insurer and the reinsurer have the same risk aversion coefficients, while the latter considers that they have different risk aversion coefficients. Actually, the latter is more compatible with reality, since the reinsurer is more risk-seeking compared to the insurer. In this paper, we mainly focus on deriving the corresponding time-consistent strategies by maximizing the weighted sum of the insurer’s and the reinsurer’s mean-variance objectives.

Additionally, the above literature is limited to the study of continuous-time problems, while the discrete-time problems are always ignored by researchers. In fact, the discrete-time setting is more realistic to decision-makers, because they cannot trade continuously since it will generate a lot of transaction costs. Brandt [12] also pointed out that the continuous-time strategies are often inadmissible in discrete time because they may cause negative wealth. Especially for insurers and reinsurers, their surplus processes are more likely to be negative values, because they bear the double risks from the insurance market and the securities market. More importantly, Zhu et al. [13] also presented that the bankruptcies occurring in the earlier stages of investments were greater than those in the later stages. The main reason is that the classical multi-period mean-variance optimizations only consider the performance of the terminal wealth. To deal with this problem, Costa and Nabholz [14] proposed a generalized mean-variance model considering the intertemporal restrictions (i.e., the investors will maximize the weighted sum of the mean-variance objectives over all the periods). Under this framework, the terminal and intermediate performance of the portfolio will be included in the decision-making. There are still many studies on this subject, such as Costa and Araujo [15], Costa and de Oliveira [16], Cui et al. [17], He et al. [18], Zhou et al. [19], Xiao et al. [20] and so on. Inspired by the works mentioned above, in this paper, we will build a generalized multiperiod mean-variance investment-reinsurance optimization model considering the joint interests of the insurer and the reinsurer.

However, the proposed model cannot be directly solved by using the dynamic programming approach because the variance measure does not satisfy the expected iterated property. To the best of our knowledge, there are two methods to solve this problem. The first method is the embedding scheme proposed by Li and Ng [21], and the derived optimal strategy is called the pre-commitment strategy. However, some researchers point out that the precommitment strategy does not satisfy time consistency since the future changes are not taken into account. The second method is provided by Basak and Chabakauri [22] and Björk and Murgoci [23], called the game method, which can provide a time-consistent strategy for decision-makers. Under the game framework, the decision-makers treat this optimization problem as a noncooperative game, and its Nash equilibrium solution is defined as the time-consistent strategy. Since then, many researchers have applied the game method to derive the time-consistent solutions of the various multiperiod mean-variance optimization problems, such as Björk and Murgoci [24], Bensoussan et al. [25], Wu and Zeng [26], Zhou et al. [19], Xiao et al. [20] and so on. Based on the game method shown above, in this paper, we will investigate the time-consistent investment-reinsurance strategies for the generalized mutiperiod mean-variance optimization problem considering the joint interests of the insurer and the reinsurer. In this framework, the intermediate and terminal performance of the insurer and the reinsurer can be both considered.

Motivated by the above studies, we assume the insurer and the reinsurer can invest their wealth in one risky asset and one risk-free asset, as well as the insurer can purchase a proportional reinsurance contract from the reinsurer. We use two generalized mean-variance criteria to measure the performance of the insurer and the reinsurer, and further propose a generalized multi-period investment-reinsurance optimization considering the joint interests of the insurer and the reinsurer by maximizing the weighted sum of the insurer’s and the reinsurer’s mean-variance objectives. We apply the game method to derive the time-consistent investment-reinsurance strategies for the proposed model and also discuss the time-consistent strategies under the expected and variance value principles. Finally, we provide some numerical simulations to show the impact of the intertemporal restrictions on the time-consistent strategies.

Different from the existing literature, this paper has four contributions. (i) We first propose a generalized multi-period investment-reinsurance optimization problem under the discrete-time framework, and the joint interests of the insurer and the reinsurer are also considered. Actually, it is more realistic to consider the joint interests of the insurer and the reinsurer in the discrete-time framework. On the one hand, it avoids the negative wealth that may be caused by the accumulation of transaction costs in the case of continuous transactions. On the other hand, it ensures that the reinsurance contract is optimal for both the insurer and the reinsurer. (ii) We consider the impact of intertemporal restrictions on the decision-making, that is, the insurer and the reinsurer not only consider the terminal performance but are also concerned with the intermediate performance of their portfolios, which is absent from the existing continuous-time literature (e.g., Li et al. [7], Zhao et al. [9], Huang et al. [11] and so on). In this framework, the insurer and the reinsurer can adjust the intertemporal restrictions dynamically according to their own risk appetites and the market environment. (iii) We first derive the time-consistent investment-reinsurance strategies rather than the traditional precommitment strategies. Compared with the precommitment strategies, the time-consistent strategies might be more suitable for the decision-makers who are more rational and sophisticated, since they take possible future revisions into account. (iv) We first investigate the impact of the intertemporal restrictions on the time-consistent strategies. The interesting finding is that the intertemporal restrictions will urge the insurer and the reinsurer to shrink the position invested in the risky asset. However, for the time-consistent reinsurance strategy, the role of the intertemporal restrictions depends on who is the leader in the proposed model. When the insurer is the leader, the intertemporal restrictions will reduce the retention level of claims, while for the case that the reinsurer is the leader, only when the impact of the reinsurer’s leading role is higher than that of the intertemporal restrictions will the intertemporal restrictions shrink the retention level of claims.

The remainder of this paper is organized as follows. In Section 2, we construct a generalized multi-period mean-variance optimization model considering the joint interests of the insurer and the reinsurer. In Section 3, we derive the time-consistent strategies by using the game method. In Section 4, we give some numerical examples to show the differences of the time-consistent strategies under different settings. Finally, we summarize the conclusions of this paper.

2. Generalized Multi-Period Mean-Variance Investment-Reinsurance Optimization Considering Both the Insurer and the Reinsurer

In this paper, we assume that both an insurer and a reinsurer will simultaneously enter the financial market at time 0 to carry out investment-reinsurance activities. Suppose that the insurer and the reinsurer have the initial wealth of

w_{0}^{1}

and

w_{0}^{2}

, respectively, and they plan to take all the wealth into the capital market within a time horizon T. We suppose that the insurer and the reinsurer are both allowed to invest their wealth in a risk-free asset and a risky asset (note that they invest in the risky asset with different random returns), where the risk-free asset takes a determinate return

s_{t}

and the random return on the risky asset invested by the insurer is

e_{t}^{1}

, while the risky asset invested by the reinsurer has the random return

e_{t}^{2}

. Let

u_{t}^{1}

and

u_{t}^{2}

represent the amount that the insurer and the reinsurer invest in the risky asset at the beginning of the time period t, respectively. Assume that

w_{t}^{1}

and

w_{t}^{2}

denote the insurer’s and the reinsurer’s wealth at time period t, then the wealth allocated in the risk-free asset can be expressed as

w_{t}^{1} - u_{t}^{1}

and

w_{t}^{2} - u_{t}^{2}

, respectively. In addition to the investment, we also assume that a proportional reinsurance contract is applied between the insurer and the reinsurer. The proportion covered by the insurer at the time period t is denoted by

q_{t}

, where

q_{t} \in [0, 1]

,

t = 0, 1, . . ., T - 1

. Under this reinsurance contract, the insurer only requires to bear the claim amount

q_{t} z_{t}

when facing a claim

z_{t}

at the time period t. In this case,

q_{t}

can also be treated as the retention level of the claim

z_{t}

. Meanwhile, the reinsurer undertakes the rest of the claim amount

(1 - q_{t}) z_{t}

, as well as obtains a premium

δ_{t} (q_{t})

from the insurer. Let

c_{t}

denote the premium income of the insurer at the time period t (note that

c_{t}

is assumed to be a determinate value), then the remaining premium of the insurer can be expressed as

c_{t} - δ_{t} (q_{t})

,

t = 0, 1, . . ., T - 1

. Therefore, the wealth process of the insurer and the reinsurer can be shown as follows.

\begin{matrix} w_{t + 1}^{1} & = & s_{t} (w_{t}^{1} - u_{t}^{1}) + e_{t}^{1} u_{t}^{1} + c_{t} - δ_{t} (q_{t}) - q_{t} z_{t} \\ = & s_{t} w_{t}^{1} + c_{t} + P_{t}^{1} u_{t}^{1} - δ_{t} (q_{t}) - q_{t} z_{t}, \end{matrix}

(1)

and:

\begin{matrix} w_{t + 1}^{2} & = & s_{t} (w_{t}^{2} - u_{t}^{2}) + e_{t}^{2} u_{t}^{2} + δ_{t} (q_{t}) - (1 - q_{t}) z_{t} \\ = & s_{t} w_{t}^{2} + P_{t}^{2} u_{t}^{2} + δ_{t} (q_{t}) - (1 - q_{t}) z_{t}, \end{matrix}

(2)

where,

P_{t}^{1} = e_{t}^{1} - s_{t}

and

P_{t}^{2} = e_{t}^{2} - s_{t}

denote the excess return of risky assets 1 and 2, respectively, and

t = 0, 1, . . ., T - 1

.

Apparently, the insurer and the reinsurer have conflicts of interest because of the reinsurance contract. Therefore, the formulation of the investment-reinsurance strategy should consider both the insurer and the reinsurer. In addition, since the insurer and the reinsurer not only face the risk of claims but also bear the investment risk of the securities market, they are more risk-averse compared to other institutional investors. More importantly, Zhu et al. [13] and Zhou et al. [19] showed that the precommitment and time-consistent strategies derived from the classical mean-variance model will lead to higher bankruptcy probabilities in the earlier periods of an investment. The cause is that the classical mean-variance model only considers the terminal performance of a portfolio and its intermediate performance is ignored. Therefore, the classical multiperiod mean-variance model may not be the best choice for the insurer and the reinsurer. To address this problem, we use two generalized mean-variance criteria to measure the performance of the insurer and the reinsurer. In this setting, the intermediate and terminal performance can be both considered. Simultaneously, we take the weighted sum of the insurer’s and the reinsurer’s mean-variance criteria into account, so as to measure the joint interests of the insurer and the reinsurer. Under this generalized mean-variance framework, the corresponding multiperiod investment-reinsurance optimization problem can be formulated as follows.

\begin{matrix} max_{π} α \sum_{t = 1}^{T} ξ_{t}^{1} [E (w_{t}^{1}) - η_{t}^{1} V a r (w_{t}^{1})] + (1 - α) \sum_{t = 1}^{T} ξ_{t}^{2} [E (w_{t}^{2}) - η_{t}^{2} V a r (w_{t}^{2})] \\ s . t . \{\begin{matrix} w_{t + 1}^{1} = s_{t} w_{t}^{1} + c_{t} + P_{t}^{1} u_{t}^{1} - δ_{t} (q_{t}) - q_{t} z_{t}, t = 0, 1, . . ., T - 1, \\ w_{t + 1}^{2} = s_{t} w_{t}^{2} + P_{t}^{2} u_{t}^{2} + δ_{t} (q_{t}) - (1 - q_{t}) z_{t}, t = 0, 1, . . ., T - 1 . \end{matrix} \end{matrix}

(3)

Here, we let

π = (π_{0}, π_{1}, . . ., π_{T - 1})

,

π_{k} = (u_{k}^{1}, u_{k}^{2}, q_{k}), k = 0, 1, . . ., T - 1

, and the weight

α

satisfy the condition that

α \in [0, 1]

. Note that

α

can be regarded as the weighing coefficient between the benefit of the insurer and that of the reinsurer. Intuitively,

α > 0.5

indicates that the benefit of the insurer is more concerned (i.e., the insurer is the leader);

α < 0.5

means that the benefit of the reinsurer is paid more attention (i.e., the reinsurer is the leader); while for the case that

α = 0.5

, we consider that the interest of the insurer and that of the reinsurer are equally important. Additionally, we also assume that

ξ_{t}^{1}

and

ξ_{t}^{2}

are both the 0-1 variables, among which

ξ_{t}^{1} = 1

(

ξ_{t}^{2} = 1

) means that the insurer (reinsurer) will consider the intertemporal restriction at the time period t, otherwise, the intertemporal restriction is ignored here,

t = 1, 2, . . ., T

. Further, parameters

η_{t}^{1}

and

η_{t}^{2}

denote the risk aversion coefficients of the insurer and the reinsurer at time period t, respectively,

t = 1, 2, . . ., T

.

Similar to the existing literature, we further assume that

e_{t} = {[e_{t}^{1}, e_{t}^{2}]}^{'}

and

z_{t}

are statistically independent,

t = 0, 1, . . ., T - 1

. In other words, the covariance

C o v (e_{t_{1}}, e_{t_{2}}) = 0

for

t_{1} \neq t_{2}

, and the covariance

C o v (e_{t_{1}}, z_{t_{2}}) = 0

, where

t_{1}, t_{2} = 0, 1, . . ., T - 1

. Let

μ_{t}^{i} = E (P_{t}^{i})

,

σ_{t}^{i} = V a r (P_{t}^{i})

,

θ_{t} = C o v (P_{t}^{1}, P_{t}^{2})

,

{\tilde{μ}}_{t} = E (z_{t})

and

{\tilde{σ}}_{t} = V a r (z_{t})

, where

i = 1, 2

and

t = 0, 1, . . ., T - 1

. In addition, for convenience, we define the notations that

\sum_{t = k}^{l} (\cdot) = 0

and

\prod_{t = k}^{l} (\cdot) = 1

for

k > l

. Under this framework, we aim to provide some suitable investment-reinsurance strategies for both the insurer and the reinsurer.

3. Time-Consistent Solution of the Generalized Model

To the best of our knowledge, the expected iterated property is absent from the variance measure, thus Equation (3) is also a time-inconsistent one, that is, it cannot be directly solved by using the dynamic programming approach. Similar to the classical multiperiod mean-variance model, we have two approaches to solve Equation (3). The first approach is the embedding scheme provided by Li and Ng [21], and the derived strategy is called the precommitment strategy. However, some researchers point out the precommitment strategy does not satisfy time consistency since it does not take the future modifications into account. The second one is proposed by Basak and Chabakauri [22] and Björk and Murgoci [23], named as the game approach, and it can provide a time-consistent strategy for decision-makers. In this paper, we mainly focus on providing a time-consistent investment-reinsurance strategy for the rational decision-makers who will consider the decision modifications in the future. Similar to Björk and Murgoci [23], we define the following time-varying mean-variance optimization sub-objective.

J_{k} (w_{k}^{1}, w_{k}^{2}, π) = α \sum_{t = k}^{T} ξ_{t}^{1} [E (w_{t}^{1}) - η_{t}^{1} V a r (w_{t}^{1})] + (1 - α) \sum_{t = k}^{T} ξ_{t}^{2} [E (w_{t}^{2}) - η_{t}^{2} V a r (w_{t}^{2})],

(4)

where

k = 0, 1, . . ., T - 1

. Then, the time-consistent solution of Equation (3) can be defined similarly as follows.

Definition 1.

Consider a fixed control policy

\hat{π} = ({\hat{π}}_{0}, {\hat{π}}_{1}, . . ., {\hat{π}}_{T - 1})

. For

k = 0, 1, . . ., T - 1

, we define that:

π (k) = (π_{k}, {\hat{π}}_{k + 1}, . . ., {\hat{π}}_{T - 1}),

\hat{π} (k) = ({\hat{π}}_{k}, {\hat{π}}_{k + 1}, . . ., {\hat{π}}_{T - 1}),

where:

π_{k} = (u_{k}^{1}, u_{k}^{2}, q_{k})

is arbitrarily control variable. Then π is said to be a time-consistent investment-reinsurance strategy if for all

k = 0, 1, . . ., T - 1

, it satisfies:

max_{π_{k}} J_{k} (w_{k}^{1}, w_{k}^{2}, π (k)) = J_{k} (w_{k}^{1}, w_{k}^{2}, \hat{π} (k)) .

Additionally, if the time-consistent investment-reinsurance strategy

\hat{π}

exists, the corresponding value function can be defined as:

V_{k} (w_{k}^{1}, w_{k}^{2}) = J_{k} (w_{k}^{1}, w_{k}^{2}, \hat{π} (k)) .

According the definition of time-consistent strategy, we have the following proposition.

Proposition 1.

The value function

V_{k} (w_{k}^{1}, w_{k}^{2})

satisfies the following recursive relation.

\begin{matrix} V_{k} (w_{k}^{1}, w_{k}^{2}) \\ = & max_{π_{k}} \{\begin{matrix} E_{k} [V_{k + 1} (w_{k + 1}^{1}, w_{k + 1}^{2})] - α \sum_{m = k + 2}^{T} ξ_{m}^{1} η_{m}^{1} V a r_{k} [f_{k + 1, m} (w_{k + 1}^{1})] \\ - (1 - α) \sum_{m = k + 2}^{T} ξ_{m}^{2} η_{m}^{2} V a r_{k} [g_{k + 1, m} (w_{k + 1}^{2})] \\ + α ξ_{k + 1}^{1} [E_{k} (w_{k + 1}^{1}) - η_{k + 1}^{1} V a r_{k} (w_{k + 1}^{1})] \\ + (1 - α) ξ_{k + 1}^{2} [E_{k} (w_{k + 1}^{2}) - η_{k + 1}^{2} V a r_{k} (w_{k + 1}^{2})], \\ f o r k = 0, 1, . . ., T - 2, \end{matrix}\} \end{matrix}

(5)

as well as the boundary condition:

\begin{matrix} V_{T - 1} (w_{T - 1}^{1}, w_{T - 1}^{2}) & = & max_{π_{T - 1}} \{\begin{matrix} α [ξ_{T}^{1} E_{T - 1} (w_{T}^{1}) - ξ_{T}^{1} η_{T}^{1} V a r_{T - 1} (w_{T}^{1})] \\ + (1 - α) [ξ_{T}^{2} E_{T - 1} (w_{T}^{2}) - ξ_{T}^{2} η_{T}^{2} V a r_{T - 1} (w_{T}^{2})] \end{matrix}\}, \end{matrix}

(6)

where:

f_{k, τ} (w_{k}^{1}) = \{\begin{matrix} E_{k} [f_{k + 1, τ} (w_{k + 1}^{1})], f o r τ > k, τ, k = 0, 1, . . ., T - 1, \\ w_{k}^{1}, f o r τ = k, k = 0, 1, . . ., T - 1, \end{matrix}

(7)

g_{k, τ} (w_{k}^{2}) = \{\begin{matrix} E_{k} [g_{k + 1, τ} (w_{k + 1}^{2})], f o r τ > k, τ, k = 0, 1, . . ., T - 1, \\ w_{k}^{2}, f o r τ = k, k = 0, 1, . . ., T - 1 . \end{matrix}

(8)

Proof.

See Appendix A. □

According to Proposition 1, we can derive the following theorem.

Theorem 1.

Suppose that

α \in (0, 1)

, for the multi-period mean-variance investment-reinsurance optimization problem Equation (3), the time-consistent investment-reinsurance strategies

{{\hat{π}}_{t} = ({\hat{u}}_{t}^{1}, {\hat{u}}_{t}^{2}, {\hat{q}}_{t}), t = 0, 1, . . ., T - 1}

can be expressed as follows.

{\hat{u}}_{t}^{1} = \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) μ_{t}^{1}}{2 (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) σ_{t}^{1}},

(9)

{\hat{u}}_{t}^{2} = \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i}) μ_{t}^{2}}{2 (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) σ_{t}^{2}},

(10)

\begin{matrix} {\hat{q}}_{t} = arg max_{0 \leq q_{t} \leq 1} \{\begin{matrix} α (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) [- δ_{t} (q_{t}) - q_{t} {\tilde{μ}}_{t}] \\ - α (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{t} {(q_{t})}^{2} \\ + (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i}) [δ_{t} (q_{t}) - (1 - q_{t}) {\tilde{μ}}_{t}] \\ - (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{t} {(1 - q_{t})}^{2} \end{matrix}\} . \end{matrix}

(11)

Additionally, the value function

V_{t} (w_{t}^{1}, w_{t}^{2})

, and the functions

f_{t, τ} (w_{t}^{1})

and

g_{t, τ} (w_{t}^{2})

are given as follows.

V_{t} (w_{t}^{1}, w_{t}^{2}) = α (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t}^{m - 1} s_{i}) w_{t}^{1} + (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t}^{m - 1} s_{i}) w_{t}^{2} + κ_{t},

(12)

f_{t, τ} (w_{t}^{1}) = \prod_{i = t}^{τ - 1} s_{i} w_{t}^{1} + γ_{t, τ}, t \geq τ, τ = 0, 1, . . ., T - 1,

(13)

g_{t, τ} (w_{t}^{2}) = \prod_{i = t}^{τ - 1} s_{i} w_{t}^{2} + ρ_{t, τ}, t \geq τ, τ = 0, 1, . . ., T - 1 .

(14)

For convenience, we define that

γ_{t, τ} = 0

and

ρ_{t, τ} = 0

for

t = τ

, then the parameters

κ_{t}

,

γ_{t, τ}

and

ρ_{t, τ}

satisfy the following equations:

\{\begin{matrix} κ_{t} = & \sum_{k = t}^{T} [\frac{α (\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{1})}^{2}}{4 (\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{1}} + \frac{(1 - α) (\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{2})}^{2}}{4 (\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{2}}] \\ + α \sum_{k = t}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) [c_{k} - δ_{k} ({\hat{q}}_{k}) - {\hat{q}}_{k} {\tilde{μ}}_{k}]] \\ - α \sum_{k = t}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{k} {({\hat{q}}_{k})}^{2}] \\ + (1 - α) \sum_{k = t}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) [δ_{k} ({\hat{q}}_{k}) - (1 - {\hat{q}}_{k}) {\tilde{μ}}_{k}]] \\ - (1 - α) \sum_{k = t}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{k} {(1 - {\hat{q}}_{k})}^{2}], \\ γ_{t, τ} = & \sum_{k = t}^{τ} [\frac{(\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{1})}^{2}}{2 (\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{1}}] + \sum_{k = t}^{τ} [\prod_{i = k + 1}^{τ - 1} s_{i} [c_{k} - δ_{k} ({\hat{q}}_{k}) - {\hat{q}}_{k} {\tilde{μ}}_{k}]], \\ ρ_{t, τ} = & \sum_{k = t}^{τ} [\frac{(\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{2})}^{2}}{2 (\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{2}}] + \sum_{k = t}^{τ} [\prod_{i = k + 1}^{τ - 1} s_{i} [δ_{k} ({\hat{q}}_{k}) - (1 - {\hat{q}}_{k}) {\tilde{μ}}_{k}]] . \end{matrix}

(15)

Proof.

See Appendix B. □

Theorem 1 only shows the time-consistent strategies when the weight coefficient

α

satisfies the condition that

α \in (0, 1)

. While for the case that

α = 1

or

α = 0

, that is, Equation (3) only considers the benefit of the insurer or the reinsurer, then the corresponding time-consistent investment-reinsurance strategies can be derived similarly. The details are shown as follows.

Remark 1.

Suppose that

α = 1

, Equation (3) degenerates into the investment-reinsurance optimization problem only considering the interest of the insurer. In this situation, the time-consistent strategies

{{\hat{π}}_{t} = ({\hat{u}}_{t}^{1}, {\hat{q}}_{t}), t = 0, . . ., T - 1}

can be obtained similarly as follows.

{\hat{u}}_{t}^{1} = \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) μ_{t}^{1}}{2 (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) σ_{t}^{1}},

(16)

\begin{matrix} {\hat{q}}_{t} = arg max_{0 \leq q_{t} \leq 1} \{\begin{matrix} (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) [- {\tilde{μ}}_{t} q_{t} - δ_{t} (q_{t})] - (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{t - 1} {(s_{i})}^{2}) {(q_{t})}^{2} {\tilde{σ}}_{t} \end{matrix}\} . \end{matrix}

(17)

In addition, the value function

V_{t} (w_{t}^{1}, w_{t}^{2})

and the function

f_{t, τ} (w_{t}^{1})

can be reduced as:

V_{t} (w_{t}^{1}, w_{t}^{2}) = (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) w_{t}^{1} + κ_{t},

(18)

f_{t, τ} (w_{t}^{1}) = \prod_{i = t}^{τ - 1} s_{i} w_{t}^{1} + γ_{t, τ}, t \geq τ, τ = 0, 1, . . ., T - 1,

(19)

where:

κ_{t}

and

γ_{t, τ}

(note that

γ_{t, τ} = 1

for

t = τ

) satisfy the following equations:

\{\begin{matrix} κ_{t} = & \sum_{k = t}^{T - 1} [(\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) {c_{k} - δ_{k} ({\hat{q}}_{k}) - {\tilde{μ}}_{k} {\hat{q}}_{k}}] \\ + \sum_{k = t}^{T - 1} [\frac{(\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{1})}^{2}}{4 (\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{1}}] - \sum_{k = t}^{T - 1} [(\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) {({\hat{q}}_{k})}^{2} {\tilde{σ}}_{k}], \\ γ_{t, τ} = & \sum_{k = t}^{τ - 1} [(\prod_{i = k + 1}^{τ - 1} s_{i}) {c_{k} - δ_{k} ({\hat{q}}_{k}) - {\tilde{μ}}_{k} {\hat{q}}_{k}}] + \sum_{k = t}^{τ - 1} [\frac{(\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) (\prod_{i = k + 1}^{τ - 1} s_{i}) {(μ_{k}^{1})}^{2}}{2 (\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{1}}] . \end{matrix}

(20)

As shown in Remark 1, we can find that the function

V_{t} (w_{t}^{1}, w_{t}^{2})

and

f_{t, τ} (w_{t}^{1}, w_{t}^{2})

only depend on the insurer’s wealth at the time period t, i.e.,

w_{t}^{1}

. In this situation, decision-makers only consider the benefit of the insurer, this is also the traditional approach to deal with the optimal investment-reinsurance problem. However, compared with the existing literature on the study of optimal investment and reinsurance in the continue-time setting (e.g., Schmidli [1], Zeng and Li [2], Zhu et al. [3], and Deng et al. [6] and so on), the proposed strategies in Remark 1 also consider the intermediate performance of the insurer.

Remark 2.

Suppose that

α = 0

, Equation (3) degenerates into the investment-reinsurance optimization problem only considering the interest of the reinsurer. In this case, the time-consistent strategies

{({\hat{u}}_{t}^{2}, {\hat{q}}_{t}), t = 0, . . ., T - 1}

can be similarly obtained as follows.

{\hat{u}}_{t}^{2} = \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i}) μ_{t}^{2}}{2 (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) σ_{t}^{2}},

(21)

\begin{matrix} {\hat{q}}_{t} = arg max_{0 \leq q_{t} \leq 1} \{\begin{matrix} (\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i}) [δ_{t} (q_{t}) - {\tilde{μ}}_{t} (1 - q_{t})] \\ - (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{t} {(1 - q_{t})}^{2} \end{matrix}\} . \end{matrix}

(22)

Meanwhile, the value function and the function

g_{t, τ} (w_{t}^{1}, w_{t}^{2})

can be reduced as follows.

V_{t} (w_{t}^{1}, w_{t}^{2}) = \prod_{i = t}^{T - 1} s_{i} w_{t}^{2} + κ_{t},

(23)

g_{t, τ} (w_{t}^{2}) = \prod_{i = t}^{τ - 1} s_{i} w_{t}^{2} + ρ_{t, τ}, t \geq τ, τ = 0, 1, . . ., T - 1,

(24)

where:

κ_{t}

and

ρ_{t, τ}

(note that

ρ_{t, τ} = 0

for

t = τ

) satisfy the following equations:

\{\begin{matrix} κ_{t} = & \sum_{k = t}^{T - 1} [(\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) {δ_{k} ({\hat{q}}_{k}) - {\tilde{μ}}_{k} (1 - {\hat{q}}_{k})}] + \sum_{k = t}^{T - 1} [\frac{(\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{2})}^{2}}{4 (\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{2}}] \\ - \sum_{k = t}^{T - 1} [(\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{k} {(1 - {\hat{q}}_{k})}^{2}], \\ ρ_{t, τ} = & \sum_{k = t}^{τ - 1} [(\prod_{i = k + 1}^{τ - 1} s_{i}) {δ_{k} ({\hat{q}}_{k}) - {\tilde{μ}}_{k} (1 - {\hat{q}}_{k})}] + \sum_{k = t}^{τ - 1} [\frac{(\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) (\prod_{i = k + 1}^{τ - 1} s_{i}) {(μ_{k}^{2})}^{2}}{2 (\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{2}}] . \end{matrix}

(25)

Remark 2 shows the time-consistent strategies when Equation (3) only considers the interest of the reinsurer. In this situation, the value function

V_{t} (w_{t}^{1}, w_{t}^{2})

and the function

f_{t, τ} (w_{t}^{1})

only depend on the wealth value

w_{t}^{2}

(i.e., the reinsurer’s wealth at time period t).

Additionally, from Theorem 1 and Remarks 1 and 2, we can find that the reinsurance strategy and the investment strategies are independent of each other, that is, some changes in the reinsurance premium

δ_{t} (q_{t})

will not affect the form of the investment strategies shown in Theorem 1. In the following, we will discuss the time-consistent strategies under some classical premium principles (e.g., the expected value principle and the variance value principle). The detailed results are presented in Section 3.1 and Section 3.2, respectively.

3.1. Time-Consistent Investment-Reinsurance Strategies under the Expected Value Principle

In this section, we refer to Waters [27] and assume that the reinsurance premium

δ_{t} (q_{t})

is calculated according to the expected value principle, i.e.,

δ_{t} (q_{t}) = (1 + β_{t}) (1 - q_{t}) {\tilde{μ}}_{t}

, where

β_{t}

(

β_{t} > 0

) is the safety loading of the reinsurer. For convenience, we define the following notation:

b_{t} = \frac{[α (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) - (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i})] β_{t} {\tilde{μ}}_{t} + 2 (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{t}}{2 [α (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) + (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2})] {\tilde{σ}}_{t}}

.

Based on the conclusions shown in Theorem 1 and Remarks 1 and 2, we can derive the time-consistent strategies under some special settings. For details see Corollary 1 and Remarks 3–5.

Corollary 1.

Suppose that

α \in (0, 1)

and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the above expected value principle. In this situation, the time-consistent investment strategies for Equation (3) are coincident with these in Theorem 1, while the corresponding time-consistent reinsurance strategy (i.e.,

{\hat{q}}_{t}, t = 0, 1, . . ., T - 1

) can be reduced as follows.

\begin{matrix} {\hat{q}}_{t} = \{\begin{matrix} 0, i f b_{t} < 0, \\ b_{t}, i f 0 \leq b_{t} \leq 1, \\ 1, i f b_{t} > 1 . \end{matrix} \end{matrix}

(26)

As shown in Corollary 1, we can find that the time-consistent reinsurance strategy

{\hat{q}}_{t}

is dependent with the notation

b_{t}

, since the value of

b_{t}

will determine whether the time-consistent strategy

q_{t}

is to take the interior point or the boundary point. When

b_{t}

is a negative value, the time-consistent reinsurance strategy

q_{t}

is equal to 0, that is, the reinsurer will bear all risk of claims; when

b_{t} > 1

, the time-consistent reinsurance strategy can be expressed as

q_{t} = 1

, in this situation, the insurer will undertake all the risk of claims; otherwise,

q_{t}

will be to take interior point

b_{t}

. Apparently, the intertemporal restrictions will affect the form of

b_{t}

, that is, the time-consistent reinsurance

q_{t}

is also influenced by the intertemporal restrictions.

Remark 3.

Suppose that

α \in (0, 1)

,

ξ_{t}^{1} = ξ_{t}^{2} = 0

for

t = 1, 2, . . ., T - 1

,

ξ_{T}^{1} = ξ_{T}^{2} = 1

and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the above expected value principle. Therefore, the time-consistent investment-reinsurance strategies for Equation (3), i.e.,

{\hat{π}}_{t} = {({\hat{u}}_{t}^{1}, {\hat{u}}_{t}^{2}, {\hat{q}}_{t}), t = 0, 1, . . ., T - 1}

, can be reduced as:

\{\begin{matrix} {\hat{u}}_{t}^{1} = \frac{μ_{t}^{1}}{2 η_{T}^{1} (\prod_{i = t + 1}^{T - 1} s_{i}) σ_{t}^{1}}, \\ {\hat{u}}_{t}^{2} = \frac{μ_{t}^{2}}{2 η_{T}^{2} (\prod_{i = t + 1}^{T - 1} s_{i}) σ_{t}^{2}}, \end{matrix}

(27)

\begin{matrix} {\hat{q}}_{t} = \{\begin{matrix} 0, i f b_{t} < 0, \\ b_{t}, i f 0 \leq b_{t} \leq 1, \\ 1, i f b_{t} > 1 . \end{matrix} \end{matrix}

(28)

In this case,

b_{t}

can be reduced as

b_{t} = \frac{(1 - α) η_{T}^{2}}{α η_{T}^{1} + (1 - α) η_{T}^{2}} + \frac{(2 α - 1) β_{t} {\tilde{μ}}_{t}}{2 [α η_{T}^{1} + (1 - α) η_{T}^{2}] \prod_{i = t + 1}^{T - 1} s_{i} {\tilde{σ}}_{t}}

.

Remark 3 shows that the insurer and the reinsurer only consider the performance of terminal wealth and the reinsurance premium is calculated according to the expected value principle. Compared with Corollary 1 and Remark 3, we can find that the latter only considers the terminal risk aversion coefficients

η_{T}^{1}

and

η_{T}^{2}

, while the former is dependent on all the risk aversion coefficients at the different time periods (i.e.,

η_{t}^{1}

and

η_{t}^{2}

for

t = 1, 2, . . ., T

).

Remark 4.

Suppose that

α = 1

and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the above expected value principle. In this case, the time-consistent investment strategy for Equation (3) is same as that in Remark 1. However, the time-consistent reinsurance strategy (i.e.,

{\hat{q}}_{t}, t = 0, . . ., T - 1

) can be reduced as follows.

\begin{matrix} {\hat{q}}_{t} = \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) β_{t} {\tilde{μ}}_{t}}{2 (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{t}} \land 1 . \end{matrix}

(29)

Remark 4 shows the time-consistent investment-reinsurance strategies for the insurer under the expected value principle. In this case, the insurer’s decision only depends on its own performance, while the performance of the reinsurer is ignored here. However, the intertemporal restrictions still restrict the formulation of the time-consistent strategy.

Remark 5.

Suppose that

α = 0

and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the above expected value principle. In this situation, the time-consistent investment strategy for Equation (3) is consistent with that in Remark 2. In addition, the time-consistent reinsurance strategy (i.e.,

{\hat{q}}_{t}, t = 0, . . ., T - 1

) can be expressed as follows.

\begin{matrix} {\hat{q}}_{t} = 0 \lor (1 - \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i}) β_{t} {\tilde{μ}}_{t}}{2 (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{t}}) \land 1 . \end{matrix}

(30)

Remark 5 shows the time-consistent investment-reinsurance strategies for the reinsurer under the expected value principle. Compared with the results shown in Remark 4, the proposed strategies in Remark 5 are derived from the other extreme, that is, the decision-maker only considers the performance of the reinsurer.

3.2. Time-Consistent Investment and Reinsurance Strategies under the Variance Value Principle

In this section, we assume that the reinsurance premium

δ_{t} (q_{t})

is calculated according to the variance value principle (the readers can refer to Waters [27]), that is,

δ_{t} (q_{t}) = (1 - q_{t}) {\tilde{μ}}_{t} + β_{t} {(1 - q_{t})}^{2} {\tilde{σ}}_{t}, t = 0, 1, . . ., T - 1

. Similarly, we can derive the time-consistent strategies under some special settings. For details see Corollary 2 and Remarks 6–8.

Corollary 2.

Suppose that

α \in (0, 1)

and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the above variance value principle. For Equation (3), its time-consistent investment strategies are also coincident with these in Theorem 1. In addition, the corresponding time-consistent reinsurance strategy (

{\hat{q}}_{t}, t = 0, 1, . . ., T - 1

) can be expressed as:

\begin{matrix} {\hat{q}}_{t} = \{\begin{matrix} 0, i f {\hat{m}}_{t} \neq 0 a n d {\hat{b}}_{t} < 0, \\ {\hat{b}}_{t}, i f {\hat{m}}_{t} \neq 0 a n d 0 \leq {\hat{b}}_{t} \leq 1, \\ 1, i f {\hat{m}}_{t} \neq 0 a n d {\hat{b}}_{t} > 1, \\ \forall q_{t} \in [0, 1], i f {\hat{m}}_{t} = 0, \end{matrix} \end{matrix}

(31)

where:

\{\begin{matrix} {\hat{b}}_{t} = & \frac{[α (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) - (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i})] β_{t} + (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2})}{[α (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) - (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i})] β_{t} + [α (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) + (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2})]}, \\ {\hat{m}}_{t} = & [α (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) - (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i})] β_{t} \\ + [α (\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) + (1 - α) (\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2})] . \end{matrix}

Obviously, the intertemporal restrictions also restrict the formulation of the proposed strategies in Corollary 2. Compared with Corollary 1 and Corollary 2, we can find that the proposed reinsurance strategy shown in Corollary 1 will be affected by the expectation and variance of the claim

z_{t}

(i.e.,

{\tilde{μ}}_{t}

and

{\tilde{σ}}_{t}

), while that in Corollary 2 is independent with

{\tilde{μ}}_{t}

and

{\tilde{σ}}_{t}

. That is, under the variance value principle, the reserve level

{\hat{q}}_{t}

does not change because of the size of

{\tilde{μ}}_{t}

and

{\tilde{σ}}_{t}

. However, with the increase of parameters

{\tilde{μ}}_{t}

and

{\tilde{σ}}_{t}

, the insurer will pay more premiums to the reinsurer since the reinsurance premium

δ_{t} (q_{t})

is an increasing function of both

{\tilde{μ}}_{t}

and

{\tilde{σ}}_{t}

.

Remark 6.

Suppose that

α \in (0, 1)

,

ξ_{t}^{1} = ξ_{t}^{2} = 0

for

t = 1, 2, . . ., T - 1

,

ξ_{T}^{1} = ξ_{T}^{2} = 1

, and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the above variance value principle. The time-consistent investment-reinsurance strategies for Equation (3) (i.e.,

{\hat{π}}_{t} = {({\hat{u}}_{t}^{1}, {\hat{u}}_{t}^{2}, {\hat{q}}_{t}), t = 0, 1, . . ., T - 1

}) can be simplified as:

\{\begin{matrix} {\hat{u}}_{t}^{1} = \frac{μ_{t}^{1}}{2 η_{T}^{1} (\prod_{i = t + 1}^{T - 1} s_{i}) σ_{t}^{1}}, \\ {\hat{u}}_{t}^{2} = \frac{μ_{t}^{2}}{2 η_{T}^{2} (\prod_{i = t + 1}^{T - 1} s_{i}) σ_{t}^{2}}, \end{matrix}

(32)

\begin{matrix} {\hat{q}}_{t} = \{\begin{matrix} 0, i f m_{t} \neq 0, {\hat{b}}_{t} < 0, \\ {\hat{b}}_{t}, i f m_{t} \neq 0, 0 \leq {\hat{b}}_{t} \leq 1, \\ 1, i f m_{t} \neq 0, {\hat{b}}_{t} > 1, \\ \forall q_{t} \in [0, 1], i f m_{t} = 0, \end{matrix} \end{matrix}

(33)

where:

\{\begin{matrix} {\hat{b}}_{t} = \frac{(2 α - 1) β_{t} + (1 - α) η_{T}^{2} (\prod_{i = t + 1}^{T - 1} s_{i})}{(2 α - 1) β_{t} + [α η_{T}^{1} + (1 - α) η_{T}^{2}] (\prod_{i = t + 1}^{T - 1} s_{i})}, \\ {\hat{m}}_{t} = (2 α - 1) β_{t} + [α η_{T}^{1} + (1 - α) η_{T}^{2}] (\prod_{i = t + 1}^{T - 1} s_{i}) . \end{matrix}

Remark 6 shows the time-consistent strategies for the insurer and the reinsurer who only consider the performance of their terminal wealth, among which the reinsurance premium is calculated according to the variance value principle. Compared with Corollary 2 and Remark 6, we can find that the proposed reinsurance strategy not only relies on the terminal risk aversion coefficients

η_{T}^{1}

and

η_{T}^{2}

, but also has nothing to do with the expectation and variance of a claim.

Remark 7.

Suppose that

α = 1

and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the variance value principle. In this situation, the time-consistent investment strategies for Equation (3) are also coincident with those in Remark 1, while the time-consistent reinsurance strategy (i.e.,

{\hat{q}}_{t}, t = 0, . . ., T - 1

) can be simplified as follows.

\begin{matrix} {\hat{q}}_{t} = \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) β_{t}}{(\sum_{m = t + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2}) + (\sum_{m = t + 1}^{T} ξ_{m}^{1} \prod_{i = t + 1}^{m - 1} s_{i}) β_{t}} . \end{matrix}

(34)

Remark 7 shows the time-consistent strategies when the decision-makers only consider the performance of the insurer rather than that of the reinsurer. Compared with the time-consistent reinsurance strategy shown in Corollary 2 and Remark 7, we can find that the former might be a boundary point, while the latter has to be an interior value.

Remark 8.

Suppose that

α = 0

and the reinsurance premium

δ_{t} (q_{t})

is calculated according to the above variance value principle. The time-consistent investment strategy are consistent with these in Remark 2. However, the corresponding time-consistent reinsurance strategy (i.e.,

{\hat{q}}_{t}, t = 0, . . ., T - 1

) is reduced as follows.

\begin{matrix} {\hat{q}}_{t} = \{\begin{matrix} 0, i f β_{t} > \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2})}{(\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i})}, \\ 1, i f β_{t} < \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2})}{(\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i})}, \\ \forall q_{t} \in [0, 1], i f β_{t} = \frac{(\sum_{m = t + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = t + 1}^{m - 1} {(s_{i})}^{2})}{(\sum_{m = t + 1}^{T} ξ_{m}^{2} \prod_{i = t + 1}^{m - 1} s_{i})} . \end{matrix} \end{matrix}

(35)

Remark 8 shows the time-consistent strategies when the decision-makers only consider the performance of the reinsurer, among which the reinsurance premium is according to the variance value principle. In this case, the safety loading coefficient

β_{t}

can be treated as the risk compensation coefficient of the reinsurer. That is, the larger the

β_{t}

, the more the reinsurer can obtain risk compensations from the insurer. Remark 8 indicates that when

β_{t}

is large enough, the reinsurer is willing to assume all the risk of claims; otherwise, the reinsurer is reluctant to assume any claim risks, since the insurer cannot offer a suitable reinsurance premium as the risk compensation.

4. Numerical Analysis

In this section, we will provide some numerical simulations to show the results presented in Section 3. We assume that the initial wealth of the insurer and the reinsurer are

w_{0}^{1} = 1

and

w_{0}^{2} = 1

, respectively. Using the data provided by Li and Ng [21], we let

μ_{t}^{1} = 1.162

,

μ_{t}^{2} = 1.228

,

σ_{t}^{1} = 0.0146

,

σ_{t}^{2} = 0.0289

,

θ_{t} = 0.0145

and

s_{t} = 1.04

,

t = 0, 1, . . ., T - 1

. Additionally, we suppose that the safety loading coefficient of the reinsurer satisfies

β_{t} = 0.8

, and the claim

z_{t}

follows an exponential distribution with the rate parameter

λ_{t} = 10

,

t = 0, 1, . . ., T - 1

. In the following, we will show the evolutions process of the time-consistent strategies under different settings and investigate how the intertemporal restrictions affect the time-consistent investment and reinsurance strategies. In addition, we assume that

ξ_{t}^{1}

and

ξ_{t}^{2}

have the following situations.

Case I. Suppose that $ξ_{t}^{1} = ξ_{t}^{2} = 1$ in Equation (3), $t = 1, 2, . . ., T$ . In this setting, the insurer’s and the reinsurer’s decisions will take all the intertemporal restrictions into account.
Case II. Suppose that $ξ_{t} = ξ_{t}^{2} = 0$ for $t = 1, 2, . . ., T - 1$ and $ξ_{T}^{1} = ξ_{T}^{2} = 1$ in Equation (3). In this case, the decision-makers only consider the performance of their terminal wealth.

Based on the above parameter settings, we will discuss the impact of the intertemporal restrictions on the time-consistent investment and reinsurance strategies. Since the reinsurance premium

δ_{t} (q_{t})

and the weight coefficient

α

have nothing to do with the investment strategies of the insurer and the reinsurer, we do not have to repeat the evolution process of time-consistent investment strategies when the

δ_{t} (q_{t})

and

α

take different settings. However, the impacts of the

δ_{t} (q_{t})

and the weight

α

on the formulation of the time-consistent reinsurance strategy cannot be ignored. Under this conception, we can obtain the following simulations.

4.1. Simulations of the Insurer’s and the Reinsurer’s Time-Consistent Investment Strategies

In this section, we will discuss how the intertemporal restrictions affect the time-consistent investment strategies. Motivated by Xiao et al. [20], we assume that the risk aversion coefficients of the insurer and the reinsurer satisfy the following two exponential functions:

η_{t}^{1} = η_{T}^{1} \times ϕ_{1}^{T - t}

and

η_{t}^{2} = η_{T}^{2} \times ϕ_{2}^{T - t}

, among which

ϕ_{1}

and

ϕ_{2}

are both greater than the return of the risk-free asset,

t = 1, 2, . . ., T

. Additionally, since the insurer is more risk-averse compared to the reinsurer, we also assume that

η_{T}^{1} > η_{T}^{2}

and

ϕ_{1} > ϕ_{2}

. Under this assumption, we let

η_{T}^{1} = 2

,

η_{T}^{1} = 1

,

ϕ_{1} = 1.08

and

ϕ_{1} = 1.06

. Therefore, we can derive the corresponding simulation paths of the time-consistent investment strategies shown in Section 3.1 and Section 3.2. For details see Figure 1.

Figure 1 shows the evolution paths of the insurer’s and the reinsurer’s time-consistent investment strategies when the investment horizon is

T = 100

. As shown in Figure 1, we find that compared with the time-consistent investment strategies without considering intertemporal restrictions, the insurer and the reinsurer will both reduce investment position invested in the risky asset when the intertemporal restrictions are all considered in the decision-making. This indicates that the insurer and the reinsurer will increase the amount invested in risk-free assets (i.e.,

w_{t}^{1} - {\hat{u}}_{t}^{1}

or

w_{t}^{2} - {\hat{u}}_{t}^{2}

), so as to reduce the risk in the earlier periods. In addition, the above time-consistent strategies are all increasing functions of the time period t, no matter that the intertemporal restrictions are considered or not. That is, the closer the insurer and the reinsurer get to the end of their investment, the more wealth they invest in the risky asset. This cause is that with the accumulation of the insurer’s and the reinsurer’s wealth, they have enough ability to bear the investment risk.

In Figure 1, we assume that the risk aversion coefficients of the insurer and the reinsurer follow the exponential distributions mentioned above. In the following, we want to check the above conclusions whether is true when the risk aversion coefficient appears in other forms. Inspired by Wang and Chen [28], we assume the risk aversion coefficients are described by the following linear functions:

η_{t}^{1} = η_{T}^{1} + k_{1} \times (T - t)

and

η_{t}^{2} = η_{T}^{2} + k_{2} \times (T - t)

,

t = 1, 2, . . ., T

, where parameters

k_{1}

and

k_{1}

are both greater than 0. Similar to Figure 1, we also assume that

η_{T}^{1} > η_{T}^{2}

and

k_{1} > k_{2}

, since the insurer is more risk-averse than the reinsurer.

Let

T = 100

,

η_{T}^{1} = 2

,

η_{T}^{2} = 1

,

k_{1} = 0.5

and

k_{2} = 0.2

, and then we can derive the corresponding simulations presented in Figure 2. As shown in Figure 2, we find that the intertemporal restrictions also have restriction effects on the time-consistent investment strategies; meanwhile, these investment strategies are all increasing functions of the time period t. Apparently, the above conclusions are coincident with these in Figure 1.

4.2. Simulations of the Time-Consistent Reinsurance Strategy

In the following, we will discuss the evolution of the time-consistent reinsurance strategy under different settings. More importantly, we will investigate the impacts of the reinsurance premium

δ_{t} (q_{t})

and the weight coefficient

α

on the time-consistent reinsurance strategy. In this section, we assume that the insurer and the reinsurer will adopt the classical expected and variance value principles to make the time-consistent reinsurance strategy. Using the parameters shown in Figure 1, we can derive the corresponding simulation path of the time-consistent reinsurance strategy under the expected value principle and the variance value principle, respectively. For detailed simulation results see Figure 3, Figure 4, Figure 5 and Figure 6.

Under the expected value principle, Figure 3 and Figure 4 show the evolution of the time-consistent reinsurance strategy with different weight coefficients

α

(i.e.,

α = 0.4

and

α = 0.6

). Under Case II, we find that the time-consistent reinsurance strategy is a decreasing function of the time period t when the weight coefficient takes

α = 0.4

, while for the situation that

α = 0.6

, the time-consistent reinsurance strategy increases with the time period t. Actually, the condition

α = 0.4

indicates that the reinsurer is paid more attention (i.e., the reinsurer is the leader), in this case, the reinsurer wants to get more reinsurance business from the insurer. Therefore, the retention level of the claim

{\hat{q}}_{t}

will decrease with the time period t. On the other hand, the condition

α = 0.6

means that the insurer is getting more attention (i.e., the insurer is the leader), in this situation, the insurer will pass most of claim risks to the reinsurer, because the bankruptcy probability is higher in the earlier periods. However, with the accumulation of the insurer’s wealth, the insurer is willing to increase the retention level of the claim

{\hat{q}}_{t}

, that is,

{\hat{q}}_{t}

increases with the time period t. Note that this monotonicity is not necessarily true for the time-consistent reinsurance strategy with consideration of the intertemporal restrictions.

We further consider the impact of the intertemporal restrictions on the time-consistent reinsurance strategy (i.e., the time-consistent reinsurance strategy under Case I in Figure 3 and Figure 4). As shown in Figure 3 and Figure 4, we find that the intertemporal restrictions have a significant impact on reinsurance strategy. However, there are large differences in the impact of intertemporal restrictions on the time-consistent investment strategies and the time-consistent reinsurance strategy. As mentioned in Section 4.1, we have concluded that the intertemporal restrictions will make the insurer and the reinsurer reduce the positions invested in the risky asset. While for the reinsurance strategy, the intertemporal restrictions do not necessarily cause the decision-makers to reduce the retention level

{\hat{q}}_{t}

(e.g., the time-consistent reinsurance strategy under Case I in Figure 3). The cause is that when

α = 0.4

(i.e., the reinsurer has the leading role) and the intertemporal restrictions are taken into consideration, the leading role urges the reinsurer to increase reinsurance proportion (

1 - {\hat{q}}_{t}

), but the intertemporal restrictions cause the reinsurer to reduce reinsurance business as much as possible (i.e., the reinsurer wants the insurer to increase the retention level

{\hat{q}}_{t}

). Therefore, only when the impact of the intertemporal restrictions is higher than that of the reinsurer’s leading role, the intertemporal restrictions can induce the reinsurer to shrink the reinsurance proportion (

1 - {\hat{q}}_{t}

). However, when

α = 0.4

(i.e., the insurer has the leading role) and the intertemporal restrictions are taken into consideration, the target of the intertemporal restrictions and that of the insurer’s leading role is consistent, that is, reducing the retention level

{\hat{q}}_{t}

. Generally speaking, the role of the intertemporal restrictions on the time-consistent reinsurance strategy depends on who is the leader in Equation (3).

Similar to Figure 3 and Figure 4, we can derive the evolution path of the time-consistent reinsurance strategy under the variance value principle. For details see Figure 5 and Figure 6.

Figure 5 and Figure 6 show the corresponding evolution path of the time-consistent reinsurance strategy under the variance value principle. When the insurer and the reinsurer do not consider the impact of the intertemporal restrictions (i.e., Case II), we find that the time-consistent reinsurance strategy

{\hat{q}}_{t}

decreases with the time period t when the weight coefficient takes

α = 0.4

, while for the case that

α = 0.6

, the time-consistent reinsurance strategy

{\hat{q}}_{t}

is an increasing function of the time period t. Obviously, the above conclusion is coincident with that in Figure 3 and Figure 4. In addition, under the different weight coefficients

α

, it is not difficult to find that the role of the intertemporal restrictions is basically consistent with that in Figure 3 and Figure 4.

In Figure 3, Figure 4, Figure 5 and Figure 6, we assume that the risk aversion coefficients

η_{t}^{1}

and

η_{t}^{2}

follow the two given exponential distributions, respectively. Using the parameters shown in Figure 2, that is, the risk aversion coefficients

η_{t}^{1}

and

η_{t}^{2}

both described by the linear functions, we will further discuss the evolution path of the time-consistent reinsurance strategy under the expected value principle and the variance value principle, respectively. For details see Figure 7, Figure 8, Figure 9 and Figure 10.

As shown in Figure 7, Figure 8, Figure 9 and Figure 10, we find that the evolution trends of the above reinsurance strategy are basically consistent with those in Figure 3, Figure 4, Figure 5 and Figure 6. In other words, under the two risk aversion coefficient functions mentioned above, our conclusions are robust to some extent.

5. Conclusions

In this paper, we first propose a multi-period investment-reinsurance optimization problem with consideration of the joint interests of the insurer and the reinsurer under the generalized mean-variance framework. The proposed model is constructed by maximizing the weighted sum of the insurer’s and the reinsurer’s mean-variance objectives. We use a game method to derive the time-consistent investment-reinsurance strategies, and also obtain the exact expression of the time-consistent reinsurance strategy under two special premium principles. Finally, we provide some numerical simulations to present the evolution process of the above time-consistent strategies, so as to show the impact of the intertemporal restrictions on the time-consistent strategies. Some interesting findings are concluded as follows: (a) The intertemporal restrictions will urge the insurer and the reinsurer to shrink the positions invested in the risky asset. (b) The role of the intertemporal restrictions on the time-consistent reinsurance strategy depends on who is the leader in the proposed model. When the insurer is the leader, the intertemporal restrictions will reduce the retention level of a claim, while for the case that the reinsurer is the leader, only when the impact of the reinsurer’s leading role is higher than that of the intertemporal restrictions will the intertemporal restrictions reduce the retention level of a claim.

These interesting findings also provide some useful advice for insurers and reinsurers on the actual investment-reinsurance issue. In the framework of the proposed model, insurers and reinsurers can adjust the intertemporal restriction conditions when the securities market is in different states. For example, when the securities market is in a bull market, insurers and reinsurers can appropriately reduce the intertemporal restrictions to obtain a higher terminal return, while for the case that the securities market is in a bear market, they can increase the number of the intertemporal restrictions, so as to prevent the bankruptcies that occur in the investment-reinsurance process. In addition, the proposed reinsurance strategy takes into account the common interests of the insurer and the reinsurer as well as the impact of intertemporal restrictions on the reinsurance strategy. Similarly, the insurer and the reinsurer can adjust the reinsurance strategy dynamically according to their own risk appetites and the market environment, which also provides a new idea for the actual formulation of the reinsurance contract.

While the proposed model can cover many classical ones, there are also some limits in our study. First, we assume that the risk aversion coefficient does not depend on decision-makers’ current wealth level; however, some researchers point out that the greater the wealth of decision-makers, the less risk-averse they are likely to be. Obviously, the optimal investment-reinsurance problem with the above state-dependent risk aversion can be regarded as one of research directions. Second, this paper assumes that the returns of risky assets are statistically independent among different time periods. However, some empirical studies show that the returns of risky assets always exhibit a certain degree of dependency among different time periods. Therefore, our work can be further investigated under the weak assumption that the returns of risky assets have the serially correlated structure.

Author Contributions

Supervision, Z.Z.; Writing—original draft, H.X.; Writing—review and editing, T.R. and Y.B.

Funding

This research is supported by the National Natural Science Foundation of China (Nos. 71771082 and 71801091) and Hunan Provincial Natural Science Foundation of China (No. 2017JJ1012).

Acknowledgments

The authors are grateful to the anonymous reviewers and the editor for the valuable comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The Proof of Proposition 1

When

k = T - 1

, according the definition of time-consistent strategy, we have:

\begin{matrix} V_{T - 1} (w_{T - 1}^{1}, w_{T - 1}^{2}) & = & max_{π_{T - 1}} \{\begin{matrix} α [ξ_{T}^{1} E_{T - 1} (w_{T}^{1}) - ξ_{T}^{1} η_{T}^{1} V a r_{T - 1} (w_{T}^{1})] \\ + (1 - α) [ξ_{T}^{2} E_{T - 1} (w_{T}^{2}) - ξ_{T}^{2} η_{T}^{2} V a r_{T - 1} (w_{T}^{2})] \end{matrix}\} . \end{matrix}

(A1)

This indicates that Proposition 1 holds for

k = T - 1

. Then for

k = 0, 1, . . ., T - 2

, the function

J_{k} (w_{k}^{1}, w_{k}^{2}, π)

can be expressed as:

\begin{matrix} \begin{matrix} J_{k} (w_{k}^{1}, w_{k}^{2}, π) & = & α \sum_{t = k + 1}^{T} ξ_{t}^{1} [E_{k} (w_{t}^{1}) - η_{t}^{1} V a r_{k} (w_{t}^{1})] + (1 - α) \sum_{t = k + 1}^{T} ξ_{t}^{2} [E_{k} (w_{t}^{2}) - η_{t}^{2} V a r_{k} (w_{t}^{2})] \\ = & α \sum_{t = k + 2}^{T} ξ_{t}^{1} [E_{k} (w_{t}^{1}) - η_{t}^{1} V a r_{k} (w_{t}^{1})] + (1 - α) \sum_{t = k + 2}^{T} ξ_{t}^{2} [E_{k} (w_{t}^{2}) - η_{t}^{2} V a r_{k} (w_{t}^{2})] \\ + α ξ_{k + 1}^{1} [E_{k} (w_{k + 1}^{1}) - η_{k + 1}^{1} V a r_{k} (w_{k + 1}^{1})] \\ + (1 - α) ξ_{k + 1}^{2} [E_{k} (w_{k + 1}^{2}) - η_{k + 1}^{2} V a r_{k} (w_{k + 1}^{2})] . \end{matrix} \end{matrix}

(A2)

By using the law of iterated expectation and the law of total variance, we have:

\begin{matrix} E_{k} (w_{t}^{i}) = E_{k} [E_{k + 1} (w_{t}^{i})], i = 1, 2, \end{matrix}

(A3)

\begin{matrix} V a r_{k} (w_{t}^{i}) = E_{k} [V a r_{k + 1} (w_{t}^{i})] + V a r_{k} [E_{k + 1} (w_{t}^{i})], i = 1, 2 . \end{matrix}

(A4)

Then,

J_{k} (w_{k}^{1}, w_{k}^{2}, π)

can be rewritten as:

\begin{matrix} \begin{matrix} J_{k} (w_{k}^{1}, w_{k}^{2}, π) \\ = E_{k} \{α \sum_{t = k + 2}^{T} ξ_{t}^{1} [E_{k + 1} (w_{t}^{1}) - η_{t}^{1} V a r_{k + 1} (w_{t}^{1})] + (1 - α) \sum_{t = k + 2}^{T} ξ_{t}^{2} [E_{k + 1} (w_{t}^{2}) - η_{t}^{2} V a r_{k + 1} (w_{t}^{2})]\} \\ - α \sum_{t = k + 2}^{T} ξ_{t}^{1} η_{t}^{1} V a r_{k} [E_{k + 1} (w_{t}^{1})] - (1 - α) \sum_{t = k + 2}^{T} ξ_{t}^{2} η_{t}^{2} V a r_{k} [E_{k + 1} (w_{t}^{2})] \\ + α ξ_{k + 1}^{1} [E_{k} (w_{k + 1}^{1}) - η_{k + 1}^{1} V a r_{k} (w_{k + 1}^{1})] + (1 - α) ξ_{k + 1}^{2} [E_{k} (w_{k + 1}^{2}) - η_{k + 1}^{2} V a r_{k} (w_{k + 1}^{2})], \\ = E_{k} [J_{k + 1} (w_{k + 1}^{1}, w_{k + 1}^{2}, π)] - α \sum_{t = k + 2}^{T} ξ_{t}^{1} η_{t}^{1} V a r_{k} [E_{k + 1} (w_{t}^{1})] - (1 - α) \sum_{t = k + 2}^{T} ξ_{t}^{2} η_{t}^{2} V a r_{k} [E_{k + 1} (w_{t}^{2})] \\ + α ξ_{k + 1}^{1} [E_{k} (w_{k + 1}^{1}) - η_{k + 1}^{1} V a r_{k} (w_{k + 1}^{1})] + (1 - α) ξ_{k + 1}^{2} [E_{k} (w_{k + 1}^{2}) - η_{k + 1}^{2} V a r_{k} (w_{k + 1}^{2})] . \end{matrix} \end{matrix}

(A5)

Let

f_{k, t} (w_{k}^{1}) = E_{k} (w_{t}^{1}) |_{\hat{π} (k)} = E_{k} [f_{k + 1, t} (w_{k + 1}^{1})]

and

g_{k, t} (w_{k}^{2}) = E_{k} (w_{t}^{2}) |_{\hat{π} (k)} = E_{k} [g_{k + 1, t} (w_{k + 1}^{2})]

. Additionally, due to the fact that

V_{k} (w_{k}^{1}, w_{k}^{2}) = max_{π_{k}} J_{k} (w_{k}^{1}, w_{k}^{2}, π (k)) = J_{k} (w_{k}^{1}, w_{k}^{2}, \hat{π} (k))

, we can derive the following iterative formula:

\begin{matrix} V_{k} (w_{k}^{1}, w_{k}^{2}) \\ = & max_{π_{k}} \{\begin{matrix} E_{k} [V_{k + 1} (w_{k + 1}^{1}, w_{k + 1}^{2})] - α \sum_{t = k + 2}^{T} ξ_{t}^{1} η_{t}^{1} V a r_{k} [f_{k + 1, t} (w_{k + 1}^{1})] \\ - (1 - α) \sum_{t = k + 2}^{T} ξ_{t}^{2} η_{t}^{2} V a r_{k} [g_{k + 1, t} (w_{k + 1}^{2})] \\ + α ξ_{k + 1}^{1} [E_{k} (w_{k + 1}^{1}) - η_{k + 1}^{1} V a r_{k} (w_{k + 1}^{1})] \\ + (1 - α) ξ_{k + 1}^{2} [E_{k} (w_{k + 1}^{2}) - η_{k + 1}^{2} V a r_{k} (w_{k + 1}^{2})] \end{matrix}\}, \\ k = 0, 1, . . ., T - 2 . \end{matrix}

(A6)

Therefore, we complete the proof of Proposition 1.

Appendix B. The proof of Theorem 1

When

t = T - 1

, we have:

\begin{matrix} V_{T - 1} (w_{T - 1}^{1}, w_{T - 1}^{2}) \\ = & max_{π_{T - 1}} \{\begin{matrix} α [ξ_{T}^{1} E_{T - 1} (w_{T}^{1}) - ξ_{T}^{1} η_{T}^{1} V a r_{T - 1} (w_{T}^{1})] \\ + (1 - α) [ξ_{T}^{2} E_{T - 1} (w_{T}^{2}) - ξ_{T}^{2} η_{T}^{2} V a r_{T - 1} (w_{T}^{2})] \end{matrix}\} \\ = & max_{π_{T - 1}} \{\begin{matrix} α ξ_{T}^{1} [s_{T - 1} w_{T - 1}^{1} + μ_{T - 1}^{1} u_{T - 1}^{1} + c_{T - 1} - δ_{T - 1} (q_{T - 1}) - q_{T - 1} {\tilde{μ}}_{T - 1}] \\ - α ξ_{T}^{1} η_{T}^{1} [σ_{T - 1}^{1} {(u_{T - 1}^{1})}^{2} + {\tilde{σ}}_{T - 1} {(q_{T - 1})}^{2}] \\ + (1 - α) ξ_{T}^{2} [s_{T - 1} w_{T - 1}^{2} + μ_{T - 1}^{2} u_{T - 1}^{2} + δ_{T - 1} (q_{T - 1}) - (1 - q_{T - 1}) {\tilde{μ}}_{T - 1}] \\ - (1 - α) ξ_{T}^{2} η_{T}^{2} [σ_{T - 1}^{2} {(u_{T - 1}^{2})}^{2} + {\tilde{σ}}_{T - 1} {(1 - q_{T - 1})}^{2}] \end{matrix}\} . \end{matrix}

Then, we have:

{\hat{u}}_{T - 1}^{1} = \frac{μ_{T - 1}^{1}}{2 η_{T}^{1} σ_{T - 1}^{1}},

(A7)

{\hat{u}}_{T - 1}^{2} = \frac{μ_{T - 1}^{2}}{2 η_{T}^{2} σ_{T - 1}^{2}},

(A8)

\begin{matrix} {\hat{q}}_{T - 1} = arg max_{0 \leq q_{T - 1} \leq 1} \{\begin{matrix} (ξ_{T}^{2} - α ξ_{T}^{1} - α ξ_{T}^{2}) δ_{T - 1} (q_{T - 1}) \\ - {\tilde{μ}}_{T - 1} [α ξ_{T}^{1} q_{T - 1} + (1 - α) ξ_{T}^{2} (1 - q_{T - 1})] \\ - [α ξ_{T}^{1} η_{T}^{1} {(q_{T - 1})}^{2} + (1 - α) ξ_{T}^{2} η_{T}^{2} {\tilde{σ}}_{T - 1} {(1 - q_{T - 1})}^{2}] \end{matrix}\} . \end{matrix}

(A9)

Then, we have:

\begin{matrix} \begin{matrix} V_{T - 1} (w_{T - 1}^{1}, w_{T - 1}^{2}) \\ = & \{\begin{matrix} α ξ_{T}^{1} s_{T - 1} w_{T - 1}^{1} + (1 - α) ξ_{T}^{2} s_{T - 1} w_{T - 1}^{2} + \frac{α {(μ_{T - 1}^{1})}^{2}}{4 η_{T}^{1} σ_{T - 1}^{1}} + \frac{(1 - α) {(μ_{T - 1}^{2})}^{2}}{4 η_{T}^{2} σ_{T - 1}^{2}} + α ξ_{T}^{1} c_{T - 1} \\ + (ξ_{T}^{2} - α ξ_{T}^{1} - α ξ_{T}^{2}) δ_{T - 1} ({\hat{q}}_{T - 1}) - {\tilde{μ}}_{T - 1} [α ξ_{T}^{1} {\hat{q}}_{T - 1} + (1 - α) ξ_{T}^{2} (1 - {\hat{q}}_{T - 1})] \\ - [α ξ_{T}^{1} η_{T}^{1} {({\hat{q}}_{T - 1})}^{2} {\tilde{σ}}_{T - 1} + (1 - α) ξ_{T}^{2} η_{T}^{2} {(1 - {\hat{q}}_{T - 1})}^{2}] {\tilde{σ}}_{T - 1} \end{matrix}\} \\ = & α ξ_{T}^{1} s_{T - 1} w_{T - 1}^{1} + (1 - α) ξ_{T}^{2} s_{T - 1} w_{T - 1}^{2} + κ_{T - 1}, \end{matrix} \end{matrix}

(A10)

where:

κ_{T - 1} = \{\begin{matrix} \frac{α {(μ_{T - 1}^{1})}^{2}}{4 η_{T}^{1} σ_{T - 1}^{1}} + \frac{(1 - α) {(μ_{T - 1}^{2})}^{2}}{4 η_{T}^{2} σ_{T - 1}^{2}} + α ξ_{T}^{1} c_{T - 1} + (ξ_{T}^{2} - α ξ_{T}^{1} - α ξ_{T}^{2}) δ_{T - 1} ({\hat{q}}_{T - 1}) \\ - {\tilde{μ}}_{T - 1} [α ξ_{T}^{1} {\hat{q}}_{T - 1} + (1 - α) ξ_{T}^{2} (1 - {\hat{q}}_{T - 1})] \\ - [α ξ_{T}^{1} η_{T}^{1} {({\hat{q}}_{T - 1})}^{2} + (1 - α) ξ_{T}^{2} η_{T}^{2} {(1 - {\hat{q}}_{T - 1})}^{2}] {\tilde{σ}}_{T - 1} \end{matrix}\} .

(A11)

Assume that Theorem 1 holds for

t = j + 1, j + 2, . . ., T - 1

, then when

t = j

we have:

\begin{matrix} \begin{matrix} V_{j} (w_{j}^{1}, w_{j}^{2}) \\ = & max_{π_{j}} \{\begin{matrix} α (\sum_{m = j + 2}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) E_{j} (w_{j + 1}^{1}) + (1 - α) (\sum_{m = j + 2}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) E_{j} (w_{j + 1}^{2}) + κ_{j + 1} \\ - α \sum_{m = j + 2}^{T} ξ_{m}^{1} η_{m}^{1} V a r_{j} [f_{j + 1, m} (w_{j + 1}^{1})] - (1 - α) \sum_{m = j + 2}^{T} ξ_{m}^{2} η_{m}^{2} V a r_{j} [g_{j + 1, m} (w_{j + 1}^{2})] \\ + α ξ_{j + 1}^{1} [E_{j} (w_{j + 1}^{1}) - η_{j + 1}^{1} V a r_{j} (w_{j + 1}^{1})] + (1 - α) ξ_{j + 1}^{2} [E_{j} (w_{j + 1}^{2}) - η_{j + 1}^{2} V a r_{j} (w_{j + 1}^{2})] \end{matrix}\} \\ = & max_{π_{j}} \{\begin{matrix} α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) [s_{j} w_{j}^{1} + μ_{j}^{1} u_{j}^{1} + c_{j} - δ_{j} (q_{j}) - q_{j} {\tilde{μ}}_{j}] \\ - α (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) [σ_{j}^{1} {(u_{j}^{1})}^{2} + {\tilde{σ}}_{j} {(q_{j})}^{2}] + κ_{j + 1} \\ + (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) [s_{j} w_{j}^{2} + μ_{j}^{2} u_{j}^{2} + δ_{j} (q_{j}) - (1 - q_{j}) {\tilde{μ}}_{j}] \\ - (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) [σ_{j}^{2} {(u_{j}^{2})}^{2} + {\tilde{σ}}_{j} {(1 - q_{j})}^{2}] \end{matrix}\} . \end{matrix} \end{matrix}

(A12)

Then, we have:

{\hat{u}}_{j}^{1} = \frac{(\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) μ_{j}^{1}}{2 (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{1}},

(A13)

{\hat{u}}_{j}^{2} = \frac{(\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) μ_{j}^{2}}{2 (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{2}},

(A14)

\begin{matrix} {\hat{q}}_{j} = arg max_{0 \leq q_{j} \leq 1} \{\begin{matrix} α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) [- δ_{j} (q_{j}) - q_{j} {\tilde{μ}}_{j}] \\ - α (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{j} {(q_{j})}^{2} \\ + (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) [δ_{j} (q_{j}) - (1 - q_{j}) {\tilde{μ}}_{j}] \\ - (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{j} {(1 - q_{j})}^{2} \end{matrix}\} . \end{matrix}

(A15)

Then, we have:

\begin{matrix} \begin{matrix} V_{j} (w_{j}^{1}, w_{j}^{2}) \\ = & \{\begin{matrix} α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j}^{m - 1} s_{i}) w_{j}^{1} + (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j}^{m - 1} s_{i}) w_{j}^{2} \\ + \frac{α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) {(μ_{j}^{1})}^{2}}{4 (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{1}} + \frac{(1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) μ_{j}^{2}}{4 (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{2}} \\ + α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) [c_{j} - δ_{j} ({\hat{q}}_{j}) - {\hat{q}}_{j} {\tilde{μ}}_{j}] \\ - α (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{j} {({\hat{q}}_{j})}^{2} + κ_{j + 1} \\ + (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) [δ_{j} ({\hat{q}}_{j}) - (1 - {\hat{q}}_{j}) {\tilde{μ}}_{j}] \\ - (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{j} {(1 - {\hat{q}}_{j})}^{2} \end{matrix}\} \\ = & α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j}^{m - 1} s_{i}) E_{j} (w_{j}^{1}) + (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j}^{m - 1} s_{i}) E (w_{j}^{2}) + κ_{j}, \end{matrix} \end{matrix}

(A16)

\begin{matrix} \begin{matrix} f_{j, τ} (w_{j}^{1}) & = & \prod_{i = j}^{τ - 1} s_{i} w_{j}^{1} + \frac{(\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) {(μ_{j}^{1})}^{2}}{2 (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{1}} + \prod_{i = j + 1}^{τ - 1} s_{i} [c_{j} - δ_{j} ({\hat{q}}_{j}) - {\hat{q}}_{j} {\tilde{μ}}_{j}] + γ_{j + 1, τ} \\ = & \prod_{i = j}^{τ - 1} s_{i} w_{j}^{1} + γ_{j, τ}, \end{matrix} \end{matrix}

(A17)

\begin{matrix} \begin{matrix} g_{j, τ} (w_{j}^{2}) & = & \prod_{i = j}^{τ - 1} s_{i} w_{j}^{2} + \frac{(\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) {(μ_{j}^{2})}^{2}}{2 (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{2}} + \prod_{i = j + 1}^{τ - 1} s_{i} [δ_{j} ({\hat{q}}_{j}) - (1 - {\hat{q}}_{j}) {\tilde{μ}}_{j}] + ρ_{j + 1, τ} \\ = & \prod_{i = j}^{τ - 1} s_{i} w_{j}^{2} + ρ_{j, τ} . \end{matrix} \end{matrix}

(A18)

Then, we have:

\{\begin{matrix} κ_{j} = & κ_{j + 1} + \frac{α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) {(μ_{j}^{1})}^{2}}{4 (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{1}} + \frac{(1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) {(μ_{j}^{2})}^{2}}{4 (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{2}} \\ + α (\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) [c_{j} - δ_{j} ({\hat{q}}_{j}) - {\hat{q}}_{j} {\tilde{μ}}_{j}] - α (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{j} {({\hat{q}}_{j})}^{2} \\ + (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) [δ_{j} ({\hat{q}}_{j}) - (1 - {\hat{q}}_{j}) {\tilde{μ}}_{j}] \\ - (1 - α) (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{j} {(1 - {\hat{q}}_{j})}^{2}, \\ γ_{j, τ} = & γ_{j + 1, τ} + \frac{(\sum_{m = j + 1}^{T} ξ_{m}^{1} \prod_{i = j + 1}^{m - 1} s_{i}) {(μ_{j}^{1})}^{2}}{2 (\sum_{m = j + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{1}} + \prod_{i = j + 1}^{τ - 1} s_{i} [c_{j} - δ_{j} ({\hat{q}}_{j}) - {\hat{q}}_{j} {\tilde{μ}}_{j}], \\ ρ_{j, τ} = & ρ_{j + 1, τ} + \frac{(\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = j + 1}^{m - 1} s_{i}) {(μ_{j}^{2})}^{2}}{2 (\sum_{m = j + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = j + 1}^{m - 1} {(s_{i})}^{2}) σ_{j}^{2}} + \prod_{i = j + 1}^{τ - 1} s_{i} [δ_{j} ({\hat{q}}_{j}) - (1 - {\hat{q}}_{j}) {\tilde{μ}}_{j}] . \end{matrix}

(A19)

This indicates that:

\{\begin{matrix} κ_{j} = & \sum_{k = j}^{T} [\frac{α (\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{1})}^{2}}{4 (\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{1}} + \frac{(1 - α) (\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{2})}^{2}}{4 (\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{2}}] \\ + α \sum_{k = j}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) [c_{k} - δ_{k} ({\hat{q}}_{k}) - {\hat{q}}_{k} {\tilde{μ}}_{k}]] \\ - α \sum_{k = j}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{k} {({\hat{q}}_{k})}^{2}] \\ + (1 - α) \sum_{k = j}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) [δ_{k} ({\hat{q}}_{k}) - (1 - {\hat{q}}_{k}) {\tilde{μ}}_{k}]] \\ - (1 - α) \sum_{k = j}^{T} [(\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) {\tilde{σ}}_{k} {(1 - {\hat{q}}_{k})}^{2}], \\ γ_{j, τ} = & \sum_{k = j}^{τ} [\frac{(\sum_{m = k + 1}^{T} ξ_{m}^{1} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{1})}^{2}}{2 (\sum_{m = k + 1}^{T} ξ_{m}^{1} η_{m}^{1} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{1}}] + \sum_{k = j}^{τ} [\prod_{i = k + 1}^{τ - 1} s_{i} [c_{k} - δ_{k} ({\hat{q}}_{k}) - {\hat{q}}_{k} {\tilde{σ}}_{k}]], \\ ρ_{j, τ} = & + \sum_{k = j}^{τ} [\frac{(\sum_{m = j + 1}^{T} ξ_{m}^{2} \prod_{i = k + 1}^{m - 1} s_{i}) {(μ_{k}^{2})}^{2}}{2 (\sum_{m = k + 1}^{T} ξ_{m}^{2} η_{m}^{2} \prod_{i = k + 1}^{m - 1} {(s_{i})}^{2}) σ_{k}^{2}}] + \sum_{k = j}^{τ} [\prod_{i = k + 1}^{τ - 1} s_{i} [δ_{k} ({\hat{q}}_{k}) - (1 - {\hat{q}}_{k}) {\tilde{μ}}_{k}]] . \end{matrix}

(A20)

References

Schmidli, H. On minimizing the ruin probability by investment and reinsurance. Ann. Appl. Probab. 2002, 12, 890–907. [Google Scholar] [CrossRef]
Zeng, Y.; Li, Z. Optimal time-consistent investment and reinsurance policies for mean-variance insurers. Insur. Math. Econ. 2011, 49, 145–154. [Google Scholar] [CrossRef]
Zhu, H.; Deng, C.; Yue, S.; Deng, Y. Optimal reinsurance and investment problem for an insurer with counterparty risk. Insur. Math. Econ. 2015, 61, 242–254. [Google Scholar] [CrossRef]
Huang, Y.; Yang, X.; Zhou, J. Optimal investment and proportional reinsurance for a jump–diffusion risk model with constrained control variables. J. Comput. Appl. Math. 2016, 296, 443–461. [Google Scholar] [CrossRef]
Hu, D.; Wang, H. Time-consistent investment and reinsurance under relative performance concerns. Commun. Stat. Theory Methods 2018, 47, 1693–1717. [Google Scholar] [CrossRef]
Deng, C.; Bian, W.; Wu, B. Optimal reinsurance and investment problem with default risk and bounded memory. Int. J. Control. 2019, 1–13. [Google Scholar] [CrossRef]
Li, D.; Rong, X.; Zhao, H. Time-consistent reinsurance–investment strategy for an insurer and a reinsurer with mean–variance criterion under the CEV model. J. Comput. Appl. Math. 2015, 283, 142–162. [Google Scholar] [CrossRef]
Li, D.; Rong, X.; Zhao, H. The optimal investment problem for an insurer and a reinsurer under the constant elasticity of variance model. IMA J. Manag. Math. 2016, 27, 255–280. [Google Scholar] [CrossRef]
Zhao, H.; Weng, C.; Shen, Y.; Zeng, Y. Time-consistent investment-reinsurance strategies towards joint interests of the insurer and the reinsurer under CEV models. Sci. China Math. 2017, 60, 317–344. [Google Scholar] [CrossRef]
Zhou, J.; Yang, X.; Huang, Y. Robust optimal investment and proportional reinsurance toward joint interests of the insurer and the reinsurer. Commun. Stat.-Theory Methods 2017, 46, 10733–10757. [Google Scholar] [CrossRef]
Huang, Y.; Ouyang, Y.; Tang, L.; Zhou, J. Robust optimal investment and reinsurance problem for the product of the insurer’s and the reinsurer’s utilities. J. Comput. Appl. Math. 2018, 344, 532–552. [Google Scholar] [CrossRef]
Brandt, M.W. Portfolio choice problems. In Handbook of Financial Econometrics: Tools and Techniques; Elsevier: Amsterdam, The Netherlands, 2010; pp. 269–336. [Google Scholar]
Zhu, S.S.; Li, D.; Wang, S.Y. Risk control over bankruptcy in dynamic portfolio selection: A generalized mean-variance formulation. IEEE Trans. Autom. Control 2004, 49, 447–457. [Google Scholar] [CrossRef]
Costa, O.; Nabholz, R.D.B. Multiperiod mean-variance optimization with intertemporal restrictions. J. Optim. Theory Appl. 2007, 134, 257. [Google Scholar] [CrossRef]
Costa, O.L.; Araujo, M.V. A generalized multi-period mean–variance portfolio optimization with Markov switching parameters. Automatica 2008, 44, 2487–2497. [Google Scholar] [CrossRef]
Costa, O.L.; de Oliveira, A. Optimal mean–variance control for discrete-time linear systems with Markovian jumps and multiplicative noises. Automatica 2012, 48, 304–315. [Google Scholar] [CrossRef]
Cui, X.; Li, X.; Li, D. Unified framework of mean-field formulations for optimal multi-period mean-variance portfolio selection. IEEE Trans. Autom. Control 2014, 59, 1833–1844. [Google Scholar] [CrossRef]
He, J.; Wang, Q.G.; Cheng, P.; Chen, J.; Sun, Y. Multi-period mean-variance portfolio optimization with high-order coupled asset dynamics. IEEE Trans. Autom. Control 2014, 60, 1320–1335. [Google Scholar] [CrossRef]
Zhou, Z.; Xiao, H.; Yin, J.; Zeng, X.; Lin, L. Pre-commitment vs. time-consistent strategies for the generalized multi-period portfolio optimization with stochastic cash flows. Insur. Math. Econ. 2016, 68, 187–202. [Google Scholar] [CrossRef]
Xiao, H.; Ren, T.; Zhou, Z. Time-Consistent Strategies for the Generalized Multiperiod Mean-Variance Portfolio Optimization Considering Benchmark Orientation. Mathematics 2019, 7, 723. [Google Scholar] [CrossRef]
Li, D.; Ng, W.L. Optimal dynamic portfolio selection: Multiperiod mean-variance formulation. Math. Financ. 2000, 10, 387–406. [Google Scholar] [CrossRef]
Basak, S.; Chabakauri, G. Dynamic mean-variance asset allocation. Rev. Financ. Stud. 2010, 23, 2970–3016. [Google Scholar] [CrossRef]
Björk, T.; Murgoci, A. A General Theory of Markovian Time Inconsistent Stochastic Control Problems. Available online: http://ssrn.com/abstract=1694759 (accessed on 10 August 2019).
Björk, T.; Murgoci, A. A theory of Markovian time-inconsistent stochastic control in discrete time. Financ. Stochastics 2014, 18, 545–592. [Google Scholar] [CrossRef]
Bensoussan, A.; Wong, K.; Yam, S.C.P.; Yung, S.P. Time-consistent portfolio selection under short-selling prohibition: From discrete to continuous setting. SIAM J. Financ. Math. 2014, 5, 153–190. [Google Scholar] [CrossRef]
Wu, H.; Zeng, Y. Equilibrium investment strategy for defined-contribution pension schemes with generalized mean–variance criterion and mortality risk. Insur. Math. Econ. 2015, 64, 396–408. [Google Scholar] [CrossRef]
Waters, H.R. Some mathematical aspects of reinsurance. Insur. Math. Econ. 1983, 2, 17–26. [Google Scholar] [CrossRef]
Wang, L.; Chen, Z. Stochastic Game Theoretic Formulation for a Multi-Period DC Pension Plan with State-Dependent Risk Aversion. Mathematics 2019, 7, 108. [Google Scholar] [CrossRef]

Figure 1. Time-consistent investment strategies for the insurer and the reinsurer (

T = 100

).

Figure 1. Time-consistent investment strategies for the insurer and the reinsurer (

T = 100

).

Figure 2. Time-consistent investment strategy for the insurer (

α = 0.6

and

T = 100

).

Figure 2. Time-consistent investment strategy for the insurer (

α = 0.6

and

T = 100

).

Figure 3. Time-consistent reinsurance strategy under the expected value principle (

α = 0.4

and

T = 100

).

Figure 3. Time-consistent reinsurance strategy under the expected value principle (

α = 0.4

and

T = 100

).

Figure 4. Time-consistent reinsurance strategy under the expected value principle (

α = 0.6

and

T = 100

).

Figure 4. Time-consistent reinsurance strategy under the expected value principle (

α = 0.6

and

T = 100

).

Figure 5. Time-consistent reinsurance strategy under the variance value principle (

α = 0.4

and

T = 100

).

Figure 5. Time-consistent reinsurance strategy under the variance value principle (

α = 0.4

and

T = 100

).

Figure 6. Time-consistent reinsurance strategy under the variance value principle (

α = 0.6

and

T = 100

).

Figure 6. Time-consistent reinsurance strategy under the variance value principle (

α = 0.6

and

T = 100

).

Figure 7. Time-consistent reinsurance strategy under the expected value principle (

α = 0.4

and

T = 100

).

Figure 7. Time-consistent reinsurance strategy under the expected value principle (

α = 0.4

and

T = 100

).

Figure 8. Time-consistent reinsurance strategy under the expected value principle (

α = 0.6

and

T = 100

).

Figure 8. Time-consistent reinsurance strategy under the expected value principle (

α = 0.6

and

T = 100

).

Figure 9. Time-consistent reinsurance strategy under the variance value principle (

α = 0.4

and

T = 100

).

Figure 9. Time-consistent reinsurance strategy under the variance value principle (

α = 0.4

and

T = 100

).

Figure 10. Time-consistent reinsurance strategy under the variance value principle (

α = 0.6

and

T = 100

).

Figure 10. Time-consistent reinsurance strategy under the variance value principle (

α = 0.6

and

T = 100

).

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xiao, H.; Ren, T.; Bai, Y.; Zhou, Z. Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria. Mathematics 2019, 7, 857. https://doi.org/10.3390/math7090857

AMA Style

Xiao H, Ren T, Bai Y, Zhou Z. Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria. Mathematics. 2019; 7(9):857. https://doi.org/10.3390/math7090857

Chicago/Turabian Style

Xiao, Helu, Tiantian Ren, Yanfei Bai, and Zhongbao Zhou. 2019. "Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria" Mathematics 7, no. 9: 857. https://doi.org/10.3390/math7090857

APA Style

Xiao, H., Ren, T., Bai, Y., & Zhou, Z. (2019). Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria. Mathematics, 7(9), 857. https://doi.org/10.3390/math7090857

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Time-Consistent Investment-Reinsurance Strategies for the Insurer and the Reinsurer under the Generalized Mean-Variance Criteria

Abstract

1. Introduction

2. Generalized Multi-Period Mean-Variance Investment-Reinsurance Optimization Considering Both the Insurer and the Reinsurer

3. Time-Consistent Solution of the Generalized Model

3.1. Time-Consistent Investment-Reinsurance Strategies under the Expected Value Principle

3.2. Time-Consistent Investment and Reinsurance Strategies under the Variance Value Principle

4. Numerical Analysis

4.1. Simulations of the Insurer’s and the Reinsurer’s Time-Consistent Investment Strategies

4.2. Simulations of the Time-Consistent Reinsurance Strategy

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. The Proof of Proposition 1

Appendix B. The proof of Theorem 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI