IDP-Core: Novel Cooperative Solution for Differential Games

Petrosian, Ovanes; Zakharov, Victor

doi:10.3390/math8050721

Open AccessArticle

IDP-Core: Novel Cooperative Solution for Differential Games

by

Ovanes Petrosian

^1,2,*

and

Victor Zakharov

²

¹

College of Mathematics and Computer Science, Yanan University, Yan’an 716000, China

²

Faculty of Applied Mathematics and Control Processes, St. Petersburg University, Universitetskaya naberezhnaya 7-9, 199034 St. Petersburg, Russia

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(5), 721; https://doi.org/10.3390/math8050721

Submission received: 2 April 2020 / Revised: 28 April 2020 / Accepted: 29 April 2020 / Published: 4 May 2020

(This article belongs to the Special Issue Game Theory)

Download

Browse Figures

Versions Notes

Abstract

:

IDP-core is a new cooperative solution for dynamic and differential games. A novel approach of constructing solutions for dynamic and differential games was employed in which the time consistency property was used as the main axiom property for the cooperative solution. Another new and important approach used for constructing IDP-core is the IDP dominance, which allows to select undominated imputation distribution procedures and construct the cooperative solution or imputation set. This approach shows the potential of using the time consistency property as the main axiom for solutions in various fields such as Social Choice and Mechanism Design. The overall procedure for defining the cooperative solution is also new since IDP-core was constructed using imputation distribution procedures but not by using imputations directly.

Keywords:

differential games; cooperative differential games; Time Consistency; IDP-core; IDP dominance

1. Introduction

The theory of cooperative games examines how optimal parameters of cooperative and strategic agreements are to be determined. The main problem in the theory of cooperative games with transferable utilities is to determine the allocation procedure for total payoff in cases when all players cooperate. The rule of how to allocate cooperative payoff among the players is called the imputation. In the theory of cooperative games with non-transferable utilities, the main problem is to define agreement on strategies or a game outcome favorable to all players.

Within the framework of classical cooperative game theory with transferable utilities, numerous cooperative solutions or allocation rules were studied. One of them is the Core. The concept of Core was proposed by D. Gillis [1], which is a generalization of the contractual Edgeworth curve [2]. Edgeworth described a market with two products and two participantsp; here, the Core is defined as a part of the Pareto front. The Core is the set of undominated imputations, each of which can be used as a solution in the game.

It is important to study the non-emptiness property of a cooperative solution, which is to determine the conditions under which the cooperative solution is not empty since its applicability depends on the wideness of the class of games to which this solution can be applied. G. Scarf [3] showed that the Core is not empty for the class of convex games in characteristic function form. Characteristic function is a function of coalition or subset of players in the game, which shows the profit of coalition. Generalization of Scharf results can be found in the paper of L. Biller [4] and Shapley [5]. Necessary and sufficient conditions for the non-emptiness of Core were formulated by Bondareva [6] and Shapley [7], where the main role of proof is the concept of a balanced game. Unfortunately, based on this concept it is impossible to apply a constructive method for choosing the specific imputations from the Core.

V. Zakharov in [8] proposed the necessary and sufficient conditions for the non-emptiness of Core, which simplify the test for a single-point solution (imputation), such as the Shapley value, Banzhaf power index and others, whenever they belong to the Core. In [9,10] based on of this approach, geometric properties for several cooperative solutions were investigated. This approach implies that the non-emptiness property of Core can be formulated by a linear programming problem constructed using the values of the characteristic function.

It is also important to construct cooperative solutions for a class of dynamic and differential games. Solutions for such models can be used for modeling cooperative and strategic agreements where conditions are defined over a long time interval [11,12]. The theory of differential games was developed as a separate class of applied mathematics in the 1950s. One of the first works in the field of differential games is the work of R. Isaacs [13] in which the notions of state, controls, and the problem of aircraft interception by a guided missile were formulated, and a fundamental equation for defining the solution derived. A comprehensive description of dynamic cooperative games is presented in [14].

A natural approach for researching cooperative differential games is an attempt at transferring the results of classical static cooperative theory [15] to the theory of differential games. However, in order to use the results of classical theory, it is additionally necessary to study the time consistency and strong time consistency properties of cooperative solutions. Time consistency of cooperative solution is the property that shows that for the players it is not beneficial to deviate from the chosen cooperative solution during the game. The use of time-inconsistent cooperative solutions in the field of economics, ecology, and management makes these solutions unfeasible because players might find it profitable to reconsider the cooperative solution. The notion of the cooperative solution’s time consistency was first formulated mathematically by L.A. Petrosyan in 1977 [16]. In [17] a method was proposed to construct time-consistent cooperative solutions using a special payment scheme, called the imputation distribution procedure (IDP). The notion of strong time consistency was formulated in [18]. Recent papers [19,20,21,22] are devoted to the study of the time consistency property of cooperative solutions.

In order to solve the time inconsistency problem in a classical cooperative solution, an imputation distribution procedure should be used. However, there exists another, rather new approach that allows for constructing time-consistent cooperative solutions. This approach uses the time consistency property as a basic axiomatic property for defining the cooperative solution. This approach is the subject of this paper and carries an innovative character. It is important to notice that the further use of time consistency property for dynamic cooperative games, Social Choice, and Mechanisms Design, as an axiom, is promising. Another important property considered in this paper is the IDP dominance property. According to this property, the corresponding cooperative solution is constructed using the imputation distribution procedures, which are undominated. We say that the IDP is undominated by coalition S if there does not exist another IDP, coalition S, and time instant such that the instant payments corresponding to IDP are higher for players from coalition S at a given time instant than in the current IDP.

In the paper [23] the notion of a strong time-consistent subset of the Core was introduced. Their authors constructed a new cooperative solution using the geometric approach and proved that it was a subset of Core and possesses a strong time consistency property. Later on, this solution was called the IDP-core and it can be constructed using a system of linear constraints for imputation distribution procedures. These conditions are defined for each time instant of a differential game. From the non-emptiness of a set described by these constraints, the non-emptiness of the corresponding set of IDPs at each time instant, it follows that the IDP-core is not empty. In the paper [24] we apply the technique proposed in [8] to study the non-emptiness of IDP-core for each time instant, and if it is non-empty, we conclude that IDP-core is non-empty. Obtained results can be used for the construction of IDP-core and verification of its non-emptiness as a numerical example. Moreover, a special case of this approach is presented for 3-player differential games. It is possible to analytically construct conditions for non-emptiness of IDP-core depending on the characteristic function. Furthermore, it is possible to define an analytical formula for selectors of IDP-core, in particular, the formula for imputation distribution procedures of IDP-core selectors.

The paper is structured as follows. Section 2 contains preliminary information, including the definition of a cooperative solution and time consistency property. Section 3 is devoted to the description of IDP-dominance, to the definition of IDP-core and corresponding necessary and sufficient conditions. Section 4 is devoted to studying the non-emptiness of IDP-core using linear programming methods. Section 5 presents the differential game model of resource extraction, IDP-core for this model is constructed using the corresponding necessary and sufficient conditions, non-emptiness conditions are studied and conclusions are drawn.

2. Problem Statement and Preliminary Information

2.1. Differential Game Model

In this section, the general description of the differential game model is given. The main concepts of this model are the type of model, payoff functions of players, motion equations, and solution concept. Type of the game model reflects what we intend to do with the model, in this paper we consider the cooperative game model. Here we need to define how to allocate joint cooperative payoff among the players. Payoff functions of players define the objectives of players depending on the state of the game, strategies and are calculated on some specific time interval (in our case closed time interval). Motion equations define of how the state of the game changes according to the strategies of players. In the case of the cooperative game model solution concept defines the exact type of imputation set that will be used to allocate joint payoff among the players.

Consider an n-player differential game

Γ (x_{0}, T - t_{0})

with prescribed duration

T - t_{0}

and initial condition

x_{0}

. Game dynamics are defined by the system of differential equations:

\begin{matrix} \dot{x} = f (x, u_{1}, \dots, u_{n}), x \in R^{n}, u_{i} \in U_{i} \subset comp R^{k}, t \in [t_{0}, T], i = \bar{1, n}, \\ x (t_{0}) = x_{0}, \end{matrix}

(1)

for which the conditions of existence, uniqueness and continuity of solution

x (t)

for any admissible measurable controls

u_{1} (\cdot), \dots, u_{n} (\cdot)

are satisfied. Open-loop control

u_{i} (t)

satisfying the system (1) is a strategy of player i and

comp R^{k}

is the compact set in k-dimension real number space (k is integer).

Let

N = {1, \dots, n}

be the set of players. Payoff of player i is defined in the following way:

K_{i} (x_{0}, T - t_{0}; u_{1}, \dots, u_{n}) = \int_{t_{0}}^{T} h_{i} (x (τ), u_{1} (τ), \dots, u_{n} (τ)) d τ, i = \bar{1, n},

(2)

where

h_{i} (x, u_{1}, \dots, u_{n}) \geq 0

,

i = \bar{1, n}

and

f (x, u_{1}, \dots, u_{n})

are integrable functions,

x (t)

is the solution of system (1) with controls

u (t) = (u_{1} (t), \dots, u_{n} (t))

involved.

2.2. Cooperative Differential Game Model

In the cooperative differential game model with transferable utility there are two problems:

(1): Determination of a strategy set for players which maximizes the sum of their payoffs or determination of strategies corresponding to cooperative behavior. These strategies $u^{*} = (u_{1}^{*}, \dots, u_{n}^{*})$ are called optimal, the corresponding trajectory is called the cooperative trajectory and denoted by $x^{*} (t)$ .
(2): Determination of the allocation rule for the maximum joint payoff of players corresponding to the optimal strategies $u^{*} (t)$ and determination of optimal trajectory $x^{*} (t)$ . Namely, the determination of a cooperative solution as a subset of the imputation set.

Let

u^{*} = (u_{1}^{*}, \dots, u_{n}^{*})

be the vector of optimal strategies (open-loop controls) for players; i.e., a set of controls that maximizes the joint payoff of players:

u^{*} = (u_{1}^{*}, \dots, u_{n}^{*}) = arg max_{u_{1}, \dots, u_{n}} \sum_{i = 1}^{n} K_{i} (x_{0}, T - t_{0}; u_{1}, \dots, u_{n}) .

(3)

Suppose that the maximum in (3) is achieved on the set of admissible strategies.

In order to determine how to allocate the maximum total payoff among players, it is necessary to define the notion of the characteristic function of coalition

S \subseteq N

. The characteristic function shows the strength of a coalition and thus allows the contribution of players to each coalition to be taken into account.

Suppose that in the game

Γ (x_{0}, T - t_{0})

characteristic function

V (S; x_{0}, T - t_{0})

,

S \subseteq N

is constructed in any relevant way (for example, as in [25]). We assume that the superadditivity conditions are satisfied:

\begin{matrix} V (S_{1} \cup S_{2}; x_{0}, T - t_{0}) \geq V (S_{1}; x_{0}, T - t_{0}) + V (S_{2}; x_{0}, T - t_{0}), \\ \forall S_{1}, S_{2} \subseteq N, S_{1} \cap S_{2} = \emptyset . \end{matrix}

Denote by

L (x_{0}, T - t_{0})

the set of imputations [26] in the game

Γ (x_{0}, T - t_{0})

:

\begin{matrix} L (x_{0}, T - t_{0}) = {ξ (x_{0}, T - t_{0}) = (ξ_{1} (x_{0}, T - t_{0}), \dots, ξ_{n} (x_{0}, T - t_{0})) : \\ \sum_{i = 1}^{n} ξ_{i} (x_{0}, T - t_{0}) = V (N; x_{0}, T - t_{0}), \\ ξ_{i} (x_{0}, T - t_{0}) \geq V ({i}; x_{0}, T - t_{0}), i \in N}, \end{matrix}

where

V ({i}; x_{0}, T - t_{0})

is a value of characteristic function

V (S; x_{0}, T - t_{0})

for coalition

S = {i}

.

By

M (x_{0}, T - t_{0})

denote an arbitrary cooperative solution or subset of imputation set

L (x_{0}, T - t_{0})

:

M (x_{0}, T - t_{0}) \subseteq L (x_{0}, T - t_{0}) .

Suppose that at the beginning of game

Γ (x_{0}, T - t_{0})

at the instant

t_{0}

, players agreed to select a subset of

L (x_{0}, T - t_{0})

or some cooperative solution. However, suppose that at some instant

\bar{t}

players decided to reconsider the chosen cooperative solution, or decided to reconsider the allocation rule for a cooperative payoff. In order to model their behavior, it is necessary to define the notion of subgame

Γ (x^{*} (t), T - t)

along the cooperative trajectory

x^{*} (t)

starting at the instant

t \in [t_{0}, T]

.

For each subgame

Γ (x^{*} (t), T - t)

,

t \in [t_{0}, T]

along the trajectory

x^{*} (t)

, we define the superadditive characteristic function

V (S; x^{*} (t), T - t)

,

S \subseteq N

in the same way as it was done for the initial game

Γ (x_{0}, T - t_{0})

:

\begin{matrix} \forall S, A \subseteq N, S \cap A = \emptyset : \\ V (S \cup A; x^{*} (t), T - t) \geq V (S; x^{*} (t), T - t) + V (A; x^{*} (t), T - t) . \end{matrix}

(4)

It is also possible to define the notion of imputation

ξ (x^{*} (t), T - t)

for a subgame

Γ (x^{*} (t), T - t)

along the cooperative trajectory

x^{*} (t)

,

t \in [t_{0}, T]

. The set of all possible imputations in the subgame

Γ (x^{*} (t), T - t)

is denoted by

L (x^{*} (t), T - t)

,

t \in [t_{0}, T]

:

\begin{matrix} L (x^{*} (t), T - t) = {ξ (x^{*} (t), T - t) = (ξ_{1} (x^{*} (t), T - t), \dots, ξ_{n} (x^{*} (t), T - t)) : \\ \sum_{i = 1}^{n} ξ_{i} (x^{*} (t), T - t) = V (N; x^{*} (t), T - t), \\ ξ_{i} (x^{*} (t), T - t) \geq V ({i}; x^{*} (t), T - t), i \in N} . \end{matrix}

(5)

The superaditivity property (4) for characteristic function

V (S; x^{*} (t), T - t)

guarantees the non-emptiness of imputation set

L (x^{*} (t), T - t)

,

t \in [t_{0}, T]

. The cooperative solution of subgame

Γ (x^{*} (t), T - t)

is denoted correspondingly by

M (x^{*} (t), T - t)

.

2.3. Core

In cooperative game theory, the main problem is “fair” allocation of the maximum joint payoff

V (N; x_{0}, T - t_{0})

among the players from grand coalition

N = {1, \dots, n}

.

Suppose that players in the cooperative differential game

Γ (x_{0}, T - t_{0})

(subgame

Γ (x^{*} (t), T - t)

,

t \in [t_{0}, T]

along the cooperative trajectory

x^{*} (t)

) made an agreement on the allocation rule

ξ (x_{0}, T - t_{0})

(imputation

ξ (x^{*} (t), T - t)

), where none of imputations dominates

ξ (x_{0}, T - t_{0})

(

ξ (x^{*} (t), T - t)

) [26]. Such an allocation rule is stable in the sense that there not exists imputation that would be better for each coalition at every time instant

t \in [t_{0}, T]

.

Definition 1.

We call the set of undominated imputations of cooperative differential game

Γ (x^{*} (t), T - t)

by the Core and denote it by

C (x^{*} (t), T - t)

,

t \in [t_{0}, T]

.

The following theorem holds:

Theorem 1.

Imputation

ξ (x^{*} (t), T - t)

belongs to the Core

C (x^{*} (t), T - t)

, if and only if for all

S \subseteq N

the following inequalities are satisfied:

V (S; x^{*} (t), T - t) \leq \sum_{i \in S} ξ_{i} (x^{*} (t), T - t), t \in [t_{0}, T] .

2.4. Non-Emptiness of Core in Static Games

These are the main results concerning the nonemptiness conditions of Core in static games. Necessary and sufficient conditions for non-emptiness of Core were formulated by O. Bondareva [6] and by L. Shapley [7]. These conditions are based on the concept of a balanced game, but the application of this approach for a specific game model is difficult.

In the paper [27] G. Owen showed that in the game

(N, v)

exists a non-empty Core, if and only if the optimal value of the linear programming problem

\begin{matrix} \sum_{i \in N} ξ_{i} ⟶ m i n \\ \sum_{i \in S} ξ_{i} \geq v (S), \forall S \subseteq N, S \neq \emptyset \end{matrix}

is equal to

v (N)

.

The papers [8,9,10] also make use of linear programming problem for Core’s non-emptiness. Consider the following linear programming problem:

\begin{matrix} \sum_{i \in N} ξ_{i} ⟶ m i n \\ \sum_{i \in S} ξ_{i} \geq v (S), \forall S \subseteq N, S \neq N, \emptyset . \end{matrix}

(6)

Suppose that

ξ^{0} = (ξ_{1}^{0}, \dots, ξ_{n}^{0})

is some arbitrary optimal solution of the linear programming problem (6). The set of all optimal solutions of the optimization problem (6) is denoted by

X^{0} (v)

. In [8] it is shown that the necessary and sufficient conditions of non-emptiness of Core can be formalized in the following way:

Theorem 2.

The Core in cooperative game with transferable utility

(N, v)

is nonempty, if and only if the following inequality is satisfied:

\sum_{i \in N} ξ_{i}^{0} \leq v (N),

(7)

where

ξ^{0} \in X^{0} (v)

is a solution of the linear programming problem (6).

2.5. Time-Consistency of Cooperative Solution and Imputation Distribution Procedure

Transferring the results of static cooperative game theory to the field of cooperative differential games brings about the problem of defining the time-consistent cooperative solution. The problem of defining the solution of the differential game with prescribed duration was studied in the papers of L.A. Petrosyan [16,17]. Time consistency of cooperative solution is the property that shows that for the players it is not beneficial to deviate from the chosen cooperative solution during the game.

The main approach for solving the problem of time inconsistency of cooperative solution in the differential game is the imputation distribution procedure, proposed in [17]. In this paper, imputation distribution procedure was defined as a vector function for a fixed imputation. In this paper, we consider another approach that generalizes the notion of IDP.

Assume IDP’s in cooperative differential game

Γ (x_{0}, T - t_{0})

are integrable vector functions that constitute some imputation from the imputation set:

β (t) : \int_{t_{0}}^{T} β (τ) d τ \in L (x_{0}, T - t_{0})

(8)

or

\begin{matrix} \int_{t_{0}}^{T} β_{i} (τ) d τ \geq V ({i}; x_{0}, T - t_{0}), i \in N, \\ \sum_{i \in N} \int_{t_{0}}^{T} β_{i} (τ) d τ = V (N; x_{0}, T - t_{0}) . \end{matrix}

Therefore, in the above definition, IDP is not based on the imputation itself but generates it. We define also the so-called corresponding IDP, the concept of which is close to the initial definition of IDP in the paper [17].

Definition 2.

The integrable function

β (t) = (β_{1} (t), \dots, β_{n} (t))

,

t \in [t_{0}, T]

is called a corresponding imputation distribution procedure (IDP) for

ξ (x_{0}, T - t_{0}) \in L (x_{0}, T - t_{0})

, if the following equalities hold:

ξ_{i} (x_{0}, T - t_{0}) = \int_{t_{0}}^{T} β_{i} (τ) d τ, i \in N .

(9)

Actually, the corresponding IDP

β (t)

depends on

ξ (x_{0}, T - t_{0})

and is not unique for this imputation. We can represent it in the form

β (t) = β (t, ξ (x_{0}, T - t_{0}))

or

β_{i} (t) = β_{i} (t, ξ_{i} (x_{0}, T - t_{0})), i \in N .

From (9) we have for

t \in [t_{0}, T]

,

i \in N

:

ξ_{i} (x_{0}, T - t_{0}) = \int_{t_{0}}^{t} β_{i} (τ) d τ + \int_{t}^{T} β_{i} (τ) d τ

or

\int_{t}^{T} β_{i} (τ) d τ = ξ_{i} (x_{0}, T - t_{0}) - \int_{t_{0}}^{t} β_{i} (τ) d τ .

That is IDP shares at instant t imputations in two parts: payoffs to player i, which are received in interval

[t_{0}, t]

and in interval

(t, T]

.

Definition 3.

The cooperative solution

M (x_{0}, T - t_{0})

in the game

Γ (x_{0}, T - t_{0})

is called time-consistent, if for each imputation

ξ (x_{0}, T - t_{0}) \in M (x_{0}, T - t_{0})

there exists a corresponding IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

such that:

\int_{t}^{T} β (τ) d τ \in M (x^{*} (t), T - t), t \in [t_{0}, T]

(10)

or equivalently

ξ (x_{0}, T - t_{0}) - \int_{t_{0}}^{t} β (τ) d τ \in M (x^{*} (t), T - t), t \in [t_{0}, T] .

(11)

Notice that from condition (10) we have the following equality

\sum_{i \in N} \int_{t}^{T} β_{i} (τ) d τ = V (N; x^{*} (t), T - t), t \in [t_{0}, T] .

(12)

It is obvious that if

M (x^{*} (t), T - t) \neq \emptyset

for

\forall t \in [t_{0}, T]

, then for any differentiable by t function

ξ (x^{*} (t), T - t) \in M (x^{*} (t), T - t)

(

ξ (x^{*} (t_{0}), T - t_{0}) = ξ (x_{0}, T - t_{0})

) IDP

β (t)

can be defined using the formula:

\begin{matrix} β (t) = - \frac{d}{d t} ξ (x^{*} (t), T - t), t \in [t_{0}, T], i \in N, \\ ξ (x^{*} (t_{0}), T - t_{0}) = ξ (x_{0}, T - t_{0}) . \end{matrix}

(13)

Then imputation

ξ (x_{0}, T - t_{0})

is defined by the formula:

ξ (x_{0}, T - t_{0}) = \int_{t_{0}}^{t} β (τ) d τ + ξ (x^{*} (t), T - t), t \in [t_{0}, T] .

Define an imputation in the current cooperative game

Γ (x^{*} (t), T - t)

with characteristic function

V (S; x^{*} (t), T - t)

which corresponds to a given IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

as

ξ (x^{*} (t), T - t) = \int_{t}^{T} β (τ) d τ .

(14)

From Definition 3 we have

ξ (x^{*} (t), T - t) \in M (x^{*} (t), T - t) .

(15)

We will call the imputation (14) the dynamic imputation generated by the corresponding IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

.

3. IDP-Core and Dominance of Imputation Distribution Procedures

Consider the development of game at instant

t \in (t_{0}, T)

. Suppose that at instant

t_{0}

players agreed to realize imputation

ξ (x_{0}, T - t_{0}) = (ξ_{1} (x_{0}, T - t_{0}), \dots, ξ_{n} (x_{0}, T - t_{0}))

. Then, according to the corresponding IDP

β (t)

, until the instant t, player

i \in N

receives the payoff:

\int_{t_{0}}^{t} β_{i} (τ) d τ .

However, for some players IDP

β (t, ξ (x_{0}, T - t_{0}))

would not be beneficial if there exists another imputation distrubution procedure

\bar{β} (t, ξ (x_{0}, T - t_{0}))

, according to which player i at interval

[t_{0}, t]

receives more payoff:

\int_{t_{0}}^{t} {\bar{β}}_{i} (τ) d τ > \int_{t_{0}}^{t} β_{i} (τ) d τ .

(16)

In such a case IDP

β (t)

may be considered as less beneficial for the player i at least in interval

[t_{0}, t]

. It is important to notice that the notion of IDP-dominance can be applied to imputation distribution procedures, which are not necessarily defined for a unique imputation. As IDP defines how dynamic imputation is to be constructed then it also makes sense to consider the notion of IDP-dominance not only for a fixed imputation.

3.1. Dominance of Imputation Distribution Procedures

In this section we consider the IDP

β (t)

defined by the formula (8). Suppose that the function

V (S; x^{*} (t), T - t)

,

S \subseteq N

is continuously differentiable by

t \in [t_{0}, T]

. Define the function

U (S; x^{*} (t), T - t)

in the following way:

U (S; x^{*} (t), T - t) = - \frac{d}{d t} V (S; x^{*} (t), T - t), t \in [t_{0}, T], S \subseteq N .

(17)

Definition 4.

IDP

β (t)

dominates IDP

\bar{β} (t)

by coalition

S \subseteq N

and at the instant

\bar{t} \in [t_{0}, T]

(denote by

β (t) \overset{S, \bar{t}}{≻} \bar{β} (t)

), if the following inequalities hold:

\begin{matrix} β_{i} (\bar{t}) > {\bar{β}}_{i} (\bar{t}), i \in S, \\ \sum_{i \in S} β_{i} (\bar{t}) \leq U (S; x^{*} (\bar{t}), T - \bar{t}) . \end{matrix}

(18)

Definition 5.

IDP

β (t)

is undominated if at any

\bar{t} \in [t_{0}, T]

there does not exist

\bar{β} (t)

, which dominates

β (t)

by coalition

S \subseteq N

:

\bar{β} (t) \overset{S, \bar{t}}{\neg ≻} β (t), \forall \bar{β} (t), S .

(19)

3.2. IDP-Core

In the paper [23] the authors first introduced and treated a subset of the imputation set in a cooperative differential game which was named subcore. This subset was designed using a set of imputation distribution procedures satisfying the system of inequalities and equalities. This approach is not classical for the theory of differential games since it uses IDP’s for imputations, not vice versa. Based on this subcore notion, we in the paper [24] redefined this notion for the dynamic case, named it IDP-core, and formulated necessary and sufficient conditions of the existence of IDP-core along the cooperative trajectory of the game. In the current paper, we define a solution concept for IDP-core by introducing the notion of IDP dominance and using the time consistency properties or axioms defined above. It is proved that IDP-core has the necessary and sufficient conditions for a dynamic imputation when defined by the system of inequalities introduced in the paper [23,28].

Suppose that players in the game

Γ (x_{0}, T - t_{0})

agreed on the allocation rule for total payoff of grand coalition N (imputation

ξ (x_{0}, T - t_{0})

) using the cooperative solution of IDP-core:

Definition 6.

By the dynamic

I D P - c o r e (x^{*} (t), T - t)

along the cooperative trajectory

x^{*} (t)

,

t \in [t_{0}, T]

(

I D P - c o r e (x_{0}, T - t_{0})

), we call the solution in cooperative differential game

Γ (x^{*} (t), T - t)

(

Γ (x_{0}, T - t_{0})

), which includes all time-consistent imputations generated by undominated IDPs

β (τ)

,

τ \in [t, T]

(8),

t \in [t_{0}, T]

(

β (t)

,

t \in [t_{0}, T]

):

\begin{matrix} I D P - c o r e (x^{*} (t), T - t) = {ξ (x^{*} (t), T - t) = \int_{t}^{T} β (τ) d τ : \\ ξ (x^{*} (t), T - t) and corresponding β (τ) satisfies (10), \\ \exists \bar{β} (t), S, \bar{t} : \bar{β} (t) \overset{S, \bar{t}}{≻} β (t)} . \end{matrix}

(20)

We note that

I D P - c o r e (x^{*} (t), T - t)

includes such imputations from the Core

C (x^{*} (t), T - t)

of cooperative game

Γ (x_{0}, T - t_{0})

for which there exists corresponding undominated IDP and this IDP generates a dynamic imputation belonging

C (x^{*} (t), T - t)

for each

t \in [t_{0}, T]

.

Theorem 3.

Let

C (x^{*} (t), T - t)

be not empty for any

t \in [t_{0}, T)

. Dynamic imputation

ξ (x^{*} (t), T - t)

in cooperative differential game

Γ (x_{0}, T - t_{0})

belongs to the dynamic

I D P - c o r e (x^{*} (t), T - t)

, if and only if for corresponding

β (t) = β (t, ξ (x_{0}, T - t_{0}))

the following conditions are satisfied

\forall t \in [t_{0}, T]

:

\begin{matrix} \sum_{i \in S}^{} β_{i} (t) \geq U (S; x^{*} (t), T - t), \forall S \subset N, \end{matrix}

(21)

\begin{matrix} \sum_{i \in N}^{} β_{i} (t) = U (N; x^{*} (t), T - t) . \end{matrix}

(22)

Proof.

Sufficiency. Let for corresponding IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

conditions (21) and (22) hold at any

t \in [t_{0}, T]

. By integrating (21) and (22) in interval we obtain

\begin{matrix} \sum_{i \in S}^{} ξ (x^{*} (t), T - t) \geq V (S; x^{*} (t), T - t), \forall S \subset N, \\ \sum_{i \in N}^{} ξ (x^{*} (t), T - t) = V (N; x^{*} (t), T - t) . \end{matrix}

(23)

This means the imputation

ξ (x_{0}, T - t_{0}) = \int_{t_{0}}^{T} β (t) d t \in C (x^{*} (t_{0}), T - t_{0}) = C (x_{0}, T - t_{0}) .

Let us show that

β (t) = β (t, ξ (x_{0}, T - t_{0}))

is undominated. It is proven by contradiction. Suppose for some

\bar{S} \subset N

there exists

\bar{t} \in [t_{0}, T]

and

\bar{β} (t)

such that

\begin{matrix} ξ (x_{0}, T - t_{0}) = \int_{t_{0}}^{T} \bar{β} (t) d t \in L (x_{0}, T - t_{0}), \\ {\bar{β}}_{i} (\bar{t}) > β_{i} (\bar{t}), i \in \bar{S}, \\ \sum_{i \in \bar{S}}^{} {\bar{β}}_{i} (\bar{t}) \leq U (\bar{S}; x^{*} (\bar{t}), T - \bar{t}) . \end{matrix}

(24)

Therefore

\sum_{i \in \bar{S}}^{} β_{i} (\bar{t}) < U (\bar{S}; x^{*} (\bar{t}), T - \bar{t}) .

This inequality contradicts to (21). Thus

β (t) = β (t, ξ (x_{0}, T - t_{0}))

is undominated on the interval

[t_{0}, T]

. Notice that

β (t)

is undominated on any subinterval

[τ, T]

,

τ \in [t_{0}, T]

, in subgame

Γ (x^{*} (τ), T - τ)

.

For

S = {i}

condition (21) is represented in the form

β_{i} (t) \geq U ({i}; x^{*} (t), T - t), i \in N .

Integrating these inequalities and equality (22) and taking into account (9), (17) we obtain

\begin{matrix} ξ_{i} (x^{*} (t), T - t) = \int_{t}^{T} β_{i} (τ) d τ \geq V ({i}; x^{*} (t), T - t), i \in N, \\ \sum_{i \in N}^{} ξ_{i} (x^{*} (t), T - t) = \sum_{i \in N}^{} \int_{t}^{T} β_{i} (τ) d τ = V (N; x^{*} (t), T - t) . \end{matrix}

(25)

Thus dynamic payoff

ξ (x^{*} (t), T - t) = \int_{t}^{T} β (τ) d τ

is the imputation in current game

Γ (x^{*} (t), T - t)

generated by corresponding IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

.

Due to the non-emptiness of

C (x^{*} (t), T - t)

for any

t \in [t_{0}, T]

the IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

which satisfies (21) and (22) generates the payoff vector

ξ (x^{*} (t), T - t) = \int_{t}^{T} β (τ) d τ,

which belongs to

C (x^{*} (t), T - t)

. Thus

ξ (x^{*} (t), T - t)

satisfies (10) and therefore imputation

ξ (x_{0}, T - t_{0})

is time-consistent and lies in

I D P - c o r e (x_{0}, T - t_{0})

.

Necessity. Let imputation

ξ (x_{0}, T - t_{0})

in the cooperative differential game

Γ (x_{0}, T - t_{0})

belong to

I D P - c o r e (x_{0}, T - t_{0})

. Therefore by Definition 6 the imputation

ξ (x_{0}, T - t_{0})

generated by undominated IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

belongs to

C (x_{0}, T - t_{0})

and is time-consistent. Due to Definition 6 the time-consistency of imputation

ξ (x_{0}, T - t_{0})

means that there exists IDP

β (t)

such that

ξ (x^{*} (t), T - t) = \int_{t}^{T} β (τ) d τ \in C (x^{*} (t), T - t), t \in [t_{0}, T] .

Let us show that this inclusion takes place for the undominated

β (t) = β (t, ξ (x_{0}, T - t_{0}))

corresponding to imputation

ξ (x_{0}, T - t_{0})

. Suppose this is not the fact. Then there exists coalition

\bar{S} \subset N

and some instant

\bar{t} \in [t_{0}, T]

such that the following inequality holds

\sum_{i \in \bar{S}} \int_{\bar{t}}^{T} β (t) d t < V (\bar{S}; x^{*} (\bar{t}), T - \bar{t}) .

(26)

Notice that

β (\bar{t}) = β (\bar{t}, ξ (x_{0}, T - t_{0}))

belongs to the set of undominated imputations in a cooperative game with the characteristic function

U (S; x^{*} (\bar{t}), T - \bar{t})

,

S \subseteq N

. Therefore as follows from Theorem 1 for

\bar{S} \subset N

the following has to be fulfilled

\sum_{i \in \bar{S}} β_{i} (\bar{t}) d t \geq U (\bar{S}; x^{*} (\bar{t}), T - \bar{t}) .

(27)

Integrating this inequality in interval

[\bar{t}, T]

we receive

\sum_{i \in \bar{S}} \int_{\bar{t}}^{T} β_{i} (t) d t \geq V (\bar{S}; x^{*} (\bar{t}), T - \bar{t}),

(28)

which contradicts (27). Thus the Theorem is proved. □

Proposition 1.

If

C (x^{*} (t), T - t)

is empty for some

t = \bar{t} \in [t_{0}, T]

, then

I D P - c o r e (x^{*} (t_{0}), T - t_{0})

is empty.

Proof.

Suppose we can find the time-consistent imputation

ξ (x_{0}, T - t_{0}) \in I D P - c o r e (x^{*} (t_{0}), T - t_{0})

and corresponding undominated IDP

β (t)

, which satisfies (21) and (22). Integrating inequalities (21) and (22) we obtain

\begin{matrix} \sum_{i \in S}^{} \int_{\bar{t}}^{T} β_{i} (τ) d τ \geq V (S; x^{*} (\bar{t}), T - \bar{t}), S \subset N, \\ \sum_{i \in N}^{} \int_{\bar{t}}^{T} β_{i} (τ) d τ = V (N; x^{*} (\bar{t}), T - \bar{t}) . \end{matrix}

(29)

Thus we receive

ξ (x^{*} (\bar{t}), T - \bar{t}) = \int_{\bar{t}}^{T} β (τ) d τ \in C (x^{*} (\bar{t}), T - \bar{t}) .

(30)

It contradicts with emptiness of

C (x^{*} (\bar{t}), T - \bar{t})

. Therefore,

I D P - c o r e (x^{*} (t_{0}), T - t_{0})

is empty. The Proposition is proved. □

Proposition 2.

If

C (x^{*} (t), T - t)

is not empty for any

t \in [t_{0}, T]

then

I D P - c o r e (x^{*} (t_{0}), T - t_{0}) = C (x^{*} (t_{0}), T - t_{0})

.

Proof.

Consider an imputation

ξ (x_{0}, T - t_{0}) \in C (x^{*} (t_{0}), T - t_{0})

that does not belong to

I D P - c o r e (x^{*} (t_{0}), T - t_{0})

. That is

ξ (x_{0}, T - t_{0})

belongs to

C (x^{*} (t_{0}), T - t_{0})

, but is time inconsistent. According to the time consistency imputation definition

ξ (x_{0}, T - t_{0})

is time inconsistent if there does not exist IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

such that at any

t \in [t_{0}, T]

dynamic imputation

ξ (x^{*} (t), T - t)

generated by this IDP belongs to core

C (x^{*} (t), T - t)

.

But as follows from Theorem 3, if the corresponding IDP

β (t) = β (t, ξ (x_{0}, T - t_{0}))

satisfies conditions (21) and (22), then

ξ (x_{0}, T - t_{0})

belongs to

I D P - c o r e (x_{0}, T - t_{0})

. That is

ξ (x_{0}, T - t_{0})

is time-consistent by Definition 6, that is the following inclusion holds for any

t \in [t_{0}, T]

ξ (x^{*} (t), T - t) = \int_{t}^{T} β (τ) d τ \in C (x^{*} (t), T - t) .

The Proposition is proved. □

Remark 1.

Proposition 2 states that if the Core is not empty, then IDP-core and Core coincide in the current game or equivalently imputations from the Core coincide with the imputations from the IDP-core. The system of inequalities (21) and (22) allows extracting the subset from the set of imputation distribution procedures which provides time-consistency and IDP-nondominance of imputations from the Core. For other subsets of the IDP set, this kind of result generally speaking is not true. Note also that the set of imputation distribution procedures (21) and (22) can be empty. In the next section consider the approach to check nonemptiness and construction of IDP’s from the IDP-core.

Suppose that the characteristic function

V (S; x^{*} (t), T - t)

,

t \in [t_{0}, T]

is defined in some relevant way (for example, as in [25]). Suppose that it is a strictly monotonically decreasing function for any

t \in [t_{0}, T]

, decreasing faster than the linear law. Construct the Core

C (x_{0}, T - t_{0})

for the initial instant

t = t_{0}

. Afterwards choose imputation from the Core

ξ (x_{0}, T - t_{0}) \in C (x_{0}, T - t_{0})

. According to the Definition 2 we choose the corresponding imputation distribution procedure

β (t) = β (t, ξ (x_{0}, T - t_{0}))

(IDP) for

ξ (x_{0}, T - t_{0})

in some relevant way. As it follows from Proposition 2 for the imputations

ξ (x_{0}, T - t_{0}) \in I D P - c o r e (x^{*} (t_{0}), T - t_{0}) = C (x^{*} (t_{0}), T - t_{0})

we can always find IDP

β^{'} (t)

satisfying the conditions (21) and (22) such that:

\int_{t_{0}}^{T} β^{'} (t) d t = ξ (x_{0}, T - t_{0}) .

On the other hand

β (t) = β (t, ξ (x_{0}, T - t_{0}))

could not satisfy conditions (21) and (22). It could be the case if

β (t) = β (t, ξ (x_{0}, T - t_{0}))

is chosen in the following way

\begin{matrix} β (t) = c = c o n s t, t \in [t_{0}, t^{'}), (t^{'} \in (t_{0}, T)), \\ β (t) = \frac{ξ (x_{0}, T - t_{0}) - c (t^{'} - t_{0})}{T - t^{'}}, t \in [t^{'}, T] \end{matrix}

and the derivative of

V (S; x^{*} (t), T - t)

U (S; x^{*} (t), T - t) > 0, t \in [t_{0}, T] .

This case will be demonstrated on the model example in Section 5.

Proposition 3.

If

C (x^{*} (t), T - t)

is not empty for any

t \in [t_{0}, T]

, then all imputations of

C (x^{*} (t_{0}), T - t_{0})

are time-consistent.

Proof.

Consider am imputation

ξ (x_{0}, T - t_{0})

from Core

C (x^{*} (t_{0}), T - t_{0})

. According to the Proposition 2

I D P - c o r e (x^{*} (t_{0}), T - t_{0}) = C (x^{*} (t_{0}), T - t_{0})

and therefore

ξ (x_{0}, T - t_{0}) \in I D P - c o r e (x^{*} (t_{0}), T - t_{0})

. From the definition of the IDP-core it follows that

ξ (x_{0}, T - t_{0})

is time-consistent. The Proposition is proved. □

4. Application of Linear Programming Methods for Nonemptiness Properties

In this section, we consider the linear programming problem described in Section 2.4, for the non-emptiness properties of Core. IDP-core can be constructed using a system of linear constraints for the imputation distribution procedures. These constraints are defined for each instant in the game. From the nonemptiness of the set described by these constraints, it follows that the IDP-core is not empty.

Consider the following linear programming problem for a fixed t:

\begin{matrix} \sum_{i \in N} β_{i} ⟶ m i n \\ \sum_{i \in S} β_{i} \geq U (S; x^{*} (t), T - t), \forall S \subseteq N, S \neq N, S \neq ⌀ . \end{matrix}

(31)

Suppose that

β_{i}^{0} = (β_{1}^{0}, \dots, β_{n}^{0})

is an optimal solution of linear programming problem (31) with fixed t. The set of optimal solutions of problem (31) we denote by

Y^{0}

.

Then the following theorem is true:

Theorem 4.

The set of IDPs satisfying the conditions (25),

t \in [t_{0}, T)

is not empty, if and only if

\forall t \in [t_{0}, T)

the following condition is satisfied:

\sum_{i \in N} β_{i}^{0} \leq U (N; x^{*} (t), T - t),

(32)

where

β^{0} \in Y^{0}

is any solution of the linear programming problem (31).

Proof.

Start the proof with the sufficient condition. Suppose that the condition (32) is satisfied, then according to (31) for any

t \in [t_{0}, T]

there exists

{\hat{β}}^{0}

such that for

β_{i} = β_{i}^{0} + \frac{U (N; x^{*} (t), T - t)}{n} - \frac{\sum_{i \in N} β_{i}^{0}}{n}, i \in N

(33)

conditions (21) and (22) are satisfied for any fixed

t \in [t_{0}, T]

. If it is true, then we can compose the integrable function

{\hat{β}}^{0} (t)

as a function of time, for which the conditions (21) and (22) will be satisfied.

Proof of the necessity condition. If the IDP-core is not empty, then there exists at least one integrable function

β (t)

satisfying the conditions (21) and (22). As a result for the solution of (31) condition (32) should be satisfied. □

5. Differential Game Model of Resource Extraction

Consider a game-theoretical model of non-renewable resource extraction with asymmetric players [29,30]. The amount of resource depends on the rates of extraction which are chosen by the players. The game involves n asymmetric players, with utility functions depending on the current amount of resource and rates of extraction.

Denote by

x (t) \in R^{1}

the amount of resource at instant t and by

u_{i} (t, x)

resource extraction rate chosen by player i at instant t. As a class of strategies we will consider a class of feedback strategies, where the strategies are the functions of time t and state x. We assume that

\forall t

,

u_{i} (t, x) \geq 0

, and

x (t) = 0

implies

u_{i} (t, x) = 0

. The amount of the resource

x (t)

as a function of t depends in the following way on

u_{i} (t, x)

:

\begin{matrix} \dot{x} = - \sum_{i = 1}^{n} a_{i} u_{i} (t, x), a_{i} > 0, i = 1, \dots, n . \\ x (t_{0}) = x_{0} . \end{matrix}

(34)

Payoff function representing the income of player i:

K_{i} (x_{0}, T - t_{0}) = \int_{t_{0}}^{T} log (u_{i} (τ, x)) d τ, i = 1, \dots, n .

(35)

5.1. Cooperative Strategies and Cooperative Trajectory

Consider the cooperative version of a non-renewable resource extraction game [30]. Here, players unite in a grand coalition and maximize total utility, acting as one player. The corresponding optimal control problem is formalized in the following way:

\sum_{i = 1}^{n} K_{i} (x_{0}, T - t_{0}) = \sum_{i = 1}^{n} \int_{t_{0}}^{T} log (u_{i} (τ, x)) d τ \to max_{u_{i}, i = \bar{1, n}}

(36)

subject to

\begin{matrix} \dot{x} = - \sum_{i = 1}^{n} a_{i} u_{i} (t, x), \\ x (t_{0}) = x_{0} > 0, \\ u (t, x) \geq 0 . \end{matrix}

(37)

To solve the optimization problem (36), (37), we use the dynamic programming principle proposed by Bellman. To do this we define the Bellman function as the maximum value of the total payoff of players (35) in the subgame

Γ (x, T - t)

starting at the instant t in the position x:

W (t, x) = max_{u_{i}, i = \bar{1, n}} \{\sum_{i = 1}^{n} K_{i} (x, T - t)\} = max_{u_{i}, i = \bar{1, n}} \{\sum_{i = 1}^{n} \int_{t}^{T} log u (τ, x) d τ\}

(38)

subject to Equation (37), when

x_{0} = x

and

t_{0} = t

.

It is proved that if there exists a continuously differentiable function

W (t, x)

that satisfies the Hamilton-Jacobi-Bellman equation

\begin{matrix} - W_{t} (t, x) = max_{u_{i}, i = \bar{1, n}} \{\sum_{i = 1}^{n} log u_{i} (t, x) - W_{x} (t, x) (\sum_{i = 1}^{n} a_{i} u_{i} (t, x))\}, \\ lim_{t \to T - 0} W (t, x) = 0, \end{matrix}

(39)

then strategies

u_{i}^{*} (t, x)

defined by maximizing the right hand side (39) deliver the maximum to the functional in the optimization problem (36), (37).

From the first order extremum condition of (39), we obtain:

u_{i}^{*} = \frac{1}{a_{i} W_{x} (t, x)},

then substituting to (39):

\begin{matrix} W_{t} (t, x) = n log W_{x} (t, x) + log A^{[N]} + n, A^{[N]} = \prod_{i = 1}^{n} a_{i} \\ lim_{t \to T - 0} W (t, x) = 0 . \end{matrix}

(40)

We will consider a Bellman function as a function of the form:

W (t, x) = A (t) log x + B (t),

then, by substituting in (40), we obtain:

\begin{matrix} \dot{A} log x + \dot{B} = n log A - n log x + log A^{[N]} + n, \\ lim_{t \to T - 0} A (t) = lim_{t \to T - 0} B (t) = 0 . \end{matrix}

(41)

The solution of (41) are the functions:

\begin{matrix} A (t) & = n (T - t), \\ B (t) & = - (T - t) (log A^{[N]} + n log n (T - t)) . \end{matrix}

(42)

By substituting

A (t)

and

B (t)

into the Bellman function we obtain:

W (t, x) = n (T - t) log \frac{x}{n (T - t)} - (T - t) log A^{[N]}, t \in [t_{0}, T) .

(43)

The corresponding form of optimal control or cooperative strategy:

u_{i}^{*} (t, x) = \frac{1}{a_{i} W_{x} (t, x)} = \frac{x}{a_{i} n (T - t)}, t \in [t_{0}, T) .

(44)

Substituting the optimal control into the motion Equation (37), we obtain the differential equation for the trajectory corresponding to the optimal control:

\begin{matrix} \dot{x} = - \frac{x}{T - t}, \\ x (t_{0}) = x_{0} . \end{matrix}

(45)

The solution has the form:

x^{*} (t) = x_{0} \frac{T - t}{T - t_{0}}, t \in [t_{0}, T) .

(46)

Trajectory

x^{*} (t)

and strategy (control)

u^{*} (t, x)

we will call cooperative.

In order to determine the value of players’ maximum total payoff that corresponds to the optimization problem (36), (37) in the subgame along the cooperative trajectory

x^{*} (t)

(46), it is necessary to substitute the expression for the cooperative trajectory by the expression for the Bellman function (43):

W (t, x^{*} (t)) = n (T - t) log \frac{x_{0}}{n (T - t_{0})} - (T - t) log A^{[N]}, t \in [t_{0}, T) .

(47)

5.2. Characteristic Function

To construct the rule for allocating the maximum joint payoff among players, it is necessary to define the characteristic function for each coalition

S \subseteq N

:

V (S; x, T - t) = \{\begin{matrix} \sum_{i = 1}^{n} K_{i} (x, T - t), & S = N \\ W_{S} (t, x), & S \subset N, \\ 0, & S = \emptyset, \end{matrix}

(48)

where

W_{S} (t, x)

is defined as the maximum joint payoff of coalition S given that the players from coalition

N ∖ S

use strategies from a fixed Nash equilibrium

u^{N E} = (u_{1}^{N E}, \dots, u_{n}^{N E})

in the initial game.

It can be shown that in the case of a non-cooperative game, Nash equilibrium strategies are

u_{i}^{N E} (t, x) = \frac{x}{a_{i} (T - t)}, i \in N .

(49)

Consider a case of coalition

S \subset N

. We introduce the Bellman function

W_{S} (t, x)

, as the maximum total payoff of players from coalition S in the subgame

Γ (x, T - t)

starting at the instant t in the position x:

\begin{matrix} W_{S} (t, x) & = max_{u_{i}, i \in S} \sum_{i \in S} \{\int_{t}^{T} log u_{i} d τ\} \end{matrix}

(50)

\begin{matrix} subject to \dot{x} (τ) & = - \sum_{i \in N} a_{i} u_{i} \end{matrix}

(51)

\begin{matrix} u_{i} & = u_{i}^{N E}, i \in N ∖ S . \end{matrix}

(52)

The Hamilton-Jacobi-Bellman equation for this problem has the form:

\begin{matrix} - \frac{\partial W_{S} (t, x)}{\partial t} = max_{u_{i}, i \in S} \{\sum_{i \in S} log u_{i} (t, x) - \frac{\partial W_{S} (t, x)}{\partial x} (\sum_{j = 1}^{n} a_{j} u_{j} (t, x))\}, \\ lim_{t \to T - 0} W_{S} (t, x) = 0 . \end{matrix}

(53)

From the first order extremum condition for (53) we obtain

u_{i}^{*} = \frac{1}{a_{i} \frac{\partial W_{S} (t, x)}{\partial x}},

(54)

substitute in (53):

\begin{matrix} \frac{\partial W_{S}}{\partial t} = k log \frac{\partial W_{S}}{\partial x} + log A^{[S]} + k + \frac{\partial W_{S}}{\partial x} \sum_{j \in N ∖ S} \frac{x}{T - t}, A^{[S]} = \prod_{i = 1}^{n} a_{i} \\ lim_{t \to T - 0} W_{S} (t, x) = 0, \end{matrix}

(55)

where

k = | S |

,

n = | N |

. Consider the following form of the Bellman function:

W_{S} (t, x) = A (t) log x + B (t),

then by substituting in (55) we obtain:

\begin{matrix} \dot{A} log x + \dot{B} = k log A - k log x + log A^{[S]} + k + (n - k) \frac{A}{T - t}, \\ lim_{t \to T - 0} A (t) = lim_{t \to T - 0} B (t) = 0 . \end{matrix}

(56)

The solution of (56) are the functions:

\begin{matrix} A (t) & = k (T - t), \\ B (t) & = - k (T - t) (\frac{log A^{[S]}}{k} + log k (T - t) + n - k) . \end{matrix}

(57)

Solution of the optimization problem (50):

W_{S} (t, x) = k (T - t) [log \frac{x}{T - t} - log k - \frac{log A^{[S]}}{k} - n + k] .

(58)

According to the definition, we obtain the characteristic function for the coalition

S \neq N

:

V (S, x, T - t) = W_{S} (t, x) .

In order to determine the way to allocate the maximum joint payoff of players (47) among them along the cooperative trajectory

x^{*} (t)

(46), namely, for the subgame starting at the instant t on the cooperative trajectory

x^{*} (t)

(46) it is necessary to define the characteristic function along the cooperative trajectory. Let us substitute the expression for

x^{*} (t)

(46) into the expression for characteristic function

V (S, T - t, x)

,

S \subset N

(58).

W_{S} (t, x^{*} (t)) = k (T - t) [log \frac{x_{0}}{T - t_{0}} - log k - \frac{log A^{[S]}}{k} - n + k] .

(59)

For the case when

S = N

, the characteristic function is calculated in accordance with (47).

5.3. IDP-Core

Suppose that all players unite in grand coalition N, then they can guarantee themselves joint payoff equal to

V (N; x^{*} (t), T - t)

. In order to determine how to allocate the maximum joint payoff among players, we use the notion of imputations

ξ (x, T - t)

. In particular, we will use IDP-core as a cooperative solution in the game. According to Theorem 3, IDP-core can be constructed using the conditions for IDP’s

β_{i} (t)

,

i \in N

:

\begin{matrix} \sum_{i \in S}^{} β_{i} (t) \geq - k t [log \frac{x_{0}}{T - t_{0}} - log k - \frac{log A^{[S]}}{k} - n + k], \forall S \subset N, \\ \sum_{i \in N}^{} β_{i} (t) = - n t log \frac{x_{0}}{n (T - t_{0})} - (T - t) log A^{[N]}, \forall t \in [t_{0}, T] . \end{matrix}

(60)

5.4. Non-Emptiness of IDP-Core

In order to study non-emptiness conditions we solve the linear programming problem, as presented in the paper [24], for

t \in [t_{0}, T]

with a fixed step

Δ t

. As a result, the vector function

β^{0} = (β_{1}^{0}, \dots, β_{n}^{0})

is obtained using the numerical methods and corresponding conditions are to be verified in order for the IDP-core to be non-empty:

\sum_{i \in N} β_{i}^{0} \leq U (N; x^{*} (t), T - t) .

(61)

We construct IDP

{\hat{β}}^{0} (t)

using

β^{0} (t)

and show that it satisfies the conditions (25):

{\hat{β}}_{i}^{0} (t) = β_{i}^{0} (t) + \frac{U (N; x^{*} (t), T - t) - \sum_{i \in N} β_{i}^{0} (t)}{n} .

(62)

5.5. Core and IDP-Core

According to the Theorem 3, the imputation that corresponds to the IDP

{\hat{β}}^{0} (t)

ξ (x_{0}, T - t_{0}) = \int_{t_{0}}^{T} {\hat{β}}^{0} (t) d t

(63)

belongs to the Core

C (x_{0}, T - t_{0})

because, for given parameters IDP

{\hat{β}}^{0} (t)

,

t \in [t_{0}, T]

satisfies conditions (21) and (22) or

ξ (x_{0}, T - t_{0})

belongs to

I D P - c o r e (x_{0}, T - t_{0})

. But if we use the Core

C (x_{0}, T - t_{0})

instead of

I D P - c o r e (x_{0}, T - t_{0})

as a cooperative solution in the game, then we can use any IDP for the imputation (63), such as

\begin{matrix} β (t) = c = c o n s t, t \in [t_{0}, t^{'}), (t^{'} \in (t_{0}, T)), \\ β (t) = \frac{ξ (x_{0}, T - t_{0}) - c (t^{'} - t_{0})}{T - t^{'}}, t \in [t^{'}, T], \end{matrix}

(64)

but this does not necessarily satisfy conditions (21) and (22) at some instant and therefore it appears not to be undominated and corresponding to this IDP imputation

ξ (x_{0}, T - t_{0})

is time-inconsistent.

On Figure 1 the set defined by the system of constrains (25) shown, the solid line is the solution

β^{0} (t)

of corresponding linear programming problem (31) as a function of time, the dashed line is IDP

{\hat{β}}^{0} (t)

and IDP

β (t)

(64) corresponding to the imputation (63).

Function

{\hat{β}}_{i}^{0} (t)

satisfies the constrains (25). It can be seen that the IDP-core in this game model is not empty and conditions (32) of Theorem 4 are satisfied.

Using Figure 2 it is possible to verify the non-emptiness conditions (32) of Theorem 6, the solid line shows the sum of values

β_{i}^{0} (t)

,

i = \bar{1, 3}

:

S_{β^{0}} (t) = β_{1}^{0} (t) + β_{2}^{0} (t) + β_{3}^{0} (t),

(65)

the dashed line in the Figure 2 shows the value of characteristic function for a grand coalition

U (N; x^{*} (t), T - t) = U ({1, 2, 3}; x^{*} (t), T - t),

where

U ({1, 2, 3}; x^{*} (t), T - t)

is defined in (17). In the Figure 2 it can be seen that

S_{β^{0}} (t) \leq U ({1, 2, 3}; x^{*} (t), T - t) \forall t \in [t_{0}, T] .

6. Conclusions

This paper examines a new approach for defining a cooperative solution for differential games. Our approach uses the time consistency property as a basic axiom for constructing the cooperative solution. It is important to notice that the further use of the time consistency property as the axiom for the theory of dynamic cooperative games, the theory of social choice, and mechanism design is promising. The approach also defines the notion of IDP-dominance, which allows for selecting undominated imputation distribution procedures. Properties of time consistency and IDP-dominance are the key properties for constructing a new cooperative solution, namely IDP-core. The necessary and sufficient conditions for the IDP-core defining geometric properties of this solution are presented. It is also proved that the set of imputations that corresponds to the Core and to IDP-core coincides, but, as the simulation demonstrates, the IDPs that would be naturally proposed for use sometimes might not appear to be undominated and therefore lead to the time inconsistency of the corresponding imputations they generate.

Author Contributions

Methodology and formal analysis, O.P.; supervision, V.Z. All authors have read and agreed to the published version of the manuscript.

Funding

Research was supported by a grant from the Russian Science Foundation (Project No 18-71-00081).

Acknowledgments

Great thanks to Sergei Pogozhev who helped in preparing the Matlab project for depicting complex Figures in the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Gillies, D.B. Some Theorems on n Person Games. Ph.D. Thesis, Princeton University, Princeton, NJ, USA, 1953. [Google Scholar]
Edgeworth, F.Y. Mathematical Physics; Kegan Paul: London, UK, 1881. [Google Scholar]
Scarf, H.E. The core of an n person game. Economica 1967, 35, 50–69. [Google Scholar] [CrossRef]
Billera, L.J. Some theorems on the core of n person game. Siam J. Appl. Math. 1970, 18, 567–579. [Google Scholar] [CrossRef]
Shapley, L.S. On balanced games without side payments. Math. Program. 1972, 261–290. [Google Scholar] [CrossRef]
Bondareva, O.N. Some applications of linear programming methods to the theory of cooperative games. Probl. Cybern. 1963, 10, 119–140. (In Russian) [Google Scholar]
Shapley, L.S. On balanced sets and cores. Nav. Res. Logist. Q. 1967, 14, 453–460. [Google Scholar] [CrossRef]
Zakharov, V.; Kwon, O.-H. Linear programming approach in cooperative games. J. Korean Math. Soc. 1997, 34, 423–435. [Google Scholar]
Zakharov, V.; Dementieva, M. Multistage Cooperative Games and Problem of Time Consistency. Int. Game Theory Rev. 2004, 6, 157–170. [Google Scholar] [CrossRef]
Zakharov, V.; Akimova, A. Geometric Properties of the Core, Subcore, Nucleolus. Game Theory Appl. 2002, 8, 279–289. [Google Scholar]
Kleimenov, A.F. To the Cooperative Theory of Non-Coalition Positional Games; Reports of the USSR Academy of Sciences; USSR Academy of Sciences: Moscow, Russia, 1990. [Google Scholar]
Kleimenov, A.F. Cooperative solutions in the position differential game of many individuals with continuous payment functions. Appl. Math. Mech. 1990, 54, 389–394. [Google Scholar]
Isaacs, R. Differential Games; John Wiley and Sons: New York, NY, USA, 1965. [Google Scholar]
Yeung, D.; Petrosyan, L. Subgame Consistent Economic Optimization: An Advanced Cooperative Dynamic Game Analysis; Springer: New York, NY, USA, 2012. [Google Scholar]
Von Neumann, J.; Morgenstern, O. Theory of Games and Economic Behavior; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]
Petrosyan, L. Time-consistency of solutions in multi-player differential games. Astronomy 1977, 4, 46–52. [Google Scholar]
Petrosyan, L.A.; Danilov, N.N. Stability of solutions in non-zero sum differential games with transferable payoffs. Astronomy 1979, 1, 52–59. [Google Scholar]
Petrosyan, L. Strongly time consistent differential optimality principles. Astronomy 1993, 26, 40–46. [Google Scholar]
Gao, H.; Petrosyan, L.; Qiao, H.; Sedakov, A. Cooperation in two-stage games on undirected networks. J. Syst. Sci. Complex. 2017, 30, 680–693. [Google Scholar] [CrossRef]
Petrosyan, L.A.; Danilov, N.N. Cooperative Differential Games and Their Applications; Publishing House of Tomsk University: Tomsk, Russia, 1985. [Google Scholar]
Parilina, E.; Zaccour, G. Node-Consistent Shapley Value for Games Played over Event Trees with Random Terminal Time. J. Optim. Theory Appl. 2017, 175, 236–254. [Google Scholar] [CrossRef]
Parilina, E.; Zaccour, G. Node-consistent core for games played over event trees. Automatica 2015, 53, 304–311. [Google Scholar] [CrossRef]
Petrosian, O.L.; Gromova, E.V.; Pogozhev, S.V. Strong time-consistent subset of core in cooperative differential games with finite time horizon. Autom. Remote Control 2018, 79, 1912–1928. [Google Scholar] [CrossRef]
Wolf, D.A.; Zakharov, V.V.; Petrosian, O.L. On the existence of IDP-core in cooperative differential games. Math. Theory Games Appl. 2017, 9, 18–38. [Google Scholar]
Gromova, E.V.; Petrosyan, L.A. Strongly dynamically stable cooperative solution in one differential game of harmful emissions management. Manag. Large Syst. 2015, 55, 140–159. (In Russian) [Google Scholar]
Vorob’ev, N.N. Game Theory; Lectures for Economists and Systems Scientists; Springer: New York, NY, USA, 1977. [Google Scholar]
Owen, G. Game Theory; Academic Press: New York, NY, USA, 1982. [Google Scholar]
Petrosian, O.L.; Gromova, E.V.; Pogozhev, S.V. Strong time-consistent subset of core in cooperative differential games with finite time horizon. Math. Theory Games Appl. 2016, 8, 79–106. [Google Scholar] [CrossRef]
Breton, M.; Zaccour, G.; Zahaf, M. A differential game of joint implementation of environmental projects. Automatica 2005, 41, 1737–1749. [Google Scholar] [CrossRef]
Dockner, E.; Jorgensen, S.; van Long, N.; Sorger, G. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2001. [Google Scholar]

Figure 1. Axes:

β_{1}

,

β_{3}

, t.

β_{2}

can be found using the equality in (25).

Figure 1. Axes:

β_{1}

,

β_{3}

, t.

β_{2}

can be found using the equality in (25).

Figure 2.

U ({1, 2, 3}; x^{*} (t), T - t)

(17) is a dashed line,

S_{β^{0}} (t)

(65) is a solid line.

Figure 2.

U ({1, 2, 3}; x^{*} (t), T - t)

(17) is a dashed line,

S_{β^{0}} (t)

(65) is a solid line.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Petrosian, O.; Zakharov, V. IDP-Core: Novel Cooperative Solution for Differential Games. Mathematics 2020, 8, 721. https://doi.org/10.3390/math8050721

AMA Style

Petrosian O, Zakharov V. IDP-Core: Novel Cooperative Solution for Differential Games. Mathematics. 2020; 8(5):721. https://doi.org/10.3390/math8050721

Chicago/Turabian Style

Petrosian, Ovanes, and Victor Zakharov. 2020. "IDP-Core: Novel Cooperative Solution for Differential Games" Mathematics 8, no. 5: 721. https://doi.org/10.3390/math8050721

APA Style

Petrosian, O., & Zakharov, V. (2020). IDP-Core: Novel Cooperative Solution for Differential Games. Mathematics, 8(5), 721. https://doi.org/10.3390/math8050721

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

IDP-Core: Novel Cooperative Solution for Differential Games

Abstract

1. Introduction

2. Problem Statement and Preliminary Information

2.1. Differential Game Model

2.2. Cooperative Differential Game Model

2.3. Core

2.4. Non-Emptiness of Core in Static Games

2.5. Time-Consistency of Cooperative Solution and Imputation Distribution Procedure

3. IDP-Core and Dominance of Imputation Distribution Procedures

3.1. Dominance of Imputation Distribution Procedures

3.2. IDP-Core

4. Application of Linear Programming Methods for Nonemptiness Properties

5. Differential Game Model of Resource Extraction

5.1. Cooperative Strategies and Cooperative Trajectory

5.2. Characteristic Function

5.3. IDP-Core

5.4. Non-Emptiness of IDP-Core

5.5. Core and IDP-Core

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI