Vulnerability and Defence: A Case for Stackelberg Game Dynamics

Iqbal, Azhar; Honhaga, Ishan; Teffera, Eyoel; Perry, Anthony; Baker, Robin; Pearce, Glen; Szabo, Claudia

doi:10.3390/g15050032

Open AccessArticle

Vulnerability and Defence: A Case for Stackelberg Game Dynamics

by

Azhar Iqbal

^1,*

,

Ishan Honhaga

¹,

Eyoel Teffera

²,

Anthony Perry

²,

Robin Baker

²,

Glen Pearce

² and

Claudia Szabo

¹

School of Computer and Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia

²

Defence Science and Technology Group, P.O. Box 1500, Edinburgh, SA 5111, Australia

^*

Author to whom correspondence should be addressed.

Games 2024, 15(5), 32; https://doi.org/10.3390/g15050032

Submission received: 27 July 2024 / Revised: 13 September 2024 / Accepted: 14 September 2024 / Published: 18 September 2024

Download

Browse Figures

Versions Notes

Abstract

:

This paper examines the tactical interaction between drones and tanks in modern warfare through game theory, particularly focusing on Stackelberg equilibrium and backward induction. It describes a high-stakes conflict between two teams: one using advanced drones for attack, and the other defending using tanks. The paper conceptualizes this as a sequential game, illustrating the complex strategic dynamics similar to Stackelberg competition, where moves and countermoves are carefully analyzed and predicted.

Keywords:

Stackelberg equilibrium; sequential games; backwards-induction outcome

1. Introduction

More than a century before John Nash formalized the concept of equilibrium in game theory [1,2,3], Antoine Cournot [4] had already introduced a similar idea through his duopoly model, which became a cornerstone in the study of industrial organization [5]. In economics, an oligopoly refers to a market structure in which a small number of firms (

n \geq 2

) supply a particular product. A duopoly, a specific case where

n = 2

, is the scenario to which Cournot’s model applies. In this model, two firms simultaneously produce and sell a homogeneous product. Cournot identified an equilibrium quantity for each firm, where the optimal strategy for each participant is to follow a specific rule if the other firm adheres to it. This idea of equilibrium in a duopoly anticipated Nash’s more general concept of equilibrium points in non-cooperative games.

In 1934, Heinrich von Stackelberg [6,7] introduced a dynamic extension to Cournot’s model by allowing for sequential moves rather than simultaneous ones. In the Stackelberg model, one firm, the leader, moves first, while the second, the follower, reacts accordingly. A well-known example of such strategic behavior is General Motors’ leadership in the early U.S. automobile industry, with Ford and Chrysler often acting as followers.

The Stackelberg equilibrium, derived through backward induction, represents the optimal outcome in these sequential-move games. This equilibrium is often considered more robust than Nash equilibrium (NE) in such settings, as sequential games can feature multiple NEs, but only one corresponds to the backward-induction outcome [1,2,3].

2. Related Work

Stackelberg games have been significantly influential in security and military research applications [8,9,10,11,12,13,14,15,16,17,18,19]. These games, based on the Stackelberg competition model, have been successfully applied in a wide range of real-world scenarios. They are particularly notable for their deployment in contexts where security decisions are critical, such as in protecting infrastructure and managing military operations.

The sequential setup of Stackelberg games is particularly relevant in military contexts where strategic decisions often involve anticipating and responding to an adversary’s actions. The applications of such games in military settings are diverse, ranging from optimizing resource allocation for defence to strategizing offensive maneuvers.

This paper considers the strategic interplay between drones and tanks through the lens of the Stackelberg equilibrium and the principles of backwards induction. In a military operation setting, we consider two types of agents, namely, the attacker (Red team) and the defender (Blue team). The attacker might utilize mobile threats to attack or reduce the number of static, unmovable entities belonging to the defender. In response, the defender will employ countermeasures to reduce the number of enemy attackers. This complex pattern of strategic moves and countermoves is explored as a sequential game, drawing on the concept of Stackelberg competition to illuminate the dynamics at play.

While focusing on developing a game-theoretical analysis, we have presented a hypothetical strategic scenario involving tanks and drones to illustrate our point. Naturally, this scenario may only loosely reflect the realities of such encounters, which evolve rapidly and are subject to constant change.

This paper’s contribution consists of obtaining an analytical solution to a Stackelberg competition in a military setting. To obtain such a solution, we limit the number of available strategic moves to small numbers but enough still to demonstrate the dynamics of a sequential strategic military operation.

3. The Game Definition

We consider a scenario in which two teams, Blue (

B

) and Red (

R

), are engaged in a military operation. The

B

team comprises ground units, specifically tanks, while the

R

team operates aerial units, namely drones. The strategy involves the

R

team’s drones targeting the

B

team’s tanks. Meanwhile, the

B

team not only has the capability to shoot down these drones but also provides defensive cover for their tanks, creating a complex interplay of offensive and defensive maneuvers in this combat scenario.

We assume that the

B

team consists of n tanks where

n \in N

and represent the set of tanks as

T = {T_{1}, T_{2}, \dots T_{n}} .

Let

S = {S_{1}, S_{2}, \dots S_{m}}

be a set of resources that are at the disposal of

B

team to protect the tanks. It is assumed that the

R

team’s pure strategy is to attack one of the tanks from the set T. The

R

team’s mixed strategy is then defined as a vector

〈A_{T}〉

where

A_{T}

is the probability of attacking the tank

T : A \to

attack and

\sum_{T = 1}^{n} A_{T} = 1 .

The

B

team’s mixed strategy is also a vector

〈D_{T}〉

where

D_{T}

is the marginal probability of protecting the tank

T

. Note that a marginal probability is obtained by summing (or integrating) over the distribution of the variables that are being disregarded, and these disregarded variables are said to have been marginalized out.

3.1. Marginal Probability of Protection for Tanks

We consider the case when there are five resources from the set

{S_{1}, S_{2}, \dots S_{5}}

available to protect the four tanks from

T = {T_{1}, T_{2}, \dots T_{4}},

whereas one or more of the resources can be used to protect a tank

T_{i}

from the set T. For this case, the marginal probabilities

D_{T}

to protect the tank

T

are determined as follows:

\begin{matrix} Tanks \end{matrix} \overset{\begin{matrix} Resources \end{matrix}}{\begin{matrix} S_{1} & S_{2} & S_{3} & S_{4} & S_{5} & D_{T} \\ T_{1} & \Pr {(T}_{1} {, S}_{1}) & \Pr {(T}_{1} {, S}_{2}) & \Pr {(T}_{1} {, S}_{3}) & \Pr {(T}_{1} {, S}_{4}) & \Pr {(T}_{1} {, S}_{5}) & D_{1} = \frac{1}{5} \sum_{i = 1}^{5} \Pr {(T}_{1} {, S}_{i}) \\ T_{2} & \Pr {(T}_{2} {, S}_{1}) & \Pr {(T}_{2} {, S}_{2}) & \Pr {(T}_{2} {, S}_{3}) & \Pr {(T}_{2} {, S}_{4}) & \Pr {(T}_{2} {, S}_{5}) & D_{2} = \frac{1}{5} \sum_{i = 1}^{5} \Pr {(T}_{2} {, S}_{i}) \\ T_{3} & \Pr {(T}_{3} {, S}_{1}) & \Pr {(T}_{3} {, S}_{2}) & \Pr {(T}_{3} {, S}_{3}) & \Pr {(T}_{3} {, S}_{4}) & \Pr {(T}_{3} {, S}_{5}) & D_{3} = \frac{1}{5} \sum_{i = 1}^{5} \Pr {(T}_{3} {, S}_{i}) \\ T_{4} & \Pr {(T}_{4} {, S}_{1}) & \Pr {(T}_{4} {, S}_{2}) & \Pr {(T}_{4} {, S}_{3}) & \Pr {(T}_{4} {, S}_{4}) & \Pr {(T}_{4} {, S}_{5}) & D_{4} = \frac{1}{5} \sum_{i = 1}^{5} \Pr {(T}_{4} {, S}_{i}) \end{matrix},}

(1)

where

\begin{matrix} \sum_{i = 1}^{4} \Pr (T_{i}, S_{1}) = \sum_{i = 1}^{4} \Pr (T_{i}, S_{2}) = \sum_{i = 1}^{4} \Pr (T_{i}, S_{3}) \\ = \sum_{i = 1}^{4} \Pr (T_{i}, S_{4}) = \sum_{i = 1}^{4} \Pr (T_{i}, S_{5}) = 1 . \end{matrix}

(2)

For instance,

\Pr (T_{3}, S_{2})

is the probability that the resource

S_{2}

is used to give protection to the tank

T_{3} .

This means

D_{i} \leq 1

and

\sum_{i = 1}^{4} D_{i} = 1 .

In case the set of resources consists of m element i.e.,

S = {S_{1}, S_{2}, \dots S_{m}}

and the number of tanks to be protected are n, we then have the the marginal probabilities

D_{j}

to protect the j-th tank obtained as

D_{j} = \frac{1}{m} \sum_{i = 1}^{m} \Pr {(T}_{j} {, S}_{i})

where

1 \leq j \leq n .

3.2. Defining the Reward Functions

Let

R_{B} (T)

be the reward to team

B

if the attacked tank

T

is protected using resources from the set

S = {S_{1}, S_{2}, \dots S_{m}},

C_{B} (T)

be the cost to team

B

if the attacked tank

T

is unprotected,

R_{R} (T)

be the reward to team

R

if the attacked tank

T

is unprotected,

C_{R} (T)

be the cost to team

R

if attacked tank

T

is protected. Note that

D_{T}

is the marginal probability of protecting the tank

T

using the resources from the set

S .

The quantity

D_{T} R_{B} (T) - (1 - D_{T}) C_{B} (T)

then describes the payoff to the

B

team when tank

T

is attacked. Similarly, the quantity

(1 - D_{T}) R_{R} (T) - D_{T} C_{R} (T)

describes the payoff to the

R

team when the tank

T

is attacked. However, the probability that the tank

T

is attacked is

A_{T}

and we can take this into consideration to define the quantities

A_{T} {D_{T} R_{B} (T) - (1 - D_{T}) C_{B} (T)}

and

A_{T} {(1 - D_{T}) R_{R} (T) - D_{T} C_{R} (T)}

. These are the contributions to the payoffs to the

B

and

T

teams, respectively, when the tank

T

is attacked with the probability

A_{T} .

As the vector

〈A_{T}〉

describes the

R

team’s (mixed) attacking strategy whereas the vector

〈D_{T}〉

describes the

B

team’s (mixed) protection strategy, the players’ strategy profiles are given as

{〈D_{T}〉, 〈A_{T}〉}

. For a set of tanks T, the expected payoffs [16,17] to the

B

and

R

teams, respectively, can then be written as

\begin{matrix} Π_{B} {〈D_{T}〉, 〈A_{T}〉} & = & \sum_{T \in T} A_{T} {D_{T} R_{B} (T) - (1 - D_{T}) C_{B} (T)}, \\ Π_{R} {〈D_{T}〉, 〈A_{T}〉} & = & \sum_{T \in T} A_{T} {(1 - D_{T}) R_{R} (T) - D_{T} C_{R} (T)} . \end{matrix}

(3)

We note from these payoffs that if the attack probability for a tank

T

is zero, the rewards to both the

B

and

T

teams for the tank

T \in T

are also zero; the payoff functions for the either team depend only on the attacked tanks; and if the

B

and

R

teams move simultaneously, the solution is a Nash equilibrium.

Note that with reference to the reward functions defined in Equation (3), for the imagined strategy profile (not equilibrium), where defender protects the first tank by all elements from the set

{S_{1}, S_{2}, \dots S_{5}}

i.e.,

\Pr {(T}_{1} {, S}_{i}) = 1

for

1 \leq i \leq 5,

we obtain

D_{1} = \frac{1}{5} \sum_{i = 1}^{5} \Pr {(T}_{1} {, S}_{i}) = 1

and thus

D_{2, 3, 4} = 0 .

If the attacker decides to attack the first tank i.e.,

A_{1} = 1

, we obtain

Π_{B} {〈D_{T}〉, 〈A_{T}〉} = R_{B} (T_{1})

and

Π_{R} {〈D_{T}〉, 〈A_{T}〉} = - C_{R} (T_{1}) .

4. Leader-Follower Interaction and Stackelberg Equilibrium

We consider a three step strategic game between the

B

and

R

teams, also called the leader-follower interaction. As the leader, the

B

team chooses an action consisting of a protection strategy

〈D_{T}〉

. The

R

team observes

〈D_{T}〉

and then chooses an action consisting of its attack strategy given by the vector

〈A_{T}〉

. Knowing the rational response

〈A_{T}〉

of the

R

team, the

B

team takes this in account and as the leader optimizes its own action. The payoffs to the two teams are

Π_{B} {〈D_{T}〉, 〈A_{T}〉}

and

Π_{R} {〈D_{T}〉, 〈A_{T}〉}

.

This game is an example of the dynamic games of complete and perfect information [2]. Key features of this game are (a) the moves occur in sequence, (b) all previous moves are known before next move is chosen, and (c) the players’ payoffs are common knowledge. This framework allows for strategic decision-making based on the actions and expected reactions of the other players, typical of Stackelberg competition scenarios. In many real-world scenarios—especially in complex environments in a military contexts—the assumption that players’ payoffs are common knowledge does not hold and complete information about the payoffs of other players is rarely available.

Given the action

〈D_{T}〉

is previously chosen by the

B

team, at the second stage of the game, when the

R

team gets the move, it faces the problem:

\underset{〈A_{T}〉}{M a x} Π_{R} {〈D_{T}〉, 〈A_{T}〉} .

(4)

Assume that for each

〈D_{T}〉

,

R

team’s optimization problem (4) has a unique solution

S_{R} (〈D_{T}〉)

, which is known as the best response of the

R

team. Now the

B

team can also solve the

R

team’s optimization problem by anticipating the

R

team’s response to each action

〈D_{T}〉

that the

B

team might take. So that the

B

team faces the problem:

\underset{〈D_{T}〉}{M a x} Π_{B} {〈D_{T}〉, S_{R} (〈D_{T}〉)}

(5)

Suppose this optimization problem also has a unique solution for the

B

team and is denoted by

{〈D_{T}〉}^{*}

. The solution

({〈D_{T}〉}^{*}, S_{R} ({〈D_{T}〉}^{*}))

is the backwards-induction outcome of this game.

To address this, we consider the above simplified case i.e., when

T = 1, 2, \dots 4

. Expanding Equation (3) we obtain:

Π_{R} {〈D_{T}〉, 〈A_{T}〉} = \sum_{i = 1}^{4} A_{i} {(1 - D_{i}) R_{R} (T_{i}) - D_{i} C_{R} (T_{i})} .

(6)

Now, as

\sum_{T = 1}^{4} A_{T} = 1

, we take as an arbitrary choice

A_{3} = 1 - A_{1} - A_{2} - A_{4}

in Equation (6) to obtain

\begin{matrix} Π_{R} {〈D_{T}〉, 〈A_{T}〉} = \sum_{\begin{matrix} i = 1 \\ i \neq 3 \end{matrix}}^{4} A_{i} {(1 - D_{i}) R_{R} (T_{i}) - D_{i} C_{R} (T_{i})} \\ + (1 - A_{1} - A_{2} - A_{4}) {(1 - D_{3}) R_{R} (T_{3}) - D_{3} C_{R} (T_{3})}, \end{matrix}

(7)

and this re-presses the

R

team’s reward function in terms of only three variables

A_{1}

,

A_{2}

, and

A_{3}

—defining its attack strategy

〈A_{T}〉 .

When expanded, the above equation becomes

\begin{matrix} Π_{R} {〈D_{T}〉, 〈A_{T}〉} = A_{1} {(1 - D_{1}) R_{R} (T_{1}) - D_{1} C_{R} (T_{1}) - (1 - D_{3}) R_{R} (T_{3}) + D_{3} C_{R} (T_{3})} \\ + A_{2} {(1 - D_{2}) R_{R} (T_{2}) - D_{2} C_{R} (T_{2}) - (1 - D_{3}) R_{R} (T_{3}) + D_{3} C_{R} (T_{3})} \\ + A_{4} {(1 - D_{4}) R_{R} (T_{4}) - D_{4} C_{R} (T_{4}) - (1 - D_{3}) R_{R} (T_{3}) + D_{3} C_{R} (T_{3})} \\ + (1 - D_{3}) R_{R} (T_{3}) - D_{3} C_{R} (T_{3}) . \end{matrix}

(8)

As a rational player, the

B

team knows that the

R

team would maximize its reward function with respect to its strategic variables and this is expressed as

\frac{\partial Π_{R} {〈D_{T}〉, 〈A_{T}〉}}{\partial A_{1}} = \frac{\partial Π_{R} {〈D_{T}〉, 〈A_{T}〉}}{\partial A_{2}} = \frac{\partial Π_{R} {〈D_{T}〉, 〈A_{T}〉}}{\partial A_{4}} = 0,

(9)

where

A_{1}, A_{2}, A_{4} \in [0, 1]

and

\sum_{T = 1}^{4} A_{T} = 1 .

This results in obtaining

\begin{matrix} D_{3} \{R_{R} (T_{3}) + C_{R} (T_{3})\} = D_{1} \{R_{R} (T_{1}) + C_{R} (T_{1})\} - R_{R} (T_{1}) + R_{R} (T_{3}), \\ D_{3} \{R_{R} (T_{3}) + C_{R} (T_{3})\} = D_{2} \{R_{R} (T_{2}) + C_{R} (T_{2})\} - R_{R} (T_{2}) + R_{R} (T_{3}), \\ D_{3} \{R_{R} (T_{3}) + C_{R} (T_{3})\} = D_{4} \{R_{R} (T_{4}) + C_{R} (T_{4})\} - R_{R} (T_{4}) + R_{R} (T_{3}), \end{matrix}

(10)

and this leads us to denote the sum of the reward and the cost to the

B

and

R

teams for protecting or attacking the tank

T

, respectively, by new symbols

\begin{matrix} Ω_{1}^{R} & = & R_{R} (T_{1}) + C_{R} (T_{1}), Ω_{2}^{R} = R_{R} (T_{2}) + C_{R} (T_{2}), \\ Ω_{3}^{R} & = & R_{R} (T_{3}) + C_{R} (T_{3}), Ω_{4}^{R} = R_{R} (T_{4}) + C_{R} (T_{4}), \\ Ω_{1}^{B} & = & R_{B} (T_{1}) + C_{B} (T_{1}), Ω_{2}^{B} = R_{B} (T_{2}) + C_{B} (T_{2}), \\ Ω_{3}^{B} & = & R_{B} (T_{3}) + C_{B} (T_{3}), Ω_{4}^{B} = R_{B} (T_{4}) + C_{B} (T_{4}) . \end{matrix}

(11)

As

D_{i} \leq 5

and

\sum_{i = 1}^{4} D_{i} = 1

, we substitute

D_{3} = (1 - D_{1} - D_{2} - D_{4})

in Equation (10) along with the substitutions (11) to obtain

\begin{matrix} (1 - D_{2} - D_{4}) Ω_{3}^{R} = D_{1} \{Ω_{1}^{R} + Ω_{3}^{R}\} - R_{R} (T_{1}) + R_{R} (T_{3}), \end{matrix}

(12)

\begin{matrix} (1 - D_{1} - D_{4}) Ω_{3}^{R} = D_{2} \{Ω_{2}^{R} + Ω_{3}^{R}\} - R_{R} (T_{2}) + R_{R} (T_{3}), \end{matrix}

(13)

\begin{matrix} (1 - D_{1} - D_{2}) Ω_{3}^{R} = D_{4} \{Ω_{4}^{R} + Ω_{3}^{R}\} - R_{R} (T_{4}) + R_{R} (T_{3}) . \end{matrix}

(14)

Using Equations (12)–(14), we now express

D_{2}

and

D_{4}

in terms of

D_{1}

. For this, we subtract Equation (13) from Equation (12)

(D_{1} - D_{2}) Ω_{3}^{R} = D_{1} \{Ω_{1}^{R} + Ω_{3}^{R}\} - D_{2} \{Ω_{2}^{R} + Ω_{3}^{R}\} - R_{R} (T_{1}) + R_{R} (T_{2}),

which gives

D_{2} = \frac{D_{1} Ω_{1}^{R} - R_{R} (T_{1}) + R_{R} (T_{2})}{Ω_{2}^{R}} .

(15)

Similarly, subtracting Equation (14) from (12) results in

D_{4} = \frac{D_{1} Ω_{1}^{R} + R_{R} (T_{4}) - R_{R} (T_{1})}{Ω_{4}^{R}} .

(16)

Using Equations (15) and (16), the marginal

D_{3}

can then be expressed in terms of the marginal

D_{1}

as

D_{3} = 1 - D_{1} [1 + Ω_{1}^{R} (\frac{1}{Ω_{2}^{R}} + \frac{1}{Ω_{4}^{R}})] - \frac{R_{R} (T_{2}) - R_{R} (T_{1})}{Ω_{2}^{R}} - \frac{R_{R} (T_{4}) - R_{R} (T_{1})}{Ω_{4}^{R}} .

(17)

Equations (15)–(17) represent the rational behaviour of the

R

team, which the

B

team can now exploit to optimize its defence strategy

〈D_{T}〉

.

From Equations (3) the payoff function of the

B

team can be expressed as

Π_{B} {〈D_{T}〉, 〈A_{T}〉} = \sum_{i = 1}^{4} [D_{i} Ω_{i}^{B} - C_{B} (T_{i})] A_{i},

(18)

and with the substitution

D_{3} = (1 - D_{1} - D_{2} - D_{4})

this result in obtaining

\begin{matrix} Π_{B} \{〈D_{T}〉, 〈A_{T}〉\} \\ = D_{1} [Ω_{1}^{B} A_{1} - Ω_{3}^{B} A_{3}] - C_{B} (T_{1}) A_{1} + D_{2} [Ω_{2}^{B} A_{2} - Ω_{3}^{B} A_{3}] - C_{B} (T_{2}) A_{2} \\ + D_{4} [Ω_{4}^{B} A_{4} - Ω_{3}^{B} A_{3}] - C_{B} (T_{4}) A_{4} + Ω_{3}^{B} A_{3} - C_{B} (T_{3}) A_{3} . \end{matrix}

(19)

Now, substitute from Equations (15) and (16) to Equation (19), along with the substitutions (11), to obtain

\begin{matrix} Π_{B} {〈D_{T}〉, 〈A_{T}〉} = \\ D_{1} {Ω_{1}^{B} A_{1} - Ω_{3}^{B} A_{3} + \frac{Ω_{1}^{R} [Ω_{2}^{B} A_{2} - Ω_{3}^{B} A_{3}]}{Ω_{2}^{R}} + \frac{Ω_{1}^{R} [Ω_{4}^{B} A_{4} - Ω_{3}^{B} A_{3}]}{Ω_{4}^{R}}} \\ + \frac{[- R_{R} (T_{1}) + R_{R} (T_{2})] [Ω_{2}^{B} A_{2} - Ω_{3}^{B} A_{3}]}{Ω_{2}^{R}} \\ + \frac{[R_{R} (T_{4}) - R_{R} (T_{1})] [Ω_{4}^{B} A_{4} - Ω_{3}^{B} A_{3}]}{Ω_{4}^{R}} \\ + Ω_{3}^{B} A_{3} - [C_{B} (T_{1}) A_{1} + C_{B} (T_{2}) A_{2} + C_{B} (T_{3}) A_{3} + C_{B} (T_{4}) A_{4}] \\ = D_{1} Δ_{1} + Δ_{2} \end{matrix}

(20)

where

\begin{matrix} Δ_{1} & = & Ω_{1}^{B} A_{1} - Ω_{3}^{B} A_{3} + \frac{Ω_{1}^{R} [Ω_{2}^{B} A_{2} - Ω_{3}^{B} A_{3}]}{Ω_{2}^{R}} + \frac{Ω_{1}^{R} [Ω_{4}^{B} A_{4} - Ω_{3}^{B} A_{3}]}{Ω_{4}^{R}} \end{matrix}

(21)

\begin{matrix} Δ_{2} & = & \begin{matrix} \frac{[- R_{R} (T_{1}) + R_{R} (T_{2})] [Ω_{2}^{B} A_{2} - Ω_{3}^{B} A_{3}]}{Ω_{2}^{R}} + \frac{[R_{R} (T_{4}) - R_{R} (T_{1})] [Ω_{4}^{B} A_{4} - Ω_{3}^{B} A_{3}]}{Ω_{4}^{R}} \\ + Ω_{3}^{B} A_{3} - [C_{B} (T_{1}) A_{1} + C_{B} (T_{2}) A_{2} + C_{B} (T_{3}) A_{3} + C_{B} (T_{4}) A_{4}], \end{matrix} \end{matrix}

(22)

appear as the new parameters of considered sequential strategic interaction. This completes the backwards induction process of obtaining the optimal response of the

B

team in view of its encounter with the rational behaviour of the

R

team.

5. Optimal Response of the $B$ Team

From Equations (21) and (22) we note that

Δ_{1, 2}

depend on the values assigned to the two teams’ rewards and costs variables i.e.,

R_{B} (T)

,

C_{B} (T)

,

R_{R} (T)

,

C_{R} (T)

as well as on the

R

team’s attack probabilities

A_{i} (1 \leq i \leq 4)

. Three case, therefore, emerge in view of Equation (20) that are described below.

5.1. Case $Δ_{1} > 0$

After observing the attack probabilities

A_{i} (1 \leq i \leq 4)

the

B

team obtains

Δ_{1}

using Equation (21) along with the rewards and costs variables i.e.,

R_{B} (T),

C_{B} (T),

R_{R} (T),

C_{R} (T)

and the Equation (11). If the

B

team finds that

Δ_{1} > 0

then its payoff

Π_{B} {〈D_{T}〉, 〈A_{T}〉}

is maximized at the maximum value of

D_{1}

and it is irrespective of the value of

Δ_{2}

. Note that at this maximum value of

D_{1}

, the corresponding values of

D_{2}, D_{3}, D_{4}

—as expressed in terms of

D_{1}

and given by Equations (15)–(17)—must remain non-negative, and that the maximum value obtained for

D_{1}

can still be less than

D_{2}

or

D_{3}

or

D_{4}

.

5.2. Case $Δ_{1} < 0$

As

0 \leq D_{i} \leq 1

and

\sum_{i = 1}^{4} D_{i} = 1

, therefore, in view of the attack probabilities

A_{i} (1 \leq i \leq 4)

, if the

B

team finds that

Δ_{1} < 0

then the reward is maximized to the value of

Δ_{2}

with

D_{1} = 0

and we then have

\begin{matrix} D_{2} & = & \frac{R_{R} (T_{2}) - R_{R} (T_{1})}{Ω_{2}^{R}}, \\ D_{3} & = & 1 - \frac{R_{R} (T_{2}) - R_{R} (T_{1})}{Ω_{2}^{R}} - \frac{R_{R} (T_{4}) - R_{R} (T_{1})}{Ω_{4}^{R}}, \\ D_{4} & = & \frac{R_{R} (T_{4}) - R_{R} (T_{1})}{Ω_{4}^{R}} . \end{matrix}

(23)

5.3. Case $Δ_{1} = 0$

If the

B

team finds that

Δ_{1} = 0

then its reward becomes

Δ_{2}

, as defined by Equation (22), and is independent of the value assigned to

D_{1}

and via Equations (15)–(17) also independent of

D_{2}

,

D_{3}

,

D_{4}

.

6. Example Instantiation

As an example, we consider the set of arbitrarily-assigned values to the two teams’ rewards and costs as in the table below.

(24)

for which we have

\begin{matrix} Ω_{1}^{B} & = & 16, Ω_{1}^{R} = 6, Ω_{2}^{B} = 15, Ω_{2}^{R} = 11, \\ Ω_{3}^{B} & = & 13, Ω_{3}^{R} = 10, Ω_{4}^{B} = 6, Ω_{4}^{R} = 6, \end{matrix}

(25)

and also for which using Equations (21) and (22) we obtain

\begin{matrix} Δ_{1} & = & 16 A_{1} + 8.181 A_{2} - 33.090 A_{3} + 6 A_{4}, \end{matrix}

(26)

\begin{matrix} Δ_{2} & = & - 7 A_{1} - 4.272 A_{2} + 2.469 A_{3} - A_{4} . \end{matrix}

(27)

6.1. Case $Δ_{1} > 0$

Now, assume that while knowing the attack probabilities

A_{i} (1 \leq i \leq 4)

, the

B

team uses Equation (26) to find that

Δ_{1} > 0 .

As discussed above, its payoff is maximized at the maximum value of

D_{1}

and it is irrespective of the value of

Δ_{2} .

Using Equations (15) and (16), along with the enteries in the table (24), the

B

team now determines the maximum value for

D_{1}

at which

D_{2}

,

D_{3}

,

D_{4}

obtained from Equations (15)–(17), respectively, all have non-negative values. Table (24) gives

D_{2} = \frac{6 D_{1} + 2}{11}, D_{3} = 1 - 10.272 D_{1} - 0.3485, D_{4} = \frac{6 D_{1} + 1}{6},

(28)

and a table of values is then obtained as

(29)

and

D_{1} = 0.0634

emerges as the maximum value at which

D_{2}

,

D_{3}

,

D_{4}

remain non-negative. The plots of

D_{2}

,

D_{3}

, and

D_{4}

vs.

D_{1}

(Range: 0.06 to 0.0636) appear in Figure 1.

The

B

team’s protection strategy is therefore obtained from Equation (28) as

{〈D_{T}〉}^{*} = 〈D_{1}^{*}, D_{2}^{*}, D_{3}^{*}, D_{4}^{*}〉 = 〈0.0634, 0.216, 2.24 \cdot 10^{- 4}, 0.23〉,

(30)

and from Equations (20), (26) and (27), along with the table (24), the

B

team’s payoffs then become

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = - 5.986 A_{1} - 3.753 A_{2} + 0.371 A_{3} - 0.62 A_{4}

(31)

which, in view of the fact that

\sum_{i = 1}^{4} A_{i} = 1

, can also be expressed as

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = - 6.357 A_{1} - 4.124 A_{2} - 0.991 A_{4} + 0.371 .

(32)

Plot of the payoff

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉}

for the values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

is in Figure 2.

From Equation (32) the payoff

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉}

is maximized at the value of

0.371

for

A_{3} = 1,

and therefore

A_{1, 2, 4} = 0 .

The payoff to the

R

team is then obtained from Equation (8) as

Π_{R} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = (1 - D_{3}^{*}) R_{R} (T_{3}) - D_{3}^{*} C_{R} (T_{3}),

(33)

where from Equation (30) we have

D_{3}^{*} \approx 0

and using the table (24) we obtain

Π_{R} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = R_{R} (T_{3}) = 7

.

Now we consider the reaction of the

R

team after the

B

team has determined its protection strategy

{〈D_{T}〉}^{*}

while following the backwards induction in the case above. For the case when the attack probabilities are such that

Δ_{1} > 0

in Equation (26), we re-express the

R

team’s payoff given by Equation (8) by substituting the

B

team’s protection strategy described by Equation (30). Using

\sum_{i = 1}^{4} A_{i} = 1

, the

R

team’s payoff are then expressed in terms of the attack probabilities

A_{1}, A_{2}, A_{4}

as

Π_{R} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = - 3.38 (A_{1} + A_{2} + A_{4}) + 7 .

(34)

Now, in Figure 3 below a plot is obtained comparing the

B

and the

R

team’s payoffs given by Equations (32) and (34), respectively, when these are considered implicit functions of

A_{1}

,

A_{2}

,

A_{4}

and with the constraints that

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1 .

For most of the allowed values of the attack probabilities, as represented by the blue shade, and for

Δ_{1} > 0

,

R

team remains significantly better off than the

B

team.

In view of the reward table (24), the

R

team’s payoffs attain the maximum value of 7 when

A_{1} + A_{2} + A_{4} = 0

or when

A_{3} = 1 .

However, when this is the case, using Equation (32) the payoff to the

B

team then becomes

0.371

.

6.2. Case $Δ_{1} \leq 0$

Consider the case when by using Equation (26) the

B

team finds that

Δ_{1} \leq 0

. As

0 \leq A_{1} + A_{2} + A_{4} \leq 1

in Equation (26), the condition

Δ_{1} \leq 0

can be realized for some values of the attack probabilities.

Plot of the payoff

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉}

for the values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

is in Figure 4.

Now in view of Equation (20) the

B

team’s reward is maximized to the value of

Δ_{2}

when

D_{1} = 0 .

In this case, using Equation (23) and the table (24) the

B

team’s protection strategy is therefore obtained as

{〈D_{T}〉}^{*} = 〈D_{1}^{*}, D_{2}^{*}, D_{3}^{*}, D_{4}^{*}〉 = 〈0, 0.181, 0.651, 0.167〉,

(35)

and, as before, using Equation (20), (26) and (27), along with the table (24), the

B

team’s payoffs then become

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = Δ_{2}

i.e.,

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = - 7 A_{1} - 4.272 A_{2} + 2.47 A_{3} - A_{4},

(36)

which in view of the fact that

\sum_{i = 1}^{4} A_{i} = 1

can also be expressed as

Π_{B} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = - 9.47 A_{1} - 6.742 A_{2} - 3.47 A_{4} + 2.47,

(37)

and similarly for the

R

team

Π_{R} {{〈D_{T}〉}^{*}, 〈A_{T}〉} = 3.51 (A_{1} + A_{2} + A_{4}) + 0.49 .

(38)

7. Discussion

We consider the case that the

B

team moves first and commits to a protection strategy

〈D_{T}〉

. The

R

team notices the protection strategy

〈D_{T}〉

and decides its attack strategy given by the vector

〈A_{T}〉

. The

B

team knows that the

R

team is a rational decision maker and how it will react to its protection strategy

〈D_{T}〉

. The leader-follower interaction—resulting in the consideration of Stackelberg equilibrium—looks into finding the

B

team’s best protection strategy

{〈D_{T}〉}^{*}

while knowing that the

R

team is going to act rationally in view of a protection strategy

〈D_{T}〉

committed by the

B

team. The

R

team’s mixed strategy is given by the vector

〈A_{T}〉

of the the attack probabilities

A_{1}, A_{2}, A_{3}, A_{4}

on the four tanks by the

R

team.

The vector

〈D_{T}〉

describing the

B

team’s allocation of its resources depends crucially on the parameter

Δ_{1}

as defined in Equation (14) and is obtained from the assigned values in table (24) for rewards and the costs to the two teams. If

Δ_{1} > 0

then the reward to the

B

team is maximized at the maximum value of

D_{1}

for which

D_{2}, D_{3}, D_{4}

—as expressed in terms of

D_{1}

by Equations (15)–(17)—remain non-negative. That is, the maximum value obtained for

D_{1}

can still be less than

D_{2}

or

D_{3}

or

D_{4} .

We note that for the case

Δ_{1} > 0

and for most situations encountered by the two teams—represented by the area covered by the blue shade in Figure 3—the reward for the

B

team remains between

- 6

and

0.5

whereas the reward for the

R

team remains between

0.5

and 7.

However, in the case

Δ_{1} \leq 0

and for most situations encountered by the two teams—represented now by the area covered by the blue shade in Figure 5—the reward for the

B

team remains between

- 6.5

and

2.5

whereas the reward for the

R

team remains between

0.5

and 4.

That is, for most of the allowed values of the attack probabilities, the

B

team can receive higher reward when

Δ_{1} \leq 0

relative to the case when

Δ_{1} > 0 .

However, for most of the allowed values of the attack probabilities, the

R

team can receive less reward when

Δ_{1} \leq 0

relative to the case when

Δ_{1} > 0 .

Therefore, the situation

Δ_{1} \leq 0

is more favourable to the

B

team than it is to the

R

team. Similarly, the situation

Δ_{1} > 0

turns out to be more favorable to the

R

team than it is to the

B

team. Note that these results are specific to the particular values assigned in the considered example to the parameters

R_{B} (T)

,

C_{B} (T)

,

R_{R} (T)

,

C_{R} (T)

for the four tanks.

8. Conclusions

The Stackelberg equilibrium in this scenario is reached when the drones have optimized their attack patterns, in view of the protections provided to the tanks, and the tanks have subsequently optimized their protections in light of the drone best responses. The dynamic interplay of strategic decision-making, under the principles of Stackelberg equilibrium and backwards induction, highlights the intricate nature of modern warfare involving drones and tanks where brains and brawn are equally pivotal. A natural extension of this work is the case when the set of resources consists of m element i.e.,

S = {S_{1}, S_{2}, \dots S_{m}}

and set of tanks is given as

T = {T_{1}, T_{2}, \dots T_{n}}

.

Author Contributions

Conceptualization, A.I.; Methodology, A.I.; Software, I.H.; Validation, E.T. and R.B.; Formal analysis, A.I.; Investigation, E.T., A.P. and R.B.; Writing—original draft, A.I.; Writing—review & editing, A.I. and C.S.; Visualization, A.P. and G.P.; Supervision, C.S.; Project administration, C.S. All authors have read and agreed to the published version of the manuscript.

Funding

The work in this paper was carried out under a Research Agreement between the Defence Science and Technology Group, Department of Defence, Australia, and the University of Adelaide, Contract No. UA216424-S27.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The author declares no conflicts of interest.

References

Binmore, K. Game Theory: A Very Short Introduction; Oxford University Press: Oxford, UK, 2007. [Google Scholar]
Rasmusen, E. Games and Information: An Introduction to Game Theory, 3rd ed.; Blackwell Publishers Ltd.: Oxford, UK, 2001. [Google Scholar]
Osborne, M.J. An Introduction to Game Theory; Oxford University Press: Oxford, UK, 2003. [Google Scholar]
Cournot, A. Researches into the Mathematical Principles of the Theory of Wealth; Bacon, N., Ed.; Macmillan: New York, NY, USA, 1897. [Google Scholar]
Tirole, J. The Theory of Industrial Organization; MIT: Cambridge, MA, USA, 1988. [Google Scholar]
von Stackelberg, H. Marktform und Gleichgewicht; Julius Springer: Vienna, Austria, 1934. [Google Scholar]
Gibbons, R. Game Theory for Applied Economists; Princeton University Press: Princeton, NJ, USA, 1992. [Google Scholar]
Korzhyk, D.; Yin, Z.; Kiekintveld, C.; Conitzer, V.; Tambe, M. Stackelberg vs. Nash in Security Games: An Extended Investigation of Interchangeability, Equivalence, and Uniqueness. J. AI Res. (JAIR) 2011, 41, 297–327. [Google Scholar] [CrossRef]
Bustamante-Faúndez, P.; Bucarey, L.V.; Labbé, M.; Marianov, V.; Ordoñez, F. Playing Stackelberg Security Games in perfect formulations. Omega 2024, 126, 103068. [Google Scholar] [CrossRef]
Hunt, K.; Zhuang, J. A review of attacker-defender games: Current state and paths forward. Eur. J. Oper. Res. 2024, 313, 401–417. [Google Scholar] [CrossRef]
Chen, X.; Xiao, L.; Feng, W.; Ge, N.; Wang, X. DDoS Defense for IoT: A Stackelberg Game Model-Enabled Collaborative Framework. IEEE Int. Things J. 2022, 9, 9659–9674. [Google Scholar] [CrossRef]
Bansal, G.; Sikdar, B. Security Service Pricing Model for UAV Swarms: A Stackelberg Game Approach. In Proceedings of the IEEE INFOCOM 2021—IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Vancouver, BC, Canada, 10–13 May 2021; pp. 1–6. [Google Scholar] [CrossRef]
Li, H.; Zheng, Z. Optimal Timing of Moving Target Defense: A Stackelberg Game Model. In Proceedings of the MILCOM 2019—2019 IEEE Military Communications Conference (MILCOM), Norfolk, VA, USA, 12–14 November 2019; pp. 1–6. [Google Scholar] [CrossRef]
Feng, Z.; Ren, G.; Chen, J.; Zhang, X.; Luo, Y.; Wang, M.; Xu, Y. Power Control in Relay-Assisted Anti-Jamming Systems: A Bayesian Three-Layer Stackelberg Game Approach. IEEE Access 2019, 7, 14623–14636. [Google Scholar] [CrossRef]
Kar, D.; Nguyen, T.H.; Fang, F.; Brown, M.; Sinha, A.; Tambe, M.; Jiang, A.X. Trends and Applications in Stackelberg Security Games. In Handbook of Dynamic Game Theory; Basar, T., Zaccour, G., Eds.; Springer: Cham, Switzerland, 2016. [Google Scholar] [CrossRef]
Tambe, M. Security and Game Theory: Algorithms, Deployed Systems, Lessons Learned; Cambridge University Press: Cambridge, MA, USA, 2011. [Google Scholar]
Paruchuri, P.; Pearce, J.; Marecki, J.; Tambe, M.; Ordonez, F.; Kraus, S. Playing Games for Security: An Efficient Exact Algorithm for Solving Bayesian Stackelberg Games. In Proceedings of the International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), Estoril, Portugal, 12–16 May 2008; pp. 895–902. [Google Scholar]
Hohzaki, R.; Nagashima, S. A Stackelberg equilibrium for a missile procurement problem. Eur. J. Oper. Res. 2009, 193, 238–249. [Google Scholar] [CrossRef]
Sinha, A.; Fang, F.; An, B.; Kiekintveld, C.; Tambe, M. Stackelberg Security Games: Looking Beyond a Decade of Success. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18), Stockholm, Sweden, 13–19 July 2018; pp. 5494–5501. [Google Scholar] [CrossRef]

Figure 1. Plots of

D_{2}

,

D_{3}

, and

D_{4}

vs.

D_{1}

(Range: 0.06 to 0.0636).

Figure 1. Plots of

D_{2}

,

D_{3}

, and

D_{4}

vs.

D_{1}

(Range: 0.06 to 0.0636).

Figure 2. The

B

team’s payoff as given by Equation (32) when

Δ_{1} > 0

and for the values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

. The payoff is maximized at the value of 0.371 for

A_{1, 2, 4} = 0

.

Figure 2. The

B

team’s payoff as given by Equation (32) when

Δ_{1} > 0

and for the values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

. The payoff is maximized at the value of 0.371 for

A_{1, 2, 4} = 0

.

Figure 3. The plot between

Π_{B}

and

Π_{R}

when

Δ_{1} > 0

and for values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

.

Figure 3. The plot between

Π_{B}

and

Π_{R}

when

Δ_{1} > 0

and for values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

.

Figure 4. The

B

team’s payoff as given by Equation (37) when

Δ_{1} \leq 0

and for the values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

. The payoff is maximized at the value of 2.47 for

A_{1, 2, 4} = 0

.

Figure 4. The

B

team’s payoff as given by Equation (37) when

Δ_{1} \leq 0

and for the values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

. The payoff is maximized at the value of 2.47 for

A_{1, 2, 4} = 0

.

Figure 5. The plot between

Π_{B}

and

Π_{R}

when

Δ_{1} \leq 0

and for values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

.

Figure 5. The plot between

Π_{B}

and

Π_{R}

when

Δ_{1} \leq 0

and for values of

A_{1}

,

A_{2}

,

A_{4}

that satisfy the constraints

0 \leq A_{1}

,

A_{2}

,

A_{4} \leq 1

and

0 \leq A_{1} + A_{2} + A_{4} \leq 1

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Iqbal, A.; Honhaga, I.; Teffera, E.; Perry, A.; Baker, R.; Pearce, G.; Szabo, C. Vulnerability and Defence: A Case for Stackelberg Game Dynamics. Games 2024, 15, 32. https://doi.org/10.3390/g15050032

AMA Style

Iqbal A, Honhaga I, Teffera E, Perry A, Baker R, Pearce G, Szabo C. Vulnerability and Defence: A Case for Stackelberg Game Dynamics. Games. 2024; 15(5):32. https://doi.org/10.3390/g15050032

Chicago/Turabian Style

Iqbal, Azhar, Ishan Honhaga, Eyoel Teffera, Anthony Perry, Robin Baker, Glen Pearce, and Claudia Szabo. 2024. "Vulnerability and Defence: A Case for Stackelberg Game Dynamics" Games 15, no. 5: 32. https://doi.org/10.3390/g15050032

APA Style

Iqbal, A., Honhaga, I., Teffera, E., Perry, A., Baker, R., Pearce, G., & Szabo, C. (2024). Vulnerability and Defence: A Case for Stackelberg Game Dynamics. Games, 15(5), 32. https://doi.org/10.3390/g15050032

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Vulnerability and Defence: A Case for Stackelberg Game Dynamics

Abstract

1. Introduction

2. Related Work

3. The Game Definition

3.1. Marginal Probability of Protection for Tanks

3.2. Defining the Reward Functions

4. Leader-Follower Interaction and Stackelberg Equilibrium

5. Optimal Response of the $B$ Team

5.1. Case $Δ_{1} > 0$

5.2. Case $Δ_{1} < 0$

5.3. Case $Δ_{1} = 0$

6. Example Instantiation

6.1. Case $Δ_{1} > 0$

6.2. Case $Δ_{1} \leq 0$

7. Discussion

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Vulnerability and Defence: A Case for Stackelberg Game Dynamics

Abstract

1. Introduction

2. Related Work

3. The Game Definition

3.1. Marginal Probability of Protection for Tanks

3.2. Defining the Reward Functions

4. Leader-Follower Interaction and Stackelberg Equilibrium

5. Optimal Response of the B Team

5.1. Case Δ 1 > 0

5.2. Case Δ 1 < 0

5.3. Case Δ 1 = 0

6. Example Instantiation

6.1. Case Δ 1 > 0

6.2. Case Δ 1 ≤ 0

7. Discussion

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5. Optimal Response of the $B$ Team

5.1. Case $Δ_{1} > 0$

5.2. Case $Δ_{1} < 0$

5.3. Case $Δ_{1} = 0$

6.1. Case $Δ_{1} > 0$

6.2. Case $Δ_{1} \leq 0$