1. Introduction
As strategic-form games, bimatrix games are used for analyzing the strategic interaction between risk-neutral players. Different types of strategies, including mixed strategies and pure strategies, are often adopted to optimize the payoffs of two risk-neutral players. The payoff matrices in a bimatrix game are determined by the two players’ strategic profiles. Bimatrix games can be easily generalized to n-person non-cooperative games, which means that bimatrix games can exactly model the real decision-making problem. In particular, bimatrix games play key roles in numerous decision-making scenarios when bimatrix games are applied to real-game situations. For each player, it is not easy to exactly know the payoffs of the other player in a bimatrix game. Therefore, it is necessary to explore bimatrix games under uncertain environments.
Classical bimatrix games were investigated based on the expected utility (ET) ([
1,
2,
3,
4]). Although the ET dominates the analysis of bimatrix games, a large amount of evidence shows that the ET cannot adequately characterize or predict the behavior of humans ([
5,
6,
7]). Prospect theory (PT), proposed by [
8], is regarded as a substitute approach to the ET. PT confirms risk aversion for gains. For instance, 80% of respondents would rather choose a certain outcome of three thousand dollars than opt for an 80% chance of four thousand dollars and a 20% chance of nothing ([
6]). This phenomenon is called risk aversion. Nevertheless, scholars have paid little attention to the effects of risk aversion in the game-theoretic field, except for bargaining theory.
In classical game theory, all payoffs in bimatrix games are common knowledge to the two players ([
9,
10]). In real-game problems, however, players usually lack sufficient information about the payoffs for themselves and other players ([
11]) because of the complex relationships between differently strategic selections and their complicated impacts on the payoffs. This results in the phenomenon of uncertainty in the payoffs. The uncertainties in the payoffs are presented as the following two types: random payoffs (RPs) as well as fuzzy payoffs (FPs). Ref. [
12] introduced a Bayesian game model (BGM) to address the issues with random payoffs in the game-theoretic field. Ref. [
13] investigated zero-sum game problems with RPs by using chance-constrained programming. Ref. [
14] examined zero-sum games with RPs by adopting non-convex mathematical programming. Ref. [
15] proposed a disordered systems-based model for investigating bimatrix games with random payoffs. Ref. [
16] developed statistical mechanics of neural networks for solving zero-sum games with RPs. Ref. [
17] applied Cauchy distribution to explore coordination games with RPs. The above works investigated zero-sum games with both players under the hypothesis that plenty of available historical data can be used to adequately characterize RPs. However, BGM fails to adequately model the situation in which few historical data are available in game problems. As a result, the fuzzy game (FG) was introduced by adopting fuzzy set theory developed by [
18]. Ref. [
19] investigated zero-sum games with FPs. Ref. [
20] considered a bimatrix game with FPs by using possibility and necessity measures. Ref. [
21] investigated matrix games with FPs and multi-objectives. Ref. [
22] adopted minimax equilibria to examine zero-sum games with FPs. Ref. [
23] presented an equilibrium to solve a zero-sum game with FPs. Ref. [
24] investigated a zero-sum game with FPs in an asymmetric fuzzy environment. Refs. [
24,
25,
26,
27,
28,
29] presented an equilibrium in a zero-sum game with triangular FPs, interval FPs, triangular intuitionistic FPs, and trapezoidal intuitionistic FPs, respectively. Ref. [
30] examined the PN equilibrium of matrix games with FPs based on possibility and necessity measures. Differing from the above works that focused on matrix games with risk-neutral players, our work concentrates on bimatrix games with risk-aversion players.
Here, we examine the impact of risk aversion on bimatrix games with symmetric triangular FPs (STFPs). Our work explores the relationship between the (
α,
β)-risk aversion Nash equilibrium ((
α,
β)-RANE) and mixed-strategy Nash equilibrium (MSNE) in bimatrix games with certain payoffs. In a bimatrix game with STFPs, let player 2 be the risk-averse one and player 1 be the risk-neutral one. We suppose that there exists the same support for the MSNE in the two cases—one with a risk-neutral player 2 and the other with a risk-averse player 2. This assumption has been used by [
9]. As a result, player 2’s equilibrium strategy as well as player 1’s expected payoff cannot be changed. This implies the fact that a player in an MSNE is indifferent between the pure strategies played at a positive probability in a bimatrix game with certain payoffs. Finally, the impacts of risk aversion are explored in a 2 × 2 bimatrix game with STFPs.
The rest of our work is arranged as follows. In
Section 2, related definitions as well as notations are reviewed briefly. In
Section 3, we introduce the solution concept of the (
α,
β)-RANE, which is reviewed in bimatrix games with FPs, and explore the relationship between this equilibrium and the MSNE in a bimatrix game with certain payoffs. In
Section 4, the impacts of risk aversion are explored in a 2 × 2 bimatrix game with STFPs.
Section 5 concludes the work.
2. Preliminaries
For the vectors and , we have (or ) if and only if (or ), where Rk is the Euclidian k-space and i = 1, 2, …, k.
Definition 1. Let denote a fuzzy number and R denote the real number space; then, is a fuzzy subset F of R with a membership function, i.e., : R→[0, 1], where the membership function satisfies the following three conditions:
- (i)
has a center that is denoted by a real number c, then ;
- (ii)
is upper semi-continuous and quasi-concave;
- (iii)
the support supp() of is compact, where .
Definition 2 ([
22])
. Given the real number m as well as the positive number h, a fuzzy number is referred to as a symmetric triangular fuzzy number (STFN) if its membership function satisfies the following:where m is the center parameter of and h is the deviation parameter of . Any STFN is denoted by .
Definition 3 ([
22])
. Given a real number and two STFNs, and , then and , where Definition 4 ([
22])
. Given an STFN and a real number , then the set is referred to as an α-cut of . And , where and .
For any STFN
, by using the operations of intervals that are introduced by [
6], then
Given the STFNs
and
, by adopting the extension principles that are developed by [
18,
31], then
and
if
c > 0.
Definition 5 ([
32])
. Given the STFNs and , then ; therefore, there exist the following relationships between and :
To describe the inequality relationships between any two STFNs, [
33] introduced two indices, as shown in Lemma 1. The reason why our work adopts the indices developed by [
33] is that [
33] has been cited by a large number of scholars. The number of citations amounts to 53 (see Google Scholar). And [
33] has been cited by top journals in the fields of operations research and decision making, such as
Fuzzy Sets and Systems and
The European Journal of Operational Research. These have proven the effectiveness of the indices developed by [
33,
34].
Lemma 1 ([
33,
34])
. Given the STFNs and , then 3. Fuzzy Bimatrix Games
For a bimatrix game with players
I and
J, the pure strategy set for player
I is denoted by
L′ = {1,2,…,
l} and player
J’s is denoted by
K = {1, 2,…,
k}. Then, player
I’s mixed strategy set is as follows:
and player
I’s mixed strategy set is as follows:
Let any two STFNs and denote the payoffs of both players when playing the pure strategy profile (i, j), respectively; then, and are the payoff matrices of players I and J, respectively. A bimatrix game with STFPs is denoted by .
Definition 6 ([
20])
. A strategy profile is referred to as an (α, β)-Nash equilibrium ((α, β)-NE) in G1 if satisfies the following: and a point is referred to as an ()-equilibrium expected payoff vector. Let G3 denote .
Lemma 2 shows a sufficient and necessary condition (SNC) that there exists an (α, β)-NE in G1.
Lemma 2 ([
20])
. For any , in G1 there exist at least one ()-NE. Moreover, in G1, is an (α, β)-NE if and only if there exists the optimal solution in the programming model, as follows:where and . Let and .
From [
35], we derive Lemma 3, which indicates the relationships between the (
α,
β)-NE in a bimatrix game with STFPs and the MSNE in a bimatrix game with certain payoffs.
Lemma 3. Given any , is an (α, β)-NE in G1 if and only if it is an MSNE in the game .
In many situations, however, players often show risk-averse preferences. To characterize such preferences, let φ be a strictly increasing and concave function on [min{}, max{}]. Then, the matrix with element φ () is denoted by φ (). Therefore, there exists a new game . In G3, player 2 is more risk averse compared with player 2 in G2. Let (, y*) denote the MSNE in G3. To perform a meaningful comparison, there exists a same support for x* and , i.e., ⇔ .
Definition 7. A strategy profile is referred to as an (α, β)-risk aversion NE ((α, β)-RANE) in G3 if satisfies the following:and a point is referred to as an ()-equilibrium expected payoff vector. From Lemma 2, we can present an SNC that there exists an (α, β)-RANE in G3.
Corollary 1. For any in G1 with risk aversion, there exist at least one ()-RANE. Moreover, in G3, is an (α, β)-RANE if and only if there exists the optimal solution in the programming model, as follows:where and . Let and .
Similar to Lemma 3, we need to indicate the relationships between the (α, β)-RANE in a bimatrix game with STFPs and the MSNE in a bimatrix game with risk aversion for player 2 and certain payoffs, as shown in Corollary 2.
Corollary 2. Given any , is an (α, β)-RANE in G1 with risk aversion if and only if it is the MSNE in the game .
4. The (α, β)-NE in 2 × 2 Fuzzy Bimatrix Games with Risk Aversion
When
l =
k = 2, given a bimatrix game with STFPs,
, where
For i, j = 1, 2, let and .
Given the real numbers , following Lemma 2, there exists the ()-NE in G1 by deriving the MSNE in the game , where and . In G2, there exists a mixed-strategy profile (x, y), where and . Let (x*, y*) denote the MSNE in G2, where and .
When player 2 is risk averse, let φ be a strictly increasing and concave function on [min{}, max{}]. Then, the matrix with element φ () is denoted by φ (). Therefore, there exists new game . Given the real numbers , following Corollary 1, there exists the ()-RANE in G1 by deriving the MSNE in the game . In G3, there exists a mixed-strategy profile (, y), where and . Let (, y*) denote the MSNE in G3, where and .
Player 1 knows that player 2’s utility function depends on a concave and increasing transformation (i.e., player 2’s risk-aversion level). That is, one can regard the payoff matrices and φ () as utilities of monetary outcomes. The monetary outcomes of player 2 are known, but their utilities are not known by player 1.
To explore the impacts of risk aversion in
G1, following [
35], for player
J, there exist the following assumptions:
and
It follows from (19) and Lemma 1 that
From (20) and Lemma 1, we have that
And from (21) and Lemma 1, we have that
This leaves the following three exhaustive cases:
- Case i.
;
- Case ii.
and ;
- Case iii.
.
Let (
x*,
y*) be an (
α,
β)-NE in
G1. Then, according to Lemma 2, (
x*,
y*) is the MSNE in
G2, where
Theorem 1. Let (x*, y*) be an (α, β)-NE in G1 and an MSNE in G2, and let (, y*) be an (α, β)-RANE in G1 with risk aversion and an MSNE in G3. Then, player 2 benefits from risk aversion at (x*, y*) and (, y*) if .
Proof. For Case i, since
, we obtain the following from Lemma 1:
Combining (16)–(18), we then obtain the following:
From Definition 5, it follows that
The function
φ is normalized such that
φ (
) =
and
φ (
) =
. Since
φ is a strictly increasing and concave function, then (
) ≥
and
φ (
) ≥
. Then, for (
,
y*) in
G3, we have
Since
, then
Thus, by (20) and (21), we obtain the following:
This implies that player 2 benefits from risk aversion in Case i. □
Theorem 2. Let (x*, y*) be the (α, β)-NE in G1 and MSNE in G2, and let (, y*) be the (α, β)-RANE in G1 with risk aversion and the MSNE in G3. Then, player 2 benefits from risk aversion at (x*, y*) and (, y*) if .
Proof. For Case ii, since
and
, from Lemma 1, we obtain the following:
Combining (16), (17), and (23), then
From Definition 5, it follows that
The function
φ is normalized such that
φ (
) =
and
φ (
) =
. Since
φ is a strictly increasing and concave function, then
φ (
) ≥
and
φ (
) ≥
. Then, for (
,
y*) in
G3, we have
Since
, then
Thus, from (25) and (26), then
This implies that player 2 benefits from risk aversion in Case ii. □
Theorem 3. Let (x*, y*) be an (α, β)-NE in G1 and MSNE in G2, and let (, y*) be an (α, β)-RANE in G1 with risk aversion and the MSNE in G3.
- (I)
For the games G2 and G3, where φ (x) = x for all x ≤ and , then player 2 benefits from risk aversion at (x*, y*) and (, y*) if ;
- (II)
For the games G2 and G3, where φ (x) = x for all x ≥ and , then player 2 is hurt by risk aversion at (x*, y*) and (, y*) if .
Proof. For Case iii, since
, by Lemma 1, we obtain the following:
Combining (16), (17), and (28), then
From Definition 5, it follows that
For the function φ, two cases are shown as follows:
- (i)
φ satisfies φ (x) = x for all x ≤ and ;
- (ii)
φ satisfies φ (x) = x for all x ≥ and .
For (i) and for (
,
y*) in
G3, then
For (ii) and for (
,
y*) in
G3, then
On the other hand, since , the two cases are as follows:
- (a)
();
- (b)
.
On the other hand, for Case iii, player 2 benefits from risk aversion or is hurt by risk aversion, which depends on
y*;
is shown as follows:
If
, then, by (30) and (a), we obtain the following:
This implies that risk aversion is conducive to player 2.
If
, then, by (31) and (b), we obtain the following:
which means that player 2 is hurt by risk aversion. □
Although the method of reasoning in our work is similar to [
35], one of the differences is that our work examines how risk aversion affects player 2 and not player 1, while [
35] explores the impacts of loss aversion on both players. The reasons are as follows. To perform a meaningful comparison, we suppose that there exists the same support of the MSNE in the following two cases—one with a risk-neutral player 2 and the other with a risk-averse player 2. As result, player 2’s equilibrium strategy as well as player 1’s expected payoff cannot be changed. This implies the fact that a player in an MSNE is indifferent between the pure strategies played at a positive probability in a bimatrix game with certain payoffs.