Next Article in Journal
On Compressed Sensing of Binary Signals for the Unsourced Random Access Channel
Next Article in Special Issue
Reducing Cognitive Effort in Scoring Negotiation Space Using the Fuzzy Clustering Model
Previous Article in Journal
Biases and Variability from Costly Bayesian Inference
Previous Article in Special Issue
Efficiency of Classical and Quantum Games Equilibria
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Non-Classical Rules in Quantum Games

by
Piotr Frąckiewicz
Institute of Exact and Technical Sciences, Pomeranian University in Słupsk, ul. Arciszewskiego 22d, 76-200 Słupsk, Poland
Entropy 2021, 23(5), 604; https://doi.org/10.3390/e23050604
Submission received: 3 April 2021 / Revised: 30 April 2021 / Accepted: 5 May 2021 / Published: 13 May 2021
(This article belongs to the Special Issue Decision Making, Classical and Quantum Optimization Methods)

Abstract

:
Over the last twenty years, quantum game theory has given us many ideas of how quantum games could be played. One of the most prominent ideas in the field is a model of quantum playing bimatrix games introduced by J. Eisert, M. Wilkens and M. Lewenstein. The scheme assumes that players’ strategies are unitary operations and the players act on the maximally entangled two-qubit state. The quantum nature of the scheme has been under discussion since the article by Eisert et al. came out. The aim of our paper was to identify some of non-classical features of the quantum scheme.

1. Introduction

The scheme defined by J. Eisert, M. Wilkens and M. Lewenstein [1] was one of the first formal protocols of playing quantum games, and is definitely one of the most used schemes for quantum games. This conclusion is confirmed by the number of citations of the article (around 500 citations according to Web of Knowledge). The scheme generalizes a 2 × 2 game in the sense that the game generated by the Eisert–Wilkens–Lewenstein (EWL) scheme with unitary strategies restricted to some type of one-parameter operators is equivalent to the classical game. The seminal paper [1] and the subsequent papers [2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18] are just a very smart part of the huge literature devoted to the EWL scheme. It was shown in [1] that a quantum way of playing the Prisoner’s Dilemma game can lead to a reasonable and Pareto efficient outcome. Further research has shown, for example, that players can benefit from the use of quantum strategies in symmetric 2 × 2 games [3]. The Eisert–Wilkens–Lewenstein (EWL) scheme can also be extended to consider extensive-form games [4]. It was also shown that the EWL scheme can be implemented with a quantum computer [5,6].
Despite the significance of the scheme in the development of quantum game theory, doubts arise as to quantum nature of the EWL game. These concerns include the following:
  • Does the quantum solution provided by the EWL scheme really solve the input classical game? (to what extent the quantum solution solves the underlying classical game).
  • Can the quantum solution be obtained in a classical game? (to what extent the quantum solution is really quantum mechanical in that it cannot be achieved classically).
These questions were raised in [19]. By considering the Prisoner’s Dilemma game, the authors come to the conclusion that the EWL scheme does not imply a quantum mechanical game. Moreover, according to [19], the solution (Nash equilibrium) resulting from playing the EWL game does not appear to solve the original game.
Recently, there have been discussions about van Enk and Pike’s arguments. It is claimed in [20] that the EWL approach to the Hawk–Dove game enables the players to obtain a game result that is not achievable in the classical game. As a result, it was concluded in [20] that a quantum game cannot be fully modeled by the classical game. Shortly after appearing [20], B. Groisman [21] suggested that the scheme used by N. Vyas and C. Benjamin changes the rules of the original game. Hence, the author stated that the solution provided in [20] cannot be treated as a quantum extension of the classical game.
In light of the above, it can be seen that the problem of quantumness of the EWL scheme is not resolved. The purpose of this article is, on the one hand, to show that the form of the scheme considered in [18,19,21] does not fully describe the EWL scheme, on the other hand, to draw attention to other non-classical properties of the scheme.

2. Preliminaries on Game Theory

This section is based on [22,23]. We review relevant material connected with the notion of strategic-form games and payoff regions in those games.
The basic model of games studied in game theory is a game in strategic form.
Definition 1
([22]). A game in strategic form (or in normal form) is an ordered triple
( N , ( S i ) i N , ( u i ) i N ) ,
in which
  • N = { 1 , 2 , , n } is a finite set of players;
  • S i is the set of strategies of player i, for every player i N ;
  • u i : S 1 × S 2 S n R is a function associating each vector of strategies s = ( s i ) i N with the payoff u i ( s ) to player i, for every player i N .
In the case of a finite two-person game, i.e., N = { 1 , 2 } , S 1 = { 0 , 1 , , m 1 } , S 2 = { 0 , 1 , , r 1 } , the game can be written as a bimatrix with entries ( u 1 ( s ) , u 2 ( s ) ) ,
    0 1 r 1 0 1 m 1 ( ( a 00 , b 00 ) ( a 10 , b 10 ) ( a m 1 , 0 , b m 1 , 0 ) ( a 01 , b 01 ) ( a 11 , b 11 ) ( a m 1 , 1 , b m 1 , 1 ) ( a 0 , r 1 , b 0 , r 1 ) ( a 1 , r 1 , b 1 , r 1 ) ( a m 1 , r 1 , b m 1 , r 1 ) ) .
The elements of S i are called the pure strategies of player i. The set of pure strategy vectors (profiles) is i = 1 n S i . A mixed strategy of player i is a probability distribution over S i . We denote the set of mixed strategies of player i by Δ ( S i ) . The set of mixed strategy profiles is i = 1 n Δ ( S i ) . In particular, if S i = { s 0 i , s 1 i } , player i’s set of mixed strategies will be denoted by
{ [ p 1 ( s 0 i ) , p 2 ( s 1 i ) ] : p 1 0 , p 2 0 , p 1 + p 2 = 1 } .
A correlated strategy is a probability distribution over i = 1 n S i . The set of correlated strategies is denoted by Δ ( i = 1 n S i ) .
Let u i : i = 1 n S i R be the payoff function of player i in ( N , ( S i ) i N , ( u i ) i N ) . Then the payoff functions u i : i = 1 n Δ ( S i ) R and u i : Δ ( i = 1 n S i ) R are defined by the expected values of u i : i = 1 n S i R determined by mixed strategies σ i = 1 n Δ ( S i ) and probability distributions over i = 1 n S i , respectively. Let us define the vector-valued payoff function u : i = 1 n S i R n by u ( s ) = ( u 1 ( s ) , , u n ( s ) ) , s i = 1 n S i .
Definition 2
([23]). Let ( N , ( S i ) i N , ( u i ) i N ) be a finite strategic-form game. The ranges
R pu = u i = 1 n S i , R nc = u i = 1 n Δ ( S i ) , R co = u Δ i = 1 n S i
are called the pure-payoff region, the non-cooperative payoff region and the cooperative payoff region, respectively.
The notion of Nash equilibrium is one of the most important solution concepts in non-cooperative game theory. It defines a strategy vector at which each strategy is a best reply to the strategies of the other players.
Definition 3
([22]). A strategy vector s * = ( s 1 * , s 2 * , , s n * ) is a Nash equilibrium if for each player i N and each strategy s i S i the following is satisfied:
u i ( s * ) u i ( s i , s i * ) ,
where s i * = ( s 1 * , , s i 1 * , s i + 1 * , , s n * ) .
In particular, if a strategic form game is described in bimatrix form, Nash equilibrium can be defined as follows:
Definition 4.
A position ( i , j ) in a bimatrix game (2) is a Nash equilibrium if
a i j a k j for all k { 0 , 1 , , m 1 }
and
b i j b i l for all l { 0 , 1 , , r 1 } .

3. The Eisert–Wilkens–Lewenstein Scheme

The Eisert–Wilkens–Lewenstein (EWL) scheme is a model of a normal-form framework. It concerns bimatrix 2 × 2 games—two person strategic form games with two-element sets of strategies that can be written as
    s 0 2 s 1 2 s 0 1 s 1 1 ( ( a 00 , b 00 ) ( a 10 , b 10 ) ( a 01 , b 01 ) ( a 11 , b 11 ) ) .
In the EWL scheme, players’ strategies are unitary operators that each of two players acts on a maximally entangled quantum state. In the literature there are a few descriptions of the EWL scheme that are strategically equivalent. In what follows, we recall the general n-person scheme we adapted for the purpose of our research. For more details of the schematic description, see Figure 1 in [24].
Definition 5
([13]). Let us consider a strategic game Γ = ( N , ( S i ) i N , ( u i ) i N ) with S i = { s 0 i , s 1 i } for each i N . The Eisert–Wilkens–Lewenstein approach to game Γ is defined by a triple Γ E W L = ( N , ( D i ) i N , ( v i ) i N ) , where
  • N = { 1 , 2 , n } is the set of players.
  • D i is a set of unitary operators from SU ( 2 ) . A possible parameterization of U SU ( 2 ) is
    U i ( θ i , α i , β i ) = e i α i cos θ i 2 i e i β i sin θ i 2 i e i β i sin θ i 2 e i α i cos θ i 2 , θ i [ 0 , π ] , α i , β i [ 0 , 2 π ) .
  • v i : D 1 D 2 D n R is a payoff function given by
    v i i = 1 n U i ( θ i , α i , β i ) = tr ( | Ψ Ψ | M i ) ,
    where
    | Ψ = J i = 1 n U i ( θ i , α i , β i ) J | 0 n , J = ( 𝟙 n + i σ x n ) / 2 , M i = j 1 , , j n { 0 , 1 } a j 1 , , j n i | j 1 , , j n j 1 , , j n | ,
    and a j 1 , , j n i R are payoffs of player i in Γ given by equation a j 1 , , j n i = u i ( s j 1 1 , , s j n n ) .
In particular, the EWL approach to a 2 × 2 game (2) results in the following vector-valued payoff functions:
v ( U 1 ( θ 1 , α 1 , β 1 ) , U 2 ( θ 2 , α 2 , β 2 ) ) = ( a 00 , b 00 ) cos ( α 1 + α 2 ) cos θ 1 2 cos θ 2 2 + sin ( β 1 + β 2 ) sin θ 1 2 sin θ 2 2 2 + ( a 01 , b 01 ) sin ( α 2 β 1 ) sin θ 1 2 cos θ 2 2 + cos ( α 1 β 2 ) cos θ 1 2 sin θ 2 2 2 + ( a 10 , b 10 ) cos ( α 2 β 1 ) sin θ 1 2 cos θ 2 2 + sin ( α 1 β 2 ) cos θ 1 2 sin θ 2 2 2 + ( a 11 , b 11 ) cos ( β 1 + β 2 ) sin θ 1 2 sin θ 2 2 sin ( α 1 + α 2 ) cos θ 1 2 cos θ 2 2 2 .

4. Problem of Classical Strategies in the EWL Scheme

The EWL scheme constitutes a generalization of the classical way of playing the game. It is known that the EWL game becomes equivalent to the classical one by restricting the unitary strategy sets of the players. In the case of a bimatrix game (2), the scheme
Γ E W L = ( { 1 , 2 } , ( D i ) i { 1 , 2 } , ( v i ) i { 1 , 2 } )
is equivalent to (2) if
D 1 = D 2 = { U ( θ , 0 , 0 ) θ [ 0 , π ] } .
If the players choose U 1 ( 2 arccos p , 0 , 0 ) , U 2 ( 2 arccos q , 0 , 0 ) { U ( θ , 0 , 0 ) θ [ 0 , π ] } , then the resulting payoff vector is of the form
v ( U 1 ( θ 1 , α 1 , β 1 ) , U 2 ( θ 2 , α 2 , β 2 ) ) = ( a 00 , b 00 ) p q + ( a 01 , b 01 ) p ( 1 q ) + ( a 10 , b 10 ) ( 1 p ) q + ( a 11 , b 11 ) ( 1 p ) ( 1 q ) .
This is the same as the payoff vector corresponding to a profile of classical mixed strategies
[ p ( s 0 1 ) , ( 1 p ) ( s 1 1 ) ] , [ q ( s 0 2 ) , ( 1 q ) ( s 1 2 ) ] .
On the other hand, player 1 and player 2’s classical mixed strategies in the EWL scheme can also be modeled by quantum operations
C p ( ρ ) = p 𝟙 ρ 𝟙 + ( 1 p ) U ( π , 0 , 0 ) ρ U ( π , 0 , 0 ) , C q ( ρ ) = q 𝟙 ρ 𝟙 + ( 1 q ) U ( π , 0 , 0 ) ρ U ( π , 0 , 0 ) ,
where ρ stands for a 2 × 2 density matrix. In other words, playing 𝟙 and U ( π , 0 , 0 ) with probability p and 1 p by player 1, and q and 1 q by player 2 results also in (15). Both ways (14) and (17) turn the EWL game into the classical one. However, the problem becomes more complex if at least one of the players has access to other unitary operations. The following examples show that the limitation to the probability distributions over the counterparts of classical pure strategies 𝟙 and U ( π , 0 , 0 ) and considering the EWL game as a 3 × 3 bimatrix game lose some of the non-classical features of the EWL scheme.
Example 1.
Let us consider the Matching Pennies game in terms of the EWL scheme. A common bimatrix form of that game is as follows:
      s 0 2 s 1 2 M P = s 0 1 s 1 1 ( ( 1 , 1 ) ( 1 , 1 ) ( 1 , 1 ) ( 1 , 1 ) ) .
One can easily show that game (18) has the unique mixed Nash equilibrium ( σ 1 * , σ 2 * ) , where σ 1 * = [ ( 1 / 2 ) ( s 0 1 ) , ( 1 / 2 ) ( s 1 1 ) ] and σ 2 * = [ ( 1 / 2 ) ( s 0 2 ) , ( 1 / 2 ) ( s 1 2 ) ] . Let us now extend game (18) to include the strategy U ( π / 2 , 0 , π / 2 ) for each player. By substituting θ 1 = θ 2 = π / 2 , α 1 = α 2 = 0 and β 1 = β 2 = π / 2 into (12), we get
v U π 2 , 0 , π 2 , U π 2 , 0 , π 2 = ( 0 , 0 ) .
The corresponding bimatrix is of the form
    𝟙 i X U π 2 , 0 , π 2 𝟙 i X U π 2 , 0 , π 2 ( ( 1 , 1 ) ( 1 , 1 ) ( 0 , 0 ) ( 1 , 1 ) ( 1 , 1 ) ( 0 , 0 ) ( 0 , 0 ) ( 0 , 0 ) ( 0 , 0 ) ) .
Among the Nash equilibria are the classical mixed Nash equilibrium
[ ( 1 / 2 ) ( 𝟙 ) + ( 1 / 2 ) ( i X ) ] , [ ( 1 / 2 ) ( 𝟙 ) + ( 1 / 2 ) ( i X ) ]
and non-classical Nash equilibria
U π 2 , 0 , π 2 , U π 2 , 0 , π 2 ,
[ ( 1 / 2 ) ( 𝟙 ) + ( 1 / 2 ) ( i X ) ] , U π 2 , 0 , π 2 ,
U π 2 , 0 , π 2 , [ ( 1 / 2 ) ( 𝟙 ) + ( 1 / 2 ) ( i X ) ] .
Let us now consider the EWL scheme with unitary strategies
D 1 = D 2 = { U ( θ , 0 , 0 ) : θ [ 0 , π ] } U π 2 , 0 , π 2 .
Combining (12) with (25) yields
v 1 ( U 1 U 2 ) = cos θ 1 cos θ 2 i f U 1 U 2 = U 1 ( θ 1 , 0 , 0 ) U 2 ( θ 2 , 0 , 0 ) , sin θ 1 i f U 1 U 2 = U 1 ( θ 1 , 0 , 0 ) U 2 π 2 , 0 , π 2 , sin θ 2 i f U 1 U 2 = U 1 π 2 , 0 , π 2 U 2 ( θ 2 , 0 , 0 ) , 0 i f U 1 U 2 = U 1 π 2 , 0 , π 2 U 2 π 2 , 0 , π 2 ,
and
v 2 ( U 1 U 2 ) = v 1 ( U 1 U 2 ) .
One can show that among (22)–(24), only strategy profile (23) is a Nash equilibrium in the game determined by (25)–(27). In the case of both profiles (22) and (24), player 2 obtains the payoff of 0, and she will get the payoff of 1 by choosing U ( π / 2 , 0 , 0 ) ,
v 2 U π 2 , 0 , π 2 , U π 2 , 0 , 0 = 1 .
In general, there is no pure Nash equilibrium in the game given by (25)–(27). Let us first note that the strategy profile U ( π / 2 , 0 , 0 ) U ( π / 2 , 0 , 0 ) is not a Nash equilibrium. Player 2 can benefit by a unilateral deviation:
1 = v 2 U π 2 , 0 , 0 , U π 2 , 0 , π 2 > v 2 U π 2 , 0 , 0 , U π 2 , 0 , 0 = 0 .
Since there is no other possible Nash equilibria in the set { U 1 ( θ 1 , 0 , 0 ) U 2 ( θ 2 , 0 , 0 ) } , a strategy profile in the form U 1 ( θ 1 , 0 , 0 ) U 2 ( θ 2 , 0 , 0 ) cannot be a Nash equilibrium in the set (25).
The last step is to show that neither U 1 ( θ 1 , 0 , 0 ) U 2 π 2 , 0 , π 2 nor U 1 π 2 , 0 , π 2 U 2 ( θ 2 , 0 , 0 ) constitute a Nash equilibrium. Player 1’s best reply to the strategy U 2 ( π / 2 , 0 , π / 2 ) is U 1 ( 0 , 0 , 0 ) or U 1 ( π , 0 , 0 ) when restricted to the set { U 1 ( θ 1 , 0 , 0 ) : θ 1 [ 0 , π ] } . Then player 2’s best reply to U 1 ( 0 , 0 , 0 ) and U 1 ( π , 0 , 0 ) is U 2 ( π , 0 , 0 ) and U 2 ( 0 , 0 , 0 ) , respectively. Therefore, a strategy profile U 1 ( θ 1 , 0 , 0 ) U 2 π 2 , 0 , π 2 is not a Nash equilibrium. The same conclusion can be drawn for U 1 π 2 , 0 , π 2 U 2 ( θ 2 , 0 , 0 ) . This shows that the 3 × 3 bimatrix form used to present the EWL scheme is not equivalent to the original scheme.
Example 2.
An equally interesting example is the Prisoner’s Dilemma game in the form studied in [1]:
    s 0 2 s 1 2 s 0 1 s 1 2 ( ( 3 , 3 ) ( 5 , 0 ) ( 0 , 5 ) ( 1 , 1 ) ) .
Let us extend the game in the same manner as (20). This gives
    𝟙 i X U 2 π 2 , 0 , π 2 𝟙 i X U 1 π 2 , 0 , π 2 ( ( 3 , 3 ) ( 5 , 0 ) ( 3 2 , 4 ) ( 0 , 5 ) ( 1 , 1 ) ( 3 2 , 4 ) ( 4 , 3 2 ) ( 4 , 3 2 ) ( 9 4 , 9 4 ) ) .
Adding U i π / 2 , 0 , π / 2 to the strategy sets of the players in game (30) results in two non-classical equilibria
( U 1 ( π / 2 , 0 , π / 2 ) , i X ) a n d ( i X , U 2 ( π / 2 , 0 , π / 2 ) ) .
Game (31) is not equivalent to one defined by strategy sets (25). We find that the strategy profiles (32) are no longer Nash equilibria in (25). We have
5 = v 1 U 1 π 2 , 0 , 0 , U 2 π 2 , 0 , π 2 > v 1 i X , U 2 π 2 , 0 , π 2 = 4
and
5 = v 2 U 1 π 2 , 0 , π 2 , U 2 π 2 , 0 , 0 > v 2 U 1 π 2 , 0 , π 2 , i X = 4 .
The above examples demonstrate that adding a single unitary strategy to the bimatrix-form game does not fully reflect non-classical features of the EWL scheme. The idea of replacing strategy sets of the form { U ( θ , 0 , 0 ) } with { 𝟙 , i X } written with the use of bimatrix form works if strategy set of each player is restricted to the one parameter set. Then a unitary strategy U ( 2 arccos p , 0 , 0 ) is outcome-equivalent to the mixed strategy [ p ( 𝟙 ) , ( 1 p ) ( i X ) ] . In general, when other unitary strategies are available, the equivalence does not hold. For example, since v 1 ( 𝟙 , U 2 ( π / 2 , 0 , π / 2 ) ) = v 1 ( i X , U 2 ( π / 2 , 0 , π / 2 ) ) = ( a 00 + a 10 ) / 2 for every bimatrix-form game (2), it follows that
v 1 [ p ( 𝟙 ) , ( 1 p ) ( i X ) ] , U 2 π 2 , 0 , π 2 = a 00 + a 10 2 .
In other words, playing any classical mixed strategy against U 2 π / 2 , 0 , π / 2 always results in the same payoff outcome. In the case of the strategy profile U 1 2 arccos p , 0 , 0 , U 2 ( π / 2 , 0 , π / 2 ) , we have
v 1 U 1 2 arccos p , 0 , 0 , U 2 π 2 , 0 , π 2 = 1 2 + p 1 p a 00 + 1 2 p 1 p a 10 .
A quick look at Equation (36) shows the interference terms ± p 1 p that are not part of the payoff function (35). That is the reason why we obtain different results depending on whether we use strategies of the form [ p ( 𝟙 ) , ( 1 p ) ( i X ) ] or the one parameter unitary operations extended with some type of two-parameter operator.

5. The EWL Scheme and the IBM Quantum Experience

In what follows, we provide the EWL approach implemented on the IBM quantum experience platform for strategy profiles ( U 1 ( π / 2 , 0 , π / 2 ) , U 2 ( π / 2 , 0 , 0 ) ) , ( U 1 ( π / 2 , 0 , π / 2 ) , 𝟙 ) and ( U 1 ( π / 2 , 0 , π / 2 ) , i X ) . The quantum circuits are adapted from [6]. First, we express unitary operators U 1 ( π / 2 , 0 , π / 2 ) and U 2 ( π / 2 , 0 , 0 ) ) in terms of the parameterization of unitary operators used in the IBM quantum circuit composer. Recall that the gates provided by IBM are defined as follows:
U 3 Q C ( θ , ϕ , λ ) = cos θ 2 e i λ sin θ 2 e i ϕ sin θ 2 e i ( λ + ϕ ) sin θ 2 , U 2 Q C ( ϕ , λ ) = U 3 Q C π 2 , ϕ , λ , U 1 Q C ( λ ) = U 3 Q C ( 0 , 0 , λ ) .
Thus,
U π 2 , 0 , π 2 = U 2 Q C ( π , π ) , U π 2 , 0 , 0 = U 2 Q C π 2 , π 2
According to [6], the entangling operator J and the disentangling operator J can be expressed in the form
J = CNOT · U 2 ( π / 2 , π / 2 ) · CNOT , J = CNOT · U 2 ( π / 2 , π / 2 ) · CNOT .
The quantum circuit is presented in Figure 1.
Although, it generates small errors, the IBM quantum computer (ibmq_vigo) outputs | 01 with probability close to one in the case of playing the strategy vector ( U ( π / 2 , 0 , π / 2 ) , U ( π / 2 , 0 , 0 ) ) or equivalently ( U 2 Q C ( π , π ) , U 2 Q C ( π / 2 , π / 2 ) ) (see Figure 2).
Comparing the histograms in Figure 2 with ones in Figure 3 and Figure 4 shows that U ( π / 2 , 0 , 0 ) has no counterpart in any probability distribution over 𝟙 and X. As a result of playing 𝟙 or X against U ( π / 2 , 0 , π / 2 ) , the final state | Ψ is | 00 or | 01 with equal probability.

6. Payoff Region of the EWL Quantum Game

Another advantage that makes the difference between the classical game and the EWL approach is possibility of obtaining payoff profiles which are in the complement of the non-cooperative payoff region. The Prisoner’s Dilemma game (PD) examined repeatedly with the use of the EWL scheme does not allow one to see that feature. The non-cooperative payoff region in the PD game is equal to the cooperative one (see Figure 5).
The players by using mixed strategies can obtain each payoff vector from the convex hull of the pure-payoff vectors. In general, it is clear that R p u R n c R c o (see Definition 2). As we show below, the extension of the classical strategies to unitary operators (9) makes the sets R p u , R n c , R c o equal in the EWL scheme. The Battle of the Sexes game is a typical example of inequality between the non-cooperative and cooperative payoff regions. Its bimatrix form can be written as
      s 0 2 s 1 2 B o S = s 0 1 s 1 1 ( ( 4 , 2 ) ( 0 , 0 ) ( 0 , 0 ) ( 2 , 4 ) ) .
In this case, the cooperative payoff region is a convex polygon determined by points ( 4 , 2 ) , ( 2 , 4 ) and ( 0 , 0 ) , and there is no mixed strategy profile from Δ ( S 1 ) × Δ ( S 2 ) that would determine the payoff outcome (3,3). The non-cooperative and cooperative payoff regions of (40) are shown in Figure 6.
The outcome (3, 3) can be easily achieved by the EWL scheme. From (12), it follows that
v U 1 0 , π 8 , U 2 0 , π 8 = 1 2 ( ( a 00 , b 00 ) + ( a 11 , b 11 ) ) = ( 3 , 3 ) .
In general, the cooperative payoff region of any 2 × 2 game can be already determined by pure strategy profiles of the two-parameter unitary strategies. We will prove this fact by using the well-known Carathéodory’s Theorem for convex hulls.
Theorem 1
(Carathéodory’s Theorem for convex hulls). Let A be a subset in R d . Suppose that x conv ( A ) . Then there exists a subset B of A of cardinality at most d + 1 such that x conv ( B ) .
In our case, Carathéodory’s Theorem states that every payoff vector from conv ( { ( a i j , b i j ) : i , j = 0 , 1 } ) can be represented as a convex combination of at most three payoff vectors from the pure-payoff region. That observation enables us to prove the following proposition:
Proposition 1.
The pure-payoff region in EWL approach
Γ E W L = ( { 1 , 2 } , ( U i ( θ 1 , α i , 0 ) ) i { 1 , 2 } , ( v i ) i { 1 , 2 } )
to a general 2 × 2 game is equal to the cooperative payoff region, i.e., R p u = R n c = R c o .
Proof. 
It is clear that the pure-payoff region of the classical game can be obtained in the EWL game since (10) coincides with the payoff function of the classical game if the unitary strategies are restricted to the set { U ( 0 , 0 , 0 ) , U ( π , 0 , 0 ) } .
Let us consider 0 λ 00 , λ 01 , λ 10 , λ 11 < 1 such that λ 00 + λ 01 + λ 10 + λ 11 = 1 . Then there are unitary strategy profiles that depend on λ i j and imply a general convex combination of any three pure-payoff profiles. Using (12) and assuming one of λ i j = 0 , we obtain
u U 1 0 , arccos λ 01 1 λ 00 , 0 , U 2 2 arccos λ 00 , arccos λ 01 1 λ 00 , 0   = λ 00 ( a 00 , b 00 ) + λ 01 ( a 01 , b 01 ) + λ 10 ( a 10 , b 10 ) .
u U 1 ( 0 , 0 , 0 ) , U 2 2 arccos 1 λ 01 , arccos λ 00 1 λ 01 , 0   = λ 00 ( a 00 , b 00 ) + λ 01 ( a 01 , b 01 ) + λ 11 ( a 11 , b 11 ) .
u U 1 0 , π 2 , 0 , U 2 2 arccos 1 λ 10 , arccos λ 11 1 λ 10 , 0   = λ 00 ( a 00 , b 00 ) + λ 10 ( a 10 , b 10 ) + λ 11 ( a 11 , b 11 ) .
u U 1 ( π , 0 , 0 ) , U 2 2 arccos 1 λ 11 , arccos λ 10 1 λ 11 , 0   = λ 01 ( a 00 , b 00 ) + λ 10 ( a 10 , b 10 ) + λ 11 ( a 11 , b 11 ) .
It follows from Theorem 1 that any payoff profile from conv ( { ( a i j , b i j ) : i , j = 0 , 1 } ) is achievable by the players’ pure strategies. In other words, the two-parameter pure strategies in the EWL scheme imply the cooperative payoff region of the corresponding 2 × 2 game. □

7. The EWL Scheme in Relation to van Pike–Enk’s Arguments

According to van Enk–Pike [19], the games written in the form (20) and (31) should not be seen as quantum games. They simply describe a 3 × 3 bimatrix game resulting from the addition of the third pure strategy to the original game. We showed in Section 4 that bimatrix form cannot fully describe the EWL game since strategies of the form { U ( θ , 0 , 0 ) θ [ 0 , π ] } are not equivalent to probability distributions over 𝟙 and U ( π , 0 , 0 ) . As a result, van Pike–Enk’s criticism, in fact, does not relate to the original EWL scheme (with continuum of strategies) but merely to a 3 × 3 bimatrix game with the payoffs calculated by the EWL scheme.
Still, it was noted in [19,21] that adding of another strategy to the classical game changes the rules of the game. Therefore, the outcome resulting from the new game cannot be treated as a solution of the original game. Now, we are going to show that not every extension of strategy sets of the players means changing the rules of the game, in particular, one conducted by unitary strategies in the EWL scheme. A typical example is a mixed extension of the game in which the players can choose probability distributions over their own sets of pure strategies. Let us recall the formal definition of mixed extension of a strategic-form game [22].
Definition 6.
Let G = ( N , ( S i ) i N ) be a strategic-form game (1) with finite strategy sets. Denote by S = S 1 × S 2 × × S n the set of pure strategy vectors. The mixed extension of G is the game
Γ = ( N , ( Σ i ) i N , ( u i ) i N ) ,
in which, for each i N , player i’s set of strategies is
Σ i = σ i : S i [ 0 , 1 ] : s i S i σ i ( s i ) = 1 ,
and her payoff function is the function
u i : Σ 1 × Σ 2 × × Σ n R ,
which associates each strategy vector σ = ( σ 1 , , σ n ) , σ i Σ i with the payoff
u i ( σ ) = ( s 1 , , s n ) S u i ( s 1 , , s n ) σ 1 ( s 1 ) σ 2 ( s 2 ) σ n ( s n ) .
Nash equilibrium is guaranteed in the mixed extension defined above [25]. Thus, mixed strategies enable the players to obtain a rational outcome that is not achievable in the set of pure strategy vectors. By using a mixed strategy, a player gets a better payoff in terms of the expected payoff (50). Although, it must be assumed that the payoff functions in G satisfy the von Neumann–Morgenstern axioms (see [22])—their payoff functions are linear in probabilities, it has nothing to do with breaking the rules of the game G. The result of the game G is always a pure strategy vector of G.
Similarly to the mixed extension, the EWL scheme can also be treated as an extension of G. The game generated by (13) is outcome-equivalent to the mixed extension of a 2 × 2 game if the unitary strategies are restricted to (14), and a wider range of unitary operators makes (13) a nontrivial generalization of (47). Both extensions require using additional resources to be implemented. One would require using some random device to play a mixed strategy. It could be a coin or dice in the case of simple mixed strategies and a random number generator in general. The unitary strategies, in turn, require using a quantum device. It is also worth noting that Formulas (10) and (50) are just the expected payoff functions. They are associated with specific probability distributions that are generated by the player’s mixed strategies and the final state | Ψ . By choosing mixed or unitary strategy, the players create a specific probability distribution over the pure outcomes. However, it is worth emphasizing that a mixed extension as well as the EWL approach always result in a pure strategy outcome of G. In the case of the EWL approach to a 2 × 2 game, the result of the quantum measurement on the final state (determined by the unitary strategies) is one of the four payoff outcomes related to the four pure strategy vectors of the classical game. As stated in [19], it would be perfect if the quantum scheme left the classical game unchanged and solved it using quantum operations. In our view, the EWL scheme meets this requirement.
Mixed and the EWL extensions of an n-person strategic-form game (with two-element strategy sets for the players) are summarized in the following table to point out the similarities of two ways of playing the game G.
Mixed extension Γ = ( N , ( Σ i ) i N , ( u i ) i N ) of a 2 × 2 × × 2 game
N = { 1 , 2 , , n }
Σ i = σ i : s 0 i , s 1 i [ 0 , 1 ] : σ i ( s 0 i ) + σ i ( s 1 i ) = 1
u i ( σ 1 , σ 2 , , σ n ) = j 1 , , j n { 0 , 1 } u i s j 1 1 , s j 2 2 , , s j n n σ 1 s j 1 1 σ 2 s j 2 2 σ n s j n n
The EWL extension Γ E W L = ( N , ( D i ) i N , ( v i ) i N ) of a 2 × 2 × × 2 game
N = { 1 , 2 , , n }
D i SU ( 2 ) = U ( θ , α , β ) : θ [ 0 , π ] , α , β [ 0 , 2 π )
v i ( U 1 , U 2 , , U n ) = j 1 , , j n { 0 , 1 } u i s j 1 1 , s j 2 2 , , s j n n | Ψ | j 1 , , j n | 2
To sum up, it is not obvious that playing the quantum game really changes the rules of the game if we look at a unitary operator as an extension of a mixed strategy. If so, it might as well state that using classical mixed strategies violate the rules of the game. The bimatrix games 3 × 3 in the form of (20) or (31) combine outcomes associated with classical pure strategies with one unitary strategy profile determined by the expected payoff function. This way differs significantly from the original scheme presented in [1] and cannot be used as an argument against the EWL scheme.

8. Conclusions

The work [1] was one of the first papers that launched the quantum game theory. From that moment on, the idea of [1] has been developed to cover other game theory problems that go beyond simple 2 × 2 games. The scheme introduced in [1] enables the players to obtain the expected payoff outcomes that are often not available when the classical mixed strategies are used. Still, there are doubts if a solution given by the EWL scheme is really of the quantum nature. Among a few comments, it was postulated that the EWL approach to a given game changes the rules of the game. For that reason, the solution provided by the EWL game should not concern the classical game under study.
In our opinion, the form of the EWL scheme presented in [1] can be regarded as a further generalization of the mixed extension of the game. In a particular case, the EWL approach coincides with the mixed extension since the type of one-parameter unitary operations can be viewed as a counterpart of a mixed strategy. Mixed and the EWL extensions of a game have many features in common that support our view. They both enable the players to obtain a specific probability mixtures of the outcomes and as a result, they generate expected payoff outcomes far beyond the pure-payoff region. Non-cooperative payoff region is associated with the mixed extension, and the full convex hull of pure-payoff vectors (i.e., a cooperative payoff region) is available when the players play the EWL extension of the game. At the same time, the result of the game from playing mixed and unitary strategies is always an outcome from pure-payoff region. Another thing is that both extensions have the same structure of strategic-form game. They are both defined by a set of players, sets of players’ strategies and the expected payoff functions.
We think that the EWL scheme does not change the rules of the bimatrix game. As in the case of mixed extension, the EWL extension allows the players to get new possibilities for choosing strategies in the classical game.

Funding

This research received no external funding.

Acknowledgments

This research was funded by the Pomeranian University in Słupsk. We thank the IBM Quantum team for making the IBM Quantum Experience.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Eisert, J.; Wilkens, M.; Lewenstein, M. Quantum games and quantum strategies. Phys. Rev. Lett. 1999, 83, 3077. [Google Scholar]
  2. Du, J.; Li, H.; Xu, X.; Zhou, X.; Han, R. Entanglement enhanced multiplayer quantum games. Phys. Lett. A 2002, 302, 229. [Google Scholar] [CrossRef] [Green Version]
  3. Flitney, A.P.; Hollenberg, L.C.L. Nash equilibria in quantum games with generalized two-parameter strategies. Phys. Lett. A 2007, 363, 381. [Google Scholar] [CrossRef] [Green Version]
  4. Frąckiewicz, P. Quantum information approach to normal representation of extensive games. Int. J. Quantum Inform. 2012, 10, 1250048. [Google Scholar] [CrossRef]
  5. Prevedel, R.; Stefanov, A.; Walther, P.; Zeilinger, A. Experimental realization of a quantum game on a one-way quantum computer. New J. Phys. 2007, 9, 205. [Google Scholar] [CrossRef]
  6. Narula, H.; Islam, M.d.S.; Behera, B.K.; Panigrahi, P.K. Designing Circuits for Quantum Games with IBM’s Quantum Experience. 2019. Available online: https://www.researchgate.net/publication/338165193_Designing_Circuits_for_Quantum_Games_with_IBM%27s_Quantum_Experience (accessed on 8 May 2021).
  7. Du, J.; Li, H.; Xu, X.; Zhou, X.; Han, R. Phase-transition-like behaviour of quantum games. J. Phys. A Math. Gen. 2003, 36, 6551. [Google Scholar] [CrossRef] [Green Version]
  8. Nawaz, A.; Toor, A.H. Generalized quantization scheme for two-person non-zero sum games. J. Phys. A Math. Gen. 2004, 37, 11457. [Google Scholar] [CrossRef]
  9. Kay, R.; Johnson, N.F.; Benjamin, S.C. Evolutionary quantum game. J. Phys. A Math. Gen. 2001, 34, L547. [Google Scholar] [CrossRef] [Green Version]
  10. Landsburg, S.E. Nash equilibria in quantum games. Proc. Am. Math. Soc. 2011, 139, 4423. [Google Scholar] [CrossRef]
  11. Chen, K.Y.; Hogg, T. How Well Do People Play a Quantum Prisoner’s Dilemma? Quantum Inf. Process. 2006, 5, 43–67. [Google Scholar] [CrossRef]
  12. Li, Q.; Iqbal, A.; Chen, M.; Abbott, D. Quantum strategies win in a defector-dominated population. Physical A 2012, 391, 3316. [Google Scholar] [CrossRef] [Green Version]
  13. Frąckiewicz, P. Strong isomorphism in Eisert-Wilkens-Lewenstein type quantum game. Adv. Math. Phys. 2016, 2016, 4180864. [Google Scholar] [CrossRef]
  14. Frąckiewicz, P. Quantum games with unawareness. Entropy 2018, 20, 555. [Google Scholar] [CrossRef] [Green Version]
  15. Flitney, A.P.; Schlosshauer, M.; Schmid, C.; Laskowski, W.; Hollenberg, L.C.L. Equivalence between Bell inequalities and quantum minority games. Phys. Lett. A 2009, 373, 521–524. [Google Scholar] [CrossRef] [Green Version]
  16. Zhang, P.; Zhou, X.Q.; Wang, Y.L.; Liu, B.H.; Shadbolt, P.; Zhang, Y.S.; Gao, H.; Li, F.L.; O’Brien, J.L. Quantum gambling based on Nash-equilibrium. NPJ Quantum Inf. 2017, 3, 24. [Google Scholar] [CrossRef] [Green Version]
  17. Cheng, H.M.; Luo, M.X. Tripartite Dynamic Zero-Sum Quantum Games. Entropy 2021, 23, 154. [Google Scholar] [CrossRef] [PubMed]
  18. Bolonek-Lasoń, K.; Kosiński, P. Mixed Nash equilibria in Eisert-Lewenstein-Wilkens (ELW) games. J. Phys. Conf. Ser. 2017, 804, 012007. [Google Scholar] [CrossRef] [Green Version]
  19. Van Enk, S.J.; Pike, R. Classical rules in quantum games. Phys. Rev. A 2002, 66, 024306. [Google Scholar] [CrossRef] [Green Version]
  20. Vyas, N.; Benjamin, C. Negating van Enk-Pike’s assertion on quantum games OR Is the essence of a quantum game captured completely in the original classical game? arXiv 2017, arXiv:1701.08573v2. [Google Scholar]
  21. Groisman, B. When quantum games can be played classically: In support of van Enk-Pike’s assertion. arXiv 2018, arXiv:1802.00260. [Google Scholar]
  22. Maschler, M.; Solan, E.; Zamir, S. Game Theory; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
  23. Tu, Y.S.; Juang, W.T. The payoff region of a strategic game and its extreme points. arXiv 2017, arXiv:1705.01454. [Google Scholar]
  24. Szopa, M. Efficiency of classical and quantum games equilibria. Entropy 2021, 23, 506. [Google Scholar] [CrossRef] [PubMed]
  25. Nash, J. Non-cooperative games. Ann. Math. 1951, 54, 286. [Google Scholar] [CrossRef]
Figure 1. Quantum circuit for the EWL scheme. Qubits q [ 1 ] and q [ 0 ] are identified with the first and second qubit, respectively. Player 1 acts on q [ 1 ] with U 2 Q C ( π , π ) , player 2 acts on q [ 0 ] with U 2 Q C ( π / 2 , π / 2 ) , which corresponds to the strategy profile ( U ( π / 2 , 0 , π / 2 ) , U ( π / 2 , 0 , 0 ) ) in the EWL approach.
Figure 1. Quantum circuit for the EWL scheme. Qubits q [ 1 ] and q [ 0 ] are identified with the first and second qubit, respectively. Player 1 acts on q [ 1 ] with U 2 Q C ( π , π ) , player 2 acts on q [ 0 ] with U 2 Q C ( π / 2 , π / 2 ) , which corresponds to the strategy profile ( U ( π / 2 , 0 , π / 2 ) , U ( π / 2 , 0 , 0 ) ) in the EWL approach.
Entropy 23 00604 g001
Figure 2. The histogram showing the result of the quantum measurement (backend: ibmq_vigo) corresponding to ( U ( π / 2 , 0 , π / 2 ) , U ( π / 2 , 0 , 0 ) ) in the EWL approach.
Figure 2. The histogram showing the result of the quantum measurement (backend: ibmq_vigo) corresponding to ( U ( π / 2 , 0 , π / 2 ) , U ( π / 2 , 0 , 0 ) ) in the EWL approach.
Entropy 23 00604 g002
Figure 3. The histogram showing the result of the quantum measurement (backend: ibmq_vigo) corresponding to ( U ( π / 2 , 0 , π / 2 ) , 𝟙 ) in the EWL approach.
Figure 3. The histogram showing the result of the quantum measurement (backend: ibmq_vigo) corresponding to ( U ( π / 2 , 0 , π / 2 ) , 𝟙 ) in the EWL approach.
Entropy 23 00604 g003
Figure 4. The histogram showing the result of the quantum measurement (backend: ibmq_vigo) corresponding to ( U ( π / 2 , 0 , π / 2 ) , X ) in the EWL approach.
Figure 4. The histogram showing the result of the quantum measurement (backend: ibmq_vigo) corresponding to ( U ( π / 2 , 0 , π / 2 ) , X ) in the EWL approach.
Entropy 23 00604 g004
Figure 5. Non-cooperative payoff region in the Prisoner’s Dilemma game that coincides with the cooperative one.
Figure 5. Non-cooperative payoff region in the Prisoner’s Dilemma game that coincides with the cooperative one.
Entropy 23 00604 g005
Figure 6. Non-cooperative payoff region (left) and cooperative payoff region (right) of the Battle of the Sexes game that coincides with the cooperative one.
Figure 6. Non-cooperative payoff region (left) and cooperative payoff region (right) of the Battle of the Sexes game that coincides with the cooperative one.
Entropy 23 00604 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Frąckiewicz, P. Non-Classical Rules in Quantum Games. Entropy 2021, 23, 604. https://doi.org/10.3390/e23050604

AMA Style

Frąckiewicz P. Non-Classical Rules in Quantum Games. Entropy. 2021; 23(5):604. https://doi.org/10.3390/e23050604

Chicago/Turabian Style

Frąckiewicz, Piotr. 2021. "Non-Classical Rules in Quantum Games" Entropy 23, no. 5: 604. https://doi.org/10.3390/e23050604

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop