2. The Game
A set of players is tasked with the division of a perfectly divisible unit good. At each period , in a rotating order , a player becomes the proposer; he/she gets to make a proposal or offer z with for all and . The other players, the responders, simultaneously decide whether to accept or reject the proposer’s offer. Whenever a time t proposal z is unanimously accepted by all responders, the proposed partition of the good is implemented, and the game ends; in such a case, each player i’s resulting time t utility is given by . Whenever a player’s time t proposal is not unanimously accepted, it is rejected. In that case, the game continues to time , where the next player in the above-described rotating order becomes the proposer. With probability (), all players in N remain in the game as the game moves to time , and with probability , the time t proposer, say player i, is eliminated, receiving zero utility at time . If i is eliminated, the game continues at as described above with the player set , and player i is henceforth skipped in the proposer order. An important distinction with the classic version of this game is that the expected utility of perpetual disagreement is well-defined, and not equal to zero.
2.1. Preliminaries
A time thistory, denoted , specifies for all periods s up until time t, the proposals that are made, and the coalition of players that survive period s. Specifically, where . Furthermore, let be the empty history. The set of all time t histories is denoted .
A strategy for player i is a set of functions that specifies player i’s actions at each subgame, where he/she is still an active player. Let be the set of proposals available to the players in —that is, —and let . When i is the proposer, ; that is, based on the history of play up until time t, player i’s strategy prescribes which proposal he/she should make to the surviving coalition. When i is a responder, ; based on the play of the game up until time t, the current proposal on the table, and the time t coalition of active players, player i’s strategy specifies whether he/she should accept (Y) or reject (N). A tuple specifying each player’s strategy is called a strategy profile. Given a strategy profile f, denotes the tuple of strategies played by players in .
The (expected) payoff a player realizes when the strategy profile f is played is denoted as . The payoff he/she realizes in the subgame that results from history is denoted .
A Nash equilibrium is a strategy profile f such that for all , for all strategies . A subgame perfect equilibrium (SPE) is a strategy profile f that is a Nash equilibrium in each subgame. More precisely, for all strategies and histories .
2.2. The One-Shot Deviation Principle
Since payoffs are not discounted, it is not immediate that the one-shot deviation principle applies. To see that it does, let
f and
be two strategy profiles that coincide for periods 0 to
. Then, for all
and for all
, the following holds:
If under the strategy profile
f, with some probability, a terminal node is reached prior to time
t, then since
coincides with
f prior to time
t, this same terminal node is reached under
with the same probability, and players realize the same physical payoffs. Hence, the expected payoff a player
i realizes from all terminal nodes prior to
t is the same under
f as it is under
, and in Equation (
1), the two cancel each other out. Furthermore, the difference between the continuation values implied by two different strategy profiles is bounded from above by 1 at each subgame. From (
1), it follows that for any
, there is a
t such that for all strategy profiles
coinciding for the first
t periods,
. This condition, called
continuity at infinity, is sufficient for the one-shot deviation principle to apply. That is, in verifying whether a strategy profile
f is SPE, we may restrict attention to one-period deviations from these strategies.
3. The Two-Player Case
In order to construct an SPE, assume that player 1 always proposes
x and 2 always proposes
y, and that there is immediate agreement. If player 2 were to reject player 1’s proposal
x, then in the next round, he/she obtains
in case player 1 is not eliminated from the game (an event that occurs with probability
) and the entire pie if player 1
is eliminated from the game (an event that occurs with probability
). Similarly, if player 1 were to reject player 2’s proposal
y, then he/she realizes
in case player 2 survives this rejection, and the entire pie otherwise.
Solving with equality yields the following:
This allows for the construction of an SPE. Consider the strategy profile , defined as follows:
- :
As proposer, propose x; as responder, accept v iff .
- :
As proposer, propose y; as responder, accept v iff .
Note that in this strategy profile, it is better to be a responder than a proposer.
Proposition 1. The strategy pair is an SPE.
Proof. It is sufficient to show that is optimal against since the other case is analogous. Suppose, thus, that player 2 plays strategy . If player 1 is the proposer, following yields . If player 1 deviates by offering player 2 less than , player 2 rejects and player 1’s payoff is . If player 1 deviates by offering player 2 more than , player 2 accepts, and player 1’s payoff is strictly below . Hence, at even t, player 1 cannot profitably deviate from .
Consider an odd time t at which player 1 is responding to a proposal v by player 2. By accepting, player 1 obtains , while rejecting yields a payoff of . Hence, it is optimal for player 1 to accept player 2’s proposal v if , and to reject it otherwise, as prescribed by . □
It is clear that the value from perpetual disagreement in this model is not zero, like in the classical model. As it turns out, the players’ expected payoffs from perpetual disagreement are given by the solution to system (
2).
Lemma 1. Player i’s () time-t expected payoff from perpetual disagreement is if t is even, and if t is odd.
Proof. Note first that the values of perpetual disagreement in two identical subgames are the same. Thus, let
be the players’ values of perpetual disagreement at even
t, and
players’ values of perpetual disagreement at odd
t. Observe that for player 2, the expected payoff from perpetual disagreement when he/she is the responder is equal to the expected payoff of his/her value of perpetual disagreement in the next round when he/she is the proposer. That is,
Similarly, . Further note that the players’ payoffs at every terminal node are either given by or . Thus, for some probability p, which implies that . Similarly, . Thus, and , as desired. □
An immediate implication of Lemma 1 is that is not the only SPE in this game. It even implies that perpetual disagreement can be supported in SPE. To see this, consider the profile , defined as follows:
- :
As proposer, propose x; as responder, accept v iff .
- :
As proposer, propose y; as responder, accept v iff .
Evidently, this strategy profile induces perpetual disagreement, but even so, it is an SPE.
Proposition 2. The strategy pair is an SPE.
Proof. Again, it is sufficient to show that is optimal against . Suppose that player 1 is the proposer. If he/she follows , the result is perpetual disagreement, which by Lemma 1 yields . If player 1 offers player 2 strictly less than , player 2 rejects, leading to perpetual disagreement, and player 1’s expected payoff continues to be . If he/she offers player 2 strictly more than , player 2 will accept, and player 1’s payoff is strictly below . Thus, as proposer, player 1 cannot profitably deviate from .
Suppose on the other hand that player 2 is the proposer, and that he/she offers v. Rejecting v again leads to perpetual disagreement. By Lemma 1, player 1’s corresponding payoff is . Thus, it is optimal for player 1 to accept v if exceeds , and to reject it otherwise, as prescribed by . □
Remark 1. It can be verified that and are SPE as well. We may even consider more intricate SPE , where player 1 starts out by playing , and at some time t switches to so that the game concludes with an acceptance after several rounds of delay.
Remark 2. To see where the multiplicity of equilibria comes from, observe that when player 1 proposes x, player 2 is indifferent between accepting and rejecting. More precisely, accepting yields , while rejecting yields . So what if, instead of accepting x, player 2 rejects? This can still be supported in SPE since player 1 has zero scope to make player 2 accept. Specifically, suppose that player 1 deviates by offering player 2 an over and above ; player 2 will then certainly accept, but player 1’s corresponding payoff is , which is strictly below the he/she receives when player 2 rejects.
Fortunately, we can say something about the payoffs that players realize in SPE, which for our purposes is sufficient.
Proposition 3. In SPE, player i’s expected time-t payoff is if t is even, and if t is odd.
Proof. For , let and , respectively, be the supremum and the infimum over all SPE of player i’s continuation value in subgames where both players are in the game and i is the proposer. Similarly, for let and respectively be the supremum and the infimum over all SPE of player i’s continuation value in subgames where both players are in the game and i is the responder.
First note that at any time t, player 1 can commit to rejecting all of player 2’s proposals when she is responding, and demanding when she is proposing. Thus, by Lemma 1, she can unilaterally secure an (expected) payoff of when she is the proposer, and of when she is the responder. Therefore, and . Similarly, and .
In SPE, player 1 is never offered more than when 2 is the proposer. Whether player 2’s proposal is accepted or rejected, player 1’s payoff in SPE is, thus, at most , that is, .
If player 1 is the proposer, player 2’s payoff is bounded from below by
. Hence, player 1 can receive at most
from an accepted offer. He/she can secure at most
if his/her offer is rejected. Thus,
Then, either , or , which also implies . Thus, in either case, . Then . Thus, . By similar reasoning, and . □
The question is whether these results carry over to the multilateral case. Consider for a moment the classic model, where is interpreted as a discount factor so that a payoff z realized at time t yields utility . Shaked showed that if , then any partition of the good can be sustained in SPE in the three-player game. Let be the vector that gives one to player i, and zero to both . Then the game is as follows.
In state , each player i proposes y, and accepts a proposal x if and only if . If a player i proposes x with , then the game moves immediately, prior to the responses of the other players, to the state , where is the player with the lowest index for whom . Note that there is at least one such player. To see that this is an SPE, consider first a player i who responds to an offer y; rejecting yields , so accepting a proposal x if and only if is optimal. Consider next a player i who proposes. The correct offer yields . If i proposes x with , the game switches to state . Since , player j rejects, and player i obtains a zero payoff.
To see that in our version of the game, this strategy profile is not an SPE, assume that the game is in state and that player 2 is the proposer. Then, player 3 is correctly offered 0, and receives zero payoff if he/she accepts. Rejecting, on the other hand, yields , which is strictly positive. Hence, for player 3, in this subgame, the above-described strategy is not optimal, and as such, the profile is not an SPE.
4. The -Player Case ()
In order to derive an SPE in the general game, note first that by symmetry, a player’s (expected) payoff in the game is fully determined by his/her position, i.e., how many rounds it would take to become the proposer. The payoff is further completely independent from his/her identity. Let , be the (expected) payoff a player realizes in the game with players when he/she is k rounds away from being the proposer. We assume that these are known quantities. Let be the corresponding payoffs in the n-person game.
If a player is one period away from being the proposer, the (expected) payoff he/she realizes is
, which should be at least as good as going to the next round. With probability
, he/she becomes the proposer in the
n-person game in this next period, in which case the (expected) payoff is
, while with probability
, he/she becomes the proposer in the
-person game that results from the proposer being eliminated from the game. In that case, the (expected) payoff is known and given by
. Hence,
. By similar reasoning,
for
, and
. Evaluating with equality, we obtain the following system:
We next show that this system has a unique solution, its components sum up to one, and if tends to one, the equal split is implemented. We further show that, even in the n-person game, there is proposer disadvantage. Specifically, the further a player is from being the proposer in the rotating order, the higher a payoff he/she realizes.
Proposition 4. The system (3) has a unique solution , for which , , and for all k. Proof. Note that for the two-person case , , , and = . Assume that satisfies the same properties:
- A1.
.
- A2.
.
- A3.
.
- A4.
For all , .
Note that any solution to the system (
3) trivially satisfies A3. Let
be a possible solution to system (
3), and let
. By A1, summing up the equations yields
, and thus
. Hence, any solution to the system (
3) satisfies A1. By A1 and the first
equations of the system, we can uniquely determine
:
Hence, system (
3) has a solution, and this solution is unique. Since
satisfies A4, the
terms converge to finite values so that
. From this and, again, the fact that the
terms converge to finite values,
for all
k.
To see that A2 is satisfied, assume, contrary to what we want, that
. Observe that
Since
satisfies A2,
. Then by the initial assumption that
,
. By continuing this reasoning,
,
, …,
so that
Since
, we arrive at a contradiction. Thus,
. Then since
,
, and thus by A3,
. Since
, this implies
. Since
it follows that
. Continuing this logic,
, as desired. □
Based on the solution to (
3), construct the following strategy profile
:
A proposer claims for himself/herself, and offers to the player that is k periods away from becoming the proposer.
A responder i, k periods away from becoming the proposer, accepts a proposal x if and only if .
Note that the proposals are feasible and efficient.
Proposition 5. The profile is an SPE.
Proof. To see that the proposer rule is optimal, note that a rejected offer yields , and that the best a proposer can do from an accepted proposal is .
Consider a responder who is k periods from being the proposer. If all other players accept the proposer’s offer, then rejecting yields . Then, the responder rule is optimal. □
Lemma 2. If at time t a player is k periods away from being the proposer, then their expected time t payoff from perpetual disagreement is .
Proof. Note first that a player’s continuation values of perpetual disagreement in two identical subgames are the same. In addition, by symmetry of the players, a player’s value of perpetual disagreement is fully determined by how many periods away they are from being the proposer. Thus, for , let be the value of perpetual disagreement of a player k rounds from being the proposer. Note that the expected value of perpetual disagreement in the current round must be equal to the expected value of perpetual disagreement in the next round, i.e., and if . Then, for all k, as desired. □
This result again implies that perpetual disagreement can be sustained in SPE. Let be the same as , except that the inequality in the accept/reject rule is strict. Such a profile leads to perpetual disagreement but is still an SPE.
Proposition 6. The profile is an SPE.
As before, this has the undesirable implication that there are a great many different SPEs. Not only is optimal against , and vice versa, there are also more intricate SPEs where each player i switches back and forth between and . However, using similar techniques as before, it is still the case that the payoffs that players realize in SPE are always the same.
Theorem 1. If a player is k periods from being the proposer, their (expected) payoff in SPE is given by .
Proof. Let and respectively be the supremum and the infimum over all SPE of a player’s payoff when they are k periods from becoming the proposer, and all players are in the game. Let be the (expected) payoff a player realizes in the subgame where he/she is k periods from becoming the proposer, and one player has been eliminated from the game. By induction, we know that the equilibrium payoffs w are unique in the game with players. Based on this assumption, we will argue that the equilibrium payoffs in the game with n players are unique as well. In other words, for all .
Suppose that at time t a player is k periods from being the proposer, and all players are in the game. Since this player can commit to henceforth rejecting all proposals from his/her opponents and always demanding when they themselves are the proposer, a strategy that yields by Lemma 2 is .
Since the payoffs of all responders are bounded from below by
, an accepted proposal cannot give a proposer more than
. Furthermore,
. Hence,
If the proposal is rejected, the proposer’s expected payoff is bounded from above by the expected payoff of receiving
in the next round. That is,
. Thus, the following holds:
Suppose a player is
k periods from being a proposer. The expected payoff from moving to the next round is
, so he/she will never be offered more than that in the current round. Thus, whether the current proposal is accepted or rejected, this is an upperbound on the payoff that they realize in SPE in the current round. In other words,
Since
is the unique solution to the system (
3), it follows from (
4) and (
5) that
. Then since
,
.
Let
, and assume that
. Then, by (
5),
. Since
,
. □