The Evolution of Cooperation and Diversity under Integrated Indirect Reciprocity

Sasaki, Tatsuya; Uchida, Satoshi; Okada, Isamu; Yamamoto, Hitoshi

doi:10.3390/g15020015

Open AccessArticle

The Evolution of Cooperation and Diversity under Integrated Indirect Reciprocity

¹

Department of Community Development, Koriyama Women’s College, Fukushima 963-8503, Japan

²

Research Center for Ethi-Culture Studies, RINRI Institute, Tokyo 102-8561, Japan

³

High-Tech Research Center, Kokushikan University, Tokyo 154-8515, Japan

⁴

Department of Business Administration, Soka University, Tokyo 192-8577, Japan

⁵

Faculty of Business Administration, Rissho University, Tokyo 141-8602, Japan

^*

Author to whom correspondence should be addressed.

Games 2024, 15(2), 15; https://doi.org/10.3390/g15020015

Submission received: 28 February 2024 / Revised: 31 March 2024 / Accepted: 16 April 2024 / Published: 18 April 2024

(This article belongs to the Section Cooperative Game Theory and Bargaining)

Download

Browse Figures

Versions Notes

Abstract

:

Indirect reciprocity is one of the major mechanisms driving the evolution of cooperation in human societies. There are two types of indirect reciprocity: upstream and downstream reciprocity. Cooperation in downstream reciprocity follows the pattern ‘You helped someone, and I will help you’, while the direction of cooperation is reversed in upstream reciprocity, which follows the pattern ‘You helped me, and I will help someone else’. These two types of indirect reciprocity often occur in combination. However, upstream and downstream reciprocity have mostly been theoretically studied in isolation. In this study, we propose a new model that integrates both types of reciprocity. In particular, we apply the standard giving-game framework of indirect reciprocity and analyze the three-strategy model including reciprocal altruists, indiscriminate altruists, and free riders using evolutionary game theory. We show that the model allows reciprocal altruists and free riders to coexist stably in well-mixed populations. We also find that by accounting for inattention in the assessment rule, the stability of this mixed equilibrium can be strengthened to prevent the invasion of infamous indiscriminate altruists and can even be made globally stable.

Keywords:

evolutionary game; evolution of cooperation; indirect reciprocity; downstream reciprocity; upstream reciprocity; pay it forward

1. Introduction

Reciprocal cooperation is an indispensable component of a sustainable society. Even nearly half a century after the seminal work of Trivers [1] on reciprocal altruism, the exploration of game-theoretic models for the evolution of cooperation through reciprocity remains at the forefront of evolutionary biology and the social sciences. As helping is costly, self-interested individuals will free-ride on others and, so, unconditional cooperation is unlikely to evolve. Therefore, the standard paradigm for the evolution of cooperation is a type of cooperation that is conditional on the degree of the other party’s cooperativeness, as is the case in reciprocal cooperation.

To succeed in competition with free riders, reciprocal altruists require sufficient cognitive capacity to effectively process information to discriminate non-free riders from free riders. When an interaction consists of iterated rounds between the same pair of individuals, reciprocity often occurs in the form of direct reciprocity [1,2,3]. Direct reciprocity is expressed as follows: A helps B, and then B helps A. Direct reciprocity requires memorizing what the co-player and one’s self did for each other in the past rounds of the iteration. In the absence of such iterations, as in the case of generalized exchange [4], reciprocity should be indirect [5,6,7,8,9]. Indirect reciprocity extends closed pairwise interactions to relationships involving external third parties. Implementing indirect reciprocity thus requires knowing what the involved players did to others or had done to them by others in the past, for example, through observing the co-player(s) directly or using reputation systems.

There are two types of indirect reciprocity: upstream and downstream reciprocity [7]. Downstream reciprocity, on one hand, can be expressed as follows: B helps C, and then A helps B (Figure 1b). In other words, the response to B helping C was not C helping B directly, but B being helped by a third party A, who observed B helping C and consequently evaluated B positively. This led to B being helped by A or another party who was influenced by B’s positive evaluation, for instance, through gossip or reputation [10,11,12]. This is called ‘rewarding reputation’ [13]. Therefore, downstream reciprocity uses reputation to identify partners with whom to cooperate. Thus, the motivation for such a reputational mechanism in downstream reciprocity is often described as follows: ‘If I help you, then I will be deemed good, and then someone will help me.’ This is called ‘reputational giving’ [14].

Upstream reciprocity (also known as generalized reciprocity) is expressed as follows: A helps B, and then B helps C (Figure 1a). This type of reciprocity is characterized by the logic of not choosing the partners with whom to cooperate. This differs from the logic behind downstream reciprocity, which is based on conditional cooperation. Upstream reciprocity involves a chain of altruistic behaviors [15], called ‘paying it forward’ [16,17,18], which increases driving forces such as gratitude [13,18,19,20,21,22] or a sense of indebtedness [22], rather than the expectation of direct or indirect reward. However, in the eyes of a third party, emotional behavior can be viewed as a kind of reputational behavior (and vice versa). These motivations for reciprocity can be easily intertwined when evaluated.

Upstream and downstream reciprocity are commonly observed behaviors in experimental settings and field research [23,24,25,26,27,28,29,30]. Notably, different types of reciprocal mechanisms can be applied in tandem to promote cooperation [14]. However, upstream and downstream reciprocity have been theoretically studied mostly in isolation, and a comprehensive study combining both types of reciprocity is still missing.

In this study, we present a new model that integrates both types of indirect reciprocity. Our model demonstrates that a stable coexistence of reciprocal altruists and free riders is attainable through a strategy which is solely reliant on indirect reciprocity, without the need for additional mechanisms such as direct or spatial reciprocity. Spatial reciprocity, like direct reciprocity, is a mechanism of reciprocity [31] which continues to be extensively studied [32,33,34,35]. Spatial reciprocity is characterized by a spatial or subgroup structure—or relatedness—among interacting individuals, which can increase cooperation among like-minded individuals. Existing studies [36,37,38,39] have shown that integrating direct or spatial reciprocity into upstream reciprocity could promote the evolution of the latter, suggesting that such outcomes may be a byproduct of the primary mechanism [31]. Generally, reciprocity mechanisms enhance the likelihood of cooperative engagements with peers over encounters with all-out defectors [33]. However, our integrated model leads to a unique dynamic in which interactions with all-out defectors may even be encouraged, in contrast to the traditional case of reciprocity.

We specifically implement the interplay of upstream and downstream reciprocity within our framework [13], which is detailed as follows (see Figure 1c). Let B be the modeled integrated reciprocator, who can act as either an upstream or downstream reciprocator. First, assume that D helps E; witnessing this, the integrated reciprocator B deems D to be good and rewards them by helping them as a downstream reciprocator. Furthermore, if A is another integrated reciprocator who has already deemed B good, they will try to reward B by helping them as well. Then, B will forward the help received to someone else (C) as an upstream reciprocator. This may lead to B being rewarded by another witnessing the integrated reciprocator. Subsequently, the chain of forwarding and rewarding help may continue in the same manner.

It should be recalled that the chain of unconditional helping by upstream reciprocators is easily terminated when facing a free rider [37]. The reactivation of helping requires waiting for the event that a new chain begins. In contrast, it is expected that helping is more likely to lead to reactivation due to the intervention of selective rewarding in the model, as depicted above. This point has been overlooked to date, and may provide an important starting point for a comprehensive study of upstream and downstream reciprocity.

In the subsequent section, we articulate the integrated reciprocator model through incorporating forwarding and rewarding behaviors into the action rules of individuals, subsequently analyzing the framework according to evolutionary game theory [40,41]. This method diverges from conventional game models through accounting for the bounded rationality of players and aligns with biological evolution, in which advantageous strategies are naturally selected for. Our methodology facilitates dynamic tracking of the stability of a strategy across evolutionary processes—not just at static equilibria—marking a significant methodological advance over classical game-theoretic analyses. Specifically, our model examines the global stability of the mixed equilibrium state brought about by the newly proposed strategy, that is, whether all the interior orbits of the dynamical system meet at that point, regardless of their starting conditions [41].

2. Results

2.1. The Setup

We build the model on the basis of the giving game in a well-mixed population. We assume that, given any interaction event, two players are randomly selected from the population and then interact with each other in only one round. The role of the donor or recipient is determined through a coin toss. To simplify the analysis, we assume that a player acts as both a donor and a recipient in each round [40]. When acting as a donor, the player is offered the option to help (C) or not (D). Helping leads to benefits

b

for the recipient and costs

c

for the donor, where

b > c > 0

. Not helping has no effect on either the donor or recipient. This yields an example of the well-known prisoner’s dilemma game [2]. We also consider the probability of failing to implement an intended action, whether or not to help, denoted by

ϵ

[42].

We then apply the standard framework to study the evolution of indirect reciprocity based on the giving game [43,44,45]. The player’s strategy is described using an action rule and an assessment rule. The action rule prescribes whether a player helps or not. After every round, each player acting as a donor is assigned a binary image of ‘good’ (G) or ‘bad’ (B) following the assessment rule. Note that the player’s image when acting as a recipient is assumed to remain unchanged. In this study, we consider public assessment, under which a representative observer monitors each game, enforces an assessment rule for updating images, and broadcasts information about the population. We allow each player to perfectly know the information of co-players regarding their actions and images.

2.2. Modeling Integrated Reciprocators

To study the interplay between upstream and downstream reciprocity, we allow for the circulation of forwarded and rewarded help, as shown in Figure 1c. In this study, we examine reciprocators that conditionally help based on the integrated action rule (Table 1a), as follows: those who received help in the previous round will help a potential recipient, regardless of the recipient’s image, while those who did not receive help in the previous round will help a potential recipient only if the recipient’s image is good. In the following section, we analyze a minimalistic setting in which each individual can choose one of three strategies: unconditional cooperator (X), unconditional defector (Y), and integrated reciprocator (Z). Unconditional cooperators and defectors always intend to help and not to help, respectively. The relative frequencies of the three strategies are denoted by

x

,

y

, and

z

, respectively, where

x + y + z = 1

. We assume that, in the learning process, strategies that earn a higher payoff are more likely to be imitated in the population. We studied this simple process using replicator dynamics [41] (see Section 4 for details). In the following, we present the results obtained with the baseline model (Model I) and the refined model (Model II).

2.3. Model I: Stable Coexistence of the Good and the Bad

We first developed Model I by considering the simplest assessment rule: those who help are deemed good and those who do not are deemed bad (Table 1b), denoted by

(g (C, C), g (C, D), g (D, C), g (D, D)) = GBGB

, where each component corresponds to an assessment outcome; for example,

g (C, C)

, associated with ‘G’, indicates that the donor is evaluated as good when they receive help (C) and subsequently provide help (C). This is the well-known scoring rule [46,47,48]. As shown in Figure 2, Model I can stabilize the intermediate level of cooperation in a mixed state of reciprocators and defectors (at P in Figure 2a,b). When maintaining this coexistence, while unconditionally forwarding help through the upstream reciprocation part of the action rule (the upper row of Table 1a) can be exploited by defectors, this is compensated for by conditional rewarding in the downstream reciprocation part (the bottom row of Table 1a). Thus, the problem regarding the evolution of upstream reciprocity [7,36,37,38,39,40] can be resolved through indirect reciprocity.

Figure 2 shows further details of the evolution of the three strategies. The phase portraits present a continuum of fixed points in the interior of the simplex

Δ = \{(x, y, z) : x + y + z = 1\}

. Notably, dimorphic dynamics can be observed between integrated reciprocators and defectors, seen along the edge YZ given by

x = 0

. In the case without errors (Figure 2a), the edge YZ generally consists of a segment RZ, which is a basin of the attractor P, and another segment YR, which is a continuum of boundary fixed points. The attractor P:

z = z_{0}

is given by

z_{0} = \frac{b - 2 c}{b - c} .

(1)

The location of the attractor P asymptotically approaches the node Z (

z = 1

) as the benefit–cost ratio

b / c

increases. At the attractor P, the population average of the probability of helping is

- z_{0}^{2} + 3 z_{0} - 1

. We see that the curve PQ—which is a continuum of interior fixed points connecting the points P and Q—divides the simplex. Turning to the other boundaries, the dynamics between integrated reciprocators and cooperators along the edge XZ are neutral, and the dynamics between cooperators and defectors along the edge XY are dominated by defectors. Therefore, in the long run, considering random fluctuations can lead the population to ‘neutral drift’ along the lines of equilibria (i.e., PQ and RY), eventually coming into the vicinity of the node Y (the 100% defectors state).

In the case with errors (Figure 2b), an attractor P and a repeller Q can appear along the edge YZ. While the continua of boundary fixed points disappear, those of the interior fixed points along PQ remain. The dynamics between integrated reciprocators and cooperators become dominated by the former. Besides these changes, the evolutionary fate of the population in the long run remains similar, even more definitely converging to the 100% defector state (see Section 4 for details).

While Model I succeeds in inducing the attractor between reciprocators and defectors, the induced equilibrium is not asymptotically stable [41] against the invasion of cooperators. Therefore, regardless of the presence or absence of errors, considering random perturbation, the population leaves the coexistence state in the long run. This is similar to the evolution of indirect reciprocity through scoring [42,49,50]. The lack of stability of the coexistence state can be explained as follows: the definition of goodness in Model I is based only on whether to help or not, thus giving rise to the infamous problem of ‘unjustified defection’ [9,51,52] when reciprocators refuse to help those who are deemed bad. In this case, the image of reciprocators becomes bad and the chance of being rewarded by other reciprocators decreases. When such a chain reaction of unjustified defection and image downgrading occurs, the advantage of being a reciprocator rather than a cooperator is lost.

2.4. Model II: Robustness against the Invasion of Cooperators

To strengthen the stability of the coexistence state, we propose Model II with a refined assessment rule (Table 1c):

(g (C, C), g (C, D), g (D, C), g (D, D)) = GBKK

. Under the new rule, only those who implement upstream reciprocity should be rewarded by those who follow the action rule. Indeed, when receiving help in the previous round, those who help are deemed good, and those who do not are deemed bad; furthermore, when receiving no help in the previous round, the donor’s image remains unchanged (denoted as K in Table 1c), regardless of whether they help or not in the current round. The new assessment rule is a sort of staying rule [53,54], which was invented as a reward to focus on upstream reciprocation.

The rationale for constructing Model II is multifaceted. Initially, in Model I governed by GBGB, the probability of reciprocators maintaining a good image

g_{Z}

varies with the fraction of reciprocators

z

, as derived in Equation (14). In contrast, the probabilities that cooperators and defectors are perceived as good remain constant at

g_{X} = 1

and

g_{Y} = 0

, respectively. From this, through preserving

g_{Y}

at the minimum and elevating

g_{Z}

to its maximum, we aim to enhance the chances for reciprocators to obtain benefits

b

, equalizing them with cooperators. Consequently, in the presence of defectors, the probability that reciprocators—that is, those who cooperate conditionally—will incur costs

c

is reliably lower than that of unconditional cooperators. It is thereby anticipated that reciprocators will, on average, realize a net payoff surpassing that of the cooperators, leading the mixed equilibrium P to be stable against the invasion of rare mutant cooperators. Accordingly, the design of Model II omits (i.e., ‘ignores’) the case where the focal player received no help (D) in the assessment, while exclusively assessing the case where the focal player received help (C). This realizes the goodness probability of reciprocators,

g_{Z}

, which takes the largest value of 1. As a result, the third and fourth bits of GBGB in Model I are set to K, and Model II can be represented as GBKK.

Model II can result in the coexistence of reciprocators and defectors, which does not allow cooperators to invade. In fact, in striking contrast to Model I, the dynamics for Model II have no interior equilibria (with or without errors). Figure 3 shows that all interior orbits converge to the boundary of the simplex, particularly the edge YZ. If the rate of the implementation error,

ϵ

, is sufficiently small, integrated reciprocators are better off than cooperators (i.e.,

P_{Z} - P_{X} > 0

holds). In the case without errors (Figure 3a), along the edge YZ, there exists a unique fixed point P:

z = z_{0}

, with the same coordinates as in Model I, and the node Y is a saddle. At the attractor P, the cooperation rate (the probability to perform C) over the population is given by

- z_{0}^{3} + 2 z_{0}^{2}

. The dynamics on the other edges, XZ and XY, remain unchanged, as was the case in Model I. Thus, P is even the global attractor. Then, turning to the case with errors (Figure 3b), the edge YZ can exhibit an attractor P and a repeller Q; thus, the population dynamics can be bistable, evolving either to the mixed state P or the 100% defector state Y (see Section 4 for details).

The stability of the attractor P against the invasion of cooperators can be understood as follows: Assuming that the integrated reciprocator received no help in the previous round, even if they interacted with a co-player with a bad image, the reciprocator’s image would not change due to the staying element (K) of the assessment rule in Model II (Table 1c). Thus, the occurrence of unjustifiable defection is prevented. This means that a reciprocator with a good image can keep that image and, thus, continue to be rewarded by other reciprocators.

Upon closer examination of the GBKK assessment rule, it is conceivable that an individual labeled good after reciprocating help (receiving prior C and giving C) might also be justly perceived as good when they choose to help (C), despite not receiving prior help (D). This more lenient variation is captured by the GBGK assessment rule. Our analysis indicates that its effects align closely with those under the GBKK rule. A significant point to understand this is that the

g (D, C) = G

component in GBGK enhances the probability of an individual attaining a good image (

g_{Z}

) over the

g (D, C) = K

in GBKK, thus elevating

g_{Z}

to its maximum value of 1 when errors are absent, as well as in GBKK. The evolutionary dynamics are dictated by the expected payoffs of strategies, which fully hinge on the goodness probabilities

g_{X}

,

g_{Y}

, and

g_{Z}

. With no errors present, GBKK and GBGK both yield probabilities of 1, 0, and 1, correspondingly, thus resulting in a consistent evolutionary trajectory for the population.

2.5. Cooperator, Defector, Upstream Reciprocator, and Downstream Reciprocator

The results above can be compared with those considering the evolution of the four strategies: unconditional cooperators, unconditional defectors, upstream reciprocators, and downstream reciprocators. Indeed, our study shows that the replicator dynamics for the four strategies can only result in the bistable fate of the population, as in the evolution of downstream reciprocity. The state space is divided into two distinct regions by a continuum of stable and unstable fixed points, given by

z = c / ((1 - ϵ) b)

(with

c / ((1 - ϵ) b) < 1

), that is, the planar set (Figure 4). Considering random perturbations, this causes the population to end up in the 100% defector state. This reveals that the simple extension of the strategy space to upstream reciprocators has no effect on improving the stability of cooperation (see Section 4 for details).

3. Discussion

This study ventures into unexplored territory within the evolution of indirect reciprocity. Unlike traditional models, which often pair upstream reciprocity with direct or spatial reciprocity while overlooking downstream reciprocity, our model reveals that an asymptotically stable global attractor which sustains high levels of upstream reciprocation is achievable when integrated with downstream reciprocation, even in the absence of errors. Notably, in this attractor, the integrated reciprocators coexist with all-out defectors, providing evolutionary dynamics unprecedented in previous models within the challenging confines of the one-shot prisoner’s dilemma game in a well-mixed population. In contrast, finding an attractor between conditional and unconditional cooperators has been intensively studied [43,55,56,57,58,59].

The role of errors has been central in previous models: conditional strategies fostering cooperation are ironically at risk of erosion due to unconditional cooperation once a fully cooperative state is attained. To stabilize conditional cooperation, most models have introduced errors that result in conditional cooperators denying help to unconditional ones, thereby securing an advantage. As such, errors have hitherto been essential in stabilizing conditional cooperation [44,57,60,61,62]. Our model departs from this convention, showing that the integration of upstream and downstream reciprocity can naturally stabilize altruists in the presence of free riders without the forced incorporation of error dynamics.

A broad aspect of our framework involves the interaction with errors in perception, such as when a player misinterprets an opponent’s reputation. Initially, we assumed a perfect public assessment condition, under which the model’s design focuses on errors in implementation, thus bypassing errors in perception. This simplification, while not detracting from the principal outcome—namely, that stable coexistence is attainable sans errors—implies that the inclusion of errors in perception could yield different results under varying model assumptions, such as the action and assessment rules.

Model I’s GBGB assessment rule confines the mixed equilibrium point P to a limited basin of attraction and exposes it to neutral stability disruptions due to rare mutant cooperators (Figure 2a). The introduction of inattention in Model II effectively addresses these concerns through allowing integrated reciprocators to infiltrate the homogeneous defector state (node Y), maintaining their good image through the mechanism of inattention. Furthermore, inattention elevates the perceived goodness of integrated reciprocators to the maximum probability of 1, equivalent to cooperators. This advancement thwarts the invasion of cooperators at the point P who, unlike reciprocators, are subject to exploitation by defectors. Consequently, this boosts the stability of the point P (Figure 3a) without overly escalating the model’s complexity.

A task we leave for the future is that the stable coexistence established in Model II can become unstable due to the invasion of ‘pure’ upstream reciprocity (Table 2a). Pure upstream reciprocators are those who can free ride on the costly rewarding by the integrated reciprocators. To address this issue, a considerable countermeasure would be to update the assessment rule to downgrade pure upstream reciprocators. We also remark on another type of free rider: those who only employ downstream reciprocity (Table 2b) and, thus, can free ride on the costly unconditional forwarding of help. Our analysis suggests that the coexistence state can be stable against the invasion of pure downstream reciprocators (see Section 4 for details).

Our research underscores the necessity to investigate robustness with respect to the inclusion of all possible action rules and to expand to systematically encompass concepts such as negative reciprocity or forwarding ‘greed’ strategies [63,64]. The interplay with more complex downstream reciprocity, exemplified by the leading eight norms [45,65,66], warrants comprehensive examination. The application of private assessment in the norm ecosystem [30,67,68,69,70,71,72,73], combined with other types of reciprocity [36,37,38,39,74,75] and sanctioning mechanisms [76,77], remains a significant area for future research. The relevance of spatial reciprocity can also be revisited in light of findings suggesting that certain network structures, such as directed triangular cycles [34], can indeed promote cooperative behaviors, contrary to earlier beliefs [7].

The phenomenon of different cooperation levels coexisting in human societies—which is often observed but less understood—opens a new realm of inquiry into the conditions fostering such polymorphism [78,79,80]. Our analysis of the demanding one-shot prisoner’s dilemma-like environment in a well-mixed population, independent of additional coordination factors such as repeat interactions or spatial structuring, introduces a stable polymorphism of reciprocal altruists and free riders through the sole mechanism of indirect reciprocity. This theoretical foundation enriches the dialogue on the evolution of cooperation and diversity, beckoning further scholarly exploration into this complex and fascinating domain.

4. Materials and Methods

4.1. Evolutionary Dynamics and Image Dynamics

We analyzed the model using evolutionary game theory and investigated the replicator dynamics for a set of strategies considered. Thus, we assumed an infinitely large population and its slow evolution, such that the composition of the population may be supposed to remain constant without changes in consecutive rounds. In general, the replicator dynamics are given by

d s / d t = s (P_{S} - P)

, where

s

denotes the relative frequency of individuals who employ strategy

S

,

P_{S}

is the expected payoff per round for strategy

S

(

P_{S}

is determined after playing an infinitely large number of rounds), and

P

is the average payoff over the population, given by

\sum_{S} s P_{S}

.

In the first step, we investigated the dynamics for three strategies: unconditional cooperator (X), unconditional defector (Y), and integrated reciprocator (Z). We denote these relative frequencies by

x

,

y

, and

z

, respectively. Thus,

x + y + z = 1

and

P = x P_{X} + y P_{Y} + z P_{Z}

. We also denote the relative frequency of those who have a good image within each strategy subpopulation by

g_{S}

, with

S \in {X, Y, Z}

. We denote the frequency of the good over the whole population as

g = x g_{X} + y g_{Y} + z g_{Z}

.

We introduce a minimalistic framework that can deal with the interplay of upstream and downstream reciprocity using the generalized first-order action and assessment rule. The generalized first-order assessment rule is given by the following matrix:

\begin{matrix} g i v e C & g i v e D \\ r e c e i v e d C & g (C, C) & g (C, D) \\ r e c e i v e d D & g (D, C) & g (D, D) \end{matrix},

(2)

where each element

g (a, b)

denotes the probability that the focal player who received action

a

in the previous round and then gave action

b

in the current round, with

a, b \in {C, D}

, is deemed good. This matrix acts as a function of what the focal player does and what was done to the focal player and, thus, can cover the first-order assessment rules such as scoring (Table 1b).

In the equilibrium state (attained by starting from the state in which all have a good image,

g_{S} = 1

), the frequency of the good for each strategy should satisfy the following:

\begin{matrix} g_{S} = \sum_{a, b \in {C, D}} u_{S} (a) v_{S} (b | a) g (a, b) \\ = u_{S} (C) v_{S} (C | C) g (C, C) + u_{S} (C) v_{S} (D | C) g (C, D) \\ + u_{S} (D) v_{S} (C | D) g (D, C) + u_{S} (D) v_{S} (D | D) g (D, D), \end{matrix}

(3)

where

u_{S} (i)

and

v_{S} (j | i)

denote the probabilities that the focal player with strategy

S

receives action

i

and that the focal player with strategy

S

gives action

j

if they have most recently received action

i

in a given round, respectively. Therefore,

u_{S} (D) = 1 - u_{S} (C)

,

v_{S} (D| C) = 1 - v_{S} (C| C)

, and

v_{S} (D D) = 1 - v_{S} (C| D)

.

We provide the generalized first-order action rule using the following matrix:

\begin{matrix} g o o d (G) & b a d (B) \\ r e c e i v e d C & p_{s} (C, G) & p_{s} (C, B) \\ r e c e i v e d D & p_{s} (D, G) & p_{s} (D, B) \end{matrix},

(4)

where each element

p_{S} (a, i)

denotes the probability that the focal player who received action

a \in {C, D}

in the previous round and is then given an opponent with image

i \in {G, B}

implements action

C

as the potential donor in the current round. This framework covers the fundamental action rules of integrated reciprocity (Table 1a), upstream reciprocity (Table 2a), and downstream reciprocity (Table 2b).

Using the notation in Equation (4), the probability that a donor with strategy

S

implements C to (or helps) a recipient with strategy

T

is given by

\begin{matrix} u (S, T) = \sum_{a \in {C, D}, i \in {G, B}} u_{S} (a) g_{T} (i) p_{S} (a, i) \\ = u_{S} (C) g_{T} (G) p_{S} (C, G) + u_{S} (C) g_{T} (B) p_{S} (C, B) + u_{S} (D) g_{T} (G) p_{S} (D, G) + u_{S} (D) g_{T} (B) p_{S} (D, B), \end{matrix}

(5)

in which

g_{T} (G) ∶ = g_{T}

and, thus,

g_{T} (B) = 1 - g_{T} (G)

. This yields

u_{S} (C) = \sum_{S^{'} \in \{X, Y, Z\}} s^{'} u (S^{'}, S),

(6)

and

v_{S} (C) = \sum_{S^{'} \in \{X, Y, Z\}} s^{'} u (S, S^{'}),

(7)

where

s^{'}

denotes the relative frequency of strategy

S^{'}

.

Therefore, for the minimalistic setting with the strategy space

{X, Y, Z}

, we have

\begin{matrix} u_{X} (C) = x (1 - ϵ) + y ϵ + z [u_{Z} (C) (1 - ϵ) + u_{Z} (D) (g_{X} (G) (1 - ϵ) + g_{X} (B) ϵ)], \\ u_{Y} (C) = x (1 - ϵ) + y ϵ + z [u_{Z} (C) (1 - ϵ) + u_{Z} (D) (g_{Y} (G) (1 - ϵ) + g_{Y} (B) ϵ)], \\ u_{Z} (C) = x (1 - ϵ) + y ϵ + z [u_{Z} (C) (1 - ϵ) + u_{Z} (D) (g_{Z} (G) (1 - ϵ) + g_{Z} (B) ϵ)], \end{matrix}

(8)

and

\begin{array}{l} v_{X} (C) = 1 - ϵ, \\ v_{Y} (C) = ϵ, \\ v_{Z} (C) = u_{Z} (C) (1 - ϵ) + u_{Z} (D) [g (1 - ϵ) + (1 - g) ϵ] . \end{array}

(9)

By solving Equations (3), (8) and (9), we can obtain

g_{S} (G)

,

u_{S} (C)

, and

v_{S} (C)

for each point

(x, y, z)

of the state space

∆

.

We assume that the image dynamics in Equations (3) and (5) are so fast that the replicator dynamics can be determined according to the expected payoffs, which depend on

u_{S} (C)

and

v_{S} (C)

in the equilibrium state of the image dynamics. We also assumed that the image dynamics start from a situation in which all individuals have a good image. The expected payoffs for the strategies are given by

P_{S} = b u_{S} (C) - c v_{S} (C) .

(10)

4.2. Model I

From the assessment rule that those who help are deemed good (Table 1b), we have

g_{S} = v_{S} .

(11)

Thus, substituting Equation (11) into Equation (8) yields

\begin{matrix} P_{Z} - P_{Y} = (g_{Z} (G) - g_{Y} (G)) [P_{X} - P_{Y}], \\ P_{Z} - P_{X} = (g_{Z} (G) - g_{X} (G)) [P_{X} - P_{Y}], \end{matrix}

(12)

in which

P_{X} - P_{Y} = b z (1 - u_{Z} (C)) - c

(13)

holds. The zero set of

P_{X} - P_{Y}

as a function of

(x, y, z)

provides a continuum of fixed points for the replicator dynamics in the interior of the two-dimensional state space

Δ

. This is what the interior curve PQ describes in Figure 2a,b. First, we focus on the case without errors (Figure 2a). From Equations (12) and (13), we see that, for the segment ZR on the edge YZ with

(3 - \sqrt{5}) / 2 < z \leq 1

, the fraction of the good converges to

g_{Z} (G) = - \frac{z^{2} - 3 z + 1}{z},

(14)

or, otherwise, for the segment RY, with

0 \leq z < (3 - \sqrt{5}) / 2

, to

g_{Z} = 0

. Hence, the fraction of the good over the entire population (i.e., the frequency of those who cooperate) is

g = (1 - z_{0}) g_{Y} (G) + z_{0} g_{Z} (G) = - z_{0}^{2} + 3 z_{0} - 1

. Substituting Equation (14) into Equation (13) yields the zero set of Equation (13) on the segment ZR, which is given by

z_{0} = (b - 2 c) / (b - c)

in Equation (1). It follows that the point P with

z = z_{0}

is an attractor with a basin ZR. In contrast, the segment RY consists exclusively of fixed points, along which

g_{Z} = g_{Y} = 0

yield

P_{Z} = P_{Y} = 0

. Turning to the dynamics along the edge XZ, we have

g_{Z} = g_{X} = 1

and, thus,

P_{Z} = P_{X}

. Hence, it follows that the dynamics of integrated reciprocators and cooperators are neutral. On the edge XY, it is obvious that

z = 0

yields

P_{X} - P_{Y} = - c < 0

and, thus, defectors dominate cooperators.

We then examined the case with errors (Figure 2b). Using numerical simulations, we observed that an attractor P and a repeller Q can appear along the edge YZ in general. As the error rate is non-zero, the fraction of the good among integrated reciprocators,

g_{Z} (G)

, can always take a non-zero value. Similarly,

g_{Z} (G)

, never attains its full value. As a result, no continuum of boundary fixed points appeared along the boundary of the state space. In contrast, Equations (12) and (13) hold regardless of the presence or absence of errors and, thus, a continuum of interior fixed points remains. When considering neutral drift or random perturbations, particularly in the case with errors, the population in the long run converges to the 100% defector state (node Y). Interestingly, the global dynamics for Model I are similar to those for scoring [40].

4.3. Model II

Using the staying element in the assessment rule (Table 1c) for the equilibrium state of the image dynamics, we have the following equations:

\begin{array}{l} g_{X} (G) = u_{X} (C) (1 - ϵ) + u_{X} (D) g_{X} (G), \\ g_{Y} (G) = u_{Y} (C) ϵ + u_{Y} (D) g_{Y} (G), \\ g_{Z} (G) = u_{Z} (C) (1 - ϵ) + u_{Z} (D) g_{Z} (G), \end{array}

(15)

which obviously lead to the following constant values:

\begin{array}{l} g_{X} (G) = 1 - ϵ, \\ g_{Y} (G) = ϵ, \\ g_{Z} (G) = 1 - ϵ . \end{array}

(16)

In striking contrast to Model I, the replicator dynamics for Model II have no interior equilibrium in the state space, and we can thus see that all interior orbits converge to the boundary of the state space (Figure 3a,b). Indeed, the payoff difference between reciprocators and cooperators is given by

P_{Z} - P_{X} = c (1 - u_{Z} (C)) (1 - g_{Z} (G)) (1 - 2 ϵ),

(17)

in which

(1 - u_{Z} (C)) (1 - g_{Z} (G)) \neq 0

holds in the interior state space, yielding

P_{Z} - P_{X} > 0

for sufficiently small errors with

ϵ < 1 / 2

.

Next, let us examine the dynamics between integrated reciprocators and defectors along the edge YZ. For

x = 0

, we have that

P_{Z} - P_{Y} = b z (1 - u_{Z} (C)) (1 - 2 ϵ) - c [u_{Z} (C) + u_{Z} (D) g_{Z} (G)]

(18)

and, furthermore, in the case without errors (

ϵ

= 0),

P_{Z} - P_{Y} = - z^{2} (b - c) + z (b - 2 c) .

(19)

Thus, for

b > 2 c

, the point P with

z = z_{0}

as in Equation (1) becomes an attractor along the edge YZ. We also note that the dynamics along the edges XZ and XY remain unchanged from those for Model I.

For these reasons, in the case without errors, it follows that all interior orbits will converge to P; thus, P is the global attractor (Figure 3a). Using Equation (9), we have that the probability that integrated reciprocators give C,

v_{Z} (C)

, is equal to

z_{0} (2 - z_{0})

. Thus, its population average is

z_{0}^{2} (2 - z_{0})

.

In the case with errors, it turns out that an attractor P and repeller Q appear simultaneously on the edge YZ. Hence, the replicator dynamics have only two local attractors, P and the node Y. As a result, the global dynamics are bistable: the population converges to either P or Y (Figure 3b).

4.4. Stability of the Attractor P against Invasion of Pure Downstream Reciprocators in Model II

Here, we prove that a rare mutant of pure downstream reciprocators (W) is worse off than the resident population consisting of defectors (Y) and reciprocators (Z) in the case without errors. Consider pure downstream reciprocators (PDR), who employ the action rule in Table 2b and the assessment rule in Table 1c. We first note that

g_{W} (G) = u_{W} (C) g + u_{W} (D) g_{W} (G)

and, thus,

g_{W} (G) = z

on the edge YZ. Using this, we calculated the probability for PDR to receive C as

u_{W} (C) = (1 - z) \cdot 0 + z (u_{Z} (C) + u_{Z} (D) g_{W} (G)) = z (u_{Z} (C) + u_{Z} (D) z)

, and the probability for PDR to give C as

v_{W} = (1 - z) \cdot 0 + z g_{Z} = z

.

Similarly, the probability for integrated reciprocators to receive C is given by

u_{Z} = (1 - z) \cdot 0 + z (u_{Z} (C) + u_{Z} (D) g_{Z} (G)) = z > u_{W}

, and the probability for integrated reciprocators to give C,

v_{Z} (C)

, is equal to

v_{W} (C)

. Therefore, it follows that the expected payoff for mutant PDRs,

P_{W} = b u_{W} (C) - c v_{W} (C)

, is smaller than that for resident integrated reciprocators

P_{Z} = b u_{Z} (C) - c v_{Z} (C)

. In other words, the mutants are not selected for among the residents along the edge YZ (including P).

4.5. Cooperator, Defector, Upstream Reciprocator, and Downstream Reciprocator

We also explored the evolution of four strategies: unconditional cooperators, unconditional defectors, upstream reciprocators, and downstream reciprocators. Downstream reciprocators intend to help the recipient if the recipient helped someone else in the previous round; if the recipient did not help, downstream reciprocators intend not to do so (Table 2b). Upstream reciprocators intend to help the recipient, regardless of the recipient’s image, if upstream reciprocators received help in the previous round. Otherwise, the upstream reciprocator intends not to help (Table 2a).

We denote by

x

,

y

,

v

, and

w

the relative frequencies of unconditional cooperators (X), unconditional defectors (Y), upstream reciprocators (V), and downstream reciprocators (W), respectively. Thus,

x + y + v + w = 1

and

P = x P_{X} + y P_{Y} + v P_{V} + w P_{W}

. The frequency of the good over the entire population is given by

g = x g_{X} + y g_{Y} + v g_{V} + {w g}_{W}

. Then, as in Equation (8), we can obtain the following equations to recursively define

u_{S} (C)

:

\begin{matrix} \begin{matrix} u_{X} (C) = x (1 - ϵ) + y ϵ + v [u_{V} (C) (1 - ϵ) + u_{V} (D) ϵ] + w [g_{X} (G) (1 - ϵ) + g_{X} (B) ϵ], \\ u_{Y} (C) = x (1 - ϵ) + y ϵ + v [u_{V} (C) (1 - ϵ) + u_{V} (D) ϵ] + w [g_{Y} (G) (1 - ϵ) + g_{Y} (B) ϵ], \\ u_{V} (C) = x (1 - ϵ) + y ϵ + v [u_{V} (C) (1 - ϵ) + u_{V} (D) ϵ] + w [g_{V} (G) (1 - ϵ) + g_{V} (B) ϵ], \end{matrix} \\ u_{W} (C) = x (1 - ϵ) + y ϵ + v [u_{V} (C) (1 - ϵ) + u_{V} (D) ϵ] + w [g_{W} (G) (1 - ϵ) + g_{W} (B) ϵ] . \end{matrix}

(20)

By solving Equations (3), (9), and (20), we obtain

g_{S} (G)

,

u_{S} (C),

and

v_{S} (C)

. Substituting these values into Equation (10) allows us to calculate the payoffs and, thus, the replicator dynamics. For Model II,

v_{S}

, the probability that a player with strategy

S

gives C is given by

\begin{array}{l} v_{X} (C) = 1 - ϵ, \\ v_{Y} (C) = ϵ, \\ v_{V} (C) = u_{V} (C) (1 - ϵ) + u_{V} (D) ϵ, \\ v_{W} (C) = g (1 - ϵ) + (1 - g) ϵ . \end{array}

(21)

Figure 4 describes the evolution of the four strategies based on the replicator dynamics. Figure 4a shows the boundary dynamics on each face. On the X-Y-V plane, defectors are dominant. For the other three faces (X-Y-W, X-W-V, and Y-W-V), there can exist a continuum of interior fixed points if

c / ((1 - ϵ) b) < 1

. We also see that the edge dynamics between downstream reciprocators and upstream reciprocators are neutral. Therefore, a random shock can eventually bring the population to the node Y, which is the homogeneous state for defectors.

Figure 4b shows the interior dynamics. If

c / ((1 - ϵ) b) < 1

, there exists an intersection of the plane (planar continuum of fixed points) and the 3D simplex

Δ_{4} = \{(x, y, v, w) : x + y + v + w = 1\}

; otherwise, there is no interior fixed point in

Δ_{4}

. Figure 4b shows that the intersection consists of stable and unstable fixed points. Depending on the initial conditions, the population may first evolve to a stable point within the plane. Regardless of the initial conditions, the random perturbation can still cause the population to finally converge to the node Y.

Author Contributions

Conceptualization, T.S., S.U., I.O. and H.Y.; methodology, T.S., S.U., I.O. and H.Y.; software, T.S. and S.U.; validation, S.U.; formal analysis, T.S. and S.U.; writing—original draft preparation, T.S.; writing—review and editing, T.S., S.U., I.O. and H.Y.; visualization, T.S. and S.U.; project administration, T.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by JSPS KAKENHI Grant Numbers JP23K21017 (IO, HY), JP21KK0027 (IO, HY), JP23K25160 (HY, IO), and JP23K05943 (TS). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Data Availability Statement

The authors confirm that the article has no data.

Acknowledgments

We are grateful to Å. Brännström, U. Dieckmann, Y. Nakai, H. Ohtsuki, and N. Takahashi for their comments.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Trivers, R.L. The evolution of reciprocal altruism. Q. Rev. Biol. 1971, 46, 35–57. [Google Scholar] [CrossRef]
Axelrod, R.; Hamilton, W.D. The evolution of cooperation. Science 1981, 211, 1390–1396. [Google Scholar] [CrossRef] [PubMed]
Press, W.H.; Dyson, F.D. Iterated prisoner’s dilemma contains strategies that dominate any evolutionary opponent. Proc. Natl. Acad. Sci. USA 2012, 109, 10409–10413. [Google Scholar] [CrossRef] [PubMed]
Yamagishi, T.; Cook, K.S. Generalized exchange and social dilemmas. Soc. Psychol. Q. 1993, 56, 235–248. [Google Scholar] [CrossRef]
Sugden, R. The Economics of Rights, Co-Operation and Welfare; Blackwell: Chichester, UK, 1986. [Google Scholar]
Alexander, R. The Biology of Moral Systems; Aldine de Gruyter: Berlin, Germany, 1987. [Google Scholar]
Boyd, R.; Richerson, P.J. The evolution of indirect reciprocity. Soc. Netw. 1989, 11, 213–236. [Google Scholar] [CrossRef]
Kandori, M. Social norms and community enforcement. Rev. Econ. Stud. 1992, 59, 63–80. [Google Scholar] [CrossRef]
Nowak, M.A.; Sigmund, K. Evolution of indirect reciprocity. Nature 2005, 437, 1291–1298. [Google Scholar] [CrossRef]
Milinski, M.; Semmann, D.; Krambeck, H.J. Reputation helps solve the ‘tragedy of the commons’. Nature 2002, 415, 424–426. [Google Scholar] [CrossRef] [PubMed]
Simpson, B.; Willer, R. Altruism and indirect reciprocity: The interaction of person and situation in prosocial behavior. Soc. Psychol. Q. 2008, 71, 37–52. [Google Scholar] [CrossRef]
Yoeli, E.; Hoffman, M.; Rand, D.G.; Nowak, M.A. Powering up with indirect reciprocity in a large-scale field experiment. Proc. Natl. Acad. Sci. USA 2013, 110, 10424–10429. [Google Scholar] [CrossRef]
Baker, W.E.; Bulkley, N. Paying it forward vs. rewarding reputation: Mechanisms of generalized reciprocity. Organ. Sci. 2014, 25, 1493–1510. [Google Scholar] [CrossRef]
Melamed, D.; Simpson, B.; Abernathy, J. The robustness of reciprocity: Experimental evidence that each form of reciprocity is robust to the presence of other forms of reciprocity. Sci. Adv. 2020, 6, eaba0504. [Google Scholar] [CrossRef] [PubMed]
Fowler, J.H.; Christakis, N.A. Cooperative behavior cascades in human social networks. Proc. Natl. Acad. Sci. USA 2010, 107, 5334–5338. [Google Scholar] [CrossRef] [PubMed]
Bearman, P. Generalized exchange. Am. J. Sociol. 1997, 102, 1383–1415. [Google Scholar] [CrossRef]
Malinowski, B. Argonauts of the Western Pacific: An Account of Native Enterprise and Adventure in the Archipelagoes of Melanesian New Guinea; Routledge: London, UK, 1922. [Google Scholar]
Uehara, E. Dual exchange theory, social networks, and informal social support. Am. J. Sociol. 1990, 96, 521–557. [Google Scholar] [CrossRef]
Bartlett, M.Y.; DeSteno, D. Gratitude and prosocial behavior: Helping when it costs you. Psychol. Sci. 2006, 17, 319–325. [Google Scholar] [CrossRef] [PubMed]
DeSteno, D.; Bartlett, M.Y.; Baumann, J.; Williams, L.A.; Dickens, L. Gratitude as moral sentiment: Emotion-guided cooperation in economic exchange. Emotion 2010, 10, 289–293. [Google Scholar] [CrossRef]
Tsvetkova, M.; Macy, M.W. The social contagion of generosity. PLoS ONE 2014, 9, e87275. [Google Scholar] [CrossRef]
Simpson, B.; Harrell, A.; Melamed, D.; Heiserman, N.; Negraia, D.V. The roots of reciprocity: Gratitude and reputation in generalized exchange systems. Am. Sociol. Rev. 2018, 83, 88–110. [Google Scholar] [CrossRef]
Rutte, C.; Taborsky, M. Generalized reciprocity in rats. PLoS Biol. 2007, 5, e196. [Google Scholar] [CrossRef]
Stanca, L. Measuring indirect reciprocity: Whose back do we scratch? J. Econ. Psychol. 2009, 30, 190–202. [Google Scholar] [CrossRef]
Yoshikawa, K.; Wu, H.C.; Lee, H.J. Generalized exchange orientation: Conceptualization and scale development. J. Appl. Psychol. 2020, 105, 294. [Google Scholar] [CrossRef]
Eriksson, T.; Ferreira, C. Who pays it forward the most? Examining organizational citizenship behavior in the workplace. J. Theor. Soc. Psychol. 2021, 5, 215–228. [Google Scholar] [CrossRef]
Romano, A.; Saral, A.S.; Wu, J. Direct and indirect reciprocity among individuals and groups. Curr. Opin. Psychol. 2022, 43, 254–259. [Google Scholar] [CrossRef] [PubMed]
Bolton, G.E.; Katok, E.; Ockenfels, A. Cooperation among strangers with limited information about reputation. J. Public Econ. 2005, 89, 1457–1468. [Google Scholar] [CrossRef]
Swakman, V.; Molleman, L.; Ule, A.; Egas, M. Reputation-based cooperation: Empirical evidence for behavioral strategies. Evol. Hum. Behav. 2016, 37, 230–235. [Google Scholar] [CrossRef]
Gaudeul, A.; Keser, C.; Müller, S. The evolution of morals under indirect reciprocity. Games Econ. Behav. 2022, 126, 251–277. [Google Scholar] [CrossRef]
Rand, D.G.; Nowak, M.A. Human cooperation. Trends Cogn. Sci. 2013, 17, 413–425. [Google Scholar] [CrossRef] [PubMed]
Allen, B.; Lippner, G.; Chen, Y.T.; Fotouhi, B.; Momeni, N.; Yau, S.T.; Nowak, M.A. Evolutionary dynamics on any population structure. Nature 2017, 544, 227–230. [Google Scholar] [CrossRef]
Su, Q.; McAvoy, A.; Mori, Y.; Plotkin, J.B. Evolution of prosocial behaviours in multilayer populations. Nat. Hum. Behav. 2022, 6, 338–348. [Google Scholar] [CrossRef]
Su, Q.; Allen, B.; Plotkin, J.B. Evolution of cooperation with asymmetric social interactions. Proc. Natl. Acad. Sci. USA 2022, 119, e2113468118. [Google Scholar] [CrossRef] [PubMed]
Huang, Y.; Wan, S.; Zheng, J.; Liu, W. Evolution of cooperation in spatial public goods game with interactive diversity. Phys. A 2023, 621, 128794. [Google Scholar] [CrossRef]
Pfeiffer, T.; Rutte, C.; Killingback, T.; Taborsky, M.; Bonhoeffer, S. Evolution of cooperation by generalized reciprocity. Proc. R. Soc. B 2005, 272, 1115–1120. [Google Scholar] [CrossRef] [PubMed]
Nowak, M.A.; Roch, S. Upstream reciprocity and the evolution of gratitude. Proc. R. Soc. B 2007, 274, 605–610. [Google Scholar] [CrossRef] [PubMed]
Rankin, D.J.; Taborsky, M. Assortment and the evolution of generalized reciprocity. Evolution 2009, 63, 1913–1922. [Google Scholar] [CrossRef] [PubMed]
Barta, Z.; McNamara, J.M.; Huszar, D.B.; Taborsky, M. Cooperation among non-relatives evolves by state-dependent generalized reciprocity. Proc. R. Soc. B 2011, 278, 843–848. [Google Scholar] [CrossRef] [PubMed]
Sigmund, K. The Calculus of Selfishness; Princeton Univ. Press: Princeton, NJ, USA, 2010. [Google Scholar]
Hofbauer, J.; Sigmund, K. Evolutionary Games and Population Dynamics; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar]
Brandt, H.; Sigmund, K. The good, the bad and the discriminator—Errors in direct and indirect reciprocity. J. Theor. Biol. 2006, 239, 183–194. [Google Scholar] [CrossRef] [PubMed]
Brandt, H.; Sigmund, K. The logic of reprobation: Assessment and action rules for indirect reciprocation. J. Theor. Biol. 2004, 231, 475–486. [Google Scholar] [CrossRef]
Ohtsuki, H.; Iwasa, Y. How should we define goodness?—Reputation dynamics in indirect reciprocity. J. Theor. Biol. 2004, 231, 107–120. [Google Scholar] [CrossRef]
Ohtsuki, H.; Iwasa, Y. The leading eight: Social norms that can maintain cooperation by indirect reciprocity. J. Theor. Biol. 2006, 239, 435–444. [Google Scholar] [CrossRef]
Nowak, M.A.; Sigmund, K. Evolution of indirect reciprocity by image scoring. Nature 1998, 393, 573–577. [Google Scholar] [CrossRef] [PubMed]
Nowak, M.A.; Sigmund, K. The dynamics of indirect reciprocity. J. Theor. Biol. 1998, 194, 561–574. [Google Scholar] [CrossRef] [PubMed]
Berger, U. Learning to cooperate via indirect reciprocity. Games Econ. Behav. 2011, 72, 30–37. [Google Scholar] [CrossRef]
Leimar, O.; Hammerstein, P. Evolution of cooperation through indirect reciprocity. Proc. R. Soc. B 2001, 268, 745–753. [Google Scholar] [CrossRef] [PubMed]
Panchanathan, K.; Boyd, R. A tale of two defectors: The importance of standing for evolution of indirect reciprocity. J. Theor. Biol. 2003, 224, 115–126. [Google Scholar] [CrossRef] [PubMed]
Okada, I. A review of theoretical studies on indirect reciprocity. Games 2020, 11, 27. [Google Scholar] [CrossRef]
Yamamoto, H.; Suzuki, T.; Umetani, R. Justified defection is neither justified nor unjustified in indirect reciprocity. PLoS ONE 2020, 15, e0235137. [Google Scholar] [CrossRef] [PubMed]
Sasaki, T.; Okada, I.; Nakai, Y. The evolution of conditional moral assessment in indirect reciprocity. Sci. Rep. 2017, 7, 41870. [Google Scholar] [CrossRef] [PubMed]
Okada, I.; Yamamoto, H.; Sato, Y.; Uchida, S.; Sasaki, T. Experimental evidence of selective inattention in reputation-based cooperation. Sci. Rep. 2018, 8, 14813. [Google Scholar] [CrossRef]
Fishman, M.A. Indirect reciprocity among imperfect individuals. J. Theor. Biol. 2003, 225, 285–292. [Google Scholar] [CrossRef]
Mohtashemi, M.; Mui, L. Evolution of indirect reciprocity by social information: The role of trust and reputation in evolution of altruism. J. Theor. Biol. 2003, 223, 523–531. [Google Scholar] [CrossRef] [PubMed]
Brandt, H.; Sigmund, K. Indirect reciprocity, image scoring, and moral hazard. Proc. Natl. Acad. Sci. USA 2005, 102, 2666–2670. [Google Scholar] [CrossRef]
Ohtsuki, H.; Iwasa, Y. Global analyses of evolutionary dynamics and exhaustive search for social norms that maintain cooperation by reputation. J. Theor. Biol. 2007, 244, 518–531. [Google Scholar] [CrossRef] [PubMed]
Okada, I.; Sasaki, T.; Nakai, Y. Tolerant indirect reciprocity can boost social welfare through solidarity with unconditional cooperators in private monitoring. Sci. Rep. 2017, 7, 9737. [Google Scholar] [CrossRef] [PubMed]
Lotem, A.; Fishman, M.A.; Stone, L. Evolution of cooperation between individuals. Nature 1999, 400, 226–227. [Google Scholar] [CrossRef] [PubMed]
Sherratt, T.N.; Roberts, G. The role of phenotypic defectors in stabilizing reciprocal altruism. Behav. Ecol. 2001, 12, 313–317. [Google Scholar] [CrossRef]
Takahashi, N.; Mashima, R. The importance of subjectivity in perceptual errors on the emergence of indirect reciprocity. J. Theor. Biol. 2006, 243, 418–436. [Google Scholar] [CrossRef] [PubMed]
Gray, K.; Ward, A.F.; Norton, M.I. Paying it forward: Generalized reciprocity and the limits of generosity. J. Exp. Psychol. 2012, 143, 247. [Google Scholar] [CrossRef] [PubMed]
Kim, J.E.; Tsvetkova, M. Cheating in online gaming spreads through observation and victimization. Netw. Sci. 2021, 9, 425–442. [Google Scholar] [CrossRef]
Martinez-Vaquero, L.A.; Cuesta, J.A. Evolutionary stability and resistance to cheating in an indirect reciprocity model based on reputation. Phys. Rev. E 2013, 87, 052810. [Google Scholar] [CrossRef]
Santos, F.P.; Santos, F.C.; Pacheco, J.M. Social norm complexity and past reputations in the evolution of cooperation. Nature 2018, 555, 242–245. [Google Scholar] [CrossRef] [PubMed]
Uchida, S. Effect of private information on indirect reciprocity. Phys. Rev. E 2010, 82, 036111. [Google Scholar] [CrossRef]
Uchida, S.; Sigmund, K. The competition of assessment rules for indirect reciprocity. J. Theor. Biol. 2010, 263, 13–19. [Google Scholar] [CrossRef] [PubMed]
Okada, I.; Sasaki, T.; Nakai, Y. A solution for private assessment in indirect reciprocity using solitary observation. J. Theor. Biol. 2018, 455, 7–15. [Google Scholar] [CrossRef]
Yamamoto, H.; Okada, I.; Uchida, S.; Sasaki, T. A norm knockout method on indirect reciprocity to reveal indispensable norms. Sci. Rep. 2017, 7, 44146. [Google Scholar] [CrossRef]
Uchida, S.; Yamamoto, H.; Okada, I.; Sasaki, T. A theoretical approach to norm ecosystems: Two adaptive architectures of indirect reciprocity show different paths to the evolution of cooperation. Front. Phys. 2018, 6, 14. [Google Scholar] [CrossRef]
Hilbe, C.; Schmid, L.; Tkadlec, J.; Chatterjee, K.; Nowak, M.A. Indirect reciprocity with private, noisy, and incomplete information. Proc. Natl. Acad. Sci. USA 2018, 115, 12241–12246. [Google Scholar] [CrossRef]
Krellner, M.; Han, T.A. Pleasing enhances indirect reciprocity under private assessment. Artif. Life 2021, 27, 246–276. [Google Scholar] [CrossRef]
Reiter, J.G.; Hilbe, C.; Rand, D.G.; Chatterjee, K.; Nowak, M.A. Crosstalk in concurrent repeated games impedes direct reciprocity and requires stronger levels of forgiveness. Nat. Commun. 2018, 9, 555. [Google Scholar] [CrossRef] [PubMed]
Schmid, L.; Chatterjee, K.; Hilbe, C.; Nowak, M.A. A unified framework of direct and indirect reciprocity. Nat. Hum. Behav. 2021, 5, 1292–1302. [Google Scholar] [CrossRef]
Ohtsuki, H.; Iwasa, Y.; Nowak, M.A. Indirect reciprocity provides only a narrow margin of efficiency for costly punishment. Nature 2009, 457, 79–82. [Google Scholar] [CrossRef] [PubMed]
Podder, S.; Righi, S.; Pancotto, F. Reputation and punishment sustain cooperation in the optional public goods game. Phil. Trans. R. Soc. B 2021, 376, 20200293. [Google Scholar] [CrossRef] [PubMed]
Doebeli, M.; Hauert, C.; Killingback, T. The evolutionary origin of cooperators and defectors. Science 2004, 306, 859–862. [Google Scholar] [CrossRef] [PubMed]
Archetti, M.; Scheuring, I. Coexistence of cooperation and defection in public goods games. Evolution 2011, 65, 1140–1148. [Google Scholar] [CrossRef]
Hauert, C.; Doebeli, M. Spatial social dilemmas promote diversity. Proc. Natl. Acad. Sci. USA 2021, 118, e2105252118. [Google Scholar] [CrossRef]

Figure 1. Three types of indirect reciprocity. Each panel illustrates a different reciprocity mechanism. The subscript

i

in the label

t_{i}

of each arrow represents the order in which the help actions occur in that direction. (a) Upstream reciprocity: first A helps B, and then B (upstream reciprocator) forwards the help received to C. (b) Downstream reciprocity: first B helps C, and then A (downstream reciprocator) rewards B by helping. (c) Integrated reciprocity: first D helps E, then B (integrated reciprocator) rewards D by helping; next, given that the other integrated reciprocator, A, rewards B by helping them, B forwards the help received to C. Another integrated reciprocator may also subsequently reward B, who is moved by this and again forwards the help received to someone else.

Figure 1. Three types of indirect reciprocity. Each panel illustrates a different reciprocity mechanism. The subscript

i

in the label

t_{i}

of each arrow represents the order in which the help actions occur in that direction. (a) Upstream reciprocity: first A helps B, and then B (upstream reciprocator) forwards the help received to C. (b) Downstream reciprocity: first B helps C, and then A (downstream reciprocator) rewards B by helping. (c) Integrated reciprocity: first D helps E, then B (integrated reciprocator) rewards D by helping; next, given that the other integrated reciprocator, A, rewards B by helping them, B forwards the help received to C. Another integrated reciprocator may also subsequently reward B, who is moved by this and again forwards the help received to someone else.

Figure 2. Evolution of integrated reciprocity for Model I. Panels (a,b) depict phase portraits of the replicator dynamics for the unconditional cooperator X, unconditional defector Y, and integrated reciprocator Z without and with errors, respectively. The triangles describe a simplex of the state space

Δ = {(x, y, z) : x + y + z = 1}

. Each node (X, Y, or Z) of the triangle corresponds to the homogeneous state of each strategy (

x

,

y

, or

z = 1

, respectively). Moreover, filled and empty circles denote stable and unstable fixed points, respectively. (a), Without errors, the simplex has three continua of fixed points: XZ, RY, and PQ, among which only PQ remains when assuming errors in (b). Whether errors are present or absent, considering a random shock on the population composition, the presence of the continuum of interior fixed points, PQ, prevents the population from staying at the boundary attractor P. (b), In particular, the population will eventually converge to the node Y (100% defector state). Parameters:

c = 1

,

b = 5

, (a)

ϵ = 0

, and (b)

ϵ = 0.05

.

Figure 2. Evolution of integrated reciprocity for Model I. Panels (a,b) depict phase portraits of the replicator dynamics for the unconditional cooperator X, unconditional defector Y, and integrated reciprocator Z without and with errors, respectively. The triangles describe a simplex of the state space

Δ = {(x, y, z) : x + y + z = 1}

. Each node (X, Y, or Z) of the triangle corresponds to the homogeneous state of each strategy (

x

,

y

, or

z = 1

, respectively). Moreover, filled and empty circles denote stable and unstable fixed points, respectively. (a), Without errors, the simplex has three continua of fixed points: XZ, RY, and PQ, among which only PQ remains when assuming errors in (b). Whether errors are present or absent, considering a random shock on the population composition, the presence of the continuum of interior fixed points, PQ, prevents the population from staying at the boundary attractor P. (b), In particular, the population will eventually converge to the node Y (100% defector state). Parameters:

c = 1

,

b = 5

, (a)

ϵ = 0

, and (b)

ϵ = 0.05

.

Figure 3. Evolution of integrated reciprocity for Model II. Panels (a) and (b) depict phase portraits of the replicator dynamics for the unconditional cooperator X, unconditional defector Y, and integrated reciprocator Z, as in Figure 2. (a) Without errors, the dynamics show the global attractor P along the edge YZ. At P, integrated reciprocators and defectors stably coexist. (b) The global dynamics become bistable: the population will eventually converge to either the local attractors P or Y (100% defector state). Parameters:

c = 1

,

b = 5

, (a)

ϵ = 0

, and (b)

ϵ = 0.1

.

Figure 3. Evolution of integrated reciprocity for Model II. Panels (a) and (b) depict phase portraits of the replicator dynamics for the unconditional cooperator X, unconditional defector Y, and integrated reciprocator Z, as in Figure 2. (a) Without errors, the dynamics show the global attractor P along the edge YZ. At P, integrated reciprocators and defectors stably coexist. (b) The global dynamics become bistable: the population will eventually converge to either the local attractors P or Y (100% defector state). Parameters:

c = 1

,

b = 5

, (a)

ϵ = 0

, and (b)

ϵ = 0.1

.

Figure 4. Evolution of four traditional strategies. Panels (a) and (b) depict phase portraits of the replicator dynamics for the unconditional cooperator X, unconditional defector Y, upstream reciprocator V, and downstream reciprocator W, on the surfaces and in the interior space, respectively. The tetrahedral simplex in (b) denotes the state space

Δ_{4} = \{(x, y, v, w) : x + y + v + w = 1\}

. Each node (X, Y, V, or W) corresponds to the homogeneous state for each strategy. The tetrahedral simplex intersects a planar set that consists of stable and unstable fixed points. The red solid lines in (a) denote a set of continuums of boundary fixed points, among which PQ, QR, and RP are the intersections between each triangular surface and the planar set. In (b), the global dynamics show that, on one hand, blue interior orbits are converging to points on the plane while, on the other hand, the red interior orbits are tending to the node Y, which indicates the local stability of the node Y. Due to the planar set, while considering random fluctuations, the evolution can end up in the 100% defector state at the node Y. Parameters:

c = 1

,

b = 5

, and

ϵ = 0.1

.

Figure 4. Evolution of four traditional strategies. Panels (a) and (b) depict phase portraits of the replicator dynamics for the unconditional cooperator X, unconditional defector Y, upstream reciprocator V, and downstream reciprocator W, on the surfaces and in the interior space, respectively. The tetrahedral simplex in (b) denotes the state space

Δ_{4} = \{(x, y, v, w) : x + y + v + w = 1\}

. Each node (X, Y, V, or W) corresponds to the homogeneous state for each strategy. The tetrahedral simplex intersects a planar set that consists of stable and unstable fixed points. The red solid lines in (a) denote a set of continuums of boundary fixed points, among which PQ, QR, and RP are the intersections between each triangular surface and the planar set. In (b), the global dynamics show that, on one hand, blue interior orbits are converging to points on the plane while, on the other hand, the red interior orbits are tending to the node Y, which indicates the local stability of the node Y. Due to the planar set, while considering random fluctuations, the evolution can end up in the 100% defector state at the node Y. Parameters:

c = 1

,

b = 5

, and

ϵ = 0.1

.

Table 1. Action and assessment rules for integrated reciprocity. Integrated reciprocators (as donors) act following the action rule and then are evaluated based on the assessment rule. In the assessment rules, G and B denote the donor’s image changing to good and bad, respectively; and K means that no change occurs in the donor’s image.

a. Action rule for integrated reciprocity		Image of recipient
a. Action rule for integrated reciprocity		G	B
In the previous round for the focal donor	received C	C	C
In the previous round for the focal donor	received D	C	D
b. Assessment rule for Model I		In the current round
b. Assessment rule for Model I		give C	give D
In the previous round for the focal donor	received C	G	B
In the previous round for the focal donor	received D	G	B
c. Assessment rule for Model II		In the current round
c. Assessment rule for Model II		give C	give D
In the previous round for the focal donor	received C	G	B
In the previous round for the focal donor	received D	K	K

Table 2. Action rules for upstream and downstream reciprocity. In the model, upstream and downstream reciprocators as a donor act following the corresponding action rule as below.

a. Action rule for upstream reciprocity		Image of recipient
a. Action rule for upstream reciprocity		G	B
In the previous round for the focal donor	received C	C	C
In the previous round for the focal donor	received D	D	D
b. Action rule for downstream reciprocity		Image of recipient
b. Action rule for downstream reciprocity		G	B
In the previous round for the focal donor	received C	C	D
In the previous round for the focal donor	received D	C	D

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sasaki, T.; Uchida, S.; Okada, I.; Yamamoto, H. The Evolution of Cooperation and Diversity under Integrated Indirect Reciprocity. Games 2024, 15, 15. https://doi.org/10.3390/g15020015

AMA Style

Sasaki T, Uchida S, Okada I, Yamamoto H. The Evolution of Cooperation and Diversity under Integrated Indirect Reciprocity. Games. 2024; 15(2):15. https://doi.org/10.3390/g15020015

Chicago/Turabian Style

Sasaki, Tatsuya, Satoshi Uchida, Isamu Okada, and Hitoshi Yamamoto. 2024. "The Evolution of Cooperation and Diversity under Integrated Indirect Reciprocity" Games 15, no. 2: 15. https://doi.org/10.3390/g15020015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Evolution of Cooperation and Diversity under Integrated Indirect Reciprocity

Abstract

1. Introduction

2. Results

2.1. The Setup

2.2. Modeling Integrated Reciprocators

2.3. Model I: Stable Coexistence of the Good and the Bad

2.4. Model II: Robustness against the Invasion of Cooperators

2.5. Cooperator, Defector, Upstream Reciprocator, and Downstream Reciprocator

3. Discussion

4. Materials and Methods

4.1. Evolutionary Dynamics and Image Dynamics

4.2. Model I

4.3. Model II

4.4. Stability of the Attractor P against Invasion of Pure Downstream Reciprocators in Model II

4.5. Cooperator, Defector, Upstream Reciprocator, and Downstream Reciprocator

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI