1. Introduction
Consider a repeated market for betting where two agents wage on the outcomes of a binary event. Agent behavior is described by generic betting strategies that depend on prevailing odds. If the odds are fixed, the strategy that guarantees optimal wealth growth is the Kelly rule [
1,
2]. If the odds depend on agents’ bets via the parimutuel procedure, in a population of Kelly bettors, the one with the most accurate beliefs accrues all the wealth and asymptotically dominates the market [
3]. This is a particular case of the results derived in the equivalent setting of inter-temporal general equilibrium models with short-lived securities [
4,
5]. In a similar framework, an agent adopting the Kelly strategy and having correct beliefs will surely retain a positive wealth share, that is he/she survives, when trading against price-independent rules [
6]. The global behavior of the Kelly strategy with perfect information being established, it remains to understand what happens when agents do not bet according to Kelly and/or are not perfectly informed. In a market populated by utility maximizers, the agent who trades knowing the correct probabilities always realizes a non-negative expected profit [
7]. In the case of bettors using the fractional Kelly rule, a generalization of the Kelly rule that includes a risk-aversion parameter, sufficient and, apart from hairline cases, necessary conditions for strategy dominance or survival has been derived [
8,
9]. These conditions generalize and correct previous tentative results based on numerical simulations [
10]. In intertemporal equilibrium models, when more general strategies are adopted, one can observe path-dependent cases, in which the agent who dominates depends on the sequence of realized events [
11]. In general equilibrium models, a necessary and sufficient condition for a trader to vanish, i.e., to lose everything almost surely, can be obtained by approximating prices with a convex combination of traders’ discounted beliefs [
12]. In the same framework, using arguments similar to those presented in this paper, one can obtain general conditions to study the long-run dynamics of relative consumption and, eventually, agent survival. When agents’ utility is not time separable, several long-run selection outcomes may occur, included cases in which path dependency emerges [
13].
In the present paper, we propose criteria, based on the signs of the differences of relative entropies, that can be applied to generic strategies depending on prevailing market odds. The criteria are simple and abstract from strategy-specific details. It is sufficient to know the odds bettors consider fair and the amount each bettor is willing to bet when the odds are equal to the ones the other bettor believes fair to understand if one bettor will eventually dominate the market or, conversely, if the two bettors will asymptotically retain a finite, and fluctuating, amount of wealth. Interestingly, by considering generic strategies, one recovers the traditional role of luck in the game of chance, often neglected in some of the previously-mentioned studies. Indeed, in our setting, it is generically possible that the ultimate fate of a bettor is not only decided by the adopted strategy, but also by the specific realized sequence of binary events. As examples, we apply the new criteria to the case of Constant Relative Risk Averse (CRRA) bettors and to the case of agents following logit quantal response betting strategies [
14,
15].
2. Model
Consider two agents who make a sequence of consecutive bets against each other on binary events. The rounds of betting are indexed by
, and in each
t, the outcome of the event
is an independent Bernoulli trial with success probability
:
means that the event occurs, while
that it does not. In each round
t, agent
has to choose the fraction of wealth to be wagered
and the side of the bet
, where one means betting on the occurrence of the event, while zero betting against it. We assume that the amount bet is redistributed among the winners according to the parimutuel procedure, that is proportionally to how much they have bet, without any house-take. Let
be the prevailing inverse odds ratio at round
t for the occurrence of the event. Thus, if
, the agent betting on the occurrence of the event receives
times the amount bet, while if
, the agent betting against the occurrence of the event receives
times the amount bet. Agents’ betting strategies are based on prevailing odds, and they try to maximize their gain by increasing their bet when they perceive favorable opportunities. Following [
10], we assume the following:
for each agent i, there exists a constant “fair” inverse odds ;
each agent i chooses the side of the bet comparing prevailing odds with those she/he believes fair: she/he bets on the occurrence of the event () if the odds are higher than those she/he believes fair (), while she/he bets against the occurrence of the event () if the odds are lower than those she/he believes fair ();
for each agent i, there exists a continuous betting function such that:
- (a)
: agent i is willing to bet nothing when she/he considers prevailing odds fair,
- (b)
when : agent i cannot bet more than what she/he owns. The possibility that she/he bets all her wealth is ruled out as it would lead to wealth zero almost surely.
Without loss of generality we set
. Thus, if
is the wealth of agent
before the event at time
t is realized, the prevailing inverse odds
are set by the equation:
being always
and
. We require that the functions
are such that (
1) admits one and only one solution. This is for instance the case if they are monotonic, strictly concave, or strictly convex on the set of attainable prices. The amount of wealth that is not bet is invested in a risk-less asset that pays no interest. Hence, after the event at round
t is realized, the wealth of agents is updated according to
with:
where
is the Kronecker delta and
represents the gross return of wealth of agent
i, conditional or prevailing odds
p, and realized outcome
s. Since the house takes no fee, the aggregate wealth is constant, and we set
such that
and
if and only if
.
3. Long-Run Selection
The dynamics of wealth described by (
2) can lead to two different outcomes: either a single agent accrues all the wealth and dominates the market or both agents indefinitely survive, each with a positive, and fluctuating, fraction of wealth. In general, the fate of an agent could depend on the specific sequence of realizations of the random variable
. The behavior of the system can be described using the logarithm of the relative wealth of Agent 2 with respect to Agent 1,
. Indeed, the asymptotic behavior of
summarizes all the relevant information about the agents’ fate:
diverges toward
if and only if
and
;
diverges toward
if and only if
and
;
does not diverge if and only if both agents maintain a strictly positive wealth share in the long-run.
Consider the conditional odds-adjusted wealth growth rate for agent
i obtained by dividing the realized wealth growth rate by the odds of the realized outcome:
Due to individual budget constraints and market clearing condition (
1), one has
. Those quantities can be thought of as the wealth shares the agent allocates to the possible realizations of the Bernoulli trial [
13,
16]. Since in every
t, each agent
i keeps a fraction
of her/his wealth in the risk-less security, this is
as if she/he is constantly allocating a share
of her/his wealth on the realization of the event and a share
against the realization of the event. To those fractions, Agent 1 adds
against the event, while Agent 2 adds
on the occurrence of the event. Then, we define the relative entropy of the odds-adjusted growth rate with respect to the true probability of the outcome:
Under the wealth shares interpretation of
,
can be interpreted as a measure of how different from the best possible allocation agent
i’s one is [
11]. We know from the literature [
3,
4,
5,
10] that the best allocation is the Kelly rule, which corresponds to allocating wealth to each possible realization of the Bernoulli trial according to the true probabilities,
and
.
It is immediate to verify that the drift of the process
conditional on prevailing odds is just the difference of the relative entropy (
3) of the odds-adjusted growth rates of the two agents:
The agent who gains wealth in expectation is the agent whose odds-adjusted growth rates have the lowest relative entropy with respect to the true probabilities at prevailing odds, or, equivalently, the one whose wealth shares allocated to the realizations of the Bernoulli trial have the lowest relative entropy with respect to the Kelly rule at prevailing odds. Studying the details of the trajectory of the agents’ relative wealth would require detailed knowledge of the betting strategies. Nonetheless, in order to characterize the long-run behavior of the model, such full knowledge is not necessary.
Proposition 1. Let denote a realization of the Bernoulli process, and let be the associated sequence of agent i’s wealth. If agents’ betting strategies satisfy the requirements of Section 2, then: if and , then almost surely Agent 2 dominates: she/he accrues all the wealth, and ;
if and , then almost surely Agent 1 dominates: she/he accrues all the wealth, and ;
if and , then almost surely both agents survive: they retain a positive amount of wealth, for , and prevailing odds fluctuate in ;
if and , then either or depending on the realization of the Bernoulli process σ.
Proof. Define the (conditional) increment
when
and
. From (
2), remembering that, by hypothesis,
and
cannot be both zero for the same
p and are continuous, it is immediate to see that:
where
. Thus, the increments
g are finite and bounded, and Theorems 2.2, 3.1, and 3.2 of [
17] can be applied to the process
. Notice that
and
. Thus, if
, then
, whence (
. If
, then
, whence (
. If
and
, then there exists a finite interval
A such that
almost surely for any
t, whence (
. If
and
, then on any Bernoulli sequence, either
or
, whence (
. □
In order to decide the survival or dominance of agents, it is not generically necessary to know all the details of the investment strategies, but simply the Bernoulli probability
, the inverse odds considered fair by the two agents,
and
, and two positive numbers,
and
, representing the fraction of wealth one agent bets if the odds are equal to those the other agent would consider fair. These quantities are sufficient to compute the relative entropy
with
that appears in Proposition 1. An informationally-constrained external observer, who knows the true probabilities driving the occurrence of events, but does not have perfect knowledge of individual behaviors, can thoroughly infer long-run selection outcomes exploiting only a very limited amount of information about the agents’ strategies [
14,
15]. A similar result, based on similar techniques, can be obtained in a general equilibrium model with complete markets, provided agent preferences satisfy certain assumptions [
13].
It is immediate to see that if
, we are in Case (
, while if
, we are in Case (
, recovering a result in [
10]. The definitions of survival and dominance in [
10] are weaker than the ones adopted here. Given the relative simplicity of the considered process, however, their conclusions are still valid. Notice that the dominance of any agent in Case (
is not only realized on peculiar zero-measure sequences, like the sequence of all ones or the sequence of all zeros, but on sets of sequences with finite probability. This is where luck enters into the picture: both agents might dominate and accrue all wealth, but only Fortuna will decide who.
4. Example with CRRA Bettors
The betting strategies introduced in
Section 2 are flexible enough to accommodate several behavioral prescriptions. As an illustrative example, in this section, we consider the case in which agents bet to maximize the expected utility of wealth using a power utility function with Constant Relative Risk Aversion (CRRA). Call
the relative risk aversion coefficient of agent
i and
the subjective probability (belief) that agent
i assigns to the realization of the event, which is precisely the inverse odds that agent
i would consider fair. Assuming
, for
, Agent 1 bets against the occurrence of the event a fraction of wealth
that maximizes
to obtain:
Conversely, Agent 2 bets in favor of the realization of the event a fraction of wealth
that maximizes
to obtain:
The positive risk aversion implies that agents never bet the totality of their wealth.
Figure 1 provides two examples of how agents’ betting strategies vary depending on the inverse odds ratio. In the effective price support, betting strategies are always continuous and strictly concave, so that the equilibrium market odds always exist and are unique. If we set
for
, we recover the case of Kelly betting: agents maximize the expected log-growth rate of their wealth. In this case, previous contributions [
3,
4,
5,
10] showed that the agent whose beliefs had a lower relative entropy with respect to the truth dominates in the long-run. In the other cases, instead, the selection dynamics are richer.
Figure 2 reports the long-run selection outcomes inferred using the conditions from Proposition 1. Depending on agents’ risk aversion and beliefs, any case of Proposition 1 may generically occur. Notice how low risk aversion and asymmetric beliefs enhance the role of luck in deciding the ultimate winner. This is in line with previous findings about path-dependent, long-run selection outcomes [
11,
13].
5. Example with Logit Quantal Response Bettors
In this section, we use our criteria to assess long-run selection outcomes of a repeated betting market where agents’ behavior is described by a quantal response function [
14,
15]. Suppose that agent
i wants to bet a fraction of wealth
against the realization of the event and a fraction
on the realization of the event. As in the previous example, the agent assigns a subjective probability
to the realization of the event. Her/his expected payoff for round
t is:
The agent maximizes the expected payoff under the constraints
and
, where
is the entropy of the portfolio
and
is the minimum level of portfolio entropy, that is the maximum level of information attainable by the agent. Notice that if
, full information, the solution is either
or
depending on
and
, that is a boundary solution. The problem becomes:
and given the constraints, we obtain the solutions:
where
is a monotonic decreasing function of
and
when
. The more the agent is informationally constrained, the smaller the
. Notice that agent investment shares
are equivalent to those derived under a multinomial random utility model [
18]. If
, then the agent takes a net position against the event, hence
and
. If
, then the agent takes a net position in favor of the event, hence
and
. If
, then the agent takes a risk-less position and
. It follows that
, as in the previous example. Moreover, it can be easily shown that the betting functions as defined above are monotonic.
Figure 3 shows how two examples of logit quantal response betting functions vary with respect to
p.
Figure 4 reports the long-run selection outcomes inferred using the conditions of Proposition 1. As one can notice, low values of
and
, associated with agents with strong informational constraints, allow cases to emerge in which both agents survive, while the occurrence of path-dependent scenarios becomes very likely when the
’s are large.