A Note on Universal Bilinear Portfolios

Garivaltis, Alex

doi:10.3390/ijfs9010011

Open AccessArticle

A Note on Universal Bilinear Portfolios

by

Alex Garivaltis

Department of Economics, School of Public and Global Affairs, College of Liberal Arts and Sciences, Northern Illinois University, 514 Zulauf Hall, DeKalb, IL 60115, USA

Int. J. Financial Stud. 2021, 9(1), 11; https://doi.org/10.3390/ijfs9010011

Submission received: 8 January 2021 / Revised: 5 February 2021 / Accepted: 10 February 2021 / Published: 24 February 2021

Download

Browse Figures

Versions Notes

Abstract

:

This note provides a neat and enjoyable expansion and application of the magnificent Ordentlich-Cover theory of “universal portfolios”. I generalize Cover’s benchmark of the best constant-rebalanced portfolio (or 1-linear trading strategy) in hindsight by considering the best bilinear trading strategy determined in hindsight for the realized sequence of asset prices. A bilinear trading strategy is a mini two-period active strategy whose final capital growth factor is linear separately in each period’s gross return vector for the asset market. I apply Thomas Cover’s ingenious performance-weighted averaging technique to construct a universal bilinear portfolio that is guaranteed (uniformly for all possible market behavior) to compound its money at the same asymptotic rate as the best bilinear trading strategy in hindsight. Thus, the universal bilinear portfolio asymptotically dominates the original (1-linear) universal portfolio in the same technical sense that Cover’s universal portfolios asymptotically dominate all constant-rebalanced portfolios and all buy-and-hold strategies. In fact, like so many Russian dolls, one can get carried away and use these ideas to construct an endless hierarchy of ever more dominant H-linear universal portfolios.

Keywords:

on-line portfolio selection; universal portfolios; robust procedures; model uncertainty; constant-rebalanced portfolios; asymptotic capital growth; kelly criterion

JEL Classification:

D81; D83; G11

We first investigate what a natural goal might be for the growth of wealth for arbitrary market sequences. For example, a natural goal might be to outperform the best buy-and-hold strategy, thus beating an investor who is given a look at a newspaper n days in the future. We propose a more ambitious goal.
—Thomas M. Cover, Universal Portfolios, 1991

In 1988, out of the blue, Paul Samuelson wrote a letter to Stanford information theorist Thomas Cover. Samuelson had been sent one of Cover’s papers on portfolio theory for review. “If I did use some of your procedures,” Samuelson wrote, “I would not let that … bias my portfolio choice toward choices my alien cousin with log utility would make”. He chides Kelly, Latané, Markowitz, and “various Ph.D’s who appear with Poisson-distribution probabilities most Junes”.
—William Poundstone, Fortune’s Formula, 2005

With four parameters I can fit an elephant, and with five I can make him wiggle his trunk.
—John von Neumann

1. Introduction; Literature Review

This note contains a nice application and extension of the elegant universal portfolio theory that was established by Thomas Cover (1991); Cover and Ordentlich (1996); and Ordentlich and Cover (1998).

Universal portfolio theory is the on-line analogue of the log-optimal portfolio theory (that is, the theory of asymptotic capital growth), whose brilliant simplicity came down to us from such illustrative thinkers as John Kelly (1956), Henry Latané (1959), Leo Breiman (1961), and card-counter Edward O. Thorp (1969). Under laboratory conditions where the investor or gambler knows in advance the precise distribution of the profit-and-loss outcomes on which he is betting, the tea leaves say (cf. with MacLean et al. (2011)) that log-optimal portfolios (or growth-optimal portfolios) enjoy tremendous optimality properties, quite apart from the fact that they saturate a very specific type of expected utility, as pointed out so many times by Samuelson (1963, 1969, 1979).

Leo Breiman (1961) gave the first substantial results in this direction, namely, that the so-called Kelly gambler will, under general conditions, asymptotically outperform any “essentially different strategy” almost surely by an exponential factor. He also demonstrated that, for the sake of goal-based investing, the Kelly criterion minimizes the expected waiting time with respect to hitting a distant high-water mark.

In a pair of beautiful articles, Bell and Cover(1980, 1988) established that, actually, the Kelly rule also possesses very strong short-term competitive optimality properties, even for a single period’s fluctuation of a betting or investment market. They considered a static, zero-sum investment ϕ-game whose payoff kernel is equal to the expected value of an arbitrary increasing function

ϕ (•)

of the ratio of one trader’s wealth to that of another. Subject to the proviso that, prior to the actual portfolio choice, each contestant is permitted to make a fair randomization of his initial dollar (by exchanging it for any random capital whose mean is at most 1), the saddle point of the game amounts to each player using the log-optimal portfolio, together with fair randomizations that depend only on the criterion

ϕ (•)

, and not on any particular characteristic of the underlying investment opportunities.

Garivaltis (2018a) showed that the Bell-Cover theorem holds equally well for stochastic differential investment

ϕ

-games in continuous time that exhibit state-dependent drift and diffusion; Garivaltis (2019a) generalized this result even further, so as to cover levered investment

ϕ

-games over continuous time markets whereby the asset prices follow jump-diffusion processes with compactly-supported jump returns. Some recent work by Curatola (2019) investigates the strategic interaction of two large traders whose transactions affect not just each other, but also the expected returns of the entire stock market. For an illuminating discussion of competitive optimality as it relates to evolutionary contingencies in mathematical biology, consult with Tal and Tran (2020).

Cover’s universal portfolio theory, which began in earnest with his empirical Bayes stock portfolio (Cover and Gluss (1986)), takes its cue from the fact that for stock markets with iid returns, the log-optimal portfolio amounts to a certain constant-rebalanced portfolio (CRP); this consists in fixing the correct (growth-optimal) target percentages of wealth for each asset, and continuously executing rebalancing trades so as to counteract allocation drift. However, in the presence of model uncertainty (e.g., for actual stock markets), this particular CRP is completely unknown to the practitioner.

Inspired by the analogies with information theory, Thomas Cover had the brilliant insight that one should benchmark his on-line investment performance relative to that of the best constant-rebalanced portfolio determined in hindsight for the actual (realized) sequence of asset prices. The hindsight-optimized wealth can be interpreted as a financial derivative that is susceptible of exact pricing and replication in the (complete) continuous time market of Black and Scholes (1973). On that score, Ordentlich and Cover (1998) priced the rebalancing option at time-0 for unlevered hindsight optimization over a single risk asset; their work sat unfinished for twenty years, until it was completed by Garivaltis (2019b), who demonstrated how to price and replicate Cover’s (levered) rebalancing option at any time t, for any number of correlated stocks in geometric Brownian motion.

In discrete time, the empirical Bayes stock portfolio (Cover and Gluss (1986)), the Dirichlet-weighted universal portfolio (Cover and Ordentlich (1996)), and the minimax universal portfolio (Ordentlich and Cover (1998)) are all notable in that they guarantee to achieve a high percentage of the final wealth of the best constant-rebalanced portfolio in hindsight, uniformly for all possible sequences of asset prices. On account of the fact that this percentage (or competitive ratio) converges to zero at a slow (polynomial) rate, the excess compound (logarithmic) growth rate of the best CRP in hindsight (over and above that of the on-line portfolio) converges uniformly to zero. Thus, universal portfolios succeed in matching the performance of the best CRP in hindsight “to first order in the exponent”.

The original universal portfolios (inspired as they were by iid stock markets) suffer from the defect that they fail to recognize and exploit even very simple types of serial dependence in the individual sequence of asset returns. For example, consider a two-asset market whereby asset 2 is cash (that pays no interest), and asset 1 is a “hot stock” whose price alternately doubles in odd periods and gets cut in half in even periods. Naturally, one should hope that his portfolio selection algorithm is capable of detecting such a trivial pattern, thereby learning to (asymptotically) double its capital every two periods. But the original universal portfolios, when applied to this particular sequence of asset prices, merely learn to use the constant-rebalanced portfolio that puts

50 %

of its wealth into the stock and holds the rest in cash at the start of each investment period; this generates asymptotic capital growth at a rate of

\log (9 / 8) = 11.8 %

every two periods, compounded continuously—a far cry from the

\log 2 = 69.3 %

that accrues to perfect trading.

One way out of this conundrum is the use the universal portfolio with side information (Cover and Ordentlich (1996)) along with a “signal” that indicates, say, whether or not the current period is odd. The obvious objection here is that the efficacy of this particular signal (as opposed to any other piece of side information) will only ever become apparent in hindsight. Accordingly, this paper tackles the problem differently: we consider an expanded parametric family of mini 2-period active trading strategies called bilinear portfolios, which explicitly generalize the constant-rebalanced portfolios (here called 1-linear portfolios). Accordingly, we apply the Ordentlich-Cover techniques to design a universal bilinear portfolio that compounds its money at the same asymptotic rate as the best bilinear trading strategy in hindsight (thereby learning to trade perfectly in the motivating example). Thus, the universal bilinear portfolio will be shown to asymptotically dominate the universal 1-linear portfolio in the same technical sense (cf. with Cover and Thomas (2006)) that the universal 1-linear portfolio asymptotically dominates all constant-rebalanced portfolios and all buy-and-hold strategies. Once this is done, it will become readily apparent just how one can go about constructing an endless hierarchy of ever more dominant universal H-linear portfolios, for all possible mini-horizons

H \in \{1, 2, 3, . . .\}

.

2. Bilinear Trading Strategies

We start by defining the concept of a bilinear trading strategy (or bilinear portfolio), which is a simple 2-period active strategy that generalizes the notion of a constant-rebalanced portfolio (CRP). To this end, we assume that there are m assets called

i, j \in {1, . . ., m}

; we let

x_{i} \geq 0

denote the gross return1 of a

$ 1

investment in asset i in period 1, and similarly we let

y_{j} \geq 0

denote the gross return of asset j in period 2. We let

x : = {(x_{1}, . . ., x_{m})}^{'} \in R_{+}^{m} - {0}

denote the gross return vector in period 1, and in the same vein,

y : = {(y_{1}, . . ., y_{m})}^{'} \in R_{+}^{m} - {0}

is the gross return vector in period 2.

Definition 1.

A bilinear trading strategy is a square matrix

B : = {[b_{i j}]}_{m \times m}

of non-negative weights that sum to one. After two investment periods, the bilinear trading strategy B multiplies the initial dollar by a factor of

\begin{matrix} TWO - PERIOD CAPITAL GROWTH FACTOR : = x^{'} B y = \sum_{i = 1}^{m} \sum_{j = 1}^{m} b_{i j} x_{i} y_{j} . \end{matrix}

(1)

The set of all bilinear trading strategies is denoted

B : = \{B \in {Mat}_{m, m} (R) : B \geq 0 and 1^{'} B 1 = 1\},

(2)

where

1 : = {(1, . . ., 1)}^{'}

is an

m \times 1

vector of ones.

Proposition 1.

The bilinear2 final wealth

x^{'} B y

is uniquely replicated by the following 2-period active trading strategy: in period 1, we use the initial portfolio

p : = {(p_{1}, . . ., p_{m})}^{'} = B 1

, where

p_{i} = \sum_{j = 1}^{m} b_{i j}

is the initial fraction of wealth that will be invested in asset i; in period 2, we must use the portfolio

\begin{matrix} q (x) : = {(q_{1} (x), . . ., q_{m} (x))}^{'} = \frac{B^{'} x}{p^{'} x} = \frac{B^{'} x}{x^{'} B 1}, \end{matrix}

(3)

e.g.,

q_{j} (x) = \frac{\sum_{i = 1}^{m} b_{i j} x_{i}}{\sum_{i = 1}^{m} \sum_{k = 1}^{m} b_{i k} x_{i}} .

(4)

Proof.

We start with the functional equation

(p^{'} x) \cdot (q {(x)}^{'} y) = x^{'} B y,

(5)

e.g., the two-period growth factor is equal to the product of the individual growth factors that were achieved in periods 1 and 2. To start, we substitute

y : = 1 = {(1, . . ., 1)}^{'}

and

x : = e_{i} = {(0, . . ., 0, \underset{i}{1}, 0, . . ., 0)}^{'},

which is the

i th

unit basis vector for

R^{m}

. There lies

p_{i} = e_{i}^{'} B 1 = \sum_{j = 1}^{m} b_{i j}

, as promised. Next, in the identity

q {(x)}^{'} y = \frac{x^{'} B y}{p^{'} x},

(6)

we put

y : = e_{j}

. This leaves us with

q_{j} (x) = \frac{\sum_{i = 1}^{m} b_{i j} x_{i}}{\sum_{i = 1}^{m} \sum_{j = 1}^{m} b_{i k} x_{i}},

(7)

which is the desired result. In order to be logically complete, we must substitute our expressions for p and

q (x)

into Equation (5) so as to verify that they turn it into an identity. Here you go:

{(B 1)}^{'} x \cdot {(\frac{B^{'} x}{x^{'} B 1})}^{'} y = x^{'} B y .

(8)

□

Example 1.

Every constant-rebalanced portfolio (cf. with Thomas Cover (1991))

c : = {(c_{1}, . . ., c_{m})}^{'}

amounts to a bilinear trading strategy that is represented by the outer product

B : = c c^{'}

, e.g.,

b_{i j} : = c_{i} c_{j}

for all

i, j \in \{1, . . ., m\}

. Here, the constant-rebalanced portfolio c resolves to maintain the constant fraction

c_{i}

of wealth in each asset i at all times3, where

c_{i} \geq 0

and

\sum_{i = 1}^{m} c_{i} = 1

.

Example 2.

More generally, consider the trading strategy that always uses the portfolio

c : = {(c_{1}, . . ., c_{m})}^{'} \in Δ_{m}

in period 1 and then always uses the portfolio

d : = {(d_{1}, . . ., d_{m})}^{'} \in Δ_{m}

in period 2 (regardless of the observed value of x), where

Δ_{m} : = \{c \in R_{+}^{m} : \sum_{i = 1}^{m} c_{i} = 1\}

denotes the unit portfolio simplex in

R_{+}^{m}

. This scheme is a bilinear trading strategy that corresponds to the outer product

B : = c d^{'}

, e.g.,

b_{i j} : = c_{i} d_{j}

for all

i, j \in \{1, . . ., m\}

.

Example 3.

Every buy-and-hold strategy (that buys some initial portfolio

c : = {(c_{1}, . . ., c_{m})}^{'}

and holds it for two periods, without rebalancing) amounts to a bilinear trading strategy that is represented by the diagonal matrix

B : = diag (c_{1}, . . ., c_{m})

.

Inspired by Ordentlich and Cover (1998) and Cover and Thomas (2006), we note that the concept of a bilinear trading strategy admits the following simple and lucid interpretation. Let an extremal strategy4 be defined by the simple trading scheme: in period 1, we put

100 %

of wealth into asset i, and then in period 2, we take all the proceeds and roll them over into asset j. Hence, there are

m^{2}

different extremal strategies

(i, j) \in {1, . . ., m} \times {1, . . ., m}

; since the

{(i, j)}^{t h}

extremal strategy yields a capital growth factor of

x_{i} y_{j}

, it therefore amounts to the bilinear trading strategy

B : = e_{i} e_{j}^{'}

, which is an extreme point of

B

. The general bilinear portfolio

B : = {[b_{i j}]}_{m \times m}

is uniquely representable as a convex combination

B = \sum_{i = 1}^{m} \sum_{j = 1}^{m} b_{i j} e_{i} e_{j}^{'}

(9)

of extremal strategies; this means that the practitioner of B has elected to invest the fraction

b_{i j}

of his initial dollar into each extremal strategy

(i, j)

. Thus, after the elapse of two periods, the investor’s total wealth will be equal to

\sum_{i = 1}^{m} \sum_{j = 1}^{m} b_{i j} x_{i} y_{j} = x^{'} B y .

3. Universal Bilinear Portfolios

We now consider the on-line learning of the asymptotically dominant (or growth-optimal) bilinear portfolio. To this end, we assume that there are T basic investment periods

t \in \{1, . . ., T\}

, each of which is divided into a “first half” (during which the gross return vector is

x_{t}

) and a “second half” (during which the gross return vector is

y_{t}

.) We let

x^{t} : = (x_{1}, . . ., x_{t}) \in {(R_{+}^{m} - {0})}^{t}

denote the history of returns in the first halves of periods

1, . . ., t

, and, likewise, we let

y^{t} : = (y_{1}, . . ., y_{t}) \in {(R_{+}^{m} - {0})}^{t}

denote the return history for the latter halves of periods

1, . . ., t

. Thus, we have the transition laws

x^{t + 1} : = (x^{t}, x_{t + 1})

and

y^{t + 1} : = (y^{t}, y_{t + 1})

, where

x^{0}

and

y^{0}

denote empty histories. We let

\begin{matrix} W_{B} (x^{t}, y^{t}) : = \prod_{s = 1}^{t} x_{s}^{'} B y_{s} \end{matrix}

(10)

denote the final wealth function5 of the bilinear trading strategy B against the return history

(x^{t}, y^{t})

; similarly, we write

W_{B} (x^{t}, y^{t - 1}) : = (\prod_{s = 1}^{t - 1} x_{s}^{'} B y_{s}) \times (x_{t}^{'} B 1) = W_{B} (x^{t}, (y^{t - 1}, 1))

(11)

if period t has only been half-completed. We will consider sequential investment strategies

\hat{B} (•, •)

that, at the start of each period t, select some bilinear portfolio

\hat{B} (x^{t - 1}, y^{t - 1}) : = {[{\hat{b}}_{i j} (x^{t - 1}, y^{t - 1})]}_{m \times m}

that is conditioned on the observed return history

(x^{t - 1}, y^{t - 1})

; this bilinear portfolio will be used for the entire duration of period t. The capital growth factor achieved by an investment scheme

\hat{B} (•, •)

against the history

(x^{t}, y^{t})

is equal to

\hat{W} (x^{t}, y^{t}) : = \prod_{s = 1}^{t} x_{s}^{'} \hat{B} (x^{s - 1}, y^{s - 1}) y_{s},

(12)

and, if period t is only half-finished, we write

\hat{W} (x^{t}, y^{t - 1}) : = [\prod_{s = 1}^{t - 1} x_{s}^{'} \hat{B} (x^{s - 1}, y^{s - 1}) y_{s}] \times [x_{t}^{'} \hat{B} (x^{t - 1}, y^{t - 1}) 1] = \hat{W} (x^{t}, (y^{t - 1}, 1)) .

(13)

Within a given period t, the on-line behavior of

\hat{B} (•, •)

amounts to the portfolio vectors

\hat{p} (x^{t - 1}, y^{t - 1}) : = \hat{B} (x^{t - 1}, y^{t - 1}) 1

and

\hat{q} (x^{t}, y^{t - 1}) : = \frac{\hat{B} {(x^{t - 1}, y^{t - 1})}^{'} x_{t}}{x_{t}^{'} \hat{B} (x^{t - 1}, y^{t - 1}) 1} .

(14)

In order to have a practical benchmark for the on-line performance of

\hat{B} (•, •)

after the elapse of t complete investment periods, we will consider the best bilinear trading strategy in hindsight for the individual sequence

(x^{t}, y^{t})

:

\begin{matrix} B^{*} (x^{t}, y^{t}) : = \underset{B \in B}{\arg \max} W_{B} (x^{t}, y^{t}) \end{matrix}

(15)

and

B^{*} (x^{t}, y^{t - 1}) : = \underset{B \in B}{\arg \max} W_{B} (x^{t}, y^{t - 1}) = B^{*} (x^{t}, (y^{t - 1}, 1)) .

(16)

The final wealth that accrues to

B^{*} (x^{t}, y^{t})

is a path-dependent financial derivative, with payoff

\begin{matrix} D (x^{t}, y^{t}) : = \max_{B \in B} W_{B} (x^{t}, y^{t}) = W_{B^{*} (x^{t}, y^{t})} (x^{t}, y^{t}) \end{matrix}

(17)

and

D (x^{t}, y^{t - 1}) : = \max_{B \in B} W_{B} (x^{t}, y^{t - 1}) = D (x^{t}, (y^{t - 1}, 1)) .

(18)

Proposition 2.

The final wealth function

W_{B} (x^{T}, y^{T})

is a multilinear form in the vectors

x_{1}, y_{1},

x_{2}, y_{2}, . . ., x_{T}, y_{T}

, e.g., it is linear separately in each vector

x_{t}

and also in each vector

y_{t}

, for

1 \leq t \leq T .

Consequently, the hindsight-optimized final wealth

D (x^{T}, y^{T})

is convex and positively homogeneous separately in each

x_{t}

and also in each

y_{t}

.

Proof.

The multi-linearity of

W_{B} (•, •)

follows easily from the definition, e.g.,

W_{B} (x^{T}, y^{T}) = (\prod_{s = 1}^{t - 1} x_{s}^{'} B y_{s}) \cdot (x_{t}^{'} B y_{t}) \cdot (\prod_{s = t + 1}^{T} x_{s}^{'} B y_{s})

is clearly additive and homogeneous in

x_{t}

and also in

y_{t}

. If we write

D (x_{t})

and view

D (•, •)

as a function of

x_{t}

alone, then the convexity and homogeneity with respect to

x_{t}

(or with respect to

y_{t}

) follow from the fact that the mapping

x_{t} \mapsto D (x_{t})

is a pointwise maximum of a family of linear functions, namely,

{(W_{B} (x_{t}))}_{B \in B}

. □

For obvious reasons, the hindsight-optimized payoff

D (•, •)

is not achievable by any causal (or non-anticipating) investment strategy

\hat{B} (•, •)

; however, it is possible to achieve6 any average

\hat{W} (x^{t}, y^{t}) : = \int_{B \in B} W_{B} (x^{t}, y^{t}) f (B) d B,

(19)

where

f (•)

is a continuous density function over

B

. That is, inspired by Thomas Cover (1991) and Cover and Ordentlich (1996), we make the following definition.

Definition 2.

Theuniversal bilinear portfolio(that corresponds to the prior density

f (•)

) is a performance-weighted average of all bilinear-trading strategies:

\begin{matrix} \hat{B} (x^{t}, y^{t}) : = \frac{\int_{B \in B} B \cdot W_{B} (x^{t}, y^{t}) f (B) d B}{\int_{B \in B} W_{B} (x^{t}, y^{t}) f (B) d B} = \frac{E_{f} [B \cdot W_{B} (x^{t}, y^{t})]}{E_{f} [W_{B} (x^{t}, y^{t})]} . \end{matrix}

(20)

So-defined, the matrix

\hat{B} (•, •)

is indeed a valid bilinear portfolio, on account of the fact that

\hat{B} (x^{t}, y^{t}) \geq 0

and

1^{'} \hat{B} (x^{t}, y^{t}) 1 = 1

. The initial bilinear portfolio

\hat{B} (x^{0}, y^{0})

is equal to the center of mass

\int_{B \in B} B f (B) d B = E_{f} [B]

that is induced by the prior density

f (•)

.

Proposition 3.

After T complete investment periods, the universal wealth

\hat{W} (x^{T}, y^{T})

is equal to the average value

\begin{matrix} \hat{W} (x^{T}, y^{T}) = \int_{B \in B} W_{B} (x^{T}, y^{T}) f (B) d B = E_{f} [W_{B} (x^{T}, y^{T})] . \end{matrix}

(21)

Proof.

The gross return of the universal bilinear portfolio in period t is given by

\begin{matrix} x_{t}^{'} \hat{B} (x^{t - 1}, y^{t - 1}) y_{t} & = \frac{\int_{B \in B} (x_{t}^{'} B y_{t}) \cdot W_{B} (x^{t - 1}, y^{t - 1}) f (B) d B}{\int_{B \in B} W_{B} (x^{t - 1}, y^{t - 1}) f (B) d B} \\ = \frac{\int_{B \in B} W_{B} (x^{t}, y^{t}) f (B) d B}{\int_{B \in B} W_{B} (x^{t - 1}, y^{t - 1}) f (B) d B} . \end{matrix}

(22)

Taking the (telescopic) product of both sides of Equation (22) for

t : = 1, . . ., T

, and bearing in mind that

W_{B} (x^{0}, y^{0}) = 1 = \int_{B \in B} f (B) d B

, we arrive at the desired result:

\hat{W} (x^{T}, y^{T}) = \int_{B \in B} W_{B} (x^{T}, y^{T}) f (B) d B

. □

Following Cover (1991) and Cover and Thomas (2006), the intuition behind the universal bilinear portfolio is just this: we distribute the initial dollar (according to

f (•)

) among all the bilinear trading strategies

B \in B

, whereby the bilinear portfolios in the neighborhood of a given B receive

f (B) d B

dollars to manage (from now until kingdom come). After the elapse of t complete investment periods, the bilinear strategies in this locale have grown their bankroll to

W_{B} (x^{t}, y^{t}) f (B) d B

; the investor’s aggregate wealth is thereby equal to

\int_{B \in B} W_{B} (x^{t}, y^{t}) f (B) d B .

With this intuition in hand, the formula for

\hat{B} (x^{t}, y^{t})

can be written down immediately, on account of the fact that the locale of a given B is responsible for managing the fraction

ϕ (B) d B : = W_{B} (x^{t}, y^{t}) f (B) d B / \int_{B \in B} W_{B} (x^{t}, y^{t}) f (B) d B

of the aggregate wealth.

Hence, the overall bilinear portfolio is just the convex combination

\hat{B} = \int_{B \in B} B \cdot ϕ (B) d B

. Over long periods of time, the bilinear trading strategies in the neighborhood of

B^{*} (x^{t}, y^{t})

will come to control an ever-greater share of the aggregate wealth, on account of their superior exponential growth rate, namely

(1 / t) \log D (x^{t}, y^{t})

. Thus, the aggregate bankroll will (asymptotically) compound itself at this same rate; that is, we have the relation

\begin{matrix} \lim_{t \to \infty} (Excess Growth Rate of the Best Bilinear Portfolio in Hindsight) \\ = \lim_{t \to \infty} [\frac{\log D (x^{t}, y^{t})}{t} - \frac{\log \hat{W} (x^{t}, y^{t})}{t}] = 0, \end{matrix}

(23)

regardless7 of the individual return sequence

ω : = {(x_{t}, y_{t})}_{t = 1}^{\infty}

. The remainder of the paper is concerned with fleshing out the necessary details. On that score, we make the definition:

Definition 3.

The competitive ratio

R (x^{T}, y^{T})

measures the percentage of hindsight-optimized bilinear wealth that was actually achieved by the universal bilinear portfolio, e.g.,

R (x^{T}, y^{T}) : = \frac{\hat{W} (x^{T}, y^{T})}{D (x^{T}, y^{T})} = \frac{AVERAGE VALUE OF W_{B} (x^{T}, y^{T})}{MAXIMUM VALUE OF W_{B} (x^{T}, y^{T})} .

(24)

Lemma 1.

The competitive ratio

R (•, •)

is always

\leq 1

; it is homogeneous of degree 0 and quasi-concave separately in each vector

x_{t}

and also in each vector

y_{t}

.

Proof.

The fact that

R (x^{T}, y^{T}) \leq 1

follows immediately from the fact that any convex combination (or weighted average) of the numbers

{(W_{B} (x^{T}, y^{T}))}_{B \in B}

cannot exceed their maximum. The homogeneity of degree 0 follows from the fact that

W_{B} (•, •)

and

D (•, •)

are both linearly homogeneous (of degree 1) in each vector

x_{t}

or

y_{t}

. The multi-quasi-concavity obtains from the fact that, when viewed as a function

R (x_{t})

of

x_{t}

alone (or of

y_{t}

alone), we are dealing with the ratio of a positive linear function (namely,

\hat{W} (x_{t})

) to a positive convex function (viz.,

D (x_{t})

). That is, if we consider the upper contour sets

U_{α} : = \{x_{t} \in R_{+}^{m} : R (x_{t}) \geq α\} = \{x_{t} \in R_{+}^{m} : \hat{W} (x_{t}) - α D (x_{t}) \geq 0\},

(25)

then we see that

U_{α}

is a convex set for all

α \in R

. For, if

α \leq 0

, then

U_{α} = R_{+}^{m}

, which is convex; if

α \geq 0

, then

U_{α}

is convex because it is an upper contour set of the concave function

x_{t} \mapsto \hat{W} (x_{t}) - α D (x_{t})

. □

On account of the (multi-) homogeneity of degree 0, the competitive ratio only cares about the directions of the vectors

x_{t}

or

y_{t}

—their lengths do not affect the relative performance of the universal bilinear portfolio. Thus, we are free to scale each

x_{t}

(resp.

y_{t}

) by a factor of

λ : = 1 / | | x_{t} {| |}_{1}

(resp.

1 / | | y_{t} {| |}_{1}

), so that the coordinates of

x_{t}

(resp.

y_{t}

) sum to one, e.g., we may assume that each

x_{t}

or

y_{t}

belongs to the unit simplex

Δ_{m}

. Hence, we have the relation

\underset{̲}{R} (x^{T}, y^{T}) : = \inf_{(x^{T}, y^{T}) \in {(R_{+}^{m} - {0})}^{2 T}} R (x^{T}, y^{T}) = \min_{(x^{T}, y^{T}) \in Δ_{m}^{2 T}} R (x^{T}, y^{T}),

(26)

e.g., the worst-case8 relative performance

\underset{̲}{R} (x^{T}, y^{T})

is achieved over the product of simplices

Δ_{m}^{2 T}

. Even better, since

R (•, •)

is multi-quasi-concave, its minimum value must in fact be realized at some extreme point

(x^{T}, y^{T}) \in {e_{1}, . . ., e_{m}}^{2 T}

, e.g., a return history whereby all

x_{t}, y_{t}

are unit basis vectors. This happens on account of the fact that when

R (•, •)

is viewed as a function solely of

x_{t} \in Δ_{m}

(or solely of

y_{t} \in Δ_{m}

), we have

R (x_{t}) = R (x_{t 1} e_{1} + \cdot \cdot \cdot + x_{t m} e_{m}) \geq \min \{R (e_{1}), . . ., R (e_{m})\} = R (e_{i^{*}}),

(27)

so that the competitive ratio can always be reduced by replacing any

x_{t}

or

y_{t}

by an appropriate unit basis vector

e_{i^{*}}

.

In what follows, we will consider sequences of unit basis vectors

x^{T} : = (e_{i_{1}}, . . ., e_{i_{T}})

and

y^{T} : = (e_{j_{1}}, . . ., e_{j_{T}})

, where

i^{T} : = (i_{1}, . . ., i_{T}) \in {1, . . ., m}^{T}

and

j^{T} : = (j_{1}, . . ., j_{T}) \in {1, . . ., m}^{T}

. For the sake of simplicity, we will abuse notation by writing the (self-evident) expressions

R (i^{T}, j^{T})

,

\hat{W} (i^{T}, j^{T})

, and

D (i^{T}, j^{T})

. Sequences of unit basis vectors will hereby be referred to as extremal sequences, or Kelly horse race sequences, on account of the fact that they correspond to betting markets (say, horse races or prediction markets) whereby only one of the m assets has a positive gross return. For a given Kelly sequence

(i^{T}, j^{T})

, we will require the counts, or relative frequencies

n_{i j} (i^{T}, j^{T}) : : = : (number of times (i_{t}, j_{t}) = (i, j)) = \sum_{\{t : (i_{t}, j_{t}) = (i, j)\}} 1,

(28)

so that

n_{i j} \geq 0

and

\sum_{i = 1}^{m} \sum_{j = 1}^{m} n_{i j} = T

.

Lemma 2.

For any Kelly sequence

(i^{T}, j^{T})

, the final wealth of the best bilinear trading strategy in hindsight is equal to

D ({[n_{i j}]}_{i, j = 1}^{m}) = \prod_{(i, j) : n_{i j} > 0} {(n_{i j} / T)}^{n_{i j}}

; the universal wealth

\hat{W} (i^{T}, j^{T})

admits the minorant

\hat{W} (i^{T}, j^{T}) \geq \frac{\underset{̲}{f}}{(T + m^{2} - 1)!} \prod_{i = 1}^{m} \prod_{j = 1}^{m} n_{i j}!,

(29)

where

\underset{̲}{f} : = \min_{B \in B} f (B)

is the minimum weight assigned to any bilinear portfolio by the prior density

f (•)

.

Proof.

Against the Kelly sequence

(i^{T}, j^{T})

, the final wealth of the bilinear trading strategy B is given by

W_{B} (i^{T}, j^{T}) = \prod_{t = 1}^{T} b_{i_{t} j_{t}} = \prod_{(i, j) : n_{i j} > 0} b_{i j}^{n_{i j} (i^{T}, j^{T})} .

(30)

Maximization of this quantity with respect to B amounts to a standard Cobb-Douglas optimization problem over the unit simplex in

R_{+}^{m^{2}}

. Lagrange’s multipliers yield the solution

b_{i j}^{*} = n_{i j} / T

, so that

D (i^{T}, j^{T}) = \prod_{(i, j) : n_{i j} > 0} {(n_{i j} / T)}^{n_{i j}}

.

The stated minorant for

\hat{W} (i^{T}, j^{T})

will be gotten by direct integration of

W_{B} (i^{T}, j^{T})

over the set of bilinear trading strategies. To this end, we will identify

B

with the solid region

\{(b_{11}, . . ., b_{1 m}, b_{21}, . . ., b_{2 m}, . . ., b_{m 1}, . . ., b_{m, m - 1}) \in R_{+}^{m^{2} - 1} : b_{11} + \cdot \cdot \cdot + b_{m, m - 1} \leq 1\},

(31)

where

b_{m m} = 1 - b_{11} - \cdot \cdot \cdot - b_{m, m - 1}

is not a free variable. Thus, we must evaluate the

(m^{2} - 1)

-fold integral

\int_{b_{11} = 0}^{1} \int_{b_{12} = 0}^{1 - b_{11}} \cdot \cdot \cdot \int_{b_{m, m - 1} = 0}^{1 - b_{11} - \cdot \cdot \cdot - b_{m, m - 2}} [\prod_{(i, j) \neq (m, m)} b_{i j}^{n_{i j}}] {[1 - \sum_{(i, j) \neq (m, m)} b_{i j}]}^{n_{m m}} f (B) d b_{m, m - 1} \cdot \cdot \cdot d b_{11} .

(32)

Using the fact that

f (B) \geq \underset{̲}{f}

, and recalling the general identity9

\begin{matrix} \int_{z_{1} = 0}^{1} \int_{z_{2} = 0}^{1 - z_{1}} \cdot \cdot \cdot \int_{z_{k - 1} = 0}^{1 - z_{1} - \cdot \cdot \cdot - z_{k - 2}} z_{1}^{α_{1}} z_{2}^{α_{2}} \cdot \cdot \cdot z_{k - 1}^{α_{k - 1}} {(1 - z_{1} - z_{2} - \cdot \cdot \cdot - z_{k - 1})}^{α_{k}} d z_{k - 1} \cdot \cdot \cdot d z_{2} d z_{1} \\ = \frac{Γ (α_{1} + 1) Γ (α_{2} + 1) \cdot \cdot \cdot Γ (α_{k} + 1)}{Γ (α_{1} + α_{2} + \cdot \cdot \cdot + α_{k} + k)}, \end{matrix}

(33)

where

Γ (•)

is the gamma function, we put

k : = m^{2}

and obtain

\hat{W} (i^{T}, j^{T}) \geq \underset{̲}{f} \cdot \frac{\prod_{i = 1}^{m} \prod_{j = 1}^{m} Γ (n_{i j} + 1)}{Γ (m^{2} + \sum_{i = 1}^{m} \sum_{j = 1}^{m} n_{i j})} = \underset{̲}{f} \cdot \frac{\prod_{i = 1}^{m} \prod_{j = 1}^{m} n_{i j}!}{(T + m^{2} - 1)!},

(34)

as promised. □

Corollary 1.

The competitive ratio has the following (uniform) bounds, for all

x^{T}, y^{T}

:

1 \geq R (x^{T}, y^{T}) \geq \frac{\underset{̲}{f}}{(T + 1) (T + 2) \cdot \cdot \cdot (T + m^{2} - 1)} \sim^{10} \frac{\underset{̲}{f}}{T^{m^{2} - 1}} .

(35)

10 means that The relation ∼ signifies that the two sequences are asymptotically equivalent, e.g.,

a_{n} \sim b_{n}

means that

\lim_{n \to \infty} a_{n} / b_{n} = 1

.

Hence, the excess continuously-compounded per-period growth rate10 of the best bilinear portfolio in hindsight (namely,

- (1 / T) \log R (x^{T}, y^{T})

) is sandwiched by

0 \leq EXCESS GROWTH Rate \leq \frac{\log (1 / \underset{̲}{f})}{T} + \frac{1}{T} \sum_{j = 1}^{m^{2} - 1} \log (T + j) .

(36)

That is, at worst, the excess growth rate is asymptotically equivalent to the quantity

(m^{2} - 1) \log (T) / T

.

Proof.

For any Kelly sequence

(i^{T}, j^{T})

, Lemma 1 implies that

R (i^{T}, j^{T}) = \frac{\hat{W} (i^{T}, j^{T})}{\prod_{(i, j) : n_{i j} > 0} {(n_{i j} / T)}^{n_{i j}}} \geq \underset{̲}{f} \cdot \frac{T^{T}}{(T + m^{2} - 1)!} \prod_{i = 1}^{m} \prod_{j = 1}^{m} \frac{n_{i j}!}{n_{i j}^{n_{i j}}},

(37)

where the right-hand side makes use of the convention that

0^{0} : = 1

. Now, note that the integer program

\min_{\{[n_{i j}] \geq 0 : \sum_{i = 1}^{m} \sum_{j = 1}^{m} n_{i j} = T\}} \prod_{i = 1}^{m} \prod_{j = 1}^{m} \frac{n_{i j}!}{n_{i j}^{n_{i j}}}

(38)

is solved by setting any entry of the matrix

{[n_{i j}]}_{m \times m}

to T and setting all the other entries to zero, e.g., we have the well-known inequality (cf. with Cover and Ordentlich (1996))

\prod_{i = 1}^{m} \prod_{j = 1}^{m} \frac{n_{i j}!}{n_{i j}^{n_{i j}}} \geq \frac{T!}{T^{T}} .

(39)

Hence, there lies

R (x^{T}, y^{T}) \geq \min_{(i^{T}, j^{T}) \in {1, . . ., m}^{2 T}} R (i^{T}, j^{T}) \geq \frac{\underset{̲}{f}}{(T + 1) (T + 2) \cdot \cdot \cdot (T + m^{2} - 1)} .

(40)

□

Theorem 1.

The universal bilinear portfolio asymptotically dominates the original (1-linear) universal portfolio in precisely the same technical sense that the universal 1-linear portfolio asymptotically dominates all constant-rebalanced portfolios and all buy-and-hold strategies.

If it turns out that the best bilinear trading strategy in hindsight sustains a higher asymptotic capital growth rate than the best constant-rebalanced portfolio in hindsight, then the universal bilinear portfolio will asymptotically outperform the universal 1-linear portfolio by an exponential factor.

Proof.

We let

\hat{S} (x^{t}, y^{t}) : = \int_{c \in Δ_{m}} [\prod_{s = 1}^{t} (c^{'} x_{s}) (c^{'} y_{s})] g (c) d c = E_{g} [\prod_{s = 1}^{t} (c^{'} x_{s}) (c^{'} y_{s})]

(41)

denote the wealth of the universal 1-linear portfolio (cf. with Thomas Cover (1991) and Cover and Ordentlich (1996)) after the elapse of t complete investment periods, where

Δ_{m}

is the unit portfolio simplex in

R_{+}^{m}

and

g (•)

is a prior density over

Δ_{m}

. The final wealth of the best constant-rebalanced portfolio in hindsight will be denoted

S^{*} (x^{t}, y^{t}) : = \max_{c \in Δ_{m}} \prod_{s = 1}^{t} (c^{'} x_{s}) (c^{'} y_{s}) .

(42)

On account of the lower bound

\frac{\hat{W} (x^{t}, y^{t})}{\hat{S} (x^{t}, y^{t})} = \frac{\hat{W} (x^{t}, y^{t})}{D (x^{t}, y^{t})} \cdot \frac{D (x^{t}, y^{t})}{S^{*} (x^{t}, y^{t})} \cdot \frac{S^{*} (x^{t}, y^{t})}{\hat{S} (x^{t}, y^{t})} \geq \frac{\underset{̲}{f}}{\prod_{j = 1}^{m^{2} - 1} (t + j)} \cdot \frac{D (x^{t}, y^{t})}{S^{*} (x^{t}, y^{t})} \cdot 1,

(43)

we can minorize the asymptotic excess growth rate (of the universal bilinear portfolio relative to the universal 1-linear portfolio) as follows:

\begin{matrix} \underset{t \to \infty}{lim inf} [\frac{\log \hat{W} (x^{t}, y^{t})}{t} - \frac{\log \hat{S} (x^{t}, y^{t})}{t}] \\ \geq \underset{t \to \infty}{\lim \inf} (1 / t) \log (\underset{̲}{f} / \prod_{j = 1}^{m^{2} - 1} (t + j)) + \underset{t \to \infty}{\lim \inf} [\frac{\log D (x^{t}, y^{t})}{t} - \frac{\log S^{*} (x^{t}, y^{t})}{t}] \\ = \underset{t \to \infty}{\lim \inf} [\frac{\log D (x^{t}, y^{t})}{t} - \frac{\log S^{*} (x^{t}, y^{t})}{t}] \geq 0, \end{matrix}

(44)

where we have made use of the fact that the relations

S^{*} (x^{t}, y^{t}) \geq \hat{S} (x^{t}, y^{t})

and

D (x^{t}, y^{t}) \geq S^{*} (x^{t}, y^{t})

hold for all

x^{t}

and all

y^{t}

.

Thus, we have shown that even the smallest subsequential limit of the excess growth rate

(1 / t) \log (\hat{W} / \hat{S})

is non-negative; if the best bilinear trading strategy in hindsight happens to achieve a higher asymptotic growth rate than the best constant-rebalanced portfolio in hindsight11 (in the sense that the smallest subsequential limit of

(1 / t) \log (D / S^{*})

is strictly positive), then the universal bilinear portfolio will asymptotically outperform the universal 1-linear portfolio by an exponential factor. □

Resolution of the Motivating Example

To close out the paper, this subsection provides exact formulas for the behavior of the universal bilinear portfolio in the context of our original motivating example (as discussed in the introduction) for the case of

m : = 2

assets. Accordingly, we will assume that asset 2 is cash (which pays no interest) and that asset 1 is a “hot stock” that always doubles in the first half of each investment period and then loses

50 %

of its value in the latter half of each investment period. Thus, we have the individual return sequence defined by

x_{t} : \equiv {(2, 1)}^{'}

and

y_{t} : \equiv {(1 / 2, 1)}^{'}

. The set of all bilinear trading strategies is now a family of

2 \times 2

matrices

B : = \{(b_{11}, b_{12}, b_{21}, b_{22}) \in R_{+}^{4} : b_{11} + b_{12} + b_{21} \leq 1, b_{22} = 1 - b_{11} - b_{12} - b_{21}\},

(45)

where the variable

b_{22}

is bound by the relation

b_{22} : = 1 - b_{11} - b_{12} - b_{21}

. As depicted in Figure 1, this set of matrices amounts to a tetrahedron in

R_{+}^{3}

.

Analogous to Thomas Cover (1991), we will use the uniform prior density

f (b_{11}, b_{12}, b_{21}) \equiv 6

, e.g., the volume of the tetrahedron

B

is given by

Volume (B) = \int_{b_{11} = 0}^{1} \int_{b_{12} = 0}^{1 - b_{11}} \int_{b_{21} = 0}^{1 - b_{11} - b_{12}} d b_{21} d b_{12} d b_{11} = \frac{1}{6} .

(46)

During each (complete) investment period, the (intra-period) capital growth factor achieved by the bilinear trading strategy B amounts to

x_{t}^{'} B y_{t} = [\begin{matrix} 2 & 1 \end{matrix}] [\begin{matrix} b_{11} & b_{12} \\ b_{21} & 1 - b_{11} - b_{12} - b_{21} \end{matrix}] [\begin{matrix} 1 / 2 \\ 1 \end{matrix}] = 1 + b_{12} - \frac{b_{21}}{2},

(47)

so that

W_{B} (x^{t}, y^{t}) = {(1 + b_{12} - b_{21} / 2)}^{t}

. Thus, the universal wealth

\hat{W} (x^{t}, y^{t})

that obtains after the elapse of t complete investment periods is found by evaluating the triple integral

\begin{matrix} 6 \int_{b_{11} = 0}^{1} \int_{b_{12} = 0}^{1 - b_{11}} \int_{b_{21} = 0}^{1 - b_{11} - b_{12}} {(1 + b_{12} - \frac{b_{21}}{2})}^{t} d b_{21} d b_{12} d b_{11} \\ = \frac{2^{t + 5} - 12 (t + 2) - 2^{1 - t}}{(t + 1) (t + 2) (t + 3)} \sim \frac{32}{t^{3}} \cdot 2^{t} . \end{matrix}

(48)

The best bilinear trading strategy in hindsight is obviously

B^{*} (x^{t}, y^{t}) \equiv [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}],

(49)

e.g., the extremal strategy that bets the ranch on the stock in the first half of each investment period, and then cashes out completely in the latter half of each investment period. This (perfect trading) yields the hindsight-optimized wealth

D (x^{t}, y^{t}) = D (x^{t}, y^{t - 1}) = 2^{t}

, which corresponds to the asymptotic growth rate

\lim_{t \to \infty} (1 / t) \log D (x^{t}, y^{t}) = \log 2 = 69.3 %

per complete investment period, compounded continuously. The competitive ratio after t full periods is equal to

R (x^{t}, y^{t}) = \frac{32 - 12 (t + 2) 2^{- t} - 2^{1 - 2 t}}{(t + 1) (t + 2) (t + 3)} \sim \frac{32}{t^{3}} .

(50)

Note well that Corollary 1 promised us the minorant

R (x^{t}, y^{t}) \geq \frac{6}{(t + 1) (t + 2) (t + 3)},

(51)

which is indeed correct; we of course have

\lim_{t \to \infty} (1 / t) \log R (x^{t}, y^{t}) = 0

, so that the universal bilinear portfolio compounds its money at the same asymptotic rate as the best bilinear trading strategy in hindsight.

Against this individual return sequence, the universal bilinear portfolio finds its expression in the triple integral

\frac{6}{\hat{W} (x^{t}, y^{t})} \int_{b_{11} = 0}^{1} \int_{b_{12} = 0}^{1 - b_{11}} \int_{b_{21} = 0}^{1 - b_{11} - b_{12}} {(1 + b_{12} - \frac{b_{21}}{2})}^{t} [\begin{matrix} b_{11} & b_{12} \\ b_{21} & 1 - b_{11} - b_{12} - b_{21} \end{matrix}] d b_{21} d b_{12} d b_{11} .

(52)

With some effort, one can explicitly evaluate the on-line bilinear weights, as follows:

{\hat{b}}_{11} (x^{t}, y^{t}) = \frac{2 \cdot 4^{t + 2} - 3 \cdot 2^{t} (t^{2} + 5 t + 10) + 1}{(t + 4) [4^{t + 2} - 3 \cdot 2^{t + 1} (t + 2) - 1]} \sim \frac{2}{t} \to 0,

(53)

{\hat{b}}_{12} (x^{t}, y^{t}) = \frac{2^{t + 4} (3 t - 4) + 18 (t + 4) + 2^{- t}}{3 (t + 4) [2^{t + 4} - 6 (t + 2) - 2^{- t}]} \to 1,

(54)

{\hat{b}}_{21} (x^{t}, y^{t}) = \frac{2^{t + 6} - 36 (t + 1) - 2^{- t} (3 t + 19)}{3 (t + 4) [2^{t + 4} - 6 (t + 2) - 2^{- t}]} \sim \frac{4}{3 t} \to 0,

(55)

{\hat{b}}_{22} (x^{t}, y^{t}) = {\hat{b}}_{11} (x^{t}, y^{t}) \sim \frac{2}{t} \to 0 .

(56)

Notice that the

(1, 1)

and

(2, 2)

extremal strategies (which both amount to buy-and-hold strategies) are assigned equal weights by the universal bilinear portfolio (in the sense that

{\hat{b}}_{22} = {\hat{b}}_{11}

); this happens on account of the fact that both assets produce identical results for a buy-and-hold investor over any complete investment period.

Thus, the universal bilinear portfolio learns to trade perfectly in as much as

\lim_{t \to \infty} \hat{B} (x^{t}, y^{t}) = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] .

(57)

The same cannot be said for the universal 1-linear portfolio, which achieves the capital growth factor12

\begin{matrix} \hat{S} (x^{t}, y^{t}) & = \int_{c = 0}^{1} {(1 + c)}^{t} {(1 - c / 2)}^{t} d c \\ = \sum_{k_{1} = 0}^{t} \sum_{k_{2} = 0}^{t - k_{1}} (\binom{t}{k_{1}, k_{2}, t - k_{1} - k_{2}}) \frac{{(- 1)}^{k_{2}}}{2^{k_{1} + k_{2}} (k_{1} + 2 k_{2} + 1)} . \end{matrix}

(58)

After t complete investment periods, the best constant-rebalanced portfolio in hindsight is equal to

{(1 / 2, 1 / 2)}^{'}

, which corresponds to the (sub-optimal) bilinear trading strategy

B = [\begin{matrix} 1 / 4 & 1 / 4 \\ 1 / 4 & 1 / 4 \end{matrix}] .

The final wealth of the best constant-rebalanced portfolio in hindsight is thereby

S^{*} (x^{t}, y^{t}) = {(9 / 8)}^{t}

. Thus, the excess asymptotic growth rate of the universal bilinear portfolio (over and above that of the universal 1-linear portfolio) is

\log 2 - \log (9 / 8) = 57.5 %

per (complete) investment period, compounded continuously.

For the sake of visualization, Figure 2 plots the bankroll of the universal bilinear portfolio in comparison to that of the universal 1-linear portfolio and the wealth achieved by a perfect trader. The lower panel illustrates the parameter learning that obtains from the performance-weighted average of all bilinear trading strategies.

4. Summary and Conclusions

In this note, we constructed a neat application and extension of the brilliantly lucid Ordentlich-Cover theory of “universal portfolios”. The original (1-linear) universal portfolios guarantee to achieve a high percentage of the final wealth that would have accrued to the best constant-rebalanced portfolio in hindsight for the actual (realized) sequence of asset prices.

The constant-rebalanced portfolios constitute a very simple parametric family of active trading strategies, where the “activity” amounts to continuously executing rebalancing trades so as to restore the portfolio to a given target allocation. Inspired by the fact that a constant-rebalanced portfolio is a (horizon-1) trading strategy whose capital growth factor in any given period is a linear function of the market’s gross return vector, we decided to consider the wider class of bilinear trading strategies (or bilinear portfolios), which are mini 2-period active strategies whose capital growth factors are linear separately in the two gross return vectors.

Accordingly, we hit upon the more powerful benchmark of the best bilinear trading strategy in hindsight for the actual sequence of asset prices. This led us to apply Cover’s ingenious (Cover 1991) performance-weighted averaging technique to this new situation, e.g., the universal bilinear portfolio is a performance-weighted average of all possible bilinear trading strategies.

Applying Cover and Ordentlich’s elegant (Cover and Ordentlich 1996) methodology, we showed that for any financial market with m assets13, at worst, the percentage of hindsight-optimized wealth achieved by the universal bilinear portfolio will tend to zero like the quantity

T^{- (m^{2} - 1)}

as

T \to \infty

, where T denotes the number of complete (bipartite) investment periods. Consequently, the universal bilinear portfolio succeeds in matching the performance of the best bilinear trading strategy in hindsight to “first order in the exponent,” e.g., the excess continuously-compounded per-period capital growth rate of the best bilinear trading strategy in hindsight converges (uniformly) to zero, regardless of the individual sequence of asset prices.

Thus, we showed that the universal bilinear portfolio asymptotically dominates the universal 1-linear portfolio in the same technical sense that the universal 1-linear portfolio asymptotically dominates all constant-rebalanced portfolios and all buy-and-hold strategies. The universal bilinear portfolio will beat the universal 1-linear portfolio by an exponential factor, provided that the individual sequence of asset prices enjoys the property that the best bilinear trading strategy in hindsight achieves an asymptotic growth rate that is strictly greater than that of the best constant-rebalanced portfolio in hindsight.

Analogously, we can get carried away and define the concept of a trilinear trading strategy

B : = {(b_{i j k})}_{i, j, k = 1}^{m}

, whose (horizon-3) capital growth factor in any (tripartite) period t is equal to the trilinear form

{〈 x_{t}, y_{t}, z_{t} 〉}_{B} : = \sum_{i = 1}^{m} \sum_{j = 1}^{m} \sum_{k = 1}^{m} b_{i j k} x_{t i} y_{t j} z_{t k},

(59)

where

b_{i j k} \geq 0

and

\sum_{i = 1}^{m} \sum_{j = 1}^{m} \sum_{k = 1}^{m} b_{i j k} = 1

. This leads to a universal trilinear portfolio whose worst-case competitive ratio behaves like

T^{- (m^{3} - 1)}

as

T \to \infty

. In general, an H-linear trading strategy (cf. with Garivaltis (2018b)) divides each period t into H sub-periods, wherein the gross return vectors are denoted

(x_{t}^{1}, x_{t}^{2}, . . ., x_{t}^{h}, . . ., x_{t}^{H}) = {(x_{t}^{h})}_{h = 1}^{H}

. Intra-period capital growth is now generated by the H-linear form (cf. with Serge Lang (1987))

{〈 x_{t}^{1}, . . ., x_{t}^{H} 〉}_{B} : = \sum_{(i_{1}, . . ., i_{H}) \in {1, . . ., m}^{H}} \{B (i_{1}, . . ., i_{H}) \prod_{h = 1}^{H} x_{t i_{h}}^{h}\},

(60)

where

B (i_{1}, . . ., i_{H}) \geq 0

and

\sum_{(i_{1}, . . ., i_{H}) \in {1, . . ., m}^{H}} B (i_{1}, . . ., i_{H}) = 1

; the attendant universal H-linear portfolio asymptotically achieves, at worst, the fraction

T^{- (m^{H} - 1)}

of the final wealth of the best H-linear trading strategy in hindsight.

Hence, one can use this method to construct an endless hierarchy of ever more dominant universal portfolios. If the horizon

H_{2}

is an integer multiple of the horizon

H_{1}

, say

H_{2} : = q \cdot H_{1}

, then the act of repeating a given

H_{1}

-linear portfolio B for q times in succession constitutes a special type of

H_{2}

-linear portfolio; the universal

H_{2}

-linear portfolio thereby asymptotically outperforms the universal

H_{1}

-linear portfolio “to first order in the exponent,” á la Cover.

Disclosures

This paper is solely the work of the author, who declares that he has no conflicts of interest; the work was funded entirely through his regular academic appointment at Northern Illinois University.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares that he has no conflicts of interest.

References

Bell, R. M., and T. M. Cover. 1980. Competitive Optimality of Logarithmic Investment. Mathematics of Operations Research 5: 161–66. [Google Scholar] [CrossRef]
Bell, R. M., and T. M. Cover. 1988. Game-Theoretic Optimal Portfolios. Management Science 34: 724–33. [Google Scholar] [CrossRef] [Green Version]
Black, Fischer, and Myron Scholes. 1973. The pricing of options and corporate liabilities. Journal of Political Economy 81: 637–54. [Google Scholar] [CrossRef] [Green Version]
Breiman, Leo. 1961. Optimal Gambling Systems for Favorable Games. Paper presented at the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, June 20–July 30; vol. 1, pp. 63–68. [Google Scholar]
Cover, Thomas M. 1991. Universal Portfolios. Mathematical Finance 1: 1–29. [Google Scholar] [CrossRef]
Cover, Thomas M., and David H. Gluss. 1986. Empirical Bayes Stock Market Portfolios. Advances in Applied Mathematics 7: 170–81. [Google Scholar] [CrossRef] [Green Version]
Cover, Thomas M., and Erik Ordentlich. 1996. Universal Portfolios With Side Information. IEEE Transactions on Information Theory 42: 348–63. [Google Scholar] [CrossRef] [Green Version]
Cover, Thomas M., and Joy A. Thomas. 2006. Elements of Information Theory. Hoboken: John Wiley & Sons. [Google Scholar]
Curatola, Giuliano. 2019. Portfolio Choice of Large Investors Who Interact Strategically. Working Paper. Siena: University of Siena. [Google Scholar]
Garivaltis, Alex. 2018a. Game-Theoretic Optimal Portfolios in Continuous Time. Economic Theory Bulletin 7: 235–43. [Google Scholar] [CrossRef] [Green Version]
Garivaltis, Alex. 2018b. Multilinear Superhedging of Lookback Options. Working Paper. DeKalb: Northern Illinois University. [Google Scholar]
Garivaltis, Alex. 2019a. Game-Theoretic Optimal Portfolios for Jump Diffusions. Games 10: 8. [Google Scholar] [CrossRef] [Green Version]
Garivaltis, Alex. 2019b. Exact Replication of the Best Rebalancing Rule in Hindsight. The Journal of Derivatives 26: 35–53. [Google Scholar] [CrossRef]
Kelly, John L. 1956. A New Interpretation of Information Rate. The Bell System Technical Journal 35: 917–26. [Google Scholar] [CrossRef]
Lang, Serge. 1987. Linear Algebra. New York: Springer. [Google Scholar]
Latané, Henry Allen. 1959. Criteria for Choice Among Risky Ventures. Journal of Political Economy 67: 144–55. [Google Scholar] [CrossRef]
MacLean, Leonard C., Edward O. Thorp, and William T. Ziemba. 2011. The Kelly Capital Growth Investment Criterion: Theory and Practice. Hackensack: World Scientific Publishing Company. [Google Scholar]
Ordentlich, Erik, and Thomas M. Cover. 1998. The Cost of Achieving the Best Portfolio in Hindsight. Mathematics of Operations Research 23: 960–82. [Google Scholar] [CrossRef] [Green Version]
Samuelson, Paul A. 1963. Risk and Uncertainty: A Fallacy of Large Numbers. Scientia 6: 153–58. [Google Scholar]
Samuelson, Paul A. 1969. Lifetime Portfolio Selection by Dynamic Stochastic Programming. Review of Economics and Statistics 51: 239–46. [Google Scholar] [CrossRef]
Samuelson, Paul A. 1979. Why We Should not Make Mean Log of Wealth Big Though Years to Act are Long. Journal of Banking and Finance 2: 305–7. [Google Scholar] [CrossRef]
Tal, O., and T. D. Tran. 2020. Adaptive Bet-Hedging Revisited: Considerations of Risk and Time Horizon. Bulletin of Mathematical Biology 82: 1–32. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Thorp, Edward O. 1969. Optimal Gambling Systems for Favorable Games. Revue de l’Institut International de Statistique 37: 273–93. [Google Scholar] [CrossRef]
Widder, David Vernon. 1989. Advanced Calculus. New York: Dover Publications. [Google Scholar]

1.	e.g., if $x_{i} : = 1.05$ then asset i appreciated $5 %$ in period 1; if $x_{i} : = 0.98$ , then asset i lost $2 %$ of its value in period 1, etc.
2.	Bilinearity (cf. with Serge Lang (1987)) refers to the fact that the capital growth factor $x^{'} B y$ is linear separately in each of the vectors x and y. When viewed jointly as a function of $(x, y)$ , the bilinear form $x^{'} B y$ is a homogeneous quadratic polynomial in the $2 m$ variables $x_{1}, . . ., x_{m}, y_{1}, . . ., y_{m}$ .
3.	On account of allocation drift, e.g., the fact that some constituent assets will outperform the portfolio each period (and some assets will underperform), a CRP must generally trade each period so as to restore the target allocation $c : = {(c_{1}, . . ., c_{m})}^{'}$ .
4.	Literally, an extreme point of $B$ .
5.	The initial monetary deposit into B is equal to the empty product $W_{B} (x^{0}, y^{0}) : = $ 1$ .
6.	By the way, if a discrete-time payoff $D (x^{t}, y^{t}) = \hat{W} (x^{t}, y^{t})$ can be exactly replicated (or hedged) by some causal (non-anticipating) trading strategy, then that strategy is necessarily be unique. We have encountered this phenomenon already vis-á-vis the bilinear payoff $x^{'} B y$ .
7.	Not just almost everywhere; but everywhere, for all possible $ω \in {({(R_{+}^{m} - {0})}^{2})}^{N}$ .
8.	Come what may—for all possible market behavior $(x^{T}, y^{T})$ .
9.	This identity follows by direct evaluation of the iterated integral (33). In order to accomplish this, one must repeatedly invoke the special case $k : = 2$ , e.g., $\int_{z = 0}^{1} z^{α} {(1 - z)}^{β} d z = Γ (α + 1) Γ (β + 1) / Γ (α + β + 2),$ which is the beta function, or Euler integral of the first kind (cf. with David Widder (1989)).
10.	That is, per complete investment period (both halves).
11.	The practitioner of the universal bilinear portfolio must hope against hope that the individual return sequence $ω : = {(x_{t}, y_{t})}_{t = 1}^{\infty}$ has this pleasant feature.
12.	Here, we have used the uniform prior density $g (c) \equiv 1$ over the unit interval $[0, 1]$ .
13.	One of which can be cash, or a risk-free bond.

Figure 1. Geometric depiction of the set

B

of all possible bilinear trading strategies

B : = {[b_{i j}]}_{2 \times 2}

over two assets. The defining relations are

B \geq 0; b_{11} + b_{12} + b_{21} \leq 1; and b_{22} : = 1 - b_{11} - b_{12} - b_{21} .

The volume of this tetrahedron is

1 / 6

.

Figure 1. Geometric depiction of the set

B

of all possible bilinear trading strategies

B : = {[b_{i j}]}_{2 \times 2}

over two assets. The defining relations are

B \geq 0; b_{11} + b_{12} + b_{21} \leq 1; and b_{22} : = 1 - b_{11} - b_{12} - b_{21} .

The volume of this tetrahedron is

1 / 6

.

Figure 2. Superior performance of the universal bilinear portfolio against the individual return sequence

x_{t} : \equiv {(2, 1)}^{'}

and

y_{t} : \equiv {(0.5, 1)}^{'}

. Asset 2 is cash (that pays no interest); asset 1 is a “hot stock” that doubles in the first half of each investment period and loses

50 %

of its value in the latter half of each investment period. Note that in the bottom plot, we have

\lim_{t \to \infty} {\hat{b}}_{12} (x^{t}, y^{t}) = 1

and

{\hat{b}}_{11} (x^{t}, y^{t}) \equiv {\hat{b}}_{22} (x^{t}, y^{t}) \sim 1 / t \to 0

.

Figure 2. Superior performance of the universal bilinear portfolio against the individual return sequence

x_{t} : \equiv {(2, 1)}^{'}

and

y_{t} : \equiv {(0.5, 1)}^{'}

. Asset 2 is cash (that pays no interest); asset 1 is a “hot stock” that doubles in the first half of each investment period and loses

50 %

of its value in the latter half of each investment period. Note that in the bottom plot, we have

\lim_{t \to \infty} {\hat{b}}_{12} (x^{t}, y^{t}) = 1

and

{\hat{b}}_{11} (x^{t}, y^{t}) \equiv {\hat{b}}_{22} (x^{t}, y^{t}) \sim 1 / t \to 0

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Garivaltis, A. A Note on Universal Bilinear Portfolios. Int. J. Financial Stud. 2021, 9, 11. https://doi.org/10.3390/ijfs9010011

AMA Style

Garivaltis A. A Note on Universal Bilinear Portfolios. International Journal of Financial Studies. 2021; 9(1):11. https://doi.org/10.3390/ijfs9010011

Chicago/Turabian Style

Garivaltis, Alex. 2021. "A Note on Universal Bilinear Portfolios" International Journal of Financial Studies 9, no. 1: 11. https://doi.org/10.3390/ijfs9010011

APA Style

Garivaltis, A. (2021). A Note on Universal Bilinear Portfolios. International Journal of Financial Studies, 9(1), 11. https://doi.org/10.3390/ijfs9010011

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Note on Universal Bilinear Portfolios

Abstract

1. Introduction; Literature Review

2. Bilinear Trading Strategies

3. Universal Bilinear Portfolios

Resolution of the Motivating Example

4. Summary and Conclusions

Disclosures

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI