1. Introduction
Hedging a bond-like obligation with a bond portfolio is a classical topic in finance, first presented in [
1]. We only give several references here to sketch the research context. Fisher and Weil [
2] introduced the duration based immunization against parallel shifts in the term structure of interest rates. Cooper [
3] proposed to use a parametric function to estimate the term structure and considered partial derivatives with respect to function parameters; however, his choice of parametrization was not particularly helpful. Other authors [
4,
5,
6] used the Nelson–Siegel parametric model [
7] to estimate the term structure and considered partial derivatives with respect to Nelson–Siegel parameters. Almeida and Lund [
8] considered a richer parametric model for the term structure. Barber and Copper [
9] estimated the dominant term structure change patterns via principal component analysis for the purpose of bond portfolio immunization.
Another branch of research, which is non-parametric by nature, started with Fong and Vasicek [
10] and was continued by Nawalkha et al. [
11,
12]. They considered a Taylor approximation to the change in the portfolio price in terms of the interest rate term structure changes.
Research in this area has been concentrated on testing and comparing various methods in various settings since then.
In contrast to developed markets where it is customary to estimate interest rate term structure from derivative instruments such as interest rate swaps (IRS) or overnight indexed swaps (OIS), in many developing markets, these instruments are unavailable or relatively illiquid. This leaves government bonds as the only viable source of interest rate term structure information. Furthermore, even in developed markets, bonds can have much longer maturities than swaps. Finally, when the obligation is itself bond-like, it makes sense to price it using the bond-implied term structure to avoid the necessity to estimate the basis between swap and bond zero-coupon rates.
This makes the term structure endogenous and the problem of hedging an obligation with bonds different from what was studied in the literature.
To overcome this problem, we adopt a different approach. We assume that the interest rate term structure for pricing the portfolio is estimated from prices of a set of benchmark bonds via a nonparametric term structure estimation method with smoothing, i.e., by solving some variational problem. We derive the necessary hedging equations from this variational problem formulation.
To gain some intuition about the result, we also investigate some particular cases, in which we recover existing hedging approaches like the conventional duration based and convexity based hedging and the key rate duration approach introduced by Ho [
13].
The remainder of the paper is structured as follows. We present the problem and its mathematical formulation in
Section 2. Then, we provide the solution in
Section 3.
Section 4 shows that several conventional immunization methods (duration based, convexity based, and key rate durations) can be considered special cases of our general approach.
Section 5 concludes.
2. The Hedging Problem
We consider the following hedging problem. An obligation to be hedged is described by its cash flow amounts row vector corresponding to the cash flow moments vector , i.e., the cash flow at time is . The obligation is supposed to be regularly repriced by marking to model, by using a pricing operator , where is the estimated market term structure of interest rates to be defined later, i.e., a vector of market zero-coupon rates for the corresponding terms to maturity t. We consider a relatively short term hedge, until the next repricing. For simplicity, ignore the time related features of investing in a bond portfolio such as changes in durations, receiving cash coupon payments and bond redemptions, etc. These only matter for a long term hedge.
The pricing operator is assumed to be linear in
to allow hedging. This assumption is shown to be not restrictive even for risky coupon bearing bonds [
14]. Thus, the pricing operator can be represented as:
where
is the vector of discount factors for the corresponding terms to maturity
t. We assume that
is a function of
and
only; however, the exact formula depends on the interest rate compounding convention chosen. In a more general framework, the discount factor could also be a function of credit quality, liquidity, eligibility for repo operations with the centralized counterparty, etc. However, all these parameters can be assumed constant in the short run and thus not sources of additional risks. For a long term hedge, they could be important.
Example 1. Assuming continuously compounded interest rates, we get
As for the term structure r, we assume that it is estimated from a set of K benchmark risk-free (e.g., government) bonds with the cash flow matrix , where is the cash flow for bond k at term . Note that the vector of payment terms t can be considered common for the obligation and all the benchmark bonds. We can always introduce zero cash flows where necessary without any loss of generality.
The estimation procedure is assumed to be nonparametric, fitting the benchmark bond model prices
to their observed market prices
while requiring some sort of smoothness as follows:
where
is a function to be estimated belonging to some functional space
H, e.g., Sobolev–Hilbert space
,
E is a linear functional from
H to some
, and
is the smoothing (regularization) parameter. The most popular nonparametric methods of estimating the term structure of interest rates assume problems of this form [
15,
16,
17,
18].
Example 2. Two common choices for E are the differentiation operator D (with H being ) and (with ); however, more elaborate choices might be appropriate in developing markets, as discussed in [19]. It is well known that D yields a piecewise linear term structure, while yields a smoothing cubic spline as solutions of (2). A hedging portfolio is chosen from among the same set of benchmark bonds; let be the corresponding investment amounts. To simplify the exposition, we assume no trading restrictions.
The hedging problem can be formulated as follows. Given
, find
w so that the sensitivities of the hedged obligation price to price changes should be zero, which corresponds to minimizing the risk associated with the zero-coupon yield curve changes:
Here and in what follows, we use the notation for the transpose of A, for the Moore–Penrose inverse of A, for the null space of A, for the range of A, for the orthogonal complement of the subspace V, and for the orthogonal projection of A onto the subspace V.
3. Results
We now present the main result regarding the solution of the hedging problem (
3). It gives the explicit expression for the hedging coefficients
w.
Theorem 1. The hedging coefficients w can be found via weighted least squares regression by regressing the vector of the obligation point sensitivities onto the corresponding point sensitivities of the benchmark bonds :with the error covariance matrix of ε equal to , where: Alternatively, the hedging coefficients w solve where columns of the matrix are the elementary term structure perturbations against which we hedge.
Note that the covariance matrix
is not singular if the solution of (
2) is unique; this will be proven later. We cannot guarantee that this is indeed the case as it depends on the choice of the regularization functional
; but since the overall idea of regularization includes guaranteeing the uniqueness of the solution, this can be safely assumed. We also assume that
F has full rank, which means that benchmark bond payment schedules are linearly independent.
Example 3. For , it is always not singular because only consists of constant functions , and for constant , where is a diagonal matrix. For the continuously compounded interest rates, is given by .
Proof of Theorem 1. Note that even though the problem (
2) is infinite-dimensional, the first term only depends on the finite-dimensional projection of
, namely on the values
. Therefore, we can consider the following hierarchical optimization problem:
which can be written in a more compact way as:
where
r is a vector of zero-coupon rates at corresponding terms
t, and
□
Example 4. When , the optimal f is actually a piecewise linear function connecting the known values at points ; therefore, . Similar reasoning applies for higher derivatives with also being a quadratic form in r.
A first order condition for an extremum in (
7) is:
Now, differentiate this with respect to
P:
where:
Therefore,
for
.
Let us prove that
is not singular, i.e.,
. Assume that this is not the case and there is a nonzero
. Note that
would mean that the regularization term
is invariant to the perturbation by
f:
for any
r. Furthermore,
would mean that
for every
i for which there is nonzero cash flow for at least one benchmark bond (this easily follows from the structure of
A), which in turn implies that
. Thus, adding
f to the optimal solution
of (
7) would not change the functional as
and
. Therefore, the existence of such
f immediately implies the non-uniqueness of the solution of (
7) or equivalently of the solution of (
2), which we assumed to be unique.
In what follows, we denote the matrix
as
M or
if the dependence on
is important. Using the linearity of the pricing operator, we can rewrite (
3) as:
or in terms of
M,
In what follows, we assume that F has full row rank, i.e., there are no bonds with linearly dependent payoff schedules. This is always the case in practice. Then, is invertible because has full rank, , where is diagonal and not singular regardless of the chosen discounting convention.
Therefore, we can solve for
w:
where
.
The expression (
15) is exactly the weighted least squares estimator for the linear regression:
with the error covariance matrix of
given by
[
20].
Note that the elements of
and
B can be thought of as point sensitivities of the corresponding bond prices with respect to the spot rate
for the respective term to maturity
:
Thus, the hedging coefficients can be found by regressing the vector of the obligation point sensitivities onto the corresponding point sensitivities of the benchmark bonds B. The error covariance matrix depends on both the set of benchmark bonds and the smoothing functional.
As we show in
Section 4, this result can be seen as a generalization of duration based, convexity based, and key rate duration hedging.
4. Special Cases
To develop some intuition about the solution, we now consider two special cases, as the amount of smoothing goes to zero and to infinity. We show that in these special cases, we can recover the conventional duration based and convexity based immunization approaches, as well as key rate duration hedging.
4.1. Infinite Smoothing:
First, let us show that with a suitable choice of regularization functional , we can recover conventional duration based and convexity based approaches.
Theorem 2. Let be the vector of hedging coefficients for the smoothing parameter α. Then:where satisfies: Example 5. If is the first derivative operator, then the hedging coefficients provide a duration based hedge assuming continuous compounding, i.e.,where:is the dollar Fisher–Weil duration of the obligation and is the row vector of the corresponding benchmark bond dollar Fisher–Weil durations. Example 6. If is the second derivative operator, then the hedging coefficients provide a hedge based on duration and convexity assuming continuous compounding, i.e.,where in addition:is the dollar convexity of the obligation and C is the row vector of the corresponding bond dollar convexities. Proof. It is known [
21] that for any matrices
M and
L of suitable dimensions,
where
is the orthogonal projector onto
. However, in the expression for
(
12), we have an additional convexity term
A inside the brackets, which is symmetric, but not nonnegative definite; thus, a separate proof is in order.
Factor
onto
:
Now, let
, where
,
and
. Therefore, by Theorem 2 of [
22],
where
and
. Note that
and
, both because
J has full column rank and thus
. Therefore,
and:
Since , has full rank as a mapping and is zero otherwise. Therefore, .
Finally, (
15) can be written as
, where
. For
, we have:
To observe the particular cases, note that for , , where 1 is a vector of ones and . Therefore, the condition reduces to .
In terms of our regression, this means that:
Now,
exactly equal to the dollar Fisher–Weil duration of the obligation, while by the same logic,
is the row vector of dollar Fisher–Weil durations of the benchmark bonds. The condition (
29) thus requires us to equate the dollar Fisher–Weil durations of the hedge and the obligation.
For
, we similarly have
. Thus, in addition to (
30), we have:
where
is the dollar Fisher–Weil convexity of the obligation, while by the same logic,
is the row vector of dollar Fisher–Weil convexities of the benchmark bonds times
. The condition (
29) thus requires us to equate both dollar Fisher–Weil durations and convexities of the hedge and the obligation. □
4.2. Virtually No Smoothing:
Now, we establish the limit of the hedging portfolio composition as , which corresponds to estimating the term structure with non-smoothing splines.
Proposition 1. As , , where is a constant diagonal matrix.
Proof. From (
8) and from the fact that
B has full column rank, we have
. First note that it follows from (
2) that
as
, where:
Then:
because the convergence can easily be shown to be uniform with respect to the vector
r. □
Thus, as .
Theorem 3. As , the optimal hedging coefficients , where: Proof. The following could be perceived as overly complicated. If we could assert that
is not only symmetric, but also non-negative definite (and therefore could be represented as
), a much simpler proof would follow right away along the lines of
Section 4.3.
To deal with the general case, we apply the same matrix inversion approach to
, but with:
Note that as
,
,
, and
since
U does not change its rank in the limit.
where
and
. One can easily see that
, and thus,
. We now demonstrate that the limit of
exists.
Let us calculate
as
. First of all, note that since
B has full rank,
where
.
where:
Now, plug this into
, and using
:
4.3. Key Rate Durations
This result requires more simplifying assumptions and is itself rather simple. However, it is illustrative to see where the industry best practice approach of matching key rate durations is placed within our framework, to understand where it is not.
Proposition 2. Let the model term structure be estimated from known key rates at predefined key terms to maturity via linear interpolation and constant extrapolation. Then, the hedging weights satisfy:where the columns of the matrix M correspond to the key rate durations loadings, i.e., they are: equal to one for one key term to maturity ;
equal to zero for all other key terms to maturity;
are linearly interpolated for maturities between key terms;
are constantly extrapolated outside key terms to maturity.
Proof. Note that
where
M is the linear interpolation and constant extrapolation operator defined above. The hedging equation becomes:
which can in turn be simplified using
to:
□
We now present two lemmas characterizing the limit behavior of the optimal solution when partially relaxing the strict assumptions of Proposition 2.
Lemma 1. In the setup of Proposition 2, let the model term structure be determined via regularized curve fitting as follows:where D is the differentiation operator, is the projection operator:and is the vector of observed market yields at key terms to maturity . Then, as , the hedging portfolio composition approaches the portfolio chosen via the key rate duration approach.
Proof. It is easy to see that Theorem 1 still holds with
consisting only of ones and zeroes and
. In our setting, (
15) can be written as:
where:
It is known that:
where
is the
-weighted generalized pseudoinverse of
K and
. Thus,
because
.
Let
be the optimal solution to (
49). Then,
.
and
uniformly over
as
does not depend on
at all. Obviously,
; the linearly interpolated key yields, so
. Therefore,
, and
coincides with the optimal hedging weight
w from Proposition 2. □
Lemma 2. Let the term structure be estimated from:where is the vector of benchmark bond yields to maturity. Furthermore, let the benchmark bonds be coupon-bearing with coupon rates uniformly decreasing towards zero and terms to maturity . If in addition, , then the hedging portfolio composition approaches the portfolio chosen via the key rate duration approach. Proof. Theorem 1 holds for this case with
and:
First, let us calculate
B.
where:
Note that as the coupon rate of the benchmark bonds approaches zero, and all other , where is the index of the ith bond term to maturity .
Since at the same time , as the coupon rate decreases to zero, , where is the corresponding unit vector with one at the coordinate and zero everywhere else.
Using the same logic, we get that as the coupon rate of the benchmark bonds decreases to zero, , which means .
Finally, to get the joint convergence, observe that the convergences above is uniform in since it only depends on r and as . □
5. Conclusions
The de facto industry standard of immunizing a bond-like obligation with a portfolio of bonds is based on key rate durations. It is nonparametric in its nature, but relies on having an exogenous reference term structure of interest rates determined via piecewise linear interpolation from a set of observed benchmark zero-coupon rates (e.g., interest rate swap rates).
We presented a general treatment of the immunization problem in a usual setting for developing markets, when the reference term structure of interest rates is endogenous, i.e., estimated via a smoothing spline approach from the same set of bonds (usually government bonds), which is being used for hedging.
We showed that this framework was general enough to contain both the standard duration based (Example 5) and convexity based (Example 6) hedges, as well as the key rate duration approach (Proposition 2 with the subsequent lemmas) as special cases.
This could give rise to more general practical implementations of bond portfolio immunization in developing markets. Our model was limited to short term time frames (no time effects and no changes in bond payment schedule), risk-free bonds, and perfect frictionless markets, so future research is desired to introduce the effects of time, heterogeneous credit quality, liquidity, and market frictions.
The practical implications of this work for developing markets are as follows. In the relatively widely spread case of estimating the term structure via linear interpolation of benchmark bonds’ yields to maturity, the key rate duration approach could be more or less justified if the coupon rates of the benchmark bonds are small (with the key rates being equal to the benchmark bonds’ terms to maturity). However, for coupon-bearing bonds or for higher orders of smoothing, this is generally not the case, and a specially designed immunization approach as described in this paper could be more desirable.