1. Introduction
In commodity markets, typical products include directional trades such as futures and forwards, which establish an obligation to purchase or sell an underlying commodity in the future (
Clark 2014). As essential tools for managing risks from these contracts, which may consist of multiple underlying assets, there are various options contracts that provide a right to trade the underlying commodity under a specified condition. In this study, we focus on early-exercisable options on multiple underlying assets in commodity markets, i.e., multivariate Bermudan commodity options.
Solving Bermudan commodity option pricing problems with multiple underlying assets and factors is challenging because computational efforts grow exponentially in tandem with the problem dimension in general, which is determined by the number of assets and factors. However, the improvement of algorithms and the rapid growth of computational power have led to a remarkable surge of interest in computational science in recent years. Currently, a wide variety of machine learning algorithms, such as deep learning and neural networks, are successfully employed for classification, regression, clustering, or dimensionality reduction tasks and are applied for large-sized and high-dimensional data in various areas. In this study, we develop a new Bermudan commodity option algorithm via multi-layered neural networks and show its efficiency and effectiveness based on the multi-asset commodity market model with stochastic volatility, wherein significant changes in volatility may be observed according to the demand–supply relationship and geopolitical conditions in commodity markets.
An important feature of Bermudan options is that they can be exercised early, with their value being determined by whether or not they are exercised before maturity. In other words, the option holder must decide whether to continue holding the option or immediately exercise it at a prespecified period. In this situation, it is crucial to determine the continuation value, i.e., the value of holding the option until the next exercise window. Such a continuation value may be given as the discounted conditional expectation of the remaining option value on one step ahead under a risk-neutral probability measure, which generally has no explicit boundary conditions. Moreover, the conditional expectation providing the continuation value is an unknown (possibly nonlinear and complex) multivariate function whose dimensions depend on the number of underlying factors; hence, an exact (yet still approximate) computation involves high-dimensional discrete grids concerning state variables and is quite difficult to solve.
To price early-exercisable options with estimations of continuation value functions,
Longstaff and Schwartz (
2001) proposed a simple yet powerful numerical method involving a regression-based functional estimation using simulated sample paths known as the least-squares Monte Carlo (LSMC) method. Since then, several studies have examined the application of neural networks (or machine learning methods) for estimating continuation value functions in option pricing based on the LSMC method. For American options’ pricing,
Haugh and Kogan (
2004) applied a neural network with one hidden layer for valuation, whereas
Kohler et al. (
2010) proved price consistency and convergence with multiple payoff types.
Lapeyre and Lelong (
2021) gave several numerical examples of Bermudan options and proved convergence. There are additional examples, e.g., the 5000 assets rainbow option (
Becker et al. 2021) and expected exposures (
Andersson and Oosterlee 2021). Furthermore, other machine learning algorithms have been used for early-exercisable options, e.g., radial basis functions (
Ballestra and Pacelli 2013), nearest-neighbor (
Feng et al. 2013), deep learning (
Becker et al. 2020;
Liang et al. 2021), unsupervised learning (
Salvador et al. 2020), and reinforcement learning (
Li 2022), as well as the support vector machine (
Lin and Almeida 2021). Moreover, numerical approaches have also been used, including stochastic kriging metamodels (
Ludkovski 2018), high-dimensional partial differential equations (
Sirignano and Spiliopoulos 2018), and backward stochastic differential equations (
Chen and Wan 2021). Furthermore, there are other applications of neural networks in the finance field, e.g., extending the feature set (
Montesdeoca and Niranjan 2016), the calculation of implied volatilities (
Liu et al. 2019), and decision-making (
Puka et al. 2021). A comprehensive review of these methods was conducted by
Ruf and Wang (
2020).
Although this study shares the same ideas as the aforementioned studies—in particular, as in
Lapeyre and Lelong (
2021), given that a multi-layer perceptron (MLP) is applied in the neural network—it is worthwhile to mention that our study may be considered novel in several aspects: We illustrate an algorithm for estimating the continuation values of multi-asset Bermudan commodity options with stochastic volatility features, whereby a smooth activation function, such as the sigmoid function, is applied in the MLP to reflect the smoothness of conditional expectations regarding state variables. The smoothness of functions to represent conditional expectations is key in this study. In the Markovian setting, the conditional probability density functions are usually smooth given state variables; thus, conditional expectations are smooth functions. Therefore, the target continuation value function is smooth, and we can expect a better fit to the target function by using a smooth activation function in the MLP. This is in contrast with more commonly used piecewise linear functions such as the leaky ReLU function applied in the numerical experiments by
Lapeyre and Lelong (
2021), wherein the fitting function may not be smooth but only piecewise smooth. (Note that, in a similar context with the smoothness of estimated functions,
Yamada (
2012,
2017) applied the generalized additive model to calculate smooth functions for conditional expectations in multivariate hedging problems with European and Bermudan options.) Additionally, we applied more sophisticated techniques such as the resampling procedure and early stopping to improve the computational efficiency and avoid possible biases in pricing or overfitting for optimal learning (see
Section 3.2 and the numerical experiments).
While several neural network/machine learning models for option pricing exist, we believe that, for this type of research, the current methodologies, along with developed computational algorithms, need to be combined with existing techniques using the currently available computational environment. In this context, the choice of problem and methodology choice is important, as is the approach to the problem and how to perform the numerical experiments. The combination of multi-asset commodity options with stochastic volatility and recently developed neural network techniques (including the computational environment and software) is meaningful since commodity markets are largely volatile, and this volatility may change over time. Moreover, the multi-asset Bermudan commodity options with stochastic volatility, to the best of our knowledge, have not been previously considered despite the problem’s importance. It should become more challenging in numerical calculations to configure multiple underlying assets that recognize mean-reverting dynamics and solve boundary conditions with stochastic factors (see, e.g.,
Hahn and Dyer 2008 and
Ball and Roma 1994).
The present study implements a multi-layered neural network and examines its efficiency and effectiveness for multi-asset Bermudan commodity option pricing problems with stochastic volatility. First, we formulate the multi-asset commodity market with stochastic volatility, wherein individual asset price dynamics are expressed as a two-factor model by combining a well-known commodity model by
Schwartz (
1997) with Heston’s stochastic volatility model (
Heston 1993). Next, we apply MLPs in the neural network to represent the continuation value functions in Bermudan option pricing, whereby the entire neural network is trained using LSMC simulations. We perform numerical experiments to compare the continuation value function accuracy in response to changing the exercise dates, the number of layers/neurons, and the dimension of the problem. We also compare the relationship between the continuation values and network configurations.
The outline of this article is as follows.
Section 2 gives an introduction to the commodity option structure adopted in this study and the formulation of the multi-dimensional Bermudan option problem.
Section 3 describes the configuration of neural networks, a multi-dimensional asset model with stochastic volatility, and a Bermudan options pricing procedure for learning and valuation via Monte Carlo sample paths.
Section 4 presents the numerical results of the Bermudan option prices and compares the accuracy of the continuation value surfaces.
Section 5 summarizes the analysis results and discussions. Lastly,
Section 6 concludes this study.
2. Pricing Multi-Asset Bermudan Commodity Options with Stochastic Volatility
In this section, we introduce early-exercisable commodity options and formulate the problem of pricing multi-asset Bermudan commodity options with stochastic volatility.
2.1. Early-Exercisable Commodity Options
As stated earlier, in commodity markets, typical products include plain directional trades such as future and forward contracts, which establish an obligation to buy or sell a particular commodity asset at a specified price in the future (
Clark 2014). Depending on the terminal values of commodity assets, holding these contracts may lead to a loss or profit for the contractor, while a large loss is particularly undesirable for the holder; furthermore, the possibility of a large profit may be pursued. Such opportunities are realized using options contracts, giving a right to purchase or sell an underlying commodity asset with a prespecified strike price in the future.
Among the many types of options used as hedging tools in commodity markets, early-exercisable options provide additional flexibility regarding exercise timing and are considered useful for hedgers, in practice. Traditionally, such options are characterized as American options; however, in the context of exotic options, Bermudan options have similar flexibility. These options allow holders to exercise them early, although only on specific dates before maturity; thus, the option holder must decide whether to continue holding the option or immediately exercise it during the exercisable period. However, such continuation value is usually unknown because it depends on future option values on specific exercisable dates. Therefore, it is paramount to determine the continuation values for Bermudan options. The objective of this study is to evaluate computational performance (including the accuracy of continuation value estimation) for pricing multi-asset Bermudan commodity options via multi-layered neural networks.
2.2. Multi-Dimensional Bermudan Option Pricing Problem
In this subsection, we describe the multi-dimensional Bermudan option pricing problem, following
Lapeyre and Lelong (
2021). Given a complete filtered probability space (
,
,
,
) with a finite time horizon
, we assume that a set of underlying assets is modeled via a multifactored process
adapted to the filtration,
, and that
is an associated risk-neutral measure. We consider a Bermudan option with exercise dates
and a discrete-time payoff process
if exercised at times
, where
is specified as a function of
. Then, Bermudan option prices
are computed using the following recursive equation:
where
denotes the expectation under the risk-neutral probability measure
with the risk-free interest rate
and the interval between
and
as
. Furthermore, assuming that
is a multi-dimensional Markov process, there exists a measurable function
, such that:
Herein, we refer to as a continuation value function in this paper.
Note that finding the exact
is difficult; alternatively, one may identify a function
to minimize the following quantity,
over a parametrized set of functions
𝕭. If all (real-valued) square-integrable measurable functions are searched to minimize Equation (3), it turns out that the function
providing the conditional expectation in Equation (2) is achieved via an optimizer. However, there is a trade-off between the generality of a set of functions and the efficiency of computation. Additionally, computational tractability depends on the methodology to solve the optimization problem.
2.3. Multi-Asset Commodity Market Model with Stochastic Volatility
This study employs a multivariate commodity market model consisting of multiple underlying assets with stochastic volatility for the Bermudan option problem. To this end, we adopt a stochastic volatility model for the mean-reverting commodity dynamics (
Schwartz 1997) and expand it to the multi-asset case.
Consider the Bermudan option problem with
underlying assets at time
,
, the
-th price dynamics of which are governed by the following two-dimensional stochastic differential equations (SDEs):
Herein, and are correlated Brownian motions with appropriate correlation parameters, while the magnitude of the speed coefficient measures the degree of mean reversion to the long-run mean , including the market price of risk in the underlying asset price processes. The second term characterizes the -th volatility process, , where indicates a degree of mean reversion toward long-term volatility , and is the volatility of volatility.
Since each of the underlying asset price dynamics in (4) follows a two-dimensional Markov process, the state variables at time
, denoted by
, corresponding to the input features
of the MLP in the previous subsection, may be described as
The dimension in (5) depends on the number of state variables and is given by . Note that the SDEs of the underlying assets are used to generate sample paths of the LSMC method in the Bermudan option pricing problem.
3. Application of Neural Networks with MLP
When pricing Bermudan commodity options using a model with multi-dimensional factors—including the multi-asset stochastic volatility model introduced in the previous section—it is crucial to determine continuation values at each exercisable date. In order to identify a continuation value function in the multi-asset Bermudan option pricing problem with stochastic volatility, this study takes a neural network approach with MLP, similar to
Lapeyre and Lelong (
2021). First, we introduce the neural network architecture considered in this study, which generates a continuation value in Bermudan commodity options pricing. Second, we present the underlying assets model with multi-dimensional factors, which has multi-asset and stochastic volatility. Finally, we provide algorithms for learning the entire network and option pricing procedure.
3.1. Continuation Value Functions via MLPs
First, we explain the configuration of an MLP to express a general multi-dimensional function and to approximate the continuation value function in the multi-dimensional Bermudan option problem.
The basic configuration of the MLP is shown in
Figure 1, where
is an input vector and
is an output of the entire neural network. Each neuron is called a “perceptron” that defines a mapping of input/output signals with appropriate dimensions, being dependent on the number of neurons at input/output layers. For example, if
denotes an input signal of a perceptron with a weight matrix
and a bias vector
, then, an output signal
from the perceptron is given by
where
is a component-wise activation function. Typical choices of activation functions in neurons are as follows:
In the case where the MLP is applied for a regression, all the weight matrices and bias vectors in the MLP are computed to minimize the sum of squared errors between the actual dependent variable, denoted by
, and the predicted dependent variable
given the training datasets of
and
(expressed using the MLP in
Figure 1).
Note that the MLP can express a continuous and complex nonlinear surface in entire networks by sequentially performing a linear and nonlinear transformation on inputs
to the compiled layer output
. The properties of the MLP function derive from the universal approximation theorem proposed by
Cybenko (
1989) and the Kolmogorov–Arnold representation theorem put forward by
Kolmogorov (
1957) and
Arnold (
2009), in which any function can be approximated if the input size and network are infinite. In this sense, functions expressed by the MLP are generally considered suitable for a problem with complicated interactions because of the adjustable basis functions (see
Choon et al. 2008).
For the randomly generated sample paths of
, we can apply the LSMC method (see
Appendix A and
Appendix B) combined with the MLP, whereby the continuation value function is modeled at each step using a function given by the MLP. Let
and
be the simulated
sample paths of
. Since the discrete-time payoff process
(if exercised at times
) is specified as a function of
for a Bermudan option and
, the training data of the output variable,
, in the first step of the LSMC method, are computed as
, corresponding to the payoffs of the Bermudan option at maturity along the sample path of
. The MLP in the first step is constructed for the training data of
, given as
, together with those of
. Then, we obtain an approximation of the continuation value function, denoted by
, and the continuation values along the sample path,
In the second step, the training data of the output variable,
, are computed using (1), as
as well as the training data of
, given as
. Using these training datasets, the MLP is constructed to find an approximation of the continuation value function, denoted by
, and the continuation values along the sample paths,
We then repeat the same procedure until
3.2. Learning Networks and Option Pricing
For learning neural networks, we generate Monte Carlo sample paths using the SDEs in
Section 3.2 based on a similar idea to that of the ordinary LSMC method introduced by
Longstaff and Schwartz (
2001). Herein, we apply nonlinear functions of the MLPs instead of polynomial functions for the basis of the continuation value functions. Additionally, we introduce techniques such as early stopping to improve the fitted continuation functions and avoid possible overfitting or biases in learning and pricing. We also introduce the resampling procedure to avoid a possible bias caused by using the same random samples between learning and valuation, and we regenerate Monte Carlo sample paths for the valuation of Bermudan option prices (see
Appendix A for pricing details).
Herein, we summarize a learning procedure, as described in Algorithm 1 below, where the underlying price and the volatility vector are denoted by
and
, while
denotes a payoff function of
. Given simulation the sample paths generated by the multi-asset stochastic volatility models in (4), we provide a Bermudan option pricing procedure involving the algorithm for estimating the continuation values using neural networks. This algorithm operates to find MLP
as a continuation value function satisfying Equation (8) in the previous section and gives the Bermudan option price.
Algorithm 1. Bermudan option pricing with learning networks. |
Require: Initiate paths , , , |
1:Let be the patience and be the maximum number of epochs |
2:Put for all |
3:for from to do |
4: Let and for all |
5: if on exercisable periods then |
6: Perform learning on to obtain network with to be |
7: |
8: |
9: while do |
10: Train on and |
11: if improved then |
12: |
13: else |
14: |
15: end if |
16: if then |
17: Break |
18: end if |
19: |
20: end while |
21: Calculate the continuation value for all |
22: for from to do |
23: if then |
24: |
25: end if |
26: end for |
27: end if |
28:end for |
29:return mean of |
Note This study does not use the selection technique, which performs regression using only the in-the-money paths proposed by Longstaff and Schwartz (2001), for the purpose of constructing a versatile algorithm. |
It is noted that one cycle of training with the complete training data is known as an epoch and is repeated for learning purposes for each continuation value function in Algorithm 1. In general, the larger the number of epochs, the better learning of the training data. However, a large number of epochs usually requires a long computational time, even with large computer resources, and sometimes leads to overfitting of the training data. To prevent such situations, we introduce an early stopping rule for the learning procedure given a specified integer in Algorithm 1. Under the early stopping rule, the objective function (3) is monitored for improvement, and the number of iterations (i.e., the number of epochs) without improvement (compared with the previous epoch), denoted by , are counted. If this number reaches , the iteration stops and the learning procedure of the continuation value function terminates; otherwise, the iteration continues as long as the iteration index is less than , where is the maximum number of epochs specified at the beginning of Algorithm 1. Note that the introduction of the early stopping rule not only decreases the computational time but also prevents overfitting/underfitting for the MLP.
Based on the network configuration of the neural networks, the computational complexity of Algorithm 1 is given by the number of iterations for parameter estimation of the MLP. This number of iterations depends on the maximum number of epochs and the number of exercisable dates,
and
. Once these values are specified, the maximum number of iterations is
, which is the total number of epochs applied in Algorithm 1. In addition, the computational complexity of each epoch in the MLP depends on the network configuration (see
Serpen and Gao 2014).
To price the Bermudan commodity options using the continuation functions estimated in Algorithm 1, we regenerate different sample paths from those used in the learning procedure for computing the continuation values and Bermudan commodity option prices, given the neural networks in Algorithm 1, i.e., we separate the learning and the valuation procedures, and Algorithm 1 may be applied without learning (i.e., given the estimated neural networks) for the valuation procedure. The merit of this resampling is that it avoids a price bias, which results from overfitting using the same sample paths. Accordingly, this study adopts the following procedure:
Generate the sample paths of the underlying assets for the MLP learning of Algorithm 1.
Find the MLP network parameters via learning in Algorithm 1 using the sample paths in Step 1.
Given the estimated neural networks, regenerate a different set of sample paths and apply Algorithm 1 (without learning) to compute the continuation values and the initial prices of the Bermudan options.
Repeat Step 2 and calculate statistical values such as the mean and the standard deviation of the Bermudan option prices.
In the above, it is key that learning and pricing (i.e., valuation) utilize the different simulation sample paths set in Steps 1 and 3.
Figure 2 shows a flowchart of the entire procedure for learning and valuation.
4. Numerical Experiments
The objective of this section is to execute numerical experiments based on the learning and valuation procedure explained in the previous section and make comparisons of the Bermudan option pricing between the MLP and the benchmark polynomial regression (i.e., the standard (naïve) LSMC method by
Longstaff and Schwartz 2001).
4.1. Problem Setting and Preliminary Experiment
Herein, we consider Bermudan commodity options with early-exercisable dates in discretized periods until maturity , i.e., , the payoffs of which are given by when exercised. We define several settings for different dimensions of Bermudan options, (i.e., the number of state variables in (5)), exercisable dates, and payoff functions. We also introduce a constant volatility model as a one-dimensional problem to perform a preliminary experiment.
For the exercisable dates of the Bermudan commodity options, we consider two cases as depicted in
Figure 3. One is a two-period problem, wherein the Bermudan commodity option is issued at time
and can be exercised at
and maturity
The other is a case with multiple exercisable dates, wherein we choose ten exercisable timings before maturity. In both cases, the options can be exercised after a half-year period to compare the continuation value surfaces at time
between different methodologies, and the values of the options are evaluated at the initial time period,
.
Moreover, the payoff functions for Bermudan basket put options are given as
Then, an upper limit price
is introduced to the payoff function as
for Bermudan capped put options. Note that the upper limit,
, provides an additional complexity of the payoff functions.
The parameters of the underlying assets and neural networks are set as shown in
Table 1 and
Table 2 below.
4.2. Low-Dimensional Case: Single-Asset Bermudan Options with Stochastic Volatility and Constant Volatility
We begin with the simplest valuation based on
Schwartz’s (
1997) single-asset and constant volatility model in, compared with the standard LSMC method using polynomial regression and the finite difference method (FDM) detailed by
Tavella and Randall (
2000). In the MLP, we applied Algorithm 1 for learning neural networks with Monte Carlo simulations and then repeated the valuation procedure (using Algorithm 1 without learning) 100 times. The MLP in this experiment contains two hidden layers with sixty-four neurons of sigmoid activate functions. In the standard LSMC method, we use a quintic function for one-dimensional problems (i.e.,
) and a multi-dimensional quadratic function for two or more higher-dimensional problems (i.e.,
) and apply the same procedure (i.e., the learning and valuation procedure). The FDM is based on the Crank–Nicolson scheme with discretized 2000/200/50 grids in the time/asset/volatility directions.
Table 3 compares the means and standard deviations of the option prices obtained with the MLP and the standard LSMC methods. Considering the option price of the FDM as a proxy for the value of the Bermudan commodity option price, we see that both the MLP and the standard LSMC method almost achieve the Bermudan option price value, i.e., the gap between the three prices is sufficiently small for this one-dimensional problem. We implemented Algorithm 1 using the MLP and the standard LSMC on Python, using a machine learning package based on TensorFlow (
Abadi et al. 2015) and the PolynomialFeatures toolbox of the scikit-learn library (
Pedregosa et al. 2011). All our numerical experiments were run using Google Colaboratory (
Google 2022) with 36 GB of RAM and a dual-core CPU of 2.3 GHz.
When considering a single-asset stochastic model, the corresponding Bermudan commodity option problem becomes two-dimensional, i.e.,
, we can observe a slight difference between the MLP and the LSMC methods, compared with the approximate solution of the FDM, as shown in
Table 4. First, we see that there is no significant difference between the 3 cases for the Bermudan put option with exercisable dates
. However, the gap between the LSMC and FDM prices becomes slightly wider, compared with that between the MLP and FDM prices with exercisable dates
. For the Bermudan capped put option, there is a slightly larger difference vis-à-vis the FDM price for both the MLP and LSMC prices, whereas a slight improvement was achieved by using the MLP in terms of the gap from the FDM price, as illustrated in the box plots in
Figure 4.
4.3. Higher-Dimensional Case: Multi-Asset Bermudan Options with Stochastic Volatility
In the case of higher-dimensional Bermudan commodity options with multi-asset stochastic volatility (e.g., two asset problems with ), it is difficult (or unrealistic) to obtain an approximate Bermudan option price with high accuracy using the FDM. Thus, we compare option prices obtained with the MLP with the benchmark polynomials (i.e., the standard LSMC method) only, where we set , i.e., five-asset stochastic volatility, in the numerical experiments. Then, we will discuss the source of the difference in view of the continuation value functions for both methods in the next section. Additionally, we will compare the accuracy of estimated continuation values by considering a two-asset problem with and two exercisable dates .
Table 5 shows our numerical results, which compare the mean and standard deviation between the LSMC and the MLP prices for
and
, obtained via the learning and valuation procedure described in
Section 3. In the case of the Bermudan put option for
, we see that there is no significant difference between the two methods, as with the case with a low-dimensional problem,
. However, the gap between the two increased for the Bermudan capped put options and the case with
, as shown in the box plots in
Figure 5. In other words, we see that the differences between the MLP and the LSMC are emphasized by introducing additional complexity to the payoff function or by increasing exercisable dates.
Additionally, we demonstrated the same numerical experiments above but increased the number of assets in the stochastic volatility model to 10 and 20 assets, respectively (i.e.,
and
) and compared the estimated Bermudan commodity option prices between the MLP and the LSMC. To observe the effects of increasing the number of exercisable dates more clearly, we added
between
and
to obtain the estimation values shown in
Table 6 and
Table 7 (corresponding to the cases of
and
, respectively).
Table 6 shows the means, the standard deviations, and the gaps between the estimated prices for the Bermudan put options and Bermudan capped put options with
, and
Table 7 shows those with
.
Similar to the previous cases, the gap between the MLP and LMSC increases for the Bermudan put prices given a larger number of exercisable dates but decreases for the Bermudan capped put options when for both and . It is possible that the choice of regression function has a weaker effect for higher dimensional Bermudan capped put options with a larger number of exercisable dates, i.e., the continuation value functions of the capped put options may become flatter or smoother when the number of exercisable dates increases and can be fitted with less sophisticated functions. This phenomenon should be investigated in more detail in a future study.
4.4. Comparison of Continuation Value Surfaces
In the previous subsection, we observed that there are some differences between the MLP and the LSMC regarding the estimated prices and that these differences were more notable for Bermudan capped put options and/or increased exercisable dates. Herein, we discuss the possible reason for this price difference by visualizing the continuation value surfaces for both the MLP and the LSMC methods. For visualization purposes, we consider single-asset Bermudan capped put options with stochastic volatility and constant volatility, i.e., the low-dimensional cases with
and
introduced in
Section 4.2.
The left-hand plot of
Figure 6 illustrates the continuation value function estimated at
in the one-dimensional Bermudan capped-put option problem (corresponding to the single-asset constant volatility model) using the LSMC, whereas the right-hand plot shows the problem using the MLP. We first observe that the continuation value function of the MLP monotonically decreases with the underlying price, whereas the one obtained from the LSMC method is a nonmonotonic function. Since the payoff function of the Bermudan capped put option is piecewise linear and it monotonically decreases with the underlying price, the monotonicity of the continuation value function is more consistent with the payoff structure of the Bermudan capped-put option. In this sense, we see that the continuation value function of the MLP reflects the monotonic property more appropriately than that of the LSMC method in the one-dimensional single-asset problem.
For the two-dimensional case with single-asset stochastic volatility for Bermudan capped put options, the continuation value functions have three-dimensional surfaces, as shown in
Figure 7, wherein the left-hand and right-hand plots depict the continuation values with respect to volatility and underlying asset price directions for the LSMC method and the MLP, respectively. As in the one-dimensional problem, the payoff function is piecewise linear with respect to the underlying asset price direction and is flat when the underlying asset price exceeds the strike price
or is less than a certain value related to the capped rate
. Since continuation value functions are supposed to approximate the payoff function at the maturity
, given the information up to time
, their surfaces are expected to have similar shapes, i.e., continuation values are approximately zero or close to the capped rate for larger or smaller values of the underlyings, respectively. In view of this payoff structure for the Bermudan capped put options, the continuation value surface of the MLP seems to approximate the payoff function more accurately than that of the LSMC.
Remark 1. In general, the visualization of nonparametric methods provides an intuitive interpretation of the estimated functions. We have observed that “the continuation value function of the MLP monotonically decreases with the underlying price, whereas the one obtained from the LSMC method is a non-monotonic function,” and that “since the payoff function of the Bermudan capped-put option is piecewise linear and it monotonically decreases with the underlying price, the monotonicity of the continuation value function is more consistent with the payoff structure of the Bermudan capped-put option,” as stated earlier in this section. Such a visualization helps in understanding the valuation structure for the applied method in the middle of the process for Bermudan option pricing, but the effect of the approximation error may be weakened in the total procedure. However, we should be able to understand the approximation errors of estimated continuation value functions intuitively in the middle of the process from such a visualization.
4.5. Comparison of Accuracy in Continuation Values
In the previous subsection, we observed that the continuation value surfaces of the MLP may approximate the payoff functions more accurately than those of the LSMC by visualizing the continuation value surfaces. To further investigate the estimated continuation value surfaces in higher-dimension problems, we next measure the accuracy of the continuation values using a four-dimensional problem of the Bermudan capped put basket option with two exercisable dates (i.e., and ) for both the MLP and the LSMC method.
Consider a problem of estimating the continuation values at
, as shown in
Figure 8. Since Bermudan options with exercisable dates
become simple European options if not exercised at
, the continuation values at
may be estimated via European option prices expiring at
, given the state variables at
, i.e.,
. Therefore, we can calculate the accuracy of the estimated continuation values at
by measuring the differences between the estimated continuation values and the European option prices at
by specifying different state variables as input values of the European options.
We first solved the Bermudan option problem with
and
using the same parameter settings as those in the previous subsections by applying the MLP and the LSMC methods; subsequently, we calculated the continuation values at
, given the state variables specified in
Table 8. Note that
Table 8 defines the discretized domain of the state variables, wherein each state variable is discretized in the interval between the minimum and maximum values so that the number of grid points in 1 dimension becomes
. Then, the total number of grid points is given by 83,521 (
). Similarly, we applied the standard Monte Carlo simulation to compute the European option price on each grid and repeated this procedure 83,521 times to estimate the surface of the European option prices. This surface of the European option prices may be considered to provide an approximation of the theoretical continuation value (see notes in
Table 8), and we can measure the accuracy of the continuation values using the difference between the estimated continuation values with the MLP and the LSMC methods and the simulation-based (theoretical) surface.
Additionally, we can change the number of hidden layers/neurons and the type of activation function in the MLP to verify their effects on its accuracy. In this study, we evaluate the size of accuracy in terms of the following normalized root-mean-squared error (NRMSE) for each methodology:
where
is the total number of grid points (i.e.,
), and
and
are the
-continuation value and the corresponding European option price on the same grid point. In Equation (11), the root-mean-squared error is normalized by the difference between
and
, which are the minimum and maximum values of European option prices over the entire grid.
We computed the NRMSEs for different settings of neural networks in the case of the MLP, as shown in
Table 9, wherein we changed the number of hidden layers/neurons and applied two types of activation functions, i.e., the ReLU and sigmoid functions. Note that the NRMSE with the LSMC method is also computed, as shown in the bottom row of the table, while
Figure 9 compares the same NRMSE with respect to a different number of neurons for
and
using bar graphs. In
Table 9, we first observe that the MLP almost always provided better accuracy in terms of NRMSEs compared with the LSMC. Second, when comparing the types of activation functions, the MLP with the sigmoid function was always better than the MLP with the ReLU function for estimating continuation value surfaces when the number of hidden layers/neurons was fixed. This may be explained by the smoothness of the sigmoid function; the continuation value functions are expected to be smooth with respect to the state variables and can be fitted via smooth functions (e.g., the sigmoid function) better than non-smooth functions, such as the ReLU function. Third, an increase in the number of hidden layers is effective for a few hidden layers but does not necessarily improve the NRMSE when the number of hidden layers is three or larger for both MLPs with the ReLU and sigmoid functions. However, in any case, we obtained better NRMSEs by using the MLP with the sigmoid function.
5. Discussion of Robustness and Computational Costs
In this section, we discuss the robustness and computational costs of our experiment for the pricing algorithm of multi-asset Bermudan commodity options using the MLP.
First, we discuss the learning adaptability of the MLP applied to our multi-asset Bermudan commodity option problems with stochastic volatility.
Figure 10 depicts changes in the mean and standard deviation of learning rates with respect to the number of epochs in the MLP. In this figure, we see that training and validation losses remain close, which indicates no overfitting. Furthermore, the learning rate decreases rapidly until the number of epochs is 10 and stays at sufficiently good levels thereafter.
Next, we estimated the computational costs of the learning and valuation procedure when the problem dimension is increased.
Table 10 compares the computational costs with respect to the dimensions of
, wherein the same numerical experiment as that of the previous section was repeated 100 times and computed the mean and standard deviation of the computational time for each algorithm. Furthermore, the average numbers of epochs in learning are also computed in the MLP. Although the LSMC generally performs much better in terms of computational costs when the dimension is particularly low, its computational time grows exponentially in tandem with the size of the dimension. This is because when using a polynomial regression in the LSMC, the number of terms in the polynomial function increases combinatorially with the number of variables, even though its maximum order is fixed.
In contrast with the LSMC, the computational cost in the MLP is mostly unaffected by the size of the dimension but is directly proportional to the average number of epochs in learning. The computational cost of learning depends on how often the networks are updated during training, but the computation cost per one cycle of training data (i.e., epoch) remains the same when the size of the network is fixed. In the numerical experiments, we applied two hidden layers with sixty-four neurons using sigmoid activation functions, whereby the computational cost per epoch remained almost the same regardless of dimensions; the computational time in learning is determined by the total number of epochs. Although the computational cost per epoch slightly increases as the number of features in the input layer increases by dimension, the average computational time decreases even for large dimensions with the reduction in the average number of epochs due to the early stopping rule. This is the benefit of introducing the early stopping rule in Algorithm 1, which is particularly effective for higher-dimensional problems to avoid unnecessarily increasing training iterations (and overfitting). Note that the average computational time for both learning and valuation became smaller for the MLP than the LSMC when .
6. Conclusions
In this study, we detailed the use of a neural network for pricing multi-asset Bermudan option problems with stochastic volatility in commodity markets and illustrated its effectiveness. First, we employed the MLP to estimate continuation values in the multi-dimensional Bermudan commodity option problem, whereby we formulated the multi-asset stochastic volatility model and generated Monte Carlo simulation sample paths for learning continuation value functions using the MLP. Then, in the applied algorithm, we introduced early stopping into the learning of the MLP to avoid unnecessarily increasing training iterations and overfitting. The early stopping rule was activated by counting the number of epochs without improvement compared with the previous epoch. We also introduced a resampling process, and a valuation procedure was applied for the estimated neural networks by generating a different set of simulation sample paths. We executed numerical experiments to evaluate the accuracy of the continuation values and the initial price of the option using different settings of networks, problem dimension, and exercisable dates, whereby two types of payoff functions for Bermudan commodity options were considered, namely Bermudan put options and Bermudan capped put options. From our numerical analysis, we clarified the following observations:
No significant difference was observed between the MLP and the standard LSMC method when solving Bermudan put option problems with a few exercisable dates. However, there was a slight difference for the Bermudan capped put options; this difference was emphasized when the number of excisable dates increased. A similar tendency was observed for higher-dimensional cases, but the gap narrowed between the mean values of the two methods for Bermudan capped put options, as shown in our additional numerical experiments.
While it turned out that the MLP was not much better than the standard LSMC from the numerical experiments in
Section 4.3 for high-dimensional cases, it is meaningful to show how the accuracy and computational time can be achieved using the current computational resources. In addition, we expect that the MLP has the potential to achieve much better accuracy due to its generality and flexibility. Moreover, if computational power is increased, the MLP should become more efficient since computational effort grows slower than that of polynomial regressions in the standard LSMC for higher-dimensional problems, as illustrated in the numerical experiments in
Section 5.
From the perspective that the continuation value function is expected to approximate the payoff function (given state variables) one step before maturity, the shape of the continuation values from the MLP reflected the structure of payoff functions more accurately than the LSMC method.
Based on the fact that the continuation values of Bermudan options one step before maturity can be computed as European option prices, we measured the accuracy of the estimated continuation values and examined the effects of different network configurations in the MLP, changing the number of hidden layers/neurons and the choice of activation functions. We observed that the MLP almost always provided better accuracy in terms of NRMSEs compared with the LSMC; furthermore, when comparing the types of activation functions, the MLP with the sigmoid function was always better than the MLP with the ReLU function for estimating continuation value surfaces. An increase in the number of hidden layers was effective for a few network layers but did not necessarily improve the accuracy when the number of hidden layers was three or larger.
We computed the learning rate by epochs to show the learning adaptability of our proposed algorithm using the MLP, which indicated no overfitting and achieved sufficiently good levels of learning rates at approximately 10 epochs. Additionally, we showed that although the LSMC generally performs significantly better in terms of computational costs when the dimension is particularly low, its computational time grows exponentially with the size of the dimension due to the combinatorial characterization with respect to the number of terms in the polynomial functions. Conversely, the computational costs of the MLP were mostly unaffected by the size of the dimension or even decreased for large dimensions due to the introduction of the early stopping rule.
Essentially, the use of neural networks for option pricing has the advantage of recognizing sizable input features and generating flexible output features in a unified framework. Nevertheless, there are some drawbacks: high computational effort and resources are required for learning networks, especially for exotic options, including the Bermudan commodity options considered in this study. Additionally, it is necessary to frequently re-learn the networks in response to market conditions. However, we observed that the neural network approach using the MLP reached an appropriate level of learning rates at around 10 epochs, even in high-dimensional cases, as illustrated in our numerical experiments. Therefore, this approach is expected to reduce learning costs if network configurations are developed appropriately.
Although this study chose a relatively simple structure of multi-layered networks, there are other types of network structures such as a recursive structure and unsupervised learning, as discussed in various fields, including pattern recognition and time-series prediction. In finance, although some examples of recursive neural networks for time-series analyses exist, to the best of our knowledge, their use for option pricing has not been considered sufficiently. Moreover, it is important to use empirical data and demonstrate the practicability and applications for risk management in actual commodity market businesses. Such further investigation is interesting and could be considered potential topics for future study.