Radu Briciu
BSc Finance, PgDip Quantitative Finance
See testing results in the example outputs direcotry
In this paper we will explore a proposed data generation method which will construct the feature matrix for a multi-layer perceptron model designed to estimate the pricing functional of path-dependent financial derivatives. We will explore the theoretical framework and specifications for barrier and Asian option feature generation along with their respective multi-layer perceptron architectures and model performance metrics. We will demonstrate how the proposed models can retain pricing errors below one percent while reducing the computation time by up to 99.8%.
- Introduction
- Pricing Model
- Data Generation Method
- Application to Path-Dependent Options
- Model Training
- Model Testing
- Concluding Remarks
- Software Repositories
In this paper we will explore a proposed method of pricing path-dependent index options via multi-layer perceptron approximations derived from the simulation of a multidimensional space representing a contract's price as a functional form of its features. Index options can generally be defined as financial derivative contracts facilitating a contingent claim on the value of a stock market index which is designed to aggregate performance across a sector or economy.
It is therefore a complicated matter to evaluate the price of path-dependent option counterparts which introduce further non-linearities in their pricing functions. Path-dependent options often require statistical simulation to obtain the most accurate price, resulting in long computation times.
We propose a method to generate possible values of a function which will proceed the optimization of a multi-layer perceptron model, effectively proxying the stochastic non-linear functional form between the output and its input features. To generate a representative sample space, we calibrate historical Heston (1993) parameters using market observed risk-free and dividend rates accompanied by live options trade data, thereby effectively simulating, in the case of this paper, the SPX index options market.
The use of Heston’s stochastic volatility model as a pricing function for the underlying index allows for discrete monitoring of volatility and pricing of various market scenarios and contract types. This paper serves as a framework and demonstration of a generalized estimation process for barrier and Asian options along with model specifications and retraining analyses.
We use the Heston (1993) model to describe the log-price of the underlying, via the SDE system:
where
-
$v_0$ = initial variance, -
$\theta$ = long-run variance, -
$\rho$ = correlation between log-price and variance process, -
$\kappa$ = mean reversion speed, -
$\eta$ = vol of variance process, -
$B_t, W_t$ = Brownian motions.
Heston (1993) extends Black-Scholes (1973) by allowing stochastic volatility, making it suitable for pricing path-dependent options.
We construct a dataset of historical parameter sets by calibrating to live SPX options trades. This calibration yields ~1600 sets between 2012–2024.
Filtering ensures each volatility surface has strikes both above and below the spot price (at least 2 each side, with ≥5 contracts) to allow reliable Heston calibration. Calibration is performed using the Levenberg–Marquardt algorithm as implemented in QuantLib.
We start with an initial feature matrix
For each row
The Cartesian product of these candidate values produces the row-specific feature block:
Stacking every
Based on
A Barrier option modifies the European call payoff
We generalize the barrier price functional:
Benchmark pricing is via Monte Carlo with control variates; finite-difference methods are used for computational savings during dataset generation.
The payoff is based on the average of the underlying over fixing dates:
Arithmetic average payoff:
Geometric average payoff:
Definitions of averages:
Generalized pricing function:
In order to simulate the multidimensional space representing a Barrier option's price as a function of its features, we begin by iterating through the row dimension of
where
-
$K_{S}$ is a set of strikes spread around the spot, -
$T$ is a set of maturities, -
$B$ is a set of barrier levels, -
$R$ is a set of rebates, which for the purposes of this study is a set consisting of only the element$0$ (zero), -
$F^{\text{call/put}}$ is a set of categorical variables representing the type of underlying European option, and -
$F^{\text{Out/In}}$ is a set of categorical variables representing the Barrier option type payoff.
It is to be noted that the only feasible combinations are Down options with
The subset feature matrix
This essentially allows us to generate
In the case of Asian options, the feature matrix generation process is not as straightforward. In order to simulate an evenly distributed sample space, we need to take into account the relationship between an Asian option's maturity and its fixing dates. This will require careful construction of a maturities vector
which precedes the creation of
where
-
$\tau$ is the iterated maturity in$T^{\text{Asian}}$ , -
$K_{S}$ is a set of strikes spread around the spot price$S$ , -
$A_{\tau}$ are factors of$\tau$ which will determine the number of fixing dates applicable to the contract, -
$P$ is a set of past fixings, which for the purposes of this study is a set consisting of only the element$0$ (zero), -
$F^{\text{call/put}}$ is a set of categorical variables representing the type of underlying European option, and -
$F^{\text{arithmetic/geometric}}$ is a set of categorical variables representing the contract's averaging type.
The subset feature matrix
essentially allowing us to generate
We train a multi-layer perceptron (MLP) on the feature matrix
Hyperparameter tuning across 6480 configurations ensures in-sample error ~0.5% and out-of-sample error <5% (Asian options).
See testing results in the example outputs direcotry
The model was tested with a negligible amount of data and tested for over 10 years of out of sample data.
Figure: Barrier options out-of-sample errors
Figure: Asian options out-of-sample errors
Errors increase without retraining, due to unseen parameter combinations.
The neural network preserves distributional shapes of option prices vs Heston parameters:
Figure: Barrier option price/parameter joint distribution
Figure: Asian option price/parameter joint distribution
We proposed a data generation routine to produce parsimonious training sets for exotic option pricing.
- Barrier options priced via advection (approximate).
- Asian options priced via Monte Carlo.
- Neural networks approximate the Heston functional form.
- Achieved >99% reduction in computation time with in-sample error <1%.






