Unsupervised Learning-based Calibration Scheme for Rough Volatility Models
Changqing Teng, Guanglian Li
TL;DR
The paper tackles calibrating rough volatility models where non-Markovian dynamics hinder PDE-based pricing. It introduces an unsupervised Deep BSDE scheme that treats model parameters as trainable weights and solves the associated BSPDE/BSDE pricing problem forward in time, using terminal mispricing as the loss. A key theoretical result bounds the pricing discrepancy $F(\theta)$ by the learning loss $\mathcal{L}(\theta;\nu)$, and neural networks' universal approximation ensures this loss can be driven arbitrarily close to zero. Numerical experiments on simulated data and historical S&P 500 data with the rBergomi model demonstrate accurate calibration and substantial computational efficiency gains over traditional Monte Carlo methods, supporting online market-fitting applications and suggesting extensions to time-dependent parameters.
Abstract
Existing deep learning-based calibration scheme for rough volatility models predominantly rely on supervised learning frameworks, which incur significant computational costs due to the necessity of generating massive synthetic training datasets. In this work, we propose a novel unsupervised learning-based calibration scheme for rough volatility models that eliminates the data generation bottleneck. Our approach leverages the backward stochastic differential equation (BSDE) representation of the pricing function derived by Bayer et al. \cite{bayer2022pricing}. By treating model parameters as trainable variables, we simultaneously approximate the BSDE solution and optimize the parameters within a unified neural network training process, with the terminal misfit as the loss. We theoretically establish that the mean squared error between the model-implied prices and market data is bounded by the loss function. Furthermore, we prove that the loss can be minimized to an arbitary degree, depending on the model's market fitting capacity and the universal approximation capability of neural networks. Numerical experiments for both simulated and historical S\&P 500 data based on rough Bergomi (rBergomi) model demonstrate the efficiency and accuracy of the proposed scheme.
