An efficient Wasserstein-distance approach for reconstructing jump-diffusion processes using parameterized neural networks
Mingtao Xia, Xiangting Li, Qijing Shen, Tom Chou
TL;DR
This work tackles the inverse problem of reconstructing multidimensional jump-diffusion processes from data by leveraging Wasserstein-distance based losses. It introduces a temporally decoupled squared $W_2$ distance, $\tilde{W}_2^2(\mu, \hat{\mu}) = \int_0^T W_2^2(\mu(t), \hat{\mu}(t)) dt$, which is efficiently computable from finite-sample trajectories and provides both upper and lower bounds on reconstruction errors of the drift $\bm{f}$, diffusion $\bm{\sigma}$, and jump $\bm{\beta}$ when approximating the true process with a neural-network parameterized model. Theoretical results establish that $W_p(\mu,\hat{\mu})$ lower-bounds the aggregate coefficient discrepancies while the temporally decoupled form offers practical finite-sample estimability, with well-defined finite-time projections ensuring convergence properties. Numerical experiments demonstrate that minimizing the temporally decoupled $W_2$ loss yields more accurate reconstructions than standard losses (e.g., MSE, MMD, $W_1$, $W_2$, WGAN), and that incorporating prior information on the drift can substantially improve diffusion and jump-function recovery. The approach holds promise for efficient, accurate inference of complex stochastic systems in finance, biology, and beyond, while pointing to future work on higher dimensions and more general noise types.
Abstract
We analyze the Wasserstein distance ($W$-distance) between two probability distributions associated with two multidimensional jump-diffusion processes. Specifically, we analyze a temporally decoupled squared $W_2$-distance, which provides both upper and lower bounds associated with the discrepancies in the drift, diffusion, and jump amplitude functions between the two jump-diffusion processes. Then, we propose a temporally decoupled squared $W_2$-distance method for efficiently reconstructing unknown jump-diffusion processes from data using parameterized neural networks. We further show its performance can be enhanced by utilizing prior information on the drift function of the jump-diffusion process. The effectiveness of our proposed reconstruction method is demonstrated across several examples and applications.
