Efficient Training of Neural Stochastic Differential Equations by Matching Finite Dimensional Distributions
Jianxin Zhang, Josh Viktorov, Doosan Jung, Emily Pitler
TL;DR
This work addresses the inefficiencies of training Neural SDEs with signature-kernel or adversarial objectives by introducing Finite Dimensional Matching (FDM), a strictly proper scoring-rule framework for continuous Markov processes. By converting a strictly proper scoring rule on $bR^{2d}$ into a process-level rule via averaging over two-time marginals, FDM enables an objective that scales as $O(D)$ per epoch, avoiding the PDEs and double integrals that burden prior methods. The authors prove the core scoring rule extension is strictly proper and provide rigorous sample-complexity and sensitivity analyses, complemented by empirical results across diverse financial and synthetic datasets where FDM consistently yields superior generative quality and faster training. The approach offers a principled, scalable alternative to GANs and signature-based methods, with practical impact for mesh-free time-series modeling in finance, physics, and biology. The work also envisions extensions to non-continuous or non-Markov settings, including càdlàg processes and hidden-Markov structures.
Abstract
Neural Stochastic Differential Equations (Neural SDEs) have emerged as powerful mesh-free generative models for continuous stochastic processes, with critical applications in fields such as finance, physics, and biology. Previous state-of-the-art methods have relied on adversarial training, such as GANs, or on minimizing distance measures between processes using signature kernels. However, GANs suffer from issues like instability, mode collapse, and the need for specialized training techniques, while signature kernel-based methods require solving linear PDEs and backpropagating gradients through the solver, whose computational complexity scales quadratically with the discretization steps. In this paper, we identify a novel class of strictly proper scoring rules for comparing continuous Markov processes. This theoretical finding naturally leads to a novel approach called Finite Dimensional Matching (FDM) for training Neural SDEs. Our method leverages the Markov property of SDEs to provide a computationally efficient training objective. This scoring rule allows us to bypass the computational overhead associated with signature kernels and reduces the training complexity from $O(D^2)$ to $O(D)$ per epoch, where $D$ represents the number of discretization steps of the process. We demonstrate that FDM achieves superior performance, consistently outperforming existing methods in terms of both computational efficiency and generative quality.
