Table of Contents
Fetching ...

DynGMA: a robust approach for learning stochastic differential equations from data

Aiqing Zhu, Qianxiao Li

TL;DR

This paper tackles learning fully unknown drift $f$ and diffusion $\sigma$ in stochastic differential equations from discrete trajectories. It introduces DynGMA, a dynamical Gaussian mixture density that leverages multi-step Gaussian approximations and cubature to enable likelihood-based training with moderate-to-large time steps and variable sampling intervals, while supporting invariant-distribution computation. The approach provides theoretical error nuances via an asymptotic Gaussian density expansion and practical neural-network parameterizations for $f_\theta$ and $\sigma_\theta$, yielding accurate reconstructions across low- and high-dimensional SDEs, as well as data from Gillespie stochastic simulations. The method demonstrates robustness to measurement noise, scalability to 10D systems, and improved estimation of invariant structures, with potential for integrating partial physics-informed terms and extending to non-Gaussian noise in future work.

Abstract

Learning unknown stochastic differential equations (SDEs) from observed data is a significant and challenging task with applications in various fields. Current approaches often use neural networks to represent drift and diffusion functions, and construct likelihood-based loss by approximating the transition density to train these networks. However, these methods often rely on one-step stochastic numerical schemes, necessitating data with sufficiently high time resolution. In this paper, we introduce novel approximations to the transition density of the parameterized SDE: a Gaussian density approximation inspired by the random perturbation theory of dynamical systems, and its extension, the dynamical Gaussian mixture approximation (DynGMA). Benefiting from the robust density approximation, our method exhibits superior accuracy compared to baseline methods in learning the fully unknown drift and diffusion functions and computing the invariant distribution from trajectory data. And it is capable of handling trajectory data with low time resolution and variable, even uncontrollable, time step sizes, such as data generated from Gillespie's stochastic simulations. We then conduct several experiments across various scenarios to verify the advantages and robustness of the proposed method.

DynGMA: a robust approach for learning stochastic differential equations from data

TL;DR

This paper tackles learning fully unknown drift and diffusion in stochastic differential equations from discrete trajectories. It introduces DynGMA, a dynamical Gaussian mixture density that leverages multi-step Gaussian approximations and cubature to enable likelihood-based training with moderate-to-large time steps and variable sampling intervals, while supporting invariant-distribution computation. The approach provides theoretical error nuances via an asymptotic Gaussian density expansion and practical neural-network parameterizations for and , yielding accurate reconstructions across low- and high-dimensional SDEs, as well as data from Gillespie stochastic simulations. The method demonstrates robustness to measurement noise, scalability to 10D systems, and improved estimation of invariant structures, with potential for integrating partial physics-informed terms and extending to non-Gaussian noise in future work.

Abstract

Learning unknown stochastic differential equations (SDEs) from observed data is a significant and challenging task with applications in various fields. Current approaches often use neural networks to represent drift and diffusion functions, and construct likelihood-based loss by approximating the transition density to train these networks. However, these methods often rely on one-step stochastic numerical schemes, necessitating data with sufficiently high time resolution. In this paper, we introduce novel approximations to the transition density of the parameterized SDE: a Gaussian density approximation inspired by the random perturbation theory of dynamical systems, and its extension, the dynamical Gaussian mixture approximation (DynGMA). Benefiting from the robust density approximation, our method exhibits superior accuracy compared to baseline methods in learning the fully unknown drift and diffusion functions and computing the invariant distribution from trajectory data. And it is capable of handling trajectory data with low time resolution and variable, even uncontrollable, time step sizes, such as data generated from Gillespie's stochastic simulations. We then conduct several experiments across various scenarios to verify the advantages and robustness of the proposed method.
Paper Structure (17 sections, 5 theorems, 65 equations, 12 figures, 4 tables, 2 algorithms)

This paper contains 17 sections, 5 theorems, 65 equations, 12 figures, 4 tables, 2 algorithms.

Key Result

Theorem 3.1

Consider an SDE with a small parameter: Suppose elements of the drift $f$ and diffusion $\sigma$ have bounded partial derivatives up to order $2$, then there exists a Gaussian process $X_{GP}(t) = X_0(t) + \varepsilon X_1(t)$ such that Here, $X_0(t)$ and $X_1(t)$ satisfy the following differential equations: where $J_{f } = \frac{\partial f }{\partial x}$ denotes the Jacobian of $f$, and $\math

Figures (12)

  • Figure 1: Illustration of the proposed DynGMA with $3$ evaluation points. (Black box section) We discretize the time interval $[0, \Delta t]$ into $K$ sub-intervals with a step size of $h$, and then use Gaussian mixture model to approximate the $k$-step transition density (see \ref{['DynGMA']} for details). (Red box section) At each step of size $h$, we solve (\ref{['eq: gauss discretization2']}) to determine each Gaussian density within the Gaussian mixture model (see \ref{['sec:gaussian']} for details).
  • Figure 2: Comparison of approximations and the exact density for Beneš SDE.
  • Figure 3: Learned drift functions and generalized potential of the learned dynamics for the two-dimensional system (\ref{['eq:2d']}).
  • Figure 4: Time evolution of the distribution (Top), the mean and the standard deviation (Bottom) of the learned and exact dynamics for the two-dimensional system.
  • Figure 5: Errors in the learned diffusion coefficients and the generalized potential of the learned dynamics for multi-step loss
  • ...and 7 more figures

Theorems & Definitions (10)

  • Remark 3.1
  • Theorem 3.1
  • Corollary 3.1
  • Theorem 3.2
  • Theorem 3.3
  • Theorem 3.4
  • proof : Proof of \ref{['cor:gauss']}
  • proof : Proof of \ref{['the:dis con']}
  • proof : Proof of \ref{['the:dis con2']}
  • proof : Proof of \ref{['the:dis con3']}