Identifying Drift, Diffusion, and Causal Structure from Temporal Snapshots
Vincent Guan, Joseph Janssen, Hossein Rahmani, Andrew Warren, Stephen Zhang, Elina Robeva, Geoffrey Schiebinger
TL;DR
This work addresses the challenge of learning drift and diffusion parameters of an SDE from temporal marginals when trajectories are unobserved. It establishes a sharp identifiability criterion: for linear additive-noise SDEs, drift $A$ and diffusion $H=GG^\top$ are identifiable from marginals if and only if the initial distribution $X_0$ is not auto-rotationally invariant, and it shows that the causal graph can be recovered from these parameters. The paper then introduces APPEX, an algorithm that alternates trajectory inference via anisotropic Schrödinger-bridge OT (AEOT) with maximum-likelihood parameter estimation, provably reducing KL divergence at each step and converging to the true SDE under suitable conditions. Empirical results on simulated data demonstrate that APPEX accurately recovers drift, diffusion, and causal structure and outperforms Waddington-OT, even in high dimensions and in the presence of latent confounders, highlighting its practical potential for systems biology and related domains.
Abstract
Stochastic differential equations (SDEs) are a fundamental tool for modelling dynamic processes, including gene regulatory networks (GRNs), contaminant transport, financial markets, and image generation. However, learning the underlying SDE from data is a challenging task, especially if individual trajectories are not observable. Motivated by burgeoning research in single-cell datasets, we present the first comprehensive approach for jointly identifying the drift and diffusion of an SDE from its temporal marginals. Assuming linear drift and additive diffusion, we prove that these parameters are identifiable from marginals if and only if the initial distribution lacks any generalized rotational symmetries. We further prove that the causal graph of any SDE with additive diffusion can be recovered from the SDE parameters. To complement this theory, we adapt entropy-regularized optimal transport to handle anisotropic diffusion, and introduce APPEX (Alternating Projection Parameter Estimation from $X_0$), an iterative algorithm designed to estimate the drift, diffusion, and causal graph of an additive noise SDE, solely from temporal marginals. We show that APPEX iteratively decreases Kullback-Leibler divergence to the true solution, and demonstrate its effectiveness on simulated data from linear additive noise SDEs.
