Gradient-flow SDEs have unique transient population dynamics
Vincent Guan, Joseph Janssen, Nicolas Lanzetti, Antonio Terpin, Geoffrey Schiebinger, Elina Robeva
TL;DR
This work tackles the hard problem of identifying both drift and diffusion in gradient-flow SDEs from observed marginals. It proves that joint identifiability is possible if and only if the process is observed away from equilibrium, and further shows that three distinct marginals suffice to identify the true SDE from any countable candidate set. Building on this theory, it introduces nn-APPEX, a tri-level Schrödinger Bridge-based algorithm that jointly learns the gradient-flow drift $- abla \Psi$ and diffusivity $\sigma^2$ by iteratively inferring trajectories, updating drift via neural networks, and re-estimating diffusion from the inferred paths. Empirical results across multiple potentials demonstrate that learning diffusion is critical to unbiased drift estimation, with nn-APPEX outperforming prior SB methods and closely approaching the true SDE from marginals, offering a principled framework for population dynamics inference when diffusion is unknown.
Abstract
Identifying the drift and diffusion of an SDE from its population dynamics is a notoriously challenging task. Researchers in machine learning and single cell biology have only been able to prove a partial identifiability result: for potential-driven SDEs, the gradient-flow drift can be identified from temporal marginals if the Brownian diffusivity is already known. Existing methods therefore assume that the diffusivity is known a priori, despite it being unknown in practice. We dispel the need for this assumption by providing a complete characterization of identifiability: the gradient-flow drift and Brownian diffusivity are jointly identifiable from temporal marginals if and only if the process is observed outside of equilibrium. Given this fundamental result, we propose nn-APPEX, the first Schrödinger Bridge-based inference method that can simultaneously learn the drift and diffusion of gradient-flow SDEs solely from observed marginals. Extensive numerical experiments show that nn-APPEX's ability to adjust its diffusion estimate enables accurate inference, while previous Schrödinger Bridge methods obtain biased drift estimates due to their assumed, and likely incorrect, diffusion.
