Score matching for bridges without learning time-reversals
Elizabeth L. Baker, Moritz Schauer, Stefan Sommer
TL;DR
The paper addresses sampling from bridges of diffusion processes conditioned on an endpoint, where the score term $\nabla_x \log p(t, x; T, y)$ is generally intractable. It proposes a score-matching approach based on adjoint diffusions to learn the forward bridge score directly, avoiding the need to learn time-reversals. The authors derive a KL-based loss that ensures the learned drift produces a bridged process and show that this yields training-time savings and competitive performance relative to time-reversal-based methods across linear and nonlinear SDEs; they also provide code. The approach broadens the applicability of diffusion-bridge methods to non-linear dynamics and end-point distributions, with potential impact for shape analysis and related domains.
Abstract
We propose a new algorithm for learning bridged diffusion processes using score-matching methods. Our method relies on reversing the dynamics of the forward process and using this to learn a score function, which, via Doob's $h$-transform, yields a bridged diffusion process; that is, a process conditioned on an endpoint. In contrast to prior methods, we learn the score term $\nabla_x \log p(t, x; T, y)$ directly, for given $t, y$, completely avoiding first learning a time-reversal. We compare the performance of our algorithm with existing methods and see that it outperforms using the (learned) time-reversals to learn the score term. The code can be found at https://github.com/libbylbaker/forward_bridge.
