Schrödinger Bridge Samplers
Espen Bernton, Jeremy Heng, Arnaud Doucet, Pierre E. Jacob
TL;DR
This work introduces Schrödinger bridge samplers, a regression-augmented IPF approach to approximate target distributions by iteratively reshaping Markov kernels so their T-step marginals converge to the target. It reframes Schrödinger bridges as forward/backward half-bridge KL projections and leverages approximate dynamic programming to learn policy refinements, enabling feasible sampling in continuous spaces without requiring conjugacy. The authors extend the method to sequential, multi-marginal settings (SSB), apply it to discretized Langevin dynamics, and demonstrate substantial variance reductions in normalizing-constant estimation and improved marginal fits in high dimensions and Bayesian regression. The framework connects to optimal transport, flow transport, and shortcuts to adiabaticity, offering a versatile toolkit for diffusion-based sampling and inference in challenging settings.
Abstract
Consider a reference Markov process with initial distribution $π_{0}$ and transition kernels $\{M_{t}\}_{t\in[1:T]}$, for some $T\in\mathbb{N}$. Assume that you are given distribution $π_{T}$, which is not equal to the marginal distribution of the reference process at time $T$. In this scenario, Schrödinger addressed the problem of identifying the Markov process with initial distribution $π_{0}$ and terminal distribution equal to $π_{T}$ which is the closest to the reference process in terms of Kullback--Leibler divergence. This special case of the so-called Schrödinger bridge problem can be solved using iterative proportional fitting, also known as the Sinkhorn algorithm. We leverage these ideas to develop novel Monte Carlo schemes, termed Schrödinger bridge samplers, to approximate a target distribution $π$ on $\mathbb{R}^{d}$ and to estimate its normalizing constant. This is achieved by iteratively modifying the transition kernels of the reference Markov chain to obtain a process whose marginal distribution at time $T$ becomes closer to $π_T = π$, via regression-based approximations of the corresponding iterative proportional fitting recursion. We report preliminary experiments and make connections with other problems arising in the optimal transport, optimal control and physics literatures.
