Soft-constrained Schrodinger Bridge: a Stochastic Control Approach
Jhanvi Garg, Xianyang Zhang, Quan Zhou
TL;DR
This work introduces the soft-constrained Schrödinger bridge (SSB), which replaces the hard terminal constraint of the classical Schrödinger bridge with a penalized KL term controlled by $\beta$, enabling robust diffusion-based sampling when the target is uncertain or data-limited. The authors derive the optimal drift as $u_t^* = \sigma^2 \nabla \log h(X_t^{u^*}, t)$ with $h$ defined via conditional expectations, and show that the optimal terminal law is a geometric mixture between the target $\mu_T$ and the uncontrolled terminal distribution. They extend the framework to time-series targets, provide an existence/uniqueness theory via a generalized Schrödinger system, and develop a score-matching with importance sampling algorithm to learn geometric mixtures. Experiments on MNIST illustrate how moderate values of $\beta$ yield high-quality samples by leveraging large, high-fidelity reference data while maintaining fidelity to noisy targets. Overall, SSB offers a flexible, theoretically grounded route to robust generative diffusion and time-series data synthesis with controllable regularization through $\beta$.
Abstract
Schrödinger bridge can be viewed as a continuous-time stochastic control problem where the goal is to find an optimally controlled diffusion process whose terminal distribution coincides with a pre-specified target distribution. We propose to generalize this problem by allowing the terminal distribution to differ from the target but penalizing the Kullback-Leibler divergence between the two distributions. We call this new control problem soft-constrained Schrödinger bridge (SSB). The main contribution of this work is a theoretical derivation of the solution to SSB, which shows that the terminal distribution of the optimally controlled process is a geometric mixture of the target and some other distribution. This result is further extended to a time series setting. One application is the development of robust generative diffusion models. We propose a score matching-based algorithm for sampling from geometric mixtures and showcase its use via a numerical example for the MNIST data set.
