Mirror Bridges Between Probability Measures
Leticia Mattos Da Silva, Silvia Sellán, Francisco Vargas, Justin Solomon
TL;DR
The paper proposes mirror Schrödinger bridges, a self-mapping variant of the Schrödinger bridge problem, to generate in-distribution variations proximal to a given input. By exploiting time symmetry, it develops a drift-averaging-based AMP that reduces computation (single drift model, halved iterations) while maintaining proximity control through a parameter $\sigma$. The authors provide theoretical insights, including a first-order equivalence between the reverse KL projection and the averaged drift, and demonstrate empirical gains on Gaussian, 2D, and image datasets with controllable sample proximity and improved efficiency. This approach offers a principled, scalable tool for conditional resampling and data augmentation within the data distribution.
Abstract
Resampling from a target measure whose density is unknown is a fundamental problem in mathematical statistics and machine learning. A setting that dominates the machine learning literature consists of learning a map from an easy-to-sample prior, such as the Gaussian distribution, to a target measure. Under this model, samples from the prior are pushed forward to generate a new sample on the target measure, which is often difficult to sample from directly. Of particular interest is the problem of generating a new sample that is proximate to or otherwise conditioned on a given input sample. In this paper, we propose a new model called mirror bridges to solve this problem of conditional resampling. Our key observation is that solving the Schrödinger bridge problem between a distribution and itself provides a natural way to produce new samples from conditional distributions, giving in-distribution variations of an input data point. We demonstrate how to efficiently estimate the solution to this largely overlooked version of the Schrödinger bridge problem, and we prove that under mild conditions, the difference between our estimate and the true Schrödinger bridge can be controlled explicitly. We show that our proposed method leads to significant algorithmic simplifications over existing alternatives, in addition to providing control over in-distribution variation. Empirically, we demonstrate how these benefits can be leveraged to produce proximal samples in a number of application domains.
