Tight Bounds for Schrödinger Potential Estimation in Unpaired Data Translation
Nikita Puchkin, Denis Suchkov, Alexey Naumov, Denis Belomestny
TL;DR
This work studies learning Schrödinger potentials for unpaired data translation under an Ornstein–Uhlenbeck reference process. It defines population and empirical risks using only iid samples from the initial and target marginals and proves a non-asymptotic, high-probability bound on the KL divergence between the true optimal coupling π^* and its empirical estimator, capturing fast rates when the approximation error Δ is small. The key technical contribution is a nearly parametric rate for the KL error, driven by OU mixing properties and a Bernstein-type concentration framework, with explicit dependence on data geometry and the time horizon T. The approach is validated through numerical experiments on Gaussian mixtures, single-cell data, and unpaired image-to-image translation, showing improvements over baseline LightSB and illustrating practical applicability to high-dimensional generative tasks.
Abstract
Modern methods of generative modelling and unpaired data translation based on Schrödinger bridges and stochastic optimal control theory aim to transform an initial density to a target one in an optimal way. In the present paper, we assume that we only have access to i.i.d. samples from initial and final distributions. This makes our setup suitable for both generative modelling and unpaired data translation. Relying on the stochastic optimal control approach, we choose an Ornstein-Uhlenbeck process as the reference one and estimate the corresponding Schrödinger potential. Introducing a risk function as the Kullback-Leibler divergence between couplings, we derive tight bounds on generalization ability of an empirical risk minimizer in a class of Schrödinger potentials including Gaussian mixtures. Thanks to the mixing properties of the Ornstein-Uhlenbeck process, we almost achieve fast rates of convergence up to some logarithmic factors in favourable scenarios. We also illustrate performance of the suggested approach with numerical experiments.
