Wasserstein Convergence Guarantees for a General Class of Score-Based Generative Models
Xuefeng Gao, Hoang M. Nguyen, Lingjiong Zhu
TL;DR
<3-5 sentence high-level summary>This work develops a rigorous convergence theory for a broad class of score-based generative models (SGMs) implemented via stochastic differential equations (SDEs). By assuming a strongly log-concave data distribution and accurate score estimation, the authors derive a 2-Wasserstein ($\mathcal{W}_{2}$) contraction bound for the reverse SDE, and decompose the error into initialization, discretization, and score-matching components to obtain iteration-complexity guarantees. The theory is specialized to Variance-Exploding (VE) and Variance-Preserving (VP) forward processes, including novel polynomial and exponential noise schedules, and is complemented by lower bounds showing limits of the approach in Gaussian and general settings. Numerical experiments on CIFAR-10 confirm that forward-process choices with lower predicted iteration complexity yield better sample quality, aligning with the theoretical predictions and highlighting the practical impact of forward-process design in SGMs.
Abstract
Score-based generative models (SGMs) is a recent class of deep generative models with state-of-the-art performance in many applications. In this paper, we establish convergence guarantees for a general class of SGMs in 2-Wasserstein distance, assuming accurate score estimates and smooth log-concave data distribution. We specialize our result to several concrete SGMs with specific choices of forward processes modelled by stochastic differential equations, and obtain an upper bound on the iteration complexity for each model, which demonstrates the impacts of different choices of the forward processes. We also provide a lower bound when the data distribution is Gaussian. Numerically, we experiment SGMs with different forward processes, some of which are newly proposed in this paper, for unconditional image generation on CIFAR-10. We find that the experimental results are in good agreement with our theoretical predictions on the iteration complexity, and the models with our newly proposed forward processes can outperform existing models.
