Score-based generative models are provably robust: an uncertainty quantification perspective
Nikiforos Mimikos-Stamatopoulos, Benjamin J. Zhang, Markos A. Katsoulakis
TL;DR
This work develops a PDE-regularity–driven uncertainty-quantification framework for score-based generative models (SGMs). By deriving the Wasserstein Uncertainty Propagation (WUP) theorem, it shows how $L^2$-norm errors in the learned score translate into integral probability metric deviations (e.g., $\mathbf{d}_1$ and TV) of the generated distribution under the diffusion flow. It provides explicit generalization bounds for both denoising score matching (DSM) and denoising score matching with early stopping (DSM-ESM), relying on minimal assumptions and without requiring manifold-density assumptions. The analysis leverages Kolmogorov backward equations and Bernstein-type gradient estimates for Hamilton-Jacobi-Bellman PDEs to obtain computable bounds, clarifying how error trade-offs, early stopping, and reference measures impact robustness. The results pave the way for principled uncertainty quantification, connections to likelihood-free inference, and distributionally robust optimization in SGMs.
Abstract
Through an uncertainty quantification (UQ) perspective, we show that score-based generative models (SGMs) are provably robust to the multiple sources of error in practical implementation. Our primary tool is the Wasserstein uncertainty propagation (WUP) theorem, a model-form UQ bound that describes how the $L^2$ error from learning the score function propagates to a Wasserstein-1 ($\mathbf{d}_1$) ball around the true data distribution under the evolution of the Fokker-Planck equation. We show how errors due to (a) finite sample approximation, (b) early stopping, (c) score-matching objective choice, (d) score function parametrization expressiveness, and (e) reference distribution choice, impact the quality of the generative model in terms of a $\mathbf{d}_1$ bound of computable quantities. The WUP theorem relies on Bernstein estimates for Hamilton-Jacobi-Bellman partial differential equations (PDE) and the regularizing properties of diffusion processes. Specifically, PDE regularity theory shows that stochasticity is the key mechanism ensuring SGM algorithms are provably robust. The WUP theorem applies to integral probability metrics beyond $\mathbf{d}_1$, such as the total variation distance and the maximum mean discrepancy. Sample complexity and generalization bounds in $\mathbf{d}_1$ follow directly from the WUP theorem. Our approach requires minimal assumptions, is agnostic to the manifold hypothesis and avoids absolute continuity assumptions for the target distribution. Additionally, our results clarify the trade-offs among multiple error sources in SGMs.
