Convergence Analysis of Probability Flow ODE for Score-based Generative Models
Daniel Zhengyu Huang, Jiaoyang Huang, Zhengjiang Lin
TL;DR
This work analyzes the convergence of deterministic samplers based on probability flow ODEs for score-based generative models, focusing on how score estimation error and time discretization affect sampling fidelity. It establishes a continuous-time total-variation bound of O($d^{3/4}\delta^{1/2}$) under mild assumptions, and a discretized bound combining score error and RK discretization error as O($d^{3/4}\delta^{1/2} + d\cdot(dh)^p$). An iteration complexity of O($d^{1+1/p}\varepsilon^{-1/p}$) is derived for achieving a target TV accuracy, and the theory is validated through numerical experiments on Gaussian mixtures up to 128 dimensions. The results leverage transport along characteristics, interpolation-based discretization analysis, and dimension-free Gagliardo–Nirenberg inequalities, with extensions to general transport equations. Overall, the paper provides both theoretical guarantees and empirical evidence for the stability and efficiency of probability-flow-based score models under nonideal score estimates.
Abstract
Score-based generative models have emerged as a powerful approach for sampling high-dimensional probability distributions. Despite their effectiveness, their theoretical underpinnings remain relatively underdeveloped. In this work, we study the convergence properties of deterministic samplers based on probability flow ODEs from both theoretical and numerical perspectives. Assuming access to $L^2$-accurate estimates of the score function, we prove the total variation between the target and the generated data distributions can be bounded above by $\mathcal{O}(d^{3/4}δ^{1/2})$ in the continuous time level, where $d$ denotes the data dimension and $δ$ represents the $L^2$-score matching error. For practical implementations using a $p$-th order Runge-Kutta integrator with step size $h$, we establish error bounds of $\mathcal{O}(d^{3/4}δ^{1/2} + d\cdot(dh)^p)$ at the discrete level. Finally, we present numerical studies on problems up to 128 dimensions to verify our theory.
