Provable Acceleration for Diffusion Models under Minimal Assumptions
Gen Li, Changxiao Cai
TL;DR
This work tackles the slow sampling of score-based diffusion models by proposing a training-free accelerated sampler that operates under minimal assumptions: only $L^2$-accurate score estimates and finite second moments of the target distribution. The approach injects randomness and employs a momentum-like, second-order correction derived from higher-order approximations to the probability flow ODE, combined with clipping to control large errors. The authors prove that the sampler achieves $\varepsilon$-accuracy in total variation within $\widetilde{O}(d^{5/4}/\sqrt{\varepsilon})$ iterations (up to a burn-in cost), improving over the standard $\widetilde{O}(d/\varepsilon)$ rate for $\varepsilon \le 1/\sqrt{d}$, without requiring distributional or high-order smoothness assumptions. This represents the first provable acceleration for score-based samplers under such relaxed conditions, offering practical speedups for high-dimensional diffusion-model sampling.
Abstract
Score-based diffusion models, while achieving minimax optimality for sampling, are often hampered by slow sampling speeds due to the high computational burden of score function evaluations. Despite the recent remarkable empirical advances in speeding up the score-based samplers, theoretical understanding of acceleration techniques remains largely limited. To bridge this gap, we propose a novel training-free acceleration scheme for stochastic samplers. Under minimal assumptions -- namely, $L^2$-accurate score estimates and a finite second-moment condition on the target distribution -- our accelerated sampler provably achieves $\varepsilon$-accuracy in total variation within $\widetilde{O}(d^{5/4}/\sqrt{\varepsilon})$ iterations, thereby significantly improving upon the $\widetilde{O}(d/\varepsilon)$ iteration complexity of standard score-based samplers for $\varepsilon\leq 1/\sqrt{d}$. Notably, our convergence theory does not rely on restrictive assumptions on the target distribution or higher-order score estimation guarantees.
