Faster Diffusion Models via Higher-Order Approximation
Gen Li, Yuchen Zhou, Yuting Wei, Yuxin Chen
TL;DR
The paper tackles slow sampling in diffusion models by proposing HEROISM, a training-free, high-order ODE-based sampler that uses Lagrange interpolation over multiple time points and a successive-refinement scheme. It proves provable convergence in total variation to the target distribution under mild assumptions and shows a favorable iteration complexity scaling as $\tilde{O}(d^{1+2/K}/\varepsilon^{1/K})$ for fixed $K$, with near-linear dependence on the data dimension when $K$ is held constant. The theory accommodates inexact score estimates, revealing graceful degradation with score/Jacobian errors, and improves upon prior deterministic and stochastic acceleration results by relaxing smoothness and log-concavity requirements and reducing dependence on the data support size. Overall, the work provides a solid theoretical framework for higher-order accelerated samplers in diffusion models and suggests directions for scaling and stochastic extensions.
Abstract
In this paper, we explore provable acceleration of diffusion models without any additional retraining. Focusing on the task of approximating a target data distribution in $\mathbb{R}^d$ to within $\varepsilon$ total-variation distance, we propose a principled, training-free sampling algorithm that requires only the order of $$ d^{1+2/K} \varepsilon^{-1/K} $$ score function evaluations (up to log factor) in the presence of accurate scores, where $K>0$ is an arbitrary fixed integer. This result applies to a broad class of target data distributions, without the need for assumptions such as smoothness or log-concavity. Our theory is robust vis-a-vis inexact score estimation, degrading gracefully as the score estimation error increases -- without demanding higher-order smoothness on the score estimates as assumed in previous work. The proposed algorithm draws insight from high-order ODE solvers, leveraging high-order Lagrange interpolation and successive refinement to approximate the integral derived from the probability flow ODE. More broadly, our work develops a theoretical framework towards understanding the efficacy of high-order methods for accelerated sampling.
