ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models
Shengming Li, Luping Liu, Runnan Li, Xu Tan
TL;DR
ERA-Solver tackles slow diffusion-model sampling by casting denoising as a diffusion ODE and introducing an error-robust implicit Adams solver. It replaces fixed-predictor coefficients with a Lagrange interpolation-based predictor and an adaptive noise-basis selection mechanism to accommodate diverse error patterns from pretrained models, all without retraining. The authors show third-order local error and second-order convergence for suitable interpolation order, plus robustness guarantees for basis selection, and demonstrate substantial FID improvements on CIFAR-10, CelebA, LSUN-Church, and ImageNet 64×64 with as few as 10 function evaluations. This training-free, model-agnostic approach significantly accelerates diffusion-based generation while preserving or enhancing sample quality, enabling practical deployment across tasks.
Abstract
Though denoising diffusion probabilistic models (DDPMs) have achieved remarkable generation results, the low sampling efficiency of DDPMs still limits further applications. Since DDPMs can be formulated as diffusion ordinary differential equations (ODEs), various fast sampling methods can be derived from solving diffusion ODEs. However, we notice that previous fast sampling methods with fixed analytical form are not able to robust with the various error patterns in the noise estimated from pretrained diffusion models. In this work, we construct an error-robust Adams solver (ERA-Solver), which utilizes the implicit Adams numerical method that consists of a predictor and a corrector. Different from the traditional predictor based on explicit Adams methods, we leverage a Lagrange interpolation function as the predictor, which is further enhanced with an error-robust strategy to adaptively select the Lagrange bases with lower errors in the estimated noise. The proposed solver can be directly applied to any pretrained diffusion models, without extra training. Experiments on Cifar10, CelebA, LSUN-Church, and ImageNet 64 x 64 (conditional) datasets demonstrate that our proposed ERA-Solver achieves 3.54, 5.06, 5.02, and 5.11 Frechet Inception Distance (FID) for image generation, with only 10 network evaluations.
