Diffusion Sampling Correction via Approximately 10 Parameters
Guangyi Wang, Wei Peng, Lijiang Li, Wenyu Chen, Yuren Cai, Songzhi Su
TL;DR
Diffusion Probabilistic Models suffer slow sampling, and training-based distillation methods add substantial cost and can disrupt interpolation between modes. PAS introduces a PCA-driven, plug-and-play correction that learns a tiny set of coordinates in a low-dimensional sampling subspace to adjust sampling directions, combined with an adaptive search that targets high-curvature regions of the trajectory. The approach achieves significant improvements (e.g., CIFAR10 FID from 15.69 to 4.37 at $\text{NFE}=10$) with roughly $4$–$12$ parameters and minutes of training, while preserving the underlying ODE trajectory. PAS demonstrates strong, dataset- and solver-agnostic gains across unconditional and conditional pre-trained DPMs, offering a practical route to fast, high-quality diffusion sampling. The work also provides theoretical and empirical insights into why diffusion trajectories lie in low-dimensional subspaces and how consistent geometric structure across samples can be leveraged for efficient correction.
Abstract
While powerful for generation, Diffusion Probabilistic Models (DPMs) face slow sampling challenges, for which various distillation-based methods have been proposed. However, they typically require significant additional training costs and model parameter storage, limiting their practicality. In this work, we propose PCA-based Adaptive Search (PAS), which optimizes existing solvers for DPMs with minimal additional costs. Specifically, we first employ PCA to obtain a few basis vectors to span the high-dimensional sampling space, which enables us to learn just a set of coordinates to correct the sampling direction; furthermore, based on the observation that the cumulative truncation error exhibits an ``S"-shape, we design an adaptive search strategy that further enhances the sampling efficiency and reduces the number of stored parameters to approximately 10. Extensive experiments demonstrate that PAS can significantly enhance existing fast solvers in a plug-and-play manner with negligible costs. E.g., on CIFAR10, PAS optimizes DDIM's FID from 15.69 to 4.37 (NFE=10) using only 12 parameters and sub-minute training on a single A100 GPU. Code is available at https://github.com/onefly123/PAS.
