On the Trajectory Regularity of ODE-based Diffusion Sampling
Defang Chen, Zhenyu Zhou, Can Wang, Chunhua Shen, Siwei Lyu
TL;DR
The paper investigates the trajectory regularity of ODE-based diffusion sampling, revealing a boomerang-shaped, largely straight sampling path governed by an implicit denoising trajectory. By modeling data with kernel density estimation, it derives a closed-form denoising solution linked to annealed mean shift and demonstrates that the sampling length scales as $\sigma_T\sqrt{d}$ with a nearly constant vector-field magnitude. Leveraging this regularity, it introduces Geometry-Inspired Time Scheduling (GITS), a dynamic-programming approach that re-allocates time steps using a small warmup, yielding substantial FID gains at few function evaluations with minimal overhead. The method provides theoretical and empirical insights into the sampling mechanism, offering a practical, fast route to higher-quality image generation on standard benchmarks. Overall, the work connects trajectory structure, KDE-based denoising, and DP-based scheduling to deliver efficient, principled acceleration for diffusion samplers.
Abstract
Diffusion-based generative models use stochastic differential equations (SDEs) and their equivalent ordinary differential equations (ODEs) to establish a smooth connection between a complex data distribution and a tractable prior distribution. In this paper, we identify several intriguing trajectory properties in the ODE-based sampling process of diffusion models. We characterize an implicit denoising trajectory and discuss its vital role in forming the coupled sampling trajectory with a strong shape regularity, regardless of the generated content. We also describe a dynamic programming-based scheme to make the time schedule in sampling better fit the underlying trajectory structure. This simple strategy requires minimal modification to any given ODE-based numerical solvers and incurs negligible computational cost, while delivering superior performance in image generation, especially in $5\sim 10$ function evaluations.
