Optimizing for the Shortest Path in Denoising Diffusion Model
Ping Chen, Xingpeng Zhang, Zhaoxiang Liu, Huan Hu, Xiang Liu, Kai Wang, Min Wang, Yanlin Qian, Shiguo Lian
TL;DR
This work addresses the computational bottleneck of diffusion-based generative models by reframing denoising as a shortest-path problem over a reverse-step graph. ShortDF optimizes initial residuals and propagates them through a residual-path relaxation, using a multi-state training setup to stabilize learning and edge-weight estimation. The method achieves substantial inference-speedups (reducing steps from around $1000$ to below $20$) while preserving or improving sample fidelity, as demonstrated on CIFAR-10, CelebA, and LSUN Churches with favorable FID scores and speed metrics. This graph-theoretic approach enables efficient, high-quality diffusion suitable for interactive or real-time applications, and provides a foundation for end-to-end optimization of diffusion samplers and generators.
Abstract
In this research, we propose a novel denoising diffusion model based on shortest-path modeling that optimizes residual propagation to enhance both denoising efficiency and quality. Drawing on Denoising Diffusion Implicit Models (DDIM) and insights from graph theory, our model, termed the Shortest Path Diffusion Model (ShortDF), treats the denoising process as a shortest-path problem aimed at minimizing reconstruction error. By optimizing the initial residuals, we improve the efficiency of the reverse diffusion process and the quality of the generated samples. Extensive experiments on multiple standard benchmarks demonstrate that ShortDF significantly reduces diffusion time (or steps) while enhancing the visual fidelity of generated samples compared to prior arts. This work, we suppose, paves the way for interactive diffusion-based applications and establishes a foundation for rapid data generation. Code is available at https://github.com/UnicomAI/ShortDF.
