Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance
Cunzheng Wang, Ziyuan Guo, Yuxuan Duan, Huaxia Li, Nemo Chen, Xu Tang, Yao Hu
TL;DR
Diffusion models offer high-quality generation but are slow; existing consistency-distillation methods struggle with blur and detail loss due to naive target-timestep choices. Target-Driven Distillation (TDD) refines this by selecting a restricted, strategically chosen set of target timesteps, using decoupled guidance during training, and enabling non-equidistant sampling and x0 clipping to improve few-step results. The approach yields state-of-the-art performance in 4–8 step generation across standard benchmarks, with robust guidance-scale tuning and stable training. This combination provides a practical pathway to fast, high-fidelity diffusion generation applicable to large-scale models and diverse prompts.
Abstract
Consistency distillation methods have demonstrated significant success in accelerating generative tasks of diffusion models. However, since previous consistency distillation methods use simple and straightforward strategies in selecting target timesteps, they usually struggle with blurs and detail losses in generated images. To address these limitations, we introduce Target-Driven Distillation (TDD), which (1) adopts a delicate selection strategy of target timesteps, increasing the training efficiency; (2) utilizes decoupled guidances during training, making TDD open to post-tuning on guidance scale during inference periods; (3) can be optionally equipped with non-equidistant sampling and x0 clipping, enabling a more flexible and accurate way for image sampling. Experiments verify that TDD achieves state-of-the-art performance in few-step generation, offering a better choice among consistency distillation models.
