ETC: training-free diffusion models acceleration with Error-aware Trend Consistency
Jiajian Xie, Hubery Yin, Chen Li, Zhou Zhao, Shengyu Zhang
TL;DR
Diffusion models deliver high-quality generation but suffer from slow, multi-step sampling. The paper proposes ETC, a training-free diffusion acceleration framework that enforces trajectory consistency via a Consistent Trend Predictor and calibrates error tolerance with a model-specific threshold search. By aggregating historical denoise trends into future direction estimates and adaptively expanding the approximation window within model tolerance, ETC achieves a $2.65$ speedup over FLUX with negligible SSIM degradation of $0.074$. It demonstrates strong gains across image, video, and audio tasks, outperforming state-of-the-art training-free baselines in both speed and consistency. A noted limitation is the conservative cap on per-iteration steps, with future work aimed at dynamic estimation of the maximum feasible approximations.
Abstract
Diffusion models have achieved remarkable generative quality but remain bottlenecked by costly iterative sampling. Recent training-free methods accelerate diffusion process by reusing model outputs. However, these methods ignore denoising trends and lack error control for model-specific tolerance, leading to trajectory deviations under multi-step reuse and exacerbating inconsistencies in the generated results. To address these issues, we introduce Error-aware Trend Consistency (ETC), a framework that (1) introduces a consistent trend predictor that leverages the smooth continuity of diffusion trajectories, projecting historical denoising patterns into stable future directions and progressively distributing them across multiple approximation steps to achieve acceleration without deviating; (2) proposes a model-specific error tolerance search mechanism that derives corrective thresholds by identifying transition points from volatile semantic planning to stable quality refinement. Experiments show that ETC achieves a 2.65x acceleration over FLUX with negligible (-0.074 SSIM score) degradation of consistency.
