Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion
Songwei Liu, Chao Zeng, Chenqian Yan, Xurui Peng, Xing Wang, Fangmin Chen, Xing Mei
TL;DR
This work addresses the problem that post-training quantization (PTQ) introduces cumulative errors in diffusion models during iterative denoising, degrading image fidelity. It develops a theoretical framework linking per-step quantization errors, cumulative error, and their propagation, deriving a closed-form solution for cumulative error under a DDIM-like sampler. Building on this, the authors propose TCEC, a timestep-aware online compensation scheme that reconstructs per-step errors via a learned channel-wise scaling (K_t) and applies a final correction term during generation, with computationally light approximations (m=1) to keep overhead minimal. Empirically, TCEC improves SOTA PTQ methods (e.g., SVDQuant) across SDXL, SDXL-Turbo, and PixArt backbones, achieving better PSNR and perceptual metrics while reducing memory footprint by ~3.5× and incurring less than 0.5% end-to-end latency, thereby enabling high-quality, low-bit quantized diffusion generation on commodity hardware.
Abstract
Diffusion models have transformed image synthesis by establishing unprecedented quality and creativity benchmarks. Nevertheless, their large-scale deployment faces challenges due to computationally intensive iterative denoising processes. Although post-training quantization (PTQ) provides an effective pathway for accelerating sampling, the iterative nature of diffusion models causes stepwise quantization errors to accumulate progressively during generation, inevitably compromising output fidelity. To address this challenge, we develop a theoretical framework that mathematically formulates error propagation in Diffusion Models (DMs), deriving per-step quantization error propagation equations and establishing the first closed-form solution for cumulative error. Building on this theoretical foundation, we propose a timestep-aware cumulative error compensation scheme. Extensive experiments on multiple image datasets demonstrate that our compensation strategy effectively mitigates error propagation, significantly enhancing existing PTQ methods. Specifically, it achieves a 1.2 PSNR improvement over SVDQuant on SDXL W4A4, while incurring only an additional $<$ 0.5\% time overhead.
