FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers

Ruichen Chen; Keith G. Mills; Di Niu

FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers

Ruichen Chen, Keith G. Mills, Di Niu

TL;DR

FP4DiT tackles the practical deployment of diffusion transformers by applying floating-point post-training quantization (FPQ) to DiTs, including PixArt and Hunyuan, achieving W4A6 while outperforming INT PTQ baselines on CLIP, ImageReward and HPSv2. The method combines optimized FP formats within DiT blocks, scale-aware AdaRound for FP weight calibration, and token-wise online activation quantization to handle patch-level activation dynamics. Empirical results on MS-COCO and HPSv2 across multiple DiT backbones demonstrate superior quantitative and human-preference performance with minimal hardware overhead. This work suggests FPQ as a promising direction for efficient, high-quality diffusion-based image synthesis on edge devices.

Abstract

Diffusion Models (DM) have revolutionized the text-to-image visual generation process. However, the large computational cost and model footprint of DMs hinders practical deployment, especially on edge devices. Post-training quantization (PTQ) is a lightweight method to alleviate these burdens without the need for training or fine-tuning. While recent DM PTQ methods achieve W4A8 \blue{(i.e., 4-bit weights and 8-bit activations)} on integer-based PTQ, two key limitations remain: First, while most existing DM PTQ methods evaluate on classical DMs like Stable Diffusion XL, 1.5 or earlier, which use convolutional U-Nets, newer Diffusion Transformer (DiT) models like the PixArt series, Hunyuan and others adopt fundamentally different transformer backbones to achieve superior image synthesis. Second, integer (INT) quantization is prevailing in DM PTQ but does not align well with the network weight and activation distribution, while Floating-Point Quantization (FPQ) is still under-investigated, yet it holds the potential to better align the weight and activation distributions in low-bit settings for DiT. In this paper, we introduce FP4DiT, a PTQ method that leverages FPQ to achieve W4A6 quantization. Specifically, we extend and generalize the Adaptive Rounding PTQ technique to adequately calibrate weight quantization for FPQ and demonstrate that DiT activations depend on input patch data, necessitating robust online activation quantization techniques. Experimental results demonstrate that FP4DiT achieves higher CLIP, ImageReward and HPSv2 performance compared to integer-based PTQ at the W4A6 and W4A8 precision levels while generating convincing visual content on PixArt-$α$, PixArt-$Σ$ and Hunyuan.

FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers

TL;DR

Abstract

FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (15)

Theorems & Definitions (4)