How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training
Jaeseong You, Minseop Park, Kyunggeun Lee, Seokjun An, Chirag Patel, Markus Nage
TL;DR
The paper analyzes three asymmetric QAT parameterizations—scale/offset, min/max, and beta/gamma—through controlled toy experiments and real-world large language model quantization to understand how learnable ranges respond to bit width and learning rate. It finds that scale/offset is prone to instability and poor convergence, especially under challenging bit widths, whereas min/max demonstrates greater robustness but slower convergence; beta/gamma enables fast, per-channel range learning and, when used without a sigmoid, often yields faster training and lower loss. The authors conclude with practical guidance: use min/max with appropriately scaled learning rates, and employ sigmoid-free beta/gamma for robust, rapid QAT, while noting beta/gamma can dynamically adapt to true min/max values and offer per-channel flexibility. They also discuss potential future directions, including distribution-aware and mixed-parameterization QAT to further optimize quantization for diverse weights and activations in large models.
Abstract
This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma. We perform a comprehensive comparative analysis of these parameterizations' influence on quantization-aware training, using both controlled experiments and real-world large language models. Our particular focus is on their changing behavior in response to critical training hyperparameters, bit width and learning rate. Based on our investigation, we propose best practices to stabilize and accelerate quantization-aware training with learnable asymmetric quantization ranges.
