Oscillations Make Neural Networks Robust to Quantization
Jonathan Wenshøj, Bob Pepin, Raghavendra Selvan
TL;DR
The paper reframes weight oscillations during QAT from a detrimental side-effect to a core mechanism enabling quantization robustness. Through a univariate toy model, it reveals an oscillation-driving gradient tied to pushing weights toward quantization thresholds and introduces an oscillation-inducing regularizer that replicates this behavior in neural networks. Empirical results on CIFAR-10 and Tiny ImageNet with ResNet-18 and Tiny ViT show that oscillations induced during training can recover QAT-level accuracy at 3–4 bits and offer strong cross-bit robustness, sometimes outperforming QAT in unseen bit-widths. These findings provide a deeper understanding of QAT dynamics and suggest oscillations as a constructive tool for efficient, robust quantization across varying bit widths and quantizers.
Abstract
We challenge the prevailing view that weight oscillations observed during Quantization Aware Training (QAT) are merely undesirable side-effects and argue instead that they are an essential part of QAT. We show in a univariate linear model that QAT results in an additional loss term that causes oscillations by pushing weights away from their nearest quantization level. Based on the mechanism from the analysis, we then derive a regularizer that induces oscillations in the weights of neural networks during training. Our empirical results on ResNet-18 and Tiny Vision Transformer, evaluated on CIFAR-10 and Tiny ImageNet datasets, demonstrate across a range of quantization levels that training with oscillations followed by post-training quantization (PTQ) is sufficient to recover the performance of QAT in most cases. With this work we provide further insight into the dynamics of QAT and contribute a novel insight into explaining the role of oscillations in QAT which until now have been considered to have a primarily negative effect on quantization.
