Tilt and Average : Geometric Adjustment of the Last Layer for Recalibration
Gyusang Cho, Chan-Hyun Youn
TL;DR
This work tackles neural network miscalibration by proposing Tilt and Average (Tna), a geometric recalibration method that operates on the last-layer weights rather than calibration maps. It generates tilted class vectors through an $n$-dimensional rotation built from Givens rotations, controlled by a mean Rotation over Classes $\text{mRC}$, and then averages multiple tilted weights to maintain accuracy. The approach improves calibration (ECE and AdaECE) while preserving nearly unchanged accuracy and can complement traditional calibration maps; it is demonstrated across CIFAR and ImageNet with multiple architectures, and code is released for reproducibility. Overall, Tna offers a data-efficient, plug-in recalibration option that expands the space of post-hoc calibration techniques by leveraging angular geometry in the final layer.
Abstract
After the revelation that neural networks tend to produce overconfident predictions, the problem of calibration, which aims to align confidence with accuracy to enhance the reliability of predictions, has gained significant importance. Several solutions based on calibration maps have been proposed to address the problem of recalibrating a trained classifier using additional datasets. In this paper, we offer an algorithm that transforms the weights of the last layer of the classifier, distinct from the calibration-map-based approach. We concentrate on the geometry of the final linear layer, specifically its angular aspect, and adjust the weights of the corresponding layer. We name the method Tilt and Average(\textsc{Tna}), and validate the calibration effect empirically and theoretically. Through this, we demonstrate that our approach, in addition to the existing calibration-map-based techniques, can yield improved calibration performance. Code available : https://github.com/GYYYYYUUUUU/TNA_Angular_Scaling.
