Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration
Jiani Ni, He Zhao, Jintong Gao, Dandan Guo, Hongyuan Zha
TL;DR
BalCAL rethinks calibration by balancing a standard learnable classifier with a fixed Simplex ETF classifier derived from Neural Collapse. A confidence-tunable module and a dynamic adjustment mechanism regulate the ETF’s influence, allowing the model to combat both overconfidence and underconfidence without sacrificing accuracy. The approach demonstrates superior calibration (lower ECE/AECE) and improved robustness under distribution shifts and OOD scenarios across CIFAR-10/100, SVHN, and Tiny-ImageNet, with consistent gains when integrated with existing calibration methods. By exploiting the ETF’s scaling properties and a fusion of outputs, BalCAL provides a flexible, deployable calibration regularizer that generalizes across architectures and datasets.
Abstract
In recent years, deep neural networks (DNNs) have demonstrated state-of-the-art performance across various domains. However, despite their success, they often face calibration issues, particularly in safety-critical applications such as autonomous driving and healthcare, where unreliable predictions can have serious consequences. Recent research has started to improve model calibration from the view of the classifier. However, the exploration of designing the classifier to solve the model calibration problem is insufficient. Let alone most of the existing methods ignore the calibration errors arising from underconfidence. In this work, we propose a novel method by balancing learnable and ETF classifiers to solve the overconfidence or underconfidence problem for model Calibration named BalCAL. By introducing a confidence-tunable module and a dynamic adjustment method, we ensure better alignment between model confidence and its true accuracy. Extensive experimental validation shows that ours significantly improves model calibration performance while maintaining high predictive accuracy, outperforming existing techniques. This provides a novel solution to the calibration challenges commonly encountered in deep learning.
