Parametric $ρ$-Norm Scaling Calibration
Siyuan Zhang, Linbo Xie
TL;DR
The paper tackles unreliable uncertainty calibration in modern high-capacity models by proposing a post-hoc calibrator based on Parametric $\rho$-Norm Scaling, which regulates output magnitude to mitigate overconfidence without sacrificing accuracy. It introduces a multi-level optimization objective that combines bin-level Square Calibration Error with an instance-level KL-divergence regularization to preserve distributional properties of the pre-calibration outputs. The key contributions include (1) a new $\rho$-Norm Scaling calibration model with theoretical properties like decision invariance, (2) a joint bin- and instance-level objective for calibrator optimization, and (3) extensive empirical validation across multiple datasets showing state-of-the-art calibration performance in post-processing settings. This approach yields more reliable confidence estimates while maintaining classifier performance, with practical impact for deploying calibrated models in real-world decision-making tasks. Mathematical formulations such as $g_c(z) = \frac{e^{r_c}}{\sum_j e^{r_j}}$ and $r_j(z) = \frac{z_j}{\gamma \|z\|_\rho + \beta}$ are central to controlling output magnitude and shaping the calibrated distribution.
Abstract
Output uncertainty indicates whether the probabilistic properties reflect objective characteristics of the model output. Unlike most loss functions and metrics in machine learning, uncertainty pertains to individual samples, but validating it on individual samples is unfeasible. When validated collectively, it cannot fully represent individual sample properties, posing a challenge in calibrating model confidence in a limited data set. Hence, it is crucial to consider confidence calibration characteristics. To counter the adverse effects of the gradual amplification of the classifier output amplitude in supervised learning, we introduce a post-processing parametric calibration method, $ρ$-Norm Scaling, which expands the calibrator expression and mitigates overconfidence due to excessive amplitude while preserving accuracy. Moreover, bin-level objective-based calibrator optimization often results in the loss of significant instance-level information. Therefore, we include probability distribution regularization, which incorporates specific priori information that the instance-level uncertainty distribution after calibration should resemble the distribution before calibration. Experimental results demonstrate the substantial enhancement in the post-processing calibrator for uncertainty calibration with our proposed method.
