Distilling Calibration via Conformalized Credal Inference
Jiayi Huang, Sangwoo Park, Nicola Paoletti, Osvaldo Simeone
TL;DR
This work tackles reliable decision-making for edge AI under tight resource constraints by distilling calibration information from a high-capacity cloud model. It introduces Conformalized Distillation for Credal Inference (CD-CI), which forms credal sets $\Gamma(x)$ around a small-edge predictor using an offline divergence threshold to guarantee, with probability $1-\epsilon$, that the cloud model’s predictive distribution $p^*(\cdot|x)$ lies within $\Gamma(x)$. A single predictive distribution is then obtained from the credal set via an intersection-probability construction, offering a robust alternative to standard low-complexity Bayesian post-processing and achieving improved ECE with negligible accuracy loss, demonstrated on CIFAR-10 and SNLI. The method leverages conformal prediction and imprecise probabilities to deliver reliable edge predictions and has practical implications for edge deployments where computational budgets prevent full Bayesian ensembling. Overall, CD-CI provides a scalable calibration mechanism that aligns edge-model outputs with cloud-model reliability guarantees while maintaining efficiency for real-world applications.
Abstract
Deploying artificial intelligence (AI) models on edge devices involves a delicate balance between meeting stringent complexity constraints, such as limited memory and energy resources, and ensuring reliable performance in sensitive decision-making tasks. One way to enhance reliability is through uncertainty quantification via Bayesian inference. This approach, however, typically necessitates maintaining and running multiple models in an ensemble, which may exceed the computational limits of edge devices. This paper introduces a low-complexity methodology to address this challenge by distilling calibration information from a more complex model. In an offline phase, predictive probabilities generated by a high-complexity cloud-based model are leveraged to determine a threshold based on the typical divergence between the cloud and edge models. At run time, this threshold is used to construct credal sets -- ranges of predictive probabilities that are guaranteed, with a user-selected confidence level, to include the predictions of the cloud model. The credal sets are obtained through thresholding of a divergence measure in the simplex of predictive probabilities. Experiments on visual and language tasks demonstrate that the proposed approach, termed Conformalized Distillation for Credal Inference (CD-CI), significantly improves calibration performance compared to low-complexity Bayesian methods, such as Laplace approximation, making it a practical and efficient solution for edge AI deployments.
