Sparse Activations as Conformal Predictors

Margarida M. Campos; João Calém; Sophia Sklaviadis; Mário A. T. Figueiredo; André F. T. Martins

Sparse Activations as Conformal Predictors

Margarida M. Campos, João Calém, Sophia Sklaviadis, Mário A. T. Figueiredo, André F. T. Martins

TL;DR

Conformal prediction provides distribution-free uncertainty sets for predictions. This paper forges a formal link between conformal prediction and sparse activation functions, specifically the $\gamma$-entmax family, by introducing non-conformity scores whose calibration corresponds to temperature scaling. At test time, the prediction sets align with the nonzero-support of the $\gamma$-entmax outputs, ensuring coverage guarantees. Empirical results on vision and text benchmarks show competitive coverage, efficiency, and adaptiveness compared to standard softmax-based conformal predictors. This approach enables sparse, interpretable set predictions with theoretical guarantees and flexible calibration.

Abstract

Conformal prediction is a distribution-free framework for uncertainty quantification that replaces point predictions with sets, offering marginal coverage guarantees (i.e., ensuring that the prediction sets contain the true label with a specified probability, in expectation). In this paper, we uncover a novel connection between conformal prediction and sparse softmax-like transformations, such as sparsemax and $γ$-entmax (with $γ> 1$), which may assign nonzero probability only to a subset of labels. We introduce new non-conformity scores for classification that make the calibration process correspond to the widely used temperature scaling method. At test time, applying these sparse transformations with the calibrated temperature leads to a support set (i.e., the set of labels with nonzero probability) that automatically inherits the coverage guarantees of conformal prediction. Through experiments on computer vision and text classification benchmarks, we demonstrate that the proposed method achieves competitive results in terms of coverage, efficiency, and adaptiveness compared to standard non-conformity scores based on softmax.

Sparse Activations as Conformal Predictors

TL;DR

Conformal prediction provides distribution-free uncertainty sets for predictions. This paper forges a formal link between conformal prediction and sparse activation functions, specifically the

-entmax family, by introducing non-conformity scores whose calibration corresponds to temperature scaling. At test time, the prediction sets align with the nonzero-support of the

-entmax outputs, ensuring coverage guarantees. Empirical results on vision and text benchmarks show competitive coverage, efficiency, and adaptiveness compared to standard softmax-based conformal predictors. This approach enables sparse, interpretable set predictions with theoretical guarantees and flexible calibration.

Abstract

-entmax (with

), which may assign nonzero probability only to a subset of labels. We introduce new non-conformity scores for classification that make the calibration process correspond to the widely used temperature scaling method. At test time, applying these sparse transformations with the calibrated temperature leads to a support set (i.e., the set of labels with nonzero probability) that automatically inherits the coverage guarantees of conformal prediction. Through experiments on computer vision and text classification benchmarks, we demonstrate that the proposed method achieves competitive results in terms of coverage, efficiency, and adaptiveness compared to standard non-conformity scores based on softmax.

Sparse Activations as Conformal Predictors

TL;DR

Abstract

Sparse Activations as Conformal Predictors

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (6)