Conformalized Credal Set Predictors
Alireza Javanmardi, David Stutz, Eyke Hüllermeier
TL;DR
This work introduces conformal credal set predictors, combining credal-set representations of uncertainty with conformal prediction to obtain validity guarantees for classification under first-order supervision. It presents two learning paradigms—a first-order probabilistic predictor and a second-order (Dirichlet) predictor—both calibrated via nonconformity scores to produce credal sets that contain the true distribution with high probability, even in the presence of label noise. The approach is evaluated on ChaosNLI with multiple human annotations and synthetic data, demonstrating valid coverage across miscoverage levels and illustrating how nonconformity choices influence efficiency. The framework offers a principled, uncertainty-aware extension to learning under ambiguity, with practical applicability to tasks where multiple interpretations or annotations per instance are available.
Abstract
Credal sets are sets of probability distributions that are considered as candidates for an imprecisely known ground-truth distribution. In machine learning, they have recently attracted attention as an appealing formalism for uncertainty representation, in particular due to their ability to represent both the aleatoric and epistemic uncertainty in a prediction. However, the design of methods for learning credal set predictors remains a challenging problem. In this paper, we make use of conformal prediction for this purpose. More specifically, we propose a method for predicting credal sets in the classification task, given training data labeled by probability distributions. Since our method inherits the coverage guarantees of conformal prediction, our conformal credal sets are guaranteed to be valid with high probability (without any assumptions on model or distribution). We demonstrate the applicability of our method to natural language inference, a highly ambiguous natural language task where it is common to obtain multiple annotations per example.
