From Conformal Predictions to Confidence Regions
Charles Guille-Escuret, Eugene Ndiaye
TL;DR
CCR extends conformal prediction to the parameter space by aggregating finite-sample CP intervals for noise-free outputs, yielding finite-sample valid confidence regions for model parameters under minimal noise assumptions. It constructs Θ_k via aggregation over individual θ-sets Θ(X_i) with voting-based constraints, and provides bounds under fully black-box, randomized Markov, and split-conformal regimes, including a MILP-compatible formulation for linear models. The method handles heteroskedastic and non-Gaussian noise, offers PAC-type guarantees, and enables practical downstream tasks such as robust optimization and regression abstention. Empirically, CCR demonstrates competitive coverage and tighter coordinate intervals relative to existing conformal-based approaches, while maintaining finite-sample validity and enabling hypothesis testing of linearity.
Abstract
Conformal prediction methodologies have significantly advanced the quantification of uncertainties in predictive models. Yet, the construction of confidence regions for model parameters presents a notable challenge, often necessitating stringent assumptions regarding data distribution or merely providing asymptotic guarantees. We introduce a novel approach termed CCR, which employs a combination of conformal prediction intervals for the model outputs to establish confidence regions for model parameters. We present coverage guarantees under minimal assumptions on noise and that is valid in finite sample regime. Our approach is applicable to both split conformal predictions and black-box methodologies including full or cross-conformal approaches. In the specific case of linear models, the derived confidence region manifests as the feasible set of a Mixed-Integer Linear Program (MILP), facilitating the deduction of confidence intervals for individual parameters and enabling robust optimization. We empirically compare CCR to recent advancements in challenging settings such as with heteroskedastic and non-Gaussian noise.
