Conformal Prediction Adaptive to Unknown Subpopulation Shifts
Nien-Shao Wang, Duygu Nur Yaldiz, Yavuz Faruk Bakman, Sai Praneeth Karimireddy
TL;DR
This work extends conformal prediction to unknown subpopulation shifts by leveraging (i) domain classifiers to weight calibration data, and (ii) domain-agnostic strategies based on embedding similarities and conformal risk control. The authors establish theoretical guarantees for marginal coverage under various relaxations of domain knowledge, including Bayes-optimal, multicalibrated, and multiaccurate classifiers, and demonstrate scalability to high-dimensional vision and language tasks. Empirical results on diverse benchmarks show that the proposed methods maintain coverage across numerous test environments, outperforming standard conformal prediction which can fail under shifts. The practical impact lies in robust uncertainty quantification for real-world AI systems, including reliable LLM hallucination detection under distribution changes and improved risk management in high-stakes applications.
Abstract
Conformal prediction is widely used to equip black-box machine learning models with uncertainty quantification, offering formal coverage guarantees under exchangeable data. However, these guarantees fail when faced with subpopulation shifts, where the test environment contains a different mix of subpopulations than the calibration data. In this work, we focus on unknown subpopulation shifts where we are not given group-information i.e. the subpopulation labels of datapoints have to be inferred. We propose new methods that provably adapt conformal prediction to such shifts, ensuring valid coverage without explicit knowledge of subpopulation structure. While existing methods in similar setups assume perfect subpopulation labels, our framework explicitly relaxes this requirement and characterizes conditions where formal coverage guarantees remain feasible. Further, our algorithms scale to high-dimensional settings and remain practical in realistic machine learning tasks. Extensive experiments on vision (with vision transformers) and language (with large language models) benchmarks demonstrate that our methods reliably maintain coverage and effectively control risks in scenarios where standard conformal prediction fails.
