Counterfactually Fair Conformal Prediction
Ozgur Guldogan, Neeraj Sarna, Yuanyuan Li, Michael Berger
TL;DR
This work addresses the challenge of ensuring individual-level counterfactual fairness for prediction sets, not just point predictions. It introduces Counterfactually Fair Conformal Prediction (CF-CP), a training-free procedure that symmetrizes conformity scores across protected-attribute interventions and feeds them into split conformal prediction to yield counterfactually fair sets with guaranteed marginal coverage. The authors prove that, under an invertible structural causal model and exchangeability, CF-CP achieves set-level counterfactual fairness while preserving the CP coverage guarantee, and they demonstrate empirically that CF-CP reduces counterfactual set disparity with only a modest increase in average set size across regression and classification tasks on synthetic and real data (Law School, Bios). This approach provides a simple, practical uncertainty quantification method that enforces individual-level fairness without retraining or reliance on the protected-attribute distribution, with strong potential for fair decision-making under uncertainty.
Abstract
While counterfactual fairness of point predictors is well studied, its extension to prediction sets--central to fair decision-making under uncertainty--remains underexplored. On the other hand, conformal prediction (CP) provides efficient, distribution-free, finite-sample valid prediction sets, yet does not ensure counterfactual fairness. We close this gap by developing Counterfactually Fair Conformal Prediction (CF-CP) that produces counterfactually fair prediction sets. Through symmetrization of conformity scores across protected-attribute interventions, we prove that CF-CP results in counterfactually fair prediction sets while maintaining the marginal coverage property. Furthermore, we empirically demonstrate that on both synthetic and real datasets, across regression and classification tasks, CF-CP achieves the desired counterfactual fairness and meets the target coverage rate with minimal increase in prediction set size. CF-CP offers a simple, training-free route to counterfactually fair uncertainty quantification.
