Conformal Prediction via Regression-as-Classification
Etash Guha, Shlok Natarajan, Thomas Möllenhoff, Mohammad Emtiyaz Khan, Eugene Ndiaye
TL;DR
This work extends conformal prediction to challenging regression settings by reframing regression as classification through binning of the output into $K$ bins and applying CP for classification. A novel ordinal-aware loss with an entropy regularizer encourages probability mass on neighboring bins while avoiding overconfidence, yielding a linearly interpolated density $\bar{q}_\theta(y|x)$ used to form regression CP sets. The approach, named Regression-to-Classification CP (R2CCP), maintains finite-sample coverage and often produces shorter prediction intervals than diverse CP baselines, particularly in heteroscedastic and bimodal scenarios. Empirical results on synthetic and real datasets demonstrate robust interval efficiency and adaptability to complex label distributions, with practical code available via "r2ccp".
Abstract
Conformal prediction (CP) for regression can be challenging, especially when the output distribution is heteroscedastic, multimodal, or skewed. Some of the issues can be addressed by estimating a distribution over the output, but in reality, such approaches can be sensitive to estimation error and yield unstable intervals.~Here, we circumvent the challenges by converting regression to a classification problem and then use CP for classification to obtain CP sets for regression.~To preserve the ordering of the continuous-output space, we design a new loss function and make necessary modifications to the CP classification techniques.~Empirical results on many benchmarks shows that this simple approach gives surprisingly good results on many practical problems.
