Conformal Prediction Under Generalized Covariate Shift with Posterior Drift
Baozhen Wang, Xingye Qiao
TL;DR
This work develops a weighted conformal prediction framework for multiclass classification under generalized covariate shift with posterior drift (g-CSPD). By leveraging both source and target labeled data, the proposed WCC-CSPD method estimates class posteriors from the source and calibrates thresholds using a combined calibration set, with weights computed via Radon-Nikodym derivatives and accelerated by Newton's identities. The authors establish a target-domain coverage guarantee $\mathbb{Q}(Y_T\in\hat{C}(X_T)) \ge 1-\alpha$ under CSPD or g-CSPD, and prove a convergence rate for the calibration error that improves with larger calibration samples that include target data. Empirical results on simulations and semi-synthetic maternal health data demonstrate robust target-coverage performance under distribution shifts, outperforming standard weighted conformal prediction and split conformal baselines while maintaining efficient prediction-set lengths. Overall, the paper provides a practical, theoretically-justified approach for reliable uncertainty quantification under transfer learning with covariate and posterior drift.
Abstract
In many real applications of statistical learning, collecting sufficiently many training data is often expensive, time-consuming, or even unrealistic. In this case, a transfer learning approach, which aims to leverage knowledge from a related source domain to improve the learning performance in the target domain, is more beneficial. There have been many transfer learning methods developed under various distributional assumptions. In this article, we study a particular type of classification problem, called conformal prediction, under a new distributional assumption for transfer learning. Classifiers under the conformal prediction framework predict a set of plausible labels instead of one single label for each data instance, affording a more cautious and safer decision. We consider a generalization of the \textit{covariate shift with posterior drift} setting for transfer learning. Under this setting, we propose a weighted conformal classifier that leverages both the source and target samples, with a coverage guarantee in the target domain. Theoretical studies demonstrate favorable asymptotic properties. Numerical studies further illustrate the usefulness of the proposed method.
