Correcting the Coverage Bias of Quantile Regression
Isaac Gibbs, John J. Cherian, Emmanuel J. Candès
TL;DR
This work tackles the problem that quantile regression can fail to achieve target coverage in high-dimensional regimes where the ratio $d/n$ remains positive. It introduces three deterministic debiasing methods—level adjustment, additive adjustment, and fixed dual thresholding—that calibrate predictions to achieve asymptotically exact coverage, leveraging a novel link between leave-one-out coverage and the quantile regression dual to enable efficient computation. Theoretical results establish high-dimensional consistency and LOOCov-duality, while empirical studies on simulated data and real datasets demonstrate robust coverage and practical tradeoffs between interval length and computation. The methods offer model-agnostic calibration for quantile predictions with potential impact on risk assessment and tail-bound analyses in high-dimensional settings.
Abstract
We develop a collection of methods for adjusting the predictions of quantile regression to ensure coverage. Our methods are model agnostic and can be used to correct for high-dimensional overfitting bias with only minimal assumptions. Theoretical results show that the estimates we develop are consistent and facilitate accurate calibration in the proportional asymptotic regime where the ratio of the dimension of the data and the sample size converges to a constant. This is further confirmed by experiments on both simulated and real data. A key component of our work is a new connection between the leave-one-out coverage and the fitted values of variables appearing in a dual formulation of the quantile regression problem. This facilitates the use of cross-validation in a variety of settings at significantly reduced computational costs.
