Guaranteed Coverage Prediction Intervals with Gaussian Process Regression
Harris Papadopoulos
TL;DR
This work tackles the problem of miscalibrated uncertainty in Gaussian Process Regression under model misspecification. It introduces GPR-CP, a transductive conformal predictor that uses a normalized nonconformity measure based on predictive variance to guarantee finite-sample coverage at level $1-\\delta$ under exchangeability, while preserving GPR's strengths. Empirical results on synthetic data and UCI benchmarks across several covariance functions show well-calibrated prediction intervals with competitive, often tighter widths compared to standard GPR and existing conformal or recalibration methods. The approach delivers robust uncertainty quantification for regression in settings with limited or misspecified models and points to future work on sparse GP implementations and further normalization schemes.
Abstract
Gaussian Process Regression (GPR) is a popular regression method, which unlike most Machine Learning techniques, provides estimates of uncertainty for its predictions. These uncertainty estimates however, are based on the assumption that the model is well-specified, an assumption that is violated in most practical applications, since the required knowledge is rarely available. As a result, the produced uncertainty estimates can become very misleading; for example the prediction intervals (PIs) produced for the 95% confidence level may cover much less than 95% of the true labels. To address this issue, this paper introduces an extension of GPR based on a Machine Learning framework called, Conformal Prediction (CP). This extension guarantees the production of PIs with the required coverage even when the model is completely misspecified. The proposed approach combines the advantages of GPR with the valid coverage guarantee of CP, while the performed experimental results demonstrate its superiority over existing methods.
