Weighted Leave-One-Out Cross Validation
Luc Pronzato, Maria-João Rendas
TL;DR
The paper addresses estimating the Integrated Squared Prediction Error $ISE(\eta_n)=\int_{\mathscr X} \varepsilon_n^2(\mathbf{x})\,\mu(d\mathbf{x})$ for a predictor that is linear in observed GP data, by introducing a weighted LOOCV approach based on the Best Linear Predictor of squared errors. By exploiting Gaussian process moments, the authors derive a BLFP estimator $\widehat{\mathsf{ISE}}_{BLP}(\eta_n)$ that weights squared LOOCV residuals to yield substantially more accurate ISE estimates than standard LOOCV, while also addressing covariate shift. The framework includes a BLUP-specific variant, a bias-corrected version, and a nugget-augmented extension for noisy observations; they also analyze independent and flat kernel limits and demonstrate robustness to kernel misspecification through extensive numerical experiments on environmental and piston models. The work provides a practical tool for reliable predictive performance assessment and model selection in GP-based computer experiments, with broad applicability to space-filling designs and GP-based predictors.
Abstract
We present a weighted version of Leave-One-Out (LOO) cross-validation for estimating the Integrated Squared Error (ISE) when approximating an unknown function by a predictor that depends linearly on evaluations of the function over a finite collection of sites. The method relies on the construction of the best linear estimator of the squared prediction error at an arbitrary unsampled site based on squared LOO residuals, assuming that the function is a realization of a Gaussian Process (GP). A theoretical analysis of performance of the ISE estimator is presented, and robustness with respect to the choice of the GP kernel is investigated first analytically, then through numerical examples. Overall, the estimation of ISE is significantly more precise than with classical, unweighted, LOO cross validation. Application to model selection is briefly considered through examples.
