Scale estimation and rate-unbiasedness for Gaussian processes under smoothness misspecification
Toni Karvonen, François Bachoc
TL;DR
This work analyzes Gaussian process regression under misspecification of smoothness, introducing rate-unbiasedness as a relaxed criterion for uncertainty quantification. It shows that, when observation locations are quasi-uniform and the model oversmooths, estimating the scale parameter via maximum likelihood or cross-validation yields MSE with the correct order of magnitude without optimising the smoothness. The key theoretical contribution is deriving the asymptotic growth for scale estimators, $\mathbb{E}[\hat{\sigma}_n^2] \asymp n^{2(\nu-\nu_0)/d}$, which combined with $\lVert \mathrm{MSE}_n \rVert_{L^p(D)} \asymp n^{-2\nu/d}$ and $\lVert \mathrm{MSE}_n^* \rVert_{L^p(D)} \asymp n^{-2\nu_0/d}$ yields rate-unbiasedness for $\nu\ge\nu_0$. The paper also extends results to periodic Sobolev kernels and provides extensive numerical experiments showing how smoothness can be inferred from scale-growth and when cross-validation is preferable to ML under misspecification. These findings offer practical guidance for scalable GP uncertainty quantification in misspecified settings, without expensive optimisation over the smoothness parameter.
Abstract
Gaussian process regression is used throughout statistics and machine learning for prediction and uncertainty quantification. A Gaussian process is specified by its mean and covariance functions. Many covariance functions, including Matérns, have a smoothness parameter that is notoriously difficult to specify correctly or estimate from the data. In practice, the smoothness parameter is often selected more or less arbitrarily. We introduce rate-unbiasedness, a relaxed notion of asymptotic optimality which requires that the expected ratio of the mean-square error presumed by a potentially misspecified model and the true, but unknown, mean-square error remain bounded away from zero and infinity as more data are obtained. A rate-unbiased model provides uncertainty quantification that is of correct order of magnitude. We then prove that scale estimation suffices for rate-unbiasedness in a variety of common settings. As estimation of the scale of a Gaussian process is routine and requires no optimisation, rate-unbiasedness can be achieved in many applications.
