Practical Global and Local Bounds in Gaussian Process Regression via Chaining
Junyi Liu, Stanley Kok
TL;DR
Addressing the lack of global uncertainty control in Gaussian Process Regression, the paper introduces a chaining-based framework to bound the expected global maximum without relying on input features or posterior estimates. It develops Talagrand-style chaining bounds on the kernel-induced metric, with kernel-specific refinements for RBF and Matérn that tighten constants and avoid relaxations. It also adds a locally adaptive uncertainty quantification method based on partition diameters to provide input-specific bounds without posterior variance scaling. Empirical results on synthetic and real datasets show tighter, more reliable bounds and favorable comparisons to state-of-the-art baselines, including improved Coverage Width-Based Criterion.
Abstract
Gaussian process regression (GPR) is a popular nonparametric Bayesian method that provides predictive uncertainty estimates and is widely used in safety-critical applications. While prior research has introduced various uncertainty bounds, most existing approaches require access to specific input features, and rely on posterior mean and variance estimates or the tuning of hyperparameters. These limitations hinder robustness and fail to capture the model's global behavior in expectation. To address these limitations, we propose a chaining-based framework for estimating upper and lower bounds on the expected extreme values over unseen data, without requiring access to specific input features. We provide kernel-specific refinements for commonly used kernels such as RBF and Matérn, in which our bounds are tighter than generic constructions. We further improve numerical tightness by avoiding analytical relaxations. In addition to global estimation, we also develop a novel method for local uncertainty quantification at specified inputs. This approach leverages chaining geometry through partition diameters, adapting to local structures without relying on posterior variance scaling. Our experimental results validate the theoretical findings and demonstrate that our method outperforms existing approaches on both synthetic and real-world datasets.
