Adaptation using spatially distributed Gaussian Processes
Botond Szabo, Amine Hadji, Aad van der Vaart
TL;DR
The paper develops a theoretically grounded framework for spatially distributed Gaussian Process posteriors to enable scalable nonparametric regression while preserving posterior contraction properties. It proves rate-adaptive contraction for aggregated GP posteriors under mild conditions, using local priors based on rescaled integrated Brownian motion or Matérn processes and a fully Bayesian mechanism to learn length scales. A novel aggregation scheme improves continuity across regional boundaries and demonstrates strong empirical performance on synthetic and real data (including superconductivity), with substantial speedups. Overall, the work shows that spatially distributed GP methods can adapt to local regularities and potentially outperform standard GPs in accuracy and uncertainty quantification, while providing theoretical guarantees.
Abstract
We consider the accuracy of an approximate posterior distribution in nonparametric regression problems by combining posterior distributions computed on subsets of the data defined by the locations of the independent variables. We show that this approximate posterior retains the rate of recovery of the full data posterior distribution, where the rate of recovery adapts to the smoothness of the true regression function. As particular examples we consider Gaussian process priors based on integrated Brownian motion and the Matérn kernel augmented with a prior on the length scale. Besides theoretical guarantees we present a numerical study of the methods both on synthetic and real world data. We also propose a new aggregation technique, which numerically outperforms previous approaches. Finally, we demonstrate empirically that spatially distributed methods can adapt to local regularities, potentially outperforming the original Gaussian process.
