Taking the GP Out of the Loop

Mehul Bafna; Siddhant anand Jadhav; David Sweet

Taking the GP Out of the Loop

Mehul Bafna, Siddhant anand Jadhav, David Sweet

TL;DR

This paper tackles the scalability bottleneck of Bayesian optimization when many observations are cheap and plentiful, by replacing the Gaussian-process surrogate with Epistemic Nearest Neighbors (ENN). ENN provides both a mean predictor and an uncertainty estimate with linear-time fitting and querying, enabling the TuRBO-ENN framework to achieve $O(N)$ proposal time as opposed to the GP's $O(N^2)$–$O(N^3)$ scaling. The authors present noise-aware and noise-free variants, including a fitting-free acquisition for deterministic cases, and demonstrate up to 1–2 orders of magnitude speedups across diverse simulation tasks with up to $N=50{,}000$ observations while maintaining competitive solution quality. They further justify convergence under the Pseudo-Bayesian Optimization (PBO) framework, reinforcing the method's practical appeal for large-scale BO in engineering and simulation contexts.

Abstract

Bayesian optimization (BO) has traditionally solved black-box problems where function evaluation is expensive and, therefore, observations are few. Recently, however, there has been growing interest in applying BO to problems where function evaluation is cheaper and observations are more plentiful. In this regime, scaling to many observations $N$ is impeded by Gaussian-process (GP) surrogates: GP hyperparameter fitting scales as $\mathcal{O}(N^3)$ (reduced to roughly $\mathcal{O}(N^2)$ in modern implementations), and it is repeated at every BO iteration. Many methods improve scaling at acquisition time, but hyperparameter fitting still scales poorly, making it the bottleneck. We propose Epistemic Nearest Neighbors (ENN), a lightweight alternative to GPs that estimates function values and uncertainty (epistemic and aleatoric) from $K$-nearest-neighbor observations. ENN scales as $\mathcal{O}(N)$ for both fitting and acquisition. Our BO method, TuRBO-ENN, replaces the GP surrogate in TuRBO with ENN and its Thompson-sampling acquisition with $\mathrm{UCB} = μ(x) + σ(x)$. For the special case of noise-free problems, we can omit fitting altogether by replacing $\mathrm{UCB}$ with a non-dominated sort over $μ(x)$ and $σ(x)$. We show empirically that TuRBO-ENN reduces proposal time (i.e., fitting time + acquisition time) by one to two orders of magnitude compared to TuRBO at up to 50,000 observations.

Taking the GP Out of the Loop

TL;DR

proposal time as opposed to the GP's

–

scaling. The authors present noise-aware and noise-free variants, including a fitting-free acquisition for deterministic cases, and demonstrate up to 1–2 orders of magnitude speedups across diverse simulation tasks with up to

observations while maintaining competitive solution quality. They further justify convergence under the Pseudo-Bayesian Optimization (PBO) framework, reinforcing the method's practical appeal for large-scale BO in engineering and simulation contexts.

Abstract

is impeded by Gaussian-process (GP) surrogates: GP hyperparameter fitting scales as

(reduced to roughly

in modern implementations), and it is repeated at every BO iteration. Many methods improve scaling at acquisition time, but hyperparameter fitting still scales poorly, making it the bottleneck. We propose Epistemic Nearest Neighbors (ENN), a lightweight alternative to GPs that estimates function values and uncertainty (epistemic and aleatoric) from

-nearest-neighbor observations. ENN scales as

for both fitting and acquisition. Our BO method, TuRBO-ENN, replaces the GP surrogate in TuRBO with ENN and its Thompson-sampling acquisition with

. For the special case of noise-free problems, we can omit fitting altogether by replacing

with a non-dominated sort over

and

. We show empirically that TuRBO-ENN reduces proposal time (i.e., fitting time + acquisition time) by one to two orders of magnitude compared to TuRBO at up to 50,000 observations.

Taking the GP Out of the Loop

TL;DR

Abstract

Taking the GP Out of the Loop

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (15)