Pseudo-Bayesian Optimization
Haoxian Chen, Henry Lam
TL;DR
The paper addresses the gap between theory and practice in Bayesian optimization by proposing Pseudo-Bayesian Optimization (PseudoBO), an axiomatic framework that ensures convergence for exploration-based black-box optimization beyond Gaussian processes. It decomposes the algorithm into a surrogate predictor $\hat f$, an uncertainty quantifier $\hat\sigma$, and an acquisition function $g_n$, each with local consistency, SNEB, and the improvement property. The authors show that a simple recipe using local regression as SP, a randomized-prior-based UQ, and EI as AF achieves convergence guarantees and competitive performance across synthetic, hyperparameter-tuning, and robotics tasks. This work broadens BO theory beyond GP and provides a practical blueprint for designing convergent, high-performing optimization algorithms.
Abstract
Bayesian Optimization is a popular approach for optimizing expensive black-box functions. Its key idea is to use a surrogate model to approximate the objective and, importantly, quantify the associated uncertainty that allows a sequential search of query points that balance exploitation-exploration. Gaussian process (GP) has been a primary candidate for the surrogate model, thanks to its Bayesian-principled uncertainty quantification power and modeling flexibility. However, its challenges have also spurred an array of alternatives whose convergence properties could be more opaque. Motivated by these, we study in this paper an axiomatic framework that elicits the minimal requirements to guarantee black-box optimization convergence that could apply beyond GP-based methods. Moreover, we leverage the design freedom in our framework, which we call Pseudo-Bayesian Optimization, to construct empirically superior algorithms. In particular, we show how using simple local regression, and a suitable "randomized prior" construction to quantify uncertainty, not only guarantees convergence but also consistently outperforms state-of-the-art benchmarks in examples ranging from high-dimensional synthetic experiments to realistic hyperparameter tuning and robotic applications.
