Table of Contents
Fetching ...

Enhancing Gaussian Process Surrogates for Optimization and Posterior Approximation via Random Exploration

Hwanwoo Kim, Daniel Sanz-Alonso

TL;DR

This work addresses the efficiency gap in noise-free Bayesian optimization by introducing two random-exploration-enhanced algorithms, GP-UCB+ and EXPLOIT+, which improve surrogate accuracy and accelerate convergence while maintaining implementation simplicity. It also extends to Bayesian inference by using optimization iterates to build GP surrogates for unnormalized log-posterior densities, and derives rigorous bounds on the Hellinger distance between true and surrogate posteriors. Theoretical results establish near-optimal simple-regret rates for Matérn kernels and exponential-type bounds for squared-exponential kernels, alongside practical performance gains on high-dimensional benchmarks, hyperparameter tuning, and differential-equation inference. The proposed approach provides a versatile experimental-design framework that yields accurate local posterior behavior around modes without sacrificing global fidelity, with concrete benefits for problems with expensive or intractable likelihoods. Overall, the methods offer a principled, scalable path to faster optimization and reliable posterior approximation in complex scientific and engineering domains.

Abstract

This paper proposes novel noise-free Bayesian optimization strategies that rely on a random exploration step to enhance the accuracy of Gaussian process surrogate models. The new algorithms retain the ease of implementation of the classical GP-UCB algorithm, but the additional random exploration step accelerates their convergence, nearly achieving the optimal convergence rate. Furthermore, to facilitate Bayesian inference with an intractable likelihood, we propose to utilize optimization iterates for maximum a posteriori estimation to build a Gaussian process surrogate model for the unnormalized log-posterior density. We provide bounds for the Hellinger distance between the true and the approximate posterior distributions in terms of the number of design points. We demonstrate the effectiveness of our Bayesian optimization algorithms in non-convex benchmark objective functions, in a machine learning hyperparameter tuning problem, and in a black-box engineering design problem. The effectiveness of our posterior approximation approach is demonstrated in two Bayesian inference problems for parameters of dynamical systems.

Enhancing Gaussian Process Surrogates for Optimization and Posterior Approximation via Random Exploration

TL;DR

This work addresses the efficiency gap in noise-free Bayesian optimization by introducing two random-exploration-enhanced algorithms, GP-UCB+ and EXPLOIT+, which improve surrogate accuracy and accelerate convergence while maintaining implementation simplicity. It also extends to Bayesian inference by using optimization iterates to build GP surrogates for unnormalized log-posterior densities, and derives rigorous bounds on the Hellinger distance between true and surrogate posteriors. Theoretical results establish near-optimal simple-regret rates for Matérn kernels and exponential-type bounds for squared-exponential kernels, alongside practical performance gains on high-dimensional benchmarks, hyperparameter tuning, and differential-equation inference. The proposed approach provides a versatile experimental-design framework that yields accurate local posterior behavior around modes without sacrificing global fidelity, with concrete benefits for problems with expensive or intractable likelihoods. Overall, the methods offer a principled, scalable path to faster optimization and reliable posterior approximation in complex scientific and engineering domains.

Abstract

This paper proposes novel noise-free Bayesian optimization strategies that rely on a random exploration step to enhance the accuracy of Gaussian process surrogate models. The new algorithms retain the ease of implementation of the classical GP-UCB algorithm, but the additional random exploration step accelerates their convergence, nearly achieving the optimal convergence rate. Furthermore, to facilitate Bayesian inference with an intractable likelihood, we propose to utilize optimization iterates for maximum a posteriori estimation to build a Gaussian process surrogate model for the unnormalized log-posterior density. We provide bounds for the Hellinger distance between the true and the approximate posterior distributions in terms of the number of design points. We demonstrate the effectiveness of our Bayesian optimization algorithms in non-convex benchmark objective functions, in a machine learning hyperparameter tuning problem, and in a black-box engineering design problem. The effectiveness of our posterior approximation approach is demonstrated in two Bayesian inference problems for parameters of dynamical systems.
Paper Structure (31 sections, 4 theorems, 51 equations, 10 figures, 4 tables, 3 algorithms)

This paper contains 31 sections, 4 theorems, 51 equations, 10 figures, 4 tables, 3 algorithms.

Key Result

Theorem 3.1

Let $f \in \mathcal{H}_k(\mathcal{X})$, where $k$ is a squared exponential kernel. GP-UCB with noise-free observations and $\beta_t := \|f\|_{\mathcal{H}_k}^2$ satisfies the cumulative regret bound which yields the convergence rate of

Figures (10)

  • Figure 1: Simple regret vs number of noise-free observations.
  • Figure 2: Cumulative test error vs number of noise-free observations.
  • Figure 3: Cumulative test error averaged over 20 experiments.
  • Figure 4: Maximum attained value of the garden sprinkler objective function vs number of noise-free observations.
  • Figure 5: Maximum attained value of the garden sprinkler objective function averaged over 30 experiments.
  • ...and 5 more figures

Theorems & Definitions (22)

  • Theorem 3.1
  • Remark 3.2
  • Remark 3.3
  • Remark 3.4
  • Remark 4.1
  • Remark 4.2
  • Remark 4.3
  • Theorem 4.4
  • Remark 4.5
  • Remark 4.6
  • ...and 12 more