Improving sample efficiency of high dimensional Bayesian optimization with MCMC
Zeji Yi, Yunyue Wei, Chu Xin Cheng, Kaibo He, Yanan Sui
TL;DR
This paper addresses the inefficiency of Bayesian optimization in high dimensions by introducing MCMC-BO, a framework that uses Metropolis-Hastings or Langevin dynamics to sample from an approximated GP-TS posterior and concentrate search in promising regions. By tracking only a batch of $m$ points per round, the method decouples computational complexity from the discretization size and provides a regret bound that scales with the information gain without requiring large stored grids. Theoretical results establish stationary convergence properties for the approximate posterior and a regret bound that scales with dimension and kernel information gain, while experiments on high-dimensional synthetic functions and Mujoco tasks show superior performance over state-of-the-art high-dimensional BO baselines. The approach offers a flexible, memory-efficient, and theoretically grounded path to boost sample efficiency in high-dimensional Bayesian optimization with potential for parallelization and integration with existing BO techniques.
Abstract
Sequential optimization methods are often confronted with the curse of dimensionality in high-dimensional spaces. Current approaches under the Gaussian process framework are still burdened by the computational complexity of tracking Gaussian process posteriors and need to partition the optimization problem into small regions to ensure exploration or assume an underlying low-dimensional structure. With the idea of transiting the candidate points towards more promising positions, we propose a new method based on Markov Chain Monte Carlo to efficiently sample from an approximated posterior. We provide theoretical guarantees of its convergence in the Gaussian process Thompson sampling setting. We also show experimentally that both the Metropolis-Hastings and the Langevin Dynamics version of our algorithm outperform state-of-the-art methods in high-dimensional sequential optimization and reinforcement learning benchmarks.
