Accelerating Look-ahead in Bayesian Optimization: Multilevel Monte Carlo is All you Need
Shangda Yang, Vitaly Zankin, Maximilian Balandat, Stefan Scherer, Kevin Carlberg, Neil Walton, Kody J. H. Law
TL;DR
The paper tackles the computational bottleneck of multi-step look-ahead Bayesian optimization caused by nested Monte Carlo in acquisition-function evaluation. It introduces Multilevel Monte Carlo (MLMC) to construct telescoping estimators that couple inexpensive coarse simulations with accurate fine simulations, achieving the canonical $O( ext{ε}^{-2})$ cost for nested MC and mitigating dimension dependence. The authors provide a rigorous decomposition of SAA error into variance and bias, establish MLMC estimators for both the acquisition function and its maximizer, and prove convergence rates with improved costs; they also show that antithetic coupling can boost variance decay to $eta \,\approx \,1.5$, further improving efficiency. Numerical experiments on a 1D toy problem and BO benchmarks confirm substantial reductions in computational effort for a given accuracy and illustrate practical implementation considerations. The work lays foundations for extensions to multi-index MC, randomized MLMC, and quasi-Monte Carlo variants within Bayesian optimization.
Abstract
We leverage multilevel Monte Carlo (MLMC) to improve the performance of multi-step look-ahead Bayesian optimization (BO) methods that involve nested expectations and maximizations. Often these expectations must be computed by Monte Carlo (MC). The complexity rate of naive MC degrades for nested operations, whereas MLMC is capable of achieving the canonical MC convergence rate for this type of problem, independently of dimension and without any smoothness assumptions. Our theoretical study focuses on the approximation improvements for twoand three-step look-ahead acquisition functions, but, as we discuss, the approach is generalizable in various ways, including beyond the context of BO. Our findings are verified numerically and the benefits of MLMC for BO are illustrated on several benchmark examples. Code is available at https://github.com/Shangda-Yang/MLMCBO .
