Unexpected Improvements to Expected Improvement for Bayesian Optimization
Sebastian Ament, Samuel Daulton, David Eriksson, Maximilian Balandat, Eytan Bakshy
TL;DR
This work identifies a fundamental numerical issue in improvement-based Bayesian optimization, where acquisition function values and gradients vanish in large portions of the domain, hindering gradient-based optimization. To address this, the authors introduce LogEI, a family of acquisition functions whose optima align with the canonical counterparts but which are numerically stable; they extend the approach to constrained and multi-objective settings (LogCEI, LogEHVI) and to parallel batch variants (qLogEI, qLogEHVI) using log-space formulations and fat-tailed smooth approximations. Empirically, LogEI variants consistently outperform their EI counterparts across single-objective, constrained, high-dimensional, parallel, and multi-objective benchmarks, often approaching or surpassing state-of-the-art baselines with no added computational burden. The results underscore the importance of numerically robust acquisition optimization, showing joint batch optimization can compete with sequential greedy strategies and that these numerical reforms can meaningfully expand the applicability of Bayesian optimization in practice.
Abstract
Expected Improvement (EI) is arguably the most popular acquisition function in Bayesian optimization and has found countless successful applications, but its performance is often exceeded by that of more recent methods. Notably, EI and its variants, including for the parallel and multi-objective settings, are challenging to optimize because their acquisition values vanish numerically in many regions. This difficulty generally increases as the number of observations, dimensionality of the search space, or the number of constraints grow, resulting in performance that is inconsistent across the literature and most often sub-optimal. Herein, we propose LogEI, a new family of acquisition functions whose members either have identical or approximately equal optima as their canonical counterparts, but are substantially easier to optimize numerically. We demonstrate that numerical pathologies manifest themselves in "classic" analytic EI, Expected Hypervolume Improvement (EHVI), as well as their constrained, noisy, and parallel variants, and propose corresponding reformulations that remedy these pathologies. Our empirical results show that members of the LogEI family of acquisition functions substantially improve on the optimization performance of their canonical counterparts and surprisingly, are on par with or exceed the performance of recent state-of-the-art acquisition functions, highlighting the understated role of numerical optimization in the literature.
