Variational Entropy Search for Adjusting Expected Improvement
Nuojin Cheng, Stephen Becker
TL;DR
The paper reveals that EI can be viewed as a special case of MES when viewed through variational inference, unifying information-theoretic acquisition functions under the Variational Entropy Search (VES) framework. It introduces ESLB as an ELBO-like lower bound and extends the variational family from exponential to Gamma distributions, giving rise to the VES-Gamma algorithm that balances exploration and exploitation. Empirically, VES-Gamma outperforms EI and MES on standard 2D test functions and on real-world hyperparameter-tuning tasks for XGBoost, indicating improved robustness to multi-modality and over-exploitation. The work suggests future directions include exploring broader variational families (e.g., generalized Gamma) to further enhance Bayesian optimization performance.
Abstract
Bayesian optimization is a widely used technique for optimizing black-box functions, with Expected Improvement (EI) being the most commonly utilized acquisition function in this domain. While EI is often viewed as distinct from other information-theoretic acquisition functions, such as entropy search (ES) and max-value entropy search (MES), our work reveals that EI can be considered a special case of MES when approached through variational inference (VI). In this context, we have developed the Variational Entropy Search (VES) methodology and the VES-Gamma algorithm, which adapts EI by incorporating principles from information-theoretic concepts. The efficacy of VES-Gamma is demonstrated across a variety of test functions and read datasets, highlighting its theoretical and practical utilities in Bayesian optimization scenarios.
