Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds
Shion Takeno, Yu Inatsu, Masayuki Karasuyama, Ichiro Takeuchi
TL;DR
This work studies Bayesian optimization with a Gaussian process prior to derive tighter Bayesian regret bounds for posterior-sampling-based strategies. It establishes sub-linear BCR results for Thompson sampling and extends the analysis to a new hyperparameter-free acquisition, PIMS, showing it achieves the same BCR rate as TS and IRGP-UCB while mitigating practical issues like over-exploration and manual hyperparameter tuning. The paper also details a rigorous regret analysis for both finite and continuous input domains, including a discretization-free approach for continuous spaces, and demonstrates through extensive experiments that PIMS offers robust performance across synthetic, benchmark, and real-world emulators. Overall, the results suggest that randomized acquisition with data-driven confidence parameters can attain strong theoretical guarantees and improved practical performance in BO. These findings have implications for scalable, hyperparameter-free BO in settings with large or continuous domains.
Abstract
Among various acquisition functions (AFs) in Bayesian optimization (BO), Gaussian process upper confidence bound (GP-UCB) and Thompson sampling (TS) are well-known options with established theoretical properties regarding Bayesian cumulative regret (BCR). Recently, it has been shown that a randomized variant of GP-UCB achieves a tighter BCR bound compared with GP-UCB, which we call the tighter BCR bound for brevity. Inspired by this study, this paper first shows that TS achieves the tighter BCR bound. On the other hand, GP-UCB and TS often practically suffer from manual hyperparameter tuning and over-exploration issues, respectively. Therefore, we analyze yet another AF called a probability of improvement from the maximum of a sample path (PIMS). We show that PIMS achieves the tighter BCR bound and avoids the hyperparameter tuning, unlike GP-UCB. Furthermore, we demonstrate a wide range of experiments, focusing on the effectiveness of PIMS that mitigates the practical issues of GP-UCB and TS.
