Fast, Precise Thompson Sampling for Bayesian Optimization
David Sweet
TL;DR
Bayesian optimization with Thompson sampling tends to underperform popular acquisition functions in continuous domains. The authors introduce Stagger Thompson Sampler (STS), a Hit-and-Run–based sampler that initializes at $\tilde{x}_*$ and uses a log-uniform perturbation length to focus search on high-density regions of the maximizer distribution $p_*(x)$, while maintaining computational efficiency. STS integrates with Minimal Terminal Variance (MTV) for batch design, enabling effective batch selection by drawing arms from $p_*(x)$ and optimizing over the continuous space with a lightweight Metropolis-style acceptance. Empirical results across nine functions and dimensions up to 300 show that STS outperforms TS, PSS, EI, UCB, and CMA-ES, and MTV+STS matches or surpasses previous batching approaches. The work provides a scalable, practical approach for fast, precise Bayesian optimization in high-dimensional, batched settings.
Abstract
Thompson sampling (TS) has optimal regret and excellent empirical performance in multi-armed bandit problems. Yet, in Bayesian optimization, TS underperforms popular acquisition functions (e.g., EI, UCB). TS samples arms according to the probability that they are optimal. A recent algorithm, P-Star Sampler (PSS), performs such a sampling via Hit-and-Run. We present an improved version, Stagger Thompson Sampler (STS). STS more precisely locates the maximizer than does TS using less computation time. We demonstrate that STS outperforms TS, PSS, and other acquisition methods in numerical experiments of optimizations of several test functions across a broad range of dimension. Additionally, since PSS was originally presented not as a standalone acquisition method but as an input to a batching algorithm called Minimal Terminal Variance (MTV), we also demon-strate that STS matches PSS performance when used as the input to MTV.
