Gaussian Process Thompson Sampling via Rootfinding

Taiwo A. Adebiyi; Bach Do; Ruda Zhang

Gaussian Process Thompson Sampling via Rootfinding

Taiwo A. Adebiyi, Bach Do, Ruda Zhang

TL;DR

An efficient global optimization strategy for GP-TS that carefully selects starting points for gradient-based multi-start optimizers and optimizes the posterior sample using a differentiable, decoupled representation is introduced.

Abstract

Thompson sampling (TS) is a simple, effective stochastic policy in Bayesian decision making. It samples the posterior belief about the reward profile and optimizes the sample to obtain a candidate decision. In continuous optimization, the posterior of the objective function is often a Gaussian process (GP), whose sample paths have numerous local optima, making their global optimization challenging. In this work, we introduce an efficient global optimization strategy for GP-TS that carefully selects starting points for gradient-based multi-start optimizers. It identifies all local optima of the prior sample via univariate global rootfinding, and optimizes the posterior sample using a differentiable, decoupled representation. We demonstrate remarkable improvement in the global optimization of GP posterior samples, especially in high dimensions. This leads to dramatic improvements in the overall performance of Bayesian optimization using GP-TS acquisition functions, surprisingly outperforming alternatives like GP-UCB and EI.

Gaussian Process Thompson Sampling via Rootfinding

TL;DR

Abstract

Gaussian Process Thompson Sampling via Rootfinding

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)