Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

Shion Takeno; Yu Inatsu; Masayuki Karasuyama

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

Shion Takeno, Yu Inatsu, Masayuki Karasuyama

TL;DR

Improved randomized GP-UCB is analyzed, which uses the confidence parameter generated from the shifted exponential distribution, and it is shown that randomization plays a key role in avoiding an increase in confidence parameter by showing that GP-UCB using a constant confidence parameter can incur linearly growing expected cumulative regret.

Abstract

Gaussian process upper confidence bound (GP-UCB) is a theoretically established algorithm for Bayesian optimization (BO), where we assume the objective function $f$ follows a GP. One notable drawback of GP-UCB is that the theoretical confidence parameter $β$ increases along with the iterations and is too large. To alleviate this drawback, this paper analyzes the randomized variant of GP-UCB called improved randomized GP-UCB (IRGP-UCB), which uses the confidence parameter generated from the shifted exponential distribution. We analyze the expected regret and conditional expected regret, where the expectation and the probability are taken respectively with $f$ and noise and with the randomness of the BO algorithm. In both regret analyses, IRGP-UCB achieves a sub-linear regret upper bound without increasing the confidence parameter if the input domain is finite. Furthermore, we show that randomization plays a key role in avoiding an increase in confidence parameter by showing that GP-UCB using a constant confidence parameter can incur linearly growing expected cumulative regret. Finally, we show numerical experiments using synthetic and benchmark functions and real-world emulators.

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

TL;DR

Abstract

Gaussian process upper confidence bound (GP-UCB) is a theoretically established algorithm for Bayesian optimization (BO), where we assume the objective function

follows a GP. One notable drawback of GP-UCB is that the theoretical confidence parameter

increases along with the iterations and is too large. To alleviate this drawback, this paper analyzes the randomized variant of GP-UCB called improved randomized GP-UCB (IRGP-UCB), which uses the confidence parameter generated from the shifted exponential distribution. We analyze the expected regret and conditional expected regret, where the expectation and the probability are taken respectively with

and noise and with the randomness of the BO algorithm. In both regret analyses, IRGP-UCB achieves a sub-linear regret upper bound without increasing the confidence parameter if the input domain is finite. Furthermore, we show that randomization plays a key role in avoiding an increase in confidence parameter by showing that GP-UCB using a constant confidence parameter can incur linearly growing expected cumulative regret. Finally, we show numerical experiments using synthetic and benchmark functions and real-world emulators.

Paper Structure (30 sections, 20 theorems, 108 equations, 4 figures, 1 algorithm)

This paper contains 30 sections, 20 theorems, 108 equations, 4 figures, 1 algorithm.

Introduction
Related Work
Background
Bayesian Optimization
Regularity Assumptions
Regret
Maximum information gain
GP-UCB
Algorithm
Regret Analysis
Expected Regret Bounds
Conditional Expected Regret Bounds
High-Probability Regret Bounds
Expected Regret of GP-UCB with Constant Confidence Parameter
Experiments
...and 15 more sections

Key Result

Lemma 4.1

Suppose that $f$ is a sample path from a GP with zero mean and a positive semidefinite kernel $k$, and ${\cal X}$ is finite. Pick $\delta \in (0, 1)$ and $t \geq 1$. Then, for any given ${\cal D}_{t-1}$, where $\beta_{\delta} = 2 \log (|{\cal X}| /( 2 \delta))$.

Figures (4)

Figure 1: This figure shows the confidence parameters of GP-UCB-based methods. For GP-UCB, the solid line represents $\beta_t$. For RGP-UCB and IRGP-UCB, the solid lines and shaded areas represent the expectations $\mathbb{E}[\zeta_t]$ and $95 \%$ credible intervals, respectively.
Figure 2: This figure shows the average and standard errors of simple regrets.
Figure 3: Average and standard errors of simple regret on benchmark functions.
Figure 4: Average and standard errors of simple regret on real-world datasets.

Theorems & Definitions (39)

Definition 2.1: Maximum information gain
Lemma 4.1
Lemma 4.2
proof
Theorem 4.1: BCR bound for finite domain
Theorem 4.2: BCR bound for infinite domain
Remark 4.1
Theorem 4.3
proof
Theorem 4.4
...and 29 more

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

TL;DR

Abstract

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (39)