Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces

Aryan Deshwal; Sait Cakmak; Yuhou Xia; David Eriksson

Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces

Aryan Deshwal, Sait Cakmak, Yuhou Xia, David Eriksson

TL;DR

The first approach leverages a Gaussian process (GP) model with a conditional kernel to transfer information between different search spaces and treats the missing parameters as hyperparameters of the GP model that can be inferred jointly with the other GP hyperparameters or set to fixed values.

Abstract

Bayesian optimization (BO) is a powerful approach to sample-efficient optimization of black-box functions. However, in settings with very few function evaluations, a successful application of BO may require transferring information from historical experiments. These related experiments may not have exactly the same tunable parameters (search spaces), motivating the need for BO with transfer learning for heterogeneous search spaces. In this paper, we propose two methods for this setting. The first approach leverages a Gaussian process (GP) model with a conditional kernel to transfer information between different search spaces. Our second approach treats the missing parameters as hyperparameters of the GP model that can be inferred jointly with the other GP hyperparameters or set to fixed values. We show that these two methods perform well on several benchmark problems.

Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces

TL;DR

Abstract

Paper Structure (23 sections, 2 equations, 3 figures, 1 algorithm)

This paper contains 23 sections, 2 equations, 3 figures, 1 algorithm.

Introduction
Related Work
Transfer Learning in Bayesian Optimization (BO)
BO with Heterogeneous Search Spaces
Background
Bayesian Optimization
Bayesian Optimization with Transfer Learning
BO with Transfer Learning for Heterogeneous Search Spaces
MTGP with Conditional Kernels
Tree-Structured Representation of Heterogeneous Search Spaces
Conditional kernel
MTGP with Conditional Kernels
MTGP with Imputed Values
Experiments
Methods
...and 8 more sections

Figures (3)

Figure 1: The tree-based representation and the corresponding kernels of the search spaces $\mathcal{X}_{1}, \mathcal{X}_{2}$ and $\mathcal{X}_{3}$ given in Sec. \ref{['sec:background']}.
Figure 2: (Left) The Learned Imputed MTGP performs the best on the 6D Hartmann problem. (Middle/Right) The MTGP-based methods outperform Vanilla BO and Random Search on the 11D Ranger and 6D Rpart problems. Each experiment employs 30 source task trials.
Figure 3: Ablation study for varying the number of trials for each source task on the 11D Ranger problem. We observe that the MTGP with conditional kernels performs the best when only a small number of source trials are available. As more source data is available, the Learned Imputed MTGP and MTGP on common parameters perform the best.

Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces

TL;DR

Abstract

Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces

Authors

TL;DR

Abstract

Table of Contents

Figures (3)