Multi-Fidelity Bayesian Optimization With Across-Task Transferable Max-Value Entropy Search

Yunchuan Zhang; Sangwoo Park; Osvaldo Simeone

Multi-Fidelity Bayesian Optimization With Across-Task Transferable Max-Value Entropy Search

Yunchuan Zhang, Sangwoo Park, Osvaldo Simeone

TL;DR

The paper tackles sequential, costly black-box optimization under multi-fidelity cost structures across related tasks. It extends max-value entropy search by introducing shared GP parameters $\boldsymbol{\theta}$ that encapsulate transferable knowledge, and it uses particle-based variational updates (SVGD) to propagate this information across tasks. The proposed Multi-Fidelity Transferable MES (MFT-MES) augments the current-task information gain with a term that captures learning about $\boldsymbol{\theta}$, enabling faster convergence on future tasks. Theoretical regret bounds show how transferring knowledge reduces cumulative regret as the number of tasks grows, and empirical results on synthetic benchmarks and radio-resource management demonstrate substantial gains over single-task MF-MES and non-transfer baselines as more tasks are processed.

Abstract

In many applications, ranging from logistics to engineering, a designer is faced with a sequence of optimization tasks for which the objectives are in the form of black-box functions that are costly to evaluate. Furthermore, higher-fidelity evaluations of the optimization objectives often entail a larger cost. Existing multi-fidelity black-box optimization strategies select candidate solutions and fidelity levels with the goal of maximizing the information about the optimal value or the optimal solution for the current task. Assuming that successive optimization tasks are related, this paper introduces a novel information-theoretic acquisition function that balances the need to acquire information about the current task with the goal of collecting information transferable to future tasks. The proposed method transfers across tasks distributions over parameters of a Gaussian process surrogate model by implementing particle-based variational Bayesian updates. Theoretical insights based on the analysis of the expected regret substantiate the benefits of acquiring transferable knowledge across tasks. Furthermore, experimental results across synthetic and real-world examples reveal that the proposed acquisition strategy that caters to future tasks can significantly improve the optimization efficiency as soon as a sufficient number of tasks is processed.

Multi-Fidelity Bayesian Optimization With Across-Task Transferable Max-Value Entropy Search

TL;DR

The paper tackles sequential, costly black-box optimization under multi-fidelity cost structures across related tasks. It extends max-value entropy search by introducing shared GP parameters

that encapsulate transferable knowledge, and it uses particle-based variational updates (SVGD) to propagate this information across tasks. The proposed Multi-Fidelity Transferable MES (MFT-MES) augments the current-task information gain with a term that captures learning about

, enabling faster convergence on future tasks. Theoretical regret bounds show how transferring knowledge reduces cumulative regret as the number of tasks grows, and empirical results on synthetic benchmarks and radio-resource management demonstrate substantial gains over single-task MF-MES and non-transfer baselines as more tasks are processed.

Abstract

Paper Structure (27 sections, 1 theorem, 59 equations, 7 figures, 2 algorithms)

This paper contains 27 sections, 1 theorem, 59 equations, 7 figures, 2 algorithms.

Introduction
Context and Scope
Related Work
Main Contributions
Organization
Problem Definition And Preliminaries
Sequential Multi-Task Black-Box Optimization
Gaussian Process
Multi-fidelity Gaussian Process
Single-Task Multi-fidelity Bayesian Optimization
Multi-Fidelity Max-Value Entropy Search
Optimizing the Parameter Vector $\boldsymbol{\theta}$
Sequential Multi-fidelity Bayesian Optimization With Transferable Max-Value Entropy Search
Multi-Fidelity Transferable Max-Value Entropy Search
Bayesian Learning for the Parameter Vector $\boldsymbol{\theta}$
...and 12 more sections

Key Result

Theorem 1

Under Assumptions assump:regret_cond and lemma_3, denoting $\varepsilon=\varepsilon_1+\varepsilon_2$, the cumulative expected regret of the two-phase MFT-MES scheme satisfies the inequality where $\nu_{n,t}=\min_\mathbf{x}\tilde{r}_{n,t}(\mathbf{x}|\boldsymbol{\theta}^*)$ is the minimum normalized expected regret eq: normalized regret at time $t$; $t_n^* = \arg\max_{t \in \{1,...,c\cdot T\}} \nu_

Figures (7)

Figure 1: This paper studies a sequential multi-task optimization setting with multi-fidelity approximations of expensive-to-evaluate black-box objective functions. For any current task $n$, over time index $t=1,2,...,T_n$, the optimizer selects a pair of query point $\mathbf{x}_{n,t}$ and fidelity level $m_{n,t}$, requiring an approximation cost $\lambda^{(m)}$. As a result, the optimizer receives noisy feedback $y_{n,t}^{(m)}$ about target objective value $f_n(\mathbf{x}_{n,t})$. We wish to approach the global optimal solution $\mathbf{x}_n^*$ of objective $f_n(\mathbf{x})$ while abiding by a total simulation cost budget $\Lambda$.
Figure 2: Comparison between selected prior works and the proposed MFT-MES method.
Figure 3: Synthetic optimization tasks: Simple regret \ref{['eq: simple regret']} against the number of tasks, $n$, for MF-MES, Continual MF-MES $(\beta=0)$ with $V=5$ and $V=10$ particles, and MFT-MES ($\beta=1.2$) with $V=5$ and $V=10$ particles.
Figure 4: Synthetic optimization tasks: Log-simple regret against weight parameter $\beta$ and the number of tasks, $n$, for MF-MES (black dash-dotted line), Continual MF-MES (red solid line), and MFT-MES (blue surface). The optimal values of $\beta$ at the corresponding number of tasks $n$ are labeled as gold stars. The number of particles for Continual MF-MES and MFT-MES is set to $V=10$.
Figure 5: Radio resource management for wireless systems zhang2023bayesian: Optimality ratio \ref{['eq: optimality ratio']} against the number of tasks, $n$, for MF-MES, Continual MF-MES ($\beta=0$), and MFT-MES ($\beta=1.6$) with $V=10$ particles.
...and 2 more figures

Theorems & Definitions (3)

Definition 1: $\varepsilon$-sharpness
Definition 2: $\varepsilon$-stability
Theorem 1: Cumulative Expected Regret Bound of MFT-MES

Multi-Fidelity Bayesian Optimization With Across-Task Transferable Max-Value Entropy Search

TL;DR

Abstract

Multi-Fidelity Bayesian Optimization With Across-Task Transferable Max-Value Entropy Search

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (3)