None To Optima in Few Shots: Bayesian Optimization with MDP Priors

Diantong Li; Kyunghyun Cho; Chong Liu

None To Optima in Few Shots: Bayesian Optimization with MDP Priors

Diantong Li, Kyunghyun Cho, Chong Liu

TL;DR

This work tackles the challenge of optimizing expensive black-box functions with very few evaluations. It introduces ProfBO, a Bayesian optimization framework that uses MDP priors to model optimization trajectories from related source tasks, embodied in a PFN surrogate and adapted via MAML for fast target-task performance, enabling effective search with $T \le 20$. The key contributions are the trajectory-prior modeling via MDPs, the integration of PFNs with meta-learning, and comprehensive demonstrations on real-world drug-discovery benchmarks and hyperparameter optimization showing superior few-shot performance and practical efficiency. The findings suggest that leveraging optimization-trajectory priors can substantially accelerate scientific discovery and engineering optimization in high-cost domains.

Abstract

Bayesian Optimization (BO) is an efficient tool for optimizing black-box functions, but its theoretical guarantees typically hold in the asymptotic regime. In many critical real-world applications such as drug discovery or materials design, where each evaluation can be very costly and time-consuming, BO becomes impractical for many evaluations. In this paper, we introduce the Procedure-inFormed BO (ProfBO) algorithm, which solves black-box optimization with remarkably few function evaluations. At the heart of our algorithmic design are Markov Decision Process (MDP) priors that model optimization trajectories from related source tasks, thereby capturing procedural knowledge on efficient optimization. We embed these MDP priors into a prior-fitted neural network and employ model-agnostic meta-learning for fast adaptation to new target tasks. Experiments on real-world Covid and Cancer benchmarks and hyperparameter tuning tasks demonstrate that ProfBO consistently outperforms state-of-the-art methods by achieving high-quality solutions with significantly fewer evaluations, making it ready for practical deployment.

None To Optima in Few Shots: Bayesian Optimization with MDP Priors

TL;DR

Abstract

None To Optima in Few Shots: Bayesian Optimization with MDP Priors

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)