Wasserstein Distributionally Robust Bayesian Optimization with Continuous Context
Francesco Micheli, Efe C. Balta, Anastasios Tsiamis, John Lygeros
TL;DR
The paper tackles sequential decision-making under uncertain, continuously distributed contexts by introducing Wasserstein Distributionally Robust Bayesian Optimization (WDRBO). It develops a tractable Lipschitz-based reformulation that avoids context discretization and provides two settings: General WDRBO with a specified ambiguity set and Data-Driven WDRBO built from past observations, both achieving sublinear regret; the data-driven variant leverages finite-sample Wasserstein concentration to avoid strict decay assumptions on the ambiguity radius. The authors establish regret bounds that match state-of-the-art results in RKHS-based BO, and show empirical robustness and computational efficiency across synthetic and real-world tasks. This work broadens robust optimization in BO to continuous-context scenarios with practical guarantees and scalable performance, enabling reliable optimization under distributional context shifts.
Abstract
We address the challenge of sequential data-driven decision-making under context distributional uncertainty. This problem arises in numerous real-world scenarios where the learner optimizes black-box objective functions in the presence of uncontrollable contextual variables. We consider the setting where the context distribution is uncertain but known to lie within an ambiguity set defined as a ball in the Wasserstein distance. We propose a novel algorithm for Wasserstein Distributionally Robust Bayesian Optimization that can handle continuous context distributions while maintaining computational tractability. Our theoretical analysis combines recent results in self-normalized concentration in Hilbert spaces and finite-sample bounds for distributionally robust optimization to establish sublinear regret bounds that match state-of-the-art results. Through extensive comparisons with existing approaches on both synthetic and real-world problems, we demonstrate the simplicity, effectiveness, and practical applicability of our proposed method.
