Relaxing the Additivity Constraints in Decentralized No-Regret High-Dimensional Bayesian Optimization
Anthony Bardou, Patrick Thiran, Thomas Begin
TL;DR
This paper tackles high-dimensional Bayesian Optimization by relaxing the common low Maximum Factor Size (MFS) constraint in additive decompositions. It introduces DuMBO, a decentralized, message-passing BO algorithm that models f as a sum of independent factor GPs over a factor graph and uses a tighter decentralized GP-UCB–style acquisition via ADMM to enforce consistency. Theoretical results establish asymptotic no-regret performance, with a high-probability immediate regret bound and a KL-based convergence argument. Empirically, DuMBO matches or surpasses state-of-the-art methods across synthetic and real-world tasks, especially when the true objective has large MFS or unknown additive structure, demonstrating strong scalability and practical impact for high-dimensional, expensive optimization problems.
Abstract
Bayesian Optimization (BO) is typically used to optimize an unknown function $f$ that is noisy and costly to evaluate, by exploiting an acquisition function that must be maximized at each optimization step. Even if provably asymptotically optimal BO algorithms are efficient at optimizing low-dimensional functions, scaling them to high-dimensional spaces remains an open problem, often tackled by assuming an additive structure for $f$. By doing so, BO algorithms typically introduce additional restrictive assumptions on the additive structure that reduce their applicability domain. This paper contains two main contributions: (i) we relax the restrictive assumptions on the additive structure of $f$ without weakening the maximization guarantees of the acquisition function, and (ii) we address the over-exploration problem for decentralized BO algorithms. To these ends, we propose DuMBO, an asymptotically optimal decentralized BO algorithm that achieves very competitive performance against state-of-the-art BO algorithms, especially when the additive structure of $f$ comprises high-dimensional factors.
