Bayesian Optimization with Formal Safety Guarantees via Online Conformal Prediction
Yunchuan Zhang, Sangwoo Park, Osvaldo Simeone
TL;DR
The paper addresses constrained black-box zero-th order optimization where a safety constraint $q(\mathbf{x})\ge0$ must be respected while maximizing $f(\mathbf{x})$. It introduces SAFE-BOCP, combining Bayesian optimization with online conformal prediction to produce an adaptive, pessimistic safe set that guarantees safety irrespective of the constraint function, at the cost of a non-zero target safety violation rate $\alpha$. Two algorithms are proposed: deterministic D-SAFE-BOCP and probabilistic P-SAFE-BOCP, each with formal safety guarantees (the latter under noise) and a data-driven tail-bound mechanism for practical resilience. The methods are validated on synthetic benchmarks and real-world tasks (e.g., movie recommendations and chemical reactor optimization), demonstrating competitive optimization performance while controlling safety risk and without strong model assumptions about $q$. This work broadens the applicability of safe BO to safety-critical domains by providing assumption-free safety guarantees and practical probability bounds.
Abstract
Black-box zero-th order optimization is a central primitive for applications in fields as diverse as finance, physics, and engineering. In a common formulation of this problem, a designer sequentially attempts candidate solutions, receiving noisy feedback on the value of each attempt from the system. In this paper, we study scenarios in which feedback is also provided on the safety of the attempted solution, and the optimizer is constrained to limit the number of unsafe solutions that are tried throughout the optimization process. Focusing on methods based on Bayesian optimization (BO), prior art has introduced an optimization scheme -- referred to as SAFEOPT -- that is guaranteed not to select any unsafe solution with a controllable probability over feedback noise as long as strict assumptions on the safety constraint function are met. In this paper, a novel BO-based approach is introduced that satisfies safety requirements irrespective of properties of the constraint function. This strong theoretical guarantee is obtained at the cost of allowing for an arbitrary, controllable but non-zero, rate of violation of the safety constraint. The proposed method, referred to as SAFE-BOCP, builds on online conformal prediction (CP) and is specialized to the cases in which feedback on the safety constraint is either noiseless or noisy. Experimental results on synthetic and real-world data validate the advantages and flexibility of the proposed SAFE-BOCP.
