A Continuous Relaxation for Discrete Bayesian Optimization
Richard Michael, Simon Bartels, Miguel González-Duque, Yevgen Zainchkovskyy, Jes Frellsen, Søren Hauberg, Wouter Boomsma
TL;DR
This work tackles discrete sequence optimization under strict, expensive evaluation budgets by reframing the problem in a continuous probability-space. It introduces Continuously Relaxed Bayesian Optimization (CoRel), which places a Gaussian process prior over a relaxed objective $ar f$ computed as the expectation under a distribution over sequences, and uses a weighted Hellinger kernel to incorporate prior knowledge. The approach enables acquisition optimization via discrete, continuous, or manifold methods and is instantiated with a product-kernel model over latent subsets; empirical results on GFP and RFP tasks show improved performance in cold-start, low-budget settings compared to state-of-the-art baselines. The study highlights the value of priors and surrogate choices in discrete BO and provides practical software for researchers in protein sequence design and related domains.
Abstract
To optimize efficiently over discrete data and with only few available target observations is a challenge in Bayesian optimization. We propose a continuous relaxation of the objective function and show that inference and optimization can be computationally tractable. We consider in particular the optimization domain where very few observations and strict budgets exist; motivated by optimizing protein sequences for expensive to evaluate bio-chemical properties. The advantages of our approach are two-fold: the problem is treated in the continuous setting, and available prior knowledge over sequences can be incorporated directly. More specifically, we utilize available and learned distributions over the problem domain for a weighting of the Hellinger distance which yields a covariance function. We show that the resulting acquisition function can be optimized with both continuous or discrete optimization algorithms and empirically assess our method on two bio-chemical sequence optimization tasks.
