Batched Bayesian optimization by maximizing the probability of including the optimum
Jenna Fromer, Runzhong Wang, Mrunali Manjrekar, Austin Tripp, José Miguel Hernández-Lobato, Connor W. Coley
TL;DR
The paper tackles batched Bayesian optimization in discrete design spaces, where non-additive batch acquisitions hinder optimal batch selection. It introduces qPO (multipoint Probability of Optimality), an exploitative batch construction that maximizes the probability that the true global optimum lies in the acquired batch, expressible as a sum of per-candidate scores $\Pr(x^* \in \mathcal{X}_{acq}) = \sum_{x_i \in \mathcal{X}_{acq}} \Pr(x^* = x_i) = \sum_i \alpha_i$. Alpha_i are estimated via Monte Carlo from the joint posterior, with practical accelerations by approximating the posterior as a multivariate Gaussian; low-probability events are handled via fallback strategies. The method inherently captures diversity through model covariance and remains competitive with state-of-the-art batched BO approaches in antibiotic discovery and QM9-based molecular design, while offering a simple, deterministic alternative to randomization-based strategies. Limitations include reliance on Monte Carlo estimation and potential degradation under high observation noise, suggesting avenues for analytical approximations and robustness studies.
Abstract
Batched Bayesian optimization (BO) can accelerate molecular design by efficiently identifying top-performing compounds from a large chemical library. Existing acquisition strategies for batch design in BO aim to balance exploration and exploitation. This often involves optimizing non-additive batch acquisition functions, necessitating approximation via myopic construction and/or diversity heuristics. In this work, we propose an acquisition strategy for discrete optimization that is motivated by pure exploitation, qPO (multipoint Probability of Optimality). qPO maximizes the probability that the batch includes the true optimum, which is expressible as the sum over individual acquisition scores and thereby circumvents the combinatorial challenge of optimizing a batch acquisition function. We differentiate the proposed strategy from parallel Thompson sampling and discuss how it implicitly captures diversity. Finally, we apply our method to the model-guided exploration of large chemical libraries and provide empirical evidence that it is competitive with and complements other state-of-the-art methods in batched Bayesian optimization.
