Cost-aware Stopping for Bayesian Optimization
Qian Xie, Linda Cai, Alexander Terenin, Peter I. Frazier, Ziv Scully
TL;DR
The paper addresses when to stop expensive Bayesian optimization in cost-varying environments by introducing a principled stopping rule linked to Pandora's Box/Gittins index and the LogEIPC EI-based criterion. The authors prove a theoretical bound on the cost-adjusted simple regret when the stopping rule is paired with either the PBGI or LogEIPC acquisition function, and they show the rule is equivalent to an EI-cost condition, enabling a unified framework. Empirically, the method competes with or outperforms baselines across synthetic Bayesian regret tasks and real-world hyperparameter and neural-architecture searches, especially under high evaluation costs, while offering practical guidance for unknown costs and budget constraints. The work delivers a tuning-free, theoretically grounded approach to cost-aware stopping in Bayesian optimization with broad applicability and strong practical impact.
Abstract
In automated machine learning, scientific discovery, and other applications of Bayesian optimization, deciding when to stop evaluating expensive black-box functions in a cost-aware manner is an important but underexplored practical consideration. A natural performance metric for this purpose is the cost-adjusted simple regret, which captures the trade-off between solution quality and cumulative evaluation cost. While several heuristic or adaptive stopping rules have been proposed, they lack guarantees ensuring stopping before incurring excessive function evaluation costs. We propose a principled cost-aware stopping rule for Bayesian optimization that adapts to varying evaluation costs without heuristic tuning. Our rule is grounded in a theoretical connection to state-of-the-art cost-aware acquisition functions, namely the Pandora's Box Gittins Index (PBGI) and log expected improvement per cost (LogEIPC). We prove a theoretical guarantee bounding the expected cost-adjusted simple regret incurred by our stopping rule when paired with either acquisition function. Across synthetic and empirical tasks, including hyperparameter optimization and neural architecture size search, pairing our stopping rule with PBGI or LogEIPC usually matches or outperforms other acquisition-function--stopping-rule pairs in terms of cost-adjusted simple regret.
