Dynamic Priors in Bayesian Optimization for Hyperparameter Optimization
Lukas Fehring, Marcel Wever, Maximilian Spliethöver, Leona Hennig, Henning Wachsmuth, Marius Lindauer
TL;DR
DynaBO introduces dynamic priors for Bayesian optimization in hyperparameter optimization, enabling users to inject priors at any point during the optimization and stacking multiple priors with time-decayed influence. It integrates a prior-weighted acquisition function and a prior rejection mechanism to safeguard against misleading inputs, preserving BO convergence guarantees while offering potential acceleration from informative priors. The approach is theoretically grounded, proving almost sure convergence and robustness to adversarial priors, and empirically demonstrates speedups over vanilla BO and piBO across diverse benchmarks. This work advances human-in-the-loop AutoML by enabling online collaboration between users and optimization algorithms, with practical robustness and scalability considerations.
Abstract
Hyperparameter optimization (HPO), for example, based on Bayesian optimization (BO), supports users in designing models well-suited for a given dataset. HPO has proven its effectiveness on several applications, ranging from classical machine learning for tabular data to deep neural networks for computer vision and transformers for natural language processing. However, HPO still sometimes lacks acceptance by machine learning experts due to its black-box nature and limited user control. Addressing this, first approaches have been proposed to initialize BO methods with expert knowledge. However, these approaches do not allow for online steering during the optimization process. In this paper, we introduce a novel method that enables repeated interventions to steer BO via user input, specifying expert knowledge and user preferences at runtime of the HPO process in the form of prior distributions. To this end, we generalize an existing method, $π$BO, preserving theoretical guarantees. We also introduce a misleading prior detection scheme, which allows protection against harmful user inputs. In our experimental evaluation, we demonstrate that our method can effectively incorporate multiple priors, leveraging informative priors, whereas misleading priors are reliably rejected or overcome. Thereby, we achieve competitiveness to unperturbed BO.
