Hyperparameter Optimization via Interacting with Probabilistic Circuits
Jonas Seng, Fabrizio Ventola, Zhongjie Yu, Kristian Kersting
TL;DR
This work tackles interactive hyperparameter optimization by replacing acquisition-based guidance with a tractable probabilistic-circuit surrogate that supports exact conditioning. IBO-HPC uses conditional sampling to generate candidates that honor user beliefs, optionally weighting them with a user prior, and avoids inner-loop optimization of acquisition functions. The authors prove that the interactive policy is feedback-adhering and globally convergent, and they provide an EI-based convergence analysis under Gaussian-leaf PCs. Empirically, IBO-HPC is competitive with strong HPO baselines and often outperforms interactive baselines when user knowledge is provided, achieving substantial speed-ups and demonstrating robustness to misleading feedback. The approach promises practical gains for human-in-the-loop AutoML, while outlining limitations and directions for future work such as transfer learning across tasks.
Abstract
Despite the growing interest in designing truly interactive hyperparameter optimization (HPO) methods, to date, only a few allow to include human feedback. Existing interactive Bayesian optimization (BO) methods incorporate human beliefs by weighting the acquisition function with a user-defined prior distribution. However, in light of the non-trivial inner optimization of the acquisition function prevalent in BO, such weighting schemes do not always accurately reflect given user beliefs. We introduce a novel BO approach leveraging tractable probabilistic models named probabilistic circuits (PCs) as a surrogate model. PCs encode a tractable joint distribution over the hybrid hyperparameter space and evaluation scores. They enable exact conditional inference and sampling. Based on conditional sampling, we construct a novel selection policy that enables an acquisition function-free generation of candidate points (thereby eliminating the need for an additional inner-loop optimization) and ensures that user beliefs are reflected accurately in the selection policy. We provide a theoretical analysis and an extensive empirical evaluation, demonstrating that our method achieves state-of-the-art performance in standard HPO and outperforms interactive BO baselines in interactive HPO.
