Multi-objective Hyperparameter Optimization in the Age of Deep Learning
Soham Basu, Frank Hutter, Danny Stoll
TL;DR
This work tackles multi-objective hyperparameter optimization for deep learning by introducing PriMO, the first MO-HPO algorithm that incorporates expert priors over multiple objectives and leverages cheap objective proxies. PriMO uses a MO-prior-augmented Bayesian acquisition with random scalarizations and an aggressive, MO-aware initial design to gain strong early progress, while retaining robustness against misleading priors. Empirically, PriMO achieves state-of-the-art performance in both MO and single-objective settings across eight DL benchmarks and shows resilience to prior strength through ablations and robustness studies. The approach offers practical, budget-efficient HPO for real-world DL tasks, enabling practitioners to navigate Pareto trade-offs more effectively under constrained compute budgets.
Abstract
While Deep Learning (DL) experts often have prior knowledge about which hyperparameter settings yield strong performance, only few Hyperparameter Optimization (HPO) algorithms can leverage such prior knowledge and none incorporate priors over multiple objectives. As DL practitioners often need to optimize not just one but many objectives, this is a blind spot in the algorithmic landscape of HPO. To address this shortcoming, we introduce PriMO, the first HPO algorithm that can integrate multi-objective user beliefs. We show PriMO achieves state-of-the-art performance across 8 DL benchmarks in the multi-objective and single-objective setting, clearly positioning itself as the new go-to HPO algorithm for DL practitioners.
