Hyperparameter Importance Analysis for Multi-Objective AutoML
Daphne Theodorakopoulos, Frederic Stahl, Marius Lindauer
TL;DR
The paper tackles the challenge of assessing hyperparameter importance in multi-objective AutoML by proposing a weighted $HPI$ framework that scalarizes objectives along Pareto-front tradeoffs. It develops two surrogate-based methods, $MO$-$fANOVA$ and $MO$-ablation path analysis, to quantify hyperparameter influence across different weightings, supported by data preparation and a Pareto-based weighting scheme. Through three MO-HPO experiments on MNIST, Adult, and CIFAR-10, the authors demonstrate that hyperparameters can exhibit distinct, tradeoff-dependent importance, and that the two methods offer complementary, actionable insights. The work advances human-centered MO AutoML by enabling more nuanced hyperparameter tuning and interpretability, while acknowledging limitations and outlining directions for broader objective setups and tool integration.
Abstract
Hyperparameter optimization plays a pivotal role in enhancing the predictive performance and generalization capabilities of ML models. However, in many applications, we do not only care about predictive performance but also about additional objectives such as inference time, memory, or energy consumption. In such multi-objective scenarios, determining the importance of hyperparameters poses a significant challenge due to the complex interplay between the conflicting objectives. In this paper, we propose the first method for assessing the importance of hyperparameters in multi-objective hyperparameter optimization. Our approach leverages surrogate-based hyperparameter importance measures, i.e., fANOVA and ablation paths, to provide insights into the impact of hyperparameters on the optimization objectives. Specifically, we compute the a-priori scalarization of the objectives and determine the importance of the hyperparameters for different objective tradeoffs. Through extensive empirical evaluations on diverse benchmark datasets with three different objective pairs, each combined with accuracy, namely time, demographic parity loss, and energy consumption, we demonstrate the effectiveness and robustness of our proposed method. Our findings not only offer valuable guidance for hyperparameter tuning in multi-objective optimization tasks but also contribute to advancing the understanding of hyperparameter importance in complex optimization scenarios.
