A Meta-Level Learning Algorithm for Sequential Hyper-Parameter Space Reduction in AutoML
Giorgos Borboudakis, Paulos Charonyktakis, Konstantinos Paraschakis, Ioannis Tsamardinos
TL;DR
This work tackles the cash-like problem in AutoML by introducing Sequential Hyper-parameter Space Reduction (SHSR), a meta-level learning algorithm that uses past runs to prune discrete hyper-parameter choices. SHSR builds predictive models from dataset meta-features to identify and discard configuration groups that cannot significantly improve performance, recursively reducing the search space while keeping predictive accuracy within a user-defined tolerance. In extensive experiments across 659 datasets (284 classification, 375 regression), SHSR achieves up to about 30% reductions in execution time with less than 0.1% drop in predictive performance, and it remains effective even with incomplete data. The approach is interpretable, compatible with other HPO methods, and offers a practical path to speeding AutoML without sacrificing reliability.
Abstract
AutoML platforms have numerous options for the algorithms to try for each step of the analysis, i.e., different possible algorithms for imputation, transformations, feature selection, and modelling. Finding the optimal combination of algorithms and hyper-parameter values is computationally expensive, as the number of combinations to explore leads to an exponential explosion of the space. In this paper, we present the Sequential Hyper-parameter Space Reduction (SHSR) algorithm that reduces the space for an AutoML tool with negligible drop in its predictive performance. SHSR is a meta-level learning algorithm that analyzes past runs of an AutoML tool on several datasets and learns which hyper-parameter values to filter out from consideration on a new dataset to analyze. SHSR is evaluated on 284 classification and 375 regression problems, showing an approximate 30% reduction in execution time with a performance drop of less than 0.1%.
