Algorithm Selection with Probing Trajectories: Benchmarking the Choice of Classifier Model
Quentin Renau, Emma Hart
TL;DR
This study investigates how the choice of time-series classifier influences algorithm selection when using probing-trajectories from black-box optimisation. By benchmarking 17 classifiers on the BBOB suite with three solvers under LOIO and LOPO cross-validation, it shows that feature-based (Summary) and interval-based (Time Series Forest) models reliably outperform other approaches, including many deep-learning and kernel-based methods. Automated tuning via irace improves performance and these gains transfer across trajectory types, indicating practical robustness. The results suggest that classifier choice is a crucial factor in trajectory-based algorithm selection and provide actionable defaults for practitioners, while highlighting the need to generalise benchmarks to more suites and to consider regression formulations.
Abstract
Recent approaches to training algorithm selectors in the black-box optimisation domain have advocated for the use of training data that is algorithm-centric in order to encapsulate information about how an algorithm performs on an instance, rather than relying on information derived from features of the instance itself. Probing-trajectories that consist of a sequence of objective performance per function evaluation obtained from a short run of an algorithm have recently shown particular promise in training accurate selectors. However, training models on this type of data requires an appropriately chosen classifier given the sequential nature of the data. There are currently no clear guidelines for choosing the most appropriate classifier for algorithm selection using time-series data from the plethora of models available. To address this, we conduct a large benchmark study using 17 different classifiers and three types of trajectory on a classification task using the BBOB benchmark suite using both leave-one-instance out and leave-one-problem out cross-validation. In contrast to previous studies using tabular data, we find that the choice of classifier has a significant impact, showing that feature-based and interval-based models are the best choices.
