Fast Benchmarking of Asynchronous Multi-Fidelity Optimization on Zero-Cost Benchmarks
Shuhei Watanabe, Neeratyoy Mallik, Edward Bergman, Frank Hutter
TL;DR
This work tackles the prohibitive cost of hyperparameter optimization in deep learning by enabling efficient, asynchronous multi-fidelity optimization on zero-cost benchmarks. The authors introduce a filesystem-based wrapper that preserves the exact return order of evaluations across multiple workers without waiting for actual runtimes, using an algorithm that tracks cumulative runtimes $T_{p}^{(N)}$ and allocates the next sample to the free worker with minimal $T_{p}^{(N)}$. The approach is empirically validated through extensive tests and shows compatibility with a range of OSS HPO tools, yielding speedups of up to $1.3 \times 10^{3}$ compared to naïve simulation; they also quantify CO$_2$ savings from reduced runtimes. The work provides a practical, installable tool (pip install mfhpo-simulator) that facilitates fair, large-scale parallel evaluations on zero-cost benchmarks, enabling faster development and evaluation of HPO methods across diverse benchmark suites.
Abstract
While deep learning has celebrated many successes, its results often hinge on the meticulous selection of hyperparameters (HPs). However, the time-consuming nature of deep learning training makes HP optimization (HPO) a costly endeavor, slowing down the development of efficient HPO tools. While zero-cost benchmarks, which provide performance and runtime without actual training, offer a solution for non-parallel setups, they fall short in parallel setups as each worker must communicate its queried runtime to return its evaluation in the exact order. This work addresses this challenge by introducing a user-friendly Python package that facilitates efficient parallel HPO with zero-cost benchmarks. Our approach calculates the exact return order based on the information stored in file system, eliminating the need for long waiting times and enabling much faster HPO evaluations. We first verify the correctness of our approach through extensive testing and the experiments with 6 popular HPO libraries show its applicability to diverse libraries and its ability to achieve over 1000x speedup compared to a traditional approach. Our package can be installed via pip install mfhpo-simulator.
