Automatic model training under restrictive time constraints
Lukas Cironis, Jan Palczewski, Georgios Aivaliotis
TL;DR
AutoBCT tackles hyperparameter tuning under strict time constraints by formulating it as a stochastic control problem with partial information. It represents the unknown score and cost mappings as linear surrogates H(u)=α^Tφ(u) and T(u)=β^Tψ(u) with Gaussian priors updated via Kalman filtering, and solves a dynamic programming problem with a precomputed value function V that guides stopping and next-hyperparameter decisions. The framework is validated on synthetic tasks, CNN training, and large-scale datasets, showing efficient stopping and competitive accuracy within budget; VI/Map-based implementations generally outperform on-the-fly variants when precomputed maps are available. The approach enables meta-learning and reuse of value-function maps across problems, contributing a principled, eco-efficient AutoML paradigm grounded in stochastic control and regression Monte Carlo methods.
Abstract
We develop a hyperparameter optimisation algorithm, Automated Budget Constrained Training (AutoBCT), which balances the quality of a model with the computational cost required to tune it. The relationship between hyperparameters, model quality and computational cost must be learnt and this learning is incorporated directly into the optimisation problem. At each training epoch, the algorithm decides whether to terminate or continue training, and, in the latter case, what values of hyperparameters to use. This decision weighs optimally potential improvements in the quality with the additional training time and the uncertainty about the learnt quantities. The performance of our algorithm is verified on a number of machine learning problems encompassing random forests and neural networks. Our approach is rooted in the theory of Markov decision processes with partial information and we develop a numerical method to compute the value function and an optimal strategy.
