Quantile Learn-Then-Test: Quantile-Based Risk Control for Hyperparameter Optimization
Amirmohammad Farzaneh, Sangwoo Park, Osvaldo Simeone
TL;DR
The paper addresses risk-aware hyperparameter optimization by extending Learn-Then-Test to quantile risk control ($QLTT$). $QLTT$ tests hypotheses on the $q$-th quantile of the test-risk distribution using calibrated p-values and applies family-wise error rate control to select reliable hyperparameters. It demonstrates the approach on a radio access scheduling problem, showing that the selected hyperparameters meet a target $R_q(\\lambda) \\leq \\alpha$ with high probability, and that tail risk is improved relative to standard LTT. This framework enables engineers to guarantee reliability for a specified fraction of problem instances in engineering AI deployments.
Abstract
The increasing adoption of Artificial Intelligence (AI) in engineering problems calls for the development of calibration methods capable of offering robust statistical reliability guarantees. The calibration of black box AI models is carried out via the optimization of hyperparameters dictating architecture, optimization, and/or inference configuration. Prior work has introduced learn-then-test (LTT), a calibration procedure for hyperparameter optimization (HPO) that provides statistical guarantees on average performance measures. Recognizing the importance of controlling risk-aware objectives in engineering contexts, this work introduces a variant of LTT that is designed to provide statistical guarantees on quantiles of a risk measure. We illustrate the practical advantages of this approach by applying the proposed algorithm to a radio access scheduling problem.
