Pareto-Optimality, Smoothness, and Stochasticity in Learning-Augmented One-Max-Search
Ziyad Benomar, Lorenzo Croissant, Vianney Perchet, Spyros Angelopoulos
TL;DR
The paper tackles the online One-Max Search with a prediction of the maximum price, addressing the need for algorithms that are simultaneously Pareto-optimal between consistency and robustness and smoothly responsive to prediction error. It introduces a deterministic, Pareto-optimal, and smooth family of threshold policies $\{\mathsf{A}_r^{\rho}\}$ with thresholds $\Phi_r^{\rho}$, showing that smoothing is optimally achieved at $\rho=1$ and establishing a lower bound on any Pareto-optimal algorithm’s smoothness. It extends the deterministic results to stochastic settings by coupling prices and predictions and employing optimal transport to bound performance under dependent predictions, with explicit instantiations for stochastic predictions, independent marginals, and OT-based dependence. The work is validated by synthetic and real-data experiments (Bitcoin), demonstrating the practical benefit of smoothness in handling prediction uncertainty, and it outlines future extensions to multi-unit variants and broader financial-optimization contexts.
Abstract
One-max search is a classic problem in online decision-making, in which a trader acts on a sequence of revealed prices and accepts one of them irrevocably to maximise its profit. The problem has been studied both in probabilistic and in worst-case settings, notably through competitive analysis, and more recently in learning-augmented settings in which the trader has access to a prediction on the sequence. However, existing approaches either lack smoothness, or do not achieve optimal worst-case guarantees: they do not attain the best possible trade-off between the consistency and the robustness of the algorithm. We close this gap by presenting the first algorithm that simultaneously achieves both of these important objectives. Furthermore, we show how to leverage the obtained smoothness to provide an analysis of one-max search in stochastic learning-augmented settings which capture randomness in both the observed prices and the prediction.
