Near-optimal Swap Regret Minimization for Convex Losses

Lunjia Hu; Jon Schneider; Yifan Wu

Near-optimal Swap Regret Minimization for Convex Losses

Lunjia Hu, Jon Schneider, Yifan Wu

TL;DR

The paper resolves an open question on swap regret minimization for sequences of convex, Lipschitz losses in $[0,1]$ by delivering an efficient online algorithm with $\mathbb E[\mathsf{SR}]=O(\sqrt T\log T)$ and high-probability bounds of $O\big(\sqrt{(T\log T)\log(1/\delta)}\big)$. The core techniques are multi-scale binning and a V-shaped decomposition, enabling a reduction to base losses and a balanced trade-off between sampling and rounding errors. An efficient multi-objective learning framework (AMF) and the MsMwC expert algorithm yield a poly$(T)$-time predictor that achieves the near-optimal regret, and the results extend to calibration for elicitable properties, including mean, median, and quantiles. This advances online calibration and learning in games by providing near-optimal, scalable guarantees for continuous action spaces under adversarial convex losses. The practical impact spans calibration of predictive distributions and downstream decision making that depend on robust, transform-invariant regret guarantees.

Abstract

We give a randomized online algorithm that guarantees near-optimal $\widetilde O(\sqrt T)$ expected swap regret against any sequence of $T$ adaptively chosen Lipschitz convex losses on the unit interval. This improves the previous best bound of $\widetilde O(T^{2/3})$ and answers an open question of Fishelson et al. [2025b]. In addition, our algorithm is efficient: it runs in $\mathsf{poly}(T)$ time. A key technical idea we develop to obtain this result is to discretize the unit interval into bins at multiple scales of granularity and simultaneously use all scales to make randomized predictions, which we call multi-scale binning and may be of independent interest. A direct corollary of our result is an efficient online algorithm for minimizing the calibration error for general elicitable properties. This result does not require the Lipschitzness assumption of the identification function needed in prior work, making it applicable to median calibration, for which we achieve the first $\widetilde O(\sqrt T)$ calibration error guarantee.

Near-optimal Swap Regret Minimization for Convex Losses

TL;DR

The paper resolves an open question on swap regret minimization for sequences of convex, Lipschitz losses in

by delivering an efficient online algorithm with

and high-probability bounds of

. The core techniques are multi-scale binning and a V-shaped decomposition, enabling a reduction to base losses and a balanced trade-off between sampling and rounding errors. An efficient multi-objective learning framework (AMF) and the MsMwC expert algorithm yield a poly

-time predictor that achieves the near-optimal regret, and the results extend to calibration for elicitable properties, including mean, median, and quantiles. This advances online calibration and learning in games by providing near-optimal, scalable guarantees for continuous action spaces under adversarial convex losses. The practical impact spans calibration of predictive distributions and downstream decision making that depend on robust, transform-invariant regret guarantees.

Abstract

We give a randomized online algorithm that guarantees near-optimal

expected swap regret against any sequence of

adaptively chosen Lipschitz convex losses on the unit interval. This improves the previous best bound of

and answers an open question of Fishelson et al. [2025b]. In addition, our algorithm is efficient: it runs in

time. A key technical idea we develop to obtain this result is to discretize the unit interval into bins at multiple scales of granularity and simultaneously use all scales to make randomized predictions, which we call multi-scale binning and may be of independent interest. A direct corollary of our result is an efficient online algorithm for minimizing the calibration error for general elicitable properties. This result does not require the Lipschitzness assumption of the identification function needed in prior work, making it applicable to median calibration, for which we achieve the first

calibration error guarantee.

Paper Structure (30 sections, 20 theorems, 91 equations)

This paper contains 30 sections, 20 theorems, 91 equations.

Introduction
Our Contributions
Implications to Calibration for Elicitable Properties
Proper Scoring Rules and Elicitable Properties.
Calibration Error for Elicitable Properties.
Technical Overview
V-shaped Decomposition.
Non-constructive Proof via the Minimax Theorem.
Binning.
Truthful Predictor.
Sampling error and rounding error.
Challenge with Fixed Binning.
Multi-scale Binning.
Efficient Algorithm via Multi-Objective Learning.
Related Work
...and 15 more sections

Key Result

Theorem 1.2

For every positive integer $T \ge 2$, there exists a prediction strategy for def:main that guarantees Moreover, there exists such a prediction strategy that runs in $\mathsf{poly}(T)$ time.

Theorems & Definitions (30)

Theorem 1.2
Corollary 1.3
Lemma 1.4
Lemma 2.1: opt-scoring-ruleucal
proof
Definition 3.1
Lemma 3.2
Lemma 3.3
proof : Proof of \ref{['lm:width']}
Lemma 3.4
...and 20 more

Near-optimal Swap Regret Minimization for Convex Losses

TL;DR

Abstract

Near-optimal Swap Regret Minimization for Convex Losses

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (30)