Fully Unconstrained Online Learning
Ashok Cutkosky, Zakaria Mhammedi
TL;DR
The paper addresses the challenge of fully unconstrained online learning for convex losses, aiming to achieve sublinear regret without prior knowledge of the loss sequence magnitude or the comparator norm. It introduces a parameter-free algorithm that achieves a near-optimal regret bound by combining magnitude hints with a small regularization term cast through an epigraph constraint, and then reducing the general problem to a 1D setting via standard dimensionality reductions. The approach balances a tractable, data-driven regularization schedule with careful reductions to obtain regret that matches the $G\|w_\star\|\sqrt{T}$ rate up to logarithmic factors in all practically interesting cases. The work also explores generalizations to other regularizers, provides lower bounds showing near tightness, and discusses extensions to stochastic optimization and adaptivity, highlighting both the theoretical significance and potential for robust, parameter-free online learning in practice.
Abstract
We provide an online learning algorithm that obtains regret $G\|w_\star\|\sqrt{T\log(\|w_\star\|G\sqrt{T})} + \|w_\star\|^2 + G^2$ on $G$-Lipschitz convex losses for any comparison point $w_\star$ without knowing either $G$ or $\|w_\star\|$. Importantly, this matches the optimal bound $G\|w_\star\|\sqrt{T}$ available with such knowledge (up to logarithmic factors), unless either $\|w_\star\|$ or $G$ is so large that even $G\|w_\star\|\sqrt{T}$ is roughly linear in $T$. Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably most "interesting" scenarios.
