Adaptive Bound Optimization for Online Convex Optimization

H. Brendan McMahan; Matthew Streeter

Adaptive Bound Optimization for Online Convex Optimization

H. Brendan McMahan, Matthew Streeter

TL;DR

This work addresses regret minimization in online convex optimization by introducing FTPRL, an algorithm that adaptively selects proximal regularizers centered at the current iterate. The main technical insight is a regret bound $\text{Regret} \le r_{1:T}(\mathring{x}) + \sum_{t=1}^T \|A_t^{-1} g_t\|^2$ with $A_t=(Q_{1:t})^{1/2}$, enabling competitive guarantees relative to the best post-hoc bound $B_R(\vec{Q_T},\vec{g_T})$. The paper develops several adaptive schemes (coordinate-constant, diagonal, and full-matrix regularization) showing improved, problem-dependent regret bounds on structured feasible sets such as hyperrectangles and ellipsoids, and introduces scale-transformations to extend results to general norm-balls via FTPRL-Scale. Practically, diagonal adaptation yields substantial gains for large-scale, sparse problems (e.g., text classification, CTR prediction) where per-coordinate learning rates align with feature frequency. The results highlight when adaptive regularization helps (notably for hyperrectangles) and point to future work on extending competitive guarantees to arbitrary feasible sets and richer regularization families.

Abstract

We introduce a new online convex optimization algorithm that adaptively chooses its regularization function based on the loss functions observed so far. This is in contrast to previous algorithms that use a fixed regularization function such as L2-squared, and modify it only via a single time-dependent parameter. Our algorithm's regret bounds are worst-case optimal, and for certain realistic classes of loss functions they are much better than existing bounds. These bounds are problem-dependent, which means they can exploit the structure of the actual problem instance. Critically, however, our algorithm does not need to know this structure in advance. Rather, we prove competitive guarantees that show the algorithm provides a bound within a constant factor of the best possible bound (of a certain functional form) in hindsight.

Adaptive Bound Optimization for Online Convex Optimization

TL;DR

Abstract

Adaptive Bound Optimization for Online Convex Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (26)