Table of Contents
Fetching ...

Computing Optimal Regularizers for Online Linear Optimization

Khashayar Gatmiry, Jon Schneider, Stefanie Jegelka

TL;DR

An algorithm is presented that takes as input convex and symmetric action sets and loss sets for a specific OLO instance, and outputs a regularizer such that running FTRL with this regularizer guarantees regret within a universal constant factor of the best possible regret bound.

Abstract

Follow-the-Regularized-Leader (FTRL) algorithms are a popular class of learning algorithms for online linear optimization (OLO) that guarantee sub-linear regret, but the choice of regularizer can significantly impact dimension-dependent factors in the regret bound. We present an algorithm that takes as input convex and symmetric action sets and loss sets for a specific OLO instance, and outputs a regularizer such that running FTRL with this regularizer guarantees regret within a universal constant factor of the best possible regret bound. In particular, for any choice of (convex, symmetric) action set and loss set we prove that there exists an instantiation of FTRL which achieves regret within a constant factor of the best possible learning algorithm, strengthening the universality result of Srebro et al., 2011. Our algorithm requires preprocessing time and space exponential in the dimension $d$ of the OLO instance, but can be run efficiently online assuming a membership and linear optimization oracle for the action and loss sets, respectively (and is fully polynomial time for the case of constant dimension $d$). We complement this with a lower bound showing that even deciding whether a given regularizer is $α$-strongly-convex with respect to a given norm is NP-hard.

Computing Optimal Regularizers for Online Linear Optimization

TL;DR

An algorithm is presented that takes as input convex and symmetric action sets and loss sets for a specific OLO instance, and outputs a regularizer such that running FTRL with this regularizer guarantees regret within a universal constant factor of the best possible regret bound.

Abstract

Follow-the-Regularized-Leader (FTRL) algorithms are a popular class of learning algorithms for online linear optimization (OLO) that guarantee sub-linear regret, but the choice of regularizer can significantly impact dimension-dependent factors in the regret bound. We present an algorithm that takes as input convex and symmetric action sets and loss sets for a specific OLO instance, and outputs a regularizer such that running FTRL with this regularizer guarantees regret within a universal constant factor of the best possible regret bound. In particular, for any choice of (convex, symmetric) action set and loss set we prove that there exists an instantiation of FTRL which achieves regret within a constant factor of the best possible learning algorithm, strengthening the universality result of Srebro et al., 2011. Our algorithm requires preprocessing time and space exponential in the dimension of the OLO instance, but can be run efficiently online assuming a membership and linear optimization oracle for the action and loss sets, respectively (and is fully polynomial time for the case of constant dimension ). We complement this with a lower bound showing that even deciding whether a given regularizer is -strongly-convex with respect to a given norm is NP-hard.

Paper Structure

This paper contains 27 sections, 28 theorems, 148 equations.

Key Result

Theorem 1

Given access to a linear optimization oracle for $\mathcal{L}$, which can minimize any linear function $c^\top x$ over $\mathcal{L}$ up to accuracy $\delta_{\mathrm{lin}}$ in time $\textsc{LinO}_{\mathcal{L}}\left(\delta_{\mathrm{lin}}\right)$, there is a cutting-plane algorithm that runs in time $\ Furthermore, given access to a membership oracle to $\mathcal{X}$ and the regularizer $g$ (which ca

Theorems & Definitions (54)

  • Definition 1
  • Definition 2
  • Definition 3
  • Theorem 1: Algorithmic optimal online linear optimization
  • Theorem 2
  • Theorem 3: Existence of smooth regularizer
  • proof
  • Lemma 1: estimating $f$ by the approximator
  • Definition 4
  • Lemma 2: Convex program feasibility $\rightarrow$ Locality of regularizer $g$
  • ...and 44 more