Table of Contents
Fetching ...

Structured Prediction in Online Learning

Pierre Boudart, Alessandro Rudi, Pierre Gaillard

TL;DR

This work addresses online structured prediction where outputs lack a vector structure by introducing OSKAAR, an online algorithm built on Implicit Loss Embedding (ILE) and KAAR that achieves regret bounds tied to the RKHS effective dimension and interpolation error. It also develops SALAMI, a non-stationary extension that uses expert aggregation and restart mechanisms to handle changing environments, with regret rates depending on measures of distributional variation and under capacity conditions. The paper provides both worst-case and high-probability guarantees, along with online-to-batch conversions yielding competitive excess-risk bounds in the IID setting. Together, these results enable principled, scalable online guarantees for complex structured prediction tasks in non-i.i.d. and adversarial contexts, with practical strategies for tuning and computation.</p>

Abstract

We study a theoretical and algorithmic framework for structured prediction in the online learning setting. The problem of structured prediction, i.e. estimating function where the output space lacks a vectorial structure, is well studied in the literature of supervised statistical learning. We show that our algorithm is a generalisation of optimal algorithms from the supervised learning setting, and achieves the same excess risk upper bound also when data are not i.i.d. Moreover, we consider a second algorithm designed especially for non-stationary data distributions, including adversarial data. We bound its stochastic regret in function of the variation of the data distributions.

Structured Prediction in Online Learning

TL;DR

This work addresses online structured prediction where outputs lack a vector structure by introducing OSKAAR, an online algorithm built on Implicit Loss Embedding (ILE) and KAAR that achieves regret bounds tied to the RKHS effective dimension and interpolation error. It also develops SALAMI, a non-stationary extension that uses expert aggregation and restart mechanisms to handle changing environments, with regret rates depending on measures of distributional variation and under capacity conditions. The paper provides both worst-case and high-probability guarantees, along with online-to-batch conversions yielding competitive excess-risk bounds in the IID setting. Together, these results enable principled, scalable online guarantees for complex structured prediction tasks in non-i.i.d. and adversarial contexts, with practical strategies for tuning and computation.</p>

Abstract

We study a theoretical and algorithmic framework for structured prediction in the online learning setting. The problem of structured prediction, i.e. estimating function where the output space lacks a vectorial structure, is well studied in the literature of supervised statistical learning. We show that our algorithm is a generalisation of optimal algorithms from the supervised learning setting, and achieves the same excess risk upper bound also when data are not i.i.d. Moreover, we consider a second algorithm designed especially for non-stationary data distributions, including adversarial data. We bound its stochastic regret in function of the variation of the data distributions.
Paper Structure (34 sections, 12 theorems, 153 equations, 4 algorithms)

This paper contains 34 sections, 12 theorems, 153 equations, 4 algorithms.

Key Result

Lemma 0

Let $(f_t)_t$ and $(\hat{g}_t)_t$ be defined as in eq:f_t and eq:hat g_t respectively. Then we have

Theorems & Definitions (26)

  • Definition 1: ILE ILE
  • Lemma 0: Online Comparison Inequality
  • Theorem 1: Regret Bound of OSKAAR
  • Theorem 2: Expected Regret Bound
  • Corollary 2: Expected Regret Bound
  • Theorem 3: Expected Regret in a Non-Stationary Environment
  • Lemma 3: Online Comparison Inequality
  • proof
  • proof
  • Theorem 3: Regret Bound of OSKAAR
  • ...and 16 more