Structured Prediction in Online Learning

Pierre Boudart; Alessandro Rudi; Pierre Gaillard

Structured Prediction in Online Learning

Pierre Boudart, Alessandro Rudi, Pierre Gaillard

TL;DR

This work addresses online structured prediction where outputs lack a vector structure by introducing OSKAAR, an online algorithm built on Implicit Loss Embedding (ILE) and KAAR that achieves regret bounds tied to the RKHS effective dimension and interpolation error. It also develops SALAMI, a non-stationary extension that uses expert aggregation and restart mechanisms to handle changing environments, with regret rates depending on measures of distributional variation and under capacity conditions. The paper provides both worst-case and high-probability guarantees, along with online-to-batch conversions yielding competitive excess-risk bounds in the IID setting. Together, these results enable principled, scalable online guarantees for complex structured prediction tasks in non-i.i.d. and adversarial contexts, with practical strategies for tuning and computation.</p>

Abstract

We study a theoretical and algorithmic framework for structured prediction in the online learning setting. The problem of structured prediction, i.e. estimating function where the output space lacks a vectorial structure, is well studied in the literature of supervised statistical learning. We show that our algorithm is a generalisation of optimal algorithms from the supervised learning setting, and achieves the same excess risk upper bound also when data are not i.i.d. Moreover, we consider a second algorithm designed especially for non-stationary data distributions, including adversarial data. We bound its stochastic regret in function of the variation of the data distributions.

Structured Prediction in Online Learning

TL;DR

Abstract

Paper Structure (34 sections, 12 theorems, 153 equations, 4 algorithms)

This paper contains 34 sections, 12 theorems, 153 equations, 4 algorithms.

Introduction
Contributions
Paper outline
Problem Setting and Background
Online Learning Framework
Structured Prediction
Standard Approach
Existing results in the batch statistical framework
Limitations of previous works
A General Algorithm for Online Structured Prediction
Introducing our Algorithm: OSKAAR
Regret Bound of OSKAAR
Computation time
Stochastic Regret Bounds
High probability regret bound
...and 19 more sections

Key Result

Lemma 0

Let $(f_t)_t$ and $(\hat{g}_t)_t$ be defined as in eq:f_t and eq:hat g_t respectively. Then we have

Theorems & Definitions (26)

Definition 1: ILE ILE
Lemma 0: Online Comparison Inequality
Theorem 1: Regret Bound of OSKAAR
Theorem 2: Expected Regret Bound
Corollary 2: Expected Regret Bound
Theorem 3: Expected Regret in a Non-Stationary Environment
Lemma 3: Online Comparison Inequality
proof
proof
Theorem 3: Regret Bound of OSKAAR
...and 16 more

Structured Prediction in Online Learning

TL;DR

Abstract

Structured Prediction in Online Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (26)