Generalization bounds for mixing processes via delayed online-to-PAC conversions

Baptiste Abeles; Eugenio Clerico; Gergely Neu

Generalization bounds for mixing processes via delayed online-to-PAC conversions

Baptiste Abeles, Eugenio Clerico, Gergely Neu

TL;DR

This work addresses the challenge of bounding generalization error when training data come from a stationary mixing process, i.e., non-i.i.d. data with temporal dependence. It develops a general online-to-PAC conversion framework with delayed feedback, combining Yu’s blocking and the Lugosi online-to-PAC approach under a weak $\phi_d$-mixing assumption to yield high-probability generalization bounds. The authors derive explicit bounds for geometrically and algebraically mixing processes, and extend the framework to delayed variants of EWA and FTRL, as well as to dynamic hypotheses whose loss depends on memory. The results provide near-i.i.d.-like rates in several regimes, offer PAC-Bayesian-style guarantees for dependent data, and apply to time-series and dynamical-system predictors, broadening the scope of robust generalization analysis beyond i.i.d. settings.

Abstract

We study the generalization error of statistical learning algorithms in a non-i.i.d. setting, where the training data is sampled from a stationary mixing process. We develop an analytic framework for this scenario based on a reduction to online learning with delayed feedback. In particular, we show that the existence of an online learning algorithm with bounded regret (against a fixed statistical learning algorithm in a specially constructed game of online learning with delayed feedback) implies low generalization error of said statistical learning method even if the data sequence is sampled from a mixing time series. The rates demonstrate a trade-off between the amount of delay in the online learning game and the degree of dependence between consecutive data points, with near-optimal rates recovered in a number of well-studied settings when the delay is tuned appropriately as a function of the mixing time of the process.

Generalization bounds for mixing processes via delayed online-to-PAC conversions

TL;DR

-mixing assumption to yield high-probability generalization bounds. The authors derive explicit bounds for geometrically and algebraically mixing processes, and extend the framework to delayed variants of EWA and FTRL, as well as to dynamic hypotheses whose loss depends on memory. The results provide near-i.i.d.-like rates in several regimes, offer PAC-Bayesian-style guarantees for dependent data, and apply to time-series and dynamical-system predictors, broadening the scope of robust generalization analysis beyond i.i.d. settings.

Abstract

Paper Structure (20 sections, 15 theorems, 37 equations)

This paper contains 20 sections, 15 theorems, 37 equations.

Introduction
Preliminaries
Proving generalization bounds via online learning
Online-to-PAC conversions for i.i.d. data
Online-to-PAC conversions for non-i.i.d. data
New generalization bounds for non-i.i.d. data
Regret bounds for delayed online learning
Geometric and algebraic mixing
Multiplicative weights with delay
Follow the regularized leader with delay
Generalization bounds for dynamic hypotheses
Conclusion
Omitted proofs
The proof of Theorem \ref{['theorem:: regret_decomposition']}
Proof of lemma \ref{['lemma:: d-block bound']}
...and 5 more sections

Key Result

Theorem 1

With the notation introduced above,

Theorems & Definitions (17)

Theorem 1: Theorem 1 in lugosi2023online; see appendix \ref{['proof:regret_decomposition']}
Theorem 2: Corollary $6$ in lugosi2023online
Definition 3: Generalization game with delay
Lemma 4
Proposition 5: Bound in expectation
Theorem 6: Bound in probability
Lemma 7
Lemma 8: weinberger2002delayed
Definition 9
Corollary 10
...and 7 more

Generalization bounds for mixing processes via delayed online-to-PAC conversions

TL;DR

Abstract

Generalization bounds for mixing processes via delayed online-to-PAC conversions

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (17)