Table of Contents
Fetching ...

Online-to-PAC generalization bounds under graph-mixing dependencies

Baptiste Abélès, Eugenio Clerico, Gergely Neu

TL;DR

This work derives generalization bounds leveraging the online-to-PAC framework, by deriving a concentration result and introducing an online learning framework incorporating the graph structure, where dependencies decay with graph distance.

Abstract

Traditional generalization results in statistical learning require a training data set made of independently drawn examples. Most of the recent efforts to relax this independence assumption have considered either purely temporal (mixing) dependencies, or graph-dependencies, where non-adjacent vertices correspond to independent random variables. Both approaches have their own limitations, the former requiring a temporal ordered structure, and the latter lacking a way to quantify the strength of inter-dependencies. In this work, we bridge these two lines of work by proposing a framework where dependencies decay with graph distance. We derive generalization bounds leveraging the online-to-PAC framework, by deriving a concentration result and introducing an online learning framework incorporating the graph structure. The resulting high-probability generalization guarantees depend on both the mixing rate and the graph's chromatic number.

Online-to-PAC generalization bounds under graph-mixing dependencies

TL;DR

This work derives generalization bounds leveraging the online-to-PAC framework, by deriving a concentration result and introducing an online learning framework incorporating the graph structure, where dependencies decay with graph distance.

Abstract

Traditional generalization results in statistical learning require a training data set made of independently drawn examples. Most of the recent efforts to relax this independence assumption have considered either purely temporal (mixing) dependencies, or graph-dependencies, where non-adjacent vertices correspond to independent random variables. Both approaches have their own limitations, the former requiring a temporal ordered structure, and the latter lacking a way to quantify the strength of inter-dependencies. In this work, we bridge these two lines of work by proposing a framework where dependencies decay with graph distance. We derive generalization bounds leveraging the online-to-PAC framework, by deriving a concentration result and introducing an online learning framework incorporating the graph structure. The resulting high-probability generalization guarantees depend on both the mixing rate and the graph's chromatic number.

Paper Structure

This paper contains 17 sections, 6 theorems, 29 equations.

Key Result

Theorem 1

Fix any online strategy $\Pi$ for the generalization game. Any statistical learning algorithm $\widehat{\mathrm{P}}_n=\mathcal{A}(S_n)$ satisfies where $M_{\Pi,n} = \sum_{t=1}^n\langle\pi_t, g_t\rangle$.

Theorems & Definitions (16)

  • Definition 1: Generalization game
  • Theorem 1: Theorem 1, lugosi2023online
  • Corollary 1: Corollary 6, lugosi2023online
  • Definition 2
  • Definition 3
  • Definition 4
  • Definition 5
  • Definition 6
  • Proposition 1
  • Definition 7
  • ...and 6 more