From Average Sensitivity to Small-Loss Regret Bounds under Random-Order Model

Shinsaku Sakaue; Yuichi Yoshida

From Average Sensitivity to Small-Loss Regret Bounds under Random-Order Model

Shinsaku Sakaue, Yuichi Yoshida

TL;DR

This work analyzes online learning under the random-order model by tying offline $(1+\varepsilon)$-approximation with online regret through a batch-to-online transformation governed by average sensitivity. The authors introduce an adaptive schedule for the approximation parameter $\varepsilon_t$, leveraging Fenchel duality to obtain small-loss regret bounds of the form $\tilde{O}(\varphi^{\star}(\mathrm{OPT}_T))$, where $\mathrm{OPT}_T$ is the offline optimum; this avoids requiring smoothness or other regularity on losses. They show this framework recovers and strengthens prior $(1+\varepsilon)$-regret results and applies to online $k$-means, online low-rank approximation, online regression, and online submodular function minimization using sparsifiers, yielding bounds that scale sublinearly with $\mathrm{OPT}_T$. A key technical contribution is the adaptive AdaGrad-like tuning of $\varepsilon_t$, which balances stability and accuracy to achieve first-order small-loss guarantees. The results highlight sparsification and coreset techniques as powerful tools for online learning in random-order settings and delineate the limits of small-loss bounds against adaptive adversaries.

Abstract

We study online learning in the random-order model, where the multiset of loss functions is chosen adversarially but revealed in a uniformly random order. Building on the batch-to-online conversion by Dong and Yoshida (2023), we show that if an offline algorithm admits a $(1+\varepsilon)$-approximation guarantee and the effect of $\varepsilon$ on its average sensitivity is characterized by a function $\varphi(\varepsilon)$, then an adaptive choice of $\varepsilon$ yields a small-loss regret bound of $\tilde O(\varphi^{\star}(\mathrm{OPT}_T))$, where $\varphi^{\star}$ is the concave conjugate of $\varphi$, $\mathrm{OPT}_T$ is the offline optimum over $T$ rounds, and $\tilde O$ hides polylogarithmic factors in $T$. Our method requires no regularity assumptions on loss functions, such as smoothness, and can be viewed as a generalization of the AdaGrad-style tuning applied to the approximation parameter $\varepsilon$. Our result recovers and strengthens the $(1+\varepsilon)$-approximate regret bounds of Dong and Yoshida (2023) and yields small-loss regret bounds for online $k$-means clustering, low-rank approximation, and regression. We further apply our framework to online submodular function minimization using $(1\pm\varepsilon)$-cut sparsifiers of submodular hypergraphs, obtaining a small-loss regret bound of $\tilde O(n^{3/4}(1 + \mathrm{OPT}_T^{3/4}))$, where $n$ is the ground-set size. Our approach sheds light on the power of sparsification and related techniques in establishing small-loss regret bounds in the random-order model.

From Average Sensitivity to Small-Loss Regret Bounds under Random-Order Model

TL;DR

This work analyzes online learning under the random-order model by tying offline

-approximation with online regret through a batch-to-online transformation governed by average sensitivity. The authors introduce an adaptive schedule for the approximation parameter

, leveraging Fenchel duality to obtain small-loss regret bounds of the form

, where

is the offline optimum; this avoids requiring smoothness or other regularity on losses. They show this framework recovers and strengthens prior

-regret results and applies to online

-means, online low-rank approximation, online regression, and online submodular function minimization using sparsifiers, yielding bounds that scale sublinearly with

. A key technical contribution is the adaptive AdaGrad-like tuning of

, which balances stability and accuracy to achieve first-order small-loss guarantees. The results highlight sparsification and coreset techniques as powerful tools for online learning in random-order settings and delineate the limits of small-loss bounds against adaptive adversaries.

Abstract

-approximation guarantee and the effect of

on its average sensitivity is characterized by a function

, then an adaptive choice of

yields a small-loss regret bound of

, where

is the concave conjugate of

is the offline optimum over

rounds, and

hides polylogarithmic factors in

. Our method requires no regularity assumptions on loss functions, such as smoothness, and can be viewed as a generalization of the AdaGrad-style tuning applied to the approximation parameter

. Our result recovers and strengthens the

-approximate regret bounds of Dong and Yoshida (2023) and yields small-loss regret bounds for online

-means clustering, low-rank approximation, and regression. We further apply our framework to online submodular function minimization using

-cut sparsifiers of submodular hypergraphs, obtaining a small-loss regret bound of

, where

is the ground-set size. Our approach sheds light on the power of sparsification and related techniques in establishing small-loss regret bounds in the random-order model.

Paper Structure (61 sections, 19 theorems, 116 equations, 1 algorithm)

This paper contains 61 sections, 19 theorems, 116 equations, 1 algorithm.

Introduction
Our Contribution
Related Work
Small-loss regret bounds.
Random-order model.
Duality in regret analysis.
Preliminaries
Average Sensitivity
Approximation--sensitivity trade-off.
Batch-to-Online Conversion
Small-Loss Regret Bound via Adaptive Control of $\varepsilon_t$
Adaptive Control of $\varepsilon_t$
Small-Loss Regret Bound
Recovering the $(1+\varepsilon)$-regret bound.
Connection to AdaGrad
...and 46 more sections

Key Result

Lemma 3.1

[lemma]lem:onestep Let $t \in \{1,\dots,T-1\}$ and $\theta_{t+1}\in\Theta$ denote the output of $\mathcal{A}$ in alg:bto. Then, it holds that

Theorems & Definitions (38)

Definition 2.1: Total variation
Definition 2.2: Average sensitivity
Lemma 3.1: Single-step bound
Lemma 3.2: Bound on sum of average optimal values
Proposition 3.3: Regret decomposition
Definition 4.1: Concave conjugate
Lemma 4.2: Adaptive control of $\varepsilon_t$
proof : Proof of Lemma \ref{['lem:exact-telescope']}
Theorem 4.3: Small-loss regret bound
proof
...and 28 more

From Average Sensitivity to Small-Loss Regret Bounds under Random-Order Model

TL;DR

Abstract

From Average Sensitivity to Small-Loss Regret Bounds under Random-Order Model

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (38)