Online Linear Regression in Dynamic Environments via Discounting

Andrew Jacobsen; Ashok Cutkosky

Online Linear Regression in Dynamic Environments via Discounting

Andrew Jacobsen, Ashok Cutkosky

TL;DR

This work addresses online linear regression in dynamic, nonstationary environments without any prior knowledge. It introduces a discounted Vovk-Azoury-Warmuth forecaster that forgets old data with factor $\gamma$ and optionally uses hints $\tilde{y}_t$, achieving dynamic regret bounds of the form $R_T(\boldsymbol{u})=O\big(d\log T \;\vee\; \sqrt{d P_T^\gamma(\boldsymbol{u})T}\big)$ and small-loss variants. The authors prove a matching dimension-dependent lower bound, show that the optimal discount factor can be learned on the fly via a grid of experts and a clipping-based meta-algorithm, and extend the guarantees to strongly adaptive bounds that hold on every sub-interval. These contributions establish optimal, prior-knowledge-free guarantees for online regression in dynamic environments and pave the way for robust, nonstationary learning in unbounded domains. The results have potential impact on streaming analytics and adaptive decision-making where data distributions drift over time.

Abstract

We develop algorithms for online linear regression which achieve optimal static and dynamic regret guarantees \emph{even in the complete absence of prior knowledge}. We present a novel analysis showing that a discounted variant of the Vovk-Azoury-Warmuth forecaster achieves dynamic regret of the form $R_{T}(\vec{u})\le O\left(d\log(T)\vee \sqrt{dP_{T}^γ(\vec{u})T}\right)$, where $P_{T}^γ(\vec{u})$ is a measure of variability of the comparator sequence, and show that the discount factor achieving this result can be learned on-the-fly. We show that this result is optimal by providing a matching lower bound. We also extend our results to \emph{strongly-adaptive} guarantees which hold over every sub-interval $[a,b]\subseteq[1,T]$ simultaneously.

Online Linear Regression in Dynamic Environments via Discounting

TL;DR

and optionally uses hints

, achieving dynamic regret bounds of the form

and small-loss variants. The authors prove a matching dimension-dependent lower bound, show that the optimal discount factor can be learned on the fly via a grid of experts and a clipping-based meta-algorithm, and extend the guarantees to strongly adaptive bounds that hold on every sub-interval. These contributions establish optimal, prior-knowledge-free guarantees for online regression in dynamic environments and pave the way for robust, nonstationary learning in unbounded domains. The results have potential impact on streaming analytics and adaptive decision-making where data distributions drift over time.

Abstract

, where

is a measure of variability of the comparator sequence, and show that the discount factor achieving this result can be learned on-the-fly. We show that this result is optimal by providing a matching lower bound. We also extend our results to \emph{strongly-adaptive} guarantees which hold over every sub-interval

simultaneously.

Paper Structure (35 sections, 35 theorems, 157 equations, 3 algorithms)

This paper contains 35 sections, 35 theorems, 157 equations, 3 algorithms.

Online Linear Regression
Related Works
Notations
The Vovk-Azoury-Warmuth Forecaster
Dynamic Regret via Discounting
Small-loss Bounds via Self-confident Predictions
Dimension-dependent Lower Bound
Learning the Optimal Discount Factor
Strongly-Adaptive Guarantees
Conclusion
Proofs for Section \ref{['sec:discounted-vaw']} (Dynamic Regret via Discounting)
Equivalence to FTRL and Mirror Descent
Proof of Theorem \ref{['thm:general-discounted-vaw']}
Proof of Lemma \ref{['lemma:discounted-vaw-decomp']}
Proof of Lemma \ref{['lemma:general-discounted-vaw-divergences']}
...and 20 more sections

Key Result

Theorem 2.1

For any $u\in\mathbb{R}^{d}$ and any sequences $(y_{t})_{t=1}^{T}$ in $\mathbb{R}$ and $(x_{t})_{t=1}^{T}$ in $\mathbb{R}^{d}$, the VAW forecaster guarantees

Theorems & Definitions (60)

Theorem 2.1
Theorem 3.1
Lemma 3.1
Theorem 3.2
Theorem 3.3
Theorem 3.4
Theorem 4.1
Theorem 4.2
Theorem 4.3
Proposition 1.0
...and 50 more

Online Linear Regression in Dynamic Environments via Discounting

TL;DR

Abstract

Online Linear Regression in Dynamic Environments via Discounting

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (60)