The Cost of Learning under Multiple Change Points

Tomer Gafni; Garud Iyengar; Assaf Zeevi

The Cost of Learning under Multiple Change Points

Tomer Gafni, Garud Iyengar, Assaf Zeevi

TL;DR

This paper studies online learning in non-stationary environments with multiple abrupt change points and identifies endogenous confounding as a key challenge that undermines detection when past data are not discarded. It proposes Anytime Tracking CUSUM (ATC), a horizon-free algorithm that selectively restarts to balance rapid adaptation to large shifts with stability during stationary periods. The authors prove a non-asymptotic upper bound on dynamic regret of order $O(\sigma^2 (S+1) \log T)$ and establish a matching information-theoretic lower bound of order $\Omega(\sigma^2 (S+1) \log(T/(S+1)))$, showing near-minimax optimality; they also quantify the confounding effect via SNR degradation and validate results on synthetic and NAB data. The work contributes a principled framework for learning under multiple change points, with implications for real-time demand tracking and adaptive control, and opens avenues for extending to higher dimensions and robustness to variance misspecification.

Abstract

We consider an online learning problem in environments with multiple change points. In contrast to the single change point problem that is widely studied using classical "high confidence" detection schemes, the multiple change point environment presents new learning-theoretic and algorithmic challenges. Specifically, we show that classical methods may exhibit catastrophic failure (high regret) due to a phenomenon we refer to as endogenous confounding. To overcome this, we propose a new class of learning algorithms dubbed Anytime Tracking CUSUM (ATC). These are horizon-free online algorithms that implement a selective detection principle, balancing the need to ignore "small" (hard-to-detect) shifts, while reacting "quickly" to significant ones. We prove that the performance of a properly tuned ATC algorithm is nearly minimax-optimal; its regret is guaranteed to closely match a novel information-theoretic lower bound on the achievable performance of any learning algorithm in the multiple change point problem. Experiments on synthetic as well as real-world data validate the aforementioned theoretical findings.

The Cost of Learning under Multiple Change Points

TL;DR

and establish a matching information-theoretic lower bound of order

, showing near-minimax optimality; they also quantify the confounding effect via SNR degradation and validate results on synthetic and NAB data. The work contributes a principled framework for learning under multiple change points, with implications for real-time demand tracking and adaptive control, and opens avenues for extending to higher dimensions and robustness to variance misspecification.

Abstract

Paper Structure (70 sections, 11 theorems, 224 equations, 11 figures, 1 algorithm)

This paper contains 70 sections, 11 theorems, 224 equations, 11 figures, 1 algorithm.

Introduction
Background
Main contributions
Related literature
Problem formulation
Illustrative Examples of the Model
The Anytime Tracking CUSUM (ATC) algorithm
Algorithm structure
Detection phase:
Prediction phase.
Balancing the stability-adaptivity trade-off
Stability.
Adaptivity.
SNR degradation due to missed detection
Computational considerations.
...and 55 more sections

Key Result

Lemma 3.1

Fix a restart time $r<\tau_{j-1}$. Under the ATC algorithm, for a change at $\tau_j$, and all times $\tau_j < t < \tau_{j+1}$, where $C>0$ is a universal constant.

Figures (11)

Figure 1: Effect of endogenous confounding on detection signal-to-noise ratio (SNR). Top: After a change at $t=20$, accumulating post-change samples increases the SNR of the detection statistic. Bottom: After the second change at $t=40$, failing to discard outdated samples from $f_0$ causes the reference statistic to be computed from a mixture of $f_0$ and $f_1$ (right), reducing statistical separability with respect to $\mu_2$, thereby degrading the SNR for detecting $f_2$.
Figure 2: Example of an online tracking instance with multiple change points. The top panel shows the underlying piecewise-stationary environment, while the bottom panel illustrates the evolution of the detection statistic, the decision thresholds, and the alarms raised by the algorithm.
Figure 3: Synthetic environment and regret scaling for ATC.
Figure 4: Cumulative regret on the NAB CPU dataset.
Figure 5: Dense change points and passive algorithms.
...and 6 more figures

Theorems & Definitions (25)

Lemma 3.1
Theorem 4.1
Theorem 4.2
Lemma 1.1: Regret upper bound decomposition
Lemma 1.2: High‑probability confidence bound
Lemma 1.3: Bias regret upper bound
Lemma 1.4: Expected number of blocks
Lemma 1.5: Variance regret upper bound
proof
proof
...and 15 more

The Cost of Learning under Multiple Change Points

TL;DR

Abstract

The Cost of Learning under Multiple Change Points

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (25)