Asymptotic analysis of shallow and deep forgetting in replay with Neural Collapse
Giulia Lanzillotta, Damiano Meier, Thomas Hofmann
TL;DR
This work analyzes why replay buffers in continual learning preserve internal feature geometry far better than they preserve alignment between the learned head and the population distribution. By extending Neural Collapse to sequential training, it shows that small buffers robustly anchor feature-space geometry (preventing deep forgetting) while shallow forgetting requires much larger buffers, with the gap depending on head architecture (single-head vs multi-head) and the task regime. Introducing an OOD-centered perspective, the authors derive a mixture model for replay that interpolates between NC-like and OOD representations, and prove that any non-zero replay preserves asymptotic separability in the NC subspace. The findings reveal a fundamental replay efficiency gap and suggest that correcting NC-induced statistical artifacts could enable robust performance with minimal replay, informing buffer sizing and training regimens for practical continual-learning systems.
Abstract
A persistent paradox in continual learning (CL) is that neural networks often retain linearly separable representations of past tasks even when their output predictions fail. We formalize this distinction as the gap between deep feature-space and shallow classifier-level forgetting. We reveal a critical asymmetry in Experience Replay: while minimal buffers successfully anchor feature geometry and prevent deep forgetting, mitigating shallow forgetting typically requires substantially larger buffer capacities. To explain this, we extend the Neural Collapse framework to the sequential setting. We characterize deep forgetting as a geometric drift toward out-of-distribution subspaces and prove that any non-zero replay fraction asymptotically guarantees the retention of linear separability. Conversely, we identify that the "strong collapse" induced by small buffers leads to rank-deficient covariances and inflated class means, effectively blinding the classifier to true population boundaries. By unifying CL with out-of-distribution detection, our work challenges the prevailing reliance on large buffers, suggesting that explicitly correcting these statistical artifacts could unlock robust performance with minimal replay.
