Can We Understand Plasticity Through Neural Collapse?
Guglielmo Bonifazi, Iason Chalas, Gian Hess, Jakub Łucki
TL;DR
The paper investigates whether neural collapse (NC) and plasticity loss (PL) co-occur in deep networks under non-stationary objectives. It quantifies NC, especially the NC1 metric defined as $NC1 = Tr(\Sigma_W \Sigma_B^\dag / C)$, across two continual-learning setups: Permuted MNIST with an MLP and warm-starting CIFAR-10 with a ResNet-18. The results show a context-dependent relationship: in Permuted MNIST, NC1 strengthens as tasks accumulate but is strongly negatively correlated with PL (r = -0.94), while in warm-starting, an early NC–PL correlation exists that wanes over time; importantly, NC1 regularization during warm-up can improve both warm-up and full-dataset accuracies. The findings suggest that NC can contribute to PL in some regimes and that targeted NC-based regularization offers a practical way to mitigate PL, informing strategies for continual learning under changing objectives.
Abstract
This paper explores the connection between two recently identified phenomena in deep learning: plasticity loss and neural collapse. We analyze their correlation in different scenarios, revealing a significant association during the initial training phase on the first task. Additionally, we introduce a regularization approach to mitigate neural collapse, demonstrating its effectiveness in alleviating plasticity loss in this specific setting.
