Do Neural Networks Lose Plasticity in a Gradually Changing World?
Tianhui Liu, Lili Mou
TL;DR
This work argues that loss of plasticity in continual learning largely stems from abrupt task shifts rather than intrinsic limitations of neural networks. It introduces a Gradually Changing Environment using input/output interpolation and task sampling to simulate smooth distribution shifts, backed by theoretical analysis under standard smoothness and local convexity assumptions. Empirically, the approach preserves trainability and generalization across vision benchmarks and language tasks, often matching or surpassing traditional abrupt-change mitigations. The findings offer a realistic, robust framework for real-world continual learning with reduced need for extensive hyperparameter tuning and complex regularization strategies.
Abstract
Continual learning has become a trending topic in machine learning. Recent studies have discovered an interesting phenomenon called loss of plasticity, referring to neural networks gradually losing the ability to learn new tasks. However, existing plasticity research largely relies on contrived settings with abrupt task transitions, which often do not reflect real-world environments. In this paper, we propose to investigate a gradually changing environment, and we simulate this by input/output interpolation and task sampling. We perform theoretical and empirical analysis, showing that the loss of plasticity is an artifact of abrupt tasks changes in the environment and can be largely mitigated if the world changes gradually.
