Harnessing Neural Unit Dynamics for Effective and Scalable Class-Incremental Learning
Depeng Li, Tianqi Wang, Junwei Chen, Wei Dai, Zhigang Zeng
TL;DR
This work tackles catastrophic forgetting in class-incremental learning by introducing AutoActivator, a network that dynamically grows only as needed per task under a supervisory mechanism. New neural units are recruited to maximize residual-error reduction, while an activation-threshold scheme enables task-agnostic reactivation during inference to prevent interference. The approach is underpinned by a universal approximation theorem guaranteeing convergence over sequential tasks and offers rehearsal-free, scalable expansion across diverse backbones and datasets. Empirical results across MNIST, FashionMNIST, CIFAR-100, and ImageNet-R show competitive or superior accuracy with minimal memory growth and zero or minimal forgetting, highlighting practical utility for privacy-sensitive and resource-constrained settings.
Abstract
Class-incremental learning (CIL) aims to train a model to learn new classes from non-stationary data streams without forgetting old ones. In this paper, we propose a new kind of connectionist model by tailoring neural unit dynamics that adapt the behavior of neural networks for CIL. In each training session, it introduces a supervisory mechanism to guide network expansion whose growth size is compactly commensurate with the intrinsic complexity of a newly arriving task. This constructs a near-minimal network while allowing the model to expand its capacity when cannot sufficiently hold new classes. At inference time, it automatically reactivates the required neural units to retrieve knowledge and leaves the remaining inactivated to prevent interference. We name our model AutoActivator, which is effective and scalable. To gain insights into the neural unit dynamics, we theoretically analyze the model's convergence property via a universal approximation theorem on learning sequential mappings, which is under-explored in the CIL community. Experiments show that our method achieves strong CIL performance in rehearsal-free and minimal-expansion settings with different backbones.
