Contrastive Consolidation of Top-Down Modulations Achieves Sparsely Supervised Continual Learning
Viet Anh Khoa Tran, Emre Neftci, Willem A. M. Wybo
TL;DR
TMCL tackles continual representation learning under unlabeled data streams with sparse supervision by introducing top-down task modulations that orthogonalize new classes and consolidate representations through view- and modulation-invariant contrastive learning. The two-phase learning—orthogonalization of new class representations via learned modulations and consolidation into a stable, modulation-invariant space—yields strong performance on CIFAR-100 and transfer benchmarks, especially in label-scarce regimes, while maintaining robustness to noisy labels. TMCL extends beyond pretrain-finetune paradigms by using persistent modulations as a memory of task structure, enabling continual plasticity without catastrophic forgetting. The approach highlights the role of top-down modulations in balancing stability and plasticity and offers a bridge between biological learning principles and scalable machine learning for continual tasks.
Abstract
Biological brains learn continually from a stream of unlabeled data, while integrating specialized information from sparsely labeled examples without compromising their ability to generalize. Meanwhile, machine learning methods are susceptible to catastrophic forgetting in this natural learning setting, as supervised specialist fine-tuning degrades performance on the original task. We introduce task-modulated contrastive learning (TMCL), which takes inspiration from the biophysical machinery in the neocortex, using predictive coding principles to integrate top-down information continually and without supervision. We follow the idea that these principles build a view-invariant representation space, and that this can be implemented using a contrastive loss. Then, whenever labeled samples of a new class occur, new affine modulations are learned that improve separation of the new class from all others, without affecting feedforward weights. By co-opting the view-invariance learning mechanism, we then train feedforward weights to match the unmodulated representation of a data sample to its modulated counterparts. This introduces modulation invariance into the representation space, and, by also using past modulations, stabilizes it. Our experiments show improvements in both class-incremental and transfer learning over state-of-the-art unsupervised approaches, as well as over comparable supervised approaches, using as few as 1% of available labels. Taken together, our work suggests that top-down modulations play a crucial role in balancing stability and plasticity.
