Variational Continual Learning
Cuong V. Nguyen, Yingzhen Li, Thang D. Bui, Richard E. Turner
TL;DR
VCL offers a principled, Bayesian framework for continual learning by performing online variational updates and optionally leveraging a small episodic memory (coresets) to guard against forgetting. It generalizes to both discriminative and generative deep models, yielding state-of-the-art results against standard baselines without hyperparameter tuning. The approach preserves uncertainty information and demonstrates strong long-term retention across sequences of tasks, with empirical gains shown on permuted/split MNIST and deep VAEs. The work also clarifies the relationship between online VI, Laplace propagation, and regularization methods, and suggests directions for richer memories and alternative approximate-inference schemes.
Abstract
This paper develops variational continual learning (VCL), a simple but general framework for continual learning that fuses online variational inference (VI) and recent advances in Monte Carlo VI for neural networks. The framework can successfully train both deep discriminative models and deep generative models in complex continual learning settings where existing tasks evolve over time and entirely new tasks emerge. Experimental results show that VCL outperforms state-of-the-art continual learning methods on a variety of tasks, avoiding catastrophic forgetting in a fully automatic way.
