Task Agnostic Continual Learning Using Online Variational Bayes
Chen Zeno, Itay Golan, Elad Hoffer, Daniel Soudry
TL;DR
This work tackles catastrophic forgetting in scenarios where task boundaries are unknown by introducing Bayesian Gradient Descent (BGD), an online variational Bayes method with a diagonal Gaussian weight posterior. BGD updates the weight distribution in a closed-form manner, tying learning rates to parameter uncertainty and leveraging sequential posterior updates without memory of past tasks. The authors provide a formal taxonomy for continual-learning scenarios, introduce the labels trick to improve class-learning, and demonstrate competitive performance across continuous and discrete task-agnostic settings on MNIST and CIFAR datasets. The approach offers a practical, scalable Bayesian framework for task-agnostic continual learning and highlights avenues for extending posterior structure beyond diagonal Gaussians.
Abstract
Catastrophic forgetting is the notorious vulnerability of neural networks to the change of the data distribution while learning. This phenomenon has long been considered a major obstacle for allowing the use of learning agents in realistic continual learning settings. A large body of continual learning research assumes that task boundaries are known during training. However, research for scenarios in which task boundaries are unknown during training has been lacking. In this paper we present, for the first time, a method for preventing catastrophic forgetting (BGD) for scenarios with task boundaries that are unknown during training --- task-agnostic continual learning. Code of our algorithm is available at https://github.com/igolan/bgd.
