Task Agnostic Continual Learning Using Online Variational Bayes

Chen Zeno; Itay Golan; Elad Hoffer; Daniel Soudry

Task Agnostic Continual Learning Using Online Variational Bayes

Chen Zeno, Itay Golan, Elad Hoffer, Daniel Soudry

TL;DR

This work tackles catastrophic forgetting in scenarios where task boundaries are unknown by introducing Bayesian Gradient Descent (BGD), an online variational Bayes method with a diagonal Gaussian weight posterior. BGD updates the weight distribution in a closed-form manner, tying learning rates to parameter uncertainty and leveraging sequential posterior updates without memory of past tasks. The authors provide a formal taxonomy for continual-learning scenarios, introduce the labels trick to improve class-learning, and demonstrate competitive performance across continuous and discrete task-agnostic settings on MNIST and CIFAR datasets. The approach offers a practical, scalable Bayesian framework for task-agnostic continual learning and highlights avenues for extending posterior structure beyond diagonal Gaussians.

Abstract

Catastrophic forgetting is the notorious vulnerability of neural networks to the change of the data distribution while learning. This phenomenon has long been considered a major obstacle for allowing the use of learning agents in realistic continual learning settings. A large body of continual learning research assumes that task boundaries are known during training. However, research for scenarios in which task boundaries are unknown during training has been lacking. In this paper we present, for the first time, a method for preventing catastrophic forgetting (BGD) for scenarios with task boundaries that are unknown during training --- task-agnostic continual learning. Code of our algorithm is available at https://github.com/igolan/bgd.

Task Agnostic Continual Learning Using Online Variational Bayes

TL;DR

Abstract

Task Agnostic Continual Learning Using Online Variational Bayes

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (14)

Theorems & Definitions (3)