Adaptive Variational Continual Learning via Task-Heuristic Modelling

Fan Yang

Adaptive Variational Continual Learning via Task-Heuristic Modelling

Fan Yang

TL;DR

AutoVCL extends GVCL by introducing a self-adjusted $\beta$-ELBO that adapts to each task through difficulty $d_t$ and similarity $s_t$ metrics. It defines $\beta_t = \exp\left(\lambda\left(\max(d_1,...,d_{t-1}) - \frac{d_t}{1+\delta_d(t-1)} + s_t\right)\right)$ to balance reconstruction versus regularization dynamically, and uses a multi-head architecture with shared parameters updated under $\beta$-ELBO. Across Split MNIST, Permuted MNIST, and mixed MNIST-CIFAR tasks, AutoVCL matches or outperforms fixed-$\beta$ GVCL settings, particularly in later tasks where adaptivity helps preserve prior knowledge while accommodating new tasks. This task-heuristics approach offers a practical, hyperparameter-free pathway to robust continual learning in varied task sequences, with avenues for integrating replay scheduling and smarter difficulty estimation in future work.

Abstract

Variational continual learning (VCL) is a turn-key learning algorithm that has state-of-the-art performance among the best continual learning models. In our work, we explore an extension of the generalized variational continual learning (GVCL) model, named AutoVCL, which combines task heuristics for informed learning and model optimization. We demonstrate that our model outperforms the standard GVCL with fixed hyperparameters, benefiting from the automatic adjustment of the hyperparameter based on the difficulty and similarity of the incoming task compared to the previous tasks.

Adaptive Variational Continual Learning via Task-Heuristic Modelling

TL;DR

AutoVCL extends GVCL by introducing a self-adjusted

-ELBO that adapts to each task through difficulty

and similarity

metrics. It defines

to balance reconstruction versus regularization dynamically, and uses a multi-head architecture with shared parameters updated under

-ELBO. Across Split MNIST, Permuted MNIST, and mixed MNIST-CIFAR tasks, AutoVCL matches or outperforms fixed-

GVCL settings, particularly in later tasks where adaptivity helps preserve prior knowledge while accommodating new tasks. This task-heuristics approach offers a practical, hyperparameter-free pathway to robust continual learning in varied task sequences, with avenues for integrating replay scheduling and smarter difficulty estimation in future work.

Abstract

Paper Structure (20 sections, 5 equations, 2 figures, 3 tables)

This paper contains 20 sections, 5 equations, 2 figures, 3 tables.

Introduction
Variational Continual Learning (VCL)
Related work
Continual Learning by modelling Intra-Class Variation (modellingtask)
Generalized Variational Continual Learning (loo2020generalized)
Proposed approach
Theory
Quantification of the metrics
Formulation of $\beta_t$
Experiments
Split MNIST with Custom Targets:
Permuted MNIST:
Mixed Dataset of Split-CIFAR-10 and Split-MNIST
Discussion
Conclusion
...and 5 more sections

Figures (2)

Figure 1: Comparison between AutoVCL and fixed $\beta$ values in three experiments. Split MNIST with Custom Targets, Permuted MNIST, and Mixed Dataset of Split-CIFAR-10 and Split-MNIST respectively represent sequences of similar tasks, different tasks, and alternating difficulty tasks.
Figure 2: Comparison between tasks 0/1 and 2/3 in the split MNIST experiment. Split 2/3 appears to be a harder task than split 0/1 as its prediction accuracy drops quickly.

Adaptive Variational Continual Learning via Task-Heuristic Modelling

TL;DR

Abstract

Adaptive Variational Continual Learning via Task-Heuristic Modelling

Authors

TL;DR

Abstract

Table of Contents

Figures (2)