Metalearning Continual Learning Algorithms
Kazuki Irie, Róbert Csordás, Jürgen Schmidhuber
TL;DR
The paper introduces Automated Continual Learning (ACL), a framework that meta-learns in-context continual-learning algorithms by training self-referential networks (SRWMs) to modify their own learning rules. By formulating continual learning as long-span sequence processing and optimizing a CL-desiderata-based objective, ACL automatically discovers CL algorithms that balance backward and forward transfer without replay memory. Empirical results show ACL can mitigate in-context catastrophic forgetting and outperform several hand-crafted and meta-CL baselines on Split-MNIST, with additional experiments across diverse datasets and task configurations. The work also discusses limitations related to domain generalization, scalability, and interpretability, and highlights the potential of scaling ACL with more diverse data and architectures.
Abstract
General-purpose learning systems should improve themselves in open-ended fashion in ever-changing environments. Conventional learning algorithms for neural networks, however, suffer from catastrophic forgetting (CF), i.e., previously acquired skills are forgotten when a new task is learned. Instead of hand-crafting new algorithms for avoiding CF, we propose Automated Continual Learning (ACL) to train self-referential neural networks to metalearn their own in-context continual (meta)learning algorithms. ACL encodes continual learning (CL) desiderata -- good performance on both old and new tasks -- into its metalearning objectives. Our experiments demonstrate that ACL effectively resolves "in-context catastrophic forgetting," a problem that naive in-context learning algorithms suffer from; ACL-learned algorithms outperform both hand-crafted learning algorithms and popular meta-continual learning methods on the Split-MNIST benchmark in the replay-free setting, and enables continual learning of diverse tasks consisting of multiple standard image classification datasets. We also discuss the current limitations of in-context CL by comparing ACL with state-of-the-art CL methods that leverage pre-trained models. Overall, we bring several novel perspectives into the long-standing problem of CL.
