A Probabilistic Framework for Modular Continual Learning
Lazar Valkov, Akash Srivastava, Swarat Chaudhuri, Charles Sutton
TL;DR
A Probabilistic Framework for Modular Continual Learning (PICLE) addresses the challenge of scalable module-path search in continual learning by introducing probabilistic models that estimate a composition’s fitness without retraining. It combines perceptual/few-shot modeling over input activations with a Gaussian-process-based latent-transfer model to cover diverse transfer scenarios, enabling a constant-training-cost search over module paths. Empirically, PICLE achieves perceptual, few-shot, and latent transfer and scales to large search spaces, outperforming state-of-the-art modular CL baselines on long problem sequences and across compositional benchmarks. The framework thus offers a principled, scalable approach to modular CL with practical implications for AutoML-style search and sequential learning tasks.
Abstract
Modular approaches that use a different composition of modules for each problem are a promising direction in continual learning (CL). However, searching through the large, discrete space of module compositions is challenging, especially because evaluating a composition's performance requires a round of neural network training. We address this challenge through a modular CL framework, PICLE, that uses a probabilistic model to cheaply compute the fitness of each composition, allowing PICLE to achieve both perceptual, few-shot and latent transfer. The model combines prior knowledge about good module compositions with dataset-specific information. We evaluate PICLE using two benchmark suites designed to assess different desiderata of CL techniques. Comparing to a wide range of approaches, we show that PICLE is the first modular CL algorithm to achieve perceptual, few-shot and latent transfer while scaling well to large search spaces, outperforming previous state-of-the-art modular CL approaches on long problem sequences.
