Model Zoo: A Growing "Brain" That Learns Continually
Rahul Ramesh, Pratik Chaudhari
TL;DR
The paper addresses the problem of continual learning under task interdependencies and interference. It proposes Model Zoo, a boosting-inspired ensemble approach that grows across episodes by training small models on current and past tasks and averaging predictions for each task, enabling strong forward and backward transfer. The authors develop a theoretical framework around task relatedness and transfer exponents to characterize when joint learning helps and provide bounds on excess risk; they also demonstrate empirically that capacity splitting across tasks can outperform existing methods across a broad benchmark suite. A striking finding is that simple isolated training can outperform many continual-learning methods in some settings, underscoring the importance of evaluating baselines and the potential of capacity-splitting with data replay. Overall, Model Zoo offers a scalable, effective strategy for continual learning by dynamically expanding learning capacity and leveraging cross-task synergies while mitigating detrimental task competition.
Abstract
This paper argues that continual learning methods can benefit by splitting the capacity of the learner across multiple models. We use statistical learning theory and experimental analysis to show how multiple tasks can interact with each other in a non-trivial fashion when a single model is trained on them. The generalization error on a particular task can improve when it is trained with synergistic tasks, but can also deteriorate when trained with competing tasks. This theory motivates our method named Model Zoo which, inspired from the boosting literature, grows an ensemble of small models, each of which is trained during one episode of continual learning. We demonstrate that Model Zoo obtains large gains in accuracy on a variety of continual learning benchmark problems. Code is available at https://github.com/grasp-lyrl/modelzoo_continual.
