Integrating Present and Past in Unsupervised Continual Learning
Yipeng Zhang, Laurent Charlin, Richard Zemel, Mengye Ren
TL;DR
This work tackles forgetting and representation overlap in unsupervised continual learning by proposing a unifying framework that separately optimizes three objectives—$L_{ ext{current}}$, $L_{ ext{cross}}$, and $L_{ ext{past}}$—to capture plasticity, cross-task consolidation, and stability. It introduces Osiris, a method that uses isolated embedding spaces and multiple projectors to realize these objectives, achieving state-of-the-art performance on standard benchmarks like Split-CIFAR-100 and on newly proposed structured benchmarks that mimic realistic learning environments. The study also analyzes the trade-offs between replay and distillation for stability, and demonstrates that the proposed structured task sequences can even surpass offline iid learning in some settings. Overall, Osiris provides a principled path to more robust continual learners by explicitly integrating present and past learning signals and by highlighting the importance of task structure and normalization choices for UCL.
Abstract
We formulate a unifying framework for unsupervised continual learning (UCL), which disentangles learning objectives that are specific to the present and the past data, encompassing stability, plasticity, and cross-task consolidation. The framework reveals that many existing UCL approaches overlook cross-task consolidation and try to balance plasticity and stability in a shared embedding space. This results in worse performance due to a lack of within-task data diversity and reduced effectiveness in learning the current task. Our method, Osiris, which explicitly optimizes all three objectives on separate embedding spaces, achieves state-of-the-art performance on all benchmarks, including two novel benchmarks proposed in this paper featuring semantically structured task sequences. Compared to standard benchmarks, these two structured benchmarks more closely resemble visual signals received by humans and animals when navigating real-world environments. Finally, we show some preliminary evidence that continual models can benefit from such realistic learning scenarios.
