CIP-Net: Continual Interpretable Prototype-based Network
Federico Di Valerio, Michela Proietti, Alessio Ragno, Roberto Capobianco
TL;DR
CIP-Net introduces an exemplar-free, self-explainable prototype-based network for continual learning that relies on a single shared prototype pool to mitigate catastrophic forgetting while maintaining interpretability. It combines alignment-uniformity self-supervision with targeted prototype regularization to curb interference and explain drift across tasks, achieving state-of-the-art results on CUB and CARS in both task- and class-incremental settings with substantially reduced memory overhead. The model provides both global and local explanations, linking prototypes to classes and local patch activations, and demonstrates stability of explanations across task sequences. Overall, CIP-Net offers a scalable, interpretable solution for continual learning with strong performance and practical memory efficiency, while outlining clear avenues for further refinement in sparsity, regularization, and out-of-distribution handling.
Abstract
Continual learning constrains models to learn new tasks over time without forgetting what they have already learned. A key challenge in this setting is catastrophic forgetting, where learning new information causes the model to lose its performance on previous tasks. Recently, explainable AI has been proposed as a promising way to better understand and reduce forgetting. In particular, self-explainable models are useful because they generate explanations during prediction, which can help preserve knowledge. However, most existing explainable approaches use post-hoc explanations or require additional memory for each new task, resulting in limited scalability. In this work, we introduce CIP-Net, an exemplar-free self-explainable prototype-based model designed for continual learning. CIP-Net avoids storing past examples and maintains a simple architecture, while still providing useful explanations and strong performance. We demonstrate that CIPNet achieves state-of-the-art performances compared to previous exemplar-free and self-explainable methods in both task- and class-incremental settings, while bearing significantly lower memory-related overhead. This makes it a practical and interpretable solution for continual learning.
