Latent Spectral Regularization for Continual Learning
Emanuele Frascaroli, Riccardo Benaglia, Matteo Boschini, Luca Moschella, Cosimo Fiorini, Emanuele Rodolà, Simone Calderara
TL;DR
This work tackles catastrophic forgetting in rehearsal-based continual learning by analyzing how latent-space geometry evolves with sequential tasks. It reveals that replayed points from different classes increasingly mix in the latent space and proposes CaSpeR-IL, a spectral-geometry regularizer that promotes partitioned embeddings by shaping the Laplacian spectrum and maximizing the eigengap $\lambda_{g+1}-\lambda_g$. By integrating a loss term $\ell_{\text{CaSpeR}}=-\lambda_{g+1}+\sum_{j=1}^g\lambda_j$ (with a Monte Carlo approximation $\ell_{\text{CaSpeR}}^*$ for efficiency), CaSpeR-IL can be plugged into any rehearsal-based CL method and consistently improves final accuracy $\bar{A}_F$ while reducing forgetting $\bar{F}^*_F$ across multiple benchmarks. The analysis shows CaSpeR-IL produces more stable latent-space partitions, as evidenced by more diagonal functional maps and lower off-diagonal energy, indicating reduced interference between classes and enhanced transfer of knowledge across tasks.
Abstract
While biological intelligence grows organically as new knowledge is gathered throughout life, Artificial Neural Networks forget catastrophically whenever they face a changing training data distribution. Rehearsal-based Continual Learning (CL) approaches have been established as a versatile and reliable solution to overcome this limitation; however, sudden input disruptions and memory constraints are known to alter the consistency of their predictions. We study this phenomenon by investigating the geometric characteristics of the learner's latent space and find that replayed data points of different classes increasingly mix up, interfering with classification. Hence, we propose a geometric regularizer that enforces weak requirements on the Laplacian spectrum of the latent space, promoting a partitioning behavior. Our proposal, called Continual Spectral Regularizer for Incremental Learning (CaSpeR-IL), can be easily combined with any rehearsal-based CL approach and improves the performance of SOTA methods on standard benchmarks.
