Resolving Conflicts in Lifelong Learning via Aligning Updates in Subspaces
Yueer Zhou, Yichen Wu, Ying Wei
TL;DR
This paper tackles catastrophic forgetting in continual learning for large language and vision models by analyzing parameter shifts within LoRA subspaces. It introduces PS-LoRA, which couples a Parameter Stability Loss that constrains update magnitudes and aligns update directions with historical adapters, with a magnitude-based post-training merging strategy to consolidate tasks efficiently. The approach yields improved stability and accuracy across NLP and CV benchmarks, outperforming baselines and offering memory-efficient, plug-in compatibility with orthogonality-based methods. Overall, PS-LoRA provides a principled, scalable method to balance adaptation and retention in continual learning scenarios.
Abstract
Low-Rank Adaptation (LoRA) enables efficient Continual Learning but often suffers from catastrophic forgetting due to destructive interference between tasks. Our analysis reveals that this degradation is primarily driven by antagonistic directional updates where new task gradients directly oppose the historical weight trajectory. To address this, we propose PS-LoRA (Parameter Stability LoRA), a framework designed to resolve conflicts by aligning updates within the optimization subspace. Our approach employs a dual-regularization objective that penalizes conflicting directions and constrains magnitude deviations to ensure consistency with prior knowledge. Additionally, we implement a magnitude-based merging strategy to consolidate sequential adapters into a robust representation without retraining. Experiments on NLP and Vision benchmarks show that PS-LoRA outperforms state-of-the-art methods by preserving the stability of learned representations while efficiently adapting to new domains.
