LoRA-Based Continual Learning with Constraints on Critical Parameter Changes
Shimou Ling, Liang Zhang, Jiangwei Zhao, Lili Pan, Hongliang Li
TL;DR
The paper tackles catastrophic forgetting in continual learning with pre-trained vision transformers by showing that orthodox orthogonal LoRA tuning alone does not fully stabilize important pre-task parameters. It introduces LoRAC-IPC, a model that combines orthogonal LoRA composition (via QR-based factorization) with Important Parameter Constraints to freeze critical parameter matrices, plus task-adaptive prediction enhancements. The approach yields state-of-the-art results across multiple benchmarks (including Split CIFAR-100, ImageNet-R, DomainNet) and demonstrates strong multi-modal performance, with ablations confirming the contributions of LoRA composition, orthogonality, IPC, and task-ID inference. This work offers a scalable, parameter-efficient path to robust continual learning in decision-critical, real-world tasks while maintaining plasticity for new knowledge.
Abstract
LoRA-based continual learning represents a promising avenue for leveraging pre-trained models in downstream continual learning tasks. Recent studies have shown that orthogonal LoRA tuning effectively mitigates forgetting. However, this work unveils that under orthogonal LoRA tuning, the critical parameters for pre-tasks still change notably after learning post-tasks. To address this problem, we directly propose freezing the most critical parameter matrices in the Vision Transformer (ViT) for pre-tasks before learning post-tasks. In addition, building on orthogonal LoRA tuning, we propose orthogonal LoRA composition (LoRAC) based on QR decomposition, which may further enhance the plasticity of our method. Elaborate ablation studies and extensive comparisons demonstrate the effectiveness of our proposed method. Our results indicate that our method achieves state-of-the-art (SOTA) performance on several well-known continual learning benchmarks. For instance, on the Split CIFAR-100 dataset, our method shows a 6.35\% improvement in accuracy and a 3.24\% reduction in forgetting compared to previous methods. Our code is available at https://github.com/learninginvision/LoRAC-IPC.
