InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning
Yan-Shuo Liang, Wu-Jun Li
TL;DR
InfLoRA tackles interference between sequential tasks in continual learning by designing a pre-trained-weight subspace that is orthogonal to old-task gradients and aligned with the new-task gradients. It reparameterizes the backbone with a single low-rank branch per task, such that updating the injected parameters $\bm{A}_t$ equates to subspace-fine-tuning of $\bm{W}$ within $\text{span}\{\bm{b}_1^t,...,\bm{b}_r^t\}$; the subspace is constructed via $\hat{\bm{H}}_t$ from inputs and gradient-memory projections from DualGPM, ensuring minimal interference with prior tasks. A two-stage design—orthogonality to old gradients and alignment with the new-task gradient—plus a principled selection of $\bm{B}_t$ from SVD on projected inputs—yields a robust, interference-free continual learner. Empirically, InfLoRA and its 5-block variant substantially outperform state-of-the-art PEFT-based continual-learning methods on ImageNet-R, CIFAR100, and DomainNet while maintaining parameter efficiency and fast inference. The approach promises practical gains for scalable continual learning with large pre-trained transformers in vision applications.
Abstract
Continual learning requires the model to learn multiple tasks sequentially. In continual learning, the model should possess the ability to maintain its performance on old tasks (stability) and the ability to adapt to new tasks continuously (plasticity). Recently, parameter-efficient fine-tuning (PEFT), which involves freezing a pre-trained model and injecting a small number of learnable parameters to adapt to downstream tasks, has gained increasing popularity in continual learning. Although existing continual learning methods based on PEFT have demonstrated superior performance compared to those not based on PEFT, most of them do not consider how to eliminate the interference of the new task on the old tasks, which inhibits the model from making a good trade-off between stability and plasticity. In this work, we propose a new PEFT method, called interference-free low-rank adaptation (InfLoRA), for continual learning. InfLoRA injects a small number of parameters to reparameterize the pre-trained weights and shows that fine-tuning these injected parameters is equivalent to fine-tuning the pre-trained weights within a subspace. Furthermore, InfLoRA designs this subspace to eliminate the interference of the new task on the old tasks, making a good trade-off between stability and plasticity. Experimental results show that InfLoRA outperforms existing state-of-the-art continual learning methods on multiple datasets.
