SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA
Minrui Luo, Fuhang Kuang, Yu Wang, Zirui Liu, Tianxing He
TL;DR
The paper tackles the challenge of efficiently fine-tuning large language models while preserving pre-trained world knowledge and safety, a key problem in parameter-efficient fine-tuning (PEFT).It introduces Subspace-Constrained LoRA (SC-LoRA), which identifies a low-rank subspace via the top eigenvectors of $\\Delta\\mathrm{Cov} = (1-\\beta)\\mathrm{Cov}_+ -\\beta\\mathrm{Cov}_-$ and initializes LoRA adapters so that $B_{init}$ and $A_{init}$ enforce $B_{init}A_{init}x = \\Pi_S(h)$, keeping updates in $S$.The method balances learning the fine-tuning data and preserving preserved knowledge with a hyperparameter $\\beta$, and demonstrates superior performance on world knowledge and safety-preservation tasks compared to existing LoRA initializations.These results suggest SC-LoRA provides a practical, theory-grounded approach to knowledge-preserving, efficient fine-tuning for LLMs, with broad implications for safe and reliable PEFT deployments.
Abstract
Parameter-Efficient Fine-Tuning (PEFT) methods, particularly Low-Rank Adaptation (LoRA), are indispensable for efficiently customizing Large Language Models (LLMs). However, vanilla LoRA suffers from slow convergence speed and knowledge forgetting problems. Recent studies have leveraged the power of designed LoRA initialization, to enhance the fine-tuning efficiency, or to preserve knowledge in the pre-trained LLM. However, none of these works can address the two cases at the same time. To this end, we introduce Subspace-Constrained LoRA (SC-LoRA), a novel LoRA initialization framework engineered to navigate the trade-off between efficient fine-tuning and knowledge preservation. We achieve this by constraining the output of trainable LoRA adapters in a low-rank subspace, where the context information of fine-tuning data is most preserved while the context information of preserved knowledge is least retained, in a balanced way. Such constraint enables the trainable weights to primarily focus on the main features of fine-tuning data while avoiding damaging the preserved knowledge features. We provide theoretical analysis on our method, and conduct extensive experiments including safety preservation and world knowledge preservation, on various downstream tasks. In our experiments, SC-LoRA succeeds in delivering superior fine-tuning performance while markedly diminishing knowledge forgetting, surpassing contemporary LoRA initialization methods.
