EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and Inference
Prakhar Kaushik, Ankit Vaidya, Shravan Chaudhari, Alan Yuille
TL;DR
EigenLoRAx recycles pretrained LoRA adapters to identify a shared, task-invariant principal subspace and enables new-task adaptation by learning lightweight coefficients over this subspace. By aggregating LoRA weights and extracting top PCs, it augments the subspace with pseudo-PCs when data is scarce and uses a fixed subspace during training, drastically reducing trainable parameters while maintaining comparable accuracy across vision and language tasks. Theoretical bounds and extensive experiments across ViT, GLUE, and diffusion-based generation validate the existence and practical utility of a shared weight subspace, delivering substantial speedups, memory savings, and improved edge-deployability for large models. This approach promises scalable, equitable access to large foundations models by lowering both compute and memory barriers for adaptation and inference.
Abstract
The rapid growth of large models has raised concerns about their environmental impact and equity in accessibility due to significant computational costs. Low-Rank Adapters (LoRA) offer a lightweight solution for finetuning large models, resulting in an abundance of publicly available adapters tailored to diverse domains. We ask: Can these pretrained adapters be leveraged to further streamline adaptation to new tasks while addressing these challenges? We introduce EigenLoRAx, a parameter-efficient finetuning method that recycles existing adapters to create a principal subspace aligned with their shared domain knowledge which can be further augmented with orthogonal basis vectors in low-resource scenarios. This enables rapid adaptation to new tasks by learning only lightweight coefficients on the principal components of the subspace-eliminating the need to finetune entire adapters. EigenLoRAx requires significantly fewer parameters and memory, improving efficiency for both training and inference. Our method demonstrates strong performance across diverse domains and tasks, offering a scalable for edge-based applications, personalization, and equitable deployment of large models in resource-constrained environments.
