Collaborative and Efficient Fine-tuning: Leveraging Task Similarity
Gagik Magakyan, Amirhossein Reisizadeh, Chanwoo Park, Pablo A. Parrilo, Asuman Ozdaglar
TL;DR
This work addresses data scarcity in fine-tuning foundation models across multiple downstream tasks by proposing CoLoRA, a collaborative and parameter-efficient extension of LoRA. It learns a shared global adapter capturing task similarity while maintaining personalized per-task scalars, achieving parameter complexity of $O(dr + kr^2)$. The authors provide a theoretical analysis via a heterogeneous linear regression model and GRIP-based guarantees for an alternating minimization scheme, showing convergence under sufficient task similarity. Empirically, CoLoRA yields substantial gains over federated baselines, especially when tasks are related, highlighting its practical impact for scalable, distributed model personalization.
Abstract
Adaptability has been regarded as a central feature in the foundation models, enabling them to effectively acclimate to unseen downstream tasks. Parameter-efficient fine-tuning methods such as celebrated LoRA facilitate efficient adaptation of large foundation models using labeled, high-quality and generally scarce task data. To mitigate data scarcity in fine-tuning of foundation models, we propose to leverage task similarity across multiple downstream users. Intuitively, users with similar tasks must be able to assist each other in boosting the effective fine-tuning data size. We propose Collaborative Low-Rank Adaptation, or CoLoRA, which exploits task similarity to collaboratively and efficiently fine-tune personalized foundation models. The main idea in CoLoRA is to train one shared adapter capturing underlying task similarities across all tasks, and personalized adapters tailored to user-specific tasks. We theoretically study CoLoRA on heterogeneous linear regression and provide provable guarantees for ground truth recovery. We also conduct several natural language experiments with varying task similarity, which further demonstrate that when trained together with similar tasks, individual performances are significantly boosted.
