CoSA: Compressed Sensing-Based Adaptation of Large Language Models
Songtao Wei, Yi Li, Bohan Zhang, Zhichun Guo, Ying Huang, Yuede Ji, Miao Yin, Guanpeng Li, Bingzhe Li
TL;DR
CoSA reframes PEFT as a compressed sensing synthesis problem by encoding weight updates with fixed random projections into a low-dimensional core: ΔW = L Y R, with vec(ΔW) = (R^T ⊗ L) vec(Y). The Kronecker-dictionary-based RIP guarantees ensure stable, near-isometric optimization while enabling expressive, multi-directional adaptation with far fewer trainable parameters than traditional low-rank schemes. Theoretical analysis establishes RIP for the Kronecker dictionary and empirical validation across NLP tasks shows CoSA matching or surpassing state-of-the-art PEFT methods on RoBERTa, LLaMA, and Qwen models with substantial memory savings. This offers a scalable, principled approach for efficient LLM adaptation, reducing resource demands without sacrificing performance. The work demonstrates that fixed random projections combined with a compact learnable core can robustly capture task-specific updates across diverse downstream tasks.
Abstract
Parameter-Efficient Fine-Tuning (PEFT) has emerged as a practical paradigm for adapting large language models (LLMs) without updating all parameters. Most existing approaches, such as LoRA and PiSSA, rely on low-rank decompositions of weight updates. However, the low-rank assumption may restrict expressivity, particularly in task-specific adaptation scenarios where singular values are distributed relatively uniformly. To address this limitation, we propose CoSA (Compressed Sensing-Based Adaptation), a new PEFT method extended from compressed sensing theory. Instead of constraining weight updates to a low-rank subspace, CoSA expresses them through fixed random projection matrices and a compact learnable core. We provide a formal theoretical analysis of CoSA as a synthesis process, proving that weight updates can be compactly encoded into a low-dimensional space and mapped back through random projections. Extensive experimental results show that CoSA provides a principled perspective for efficient and expressive multi-scale model adaptation. Specifically, we evaluate CoSA on 10 diverse tasks, including natural language understanding and generation, employing 5 models of different scales from RoBERTa, Llama, and Qwen families. Across these settings, CoSA consistently matches or outperforms state-of-the-art PEFT methods.
