Nonlinearity as Rank: Generative Low-Rank Adapter with Radial Basis Functions
Yihao Ouyang, Shiwei Li, Haozhao Wang, Xiandi Luo, Zhuoqi Hu, Yuetong Song, Qiyu Qin, Yichen Li, Ruixuan Li
TL;DR
GenLoRA tackles the parameter-inefficiency of LoRA by showing that explicit basis vectors contain redundancy that can be captured with lightweight nonlinear generators driven by latent seeds. By mapping latent vectors through RBF-based generators, GenLoRA synthesizes the A and B matrices, boosting effective rank without proportional parameter growth and preserving low-rank structure with bounded gradients. Empirical results across math, commonsense, and code-generation tasks demonstrate consistent accuracy gains and significant parameter savings versus LoRA and its variants, with strong scalability as rank and group size increase. The approach enables near-zero overhead at inference via weight merging and offers a principled, theoretically grounded framework for nonlinear, generative weight updates in PEFT.
Abstract
Low-rank adaptation (LoRA) approximates the update of a pretrained weight matrix using the product of two low-rank matrices. However, standard LoRA follows an explicit-rank paradigm, where increasing model capacity requires adding more rows or columns (i.e., basis vectors) to the low-rank matrices, leading to substantial parameter growth. In this paper, we find that these basis vectors exhibit significant parameter redundancy and can be compactly represented by lightweight nonlinear functions. Therefore, we propose Generative Low-Rank Adapter (GenLoRA), which replaces explicit basis vector storage with nonlinear basis vector generation. Specifically, GenLoRA maintains a latent vector for each low-rank matrix and employs a set of lightweight radial basis functions (RBFs) to synthesize the basis vectors. Each RBF requires far fewer parameters than an explicit basis vector, enabling higher parameter efficiency in GenLoRA. Extensive experiments across multiple datasets and architectures show that GenLoRA attains higher effective LoRA ranks under smaller parameter budgets, resulting in superior fine-tuning performance. The code is available at https://anonymous.4open.science/r/GenLoRA-1519.
