GeLoRA: Geometric Adaptive Ranks For Efficient LoRA Fine-tuning
Abdessalam Ed-dib, Zhanibek Datbayev, Amine Mohamed Aboussalah
TL;DR
The paper addresses the high computational cost of fine-tuning large language models by proposing GeLoRA, a geometry-aware, parameter-efficient fine-tuning method that adapts LoRA ranks per transformer layer based on the intrinsic dimensionality of hidden representations. It provides a theoretical framework linking layer representation idim to the necessary LoRA capacity, and uses the TwoNN estimator to compute per-layer dimensions, yielding a lower bound on the required update rank. Empirically, GeLoRA achieves state-of-the-art or competitive results on GLUE and SQuAD under tight parameter budgets, outperforming existing LoRA variants and adapters while preserving efficiency. The approach offers both practical improvements for model personalization and theoretical insight into why intermediate-task tuning can be effective in certain regimes.
Abstract
Fine-tuning large language models (LLMs) is computationally intensive because it requires updating all parameters. Low-Rank Adaptation (LoRA) improves efficiency by modifying only a subset of weights but introduces a trade-off between expressivity and computational cost: lower ranks reduce resources but limit expressiveness, while higher ranks enhance expressivity at increased cost. Despite recent advances in adaptive LoRA techniques, existing methods fail to provide a theoretical basis for optimizing the trade-off between model performance and efficiency. We propose Geometric Low-Rank Adaptation (GeLoRA), a novel framework that computes the intrinsic dimensionality of hidden state representations to adaptively select LoRA ranks. We demonstrate that the intrinsic dimension provides a lower bound for the optimal rank of LoRA matrices, allowing for a principled selection that balances efficiency and expressivity. GeLoRA dynamically adjusts the rank for each layer based on the intrinsic dimensionality of its input and output representations, recognizing that not all model parameters equally impact fine-tuning. Empirical validation on multiple tasks shows that GeLoRA consistently outperforms recent baselines within the same parameter budget.
