Rank-Accuracy Trade-off for LoRA: A Gradient-Flow Analysis
Michael Rushka, Diego Klabjan
TL;DR
This paper analyzes the rank–accuracy trade-off in LoRA fine-tuning by formulating a gradient-flow dynamics framework for the low-rank update BA to the pretrained weights W0. It first derives gradient-flow ODEs that are invariant to whether A and B are updated simultaneously or sequentially, and then applies them to two objectives: trace-squared and Frobenius norm based low-rank approximation. The main results show zero final loss for the trace-squared objective under LoRA and quantify the relative approximation error scaling as O(r^{-1/2}) under standard spectral initialization, while for general low-rank approximation the optimal rank-r solution aligns with the top r singular values of W0 per the Eckart-Young-Mirsky theorem. Together, these findings illuminate how rank and spectral properties of W0 determine LoRA’s accuracy, offering a principled basis for parameter-efficient fine-tuning in large models and guiding future extensions to more complex losses and initialization schemes.
Abstract
Previous empirical studies have shown that LoRA achieves accuracy comparable to full-parameter methods on downstream fine-tuning tasks, even for rank-1 updates. By contrast, the theoretical underpinnings of the dependence of LoRA's accuracy on update rank remain relatively unexplored. In this work, we compare the accuracy of rank-r LoRA updates against full-parameter updates for fine-tuning tasks from a dynamical systems perspective. We perform gradient flow analysis in both full-rank and low-rank regimes to establish explicit relationships between rank and accuracy for two loss functions under LoRA. While gradient flow equations for LoRA are presented in prior work, we rigorously derive their form and show that they are identical for simultaneous and sequential LoRA parameter updates. We then use the resulting dynamical system equations to obtain closed-form relationships between LoRA rank and accuracy for trace-squared and Frobenius-norm low-rank approximation loss functions.
