ARD-LoRA: Dynamic Rank Allocation for Parameter-Efficient Fine-Tuning of Foundation Models with Heterogeneous Adaptation Needs

Haseeb Ullah Khan Shinwari; Muhammad Usama

ARD-LoRA: Dynamic Rank Allocation for Parameter-Efficient Fine-Tuning of Foundation Models with Heterogeneous Adaptation Needs

Haseeb Ullah Khan Shinwari, Muhammad Usama

TL;DR

ARD-LoRA tackles the inefficiency of fixed-rank PEFT by learning per-head, per-layer rank allocations through differentiable scaling factors optimized under a meta-objective that enforces sparsity and stable rank transitions. The method achieves near-full fine-tuning performance on LLAMA-3.1-70B and PaliGemma-2 while using a tiny fraction of trainable parameters and significantly reducing memory for multimodal adaptation. Theoretical analyses establish convergence, generalization, and stability guarantees, and extensive experiments demonstrate strong empirical gains, including superior cross-domain generalization and meaningful reductions in adaptation overhead. Overall, dynamic, fine-grained rank allocation emerges as a powerful paradigm for efficient, scalable foundation-model adaptation across modalities and tasks.

Abstract

Conventional Low-Rank Adaptation (LoRA) methods employ a fixed rank, imposing uniform adaptation across transformer layers and attention heads despite their heterogeneous learning dynamics. This paper introduces Adaptive Rank Dynamic LoRA (ARD-LoRA), a novel framework that automates rank allocation through learnable scaling factors. These factors are optimized via a meta-objective balancing task performance and parameter efficiency, incorporating $\ell_1$ sparsity for minimal rank and Total Variation regularization for stable rank transitions. ARD-LoRA enables continuous, differentiable, per-head rank adaptation. Experiments on LLAMA-3.1-70B and PaliGemma-2 demonstrate ARD-LoRA's efficacy, achieving up to 99.3% of full fine-tuning performance with only 0.32% trainable parameters, outperforming strong baselines like DoRA and AdaLoRA. Furthermore, it reduces multimodal adaptation memory by 41%. These results establish dynamic, fine-grained rank allocation as a critical paradigm for efficient foundation model adaptation.

ARD-LoRA: Dynamic Rank Allocation for Parameter-Efficient Fine-Tuning of Foundation Models with Heterogeneous Adaptation Needs

TL;DR

Abstract

ARD-LoRA: Dynamic Rank Allocation for Parameter-Efficient Fine-Tuning of Foundation Models with Heterogeneous Adaptation Needs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (1)