DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution

Yulong Mao; Kaiyu Huang; Changhao Guan; Ganglin Bao; Fengran Mo; Jinan Xu

DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution

Yulong Mao, Kaiyu Huang, Changhao Guan, Ganglin Bao, Fengran Mo, Jinan Xu

TL;DR

The Dynamic Low-Rank Adaptation (DoRA) method decomposes high-rank LoRA layers into structured single-rank components, allowing for dynamic pruning of parameter budget based on their importance to specific tasks during training, which makes the most of the limited parameter budget.

Abstract

Fine-tuning large-scale pre-trained models is inherently a resource-intensive task. While it can enhance the capabilities of the model, it also incurs substantial computational costs, posing challenges to the practical application of downstream tasks. Existing parameter-efficient fine-tuning (PEFT) methods such as Low-Rank Adaptation (LoRA) rely on a bypass framework that ignores the differential parameter budget requirements across weight matrices, which may lead to suboptimal fine-tuning outcomes. To address this issue, we introduce the Dynamic Low-Rank Adaptation (DoRA) method. DoRA decomposes high-rank LoRA layers into structured single-rank components, allowing for dynamic pruning of parameter budget based on their importance to specific tasks during training, which makes the most of the limited parameter budget. Experimental results demonstrate that DoRA can achieve competitive performance compared with LoRA and full model fine-tuning, and outperform various strong baselines with the same storage parameter budget. Our code is available at https://github.com/MIkumikumi0116/DoRA

DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution

TL;DR

Abstract

Paper Structure (22 sections, 10 equations, 2 figures, 9 tables, 1 algorithm)

This paper contains 22 sections, 10 equations, 2 figures, 9 tables, 1 algorithm.

Introduction
Background
Dynamic Low-Rank Adaptation
Parameterization
Importance Scoring
Parameter Scheduling and Pruning Strategy
Dimensional Equilibrium Modulator
Experiments
Experimental Setup
Results
Analysis and Discussion
Effectiveness of DEM
Parameter Allocation Preference
Impact of Initial Budget
Related Work
...and 7 more sections

Figures (2)

Figure 1: Figure (a) and Figure (b) illustrate the reparameterization of LoRA and DoRA. LoRA introduces a pair of low-rank matrices, A and B, each with a rank of $r$, into the weight matrix. In contrast, DoRA introduces $r'$ pairs of single-rank matrices, each acting as a LoRA component. During training, DoRA evaluates the contribution of each component to the overall performance and prunes components with smaller contributions, achieving adaptive allocation of parameters.
Figure 2: Rank distribution under four parameter budgets

DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution

TL;DR

Abstract

DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution

Authors

TL;DR

Abstract

Table of Contents

Figures (2)