AROMA: Autonomous Rank-one Matrix Adaptation
Hao Nan Sheng, Zhi-yong Wang, Mingrui Yang, Hing Cheung So
TL;DR
AROMA tackles the prespecified-rank bottleneck of LoRA by introducing adaptive rank growth via a dual-loop architecture that autonomously determines the number of rank-one updates. The method expresses the incremental weight update as $ΔW = ∑_{p=1}^{P} b_p a_p$, with an inner loop optimizing each rank-one component and an outer loop deciding the total count of subspaces. The training employs a Check & Merge & Reinit & Reset protocol to maintain subspace independence and promote exploration while keeping trainable parameters minimal. Empirical results on RoBERTa-base/GLUE and LLaMA3-8B/Commonsense170K show superior accuracy with far fewer trainable parameters than LoRA and AdaLoRA, along with favorable time efficiency. These findings position adaptive rank-growth PEFT as a scalable, effective approach with potential extensions to multimodal tasks and continual learning setups.
Abstract
As large language models continue to grow in size, parameter-efficient fine-tuning (PEFT) has become increasingly crucial. While low-rank adaptation (LoRA) offers a solution through low-rank updates, its static rank allocation may yield suboptimal results. Adaptive low-rank adaptation (AdaLoRA) improves this with dynamic allocation but remains sensitive to initial and target rank configurations. We introduce AROMA, a framework that automatically constructs layer-specific updates by iteratively building up rank-one components with very few trainable parameters that gradually diminish to zero. Unlike existing methods that employ rank reduction mechanisms, AROMA introduces a dual-loop architecture for rank growth. The inner loop extracts information from each rank-one subspace, while the outer loop determines the number of rank-one subspaces, i.e., the optimal rank. We reset optimizer states to maintain subspace independence. AROMA significantly reduces parameters compared to LoRA and AdaLoRA while achieving superior performance on natural language understanding and commonsense reasoning tasks, offering new insights into adaptive PEFT. The code is available at \href{https://github.com/ShuDun23/AROMA}{AROMA}.
