MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Chongjie Si, Zhiyi Shi, Yadao Wang, Xiaokang Yang, Susanto Rahardja, Wei Shen
TL;DR
MAP recasts weight matrices as flattened vectors and decouples adaptation into direction and magnitude using two learnable scalars, providing a principled, lightweight enhancement to parameter-efficient fine-tuning. The vector-based (and equivalently matrix-based) formulations enable clean, interpretable control of the base weights and their updates, and MAP can be integrated with existing methods such as LoRA to form LoMAP. Across commonsense reasoning, natural language understanding, and generation tasks, MAP-derived configurations consistently outperform strong baselines while maintaining minimal parameter overhead and similar training costs. The work introduces a practical framework with broad compatibility, suggesting MAP as a default design choice for future PEFT methodologies.
Abstract
The rapid development of large language models has revolutionized natural language processing, but their fine-tuning remains computationally expensive, hindering broad deployment. Parameter-efficient fine-tuning (PEFT) methods, such as LoRA, have emerged as solutions. Recent work like DoRA attempts to further decompose weight adaptation into direction and magnitude components. However, existing formulations often define direction heuristically at the column level, lacking a principled geometric foundation. In this paper, we propose MAP, a novel framework that reformulates weight matrices as high-dimensional vectors and decouples their adaptation into direction and magnitude in a rigorous manner. MAP normalizes the pre-trained weights, learns a directional update, and introduces two scalar coefficients to independently scale the magnitude of the base and update vectors. This design enables more interpretable and flexible adaptation, and can be seamlessly integrated into existing PEFT methods. Extensive experiments show that MAP significantly improves performance when coupling with existing methods, offering a simple yet powerful enhancement to existing PEFT methods. Given the universality and simplicity of MAP, we hope it can serve as a default setting for designing future PEFT methods.
