ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation

Dongha Lee; Jinhee Park; Minjun Kim; Junseok Kwon

ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation

Dongha Lee, Jinhee Park, Minjun Kim, Junseok Kwon

TL;DR

ABM-LoRA addresses initialization bottlenecks in Low-Rank Adaptation by aligning adapter activation boundaries with a frozen pretrained model, reducing gradient information loss at step one. By minimizing a boundary loss over a representative batch, ABM creates a initialization that preserves gradient directions, enabling faster convergence across language, vision, and multi-task settings. Empirically, ABM-LoRA improves GLUE results for T5-Base, VTAB-1K performance for ViT-B/16, and WizardLM/LLaMA2-7B tasks, often matching or surpassing full fine-tuning with lower cost. The work also provides thorough ablations and practical guidance on layer selection, margins, and adapter rank, highlighting robust cross-domain benefits.

Abstract

We propose Activation Boundary Matching for Low-Rank Adaptation (ABM-LoRA), a principled initialization strategy that substantially accelerates the convergence of low-rank adapters. While LoRA offers high parameter efficiency, its random initialization restricts gradient updates to a mismatched tangent space, causing significant information loss and hindering early convergence. Our ABM-LoRA addresses this by aligning the adapter's activation boundaries with those of the pretrained model before downstream training, thereby maximizing the projection of full-parameter gradients into the adapter subspace. This alignment sharply reduces information loss at initialization, yields a lower starting loss, and accelerates convergence. We demonstrate ABM-LoRA's effectiveness across diverse architectures and tasks: language understanding (T5-Base on GLUE), dialogue generation (LLaMA2-7B on WizardLM), and vision recognition (ViT-B/16 on VTAB-1K). On VTAB-1K, it achieves the highest accuracy among all methods, with strong gains on structured reasoning tasks requiring geometric understanding.

ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation

TL;DR

Abstract

ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)