Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
Haobo Song, Hao Zhao, Soumajit Majumder, Tao Lin
TL;DR
CapaBoost introduces a simple, plug-in strategy to increase the effective capacity of parameter-efficient fine-tuning by using multiple parallel, weight-tied updates with deterministic random masks. Theoretical and empirical results show that the approach expands the effective rank of incremental updates, enabling higher performance without increasing trainable parameters or FLOPs. Across NLP and vision benchmarks (GLUE, SQuAD, VTAB), CapaBoost variants (notably LoRA and PAdapter) consistently outperform strong PEFT baselines while maintaining or reducing budgeted parameters, and demonstrate hardware-friendly sparse computation. The work suggests a practical path to scaling PEFT performance for large models and offers insights into rank-driven capacity gains and mask design for future exploration.
Abstract
Fine-tuning large pre-trained foundation models, such as the 175B GPT-3, has attracted more attention for downstream tasks recently. While parameter-efficient fine-tuning methods have been proposed and proven effective without retraining all model parameters, their performance is limited by the capacity of incremental modules, especially under constrained parameter budgets. \\ To overcome this challenge, we propose CapaBoost, a simple yet effective strategy that enhances model capacity by leveraging low-rank updates through parallel weight modules in target layers. By applying static random masks to the shared weight matrix, CapaBoost constructs a diverse set of weight matrices, effectively increasing the rank of incremental weights without adding parameters. Notably, our approach can be seamlessly integrated into various existing parameter-efficient fine-tuning methods. We extensively validate the efficacy of CapaBoost through experiments on diverse downstream tasks, including natural language understanding, question answering, and image classification. Our results demonstrate significant improvements over baselines, without incurring additional computation or storage costs. Our code is available at \url{https://github.com/LINs-lab/CapaBoost}.
