PVeRA: Probabilistic Vector-Based Random Matrix Adaptation

Leo Fillioux; Enzo Ferrante; Paul-Henry Cournède; Maria Vakalopoulou; Stergios Christodoulidis

PVeRA: Probabilistic Vector-Based Random Matrix Adaptation

Leo Fillioux, Enzo Ferrante, Paul-Henry Cournède, Maria Vakalopoulou, Stergios Christodoulidis

TL;DR

The paper tackles the challenge of efficiently adapting large foundation models under limited data and compute by introducing PVeRA, a probabilistic extension of VeRA that learns a distribution over low-rank adapters. It leverages reparameterization and KL regularization to enable sampling during training and inference, yielding uncertainty estimates and well-calibrated predictions. Empirically, PVeRA surpasses VeRA and other adapters on VTAB-1k while maintaining strong parameter efficiency and enabling inference-time merging of adapters. The approach also demonstrates uncertainty quantification, out-of-distribution detection, and preliminary NLP applicability, suggesting broad utility across vision and language tasks.

Abstract

Large foundation models have emerged in the last years and are pushing performance boundaries for a variety of tasks. Training or even finetuning such models demands vast datasets and computational resources, which are often scarce and costly. Adaptation methods provide a computationally efficient solution to address these limitations by allowing such models to be finetuned on small amounts of data and computing power. This is achieved by appending new trainable modules to frozen backbones with only a fraction of the trainable parameters and fitting only these modules on novel tasks. Recently, the VeRA adapter was shown to excel in parameter-efficient adaptations by utilizing a pair of frozen random low-rank matrices shared across all layers. In this paper, we propose PVeRA, a probabilistic version of the VeRA adapter, which modifies the low-rank matrices of VeRA in a probabilistic manner. This modification naturally allows handling inherent ambiguities in the input and allows for different sampling configurations during training and testing. A comprehensive evaluation was performed on the VTAB-1k benchmark and seven adapters, with PVeRA outperforming VeRA and other adapters. Our code for training models with PVeRA and benchmarking all adapters is available https://github.com/leofillioux/pvera.

PVeRA: Probabilistic Vector-Based Random Matrix Adaptation

TL;DR

Abstract

PVeRA: Probabilistic Vector-Based Random Matrix Adaptation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)