Low-rank variational dropout: Rank selection and uncertainty in adapters

Cooper Doyle; Rebecca Chan; Andy Hu; Anna Leontjeva

Low-rank variational dropout: Rank selection and uncertainty in adapters

Cooper Doyle, Rebecca Chan, Andy Hu, Anna Leontjeva

TL;DR

It is empirically show that BayesLoRA induces stable, non-arbitrary rank structure aligned with the intrinsic singular directions of the learned updates, and outperforms existing low-rank sparsification methods in accuracy at comparable training cost while delivering substantially improved predictive calibration at negligible additional overhead.

Abstract

Low-rank adaptation methods enable efficient task-specific updates in large neural networks, but provide no principled mechanism for uncertainty estimation or capacity control. We introduce Low-Rank Variational Dropout (LRVD), a Bayesian framework that operates directly in the space of low-rank adaptation. LRVD employs a scale-invariant, sparsity-inducing prior together with a structured variational family that ties uncertainty at the level of latent rank components, inducing rank-wise noise-to-signal ratios for automatic capacity selection. As a concrete instantiation, we apply LRVD to low-rank adaptation and obtain BayesLoRA, which jointly learns predictive uncertainty and the effective adapter rank with only O(r) additional parameters, where r is the adapter rank. We empirically show that BayesLoRA induces stable, non-arbitrary rank structure aligned with the intrinsic singular directions of the learned updates, and outperforms existing low-rank sparsification methods in accuracy at comparable training cost while delivering substantially improved predictive calibration at negligible additional overhead.

Low-rank variational dropout: Rank selection and uncertainty in adapters

TL;DR

Abstract

Low-rank variational dropout: Rank selection and uncertainty in adapters

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)