Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models
Theo Putterman, Derek Lim, Yoav Gelberg, Stefanie Jegelka, Haggai Maron
TL;DR
This work introduces Learning on LoRAs (LoL), a framework for processing low-rank LoRA weight updates with symmetry-aware models. By exploiting GL$(r)$-invariances and developing GL-equivariant layers, the authors build architectures (notably GL-net) that efficiently and universally approximate GL-invariant functions on LoRA inputs. They construct three large LoRA datasets (CelebA-, Imagenette-, and Qwen2-ARC-LoRA) and demonstrate strong predictive performance for CLIP scores, training-data properties, and LM task metrics, while showing good generalization across unseen ranks. The results highlight practical potential for evaluating, editing, and understanding finetuned models solely from their LoRA weights, enabling privacy-aware analysis and rapid model assessment at scale.
Abstract
Low-rank adaptations (LoRAs) have revolutionized the finetuning of large foundation models, enabling efficient adaptation even with limited computational resources. The resulting proliferation of LoRAs presents exciting opportunities for applying machine learning techniques that take these low-rank weights themselves as inputs. In this paper, we investigate the potential of Learning on LoRAs (LoL), a paradigm where LoRA weights serve as input to machine learning models. For instance, an LoL model that takes in LoRA weights as inputs could predict the performance of the finetuned model on downstream tasks, detect potentially harmful finetunes, or even generate novel model edits without traditional training methods. We first identify the inherent parameter symmetries of low rank decompositions of weights, which differ significantly from the parameter symmetries of standard neural networks. To efficiently process LoRA weights, we develop several symmetry-aware invariant or equivariant LoL models, using tools such as canonicalization, invariant featurization, and equivariant layers. We finetune thousands of text-to-image diffusion models and language models to collect datasets of LoRAs. In numerical experiments on these datasets, we show that our LoL architectures are capable of processing low rank weight decompositions to predict CLIP score, finetuning data attributes, finetuning data membership, and accuracy on downstream tasks.
