IPA: An Information-Reconstructive Input Projection Framework for Efficient Foundation Model Adaptation

Yuan Yin; Shashanka Venkataramanan; Tuan-Hung Vu; Andrei Bursuc; Matthieu Cord

IPA: An Information-Reconstructive Input Projection Framework for Efficient Foundation Model Adaptation

Yuan Yin, Shashanka Venkataramanan, Tuan-Hung Vu, Andrei Bursuc, Matthieu Cord

TL;DR

IPA reframes parameter-efficient adaptation by replacing the data-agnostic LoRA down-projection with a feature-aware input projection that preserves input information in a reduced space. It introduces a forward-only, information-reconstruction objective (via a linear P and its decoder Q) and instantiates it with Incremental PCA to pretrain the projector efficiently. Empirically, IPA consistently outperforms random projection baselines across language and vision-language benchmarks and can match full LoRA performance with roughly half the trainable parameters when the projector is frozen. The approach yields robust improvements with modest pretraining overhead and offers a practical path to more efficient foundation-model adaptation.

Abstract

Parameter-efficient fine-tuning (PEFT) methods, such as LoRA, reduce adaptation cost by injecting low-rank updates into pretrained weights. However, LoRA's down-projection is randomly initialized and data-agnostic, discarding potentially useful information. Prior analyses show that this projection changes little during training, while the up-projection carries most of the adaptation, making the random input compression a performance bottleneck. We propose IPA, a feature-aware projection framework that explicitly aims to reconstruct the original input within a reduced hidden space. In the linear case, we instantiate IPA with algorithms approximating top principal components, enabling efficient projector pretraining with negligible inference overhead. Across language and vision benchmarks, IPA consistently improves over LoRA and DoRA, achieving on average 1.5 points higher accuracy on commonsense reasoning and 2.3 points on VTAB-1k, while matching full LoRA performance with roughly half the trainable parameters when the projection is frozen. Code available at https://github.com/valeoai/peft-ipa .

IPA: An Information-Reconstructive Input Projection Framework for Efficient Foundation Model Adaptation

TL;DR

Abstract

IPA: An Information-Reconstructive Input Projection Framework for Efficient Foundation Model Adaptation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)