IterIS: Iterative Inference-Solving Alignment for LoRA Merging

Hongxu Chen; Runshi Li; Bowei Zhu; Zhen Wang; Long Chen

IterIS: Iterative Inference-Solving Alignment for LoRA Merging

Hongxu Chen, Runshi Li, Bowei Zhu, Zhen Wang, Long Chen

TL;DR

IterIS tackles the challenge of merging multiple task-specific LoRAs without gradient-based fine-tuning or access to training data by reframing LoRA merging as an optimization problem solved iteratively via an inference step that yields unified-adapter input features $\tilde{\bm{X}}_i$ and a solving step that updates $W^*$ using $W^* = ( \sum_i \lambda_i \tilde{\bm{X}}_i \tilde{\bm{X}}_i^T )^{-1} ( \sum_i \lambda_i \tilde{\bm{X}}_i \bm{X}_i^T W_i )$, with adaptive weights and a regularization term to reduce sample needs to 1-5%. The method leverages a directed acyclic graph structure to bound iterations and employs a layer-wise update for efficiency, achieving improvements over baselines across text-to-image diffusion, vision-language models, and large language models. By directly using input features for the unified adapters and iteratively refining the objective, IterIS mitigates rough feature assumptions, large unlabeled-sample requirements, and optimization imbalances in prior approaches, enabling private, data-efficient multi-task model composition with practical PEFT impact.

Abstract

Low-rank adaptations (LoRA) are widely used to fine-tune large models across various domains for specific downstream tasks. While task-specific LoRAs are often available, concerns about data privacy and intellectual property can restrict access to training data, limiting the acquisition of a multi-task model through gradient-based training. In response, LoRA merging presents an effective solution by combining multiple LoRAs into a unified adapter while maintaining data privacy. Prior works on LoRA merging primarily frame it as an optimization problem, yet these approaches face several limitations, including the rough assumption about input features utilized in optimization, massive sample requirements, and the unbalanced optimization objective. These limitations can significantly degrade performance. To address these, we propose a novel optimization-based method, named IterIS: 1) We formulate LoRA merging as an advanced optimization problem to mitigate the rough assumption. Additionally, we employ an iterative inference-solving framework in our algorithm. It can progressively refine the optimization objective for improved performance. 2) We introduce an efficient regularization term to reduce the need for massive sample requirements (requiring only 1-5% of the unlabeled samples compared to prior methods). 3) We utilize adaptive weights in the optimization objective to mitigate potential unbalances in LoRA merging process. Our method demonstrates significant improvements over multiple baselines and state-of-the-art methods in composing tasks for text-to-image diffusion, vision-language models, and large language models. Furthermore, our layer-wise algorithm can achieve convergence with minimal steps, ensuring efficiency in both memory and computation.

IterIS: Iterative Inference-Solving Alignment for LoRA Merging

TL;DR

Abstract

IterIS: Iterative Inference-Solving Alignment for LoRA Merging

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)