SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation
Teng Hu, Jiangning Zhang, Ran Yi, Hongrui Huang, Yabiao Wang, Lizhuang Ma
TL;DR
Diffusion-model fine-tuning remains costly when adapting large pre-trained models to new domains. This work introduces SaRA, a progressive sparse low-rank adaptation that reuses temporarily ineffective parameters—the smallest $|p|$ entries—via sparse updates while preserving priors. A nuclear-norm-based low-rank constraint mitigates overfitting, and a progressive parameter adjustment strategy reselects remaining below-threshold parameters to maximize utilization. Unstructured backpropagation further reduces memory, enabling a one-line-code integration and competitive backwards-compatible performance, outperforming LoRA in prior preservation and often achieving better VLHI across SD variants. Overall, SaRA offers a model-agnostic, memory-efficient, plug-and-play PEFT alternative that enhances downstream generative capabilities while maintaining pre-trained priors.
Abstract
In recent years, the development of diffusion models has led to significant progress in image and video generation tasks, with pre-trained models like the Stable Diffusion series playing a crucial role. Inspired by model pruning which lightens large pre-trained models by removing unimportant parameters, we propose a novel model fine-tuning method to make full use of these ineffective parameters and enable the pre-trained model with new task-specified capabilities. In this work, we first investigate the importance of parameters in pre-trained diffusion models, and discover that the smallest 10% to 20% of parameters by absolute values do not contribute to the generation process. Based on this observation, we propose a method termed SaRA that re-utilizes these temporarily ineffective parameters, equating to optimizing a sparse weight matrix to learn the task-specific knowledge. To mitigate overfitting, we propose a nuclear-norm-based low-rank sparse training scheme for efficient fine-tuning. Furthermore, we design a new progressive parameter adjustment strategy to make full use of the re-trained/finetuned parameters. Finally, we propose a novel unstructural backpropagation strategy, which significantly reduces memory costs during fine-tuning. Our method enhances the generative capabilities of pre-trained models in downstream applications and outperforms traditional fine-tuning methods like LoRA in maintaining model's generalization ability. We validate our approach through fine-tuning experiments on SD models, demonstrating significant improvements. SaRA also offers a practical advantage that requires only a single line of code modification for efficient implementation and is seamlessly compatible with existing methods.
