NP-LoRA: Null Space Projection Unifies Subject and Style in LoRA Fusion
Chuheng Chen, Xiaofei Zhou, Geyuan Zhang, Yong Huang
TL;DR
NP-LoRA presents a training-free, projection-based fusion for combining independently learned content and style LoRAs by enforcing subspace separation in the style subspace. It identifies style principal directions via SVD and projects the content LoRA into the orthogonal null space, with a soft projection controlled by mu to balance fidelity and style. Across SDXL and FLUX backbones and 32 content-style pairs, NP-LoRA achieves superior image fidelity and style coherence compared to strong baselines, while maintaining comparable efficiency and generalizing to unseen backbones. The approach provides a clear geometric interpretation of LoRA fusion, offering robust, plug-in improvements for diffusion model personalization without retraining.
Abstract
Low-Rank Adaptation (LoRA) fusion has emerged as a key technique for reusing and composing learned subject and style representations for controllable generation without costly retraining. However, existing methods rely on weight-based merging, where one LoRA often dominates the other, leading to interference and degraded fidelity. This interference is structural: separately trained LoRAs occupy low-rank high-dimensional subspaces, leading to non-orthogonal and overlapping representations. In this work, we analyze the internal structure of LoRAs and find their generative behavior is dominated by a few principal directions in the low-rank subspace, which should remain free from interference during fusion. To achieve this, we propose Null Space Projection LoRA (NP-LoRA), a projection-based framework for LoRA fusion that enforces subspace separation to prevent structural interference among principal directions. Specifically, we first extract principal style directions via singular value decomposition (SVD) and then project the subject LoRA into its orthogonal null space. Furthermore, we introduce a soft projection mechanism that enables smooth control over the trade-off between subject fidelity and style consistency. Experiments show NP-LoRA consistently improves fusion quality over strong baselines (e.g., DINO and CLIP-based metrics, with human and LLM preference scores), and applies broadly across backbones and LoRA pairs without retraining.
