AuthenLoRA: Entangling Stylization with Imperceptible Watermarks for Copyright-Secure LoRA Adapters
Fangming Shi, Li Li, Kejiang Chen, Guorui Feng, Xinpeng Zhang
TL;DR
AuthenLoRA presents a joint optimization framework that embeds imperceptible, traceable watermarks into LoRA adapters used for diffusion-based stylization, tightly coupling watermarking with image generation. The method introduces a diagonal scaling mechanism, an expanded fine-tuning scope including ResNet blocks, and a two-stage training process with latent watermark pretraining and multi-objective loss to balance style fidelity with watermark reliability. A zero-message regularization strategy and perceptual residual suppression (PRTS) are proposed to minimize false positives and watermark residuals, while robustness is demonstrated against LoRA merging, pruning, and quantization, as well as cross-base-model and generation-parameter variations. The results show AuthenLoRA achieves high-fidelity stylization, near-perfect watermark extraction under benign conditions, and strong robustness to adversarial processing, at the cost of modestly larger model size and perceptible residuals compared with some post-hoc methods. Open-source implementation is provided to enable provenance tracking and copyright enforcement for stylized LoRA adapters.
Abstract
Low-Rank Adaptation (LoRA) offers an efficient paradigm for customizing diffusion models, but its ease of redistribution raises concerns over unauthorized use and the generation of untraceable content. Existing watermarking techniques either target base models or verify LoRA modules themselves, yet they fail to propagate watermarks to generated images, leaving a critical gap in traceability. Moreover, traceability watermarking designed for base models is not tightly coupled with stylization and often introduces visual degradation or high false-positive detection rates. To address these limitations, we propose AuthenLoRA, a unified watermarking framework that embeds imperceptible, traceable watermarks directly into the LoRA training process while preserving stylization quality. AuthenLoRA employs a dual-objective optimization strategy that jointly learns the target style distribution and the watermark-induced distribution shift, ensuring that any image generated with the watermarked LoRA reliably carries the watermark. We further design an expanded LoRA architecture for enhanced multi-scale adaptation and introduce a zero-message regularization mechanism that substantially reduces false positives during watermark verification. Extensive experiments demonstrate that AuthenLoRA achieves high-fidelity stylization, robust watermark propagation, and significantly lower false-positive rates compared with existing approaches. Open-source implementation is available at: https://github.com/ShiFangming0823/AuthenLoRA
