LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution
Danielle Sullivan-Pao, Nicole Tian, Pooya Khorrami
TL;DR
LoRAX addresses the problem of continual deepfake attribution under evolving generative models by introducing per-task LoRA adapters on a frozen ConViT backbone, forming task-specific feature extractors whose outputs are concatenated into a single super feature for attribution. The method leverages a two-term loss with a diversity component to minimize redundancy among adapters, and it employs exemplar memory to mitigate forgetting across tasks. Empirical results on the CDDB benchmark show LoRAX is competitive with or superior to state-of-the-art class incremental learning methods across memory budgets, while dramatically reducing trainable parameters (e.g., ~2.5M vs ~86M for ConViT Base). The approach also demonstrates the importance of backbone choice, with ConViT-based LoRAX delivering strong performance and substantial memory savings, suggesting practical applicability for scalable, continual deepfake attribution in real-world settings.
Abstract
As generative AI image technologies become more widespread and advanced, there is a growing need for strong attribution models. These models are crucial for verifying the authenticity of images and identifying the architecture of their originating generative models-key to maintaining media integrity. However, attribution models struggle to generalize to unseen models, and traditional fine-tuning methods for updating these models have shown to be impractical in real-world settings. To address these challenges, we propose LoRA eXpandable Networks (LoRAX), a parameter-efficient class incremental algorithm that adapts to novel generative image models without the need for full retraining. Our approach trains an extremely parameter-efficient feature extractor per continual learning task via Low Rank Adaptation. Each task-specific feature extractor learns distinct features while only requiring a small fraction of the parameters present in the underlying feature extractor's backbone model. Our extensive experimentation shows LoRAX outperforms or remains competitive with state-of-the-art class incremental learning algorithms on the Continual Deepfake Detection benchmark across all training scenarios and memory settings, while requiring less than 3% of the number of trainable parameters per feature extractor compared to the full-rank implementation. LoRAX code is available at: https://github.com/mit-ll/lorax_cil.
