Generalized Deepfake Attribution
Sowdagar Mahammad Shahid, Sudev Kumar Padhi, Umesh Kashyap, Sk. Subidh Ali
TL;DR
The paper tackles the challenge of attributing GAN-generated images to their underlying architectures when generators are retrained with different seeds or fine-tuned. It introduces Generalized Deepfake Attribution Network (GDA-Net), comprising a Feature Extraction Network (FEN) and a multi-class classifier, with Vanilla-FEN and Denoiser-FEN variants that leverage supervised contrastive learning to extract architecture-dependent fingerprints and reduce content dependency via residuals from a denoising autoencoder. The approach demonstrates robust cross-seed and fine-tuning attribution across DCGAN, WGAN, ProGAN, and SNGAN, outperforming prior methods in generalization. The work has practical impact for forensic analysis and IP protection of GAN architectures, with code released to validate the results.
Abstract
The landscape of fake media creation changed with the introduction of Generative Adversarial Networks (GAN s). Fake media creation has been on the rise with the rapid advances in generation technology, leading to new challenges in Detecting fake media. A fundamental characteristic of GAN s is their sensitivity to parameter initialization, known as seeds. Each distinct seed utilized during training leads to the creation of unique model instances, resulting in divergent image outputs despite employing the same architecture. This means that even if we have one GAN architecture, it can produce countless variations of GAN models depending on the seed used. Existing methods for attributing deepfakes work well only if they have seen the specific GAN model during training. If the GAN architectures are retrained with a different seed, these methods struggle to attribute the fakes. This seed dependency issue made it difficult to attribute deepfakes with existing methods. We proposed a generalized deepfake attribution network (GDA-N et) to attribute fake images to their respective GAN architectures, even if they are generated from a retrained version of the GAN architecture with a different seed (cross-seed) or from the fine-tuned version of the existing GAN model. Extensive experiments on cross-seed and fine-tuned data of GAN models show that our method is highly effective compared to existing methods. We have provided the source code to validate our results.
