Proto-LeakNet: Towards Signal-Leak Aware Attribution in Synthetic Human Face Imagery
Claudio Giusti, Luca Guarnera, Sebastiano Battiato
TL;DR
Proto-LeakNet tackles the challenge of attributing synthetic face imagery by exploiting signal-leak traces embedded in diffusion latents. It introduces a latent-domain framework with temporal attention and a prototype-based attribution head, complemented by density-based open-set evaluation to identify unseen generators without retraining. The method achieves a Macro AUC of 98.13% on closed-set data, remains robust under aggressive post-processing, and reveals interpretable latent geometry through prototypes and feature-wise gates. Representation-level analysis via KDE demonstrates separability between known and unseen generators, validating open-set generalization without supervision. This work provides a scalable, interpretable approach for provenance attribution in synthetic media.
Abstract
The growing sophistication of synthetic image and deepfake generation models has turned source attribution and authenticity verification into a critical challenge for modern computer vision systems. Recent studies suggest that diffusion pipelines unintentionally imprint persistent statistical traces, known as signal-leaks, within their outputs, particularly in latent representations. Building on this observation, we propose Proto-LeakNet, a signal-leak-aware and interpretable attribution framework that integrates closed-set classification with a density-based open-set evaluation on the learned embeddings, enabling analysis of unseen generators without retraining. Acting in the latent domain of diffusion models, our method re-simulates partial forward diffusion to expose residual generator-specific cues. A temporal attention encoder aggregates multi-step latent features, while a feature-weighted prototype head structures the embedding space and enables transparent attribution. Trained solely on closed data and achieving a Macro AUC of 98.13%, Proto-LeakNet learns a latent geometry that remains robust under post-processing, surpassing state-of-the-art methods, and achieves strong separability both between real images and known generators, and between known and unseen ones. The codebase will be available after acceptance.
