Unmasking the Unknown: Facial Deepfake Detection in the Open-Set Paradigm
Nadarasar Bahavan, Sanjay Saha, Ken Chen, Sachith Seneviratne, Sanka Rasnayaka, Saman Halgamuge
TL;DR
This work tackles the limitation of closed-set deepfake detectors by framing detection as an open-set problem, where unseen forgery methods are flagged as unknown. It introduces a three-stage pipeline built on a weighted supervised contrastive learning loss, an SWA-augmented encoder, and a threshold-based open-set detector, demonstrating strong performance on FaceForensics++c23 and competitive cross-dataset results on CelebDFv2. Key contributions include a novel loss formulation that emphasizes real-data clustering, empirical evidence of improved unknown detection (AUROC) and state-of-the-art True Open-Set Classification (TOSC) scores, and explainability analyses via UMAP and Grad-CAM. The approach offers a robust, scalable tool for detecting both known and emerging deepfake techniques, with practical implications for forensic analysis and security. Future work may address domain shift, video-level modeling, and localized priors to further enhance robustness.
Abstract
Facial forgery methods such as deepfakes can be misused for identity manipulation and spreading misinformation. They have evolved alongside advancements in generative AI, leading to new and more sophisticated forgery techniques that diverge from existing 'known' methods. Conventional deepfake detection methods use the closedset paradigm, thus limiting their applicability to detecting forgeries created using methods that are not part of the training dataset. In this paper, we propose a shift from the closed-set paradigm for deepfake detection. In the open-set paradigm, models are designed not only to identify images created by known facial forgery methods but also to identify and flag those produced by previously unknown methods as 'unknown' and not as unforged/real/unmanipulated. In this paper, we propose an open-set deepfake classification algorithm based on supervised contrastive learning. The open-set paradigm used in our model allows it to function as a more robust tool capable of handling emerging and unseen deepfake techniques, enhancing reliability and confidence, and complementing forensic analysis. In open-set paradigm, we identify three groups including the "unknown group that is neither considered known deepfake nor real. We investigate deepfake open-set classification across three scenarios, classifying deepfakes from unknown methods not as real, distinguishing real images from deepfakes, and classifying deepfakes from known methods, using the FaceForensics++ dataset as a benchmark. Our method achieves state of the art results in the first two tasks and competitive results in the third task.
