Hiding Faces in Plain Sight: Defending DeepFakes by Disrupting Face Detection
Delong Zhu, Yuezun Li, Baoyuan Wu, Jiaran Zhou, Zhibo Wang, Siwei Lyu
TL;DR
This paper presents FacePoison, a proactive defense that contaminates input faces to disrupt DNN-based face detectors, thereby thwarting both training and inference of DeepFake models. It introduces VideoFacePoison, a temporal extension using optical flow to propagate perturbations across video frames with reduced computation. By adapting multiple adversarial attacks to attack intermediate detector features and employing an importance-guided pseudo-objective, the method achieves robust disruption across five detectors and eleven DeepFake models, with strong transferability and resilience to common perturbations. The approach is complementary to existing detectors, offering a practical, user-centric defense channel, though it is tailored to face-swap DeepFakes and relies on modern detectors, inviting further work on broader forgery techniques and defenses.
Abstract
This paper investigates the feasibility of a proactive DeepFake defense framework, {\em FacePosion}, to prevent individuals from becoming victims of DeepFake videos by sabotaging face detection. The motivation stems from the reliance of most DeepFake methods on face detectors to automatically extract victim faces from videos for training or synthesis (testing). Once the face detectors malfunction, the extracted faces will be distorted or incorrect, subsequently disrupting the training or synthesis of the DeepFake model. To achieve this, we adapt various adversarial attacks with a dedicated design for this purpose and thoroughly analyze their feasibility. Based on FacePoison, we introduce {\em VideoFacePoison}, a strategy that propagates FacePoison across video frames rather than applying them individually to each frame. This strategy can largely reduce the computational overhead while retaining the favorable attack performance. Our method is validated on five face detectors, and extensive experiments against eleven different DeepFake models demonstrate the effectiveness of disrupting face detectors to hinder DeepFake generation.
