Dodging DeepFake Detection via Implicit Spatial-Domain Notch Filtering

Yihao Huang; Felix Juefei-Xu; Qing Guo; Yang Liu; Geguang Pu

Dodging DeepFake Detection via Implicit Spatial-Domain Notch Filtering

Yihao Huang, Felix Juefei-Xu, Qing Guo, Yang Liu, Geguang Pu

TL;DR

This work introduces DeepNotch, a learning-based pipeline that implicitly performs spatial-domain notch filtering to remove artifact patterns in DeepFake images without sacrificing visual quality. By first adding noise to disrupt periodic artifacts and then applying per-pixel deep image filtering (K predicted by a UNet and reconstruction \\hat{I} = K \\circledast I'), the method achieves detector evasion across three state-of-the-art detectors (GANFingerprint, DCTA, CNNDetector) over 16 DeepFake types. An adversarial-noise-guided variant further localizes noise to semantically meaningful regions, significantly reducing detection accuracy (e.g., up to ~98% in best cases) while maintaining high image fidelity (COSS/PSNR/SSIM near original). The results reveal that existing detectors rely heavily on artifact cues and highlight the need for more robust, artifact-agnostic detection methods; the technique also raises ethical considerations about responsible disclosure and defense enhancement. Overall, DeepNotch demonstrates a powerful evasion strategy and provides a framework to stress-test and improve future DeepFake detectors.

Abstract

The current high-fidelity generation and high-precision detection of DeepFake images are at an arms race. We believe that producing DeepFakes that are highly realistic and 'detection evasive' can serve the ultimate goal of improving future generation DeepFake detection capabilities. In this paper, we propose a simple yet powerful pipeline to reduce the artifact patterns of fake images without hurting image quality by performing implicit spatial-domain notch filtering. We first demonstrate that frequency-domain notch filtering, although famously shown to be effective in removing periodic noise in the spatial domain, is infeasible for our task at hand due to the manual designs required for the notch filters. We, therefore, resort to a learning-based approach to reproduce the notch filtering effects, but solely in the spatial domain. We adopt a combination of adding overwhelming spatial noise for breaking the periodic noise pattern and deep image filtering to reconstruct the noise-free fake images, and we name our method DeepNotch. Deep image filtering provides a specialized filter for each pixel in the noisy image, producing filtered images with high fidelity compared to their DeepFake counterparts. Moreover, we also use the semantic information of the image to generate an adversarial guidance map to add noise intelligently. Our large-scale evaluation on 3 representative state-of-the-art DeepFake detection methods (tested on 16 types of DeepFakes) has demonstrated that our technique significantly reduces the accuracy of these 3 fake image detection methods, 36.79% on average and up to 97.02% in the best case.

Dodging DeepFake Detection via Implicit Spatial-Domain Notch Filtering

TL;DR

Abstract

Dodging DeepFake Detection via Implicit Spatial-Domain Notch Filtering

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)