Table of Contents
Fetching ...

Selective Domain-Invariant Feature for Generalizable Deepfake Detection

Yingxin Lai, Guoqing Yang Yifan He, Zhiming Luo, Shaozi Li

TL;DR

A novel framework Selective Domain-Invariant Feature (SDIF), which reduces the sensitivity to face forgery by fusing content features and styles, is proposed, which reduces the sensitivity to face forgery by fusing content features and styles.

Abstract

With diverse presentation forgery methods emerging continually, detecting the authenticity of images has drawn growing attention. Although existing methods have achieved impressive accuracy in training dataset detection, they still perform poorly in the unseen domain and suffer from forgery of irrelevant information such as background and identity, affecting generalizability. To solve this problem, we proposed a novel framework Selective Domain-Invariant Feature (SDIF), which reduces the sensitivity to face forgery by fusing content features and styles. Specifically, we first use a Farthest-Point Sampling (FPS) training strategy to construct a task-relevant style sample representation space for fusing with content features. Then, we propose a dynamic feature extraction module to generate features with diverse styles to improve the performance and effectiveness of the feature extractor. Finally, a domain separation strategy is used to retain domain-related features to help distinguish between real and fake faces. Both qualitative and quantitative results in existing benchmarks and proposals demonstrate the effectiveness of our approach.

Selective Domain-Invariant Feature for Generalizable Deepfake Detection

TL;DR

A novel framework Selective Domain-Invariant Feature (SDIF), which reduces the sensitivity to face forgery by fusing content features and styles, is proposed, which reduces the sensitivity to face forgery by fusing content features and styles.

Abstract

With diverse presentation forgery methods emerging continually, detecting the authenticity of images has drawn growing attention. Although existing methods have achieved impressive accuracy in training dataset detection, they still perform poorly in the unseen domain and suffer from forgery of irrelevant information such as background and identity, affecting generalizability. To solve this problem, we proposed a novel framework Selective Domain-Invariant Feature (SDIF), which reduces the sensitivity to face forgery by fusing content features and styles. Specifically, we first use a Farthest-Point Sampling (FPS) training strategy to construct a task-relevant style sample representation space for fusing with content features. Then, we propose a dynamic feature extraction module to generate features with diverse styles to improve the performance and effectiveness of the feature extractor. Finally, a domain separation strategy is used to retain domain-related features to help distinguish between real and fake faces. Both qualitative and quantitative results in existing benchmarks and proposals demonstrate the effectiveness of our approach.
Paper Structure (11 sections, 8 equations, 4 figures, 4 tables)

This paper contains 11 sections, 8 equations, 4 figures, 4 tables.

Figures (4)

  • Figure 1: Illustration of how our method differs from previous.
  • Figure 2: The overall framework of our proposed method.
  • Figure 3: Illustrations of our designed Dynamic feature Extractor.
  • Figure 4: Comparative experiments with different balance weight of $\lambda$ from FF++(HQ) to Celeb-DF.