Table of Contents
Fetching ...

Fairness-Aware Deepfake Detection: Leveraging Dual-Mechanism Optimization

Feng Ding, Wenhui Yi, Yunpeng Zhou, Xinan He, Hong Rao, Shu Hu

TL;DR

This work tackles fairness in deepfake detection by addressing demographic bias across gender and race without sacrificing accuracy. It introduces a dual-mechanism framework combining Structural Fairness Decoupling, which removes demographic leakage by decoupling channels correlated with sensitive attributes, and Global Distribution Alignment, which uses optimal transport with a mutual information constraint to align subgroup distributions with global ones. Experiments on FF++ and cross-domain datasets show improved inter-group and intra-group fairness while maintaining or enhancing detection performance, with ablations confirming the complementary value of the two modules. Visualizations and robustness tests further support that the approach focuses on facial cues and generalizes across backbones, indicating practical potential for fairer deepfake detection systems.

Abstract

Fairness is a core element in the trustworthy deployment of deepfake detection models, especially in the field of digital identity security. Biases in detection models toward different demographic groups, such as gender and race, may lead to systemic misjudgments, exacerbating the digital divide and social inequities. However, current fairness-enhanced detectors often improve fairness at the cost of detection accuracy. To address this challenge, we propose a dual-mechanism collaborative optimization framework. Our proposed method innovatively integrates structural fairness decoupling and global distribution alignment: decoupling channels sensitive to demographic groups at the model architectural level, and subsequently reducing the distance between the overall sample distribution and the distributions corresponding to each demographic group at the feature level. Experimental results demonstrate that, compared with other methods, our framework improves both inter-group and intra-group fairness while maintaining overall detection accuracy across domains.

Fairness-Aware Deepfake Detection: Leveraging Dual-Mechanism Optimization

TL;DR

This work tackles fairness in deepfake detection by addressing demographic bias across gender and race without sacrificing accuracy. It introduces a dual-mechanism framework combining Structural Fairness Decoupling, which removes demographic leakage by decoupling channels correlated with sensitive attributes, and Global Distribution Alignment, which uses optimal transport with a mutual information constraint to align subgroup distributions with global ones. Experiments on FF++ and cross-domain datasets show improved inter-group and intra-group fairness while maintaining or enhancing detection performance, with ablations confirming the complementary value of the two modules. Visualizations and robustness tests further support that the approach focuses on facial cues and generalizes across backbones, indicating practical potential for fairer deepfake detection systems.

Abstract

Fairness is a core element in the trustworthy deployment of deepfake detection models, especially in the field of digital identity security. Biases in detection models toward different demographic groups, such as gender and race, may lead to systemic misjudgments, exacerbating the digital divide and social inequities. However, current fairness-enhanced detectors often improve fairness at the cost of detection accuracy. To address this challenge, we propose a dual-mechanism collaborative optimization framework. Our proposed method innovatively integrates structural fairness decoupling and global distribution alignment: decoupling channels sensitive to demographic groups at the model architectural level, and subsequently reducing the distance between the overall sample distribution and the distributions corresponding to each demographic group at the feature level. Experimental results demonstrate that, compared with other methods, our framework improves both inter-group and intra-group fairness while maintaining overall detection accuracy across domains.

Paper Structure

This paper contains 25 sections, 10 equations, 10 figures, 8 tables.

Figures (10)

  • Figure 1: Example illustrating our proposed fairness enhancement strategy. The baseline method may lead to unfair model performance across intra-group and inter-group. Our proposed method effectively addresses this issue by more balanced handling of different sensitive attributes.
  • Figure 2: Overview of our proposed method: (1) Structural Fairness Decoupling: dynamically identify and decouple channels that are highly correlated with sensitive attributes; (2) Global Distribution Alignment: align prediction distributions of real and fake images within each sensitive subgroup with the global distributions. Specifically, this module adjusts the distributions of real images $g_r^i$ and fake images $g_f^i$ for different sensitive groups ($i \in \mathcal{A}$) to align with the global distributions of real images $\mathcal{R}$ and fake images $\mathcal{G}$.
  • Figure 3: Robustness evaluation of proposed methods. We test the robustness of the proposed method against four disturbances of various intensities, and all models were trained on FF++.
  • Figure 4: Grad-CAM visualization of ours and other methods on the intra-domain dataset (F++), and cross-domain datasets (DFDC, Celeb-DF, and DFD).
  • Figure 5: Analysis on the FF++ test set illustrating the impact of different decoupling iterations and decoupling ratios on the fairness performance ($F_{FPR}$) for Xception backnone.
  • ...and 5 more figures