Table of Contents
Fetching ...

Data-Driven Fairness Generalization for Deepfake Detection

Uzoamaka Ezeakunne, Chrisantus Eze, Xiuwen Liu

TL;DR

The paper addresses fairness generalization in deepfake detection, where models exhibit demographic biases and struggle on unseen data. It introduces a data-centric framework that uses synthetic self-balanced images (SBI), a multi-task architecture with dual heads for detection and demographic prediction, and Sharpness-Aware Minimization (SAM) to encourage robust generalization. The approach optimizes a combined loss that balances accuracy and fairness across demographic groups while maintaining balanced real/fake data. Across intra-dataset and cross-dataset experiments, the method achieves comparable detection performance to baselines but substantially reduces demographic disparities, demonstrating the potential of synthetic data for fairness generalization in deepfake detection.

Abstract

Despite the progress made in deepfake detection research, recent studies have shown that biases in the training data for these detectors can result in varying levels of performance across different demographic groups, such as race and gender. These disparities can lead to certain groups being unfairly targeted or excluded. Traditional methods often rely on fair loss functions to address these issues, but they under-perform when applied to unseen datasets, hence, fairness generalization remains a challenge. In this work, we propose a data-driven framework for tackling the fairness generalization problem in deepfake detection by leveraging synthetic datasets and model optimization. Our approach focuses on generating and utilizing synthetic data to enhance fairness across diverse demographic groups. By creating a diverse set of synthetic samples that represent various demographic groups, we ensure that our model is trained on a balanced and representative dataset. This approach allows us to generalize fairness more effectively across different domains. We employ a comprehensive strategy that leverages synthetic data, a loss sharpness-aware optimization pipeline, and a multi-task learning framework to create a more equitable training environment, which helps maintain fairness across both intra-dataset and cross-dataset evaluations. Extensive experiments on benchmark deepfake detection datasets demonstrate the efficacy of our approach, surpassing state-of-the-art approaches in preserving fairness during cross-dataset evaluation. Our results highlight the potential of synthetic datasets in achieving fairness generalization, providing a robust solution for the challenges faced in deepfake detection.

Data-Driven Fairness Generalization for Deepfake Detection

TL;DR

The paper addresses fairness generalization in deepfake detection, where models exhibit demographic biases and struggle on unseen data. It introduces a data-centric framework that uses synthetic self-balanced images (SBI), a multi-task architecture with dual heads for detection and demographic prediction, and Sharpness-Aware Minimization (SAM) to encourage robust generalization. The approach optimizes a combined loss that balances accuracy and fairness across demographic groups while maintaining balanced real/fake data. Across intra-dataset and cross-dataset experiments, the method achieves comparable detection performance to baselines but substantially reduces demographic disparities, demonstrating the potential of synthetic data for fairness generalization in deepfake detection.

Abstract

Despite the progress made in deepfake detection research, recent studies have shown that biases in the training data for these detectors can result in varying levels of performance across different demographic groups, such as race and gender. These disparities can lead to certain groups being unfairly targeted or excluded. Traditional methods often rely on fair loss functions to address these issues, but they under-perform when applied to unseen datasets, hence, fairness generalization remains a challenge. In this work, we propose a data-driven framework for tackling the fairness generalization problem in deepfake detection by leveraging synthetic datasets and model optimization. Our approach focuses on generating and utilizing synthetic data to enhance fairness across diverse demographic groups. By creating a diverse set of synthetic samples that represent various demographic groups, we ensure that our model is trained on a balanced and representative dataset. This approach allows us to generalize fairness more effectively across different domains. We employ a comprehensive strategy that leverages synthetic data, a loss sharpness-aware optimization pipeline, and a multi-task learning framework to create a more equitable training environment, which helps maintain fairness across both intra-dataset and cross-dataset evaluations. Extensive experiments on benchmark deepfake detection datasets demonstrate the efficacy of our approach, surpassing state-of-the-art approaches in preserving fairness during cross-dataset evaluation. Our results highlight the potential of synthetic datasets in achieving fairness generalization, providing a robust solution for the challenges faced in deepfake detection.

Paper Structure

This paper contains 15 sections, 12 equations, 9 figures, 2 tables.

Figures (9)

  • Figure 1: Fairness Generalization Comparison: A detector's accuracy can generalize well to unseen test sets A and B, maintaining consistent detection performance. However, while fairness metrics are preserved on test set B, they fail to generalize to test set A, highlighting the challenge of achieving consistent fairness across different datasets.
  • Figure 2: Sample of Facial Images. The real images (top row) and their corresponding fake/synthetic image (bottom row).
  • Figure 3: An overview of our proposed method. We utilize EfficientNet eff to extract deep features from input images for the feature extraction module. For the classification module, two heads are used: the real/fake head predicts whether the input is real or fake, and the demographic head predicts the demographic group. The demographic classification head outputs probabilities for one of eight demographic categories based on both gender and ethnicity. These categories include Black-Male (B-M), Black-Female (B-F), White-Male (W-M), White-Female (W-F), Asian-Male (A-M), Asian-Female (A-F), Other-Male (O-M), and Other-Female (O-F). We use SAM (Sharpness-Aware Minimization) for the optimization module to flatten the loss landscape and enhance fairness generalization across demographic groups.
  • Figure 4: Intra-dataset evaluation across different methods. Methods are trained and tested on the same distribution as the training set (FF++ ff++): Performance comparison across gender. See Table \ref{['tab:intracomparison']} for exact numerical values
  • Figure 5: Intra-dataset evaluation across different methods. Methods are trained and tested on the same distribution as the training set (FF++ ff++): Performance comparison across race. See Table \ref{['tab:intracomparison']} for exact numerical values
  • ...and 4 more figures