BENet: A Cross-domain Robust Network for Detecting Face Forgeries via Bias Expansion and Latent-space Attention
Weihua Liu, Jianhua Qiu, Said Boumaraf, Chaochao lin, Pan liyuan, Lin Li, Mohammed Bennamoun, Naoufel Werghi
TL;DR
BENet tackles deepfake detection under cross-domain shift by combining a bias expansion autoencoder with a multi-scale Latent-space Attention (LSA) and a cross-domain detector. The bias expansion module amplifies forgery cues while preserving real-face reconstructions, and the LSA module emphasizes latent-space inconsistencies across encoder–decoder scales; together they form a discriminative feature space for a binary classifier. Training uses a novel bias expansion loss $L_{be}$ alongside the standard cross-entropy $L_c$, with a balancing parameter $\lambda$ (optimal at 0.5), and inference includes a threshold-based cross-domain verification to handle unseen manipulations. Across intra- and cross-dataset evaluations on FF++, Celeb-DF, DFFD, and DFDC, BENet achieves state-of-the-art performance and demonstrates robustness to unseen perturbations, underscoring its practical potential for real-world deepfake defense.
Abstract
In response to the growing threat of deepfake technology, we introduce BENet, a Cross-Domain Robust Bias Expansion Network. BENet enhances the detection of fake faces by addressing limitations in current detectors related to variations across different types of fake face generation techniques, where ``cross-domain" refers to the diverse range of these deepfakes, each considered a separate domain. BENet's core feature is a bias expansion module based on autoencoders. This module maintains genuine facial features while enhancing differences in fake reconstructions, creating a reliable bias for detecting fake faces across various deepfake domains. We also introduce a Latent-Space Attention (LSA) module to capture inconsistencies related to fake faces at different scales, ensuring robust defense against advanced deepfake techniques. The enriched LSA feature maps are multiplied with the expanded bias to create a versatile feature space optimized for subtle forgeries detection. To improve its ability to detect fake faces from unknown sources, BENet integrates a cross-domain detector module that enhances recognition accuracy by verifying the facial domain during inference. We train our network end-to-end with a novel bias expansion loss, adopted for the first time, in face forgery detection. Extensive experiments covering both intra and cross-dataset demonstrate BENet's superiority over current state-of-the-art solutions.
