Semantic Contextualization of Face Forgery: A New Definition, Dataset, and Detection Method
Mian Zou, Baosheng Yu, Yibing Zhan, Siwei Lyu, Kede Ma
TL;DR
The paper reframes face forgery through a semantic lens, defining forgery as alterations to semantic face attributes that cross human perceptual thresholds. It introduces the FFSC dataset with a hierarchical label graph linking global attributes to local regions and two testing protocols to probe generalization across unseen manipulations and attributes. A semantics-oriented detection method is proposed, leveraging a probabilistic label model and bi-level optimization (Auto-$\lambda$) to prioritize the primary real-vs-fake task while propagating semantic and regional cues. Experiments demonstrate FFSC’s effectiveness as a challenging testbed and its utility as a training set to improve generalization, with the proposed method outperforming traditional binary or multi-class detectors and reducing reliance on manipulation-specific cues.
Abstract
In recent years, deep learning has greatly streamlined the process of manipulating photographic face images. Aware of the potential dangers, researchers have developed various tools to spot these counterfeits. Yet, none asks the fundamental question: What digital manipulations make a real photographic face image fake, while others do not? In this paper, we put face forgery in a semantic context and define that computational methods that alter semantic face attributes to exceed human discrimination thresholds are sources of face forgery. Following our definition, we construct a large face forgery image dataset, where each image is associated with a set of labels organized in a hierarchical graph. Our dataset enables two new testing protocols to probe the generalizability of face forgery detectors. Moreover, we propose a semantics-oriented face forgery detection method that captures label relations and prioritizes the primary task (i.e., real or fake face detection). We show that the proposed dataset successfully exposes the weaknesses of current detectors as the test set and consistently improves their generalizability as the training set. Additionally, we demonstrate the superiority of our semantics-oriented method over traditional binary and multi-class classification-based detectors.
