Generalized Face Liveness Detection via De-fake Face Generator
Xingming Long, Jie Zhang, Shiguang Shan
TL;DR
This paper tackles the domain generalization challenge in face anti-spoofing by leveraging abundant real-face data. It introduces a De-fake Face Generator trained on real faces to synthesize a real-looking version of any input; the residual between input and this generated real face provides an anomalous cue, which is exploited by an Off-real Attention Network to focus on spoof regions. The authors provide theoretical guarantees for the distinguishability of real and fake cues and demonstrate state-of-the-art cross-domain performance across nine public datasets, with extensive ablations validating the design choices. The proposed plug-and-play OA-Net can enhance existing DG-based FAS methods, offering a practical path toward robust liveness detection in diverse real-world settings.
Abstract
Previous Face Anti-spoofing (FAS) methods face the challenge of generalizing to unseen domains, mainly because most existing FAS datasets are relatively small and lack data diversity. Thanks to the development of face recognition in the past decade, numerous real face images are available publicly, which are however neglected previously by the existing literature. In this paper, we propose an Anomalous cue Guided FAS (AG-FAS) method, which can effectively leverage large-scale additional real faces for improving model generalization via a De-fake Face Generator (DFG). Specifically, by training on a large-scale real face only dataset, the generator obtains the knowledge of what a real face should be like, and thus has the capability of generating a "real" version of any input face image. Consequently, the difference between the input face and the generated "real" face can be treated as cues of attention for the fake feature learning. With the above ideas, an Off-real Attention Network (OA-Net) is proposed which allocates its attention to the spoof region of the input according to the anomalous cue. Extensive experiments on a total of nine public datasets show our method achieves state-of-the-art results under cross-domain evaluations with unseen scenarios and unknown presentation attacks. Besides, we provide theoretical analysis demonstrating the effectiveness of the proposed anomalous cues.
