Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
Zhe Kong, Wentian Zhang, Tao Wang, Kaihao Zhang, Yuexiang Li, Xiaoying Tang, Wenhan Luo
TL;DR
This work tackles the limited cross-domain generalization of face anti-spoofing by introducing DTDA, a framework that combines a domain adversarial attack to align domains with unstable DAT training and a dual-teacher knowledge distillation scheme that injects perceptual and generative face priors. The model trains in a multi-task setting, leveraging teachers pretrained on large-scale face tasks and using adversarial inputs for both student and teachers to maximize shared priors. Across cross-dataset and intra-dataset evaluations on public benchmarks, DTDA consistently outperforms state-of-the-art methods while maintaining efficiency (e.g., 95 FPS with ResNet-18 backbone). Ablation and visualization studies corroborate the benefits of combining DAA with dual teachers, demonstrating improved domain-invariant feature learning and more discriminative guidance from priors for live/spoof classification.
Abstract
Face recognition systems have raised concerns due to their vulnerability to different presentation attacks, and system security has become an increasingly critical concern. Although many face anti-spoofing (FAS) methods perform well in intra-dataset scenarios, their generalization remains a challenge. To address this issue, some methods adopt domain adversarial training (DAT) to extract domain-invariant features. However, the competition between the encoder and the domain discriminator can cause the network to be difficult to train and converge. In this paper, we propose a domain adversarial attack (DAA) method to mitigate the training instability problem by adding perturbations to the input images, which makes them indistinguishable across domains and enables domain alignment. Moreover, since models trained on limited data and types of attacks cannot generalize well to unknown attacks, we propose a dual perceptual and generative knowledge distillation framework for face anti-spoofing that utilizes pre-trained face-related models containing rich face priors. Specifically, we adopt two different face-related models as teachers to transfer knowledge to the target student model. The pre-trained teacher models are not from the task of face anti-spoofing but from perceptual and generative tasks, respectively, which implicitly augment the data. By combining both DAA and dual-teacher knowledge distillation, we develop a dual teacher knowledge distillation with domain alignment framework (DTDA) for face anti-spoofing. The advantage of our proposed method has been verified through extensive ablation studies and comparison with state-of-the-art methods on public datasets across multiple protocols.
