BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing

Yingjie Ma; Zitong Yu; Xun Lin; Weicheng Xie; Linlin Shen

BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing

Yingjie Ma, Zitong Yu, Xun Lin, Weicheng Xie, Linlin Shen

TL;DR

BIG-MoE tackles generalization challenges in multimodal Face Anti-Spoofing by introducing an Isolated Gating Mechanism Adapter (IGMA) and a Convolutional Prompt Bypass (CPB) within a Mixture of Experts framework. The approach leverages fine-grained experts, noise-robust gating, and local cue-focused prompts to improve discrimination and gating stability across modalities and domains. Empirical results across four benchmarks show substantial gains in HTER and AUC, with ablations confirming the effectiveness of IGMA and CPB and their synergy. This work advances robust, cross-domain multimodal FAS with a scalable MoE-based design and provides practical guidance for generalization under limited data scenarios.

Abstract

In the domain of facial recognition security, multimodal Face Anti-Spoofing (FAS) is essential for countering presentation attacks. However, existing technologies encounter challenges due to modality biases and imbalances, as well as domain shifts. Our research introduces a Mixture of Experts (MoE) model to address these issues effectively. We identified three limitations in traditional MoE approaches to multimodal FAS: (1) Coarse-grained experts' inability to capture nuanced spoofing indicators; (2) Gated networks' susceptibility to input noise affecting decision-making; (3) MoE's sensitivity to prompt tokens leading to overfitting with conventional learning methods. To mitigate these, we propose the Bypass Isolated Gating MoE (BIG-MoE) framework, featuring: (1) Fine-grained experts for enhanced detection of subtle spoofing cues; (2) An isolation gating mechanism to counteract input noise; (3) A novel differential convolutional prompt bypass enriching the gating network with critical local features, thereby improving perceptual capabilities. Extensive experiments on four benchmark datasets demonstrate significant generalization performance improvement in multimodal FAS task. The code is released at https://github.com/murInJ/BIG-MoE.

BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing

TL;DR

Abstract

BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)