Confidence Aware Learning for Reliable Face Anti-spoofing
Xingming Long, Jie Zhang, Shiguang Shan
TL;DR
The paper tackles the problem that face anti-spoofing models often emit overly confident predictions for unknown attacks or unseen domains, risking security. It introduces Confidence Aware FAS (CA-FAS), which builds Gaussian distributions in feature space for live and known attacks and uses a Mahalanobis-distance framework to measure prediction confidence, with a MD-based triplet loss to jointly learn the feature extractor and distributions. Confidence is computed as $\mathcal{C}(x) = - \min_p MD{ f(x), N(mu_p, Sigma_p) }$, and decisions are made when $\mathcal{C}(x) \ge \theta_c$, otherwise samples are flagged for manual review; a quantile-based threshold adapts to test data. Across eight datasets with various unknown attacks and domain shifts, CA-FAS demonstrates improved reliability by filtering out low-confidence samples, offering a practical approach for secure deployment in real-world systems.
Abstract
Current Face Anti-spoofing (FAS) models tend to make overly confident predictions even when encountering unfamiliar scenarios or unknown presentation attacks, which leads to serious potential risks. To solve this problem, we propose a Confidence Aware Face Anti-spoofing (CA-FAS) model, which is aware of its capability boundary, thus achieving reliable liveness detection within this boundary. To enable the CA-FAS to "know what it doesn't know", we propose to estimate its confidence during the prediction of each sample. Specifically, we build Gaussian distributions for both the live faces and the known attacks. The prediction confidence for each sample is subsequently assessed using the Mahalanobis distance between the sample and the Gaussians for the "known data". We further introduce the Mahalanobis distance-based triplet mining to optimize the parameters of both the model and the constructed Gaussians as a whole. Extensive experiments show that the proposed CA-FAS can effectively recognize samples with low prediction confidence and thus achieve much more reliable performance than other FAS models by filtering out samples that are beyond its reliable range.
