V-MAD: Video-based Morphing Attack Detection in Operational Scenarios
Guido Borghi, Annalisa Franco, Nicolò Di Domenico, Matteo Ferrara, Davide Maltoni
TL;DR
The paper introduces Video-based Morphing Attack Detection (V-MAD) to exploit sequences of frames in operational settings like border control gates, showing that multi-frame information improves robustness to pose and illumination variability. It adapts existing Differential MAD (D-MAD) models to the V-MAD task by computing per-frame scores and fusing them through simple or quality-aware strategies, and explores machine learning fusion with an SVM to further enhance performance. A new real-world operational database is collected to evaluate V-MAD under realistic conditions, including various face image quality metrics (MagFace, CR-FIQA, SER-FIQ) and ISO components (illumination, defocus, pose). Key findings indicate that average/median frame fusion often surpasses single-frame D-MAD, quality information yields additional gains, and ML-based fusion can outperform traditional fusion, though an oracle-like bound remains distant, underscoring the need for more robust V-MAD methods and datasets. Overall, V-MAD represents a significant advancement toward robust morphing attack detection in real-world video acquisition scenarios, with practical implications for ABC gate security and biometric verification systems.
Abstract
In response to the rising threat of the face morphing attack, this paper introduces and explores the potential of Video-based Morphing Attack Detection (V-MAD) systems in real-world operational scenarios. While current morphing attack detection methods primarily focus on a single or a pair of images, V-MAD is based on video sequences, exploiting the video streams often acquired by face verification tools available, for instance, at airport gates. Through this study, we show for the first time the advantages that the availability of multiple probe frames can bring to the morphing attack detection task, especially in scenarios where the quality of probe images is varied and might be affected, for instance, by pose or illumination variations. Experimental results on a real operational database demonstrate that video sequences represent valuable information for increasing the robustness and performance of morphing attack detection systems.
