Weakly Supervised Training for Hologram Verification in Identity Documents
Glen Pouliquen, Guillaume Chiron, Joseph Chazalon, Thierry Géraud, Ahmad Montaser Awal
TL;DR
This work tackles remote verification of Optically Variable Devices (OVDs) in identity documents captured by smartphones. It introduces a weakly supervised contrastive learning framework that uses a triplet loss with a margin $m=1$ and distance $d(x,y)=||x-y||_2$, enabling learning without per-frame labels and producing frame embeddings whose cosine similarities are used for a final Original/Attack decision. The approach is evaluated on MIDV-Holo and MIDV-2020 with ROI-focused regions and cross-validation, achieving leading performance on MIDV-Holo and robust attack detection, including photo replacement attacks. The authors provide an open-source baseline for comparison, extend the MIDV-Holo dataset protocol, and demonstrate the method’s generalization across datasets and backbones, underscoring the potential for scalable, data-efficient remote identity verification on commodity smartphones.
Abstract
We propose a method to remotely verify the authenticity of Optically Variable Devices (OVDs), often referred to as ``holograms'', in identity documents. Our method processes video clips captured with smartphones under common lighting conditions, and is evaluated on two public datasets: MIDV-HOLO and MIDV-2020. Thanks to a weakly-supervised training, we optimize a feature extraction and decision pipeline which achieves a new leading performance on MIDV-HOLO, while maintaining a high recall on documents from MIDV-2020 used as attack samples. It is also the first method, to date, to effectively address the photo replacement attack task, and can be trained on either genuine samples, attack samples, or both for increased performance. By enabling to verify OVD shapes and dynamics with very little supervision, this work opens the way towards the use of massive amounts of unlabeled data to build robust remote identity document verification systems on commodity smartphones. Code is available at https://github.com/EPITAResearchLab/pouliquen.24.icdar
