VIGFace: Virtual Identity Generation for Privacy-Free Face Recognition
Minsoo Kim, Min-Cheol Sagong, Gi Pyo Nam, Junghyun Cho, Ig-Jae Kim
TL;DR
VIGFace tackles privacy concerns in face recognition by pre-assigning virtual identities in the FR feature space and training a diffusion-based generator to produce authentic-looking faces conditioned on five-point landmarks. By keeping virtual prototypes orthogonal to real ones and jointly optimizing with ArcFace losses, the method achieves strong separability while preventing identity leakage. The approach demonstrates that synthetic, privacy-free data can substitute real datasets and also serve as a valuable augmentation to boost FR performance, attaining state-of-the-art results on multiple benchmarks. The work further provides a practical dataset release of virtual identities to help mitigate portrait-right issues in FR research.
Abstract
Deep learning-based face recognition continues to face challenges due to its reliance on huge datasets obtained from web crawling, which can be costly to gather and raise significant real-world privacy concerns. To address this issue, we propose VIGFace, a novel framework capable of generating synthetic facial images. Our idea originates from pre-assigning virtual identities in the feature space. Initially, we train the face recognition model using a real face dataset and create a feature space for both real and virtual identities, where virtual prototypes are orthogonal to other prototypes. Subsequently, we train the diffusion model based on the established feature space, enabling it to generate authentic human face images from real prototypes and synthesize virtual face images from virtual prototypes. Our proposed framework provides two significant benefits. Firstly, it shows clear separability between existing individuals and virtual face images, allowing one to create synthetic images with confidence and without concerns about privacy and portrait rights. Secondly, it ensures improved performance through data augmentation by incorporating real existing images. Extensive experiments demonstrate the superiority of our virtual face dataset and framework, outperforming the previous state-of-the-art on various face recognition benchmarks.
