DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion

Hanrui Wang; Shuo Wang; Chun-Shien Lu; Isao Echizen

DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion

Hanrui Wang, Shuo Wang, Chun-Shien Lu, Isao Echizen

TL;DR

DiffMI leverages a fixed diffusion model to perform training-free model inversion on embedding-based face recognition, addressing limitations of prior GAN- or diffusion-based approaches in generalization and efficiency. Its three-stage latent-code pipeline—robust initialization, top-N selection, and ranked, fine-grained manipulation—coupled with a confidence-aware stopping criterion enables high identity-recovery across unseen identities and models while maintaining cross-model robustness. Empirical evaluations on multiple recognizers and datasets show strong leakage even against inversion-resistant systems, with user studies confirming perceptual identity matches. The results highlight a practical privacy risk in embedding-based frameworks and suggest the need for defenses that go beyond enhancing the training-time or architectural protections of recognition models.

Abstract

Face recognition poses serious privacy risks due to its reliance on sensitive and immutable biometric data. While modern systems mitigate privacy risks by mapping facial images to embeddings (commonly regarded as privacy-preserving), model inversion attacks reveal that identity information can still be recovered, exposing critical vulnerabilities. However, existing attacks are often computationally expensive and lack generalization, especially those requiring target-specific training. Even training-free approaches suffer from limited identity controllability, hindering faithful reconstruction of nuanced or unseen identities. In this work, we propose DiffMI, the first diffusion-driven, training-free model inversion attack. DiffMI introduces a novel pipeline combining robust latent code initialization, a ranked adversarial refinement strategy, and a statistically grounded, confidence-aware optimization objective. DiffMI applies directly to unseen target identities and face recognition models, offering greater adaptability than training-dependent approaches while significantly reducing computational overhead. Our method achieves 84.42%--92.87% attack success rates against inversion-resilient systems and outperforms the best prior training-free GAN-based approach by 4.01%--9.82%. The implementation is available at https://github.com/azrealwang/DiffMI.

DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion

TL;DR

Abstract

DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)