Privacy-preserving Optics for Enhancing Protection in Face De-identification
Jhon Lopez, Carlos Hinojosa, Henry Arguello, Bernard Ghanem
TL;DR
The paper tackles privacy in face de-identification by shifting from software-only approaches to a hardware-level solution that learns optics to hide identity while preserving geometric cues. It introduces a two-stage framework: Stage I learns a phase-mask-based optical encoder and a heatmap regressor to retain pose and landmark information, while Stage II employs a GAN conditioned on the heatmap and a public reference image to synthesize a new identity. Evaluations on FFHQ/CelebA-HQ datasets and a physical prototype demonstrate improved privacy protection against deconvolution attacks and competitive image quality, with human studies underscoring effective anonymization. Key contributions include end-to-end optics learning using a Zernike-based phase mask, a heatmap-guided geometry preservation approach, and a reference-guided, privacy-aware face synthesis pipeline that mitigates sniffing risks at capture time.
Abstract
The modern surge in camera usage alongside widespread computer vision technology applications poses significant privacy and security concerns. Current artificial intelligence (AI) technologies aid in recognizing relevant events and assisting in daily tasks in homes, offices, hospitals, etc. The need to access or process personal information for these purposes raises privacy concerns. While software-level solutions like face de-identification provide a good privacy/utility trade-off, they present vulnerabilities to sniffing attacks. In this paper, we propose a hardware-level face de-identification method to solve this vulnerability. Specifically, our approach first learns an optical encoder along with a regression model to obtain a face heatmap while hiding the face identity from the source image. We also propose an anonymization framework that generates a new face using the privacy-preserving image, face heatmap, and a reference face image from a public dataset as input. We validate our approach with extensive simulations and hardware experiments.
