Revealing Unintentional Information Leakage in Low-Dimensional Facial Portrait Representations
Kathleen Anderson, Thomas Martinetz
TL;DR
This work analyzes unintentional information leakage from low-dimensional portrait attribute vectors produced by encoders under blackbox access, focusing on $f \in \mathbb{R}^{32}$ or $\mathbb{R}^{40}$. It introduces a reconstruction pipeline that couples a frozen StyleGAN generator with a trainable pre-generator $P$ to map $f$ into the generator’s input, optimizing a loss that combines pixel similarity and FaceNet embedding similarity, plus a distribution term to regularize the latent input. A key innovation is style mixing across StyleGAN layers, enabling ensemble outputs trained on different losses to be fused into distinct generator blocks for improved identity fidelity. Experiments on FFHQ and CelebA demonstrate that recognizable identity and other portrait attributes can be recovered (user studies show up to ~79–80% recognition), highlighting significant privacy risks in seemingly abstract feature representations and the need for cautious handling of such outputs.
Abstract
We evaluate the information that can unintentionally leak into the low dimensional output of a neural network, by reconstructing an input image from a 40- or 32-element feature vector that intends to only describe abstract attributes of a facial portrait. The reconstruction uses blackbox-access to the image encoder which generates the feature vector. Other than previous work, we leverage recent knowledge about image generation and facial similarity, implementing a method that outperforms the current state-of-the-art. Our strategy uses a pretrained StyleGAN and a new loss function that compares the perceptual similarity of portraits by mapping them into the latent space of a FaceNet embedding. Additionally, we present a new technique that fuses the output of an ensemble, to deliberately generate specific aspects of the recreated image.
