Visual Privacy Auditing with Diffusion Models
Kristian Schwethelm, Johannes Kaiser, Moritz Knolle, Sarah Lockfisch, Daniel Rueckert, Alexander Ziller
TL;DR
This work addresses the gap between worst-case differential privacy guarantees and practical reconstruction risk by introducing a diffusion-model (DM) based attack that leverages realistic image priors. The authors show that real-world priors can substantially increase reconstruction success under DP-SGD, challenging existing bounds like $(0,\gamma)$-ReRo and the Ziller bounds that assume limited or no priors. They propose using diffusion models as a visual auditing tool to characterize and communicate privacy leakage to non-technical stakeholders, while also establishing a practical reconstruction pipeline that starts from privatized observations and refines the attack with DM post-processing under a DDIM-style data-consistent process. The findings highlight a notable dependence of reconstruction quality on data priors and distribution shift, suggesting that current theoretical guarantees may over- or under-estimate risk in realistic settings and motivating the development of priors-aware privacy metrics. Overall, the work contributes both a powerful auditing approach and important empirical insights for guiding DP parameter choices and defenses in vision applications.
Abstract
Data reconstruction attacks on machine learning models pose a substantial threat to privacy, potentially leaking sensitive information. Although defending against such attacks using differential privacy (DP) provides theoretical guarantees, determining appropriate DP parameters remains challenging. Current formal guarantees on the success of data reconstruction suffer from overly stringent assumptions regarding adversary knowledge about the target data, particularly in the image domain, raising questions about their real-world applicability. In this work, we empirically investigate this discrepancy by introducing a reconstruction attack based on diffusion models (DMs) that only assumes adversary access to real-world image priors and specifically targets the DP defense. We find that (1) real-world data priors significantly influence reconstruction success, (2) current reconstruction bounds do not model the risk posed by data priors well, and (3) DMs can serve as heuristic auditing tools for visualizing privacy leakage.
