When Privacy Isn't Synthetic: Hidden Data Leakage in Generative AI Models

S. M. Mustaqim; Anantaa Kotal; Paul H. Yi

When Privacy Isn't Synthetic: Hidden Data Leakage in Generative AI Models

S. M. Mustaqim, Anantaa Kotal, Paul H. Yi

TL;DR

This paper reveals that privacy-preserving synthetic data can still leak information about the training data through distributional overlap, even under black-box access. It introduces the Cluster–Medoid Leakage Attack (CMLA), a model-agnostic framework that samples synthetic outputs, encodes them into a shared space, clusters them, and extracts medoids to measure proximity to real data via $d_{ ext{min}}$, ASR(τ), and Cov(τ). Empirical results across healthcare, finance, and other sensitive domains show leakage patterns across a range of generative models, including those with differential privacy, challenging the assumption that synthetic data fully protects privacy. The work provides practical privacy-auditing tools and argues for stronger guarantees that address neighborhood-level leakage, not just memorization, with publicly available code to foster adoption and evaluation.

Abstract

Generative models are increasingly used to produce privacy-preserving synthetic data as a safe alternative to sharing sensitive training datasets. However, we demonstrate that such synthetic releases can still leak information about the underlying training samples through structural overlap in the data manifold. We propose a black-box membership inference attack that exploits this vulnerability without requiring access to model internals or real data. The attacker repeatedly queries the generative model to obtain large numbers of synthetic samples, performs unsupervised clustering to identify dense regions of the synthetic distribution, and then analyzes cluster medoids and neighborhoods that correspond to high-density regions in the original training data. These neighborhoods act as proxies for training samples, enabling the adversary to infer membership or reconstruct approximate records. Our experiments across healthcare, finance, and other sensitive domains show that cluster overlap between real and synthetic data leads to measurable membership leakage-even when the generator is trained with differential privacy or other noise mechanisms. The results highlight an under-explored attack surface in synthetic data generation pipelines and call for stronger privacy guarantees that account for distributional neighborhood inference rather than sample-level memorization alone, underscoring its role in privacy-preserving data publishing. Implementation and evaluation code are publicly available at:github.com/Cluster-Medoid-Leakage-Attack.

When Privacy Isn't Synthetic: Hidden Data Leakage in Generative AI Models

TL;DR

Abstract

When Privacy Isn't Synthetic: Hidden Data Leakage in Generative AI Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)