PrAda-GAN: A Private Adaptive Generative Adversarial Network with Bayes Network Structure
Ke Jia, Yuheng Ma, Yang Li, Feifei Wang
TL;DR
PrAda-GAN addresses DP tabular data synthesis by uniting GAN training with an adaptive Bayes-network structure, using a sequential autoregressive generator to capture variable dependencies. The method introduces a group-lasso penalty to recover a sparse conditional dependency graph, yielding improved convergence and sparsity-driven generalization under privacy constraints. Theoretical results bound parameter estimation errors and Wasserstein-distance generalization, with sparsity-aware improvements when leveraging the Bayes network. Empirically, PrAda-GAN outperforms baselines in distributional similarity and downstream ML tasks across synthetic and real datasets, particularly under low privacy budgets. This approach offers a scalable, continuous-domain DP solution that automatically adapts to underlying low-dimensional structure while preserving utility.
Abstract
We revisit the problem of generating synthetic data under differential privacy. To address the core limitations of marginal-based methods, we propose the Private Adaptive Generative Adversarial Network with Bayes Network Structure (PrAda-GAN), which integrates the strengths of both GAN-based and marginal-based approaches. Our method adopts a sequential generator architecture to capture complex dependencies among variables, while adaptively regularizing the learned structure to promote sparsity in the underlying Bayes network. Theoretically, we establish diminishing bounds on the parameter distance, variable selection error, and Wasserstein distance. Our analysis shows that leveraging dependency sparsity leads to significant improvements in convergence rates. Empirically, experiments on both synthetic and real-world datasets demonstrate that PrAda-GAN outperforms existing tabular data synthesis methods in terms of the privacy-utility trade-off.
