Private Synthetic Graph Generation and Fused Gromov-Wasserstein Distance
Leoni Carla Wirth, Gholamali Aminian, Gesine Reinert
TL;DR
This work tackles private synthetic graph generation from complex data by introducing PSGG, which jointly outputs a true attributed graph and an ε-DP synthetic graph under a random connection model. It leverages TV-PSMM to produce a DP attribute measure hat{ν} and uses the fused Gromov-Wasserstein distance to quantify utility between structured graph distributions. The authors provide two rigorous utility guarantees: (i) an accuracy bound in FGW distance between the true and private graphs, and (ii) a distributional bound between the respective graph-generating processes, with rates that depend on the ambient dimension d and privacy level ε. The framework accommodates extensions to directed/weighted graphs and (ε,δ)-DP, highlighting practical implications for privacy-preserving graph data sharing and method development.
Abstract
Networks are popular for representing complex data. In particular, differentially private synthetic networks are much in demand for method and algorithm development. The network generator should be easy to implement and should come with theoretical guarantees. Here we start with complex data as input and jointly provide a network representation as well as a synthetic network generator. Using a random connection model, we devise an effective algorithmic approach for generating attributed synthetic graphs which is $ε$-differentially private at the vertex level, while preserving utility under an appropriate notion of distance which we develop. We provide theoretical guarantees for the accuracy of the private synthetic graphs using the fused Gromov-Wasserstein distance, which extends the Wasserstein metric to structured data. Our method draws inspiration from the PSMM method of \citet{he2023}.
