Generating Differentially Private Networks with a Modified Erdős-Rényi Model
Huaiyuan Rao, Calvin Hawkins, Alexander Benvenuti, Matthew Hale
TL;DR
This work addresses privately generating entire networks under edge $\varepsilon$-differential privacy by pairing a graph exponential mechanism with a computationally efficient modified Erdős-Rényi sampler. The core ideas are to (i) formulate a utility based on edge-set closeness and (ii) implement a scalable sampling procedure that preserves privacy while enabling post-processing to compute arbitrary graph properties, notably the Laplacian spectrum. The main contributions are the formal graph exponential mechanism, an equivalent two-type edge ER implementation with $O(n^2)$ complexity, and empirical evidence that the method yields substantially lower error in graph Laplacian spectra than prior approaches (e.g., a $\approx 49.34\%$ improvement at $\varepsilon=2.50$). Overall, this framework enables privacy-preserving analysis of a broad class of graph properties without bespoke mechanisms and scales to large networks, advancing practical secure analyses in networked systems.
Abstract
Differential privacy has been used to privately calculate numerous network properties, but existing approaches often require the development of a new privacy mechanism for each property of interest. Therefore, we present a framework for generating entire networks in a differentially private way. Differential privacy is immune to post-processing, which allows for any network property to be computed and analyzed for a private output network, without weakening its protections. We consider undirected networks and develop a differential privacy mechanism that takes in a sensitive network and outputs a private network by randomizing its edge set. We prove that this mechanism does provide differential privacy to a network's edge set, though it induces a complex distribution over the space of output graphs. We then develop an equivalent privacy implementation using a modified Erdős-Rényi model that constructs an output graph edge by edge, and it is efficient and easily implementable, even on large complex networks. Experiments implement $\varepsilon$-differential privacy with $\varepsilon=2.5$ when computing graph Laplacian spectra, and these results show the proposed mechanism incurs $49.34\%$ less error than the current state of the art.
