Local Node Differential Privacy
Sofya Raskhodnikova, Adam Smith, Connor Wagaman, Anatoly Zavyalov
TL;DR
This work initiates the study of Local Node Differential Privacy (LNDP*) for graphs, modeling each node as a party that privately reveals its incident edges to an untrusted server. The authors introduce the blurry degree distribution framework, which privately captures the graph's degree distribution with low sensitivity, enabling accurate, private linear queries via the factorization mechanism. They show that LNDP* can achieve central-model–like accuracy for several statistics (e.g., clique size, ER graph parameter, and certain edge counts) while proving tight lower bounds that reveal fundamental limits and a separation from the central model for some problems. The paper also develops novel lower-bound tools tailored to overlapping graph inputs (the splicing method) and structural results distinguishing pure versus approximate LNDP*, as well as unrestricted versus degrees-only LNDP*, with implications for interactive versus noninteractive settings. Overall, the work provides a foundational framework, near-optimal algorithms, and deep insights into the power and limitations of LNDP* for graph data, laying out open questions for finer separations and interaction effects.
Abstract
We initiate an investigation of node differential privacy for graphs in the local model of private data analysis. In our model, dubbed LNDP, each node sees its own edge list and releases the output of a local randomizer on this input. These outputs are aggregated by an untrusted server to obtain a final output. We develop a novel algorithmic framework for this setting that allows us to accurately answer arbitrary linear queries on a blurry approximation of the input graph's degree distribution. For some natural problems, the resulting algorithms match the accuracy achievable with node privacy in the central model, where data are held and processed by a trusted server. We also prove lower bounds on the error required by LNDP that imply the optimality of our algorithms for several fundamental graph statistics. We then lift these lower bounds to the interactive LNDP setting, demonstrating the optimality of our algorithms even when constantly many rounds of interaction are permitted. Obtaining our lower bounds requires new approaches, since those developed for the usual local model do not apply to the inherently overlapping inputs that arise from graphs. Finally, we prove structural results that reveal qualitative differences between local node privacy and the standard local model for tabular data.
