Table of Contents
Fetching ...

PGB: Benchmarking Differentially Private Synthetic Graph Generation Algorithms

Shang Liu, Hao Du, Yang Cao, Bo Yan, Jinfei Liu, Masatoshi Yoshikawa

TL;DR

The paper tackles the lack of a fair, unified framework for evaluating differentially private synthetic graph generation methods. It introduces Private Graph Benchmark (PGB), which structures evaluation around a 4-tuple of elements (mechanisms, datasets, privacy requirements, utilities) and provides a concrete instantiation with open-source implementation and a large-scale empirical study. The study reveals that while some algorithms perform well in certain settings, there is no universally best method; performance varies with graph size, topology, and privacy budget. The work offers design principles, a reproducible platform, and practical guidelines to help researchers choose mechanisms for various scenarios, accelerating progress in private graph analytics. The insights underscore the importance of standardized evaluation to compare privacy-utility trade-offs and guide future algorithm development.

Abstract

Differentially private graph analysis is a powerful tool for deriving insights from diverse graph data while protecting individual information. Designing private analytic algorithms for different graph queries often requires starting from scratch. In contrast, differentially private synthetic graph generation offers a general paradigm that supports one-time generation for multiple queries. Although a rich set of differentially private graph generation algorithms has been proposed, comparing them effectively remains challenging due to various factors, including differing privacy definitions, diverse graph datasets, varied privacy requirements, and multiple utility metrics. To this end, we propose PGB (Private Graph Benchmark), a comprehensive benchmark designed to enable researchers to compare differentially private graph generation algorithms fairly. We begin by identifying four essential elements of existing works as a 4-tuple: mechanisms, graph datasets, privacy requirements, and utility metrics. We discuss principles regarding these elements to ensure the comprehensiveness of a benchmark. Next, we present a benchmark instantiation that adheres to all principles, establishing a new method to evaluate existing and newly proposed graph generation algorithms. Through extensive theoretical and empirical analysis, we gain valuable insights into the strengths and weaknesses of prior algorithms. Our results indicate that there is no universal solution for all possible cases. Finally, we provide guidelines to help researchers select appropriate mechanisms for various scenarios.

PGB: Benchmarking Differentially Private Synthetic Graph Generation Algorithms

TL;DR

The paper tackles the lack of a fair, unified framework for evaluating differentially private synthetic graph generation methods. It introduces Private Graph Benchmark (PGB), which structures evaluation around a 4-tuple of elements (mechanisms, datasets, privacy requirements, utilities) and provides a concrete instantiation with open-source implementation and a large-scale empirical study. The study reveals that while some algorithms perform well in certain settings, there is no universally best method; performance varies with graph size, topology, and privacy budget. The work offers design principles, a reproducible platform, and practical guidelines to help researchers choose mechanisms for various scenarios, accelerating progress in private graph analytics. The insights underscore the importance of standardized evaluation to compare privacy-utility trade-offs and guide future algorithm development.

Abstract

Differentially private graph analysis is a powerful tool for deriving insights from diverse graph data while protecting individual information. Designing private analytic algorithms for different graph queries often requires starting from scratch. In contrast, differentially private synthetic graph generation offers a general paradigm that supports one-time generation for multiple queries. Although a rich set of differentially private graph generation algorithms has been proposed, comparing them effectively remains challenging due to various factors, including differing privacy definitions, diverse graph datasets, varied privacy requirements, and multiple utility metrics. To this end, we propose PGB (Private Graph Benchmark), a comprehensive benchmark designed to enable researchers to compare differentially private graph generation algorithms fairly. We begin by identifying four essential elements of existing works as a 4-tuple: mechanisms, graph datasets, privacy requirements, and utility metrics. We discuss principles regarding these elements to ensure the comprehensiveness of a benchmark. Next, we present a benchmark instantiation that adheres to all principles, establishing a new method to evaluate existing and newly proposed graph generation algorithms. Through extensive theoretical and empirical analysis, we gain valuable insights into the strengths and weaknesses of prior algorithms. Our results indicate that there is no universal solution for all possible cases. Finally, we provide guidelines to help researchers select appropriate mechanisms for various scenarios.
Paper Structure (37 sections, 6 equations, 7 figures, 12 tables)

This paper contains 37 sections, 6 equations, 7 figures, 12 tables.

Figures (7)

  • Figure 1: The common steps for differentially private graph generation algorithms: Representation, Perturbation, and Construction.
  • Figure 2: End-to-end comparison of algorithms under different graph datasets, privacy budget $\varepsilon$ and qureies.
  • Figure 3: Degree Distribution of TmF.
  • Figure 4: Community detection of TmF.
  • Figure 5: Degree Distribution of PrivSKG.
  • ...and 2 more figures

Theorems & Definitions (15)

  • Remark 1
  • Definition 1: Differential Privacy dwork2014algorithmic
  • Definition 2: Node CDP hay2009accurate
  • Definition 3: Edge CDP raskhodnikova2016differentially
  • Definition 4: Edge LDP qin2017generating
  • Remark 2
  • Remark 3
  • Remark 4
  • Definition 5
  • Remark 5
  • ...and 5 more