FairWire: Fair Graph Generation

O. Deniz Kose; Yanning Shen

FairWire: Fair Graph Generation

O. Deniz Kose, Yanning Shen

TL;DR

The work addresses structural bias in graph-based learning, including in synthetic graphs generated for privacy and scalability. It develops a theory-guided fairness regularizer $\mathcal{L}_{\text{FairWire}}$ and a diffusion-based FairWire framework to mitigate bias in both link prediction and graph generation, accommodating non-binary sensitive attributes. The approach leverages a batch-aware regularizer and a diffusion-based denoiser (MPNN) guided by synthetic sensitive attributes to prevent sensitive information leakage. Empirical results on real networks and synthetic graphs show improved fairness metrics $\Delta_{\mathrm{SP}}$ and $\Delta_{\mathrm{EO}}$ with little to no sacrifice in utility, demonstrating practical impact for fair graph analytics and privacy-preserving synthetic data generation.

Abstract

Machine learning over graphs has recently attracted growing attention due to its ability to analyze and learn complex relations within critical interconnected systems. However, the disparate impact that is amplified by the use of biased graph structures in these algorithms has raised significant concerns for the deployment of them in real-world decision systems. In addition, while synthetic graph generation has become pivotal for privacy and scalability considerations, the impact of generative learning algorithms on the structural bias has not yet been investigated. Motivated by this, this work focuses on the analysis and mitigation of structural bias for both real and synthetic graphs. Specifically, we first theoretically analyze the sources of structural bias that result in disparity for the predictions of dyadic relations. To alleviate the identified bias factors, we design a novel fairness regularizer that offers a versatile use. Faced with the bias amplification in graph generation models that is brought to light in this work, we further propose a fair graph generation framework, FairWire, by leveraging our fair regularizer design in a generative model. Experimental results on real-world networks validate that the proposed tools herein deliver effective structural bias mitigation for both real and synthetic graphs.

FairWire: Fair Graph Generation

TL;DR

The work addresses structural bias in graph-based learning, including in synthetic graphs generated for privacy and scalability. It develops a theory-guided fairness regularizer

and a diffusion-based FairWire framework to mitigate bias in both link prediction and graph generation, accommodating non-binary sensitive attributes. The approach leverages a batch-aware regularizer and a diffusion-based denoiser (MPNN) guided by synthetic sensitive attributes to prevent sensitive information leakage. Empirical results on real networks and synthetic graphs show improved fairness metrics

and

with little to no sacrifice in utility, demonstrating practical impact for fair graph analytics and privacy-preserving synthetic data generation.

Abstract

Paper Structure (22 sections, 3 theorems, 25 equations, 1 figure, 7 tables)

This paper contains 22 sections, 3 theorems, 25 equations, 1 figure, 7 tables.

Introduction
Related Work
Preliminaries
Inspection and Mitigation of Structural Bias
Bias Analysis
A Regularizer for Fair Connections
Fair Graph Generation
Diffusion Models and Structural Bias
FairWire: A Fair Graph Generation Framework
Experiments
Datasets and Experimental Setup
Link Prediction Results
Results for Graph Generation
Visualization of Synthetic Graphs
Conclusion
...and 7 more sections

Key Result

Theorem 1

The disparity between the representations of nodes in a sensitive group $\mathcal{S}_{k}$ and the representations of the remaining nodes output by the $l$th GNN layer, $\delta_{h}^{(l+1)}$, can be upper bounded by: where $L$ is the Lipschitz constant of the activation function $\sigma (\cdot)$, $\left\|\mathbf{z}^{l+1}_i- {\rm mean}(\mathbf{z}^{l+1}_j \mid v_{j} \in \mathcal{V})\right\|_{\infty}

Figures (1)

Figure 1: Distribution of the intra-edges (blue) and inter-edges (red) in the synthetic graphs created for Cora dataset by GraphMaker li2023graphmaker (left) and FairWire (right).

Theorems & Definitions (3)

Theorem 1
Proposition 1
Corollary 2

FairWire: Fair Graph Generation

TL;DR

Abstract

FairWire: Fair Graph Generation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (3)