Estimating Fair Graphs from Graph-Stationary Data
Madeline Navarro, Andrei Buciulea, Samuel Rey, Antonio G. Marques, Santiago Segarra
TL;DR
This work tackles the problem of inferring graphs from graph-stationary nodal data while enforcing unbiased edge connections with respect to sensitive attributes. It introduces two fairness notions—group fairness and individual fairness—for dyadic connectivity, and defines spectral-domain bias metrics $R_G(S)$ and $R_N(S)$ to quantify topological bias. The authors propose FairSpectralTemplatel (FairSpecTemp) with two variants: a convex-relaxation approach enforcing commutativity with the sample covariance and a bias constraint, and a shared-eigenbasis variant that implicitly promotes fairness via spectral alignment; both come with high-probability performance guarantees that reveal a conditional fairness-accuracy tradeoff. Empirical results on synthetic and real datasets, including financial investing scenarios, demonstrate that imposing fairness can reduce bias without sacrificing accuracy when the target graph is fair, and that the two FairSpecTemp variants offer complementary strengths depending on sample size and bias level.
Abstract
We estimate fair graphs from graph-stationary nodal observations such that connections are not biased with respect to sensitive attributes. Edges in real-world graphs often exhibit preferences for connecting certain pairs of groups. Biased connections can not only exacerbate but even induce unfair treatment for downstream graph-based tasks. We therefore consider group and individual fairness for graphs corresponding to group- and node-level definitions, respectively. To evaluate the fairness of a given graph, we provide multiple bias metrics, including novel measurements in the spectral domain. Furthermore, we propose Fair Spectral Templates (FairSpecTemp), an optimization-based method with two variants for estimating fair graphs from stationary graph signals, a general model for graph data subsuming many existing ones. One variant of FairSpecTemp exploits commutativity properties of graph stationarity while directly constraining bias, while the other implicitly encourages fair estimates by restricting bias in the graph spectrum and is thus more flexible. Our methods enjoy high probability performance bounds, yielding a conditional tradeoff between fairness and accuracy. In particular, our analysis reveals that accuracy need not be sacrificed to recover fair graphs. We evaluate FairSpecTemp on synthetic and real-world data sets to illustrate its effectiveness and highlight the advantages of both variants of FairSpecTemp.
