A Deep Latent Factor Graph Clustering with Fairness-Utility Trade-off Perspective
Siamak Ghodsi, Amjad Seyedi, Tai Le Quy, Fariba Karimi, Eirini Ntoutsi
TL;DR
This work tackles fair graph clustering by integrating a soft demographic-balance constraint directly into an end-to-end deep nonnegative tri-factorization (DFNMF) framework. The model builds a deep hierarchical representation $\mathbf{\Psi}=\mathbf{H}_1\cdots\mathbf{H}_p$ coupled with a final interaction matrix $\mathbf{W}_p$, and optimizes a unified objective that adds a fairness penalty $\lambda \|\mathbf{F}^\top\mathbf{\Psi}\|_F^2$ to the graph reconstruction term. This yields explicit control over the utility–fairness trade-off with a single parameter $\lambda$, enabling soft, interpretable cluster memberships without post-processing. DFNMF scales near-linearly with the number of edges using CSR-based sparse operations and alternating updates, and experiments on synthetic and real networks show it often dominates state-of-the-art baselines on Pareto fronts for modularity and demographic balance, while remaining interpretable through its hierarchical factors. The approach enables robust, scalable, fairness-aware graph clustering with practical implications for diverse domains where balanced representation is essential.
Abstract
Fair graph clustering seeks partitions that respect network structure while maintaining proportional representation across sensitive groups, with applications spanning community detection, team formation, resource allocation, and social network analysis. Many existing approaches enforce rigid constraints or rely on multi-stage pipelines (e.g., spectral embedding followed by $k$-means), limiting trade-off control, interpretability, and scalability. We introduce \emph{DFNMF}, an end-to-end deep nonnegative tri-factorization tailored to graphs that directly optimizes cluster assignments with a soft statistical-parity regularizer. A single parameter $λ$ tunes the fairness--utility balance, while nonnegativity yields parts-based factors and transparent soft memberships. The optimization uses sparse-friendly alternating updates and scales near-linearly with the number of edges. Across synthetic and real networks, DFNMF achieves substantially higher group balance at comparable modularity, often dominating state-of-the-art baselines on the Pareto front. The code is available at https://github.com/SiamakGhodsi/DFNMF.git.
