Fairness-Aware Dense Subgraph Discovery
Emmanouil Kariotakis, Nicholas D. Sidiropoulos, Aritra Konar
TL;DR
This paper tackles fairness-aware densest subgraph discovery by introducing two tractable formulations, FADSG-I and FADSG-II, that incorporate two distinct regularizers to trade off subgraph density against representation of a protected vertex group. By enabling a regularization path over the parameter λ, the authors can trace Pareto-optimal frontiers between density and fairness and compute the price of fairness (PoF) for targeted representations. They establish polynomial-time solvability (via max-flow constructions) and adopt a scalable Super-Greedy++ based algorithm to approximate solutions, enabling practical analysis on large graphs. Experimental results across real-world datasets show that FADSG-I often achieves lower PoF than prior methods while providing multiple fairness levels, and FADSG-II offers flexible control over protected-vertex inclusion with competitive density. Collectively, the work advances fair subgraph discovery by delivering tractable, tunable formulations and rigorous trade-off analysis, with potential impact on diverse social and network applications.
Abstract
Dense subgraph discovery (DSD) is a key graph mining primitive with myriad applications including finding densely connected communities which are diverse in their vertex composition. In such a context, it is desirable to extract a dense subgraph that provides fair representation of the diverse subgroups that constitute the vertex set while incurring a small loss in terms of subgraph density. Existing methods for promoting fairness in DSD have important limitations - the associated formulations are NP-hard in the worst case and they do not provide flexible notions of fairness, making it non-trivial to analyze the inherent trade-off between density and fairness. In this paper, we introduce two tractable formulations for fair DSD, each offering a different notion of fairness. Our methods provide a structured and flexible approach to incorporate fairness, accommodating varying fairness levels. We introduce the fairness-induced relative loss in subgraph density as a price of fairness measure to quantify the associated trade-off. We are the first to study such a notion in the context of detecting fair dense subgraphs. Extensive experiments on real-world datasets demonstrate that our methods not only match but frequently outperform existing solutions, sometimes incurring even less than half the subgraph density loss compared to prior art, while achieving the target fairness levels. Importantly, they excel in scenarios that previous methods fail to adequately handle, i.e., those with extreme subgroup imbalances, highlighting their effectiveness in extracting fair and dense solutions.
