Fairness without Demographics through Learning Graph of Gradients
Yingtao Luo, Zhixun Li, Qiang Liu, Jun Zhu
TL;DR
This work tackles fairness without access to sensitive demographic data by exploiting the information contained in last-layer gradients. It introduces the Graph of Gradients (GoG), a soft grouping mechanism that connects samples via $K$-nearest gradient neighbors and uses an adversarial network to weight samples during training, optimizing a Rawlsian, worst-off objective with minimal accuracy sacrifice. The authors provide theoretical justifications that gradient signals better capture demographics than input features and demonstrate, across COMPAS, BNP, and MIMIC-III, that GoG achieves superior worst-group fairness and robustness to label noise with competitive overall accuracy. The approach is scalable (linear complexity) and interpretable, offering a privacy-preserving, effective solution for real-world fairness challenges in diverse domains.
Abstract
Machine learning systems are notoriously prone to biased predictions about certain demographic groups, leading to algorithmic fairness issues. Due to privacy concerns and data quality problems, some demographic information may not be available in the training data and the complex interaction of different demographics can lead to a lot of unknown minority subpopulations, which all limit the applicability of group fairness. Many existing works on fairness without demographics assume the correlation between groups and features. However, we argue that the model gradients are also valuable for fairness without demographics. In this paper, we show that the correlation between gradients and groups can help identify and improve group fairness. With an adversarial weighting architecture, we construct a graph where samples with similar gradients are connected and learn the weights of different samples from it. Unlike the surrogate grouping methods that cluster groups from features and labels as proxy sensitive attribute, our method leverages the graph structure as a soft grouping mechanism, which is much more robust to noises. The results show that our method is robust to noise and can improve fairness significantly without decreasing the overall accuracy too much.
