Preserving AUC Fairness in Learning with Noisy Protected Groups
Mingyang Wu, Li Lin, Wenbin Zhang, Xin Wang, Zhenhuan Yang, Shu Hu
TL;DR
This work tackles the challenge of preserving AUC fairness when protected-group labels are noisy. It introduces a distributionally robust optimization (DRO) framework that bounds the total variation distance between clean and noisy group distributions to enforce fairness in AUC across all group pairs, while directly optimizing the AUC objective. A theoretically grounded noise-ratio estimation approach using pre-trained multimodal models (e.g., CLIP) is proposed, enabling practical estimation of the TV-bound without relabeling, paired with a scalable SGDA optimizer enhanced by Sharpness-Aware Minimization. Empirical results on tabular and image benchmarks demonstrate that the proposed method achieves lower AUC fairness violations and competitive AUC across diverse noise levels and backbones, highlighting its robustness and practical impact for fairness in ranking under uncertain group information.
Abstract
The Area Under the ROC Curve (AUC) is a key metric for classification, especially under class imbalance, with growing research focus on optimizing AUC over accuracy in applications like medical image analysis and deepfake detection. This leads to fairness in AUC optimization becoming crucial as biases can impact protected groups. While various fairness mitigation techniques exist, fairness considerations in AUC optimization remain in their early stages, with most research focusing on improving AUC fairness under the assumption of clean protected groups. However, these studies often overlook the impact of noisy protected groups, leading to fairness violations in practice. To address this, we propose the first robust AUC fairness approach under noisy protected groups with fairness theoretical guarantees using distributionally robust optimization. Extensive experiments on tabular and image datasets show that our method outperforms state-of-the-art approaches in preserving AUC fairness. The code is in https://github.com/Purdue-M2/AUC_Fairness_with_Noisy_Groups.
