On the Fairness of Privacy Protection: Measuring and Mitigating the Disparity of Group Privacy Risks for Differentially Private Machine Learning
Zhi Yang, Changwu Huang, Ke Tang, Xin Yao
TL;DR
This work investigates whether differential privacy protections are equitably distributed across demographic groups in ML models. It identifies that average-case membership inference attacks can understate inter-group privacy disparities and introduces an efficient approximate worst-case MIG, PA-ALOOA, to audit per-sample privacy risk and derive a group privacy risk parity metric (GPRP). The authors then exploit a canary-inspired insight to design DP-SGD-S, an adaptive per-group gradient clipping scheme that reduces group-level privacy risk disparities while preserving model utility in many settings. Across multiple datasets and privacy budgets, PA-ALOOA reveals stronger group-level privacy risks than prior auditing methods, and DP-SGD-S demonstrably lowers the disparity parameter $\Delta$ with acceptable accuracy trade-offs. The results support the practical feasibility of fair privacy protection in DPML and highlight future directions for balancing outcome and privacy fairness.
Abstract
While significant progress has been made in conventional fairness-aware machine learning (ML) and differentially private ML (DPML), the fairness of privacy protection across groups remains underexplored. Existing studies have proposed methods to assess group privacy risks, but these are based on the average-case privacy risks of data records. Such approaches may underestimate the group privacy risks, thereby potentially underestimating the disparity across group privacy risks. Moreover, the current method for assessing the worst-case privacy risks of data records is time-consuming, limiting their practical applicability. To address these limitations, we introduce a novel membership inference game that can efficiently audit the approximate worst-case privacy risks of data records. Experimental results demonstrate that our method provides a more stringent measurement of group privacy risks, yielding a reliable assessment of the disparity in group privacy risks. Furthermore, to promote privacy protection fairness in DPML, we enhance the standard DP-SGD algorithm with an adaptive group-specific gradient clipping strategy, inspired by the design of canaries in differential privacy auditing studies. Extensive experiments confirm that our algorithm effectively reduces the disparity in group privacy risks, thereby enhancing the fairness of privacy protection in DPML.
