Revisiting Membership Inference Under Realistic Assumptions
Bargav Jayaraman, Lingxiao Wang, Katherine Knipmeyer, Quanquan Gu, David Evans
TL;DR
This work revisits membership inference under realistic conditions by allowing imbalanced priors and threshold-adaptive adversaries. It introduces a PPV-based leakage metric and two novel attacks, Merlin and Morgan, alongside a threshold-selection procedure to tailor attacks to defender goals. Grounded in f-DP and Gaussian DP theory, the paper provides theoretical bounds and extensive empirical evaluation across four multi-class datasets, with and without differential privacy. The results show persistent privacy risks for non-private models under imbalanced priors, and that DP reduces leakage only at strong budgets, underscoring the need for prior-aware evaluation and threshold-aware defenses.
Abstract
We study membership inference in settings where some of the assumptions typically used in previous research are relaxed. First, we consider skewed priors, to cover cases such as when only a small fraction of the candidate pool targeted by the adversary are actually members and develop a PPV-based metric suitable for this setting. This setting is more realistic than the balanced prior setting typically considered by researchers. Second, we consider adversaries that select inference thresholds according to their attack goals and develop a threshold selection procedure that improves inference attacks. Since previous inference attacks fail in imbalanced prior setting, we develop a new inference attack based on the intuition that inputs corresponding to training set members will be near a local minimum in the loss function, and show that an attack that combines this with thresholds on the per-instance loss can achieve high PPV even in settings where other attacks appear to be ineffective. Code for our experiments can be found here: https://github.com/bargavj/EvaluatingDPML.
