Low-Cost High-Power Membership Inference Attacks
Sajjad Zarifzadeh, Philippe Liu, Reza Shokri
TL;DR
This paper tackles the practical privacy auditing problem of membership inference attacks by introducing RMIA, a low-cost, high-power test that leverages a fine-grained null model and a Bayesian-composed likelihood-ratio framework. RMIA combines information from reference population data and pre-trained reference models to form pairwise likelihood ratios, aggregating them into a single MIA score that can calibrate to any desired false-positive rate. Empirical results across CIFAR-10/100, CINIC-10, ImageNet, and Purchase-100 show RMIA consistently dominates prior attacks, especially under limited reference models and under distribution and architecture shifts, with strong robustness to OOD non-members. The approach enables practical privacy-risk assessment and auditing, reducing computational overhead while maintaining strong leakage detection, even for large-scale models and varied data distributions.
Abstract
Membership inference attacks aim to detect if a particular data point was used in training a model. We design a novel statistical test to perform robust membership inference attacks (RMIA) with low computational overhead. We achieve this by a fine-grained modeling of the null hypothesis in our likelihood ratio tests, and effectively leveraging both reference models and reference population data samples. RMIA has superior test power compared with prior methods, throughout the TPR-FPR curve (even at extremely low FPR, as low as 0). Under computational constraints, where only a limited number of pre-trained reference models (as few as 1) are available, and also when we vary other elements of the attack (e.g., data distribution), our method performs exceptionally well, unlike prior attacks that approach random guessing. RMIA lays the groundwork for practical yet accurate data privacy risk assessment in machine learning.
