Practical Bayes-Optimal Membership Inference Attacks
Marcus Lassila, Johan Östman, Khac-Hoang Ngo, Alexandre Graell i Amat
TL;DR
This paper tackles privacy leaks from membership inference attacks on both i.i.d. and graph-structured data by deriving Bayes-optimal MIA rules for node-level attacks on graph neural networks and introducing practical approximations. It presents G-BASE, a tractable Bayes-optimal MIA for graphs, and BASE, a scalable Bayes-optimal MIA for i.i.d. data, both achieving state-of-the-art or comparable performance to LiRA and RMIA with lower computational cost. A key theoretical contribution is the Bayes-optimal decision rule for graph data, which accounts for neighborhood influence in GNNs, plus sampling strategies to approximate intractable expectations. The work demonstrates that BASE is equivalent to RMIA under a specific threshold, yet BASE achieves similar results with far fewer model queries, and G-BASE delivers superior performance on larger graphs, making the framework a practical privacy auditing tool for contemporary graph and non-graph learning systems. Overall, the paper bridges theory and practice, providing principled, scalable MIAs for privacy auditing across data modalities.
Abstract
We develop practical and theoretically grounded membership inference attacks (MIAs) against both independent and identically distributed (i.i.d.) data and graph-structured data. Building on the Bayesian decision-theoretic framework of Sablayrolles et al., we derive the Bayes-optimal membership inference rule for node-level MIAs against graph neural networks, addressing key open questions about optimal query strategies in the graph setting. We introduce BASE and G-BASE, tractable approximations of the Bayes-optimal membership inference. G-BASE achieves superior performance compared to previously proposed classifier-based node-level MIA attacks. BASE, which is also applicable to non-graph data, matches or exceeds the performance of prior state-of-the-art MIAs, such as LiRA and RMIA, at a significantly lower computational cost. Finally, we show that BASE and RMIA are equivalent under a specific hyperparameter setting, providing a principled, Bayes-optimal justification for the RMIA attack.
