Towards Measuring Membership Privacy

Yunhui Long; Vincent Bindschaedler; Carl A. Gunter

Towards Measuring Membership Privacy

Yunhui Long, Vincent Bindschaedler, Carl A. Gunter

TL;DR

The paper introduces Differential Training Privacy (DTP) as an empirical, per-record privacy risk metric for membership inference in classifiers released via public interfaces. It also defines PDTP, a computable lower bound, and studies how training stability can bound DTP, enabling practical risk assessment when differential privacy is not feasible. Through case studies on NN, NB, and LR across the Adult and Purchase datasets, the authors demonstrate a strong correlation between PDTP and membership attack success, and propose the DTP-1 rule as a publishing guideline. While not a DP guarantee, DTP provides a practical framework for evaluating privacy risks and informs defensive decisions, including potential record-level data curation and further theoretical work on indirect attacks and stability.

Abstract

Machine learning models are increasingly made available to the masses through public query interfaces. Recent academic work has demonstrated that malicious users who can query such models are able to infer sensitive information about records within the training data. Differential privacy can thwart such attacks, but not all models can be readily trained to achieve this guarantee or to achieve it with acceptable utility loss. As a result, if a model is trained without differential privacy guarantee, little is known or can be said about the privacy risk of releasing it. In this work, we investigate and analyze membership attacks to understand why and how they succeed. Based on this understanding, we propose Differential Training Privacy (DTP), an empirical metric to estimate the privacy risk of publishing a classier when methods such as differential privacy cannot be applied. DTP is a measure of a classier with respect to its training dataset, and we show that calculating DTP is efficient in many practical cases. We empirically validate DTP using state-of-the-art machine learning models such as neural networks trained on real-world datasets. Our results show that DTP is highly predictive of the success of membership attacks and therefore reducing DTP also reduces the privacy risk. We advocate for DTP to be used as part of the decision-making process when considering publishing a classifier. To this end, we also suggest adopting the DTP-1 hypothesis: if a classifier has a DTP value above 1, it should not be published.

Towards Measuring Membership Privacy

TL;DR

Abstract

Towards Measuring Membership Privacy

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (13)