On the Privacy Effect of Data Enhancement via the Lens of Memorization
Xiao Li, Qiongxiu Li, Zhanhao Hu, Xiaolin Hu
TL;DR
This paper reframes privacy in data-enhanced machine learning through memorization, arguing that traditional MIAs can misrepresent per-sample risk. By employing LiRA as a memorization-aligned attack, it demonstrates a weaker link between privacy leakage and generalization gap and that adversarial training increases memorization-based leakage, while stronger robustness does not inherently worsen privacy. The study provides extensive experiments across CIFAR datasets and SVHN, highlighting that data augmentation is not a universal defense against MIAs and that memorization-aligned MIAs are necessary for accurate privacy assessment. The findings advocate for a shift toward memorization-based privacy evaluation in ML systems and reveal nuanced interactions between privacy, generalization, and robustness with practical implications for secure model design.
Abstract
Machine learning poses severe privacy concerns as it has been shown that the learned models can reveal sensitive information about their training data. Many works have investigated the effect of widely adopted data augmentation and adversarial training techniques, termed data enhancement in the paper, on the privacy leakage of machine learning models. Such privacy effects are often measured by membership inference attacks (MIAs), which aim to identify whether a particular example belongs to the training set or not. We propose to investigate privacy from a new perspective called memorization. Through the lens of memorization, we find that previously deployed MIAs produce misleading results as they are less likely to identify samples with higher privacy risks as members compared to samples with low privacy risks. To solve this problem, we deploy a recent attack that can capture individual samples' memorization degrees for evaluation. Through extensive experiments, we unveil several findings about the connections between three essential properties of machine learning models, including privacy, generalization gap, and adversarial robustness. We demonstrate that the generalization gap and privacy leakage are less correlated than those of the previous results. Moreover, there is not necessarily a trade-off between adversarial robustness and privacy as stronger adversarial robustness does not make the model more susceptible to privacy attacks.
