Impact of Dataset Properties on Membership Inference Vulnerability of Deep Transfer Learning
Marlon Tobaben, Hibiki Ito, Joonas Jälkö, Yuan He, Antti Honkela
TL;DR
This work reveals a power-law relationship between membership inference vulnerability and the number of examples per class, $S$, in deep transfer learning when attacking a fine-tuned Vision Transformer head with LiRA. It develops a theoretical framework using a simplified fine-tuning model and a high-dimensional sphere-based model to derive a closed-form form for per-example and average-case vulnerability, showing $\log(\textsc{tpr}-\textsc{fpr})$ scales as $-\tfrac{1}{2}\log S$ plus data-dependent terms. The authors corroborate the theory with extensive experiments across diverse datasets and backbones, demonstrate a practical dataset-vulnerability predictor, and analyze per-sample vulnerability, illustrating that large $S$ can substantially mitigate risk but may still be insufficient to meet DP-style guarantees unless data requirements are extreme. They also map MIA vulnerabilities to DP bounds to illustrate the practical gap between empirical privacy risk and formal guarantees, highlighting the importance of DP in settings where strong protection is required. Overall, the work provides a quantitative link between dataset properties and privacy risk under non-DP transfer learning, with implications for privacy budgeting and data collection strategies in sensitive applications.
Abstract
Membership inference attacks (MIAs) are used to test practical privacy of machine learning models. MIAs complement formal guarantees from differential privacy (DP) under a more realistic adversary model. We analyse MIA vulnerability of fine-tuned neural networks both empirically and theoretically, the latter using a simplified model of fine-tuning. We show that the vulnerability of non-DP models when measured as the attacker advantage at a fixed false positive rate reduces according to a simple power law as the number of examples per class increases. A similar power-law applies even for the most vulnerable points, but the dataset size needed for adequate protection of the most vulnerable points is very large.
