Representation Magnitude has a Liability to Privacy Vulnerability
Xingli Fang, Jung-Eun Kim
TL;DR
The paper identifies representation magnitude in bottleneck features as a key factor in privacy leakage from membership inference attacks. It proposes the Saturn Rings Classifier Module (SRCM), a plug-in that constrains representation space using a Saturn Rings Activation Function (SR) and a Magnitude Normalized Linear Layer (LinearNorm) to align training and evaluation behavior. Empirical results across CIFAR-10/100 and Purchase100 show SRCM improves privacy protection, particularly on larger models, and can further enhance defense when combined with methods like RelaxLoss, while incurring minimal inference overhead. This work provides a practical, architecture-level strategy to reduce membership leakage without sacrificing generalization, highlighting the value of regulating representation magnitude in secure ML deployment.
Abstract
The privacy-preserving approaches to machine learning (ML) models have made substantial progress in recent years. However, it is still opaque in which circumstances and conditions the model becomes privacy-vulnerable, leading to a challenge for ML models to maintain both performance and privacy. In this paper, we first explore the disparity between member and non-member data in the representation of models under common training frameworks. We identify how the representation magnitude disparity correlates with privacy vulnerability and address how this correlation impacts privacy vulnerability. Based on the observations, we propose Saturn Ring Classifier Module (SRCM), a plug-in model-level solution to mitigate membership privacy leakage. Through a confined yet effective representation space, our approach ameliorates models' privacy vulnerability while maintaining generalizability. The code of this work can be found here: \url{https://github.com/JEKimLab/AIES2024_SRCM}
