Membership Inference Risks in Quantized Models: A Theoretical and Empirical Study
Eric Aubinais, Philippe Formont, Pablo Piantanida, Elisabeth Gassiat
TL;DR
This work analyzes how quantization affects membership inference risk in machine learning, deriving asymptotic MIS bounds for fixed and size-adaptive quantizers and proposing a scalable metric, $r_{\mathcal{Q}}^n$, to rank quantizers by privacy. The authors provide a practical estimation framework and validate it through synthetic data and molecular-property tasks, showing that higher sparsity can improve privacy but may degrade performance, especially in regression. The results offer a principled, asymptotically grounded approach to privacy-aware quantization design and highlight trade-offs that matter for deploying quantized models on privacy-sensitive data. Overall, the framework enables robust privacy benchmarking of PTQ techniques with tangible implications for secure model sharing and deployment.
Abstract
Quantizing machine learning models has demonstrated its effectiveness in lowering memory and inference costs while maintaining performance levels comparable to the original models. In this work, we investigate the impact of quantization procedures on the privacy of data-driven models, specifically focusing on their vulnerability to membership inference attacks. We derive an asymptotic theoretical analysis of Membership Inference Security (MIS), characterizing the privacy implications of quantized algorithm weights against the most powerful (and possibly unknown) attacks. Building on these theoretical insights, we propose a novel methodology to empirically assess and rank the privacy levels of various quantization procedures. Using synthetic datasets, we demonstrate the effectiveness of our approach in assessing the MIS of different quantizers. Furthermore, we explore the trade-off between privacy and performance using real-world data and models in the context of molecular modeling.
