Mitigating Privacy Risk in Membership Inference by Convex-Concave Loss
Zhenlong Liu, Lei Feng, Huiping Zhuang, Xiaofeng Cao, Hongxin Wei
TL;DR
This paper tackles privacy leakage from membership inference attacks by showing that convex loss functions inherently reduce training loss variance, which can amplify vulnerability to MIAs. It proposes Convex-Concave Loss (CCL), a general framework that adds a concave term to a convex loss (e.g., cross-entropy), driving higher loss variance during training and thus reducing attack advantage. The authors provide theoretical justification showing how convexity suppresses variance while concavity can boost it, and they derive gradient bounds ensuring convergence. Empirical results across five datasets (including CIFAR-10/100 and ImageNet) demonstrate that CCL achieves a state-of-the-art balance in the privacy-utility trade-off, improving robustness to multiple MIA types while preserving or improving accuracy. The work offers a practical defense that can be tuned via a single hyperparameter $\alpha$ and contributes insight into how loss landscape shaping affects privacy in neural networks.
Abstract
Machine learning models are susceptible to membership inference attacks (MIAs), which aim to infer whether a sample is in the training set. Existing work utilizes gradient ascent to enlarge the loss variance of training data, alleviating the privacy risk. However, optimizing toward a reverse direction may cause the model parameters to oscillate near local minima, leading to instability and suboptimal performance. In this work, we propose a novel method -- Convex-Concave Loss, which enables a high variance of training loss distribution by gradient descent. Our method is motivated by the theoretical analysis that convex losses tend to decrease the loss variance during training. Thus, our key idea behind CCL is to reduce the convexity of loss functions with a concave term. Trained with CCL, neural networks produce losses with high variance for training data, reinforcing the defense against MIAs. Extensive experiments demonstrate the superiority of CCL, achieving state-of-the-art balance in the privacy-utility trade-off.
