Be Careful What You Smooth For: Label Smoothing Can Be a Privacy Shield but Also a Catalyst for Model Inversion Attacks
Lukas Struppek, Dominik Hintersdorf, Kristian Kersting
TL;DR
The paper investigates how label smoothing (LS) impacts privacy in deep classifiers under model inversion attacks (MIAs). By formalizing LS as $\mathbf{y}^{\text{LS}}=(1-\alpha)\mathbf{y}+\frac{\alpha}{C}\mathbf{1}$ with $\alpha\in(-\infty,1]$, it demonstrates that positive LS increases privacy leakage, especially with limited data, while negative LS mitigates MIAs and can outperform existing defenses in the utility-privacy trade-off. Through high-resolution face-recognition experiments using Plug & Play Attacks (PPA), embedding-space analyses, and ablations, the study shows that negative LS not only reduces leakage but also yields more robust defenses without major utility loss. This reveals a practical, parameterizable defense strategy against MIAs and highlights the broader need to account for regularization choices when evaluating model privacy in real-world deployments.
Abstract
Label smoothing -- using softened labels instead of hard ones -- is a widely adopted regularization method for deep learning, showing diverse benefits such as enhanced generalization and calibration. Its implications for preserving model privacy, however, have remained unexplored. To fill this gap, we investigate the impact of label smoothing on model inversion attacks (MIAs), which aim to generate class-representative samples by exploiting the knowledge encoded in a classifier, thereby inferring sensitive information about its training data. Through extensive analyses, we uncover that traditional label smoothing fosters MIAs, thereby increasing a model's privacy leakage. Even more, we reveal that smoothing with negative factors counters this trend, impeding the extraction of class-related information and leading to privacy preservation, beating state-of-the-art defenses. This establishes a practical and powerful novel way for enhancing model resilience against MIAs.
