Towards Eliminating Hard Label Constraints in Gradient Inversion Attacks
Yanbo Wang, Jian Liang, Ran He
TL;DR
This paper tackles gradient inversion attacks under realistic soft-label training where label smoothing and mixup obscure one-hot labels. It introduces an analytical method that recovers augmented labels and the last-layer input features from a single input gradient by solving for a scalar multiplier $\lambda_r$ so that the pseudo-label $\hat{y}$ aligns with a variance-based loss $\mathcal{L}_{label}$, and proves a normalization constraint via Theorem 1 that $\sum_i \hat{y}_i = 1$. Empirically, the method yields label-recovery accuracy above 95% across CIFAR-100, Flowers-17, and ImageNet, and enhances FCN and CNN image reconstruction compared to prior attacks. The results imply that soft-label regimes do not fully prevent gradient inversion in federated learning, highlighting privacy implications and motivating defenses.
Abstract
Gradient inversion attacks aim to reconstruct local training data from intermediate gradients exposed in the federated learning framework. Despite successful attacks, all previous methods, starting from reconstructing a single data point and then relaxing the single-image limit to batch level, are only tested under hard label constraints. Even for single-image reconstruction, we still lack an analysis-based algorithm to recover augmented soft labels. In this work, we change the focus from enlarging batchsize to investigating the hard label constraints, considering a more realistic circumstance where label smoothing and mixup techniques are used in the training process. In particular, we are the first to initiate a novel algorithm to simultaneously recover the ground-truth augmented label and the input feature of the last fully-connected layer from single-input gradients, and provide a necessary condition for any analytical-based label recovery methods. Extensive experiments testify to the label recovery accuracy, as well as the benefits to the following image reconstruction. We believe soft labels in classification tasks are worth further attention in gradient inversion attacks.
