Gradient Inversion Transcript: Leveraging Robust Generative Priors to Reconstruct Training Data from Gradient Leakage
Xinping Chen, Chen Liu
TL;DR
This work tackles the privacy risk of gradient leakage in distributed and federated learning by proposing Gradient Inversion Transcript (GIT), a generative attack whose architecture adapts to the leaked model and can be trained offline to reconstruct training data from gradients. GIT comes in two variants—Exact-GIT and Coarse-GIT—offering a principled, architecture-aware inversion that can also serve as a priors for iterative gradient matching, accelerating convergence and improving reconstruction quality. Across CIFAR-10, ImageNet, and facial datasets with LeNet, ResNet, and ViT backbones, GIT outperforms existing baselines in direct inference and enhances optimization-based reconstructions when used as priors, while remaining robust under inaccurate gradients, distribution shifts, and parameter discrepancies. The results underscore practical privacy implications for federated learning and provide a flexible, offline-trained framework for gradient-to-input inversion that can adapt to diverse network architectures while maintaining efficiency and robustness.
Abstract
We propose Gradient Inversion Transcript (GIT), a novel generative approach for reconstructing training data from leaked gradients. GIT employs a generative attack model, whose architecture is tailored to align with the structure of the leaked model based on theoretical analysis. Once trained offline, GIT can be deployed efficiently and only relies on the leaked gradients to reconstruct the input data, rendering it applicable under various distributed learning environments. When used as a prior for other iterative optimization-based methods, GIT not only accelerates convergence but also enhances the overall reconstruction quality. GIT consistently outperforms existing methods across multiple datasets and demonstrates strong robustness under challenging conditions, including inaccurate gradients, data distribution shifts and discrepancies in model parameters.
