DPDR: Gradient Decomposition and Reconstruction for Differentially Private Deep Learning
Yixuan Liu, Li Xiong, Yuhan Liu, Yujie Gu, Ruixuan Liu, Hong Chen
TL;DR
DPDR tackles the privacy-utility trade-off in DP-SGD by separating gradients into a reusable common-knowledge direction and an incremental component, and then privatizing mainly the incremental part. By applying a Gradient Decomposition and Reconstruction (GDR) at early training steps and leveraging a mixed strategy to switch to DP-SGD later, DPDR achieves higher information gain per unit privacy budget and faster convergence. The authors provide formal privacy guarantees, non-convex convergence analysis, and extensive experiments showing DPDR improves accuracy and convergence over DP-SGD and related methods across multiple datasets and models. This approach offers a principled way to exploit gradient coherence in private deep learning, with practical implications for privacy-preserving training in real-world systems.
Abstract
Differentially Private Stochastic Gradients Descent (DP-SGD) is a prominent paradigm for preserving privacy in deep learning. It ensures privacy by perturbing gradients with random noise calibrated to their entire norm at each training step. However, this perturbation suffers from a sub-optimal performance: it repeatedly wastes privacy budget on the general converging direction shared among gradients from different batches, which we refer as common knowledge, yet yields little information gain. Motivated by this, we propose a differentially private training framework with early gradient decomposition and reconstruction (DPDR), which enables more efficient use of the privacy budget. In essence, it boosts model utility by focusing on incremental information protection and recycling the privatized common knowledge learned from previous gradients at early training steps. Concretely, DPDR incorporates three steps. First, it disentangles common knowledge and incremental information in current gradients by decomposing them based on previous noisy gradients. Second, most privacy budget is spent on protecting incremental information for higher information gain. Third, the model is updated with the gradient reconstructed from recycled common knowledge and noisy incremental information. Theoretical analysis and extensive experiments show that DPDR outperforms state-of-the-art baselines on both convergence rate and accuracy.
