Forward Learning with Differential Privacy
Mingqian Feng, Zeliang Zhang, Jinyang Jiang, Yijie Peng, Chenliang Xu
TL;DR
This work targets differential privacy in deep learning by exploiting forward-learning perturbations rather than backpropagation-based gradient noise. It introduces DP-ULR, a privatized forward-learning algorithm that combines a sampling-with-rejection strategy, likelihood-ratio gradient proxies, and a dynamic privacy controller to bound DP costs. The authors provide a theoretical DP analysis using SRGM and $(\alpha,\gamma)$-RDP, demonstrating that the privacy impact of rejection sampling is negligible and that DP-ULR can achieve DP guarantees with competitive utility relative to DP-SGD. Empirically, DP-ULR is shown to perform well on MNIST with MLPs and CIFAR-10 with CNNs, particularly at larger batch sizes, while offering advantages in parallelization and applicability to non-differentiable or black-box components.
Abstract
Differential privacy (DP) in deep learning is a critical concern as it ensures the confidentiality of training data while maintaining model utility. Existing DP training algorithms provide privacy guarantees by clipping and then injecting external noise into sample gradients computed by the backpropagation algorithm. Different from backpropagation, forward-learning algorithms based on perturbation inherently add noise during the forward pass and utilize randomness to estimate the gradients. Although these algorithms are non-privatized, the introduction of noise during the forward pass indirectly provides internal randomness protection to the model parameters and their gradients, suggesting the potential for naturally providing differential privacy. In this paper, we propose a \blue{privatized} forward-learning algorithm, Differential Private Unified Likelihood Ratio (DP-ULR), and demonstrate its differential privacy guarantees. DP-ULR features a novel batch sampling operation with rejection, of which we provide theoretical analysis in conjunction with classic differential privacy mechanisms. DP-ULR is also underpinned by a theoretically guided privacy controller that dynamically adjusts noise levels to manage privacy costs in each training step. Our experiments indicate that DP-ULR achieves competitive performance compared to traditional differential privacy training algorithms based on backpropagation, maintaining nearly the same privacy loss limits.
