Differentially Private Gradient-Tracking-Based Distributed Stochastic Optimization over Directed Graphs
Jialong Chen, Jimin Wang, Ji-Feng Zhang
TL;DR
The paper introduces a differentially private gradient-tracking algorithm for distributed stochastic optimization over directed graphs, using two schemes that couple step-size schedules with adaptive subsampling to achieve a finite cumulative privacy budget even over infinite iterations. It proves almost sure and mean-square convergence for nonconvex objectives, with polynomial rates under PL conditions for Scheme (S1) and exponential rates for Scheme (S2). The work also provides a rigorous privacy analysis, showing finite DP budgets and detailing the privacy–convergence trade-off, supported by numerical experiments on MNIST and CIFAR-10 that illustrate practical performance and privacy guarantees. This framework broadens the applicability of differentially private distributed optimization to directed networks and nonconvex objectives while quantifying the privacy成本與效益的折衷.
Abstract
This paper proposes a differentially private gradient-tracking-based distributed stochastic optimization algorithm over directed graphs. In particular, privacy noises are incorporated into each agent's state and tracking variable to mitigate information leakage, after which the perturbed states and tracking variables are transmitted to neighbors. We design two novel schemes for the step-sizes and the sampling number within the algorithm. The sampling parameter-controlled subsampling method employed by both schemes enhances the differential privacy level, and ensures a finite cumulative privacy budget even over infinite iterations. The algorithm achieves both almost sure and mean square convergence for nonconvex objectives. Furthermore, when nonconvex objectives satisfy the Polyak-Lojasiewicz (PL) condition, Scheme (S1) achieves a polynomial mean square convergence rate, and Scheme (S2) achieves an exponential mean square convergence rate. The trade-off between privacy and convergence is presented. The effectiveness of the algorithm and its superior performance compared to existing works are illustrated through numerical examples of distributed training on the benchmark datasets "MNIST" and "CIFAR-10".
