Table of Contents
Fetching ...

Private and Federated Stochastic Convex Optimization: Efficient Strategies for Centralized Systems

Roie Reshef, Kfir Y. Levy

TL;DR

This work tackles differential privacy in federated learning under the stochastic convex optimization framework in centralized systems, addressing both trusted and untrusted servers. It builds on the $\mu^2$-SGD methodology by introducing Noisy-$\mu^2$-SGD and a parallel minibatch FL template that preserves linear computational complexity in the total data size while enabling DP. The authors derive optimal population-loss convergence bounds for both untrusted ($O\left(\frac{1}{\sqrt{nM}}+\frac{\sqrt{d}}{\epsilon n \sqrt{M}}\right)$) and trusted ($O\left(\frac{1}{\sqrt{nM}}+\frac{\sqrt{d}}{\epsilon n M}\right)$) server scenarios, along with gradient-error and sensitivity analyses, and provide empirical validation on MNIST. The approach significantly improves the practicality of DP in FL by balancing privacy, efficiency, and robustness across server-trust environments.

Abstract

This paper addresses the challenge of preserving privacy in Federated Learning (FL) within centralized systems, focusing on both trusted and untrusted server scenarios. We analyze this setting within the Stochastic Convex Optimization (SCO) framework, and devise methods that ensure Differential Privacy (DP) while maintaining optimal convergence rates for homogeneous and heterogeneous data distributions. Our approach, based on a recent stochastic optimization technique, offers linear computational complexity, comparable to non-private FL methods, and reduced gradient obfuscation. This work enhances the practicality of DP in FL, balancing privacy, efficiency, and robustness in a variety of server trust environment.

Private and Federated Stochastic Convex Optimization: Efficient Strategies for Centralized Systems

TL;DR

This work tackles differential privacy in federated learning under the stochastic convex optimization framework in centralized systems, addressing both trusted and untrusted servers. It builds on the -SGD methodology by introducing Noisy--SGD and a parallel minibatch FL template that preserves linear computational complexity in the total data size while enabling DP. The authors derive optimal population-loss convergence bounds for both untrusted () and trusted () server scenarios, along with gradient-error and sensitivity analyses, and provide empirical validation on MNIST. The approach significantly improves the practicality of DP in FL by balancing privacy, efficiency, and robustness across server-trust environments.

Abstract

This paper addresses the challenge of preserving privacy in Federated Learning (FL) within centralized systems, focusing on both trusted and untrusted server scenarios. We analyze this setting within the Stochastic Convex Optimization (SCO) framework, and devise methods that ensure Differential Privacy (DP) while maintaining optimal convergence rates for homogeneous and heterogeneous data distributions. Our approach, based on a recent stochastic optimization technique, offers linear computational complexity, comparable to non-private FL methods, and reduced gradient obfuscation. This work enhances the practicality of DP in FL, balancing privacy, efficiency, and robustness in a variety of server trust environment.
Paper Structure (44 sections, 16 theorems, 106 equations, 2 tables, 2 algorithms)

This paper contains 44 sections, 16 theorems, 106 equations, 2 tables, 2 algorithms.

Key Result

Lemma 2.2

Let $P\sim\mathcal{N}(\mu,I\sigma^2)$ and $Q\sim\mathcal{N}(\mu+\Delta,I\sigma^2)$, two Gaussian distributions. Then, $\mathbb{D}_\alpha\left(P\|Q\right)=\frac{\alpha\|\Delta\|^2}{2\sigma^2}$.

Theorems & Definitions (29)

  • Definition 2.1: Rényi Divergence renyi
  • Lemma 2.2
  • Definition 2.3: Differential Privacy odo_dpcalib_dp
  • Definition 2.4: Rényi Differential Privacy rdp
  • Lemma 2.5: rdp
  • Lemma 4.1
  • Lemma 4.2
  • Lemma 4.3
  • Theorem 5.1: Privacy Guarantees for \ref{['alg:untrust']}
  • proof : Proof Sketch
  • ...and 19 more