Table of Contents
Fetching ...

Differentially Private Federated Learning With Time-Adaptive Privacy Spending

Shahrzad Kiani, Nupur Kulkarni, Adam Dziedzic, Stark Draper, Franziska Boenisch

TL;DR

This paper tackles the DP-FL privacy-utility tradeoff by introducing a time-adaptive spend-as-you-go framework that saves privacy budget in early rounds and spends more in later rounds, enabling better learning of fine-grained features without increasing privacy risk. It derives comprehensive privacy accounting using Rényi DP, proves that per-client privacy spending is non-decreasing under the schedule, and shows how optimally permuting saving-based sampling rates can reduce clipping bias. The authors provide theoretical bounds and practical algorithms for selecting when to save vs. spend and how to allocate sampling rates, complemented by experiments on FMNIST, MNIST, Adult, and CIFAR10 that demonstrate improved utility over DP-FedAvg and IDP-FedAvg under heterogeneous budgets. The work has practical implications for deploying DP-FL in real-world settings with diverse privacy constraints, offering a principled approach to allocate privacy budget where it yields the most accuracy gains while maintaining rigorous privacy guarantees.

Abstract

Federated learning (FL) with differential privacy (DP) provides a framework for collaborative machine learning, enabling clients to train a shared model while adhering to strict privacy constraints. The framework allows each client to have an individual privacy guarantee, e.g., by adding different amounts of noise to each client's model updates. One underlying assumption is that all clients spend their privacy budgets uniformly over time (learning rounds). However, it has been shown in the literature that learning in early rounds typically focuses on more coarse-grained features that can be learned at lower signal-to-noise ratios while later rounds learn fine-grained features that benefit from higher signal-to-noise ratios. Building on this intuition, we propose a time-adaptive DP-FL framework that expends the privacy budget non-uniformly across both time and clients. Our framework enables each client to save privacy budget in early rounds so as to be able to spend more in later rounds when additional accuracy is beneficial in learning more fine-grained features. We theoretically prove utility improvements in the case that clients with stricter privacy budgets spend budgets unevenly across rounds, compared to clients with more relaxed budgets, who have sufficient budgets to distribute their spend more evenly. Our practical experiments on standard benchmark datasets support our theoretical results and show that, in practice, our algorithms improve the privacy-utility trade-offs compared to baseline schemes.

Differentially Private Federated Learning With Time-Adaptive Privacy Spending

TL;DR

This paper tackles the DP-FL privacy-utility tradeoff by introducing a time-adaptive spend-as-you-go framework that saves privacy budget in early rounds and spends more in later rounds, enabling better learning of fine-grained features without increasing privacy risk. It derives comprehensive privacy accounting using Rényi DP, proves that per-client privacy spending is non-decreasing under the schedule, and shows how optimally permuting saving-based sampling rates can reduce clipping bias. The authors provide theoretical bounds and practical algorithms for selecting when to save vs. spend and how to allocate sampling rates, complemented by experiments on FMNIST, MNIST, Adult, and CIFAR10 that demonstrate improved utility over DP-FedAvg and IDP-FedAvg under heterogeneous budgets. The work has practical implications for deploying DP-FL in real-world settings with diverse privacy constraints, offering a principled approach to allocate privacy budget where it yields the most accuracy gains while maintaining rigorous privacy guarantees.

Abstract

Federated learning (FL) with differential privacy (DP) provides a framework for collaborative machine learning, enabling clients to train a shared model while adhering to strict privacy constraints. The framework allows each client to have an individual privacy guarantee, e.g., by adding different amounts of noise to each client's model updates. One underlying assumption is that all clients spend their privacy budgets uniformly over time (learning rounds). However, it has been shown in the literature that learning in early rounds typically focuses on more coarse-grained features that can be learned at lower signal-to-noise ratios while later rounds learn fine-grained features that benefit from higher signal-to-noise ratios. Building on this intuition, we propose a time-adaptive DP-FL framework that expends the privacy budget non-uniformly across both time and clients. Our framework enables each client to save privacy budget in early rounds so as to be able to spend more in later rounds when additional accuracy is beneficial in learning more fine-grained features. We theoretically prove utility improvements in the case that clients with stricter privacy budgets spend budgets unevenly across rounds, compared to clients with more relaxed budgets, who have sufficient budgets to distribute their spend more evenly. Our practical experiments on standard benchmark datasets support our theoretical results and show that, in practice, our algorithms improve the privacy-utility trade-offs compared to baseline schemes.

Paper Structure

This paper contains 32 sections, 7 theorems, 23 equations, 5 figures, 11 tables, 6 algorithms.

Key Result

Lemma 1

Given any $n\in [N], t\in [T]$, and $T_n\in [T]$, we have

Figures (5)

  • Figure 1: Our framework improves accuracy in later rounds compared to the baseline. We plot the global test accuracy vs. rounds for (a) the FMNIST dataset, and (b) the MNIST dataset. In (a), $({\epsilon}_{\text{group},1}, {\epsilon}_{\text{group},2}, {\epsilon}_{\text{group},3})=(10,20,30)$, and in (b) it equals $(10,15,20)$.
  • Figure 2: While both adhere to privacy budgets, our framework follows spend-as-you-go, whereas IDP-FedAvg uses uniform privacy spending. The blue solid curves correspond to clients' privacy spending in our framework, while the red dashed curves show IDP-FedAvg. The curves of clients with budgets of $30$, $20$, and $10$ are marked with rectangles, circles, and squares, respectively.
  • Figure 3: Average Training loss of clients in our time-adaptive DP-FL scheme plotted versus the IDP-FedAvg baseline with FMNIST dataset in training rounds T = 25. We set $({\epsilon}_{\text{group},1}, {\epsilon}_{\text{group},2}, {\epsilon}_{\text{group},3})=(10,20,30)$ in our scheme and IDP-FedAvg.
  • Figure 4: Global test accuracy for increasing number of communication rounds. In this figure, we use the FMNIST dataset, $N=100$ clients, $L=30$ local iterations, $({\epsilon}_{\text{group},1}, {\epsilon}_{\text{group},2}, {\epsilon}_{\text{group},3})=(20, 20, 20)$, $c=250$, and $q=0.8$.
  • Figure 5: Test accuracy for our time-adaptive DP-FL framework vs. IDP-FedAvg, using stricter privacy budgets $({\epsilon}_{\text{group},1},{\epsilon}_{\text{group},2},{\epsilon}_{\text{group},3})=(2,5,10)$. In this figure, we use $N=100$ clients, $T=50$ global rounds, $L=30$ local iterations, $c=250$, and $q=0.8$.

Theorems & Definitions (11)

  • Lemma 1
  • Theorem 1
  • Remark 1
  • Theorem 2
  • Theorem 3
  • Lemma 2
  • Definition 1: $({\epsilon}, \delta)$-DP dwork2014algorithmic
  • Definition 2: Rényi DP (RDP) mironov2017renyi
  • Lemma 3
  • Definition 3: SGM mironov2019r
  • ...and 1 more