Provable Mutual Benefits from Federated Learning in Privacy-Sensitive Domains

Nikita Tsoy; Anna Mihalkova; Teodora Todorova; Nikola Konstantinov

Provable Mutual Benefits from Federated Learning in Privacy-Sensitive Domains

Nikita Tsoy, Anna Mihalkova, Teodora Todorova, Nikola Konstantinov

TL;DR

This paper provides necessary and sufficient conditions for the existence of mutually beneficial protocols in the context of mean estimation and convex stochastic optimization and designs protocols that maximize the total clients' utility, given symmetric privacy preferences.

Abstract

Cross-silo federated learning (FL) allows data owners to train accurate machine learning models by benefiting from each others private datasets. Unfortunately, the model accuracy benefits of collaboration are often undermined by privacy defenses. Therefore, to incentivize client participation in privacy-sensitive domains, a FL protocol should strike a delicate balance between privacy guarantees and end-model accuracy. In this paper, we study the question of when and how a server could design a FL protocol provably beneficial for all participants. First, we provide necessary and sufficient conditions for the existence of mutually beneficial protocols in the context of mean estimation and convex stochastic optimization. We also derive protocols that maximize the total clients' utility, given symmetric privacy preferences. Finally, we design protocols maximizing end-model accuracy and demonstrate their benefits in synthetic experiments.

Provable Mutual Benefits from Federated Learning in Privacy-Sensitive Domains

TL;DR

Abstract

Paper Structure (67 sections, 24 theorems, 155 equations, 1 figure, 1 algorithm)

This paper contains 67 sections, 24 theorems, 155 equations, 1 figure, 1 algorithm.

INTRODUCTION
Contributions
RELATED WORK
Accuracy-Privacy Trade-off in FL
Privacy-Related Incentives in FL
Other Incentives in Federated Learning
GENERAL FRAMEWORK
Federated Learning Protocol
Clients' Evaluations
$\texttt{err}_i\colon P \to [0, \infty)$
$\texttt{leak}_i\colon P \to [0, \infty)$
Clients' Utilities and Rationality
Parametric Utility Form
Participation Constraints
Server's Objectives
...and 52 more sections

Key Result

Theorem 4.1

$\hat{\mu}_i$ has the following property where $\rho \coloneqq \frac{n}{\sigma^2}$, $\beta_i \coloneqq \frac{1}{\frac{1}{\rho} + \alpha_i^2}$, $\gamma_i \coloneqq \sum_{k \neq i} \beta_k$.

Figures (1)

Figure 1: Personalized vs symmetric protocols. Error bars depict the standard deviation of the average effectiveness.

Theorems & Definitions (33)

Definition 3.1
Definition 3.2
Theorem 4.1: Proof in \ref{['sec:dp-mean-predictor-proof']}
Theorem 4.2: Proof in \ref{['sec:dp-mean-existence-proof']}
Theorem 4.3: Proof in \ref{['sec:dp-sgd-final-acc-proof']}
Theorem 4.4: Proof in \ref{['sec:dp-sgd-privacy-proof']}
Theorem 4.5: Proof in \ref{['sec:dp-sgd-existence-proof']}
Corollary 4.6: Proof in \ref{['sec:dp-sgd-existence-simple-proof']}
Theorem 4.7: Proof in \ref{['sec:b-mean-accuracy-proof']}
Theorem 4.8: Proof in \ref{['sec:b-mean-privacy-proof']}
...and 23 more

Provable Mutual Benefits from Federated Learning in Privacy-Sensitive Domains

TL;DR

Abstract

Provable Mutual Benefits from Federated Learning in Privacy-Sensitive Domains

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (33)