FedLog: Personalized Federated Classification with Less Communication and More Flexibility

Haolin Yu; Guojun Zhang; Pascal Poupart

FedLog: Personalized Federated Classification with Less Communication and More Flexibility

Haolin Yu, Guojun Zhang, Pascal Poupart

TL;DR

This work proposes to share sufficient data summaries instead of raw model parameters to reduce the overhead of FRL algorithms, and extends it with differential privacy framework to ensure formal privacy guarantee.

Abstract

Federated representation learning (FRL) aims to learn personalized federated models with effective feature extraction from local data. FRL algorithms that share the majority of the model parameters face significant challenges with huge communication overhead. This overhead stems from the millions of neural network parameters and slow aggregation progress of the averaging heuristic. To reduce the overhead, we propose to share sufficient data summaries instead of raw model parameters. The data summaries encode minimal sufficient statistics of an exponential family, and Bayesian inference is utilized for global aggregation. It helps to reduce message sizes and communication frequency. To further ensure formal privacy guarantee, we extend it with differential privacy framework. Empirical results demonstrate high learning accuracy with low communication overhead of our method.

FedLog: Personalized Federated Classification with Less Communication and More Flexibility

TL;DR

Abstract

Paper Structure (31 sections, 9 theorems, 17 equations, 2 figures, 5 tables, 1 algorithm)

This paper contains 31 sections, 9 theorems, 17 equations, 2 figures, 5 tables, 1 algorithm.

Introduction
Related Works
Federated Representation Learning
Method
Exponential Family
Model Construction
Notation.
FedLog
Interpretation and FedLog-C
Differential Privacy
Experiments
Synthetic
Communication Cost
Celeba
Flexible Architecture
...and 16 more sections

Key Result

Theorem 2

If $({\boldsymbol{\phi}_1}, y_1), ({\boldsymbol{\phi}_2}, y_2), \cdots, ({\boldsymbol{\phi}_n}, y_n)$ are i.i.d. samples from the exponential family defined with Eq. likeli, then ${\bf T}(({\boldsymbol{\phi}_1}, y_1), \cdots, ({\boldsymbol{\phi}_n}, y_n)) = \sum_{i=1}^n {\boldsymbol{\phi}_i}\otimes{

Figures (2)

Figure 1: Synthetic experiments. Dots are data points or local representations: dark green: client 0 class 0; light green: client 1 class 0; Dark blue: client 0 class 1; light blue: client 1 class 1. Dashed lines are linear separators. Accuracy results are averaged over 6 seeds.
Figure 2: From top to bottom: Celeba, flexible architecture, and differential privacy results. Colored area shows mean $\pm$ standard deviation of the accuracy.

Theorems & Definitions (21)

Definition 1
Theorem 2
proof
Definition 3
Theorem 4
proof
Definition A.5: 6.2.1 in casella2024statistical
Theorem A.6: 6.2.10 in casella2024statistical
Definition A.7: 6.2.11 in casella2024statistical
Theorem A.8: 6.2.13 in casella2024statistical
...and 11 more

FedLog: Personalized Federated Classification with Less Communication and More Flexibility

TL;DR

Abstract

FedLog: Personalized Federated Classification with Less Communication and More Flexibility

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (21)