Compressed Private Aggregation for Scalable and Robust Federated Learning over Massive Networks

Natalie Lang; Nir Shlezinger; Rafael G. L. D'Oliveira; Salim El Rouayheb

Compressed Private Aggregation for Scalable and Robust Federated Learning over Massive Networks

Natalie Lang, Nir Shlezinger, Rafael G. L. D'Oliveira, Salim El Rouayheb

TL;DR

This paper tackles privacy, robustness, and communication bottlenecks in Federated Learning over massive networks by introducing Compressed Private Aggregation (CPA). CPA uses nested lattice quantizers and random codebooks to encode local updates into very few bits, while applying randomized response to meet $\varepsilon$-LD and constructing a $k$-anonymous representation that resists Byzantine poisoning. The authors prove that CPA preserves the FL convergence rate at order $O(1/t)$ in the amortized sense, with a per-round distortion that diminishes as the number of participating users $K$ grows, and validate these properties with extensive experiments on MNIST and CIFAR-10 showing competitive accuracy against non-private baselines and robustness to malicious participants. Overall, CPA offers a scalable, privacy-preserving, and Byzantine-robust framework for federated learning in large-scale deployments, enabling practical deployments in massive networks with minimal communication overhead.

Abstract

Federated learning (FL) is an emerging paradigm that allows a central server to train machine learning models using remote users' data. Despite its growing popularity, FL faces challenges in preserving the privacy of local datasets, its sensitivity to poisoning attacks by malicious users, and its communication overhead. The latter is additionally considerably dominant in large-scale networks. These limitations are often individually mitigated by local differential privacy (LDP) mechanisms, robust aggregation, compression, and user selection techniques, which typically come at the cost of accuracy. In this work, we present compressed private aggregation (CPA), that allows massive deployments to simultaneously communicate at extremely low bit rates while achieving privacy, anonymity, and resilience to malicious users. CPA randomizes a codebook for compressing the data into a few bits using nested lattice quantizers, while ensuring anonymity and robustness, with a subsequent perturbation to hold LDP. The proposed CPA is proven to result in FL convergence in the same asymptotic rate as FL without privacy, compression, and robustness considerations, while satisfying both anonymity and LDP requirements. These analytical properties are empirically confirmed in a numerical study, where we demonstrate the performance gains of CPA compared with separate mechanisms for compression and privacy for training different image classification models, as well as its robustness in mitigating the harmful effects of malicious users.

Compressed Private Aggregation for Scalable and Robust Federated Learning over Massive Networks

TL;DR

-LD and constructing a

-anonymous representation that resists Byzantine poisoning. The authors prove that CPA preserves the FL convergence rate at order

in the amortized sense, with a per-round distortion that diminishes as the number of participating users

grows, and validate these properties with extensive experiments on MNIST and CIFAR-10 showing competitive accuracy against non-private baselines and robustness to malicious participants. Overall, CPA offers a scalable, privacy-preserving, and Byzantine-robust framework for federated learning in large-scale deployments, enabling practical deployments in massive networks with minimal communication overhead.

Abstract

Paper Structure (26 sections, 8 theorems, 39 equations, 10 figures, 6 tables, 1 algorithm)

This paper contains 26 sections, 8 theorems, 39 equations, 10 figures, 6 tables, 1 algorithm.

Introduction
System Model and Preliminaries
Federated Learning
Quantization Preliminaries
Privacy Preliminaries
Problem Description
Threat Model
Problem Formulation
Compressed Private Aggregation
1-Bit CPA
Initialization
Encoding
Decoding
Nested CPA
Initialization
...and 11 more sections

Key Result

Proposition 1

cpa is $\varepsilon$-ldp with respect to $\mathcal{D}_r$, per communication round.

Figures (10)

Figure 1: Nested self-similar lattice quantizers for $L=2$ (top) and $L=1$ (bottom).
Figure 2: Overview of cpa. The left dashed box represents the $r$-th user encoding while the right describes the server decoding.
Figure 3: Example: a scalar input is mapped into a point that corresponds to the continues vale of $3\Delta$, which is $9$-th quantization point. Accordingly, as the $9$-th entry of the randomized vector is $-1$, so does the output.
Figure 4: Overview of nested cpa. The upper solid box represents the $1$-bit CPA with a coarse quantizer while the bottom describes that with the nested one.
Figure 5: Example: the input is mapped to $3\Delta$ and $0$ by the coarse and nested quantizers respectively; that are the third and second quantization points in each, correspond to $1$ and $-1$ in the randomized vectors, as the outputs.
...and 5 more figures

Theorems & Definitions (14)

Definition 2.1: Lattice Quantizer
Definition 2.2: Uniform Quantizer
Definition 2.3: Nested Lattice Codebook polyanskiy2014lecture
Definition 2.4: $\varepsilon$-ldp wang2020comprehensive
Definition 2.5: $k$-anonymity sweeney2002k
Claim 1
Proposition 1
Proposition 2
Corollary 1
Theorem 4.1
...and 4 more

Compressed Private Aggregation for Scalable and Robust Federated Learning over Massive Networks

TL;DR

Abstract

Compressed Private Aggregation for Scalable and Robust Federated Learning over Massive Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (14)