Byzantine Resilient Federated Multi-Task Representation Learning

Tuan Le; Shana Moothedath

Byzantine Resilient Federated Multi-Task Representation Learning

Tuan Le, Shana Moothedath

TL;DR

This work tackles personalization, transferability, and Byzantine resilience in federated learning by introducing BR-MTRL, which learns a shared representation $\phi$ across clients while keeping per-client heads $h_i$ locally adaptable. It optimizes via alternating gradient descent between updating $h_i$ with fixed $\phi$ and updating $\phi$ with fixed $h_i$, and employs robust aggregation using the Geometric Median and Krum to mitigate malicious updates. The approach is validated on CIFAR-10 and FEMNIST in an AWS-based testbed, showing improved accuracy under Byzantine attacks and successful transfer of the shared representation to unseen clients. The results suggest BR-MTRL provides robust, transferable personalization in heterogeneous federated environments with practical implications for privacy-preserving collaborative learning.

Abstract

In this paper, we propose BR-MTRL, a Byzantine-resilient multi-task representation learning framework that handles faulty or malicious agents. Our approach leverages representation learning through a shared neural network model, where all clients share fixed layers, except for a client-specific final layer. This structure captures shared features among clients while enabling individual adaptation, making it a promising approach for leveraging client data and computational power in heterogeneous federated settings to learn personalized models. To learn the model, we employ an alternating gradient descent strategy: each client optimizes its local model, updates its final layer, and sends estimates of the shared representation to a central server for aggregation. To defend against Byzantine agents, we employ two robust aggregation methods for client-server communication, Geometric Median and Krum. Our method enables personalized learning while maintaining resilience in distributed settings. We implemented the proposed algorithm in a federated testbed built using Amazon Web Services (AWS) platform and compared its performance with various benchmark algorithms and their variations. Through experiments using real-world datasets, including CIFAR-10 and FEMNIST, we demonstrated the effectiveness and robustness of our approach and its transferability to new unseen clients with limited data, even in the presence of Byzantine adversaries.

Byzantine Resilient Federated Multi-Task Representation Learning

TL;DR

Abstract

Byzantine Resilient Federated Multi-Task Representation Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)

Theorems & Definitions (1)