Table of Contents
Fetching ...

Byzantine Resilient Federated Multi-Task Representation Learning

Tuan Le, Shana Moothedath

TL;DR

This work tackles personalization, transferability, and Byzantine resilience in federated learning by introducing BR-MTRL, which learns a shared representation $\phi$ across clients while keeping per-client heads $h_i$ locally adaptable. It optimizes via alternating gradient descent between updating $h_i$ with fixed $\phi$ and updating $\phi$ with fixed $h_i$, and employs robust aggregation using the Geometric Median and Krum to mitigate malicious updates. The approach is validated on CIFAR-10 and FEMNIST in an AWS-based testbed, showing improved accuracy under Byzantine attacks and successful transfer of the shared representation to unseen clients. The results suggest BR-MTRL provides robust, transferable personalization in heterogeneous federated environments with practical implications for privacy-preserving collaborative learning.

Abstract

In this paper, we propose BR-MTRL, a Byzantine-resilient multi-task representation learning framework that handles faulty or malicious agents. Our approach leverages representation learning through a shared neural network model, where all clients share fixed layers, except for a client-specific final layer. This structure captures shared features among clients while enabling individual adaptation, making it a promising approach for leveraging client data and computational power in heterogeneous federated settings to learn personalized models. To learn the model, we employ an alternating gradient descent strategy: each client optimizes its local model, updates its final layer, and sends estimates of the shared representation to a central server for aggregation. To defend against Byzantine agents, we employ two robust aggregation methods for client-server communication, Geometric Median and Krum. Our method enables personalized learning while maintaining resilience in distributed settings. We implemented the proposed algorithm in a federated testbed built using Amazon Web Services (AWS) platform and compared its performance with various benchmark algorithms and their variations. Through experiments using real-world datasets, including CIFAR-10 and FEMNIST, we demonstrated the effectiveness and robustness of our approach and its transferability to new unseen clients with limited data, even in the presence of Byzantine adversaries.

Byzantine Resilient Federated Multi-Task Representation Learning

TL;DR

This work tackles personalization, transferability, and Byzantine resilience in federated learning by introducing BR-MTRL, which learns a shared representation across clients while keeping per-client heads locally adaptable. It optimizes via alternating gradient descent between updating with fixed and updating with fixed , and employs robust aggregation using the Geometric Median and Krum to mitigate malicious updates. The approach is validated on CIFAR-10 and FEMNIST in an AWS-based testbed, showing improved accuracy under Byzantine attacks and successful transfer of the shared representation to unseen clients. The results suggest BR-MTRL provides robust, transferable personalization in heterogeneous federated environments with practical implications for privacy-preserving collaborative learning.

Abstract

In this paper, we propose BR-MTRL, a Byzantine-resilient multi-task representation learning framework that handles faulty or malicious agents. Our approach leverages representation learning through a shared neural network model, where all clients share fixed layers, except for a client-specific final layer. This structure captures shared features among clients while enabling individual adaptation, making it a promising approach for leveraging client data and computational power in heterogeneous federated settings to learn personalized models. To learn the model, we employ an alternating gradient descent strategy: each client optimizes its local model, updates its final layer, and sends estimates of the shared representation to a central server for aggregation. To defend against Byzantine agents, we employ two robust aggregation methods for client-server communication, Geometric Median and Krum. Our method enables personalized learning while maintaining resilience in distributed settings. We implemented the proposed algorithm in a federated testbed built using Amazon Web Services (AWS) platform and compared its performance with various benchmark algorithms and their variations. Through experiments using real-world datasets, including CIFAR-10 and FEMNIST, we demonstrated the effectiveness and robustness of our approach and its transferability to new unseen clients with limited data, even in the presence of Byzantine adversaries.

Paper Structure

This paper contains 12 sections, 7 equations, 2 figures, 2 tables, 1 algorithm.

Figures (2)

  • Figure 1: The system model of proposed MTRL.
  • Figure 2: FedRep, Byzantine FedRep, BR-MTRL+GM (proposed), BR-MTRL+Krum (proposed) Naive. Average test accuracy across all clients: Figure \ref{['fig:cfsr']} and \ref{['fig:cfml']} show the results for CIFAR-10 with 100 clients with 20 malicious clients under SR and ML attacks respectively. Figure \ref{['fig:emsr']} and \ref{['fig:emml']} present the results for FEMNIST with 150 clients with 50 malicious clients under SR and ML attacks respectively. We compared our proposed BR-MTRL with the FedRep algorithm under benign conditions (i.e., no malicious client) and FedRep subjected to Byzantine attacks (Byzantine FedRep). Meta test for new clients: Figures \ref{['fig:metacfsr']}, \ref{['fig:metaemsr']}, \ref{['fig:metaml']} and \ref{['fig:metasr']} present the average test accuracy across 10 new clients (CIFAR-10 and EMNIST) after fine-tuning the local heads for 10 epochs using the shared model. We first learned the shared model using source clients and then used the shared model to optimize the local head of new clients.

Theorems & Definitions (1)

  • Remark 3.1