FedAGHN: Personalized Federated Learning with Attentive Graph HyperNetworks

Jiarui Song; Yunheng Shen; Chengbin Hou; Pengyu Wang; Jinbao Wang; Ke Tang; Hairong Lv

FedAGHN: Personalized Federated Learning with Attentive Graph HyperNetworks

Jiarui Song, Yunheng Shen, Chengbin Hou, Pengyu Wang, Jinbao Wang, Ke Tang, Hairong Lv

TL;DR

FedAGHN tackles statistical heterogeneity in Federated Learning by learning client-specific, layer-wise collaboration graphs via Attentive Graph HyperNetworks to generate personalized initial models for each client. It introduces two trainable scalars per layer to adapt collaboration patterns and employs cosine-based priors on previous updates to compute attention weights, enabling end-to-end optimization of personalized aggregation. Empirical results across CIFAR-10/100 and Tiny-ImageNet under various non-IID settings demonstrate state-of-the-art performance, with analyses and visualizations confirming the learned graphs reflect data distribution similarities and dynamic collaboration across rounds. The approach yields a lightweight, scalable server-side mechanism for fine-grained personalization with practical implications for real-world FL deployments.

Abstract

Personalized Federated Learning (PFL) aims to address the statistical heterogeneity of data across clients by learning the personalized model for each client. Among various PFL approaches, the personalized aggregation-based approach conducts parameter aggregation in the server-side aggregation phase to generate personalized models, and focuses on learning appropriate collaborative relationships among clients for aggregation. However, the collaborative relationships vary in different scenarios and even at different stages of the FL process. To this end, we propose Personalized Federated Learning with Attentive Graph HyperNetworks (FedAGHN), which employs Attentive Graph HyperNetworks (AGHNs) to dynamically capture fine-grained collaborative relationships and generate client-specific personalized initial models. Specifically, AGHNs empower graphs to explicitly model the client-specific collaborative relationships, construct collaboration graphs, and introduce tunable attentive mechanism to derive the collaboration weights, so that the personalized initial models can be obtained by aggregating parameters over the collaboration graphs. Extensive experiments can demonstrate the superiority of FedAGHN. Moreover, a series of visualizations are presented to explore the effectiveness of collaboration graphs learned by FedAGHN.

FedAGHN: Personalized Federated Learning with Attentive Graph HyperNetworks

TL;DR

Abstract

FedAGHN: Personalized Federated Learning with Attentive Graph HyperNetworks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)