FedHCA$^2$: Towards Hetero-Client Federated Multi-Task Learning
Yuxiang Lu, Suizhi Huang, Yuwen Yang, Shalayiding Sirejiding, Yue Ding, Hongtao Lu
TL;DR
This work tackles HC-FMTL by relaxing model congruity in federated multi-task learning and proposing FedHCA2, which decouples encoder and decoder aggregation into Hyper Conflict-Averse and Hyper Cross Attention schemes, respectively, supplemented by learnable Hyper Aggregation Weights. Theoretical analysis links MTL and FL optimization and motivates conflict mitigation during encoder updates, while layer wise cross attention enables fine grained decoder interactions across heterogeneous tasks. Empirical results on PASCAL-Context and NYUD-v2 demonstrate that FedHCA2 outperforms traditional FL and FMTL baselines, with ablations confirming the necessity of both encoder and decoder aggregations and the adaptability provided by the hyper weights. The approach broadens the applicability of federated multi-task learning to realistic settings with diverse task setups and data domains, offering a flexible, scalable framework for personalized yet collaborative models across heterogeneous clients.
Abstract
Federated Learning (FL) enables joint training across distributed clients using their local data privately. Federated Multi-Task Learning (FMTL) builds on FL to handle multiple tasks, assuming model congruity that identical model architecture is deployed in each client. To relax this assumption and thus extend real-world applicability, we introduce a novel problem setting, Hetero-Client Federated Multi-Task Learning (HC-FMTL), to accommodate diverse task setups. The main challenge of HC-FMTL is the model incongruity issue that invalidates conventional aggregation methods. It also escalates the difficulties in accurate model aggregation to deal with data and task heterogeneity inherent in FMTL. To address these challenges, we propose the FedHCA$^2$ framework, which allows for federated training of personalized models by modeling relationships among heterogeneous clients. Drawing on our theoretical insights into the difference between multi-task and federated optimization, we propose the Hyper Conflict-Averse Aggregation scheme to mitigate conflicts during encoder updates. Additionally, inspired by task interaction in MTL, the Hyper Cross Attention Aggregation scheme uses layer-wise cross attention to enhance decoder interactions while alleviating model incongruity. Moreover, we employ learnable Hyper Aggregation Weights for each client to customize personalized parameter updates. Extensive experiments demonstrate the superior performance of FedHCA$^2$ in various HC-FMTL scenarios compared to representative methods. Our code will be made publicly available.
