Flow: Per-Instance Personalized Federated Learning Through Dynamic Routing
Kunjal Panchal, Sunav Choudhary, Nisarg Parikh, Lijun Zhang, Hui Guan
TL;DR
Flow addresses non-IID heterogeneity in federated learning by introducing per-instance routing between a global model and a client-specific local model. It constructs a dynamic personalized model $w_p$ per client using a routing module $\psi_g$ that decides, for each input, whether to use $w_g$ or $w_\ell$, with data split into $\zeta_{m,\ell}$ and $\zeta_{m,g}$ and FedAvg-based server aggregation. The approach includes an explicit convergence analysis for both global and personalized models and demonstrates, through extensive cross-domain experiments on language and vision tasks, that Flow improves both generalized and personalized accuracy while remaining scalable and friendly to new clients. Together, these results indicate that per-instance dynamic routing can meaningfully enhance personalization in large-scale, cross-device FL with practical deployment benefits.
Abstract
Personalization in Federated Learning (FL) aims to modify a collaboratively trained global model according to each client. Current approaches to personalization in FL are at a coarse granularity, i.e. all the input instances of a client use the same personalized model. This ignores the fact that some instances are more accurately handled by the global model due to better generalizability. To address this challenge, this work proposes Flow, a fine-grained stateless personalized FL approach. Flow creates dynamic personalized models by learning a routing mechanism that determines whether an input instance prefers the local parameters or its global counterpart. Thus, Flow introduces per-instance routing in addition to leveraging per-client personalization to improve accuracies at each client. Further, Flow is stateless which makes it unnecessary for a client to retain its personalized state across FL rounds. This makes Flow practical for large-scale FL settings and friendly to newly joined clients. Evaluations on Stackoverflow, Reddit, and EMNIST datasets demonstrate the superiority in prediction accuracy of Flow over state-of-the-art non-personalized and only per-client personalized approaches to FL.
