Invariant Federated Learning for Edge Intelligence: Mitigating Heterogeneity and Asynchrony via Exit Strategy and Invariant Penalty
Ziruo Hao, Zhenhua Cui, Tao Yang, Bo Hu, Xiaofeng Wu, Hui Feng
TL;DR
This work tackles heterogeneity and asynchrony in edge federated learning by introducing an exit strategy for aberrant clients and a FedIPG framework that enforces parameter invariance via a penalty term grounded in parameter orthogonality. By decomposing client parameters into invariant $w^I$ and heterogeneous $w^e_i$ components and minimizing $R_i(w) + \lambda \|\langle \nabla R_i(w), w \rangle\|^2$, the approach promotes stable global learning and improved OOD generalization. The authors provide theoretical analyses on contributions, generalization bounds, and convergence, and validate the method on multiple datasets with varying scales, demonstrating enhanced ID and OOD performance and hints of causal robustness. The proposed exit mechanism also reallocates communication toward better-equipped clients, improving convergence speed without increasing communication overhead. Overall, FedIPG offers a practical, theory-backed solution for robust edge FL under heterogeneity and asynchronous environments.
Abstract
This paper provides an invariant federated learning system for resource-constrained edge intelligence. This framework can mitigate the impact of heterogeneity and asynchrony via exit strategy and invariant penalty. We introduce parameter orthogonality into edge intelligence to measure the contribution or impact of heterogeneous and asynchronous clients. It is proved in this paper that the exit of abnormal edge clients can guarantee the effect of the model on most clients. Meanwhile, to ensure the models' performance on exited abnormal clients and those who lack training resources, we propose Federated Learning with Invariant Penalty for Generalization (FedIPG) by constructing the approximate orthogonality of the invariant parameters and the heterogeneous parameters. Theoretical proof shows that FedIPG reduces the Out-Of-Distribution prediction loss without increasing the communication burden. The performance of FedIPG combined with an exit strategy is tested empirically in multiple scales using four datasets. It shows our system can enhance In-Distribution performance and outperform the state-of-the-art algorithm in Out-Of-Distribution generalization while maintaining model convergence. Additionally, the results of the visual experiment prove that FedIPG contains preliminary causality in terms of ignoring confounding features.
