FedLion: Faster Adaptive Federated Optimization with Fewer Communication
Zhiwei Tang, Tsung-Hui Chang
TL;DR
FedLion addresses slow convergence and high communication costs in federated learning by adapting the centralized Lion optimizer to FL with periodic averaging. It achieves faster convergence than prior adaptive FL methods and reduces uplink data through sign-based local updates, underpinned by a new bounded heterogeneity assumption and a nonasymptotic convergence rate of $O(T^{-1/2})$ in suitable settings. Empirically, on EMNIST and CIFAR-10, FedLion outperforms FAFED and FedDA while incurring only marginal uplink overhead, with especially strong gains when gradients are dense. The approach offers practical communication savings and accelerated training in large-scale FL deployments, particularly under moderate heterogeneity and high gradient density.
Abstract
In Federated Learning (FL), a framework to train machine learning models across distributed data, well-known algorithms like FedAvg tend to have slow convergence rates, resulting in high communication costs during training. To address this challenge, we introduce FedLion, an adaptive federated optimization algorithm that seamlessly incorporates key elements from the recently proposed centralized adaptive algorithm, Lion (Chen et al. 2o23), into the FL framework. Through comprehensive evaluations on two widely adopted FL benchmarks, we demonstrate that FedLion outperforms previous state-of-the-art adaptive algorithms, including FAFED (Wu et al. 2023) and FedDA. Moreover, thanks to the use of signed gradients in local training, FedLion substantially reduces data transmission requirements during uplink communication when compared to existing adaptive algorithms, further reducing communication costs. Last but not least, this work also includes a novel theoretical analysis, showcasing that FedLion attains faster convergence rate than established FL algorithms like FedAvg.
