Adaptive Federated Learning via Dynamical System Model
Aayushya Agarwal, Larry Pileggi, Gauri Joshi
TL;DR
This work addresses the sensitivity to hyperparameters in heterogeneous federated learning by reframing FL as a continuous-time dynamical system with a central agent and client states connected by coupling flows. It introduces Adaptive FedECADO, which jointly adapts client learning dynamics and server aggregation through a circuit-inspired design controlled by a single tolerance $\gamma$, leveraging a Thevenin-impedance-based critical damping to set momentum $L_i$ and Backward-Euler-based adaptive time stepping for stability. The method ensures stable convergence despite non-IID data and variable client compute, with LTE-driven time-step adaptation guiding both client and server updates. Empirical results show robust performance across datasets and models, outperforming state-of-the-art adaptive methods that require careful hyperparameter tuning, while incurring modest overhead due to the dynamical-system formulation.
Abstract
Hyperparameter selection is critical for stable and efficient convergence of heterogeneous federated learning, where clients differ in computational capabilities, and data distributions are non-IID. Tuning hyperparameters is a manual and computationally expensive process as the hyperparameter space grows combinatorially with the number of clients. To address this, we introduce an end-to-end adaptive federated learning method in which both clients and central agents adaptively select their local learning rates and momentum parameters. Our approach models federated learning as a dynamical system, allowing us to draw on principles from numerical simulation and physical design. Through this perspective, selecting momentum parameters equates to critically damping the system for fast, stable convergence, while learning rates for clients and central servers are adaptively selected to satisfy accuracy properties from numerical simulation. The result is an adaptive, momentum-based federated learning algorithm in which the learning rates for clients and servers are dynamically adjusted and controlled by a single, global hyperparameter. By designing a fully integrated solution for both adaptive client updates and central agent aggregation, our method is capable of handling key challenges of heterogeneous federated learning, including objective inconsistency and client drift. Importantly, our approach achieves fast convergence while being insensitive to the choice of the global hyperparameter, making it well-suited for rapid prototyping and scalable deployment. Compared to state-of-the-art adaptive methods, our framework is shown to deliver superior convergence for heterogeneous federated learning while eliminating the need for hyperparameter tuning both client and server updates.
