AMSFL: Adaptive Multi-Step Federated Learning via Gradient Difference-Based Error Modeling
Ganglou Xu
TL;DR
Federated learning faces a trade-off between communication efficiency and model accuracy, especially under heterogeneous client data. The paper proposes Gradient Difference Approximation (GDA), a first-order method that estimates local update drift without computing Hessians, and embeds it into Adaptive Multi-Step Federated Learning (AMSFL) to dynamically choose the number of local steps under resource budgets. Theoretical analysis provides an error propagation framework, convergence guarantees, and a greedy time-budgeted step-allocation algorithm. Empirical results on NSL-KDD show AMSFL achieves higher global accuracy (around $0.9023$) and faster convergence (about $49$ seconds) with significantly reduced per-round time compared to baselines, validating the practical value of GDA-based adaptive scheduling in resource-constrained FL.
Abstract
Federated learning faces critical challenges in balancing communication efficiency and model accuracy. One key issue lies in the approximation of update errors without incurring high computational costs. In this paper, we propose a lightweight yet effective method called Gradient Difference Approximation (GDA), which leverages first-order information to estimate local error trends without computing the full Hessian matrix. The proposed method forms a key component of the Adaptive Multi-Step Federated Learning (AMSFL) framework and provides a unified error modeling strategy for large-scale multi-step adaptive training environments.
