SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning
Paul Mangold, Sergey Samsonov, Safwan Labbi, Ilya Levin, Reda Alami, Alexey Naumov, Eric Moulines
TL;DR
The paper analyzes how heterogeneity affects FedLSA and introduces SCAFFLSA, a bias-corrected variant using control variates to reduce communication. It provides a refined stochastic expansion that separates transient dynamics, heterogeneity bias, and fluctuations, and proves that SCAFFLSA can attain logarithmic communication complexity while preserving linear speed-up in sample complexity. The authors extend the framework to both i.i.d. and Markovian observation models and instantiate the results for federated TD learning with linear function approximation. Empirical results on Garnet environments corroborate the theory, showing that SCAFFLSA eliminates bias in heterogeneous settings and achieves faster convergence with fewer communications.
Abstract
In this paper, we analyze the sample and communication complexity of the federated linear stochastic approximation (FedLSA) algorithm. We explicitly quantify the effects of local training with agent heterogeneity. We show that the communication complexity of FedLSA scales polynomially with the inverse of the desired accuracy $ε$. To overcome this, we propose SCAFFLSA a new variant of FedLSA that uses control variates to correct for client drift, and establish its sample and communication complexities. We show that for statistically heterogeneous agents, its communication complexity scales logarithmically with the desired accuracy, similar to Scaffnew. An important finding is that, compared to the existing results for Scaffnew, the sample complexity scales with the inverse of the number of agents, a property referred to as linear speed-up. Achieving this linear speed-up requires completely new theoretical arguments. We apply the proposed method to federated temporal difference learning with linear function approximation and analyze the corresponding complexity improvements.
