Decentralized Personalized Federated Learning for Min-Max Problems
Ekaterina Borodich, Aleksandr Beznosikov, Abdurakhmon Sadiev, Vadim Sushko, Nikolay Savelyev, Martin Takáč, Alexander Gasnikov
TL;DR
The paper tackles decentralized personalized federated learning for saddle point problems by introducing a mixing objective that couples local models through a gossip matrix $W$ and a personalization parameter $\lambda$. It develops three algorithmic strategies—Sliding Opt Comm (for small $\lambda$), Sliding Big Lambda (for large $\lambda$), and a Local Variance-Reduction method—to achieve near-optimal communication and local computation complexities in both deterministic and stochastic settings. It also establishes tight lower bounds for both communications and local oracle calls, showing the proposed methods are optimal in key regimes. Theoretical results are complemented by experiments on bilinear problems and neural networks with adversarial noise, demonstrating favorable communication efficiency and robustness, and highlighting practical applicability to decentralized federated learning with personalization.
Abstract
Personalized Federated Learning (PFL) has witnessed remarkable advancements, enabling the development of innovative machine learning applications that preserve the privacy of training data. However, existing theoretical research in this field has primarily focused on distributed optimization for minimization problems. This paper is the first to study PFL for saddle point problems encompassing a broader range of optimization problems, that require more than just solving minimization problems. In this work, we consider a recently proposed PFL setting with the mixing objective function, an approach combining the learning of a global model together with locally distributed learners. Unlike most previous work, which considered only the centralized setting, we work in a more general and decentralized setup that allows us to design and analyze more practical and federated ways to connect devices to the network. We proposed new algorithms to address this problem and provide a theoretical analysis of the smooth (strongly) convex-(strongly) concave saddle point problems in stochastic and deterministic cases. Numerical experiments for bilinear problems and neural networks with adversarial noise demonstrate the effectiveness of the proposed methods.
