FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in Federated Learning
Gihun Lee, Minchan Jeong, Sangmook Kim, Jaehoon Oh, Se-Young Yun
TL;DR
FedSOL addresses non-IID data in Federated Learning by introducing an orthogonal learning principle that hides behind proximal restrictions. It uses a SAM-inspired adversarial perturbation of weights to compute local gradients that are effectively orthogonal to proximal gradients, thereby preserving global knowledge while enabling local learning. Theoretical analysis links FedSOL updates to proximal orthogonality and local objective equivalence, supported by extensive experiments across multiple datasets, heterogeneity levels, and model architectures, showing state-of-the-art performance and smoother loss landscapes. The method also demonstrates robustness to different proximal objectives and offers efficient partial perturbation strategies, making FedSOL a practical approach for scalable, privacy-preserving FL.
Abstract
Federated Learning (FL) aggregates locally trained models from individual clients to construct a global model. While FL enables learning a model with data privacy, it often suffers from significant performance degradation when clients have heterogeneous data distributions. This data heterogeneity causes the model to forget the global knowledge acquired from previously sampled clients after being trained on local datasets. Although the introduction of proximal objectives in local updates helps to preserve global knowledge, it can also hinder local learning by interfering with local objectives. To address this problem, we propose a novel method, Federated Stabilized Orthogonal Learning (FedSOL), which adopts an orthogonal learning strategy to balance the two conflicting objectives. FedSOL is designed to identify gradients of local objectives that are inherently orthogonal to directions affecting the proximal objective. Specifically, FedSOL targets parameter regions where learning on the local objective is minimally influenced by proximal weight perturbations. Our experiments demonstrate that FedSOL consistently achieves state-of-the-art performance across various scenarios.
