FedORGP: Guiding Heterogeneous Federated Learning with Orthogonality Regularization on Global Prototypes
Fucheng Guo, Zeyu Luan, Qing Li, Dan Zhao, Yong Jiang
TL;DR
FedORGP tackles the dual challenges of data and model heterogeneity in federated learning by introducing orthogonality regularization on global class prototypes to create an angular margin that complements cross-entropy loss. The method defines intra-class prototype similarity and inter-class separation via a learned prototype system and optimizes a regularized loss L_OR to maximize intra-class compactness while expanding inter-class margins, then aligns local embeddings with the global prototypes through L_k. The authors provide convergence guarantees under non-convex conditions and demonstrate through extensive experiments on CIFAR-10/100, Flowers102, and TinyImagenet that FedORGP achieves up to 10.12 percentage points higher accuracy than strong baselines, with robustness to both statistical and model heterogeneity and varying participation rates. The results indicate that angular-prototype separation integrated with CE loss offers a practical, scalable improvement for heterogeneous FL systems.
Abstract
Federated Learning (FL) has emerged as an essential framework for distributed machine learning, especially with its potential for privacy-preserving data processing. However, existing FL frameworks struggle to address statistical and model heterogeneity, which severely impacts model performance. While Heterogeneous Federated Learning (HtFL) introduces prototype-based strategies to address the challenges, current approaches face limitations in achieving optimal separation of prototypes. This paper presents FedORGP, a novel HtFL algorithm designed to improve global prototype separation through orthogonality regularization, which not only encourages intra-class prototype similarity but also significantly expands the inter-class angular separation. With the guidance of the global prototype, each client keeps its embeddings aligned with the corresponding prototype in the feature space, promoting directional independence that integrates seamlessly with the cross-entropy (CE) loss. We provide theoretical proof of FedORGP's convergence under non-convex conditions. Extensive experiments demonstrate that FedORGP outperforms seven state-of-the-art baselines, achieving up to 10.12\% accuracy improvement in scenarios where statistical and model heterogeneity coexist.
