GC-Fed: Gradient Centralized Federated Learning with Partial Client Participation

Jungwon Seo; Ferhat Ozgur Catak; Chunming Rong; Kibeom Hong; Minhoe Kim

GC-Fed: Gradient Centralized Federated Learning with Partial Client Participation

Jungwon Seo, Ferhat Ozgur Catak, Chunming Rong, Kibeom Hong, Minhoe Kim

TL;DR

GC-Fed introduces gradient centralization into federated learning as a reference-free means to reduce client drift under heterogeneous data and partial participation. By applying Local GC to feature extraction layers and Global GC to classifier layers, the approach centralizes updates through a shared hyperplane reference without extra communication. Theoretical analysis shows the projected-gradient updates reduce the optimality gap more efficiently than FedAvg, and extensive experiments across EMNIST, CIFAR, and TinyImageNet demonstrate substantial accuracy gains and faster convergence. This layer-aware, projection-based method provides a practical enhancement for cross-device FL, with strong empirical support and flexible hyperparameters for adapting to diverse architectures and data distributions.

Abstract

Federated Learning (FL) enables privacy-preserving multi-source information fusion (MSIF) but is challenged by client drift in highly heterogeneous data settings. Many existing drift-mitigation strategies rely on reference-based techniques--such as gradient adjustments or proximal loss--that use historical snapshots (e.g., past gradients or previous global models) as reference points. When only a subset of clients participates in each training round, these historical references may not accurately capture the overall data distribution, leading to unstable training. In contrast, our proposed Gradient Centralized Federated Learning (GC-Fed) employs a hyperplane as a historically independent reference point to guide local training and enhance inter-client alignment. GC-Fed comprises two complementary components: Local GC, which centralizes gradients during local training, and Global GC, which centralizes updates during server aggregation. In our hybrid design, Local GC is applied to feature-extraction layers to harmonize client contributions, while Global GC refines classifier layers to stabilize round-wise performance. Theoretical analysis and extensive experiments on benchmark FL tasks demonstrate that GC-Fed effectively mitigates client drift and achieves up to a 20% improvement in accuracy under heterogeneous and partial participation conditions.

GC-Fed: Gradient Centralized Federated Learning with Partial Client Participation

TL;DR

Abstract

GC-Fed: Gradient Centralized Federated Learning with Partial Client Participation

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (6)