Personalized Federated Learning with Bidirectional Communication Compression via One-Bit Random Sketching
Jiacheng Cheng, Xu Zhang, Guanghui Qiu, Yifang Zhang, Yinchuan Li, Kaiyuan Feng
TL;DR
This work tackles the dual challenge of personalization and extreme communication efficiency in federated learning by introducing pFed1BS, a framework that uses one-bit random sketches for both uplink updates and downlink consensus. It formulates a principled bilevel optimization with a sign-based regularizer and employs a fast structured projection via the Fast Hadamard Transform to enable scalable, bidirectional one-bit communication. Theoretical analysis shows convergence to a stationary neighborhood of a global potential under standard FL assumptions, while empirical results on MNIST, FMNIST, CIFAR-10/100, and SVHN demonstrate substantial communication cost reductions (over 96%) with competitive accuracy compared to state-of-the-art baselines. The approach significantly advances practical personalized FL in bandwidth-limited environments, offering a robust balance between personalization quality and communication efficiency.
Abstract
Federated Learning (FL) enables collaborative training across decentralized data, but faces key challenges of bidirectional communication overhead and client-side data heterogeneity. To address communication costs while embracing data heterogeneity, we propose pFed1BS, a novel personalized federated learning framework that achieves extreme communication compression through one-bit random sketching. In personalized FL, the goal shifts from training a single global model to creating tailored models for each client. In our framework, clients transmit highly compressed one-bit sketches, and the server aggregates and broadcasts a global one-bit consensus. To enable effective personalization, we introduce a sign-based regularizer that guides local models to align with the global consensus while preserving local data characteristics. To mitigate the computational burden of random sketching, we employ the Fast Hadamard Transform for efficient projection. Theoretical analysis guarantees that our algorithm converges to a stationary neighborhood of the global potential function. Numerical simulations demonstrate that pFed1BS substantially reduces communication costs while achieving competitive performance compared to advanced communication-efficient FL algorithms.
