FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity
Kai Yi, Nidham Gazagnadou, Peter Richtárik, Lingjuan Lyu
TL;DR
FedP3 addresses federated learning under pronounced data and model heterogeneity by integrating both global and local pruning with layer-wise, privacy-friendly communication. It introduces per-client pruning and aggregation schemes, and a local differential privacy variant (0.90LDP-FedP3), supported by convergence analyses showing favorable communication costs relative to unpruned baselines. The framework is validated on CIFAR-10/100, EMNIST-L, and FashionMNIST, with ResNet18 experiments highlighting scalable applicability to larger architectures. Key results include substantial reductions in communication (up to 60% in some setups) with minimal accuracy loss under non-IID distributions, and robust performance across various pruning strategies and aggregation methods. Overall, FedP3 offers a practical, theory-backed path for personalized, privacy-conscious model pruning in heterogeneous FL settings, with implications for efficient deployment in large-scale models and LLM-style architectures.
Abstract
The interest in federated learning has surged in recent research due to its unique ability to train a global model using privacy-secured information held locally on each client. This paper pays particular attention to the issue of client-side model heterogeneity, a pervasive challenge in the practical implementation of FL that escalates its complexity. Assuming a scenario where each client possesses varied memory storage, processing capabilities and network bandwidth - a phenomenon referred to as system heterogeneity - there is a pressing need to customize a unique model for each client. In response to this, we present an effective and adaptable federated framework FedP3, representing Federated Personalized and Privacy-friendly network Pruning, tailored for model heterogeneity scenarios. Our proposed methodology can incorporate and adapt well-established techniques to its specific instances. We offer a theoretical interpretation of FedP3 and its locally differential-private variant, DP-FedP3, and theoretically validate their efficiencies.
