DPMM-CFL: Clustered Federated Learning via Dirichlet Process Mixture Model Nonparametric Clustering
Mariona Jaramillo-Civill, Peng Wu, Pau Closas
TL;DR
The paper tackles the limitation of clustered federated learning (CFL) requiring a pre-specified number of clusters $K$ by introducing DPMM-CFL, which uses a Dirichlet Process prior to perform nonparametric clustering of clients. It jointly infers the number of clusters and client assignments while optimizing per-cluster federated objectives through an iterative, round-based process that couples federated updates with Bayesian clustering. The core contributions include (i) formulating CFL with a DP prior over cluster parameters, (ii) employing a split–merge MCMC strategy to sample client partitions under a CRP, and (iii) validating the method on Fashion-MNIST and CIFAR-10 under Dirichlet and class-split non-IID partitions, showing close alignment between inferred and ground-truth cluster counts or peak performance clusters and stable convergence. The approach enables automatic structure learning in CFL, reducing the need for hyperparameter sweeps and enhancing personalization in heterogeneous FL settings.
Abstract
Clustered Federated Learning (CFL) improves performance under non-IID client heterogeneity by clustering clients and training one model per cluster, thereby balancing between a global model and fully personalized models. However, most CFL methods require the number of clusters K to be fixed a priori, which is impractical when the latent structure is unknown. We propose DPMM-CFL, a CFL algorithm that places a Dirichlet Process (DP) prior over the distribution of cluster parameters. This enables nonparametric Bayesian inference to jointly infer both the number of clusters and client assignments, while optimizing per-cluster federated objectives. This results in a method where, at each round, federated updates and cluster inferences are coupled, as presented in this paper. The algorithm is validated on benchmark datasets under Dirichlet and class-split non-IID partitions.
