$k$-Center Clustering in Distributed Models
Leyla Biabani, Ami Paz
TL;DR
This work initiates the study of k-center in distributed models where the network graph defines the metric via shortest paths, examining LOCAL, CONGEST, and CLIQUE settings. It develops a spectrum of results: a simple (2+ε)k-approximation in the LOCAL model and a tight linear-time barrier for better-than-(k−1) performance, a 2-approximation in CONGEST using BFS-based greedy techniques, and a near-all-pairs-shortest-path–driven 2-approximation in the CLIQUE model with varying trade-offs dependent on distance computation accuracy. The paper also establishes strong lower bounds in CONGEST via reductions from disjointness, extended to k-center, and discusses the challenges of lower bounds in the CLIQUE model. Collectively, the results delineate the distributed complexity landscape of graph-metric k-center, highlighting the role of communication constraints and metric computation in shaping feasible approximations and runtimes.
Abstract
The $k$-center problem is a central optimization problem with numerous applications for machine learning, data mining, and communication networks. Despite extensive study in various scenarios, it surprisingly has not been thoroughly explored in the traditional distributed setting, where the communication graph of a network also defines the distance metric. We initiate the study of the $k$-center problem in a setting where the underlying metric is the graph's shortest path metric in three canonical distributed settings: the LOCAL, CONGEST, and CLIQUE models. Our results encompass constant-factor approximation algorithms and lower bounds in these models, as well as hardness results for the bi-criteria approximation setting.
