Improved fixed-parameter bounds for Min-Sum-Radii and Diameters $k$-clustering and their fair variants
Sandip Banerjee, Yair Bartal, Lee-Ad Gottlieb, Alon Hovav
TL;DR
The paper advances the algorithmic theory of MSR and MSD clustering by delivering tight fixed-parameter bounds: it shows an exact $n^{O(k)}$-time MSD algorithm (matching the known $n^{O(k)}$ bound for MSR) and ETH-based lower bounds, while providing $(1+\epsilon)$-approximation schemes in metrics of doubling dimension $d$ with run-time $O(kn) + (1/\epsilon)^{O(dk)}$ (and $\left(\alpha/\epsilon\right)^{O(dk)}$ for $\alpha$-MSR). These results extend to fairness and mergeable clustering variants, as well as to outliers, preserving the same asymptotic guarantees, with additional combinatorial factors where necessary. A decomposition-based framework underpins the approximations, leveraging nets and bounded-diameter components to achieve deterministic PTAS-like performance in low-dimension spaces. ETH-based hardness results establish tight barriers for several variants, clarifying when efficient FPT algorithms are possible. Overall, the work significantly strengthens the practical and theoretical foundations for fixed-parameter clustering in general metric spaces.
Abstract
We provide improved upper and lower bounds for the Min-Sum-Radii (MSR) and Min-Sum-Diameters (MSD) clustering problems with a bounded number of clusters $k$. In particular, we propose an exact MSD algorithm with running-time $n^{O(k)}$. We also provide $(1+ε)$ approximation algorithms for both MSR and MSD with running-times of $O(kn) +(1/ε)^{O(dk)}$ in metrics spaces of doubling dimension $d$. Our algorithms extend to $k$-center, improving upon previous results, and to $α$-MSR, where radii are raised to the $α$ power for $α>1$. For $α$-MSD we prove an exponential time ETH-based lower bound for $α>\log 3$. All algorithms can also be modified to handle outliers. Moreover, we can extend the results to variants that observe fairness constraints, as well as to the general framework of mergeable clustering, which includes many other popular clustering variants. We complement these upper bounds with ETH-based lower bounds for these problems, in particular proving that $n^{O(k)}$ time is tight for MSR and $α$-MSR even in doubling spaces, and that $2^{o(k)}$ bounds are impossible for MSD.
