Hybrid k-Clustering: Blending k-Median and k-Center
Fedor V. Fomin, Petr A. Golovach, Tanmay Inamdar, Saket Saurabh, Meirav Zehavi
TL;DR
The paper introduces Hybrid $k$-Clustering, a problem that interpolates between $k$-Center and $k$-Median by selecting $k$ balls of radius $r$ and minimizing the sum of distances outside these balls. It presents a randomized $(1+\varepsilon,1+\varepsilon)$-bicriteria approximation with radius $(1+\varepsilon)r$ and objective cost within $(1+\varepsilon)$ of ${\sf OPT}_r$, running in time $2^{(kd/\varepsilon)^{O(1)}}n^{O(1)}$, supported by preprocessing and a recursive sampling strategy that blends $k$-Center and $k$-Median techniques. The approach combines grid-based discretization, component-wise analysis, and a sampling-driven recursion inspired by Kumar, Sabharwal, and Sen to handle mixed cluster types, achieving strong theoretical guarantees under Euclidean assumptions. The work highlights theoretical optimality relative to known lower bounds and points to future directions in dimensionality reduction, coresets, and metric-space generalizations to broaden applicability and efficiency.
Abstract
We propose a novel clustering model encompassing two well-known clustering models: k-center clustering and k-median clustering. In the Hybrid k-Clusetring problem, given a set P of points in R^d, an integer k, and a non-negative real r, our objective is to position k closed balls of radius r to minimize the sum of distances from points not covered by the balls to their closest balls. Equivalently, we seek an optimal L_1-fitting of a union of k balls of radius r to a set of points in the Euclidean space. When r=0, this corresponds to k-median; when the minimum sum is zero, indicating complete coverage of all points, it is k-center. Our primary result is a bicriteria approximation algorithm that, for a given ε>0, produces a hybrid k-clustering with balls of radius (1+ε)r. This algorithm achieves a cost at most 1+εof the optimum, and it operates in time 2^{(kd/ε)^{O(1)}} n^{O(1)}. Notably, considering the established lower bounds on k-center and k-median, our bicriteria approximation stands as the best possible result for Hybrid k-Clusetring.
