Table of Contents
Fetching ...

An improved approximation algorithm for k-Median

Neal E. Young

TL;DR

The paper tackles the k-Median problem in a bicriteria setting, aiming to output a center set of size at most αk with cost no larger than the best size-k solution. It introduces a capped-cost function and a two-phase approach, first reducing the capped cost to below 1 and then polishing with a small number of extra centers to achieve a cost bound, all without solving the standard LP. The main theoretical contribution is a poly-time α_{kn}-approximation with α_{kn} ≤ 1 + 2 ln(n/k) (and ≤ 2H_Δ), matching Set Cover bounds within a factor of two, and a fast LP-free variant that runs in O(k m log(n/k) log m) time. The work also provides a dual-interpretation via implicit LP dual solutions and discusses comparisons to prior bicriteria methods and open problems for tightening the bounds and extending to related problems. Overall, the results advance efficient, near-optimal bicriteria guarantees for k-Median in non-metric settings with practical implications for large-scale clustering and facility location-like problems.

Abstract

We give a polynomial-time approximation algorithm for the (not necessarily metric) $k$-Median problem. The algorithm is an $α$-size-approximation algorithm for $α< 1 + 2 \ln(n/k)$. That is, it guarantees a solution having size at most $α\times k$, and cost at most the cost of any size-$k$ solution. This is the first polynomial-time approximation algorithm to match the well-known bounds of $H_Δ$ and $1 + \ln(n/k)$ for unweighted Set Cover (a special case) within a constant factor. It matches these bounds within a factor of 2. The algorithm runs in time $O(k m \log(n/k) \log m)$, where $n$ is the number of customers and $m$ is the instance size.

An improved approximation algorithm for k-Median

TL;DR

The paper tackles the k-Median problem in a bicriteria setting, aiming to output a center set of size at most αk with cost no larger than the best size-k solution. It introduces a capped-cost function and a two-phase approach, first reducing the capped cost to below 1 and then polishing with a small number of extra centers to achieve a cost bound, all without solving the standard LP. The main theoretical contribution is a poly-time α_{kn}-approximation with α_{kn} ≤ 1 + 2 ln(n/k) (and ≤ 2H_Δ), matching Set Cover bounds within a factor of two, and a fast LP-free variant that runs in O(k m log(n/k) log m) time. The work also provides a dual-interpretation via implicit LP dual solutions and discusses comparisons to prior bicriteria methods and open problems for tightening the bounds and extending to related problems. Overall, the results advance efficient, near-optimal bicriteria guarantees for k-Median in non-metric settings with practical implications for large-scale clustering and facility location-like problems.

Abstract

We give a polynomial-time approximation algorithm for the (not necessarily metric) -Median problem. The algorithm is an -size-approximation algorithm for . That is, it guarantees a solution having size at most , and cost at most the cost of any size- solution. This is the first polynomial-time approximation algorithm to match the well-known bounds of and for unweighted Set Cover (a special case) within a constant factor. It matches these bounds within a factor of 2. The algorithm runs in time , where is the number of customers and is the instance size.

Paper Structure

This paper contains 12 sections, 9 theorems, 18 equations, 3 figures, 1 table.

Key Result

Lemma 1

$\alpha_{kn} < 2\ln(n/k) + 2- 2\ln 2 + 1/(2k) + 1/(4k^2) < 1 + 2\ln (n/k) < 2H_{\Delta}$

Figures (3)

  • Figure 1: The standard $k$-Median linear-program relaxation and its dual.
  • Figure 1: Faster replacement for first phase.
  • Figure 1: Rounding to a partial assignment by sampling.

Theorems & Definitions (18)

  • Lemma 1
  • Definition 2: capped cost
  • Lemma 3
  • Theorem 1: first phase
  • Proof 1
  • Theorem 2: second phase
  • Corollary 3
  • Proof 2: Proof of \ref{['thm: second phase']}
  • Theorem 1
  • Corollary 2
  • ...and 8 more