Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

Vincent Cohen-Addad; Fabrizio Grandoni; Euiwoong Lee; Chris Schwiegelshohn

Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

Vincent Cohen-Addad, Fabrizio Grandoni, Euiwoong Lee, Chris Schwiegelshohn

TL;DR

The paper addresses the fundamental clustering problems UFL and $k$-Median, targeting improvements over the classic LMP 2-approximation barrier. It advances a hybrid approach that couples Dual-Fitting (for LMP analysis) with Local Search, using a bipoint framework to translate gains in UFL into improved $k$-Median guarantees. The authors establish a near-2 LMP bound for general facility costs, and a substantially tighter bound for uniform opening costs that yields a concrete $k$-Median improvement to $2.67059$. This work demonstrates that seeding local search with LP-based solutions can surpass traditional limits, providing theoretically tighter guarantees and a practical pathway to improved clustering performance in related applications.

Abstract

The Uncapacitated Facility Location (UFL) problem is one of the most fundamental clustering problems: Given a set of clients $C$ and a set of facilities $F$ in a metric space $(C \cup F, dist)$ with facility costs $open : F \to \mathbb{R}^+$, the goal is to find a set of facilities $S \subseteq F$ to minimize the sum of the opening cost $open(S)$ and the connection cost $d(S) := \sum_{p \in C} \min_{c \in S} dist(p, c)$. An algorithm for UFL is called a Lagrangian Multiplier Preserving (LMP) $α$ approximation if it outputs a solution $S\subseteq F$ satisfying $open(S) + d(S) \leq open(S^*) + αd(S^*)$ for any $S^* \subseteq F$. The best-known LMP approximation ratio for UFL is at most $2$ by the JMS algorithm of Jain, Mahdian, and Saberi based on the Dual-Fitting technique. We present a (slightly) improved LMP approximation algorithm for UFL. This is achieved by combining the Dual-Fitting technique with Local Search, another popular technique to address clustering problems. From a conceptual viewpoint, our result gives a theoretical evidence that local search can be enhanced so as to avoid bad local optima by choosing the initial feasible solution with LP-based techniques. Using the framework of bipoint solutions, our result directly implies a (slightly) improved approximation for the $k$-Median problem from 2.6742 to 2.67059.

Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

TL;DR

The paper addresses the fundamental clustering problems UFL and

-Median, targeting improvements over the classic LMP 2-approximation barrier. It advances a hybrid approach that couples Dual-Fitting (for LMP analysis) with Local Search, using a bipoint framework to translate gains in UFL into improved

-Median guarantees. The authors establish a near-2 LMP bound for general facility costs, and a substantially tighter bound for uniform opening costs that yields a concrete

-Median improvement to

. This work demonstrates that seeding local search with LP-based solutions can surpass traditional limits, providing theoretically tighter guarantees and a practical pathway to improved clustering performance in related applications.

Abstract

The Uncapacitated Facility Location (UFL) problem is one of the most fundamental clustering problems: Given a set of clients

and a set of facilities

in a metric space

with facility costs

, the goal is to find a set of facilities

to minimize the sum of the opening cost

and the connection cost

. An algorithm for UFL is called a Lagrangian Multiplier Preserving (LMP)

approximation if it outputs a solution

satisfying

for any

. The best-known LMP approximation ratio for UFL is at most

by the JMS algorithm of Jain, Mahdian, and Saberi based on the Dual-Fitting technique. We present a (slightly) improved LMP approximation algorithm for UFL. This is achieved by combining the Dual-Fitting technique with Local Search, another popular technique to address clustering problems. From a conceptual viewpoint, our result gives a theoretical evidence that local search can be enhanced so as to avoid bad local optima by choosing the initial feasible solution with LP-based techniques. Using the framework of bipoint solutions, our result directly implies a (slightly) improved approximation for the

-Median problem from 2.6742 to 2.67059.

Paper Structure (39 sections, 32 theorems, 134 equations)

This paper contains 39 sections, 32 theorems, 134 equations.

Introduction
Our Results
Related Work
$k$-Median.
Facility Location.
Preliminaries
Facility Location and JMS Algorithm.
Bipoint solution.
Organization.
An Improved LMP Bound for Small Facility Cost
A Local-Search-Based Bound
Matched vs. Lonely facilities: a key notion.
High-level overview.
An Improved Approximation for $k$-Median
Proof of Lemma \ref{['lem:S2bound']}
...and 24 more sections

Key Result

Lemma 1

Let $OPT$ be a solution to a UFL instance and $S$ be the JMS solution on the same instance. Then

Theorems & Definitions (63)

Lemma 1
Lemma 2
proof : Proof sketch
Lemma 3
Lemma 4
proof : Proof sketch
Lemma 5
proof
Corollary 1
Theorem 1
...and 53 more

Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

TL;DR

Abstract

Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (63)