Fully Dynamic $k$-Clustering with Fast Update Time and Small Recourse

Sayan Bhattacharya; Martín Costa; Naveen Garg; Silvio Lattanzi; Nikos Parotsidis

Fully Dynamic $k$-Clustering with Fast Update Time and Small Recourse

Sayan Bhattacharya, Martín Costa, Naveen Garg, Silvio Lattanzi, Nikos Parotsidis

TL;DR

This work advances fully dynamic clustering by integrating two classical techniques—local search and Lagrangian relaxation—into a unified framework that achieves near-optimal trade-offs among approximation ratio, recourse, and update time for dynamic k-median, k-means, and k-center. The authors introduce a hierarchical, lazy-rebuild scheme and a randomized local-search protocol to maintain near-constant-factor approximations while bounding recourse to tilde-polynomial in k; they further adapt a dynamic LMP/UFL approach to maintain estimates of the optimal value with constant-factor guarantees and near-linear update time. A central contribution is a dynamic, sparse representation of the input (via sparsification) that preserves solution quality while reducing problem size to ~O(k), enabling efficient implementations. The results yield state-of-the-art guarantees for fully dynamic k-clustering, including an O(1/ε)-approximation with update-time at most tilde O(k^{1+ε}) and recourse tilde O(k^{ε}) for k-median, with analogous bounds for k-means and k-center in general metrics, and a value-version achieving O(1) approximation with tilde O(k) updates. Overall, the framework offers practical dynamic algorithms with strong theoretical guarantees across a broad class of clustering objectives, and introduces versatile techniques (randomized local search, hierarchy-based consistency, dynamic sparsification) that are likely to influence future dynamic clustering work.

Abstract

In the dynamic metric $k$-median problem, we wish to maintain a set of $k$ centers $S \subseteq V$ in an input metric space $(V, d)$ that gets updated via point insertions/deletions, so as to minimize the objective $\sum_{x \in V} \min_{y \in S} d(x, y)$. The quality of a dynamic algorithm is measured in terms of its approximation ratio, "recourse" (the number of changes in $S$ per update) and "update time" (the time it takes to handle an update). The ultimate goal in this line of research is to obtain a dynamic $O(1)$ approximation algorithm with $\tilde{O}(1)$ recourse and $\tilde{O}(k)$ update time. Dynamic $k$-median is a canonical example of a class of problems known as dynamic $k$-clustering, that has received significant attention in recent years. To the best of our knowledge, however, previous papers either attempt to minimize the algorithm's recourse while ignoring its update time, or minimize the algorithm's update time while ignoring its recourse. For dynamic $k$-median, we come arbitrarily close to resolving the main open question on this topic, with the following results. (I) We develop a new framework of randomized local search that is suitable for adaptation in a dynamic setting. For every $ε> 0$, this gives us a dynamic $k$-median algorithm with $O(1/ε)$ approximation ratio, $\tilde{O}(k^ε)$ recourse and $\tilde{O}(k^{1+ε})$ update time. This framework also generalizes to dynamic $k$-clustering with $\ell^p$-norm objectives, giving similar bounds for the dynamic $k$-means and a new trade-off for dynamic $k$-center. (II) If it suffices to maintain only an estimate of the value of the optimal $k$-median objective, then we obtain a $O(1)$ approximation algorithm with $\tilde{O}(k)$ update time. We achieve this result via adapting the Lagrangian Relaxation framework to the dynamic setting.

Fully Dynamic $k$-Clustering with Fast Update Time and Small Recourse

TL;DR

Abstract

In the dynamic metric

-median problem, we wish to maintain a set of

centers

in an input metric space

that gets updated via point insertions/deletions, so as to minimize the objective

. The quality of a dynamic algorithm is measured in terms of its approximation ratio, "recourse" (the number of changes in

per update) and "update time" (the time it takes to handle an update). The ultimate goal in this line of research is to obtain a dynamic

approximation algorithm with

recourse and

update time. Dynamic

-median is a canonical example of a class of problems known as dynamic

-clustering, that has received significant attention in recent years. To the best of our knowledge, however, previous papers either attempt to minimize the algorithm's recourse while ignoring its update time, or minimize the algorithm's update time while ignoring its recourse. For dynamic

-median, we come arbitrarily close to resolving the main open question on this topic, with the following results. (I) We develop a new framework of randomized local search that is suitable for adaptation in a dynamic setting. For every

, this gives us a dynamic

-median algorithm with

approximation ratio,

recourse and

update time. This framework also generalizes to dynamic

-clustering with

-norm objectives, giving similar bounds for the dynamic

-means and a new trade-off for dynamic

-center. (II) If it suffices to maintain only an estimate of the value of the optimal

-median objective, then we obtain a

approximation algorithm with

update time. We achieve this result via adapting the Lagrangian Relaxation framework to the dynamic setting.

Paper Structure (75 sections, 72 theorems, 154 equations, 2 tables, 11 algorithms)

This paper contains 75 sections, 72 theorems, 154 equations, 2 tables, 11 algorithms.

Extended Abstract
Introduction
Our Contributions
Our Techniques
Preliminaries
Organization
Our Algorithm for Dynamic $k$-Median (Theorem \ref{['thm:solution:main']})
Basic Building Blocks
A Dynamic Algorithm with Small Recourse
Towards Simultaneously Achieving Good Recourse and Update Time
Approximation Guarantee of Local Search for Restricted $k$-Median
Achieving Fast Update Time via Randomized Local Search
Getting Rid of the Assumption Regarding Improper Dynamic $k$-Median
Our Algorithm for Dynamic $k$-Median Value (Theorem \ref{['thm:value:main']})
Our Algorithm for Dynamic $k$-Clustering
...and 60 more sections

Key Result

Theorem 1.1

For any $\epsilon > 0$, there is a dynamic $k$-clustering algorithm for the $\ell^p$-norm objective, with $O(p/\epsilon)$ approximation ratio, $\tilde{O}(k^{1 + \epsilon}p/\epsilon^3)$ update time and $\tilde{O}(k^\epsilon/\epsilon)$ recourse.

Theorems & Definitions (132)

Theorem 1.1
Corollary 1.2
Corollary 1.3
Corollary 1.4
Theorem 1.5
Definition 2.1: Informal
Theorem 2.2
Theorem 2.3
Lemma 3.1: Projection Lemma
proof
...and 122 more

Fully Dynamic $k$-Clustering with Fast Update Time and Small Recourse

TL;DR

Abstract

Fully Dynamic $k$-Clustering with Fast Update Time and Small Recourse

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (132)