On connections between k-coloring and Euclidean k-means

Enver Aman; Karthik C. S.; Sharath Punna

On connections between k-coloring and Euclidean k-means

Enver Aman, Karthik C. S., Sharath Punna

TL;DR

This work builds a bridge between graph coloring and Euclidean clustering, delivering reductions that transfer hardness and algorithmic ideas between $k$-coloring, Balanced Max-Cut, and Euclidean $k$-means (as well as $k$-min-sum). It provides a simple linear-size reduction from $k$-coloring on $d$-regular graphs to Euclidean $k$-means for any fixed $k\ge 3$, yielding an alternate NP-hardness proof for $k$-means, and extends to a structured max-cut variant to Euclidean $2$-means. On the algorithmic side, the paper presents an exact $1.7297^n\cdot \mathrm{poly}(n,d)$ time algorithm for 2-means by reducing to Weighted 2-CSP and employing fast matrix-multiplication-based CSP solvers, with a conditional improvement to $1.59^n$ under $\omega=2$. A parallel fine-grained analysis shows a computational equivalence between 2-min-sum and Balanced Max-Cut in $\ell_p$-metrics, and proves a matching $1.7297^n$-time algorithm for 2-min-sum via Max-Cut reductions. Together, these results illuminate the transfer of complexity and technique between combinatorial graph problems and geometric clustering, and open avenues for leveraging colorability tools to accelerate clustering problems. The findings have potential implications for both hardness proofs and practical algorithm design in high-dimensional clustering and related optimization tasks.

Abstract

In the Euclidean $k$-means problems we are given as input a set of $n$ points in $\mathbb{R}^d$ and the goal is to find a set of $k$ points $C\subseteq \mathbb{R}^d$, so as to minimize the sum of the squared Euclidean distances from each point in $P$ to its closest center in $C$. In this paper, we formally explore connections between the $k$-coloring problem on graphs and the Euclidean $k$-means problem. Our results are as follows: $\bullet$ For all $k\ge 3$, we provide a simple reduction from the $k$-coloring problem on regular graphs to the Euclidean $k$-means problem. Moreover, our technique extends to enable a reduction from a structured max-cut problem (which may be considered as a partial 2-coloring problem) to the Euclidean $2$-means problem. Thus, we have a simple and alternate proof of the NP-hardness of Euclidean 2-means problem. $\bullet$ In the other direction, we mimic the $O(1.7297^n)$ time algorithm of Williams [TCS'05] for the max-cut of problem on $n$ vertices to obtain an algorithm for the Euclidean 2-means problem with the same runtime, improving on the naive exhaustive search running in $2^n\cdot \text{poly}(n,d)$ time. $\bullet$ We prove similar results and connections as above for the Euclidean $k$-min-sum problem.

On connections between k-coloring and Euclidean k-means

TL;DR

This work builds a bridge between graph coloring and Euclidean clustering, delivering reductions that transfer hardness and algorithmic ideas between

-coloring, Balanced Max-Cut, and Euclidean

-means (as well as

-min-sum). It provides a simple linear-size reduction from

-coloring on

-regular graphs to Euclidean

-means for any fixed

, yielding an alternate NP-hardness proof for

-means, and extends to a structured max-cut variant to Euclidean

-means. On the algorithmic side, the paper presents an exact

time algorithm for 2-means by reducing to Weighted 2-CSP and employing fast matrix-multiplication-based CSP solvers, with a conditional improvement to

under

. A parallel fine-grained analysis shows a computational equivalence between 2-min-sum and Balanced Max-Cut in

-metrics, and proves a matching

-time algorithm for 2-min-sum via Max-Cut reductions. Together, these results illuminate the transfer of complexity and technique between combinatorial graph problems and geometric clustering, and open avenues for leveraging colorability tools to accelerate clustering problems. The findings have potential implications for both hardness proofs and practical algorithm design in high-dimensional clustering and related optimization tasks.

Abstract

In the Euclidean

-means problems we are given as input a set of

points in

and the goal is to find a set of

points

, so as to minimize the sum of the squared Euclidean distances from each point in

to its closest center in

. In this paper, we formally explore connections between the

-coloring problem on graphs and the Euclidean

-means problem. Our results are as follows:

For all

, we provide a simple reduction from the

-coloring problem on regular graphs to the Euclidean

-means problem. Moreover, our technique extends to enable a reduction from a structured max-cut problem (which may be considered as a partial 2-coloring problem) to the Euclidean

-means problem. Thus, we have a simple and alternate proof of the NP-hardness of Euclidean 2-means problem.

In the other direction, we mimic the

time algorithm of Williams [TCS'05] for the max-cut of problem on

vertices to obtain an algorithm for the Euclidean 2-means problem with the same runtime, improving on the naive exhaustive search running in

time.

We prove similar results and connections as above for the Euclidean

-min-sum problem.

Paper Structure (36 sections, 9 theorems, 57 equations)

This paper contains 36 sections, 9 theorems, 57 equations.

Introduction
Fine-Grained Complexity.
Connection to Max-Cut Problem.
Connection to $k$-Coloring Problem.
Organization of the Paper.
Preliminaries
Problem Definitions
$k$-means clustering problem.
$k$-min-sum clustering problem.
Balanced Max-Cut problem.
$k$-coloring problem (on regular graphs).
NAE-3-SAT problem.
Weighted 2-CSP.
Computational Hardness of Graph Problems
Reduction from Balanced Max-Cut to 2-means
...and 21 more sections

Key Result

Theorem 1.1

There is an exact algorithm for the 2-means problem running in time $1.7297^n\cdot \poly(n,d)$, where $n$ is the number of input pointsIn this theorem, we assume that the coordinate entries of all points in the input are integral and that the absolute value of any coordinate is bounded by $2^{o(n)}$

Theorems & Definitions (19)

Theorem 1.1
Theorem 2.1: dailey1980uniqueness
Theorem 2.2
Claim 3.1
proof
Remark 3.2
Theorem 5.1
Corollary 5.2
proof
Theorem 5.3: Theorem 6.4.1 in williams2007algorithms
...and 9 more

On connections between k-coloring and Euclidean k-means

TL;DR

Abstract

On connections between k-coloring and Euclidean k-means

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (19)