Quantum Clustering with k-Means: a Hybrid Approach

Alessandro Poggiali; Alessandro Berti; Anna Bernasconi; Gianna M. Del Corso; Riccardo Guidotti

Quantum Clustering with k-Means: a Hybrid Approach

Alessandro Poggiali, Alessandro Berti, Anna Bernasconi, Gianna M. Del Corso, Riccardo Guidotti

TL;DR

This work tackles speeding up the cluster assignment step in $k$-Means using three hybrid quantum algorithms that perform distance computations in parallel. It introduces $q_{1:1}$-$k$-Means, $q_{1:k}$-$k$-Means, and $q_{M:k}$-$k$-Means, leveraging quantum distance circuits, Inverse Stereographic Projection for ISP-based data normalization, and FF-QRAM data loading, while analyzing post-selection and shot requirements. Empirical results on synthetic and real datasets show that, given sufficient shots, the quantum variants achieve clustering quality comparable to $oldsymbol{ au}$-$k$-Means and classical $k$-Means, though practical benefits are constrained by post-selection costs and data-loading overhead. Real hardware experiments on tiny instances confirm feasibility but reveal substantial noise and overhead, indicating that substantial advances in quantum hardware and data-loading techniques are needed to realize the practical advantages of quantum clustering.

Abstract

Quantum computing is a promising paradigm based on quantum theory for performing fast computations. Quantum algorithms are expected to surpass their classical counterparts in terms of computational complexity for certain tasks, including machine learning. In this paper, we design, implement, and evaluate three hybrid quantum k-Means algorithms, exploiting different degree of parallelism. Indeed, each algorithm incrementally leverages quantum parallelism to reduce the complexity of the cluster assignment step up to a constant cost. In particular, we exploit quantum phenomena to speed up the computation of distances. The core idea is that the computation of distances between records and centroids can be executed simultaneously, thus saving time, especially for big datasets. We show that our hybrid quantum k-Means algorithms can be more efficient than the classical version, still obtaining comparable clustering results.

Quantum Clustering with k-Means: a Hybrid Approach

TL;DR

This work tackles speeding up the cluster assignment step in

-Means using three hybrid quantum algorithms that perform distance computations in parallel. It introduces

-Means,

-Means, and

-Means, leveraging quantum distance circuits, Inverse Stereographic Projection for ISP-based data normalization, and FF-QRAM data loading, while analyzing post-selection and shot requirements. Empirical results on synthetic and real datasets show that, given sufficient shots, the quantum variants achieve clustering quality comparable to

-Means and classical

-Means, though practical benefits are constrained by post-selection costs and data-loading overhead. Real hardware experiments on tiny instances confirm feasibility but reveal substantial noise and overhead, indicating that substantial advances in quantum hardware and data-loading techniques are needed to realize the practical advantages of quantum clustering.

Abstract

Paper Structure (17 sections, 8 equations, 9 figures, 7 tables, 5 algorithms)

This paper contains 17 sections, 8 equations, 9 figures, 7 tables, 5 algorithms.

Introduction
Related Works
Setting the Stage
k-Means and $\delta$-k-Means
Quantum Distance Estimate
Inverse Stereographic Projection
The q-k-Means Clustering Algorithms
One Record vs One Centroid: $q_{1:1}$-$k$-Means
One Record vs $k$ Centroids: $q_{1:k}$-$k$-Means
$M$ Records vs $k$ Centroids: $q_{M:k}$-$k$-Means
Reducing the Number of Shots
Experiments
Experimental Setting
Results on Synthetic Datasets
Results on Real Datasets
...and 2 more sections

Figures (9)

Figure 1: Quantum circuit for the Euclidean distance
Figure 2: Original data (a), normalized (b), normalized with ISP (c).
Figure 3: QC1: quantum Euclidean distance with FF-QRAM.
Figure 4: QC2: quantum kNN classifier with FF-QRAM.
Figure 5: QC3: quantum circuit for cluster assignment.
...and 4 more figures

Quantum Clustering with k-Means: a Hybrid Approach

TL;DR

Abstract

Quantum Clustering with k-Means: a Hybrid Approach

Authors

TL;DR

Abstract

Table of Contents

Figures (9)