Multilayer Correlation Clustering

Atsushi Miyauchi; Florian Adriaens; Francesco Bonchi; Nikolaj Tatti

Multilayer Correlation Clustering

Atsushi Miyauchi, Florian Adriaens, Francesco Bonchi, Nikolaj Tatti

TL;DR

This work introduces Multilayer Correlation Clustering, which extends Correlation Clustering to multiple layers by minimizing the $\ell_p$-norm of layer-wise disagreements. It develops an $O(L\log n)$-approximation via a region-growing technique on a convex-relaxation-derived pseudometric, and studies a probability-constrained variant with two constant-factor algorithms, including a 4-approximation. Theoretical guarantees are complemented by extensive experiments on real multilayer networks, showing practical effectiveness and favorable comparisons to baselines. The results advance multilayer network analysis by providing scalable, rigorous clustering that respects multiple, potentially uncertain similarity views. The work also bridges clustering, metric-embedding reductions, and fairness considerations in a unified multilayer framework.

Abstract

In this paper, we establish Multilayer Correlation Clustering, a novel generalization of Correlation Clustering (Bansal et al., FOCS '02) to the multilayer setting. In this model, we are given a series of inputs of Correlation Clustering (called layers) over the common set $V$. The goal is then to find a clustering of $V$ that minimizes the $\ell_p$-norm ($p\geq 1$) of the disagreements vector, which is defined as the vector (with dimension equal to the number of layers), each element of which represents the disagreements of the clustering on the corresponding layer. For this generalization, we first design an $O(L\log n)$-approximation algorithm, where $L$ is the number of layers, based on the well-known region growing technique. We then study an important special case of our problem, namely the problem with the probability constraint. For this case, we first give an $(α+2)$-approximation algorithm, where $α$ is any possible approximation ratio for the single-layer counterpart. For instance, we can take $α=2.5$ in general (Ailon et al., JACM '08) and $α=1.73+ε$ for the unweighted case (Cohen-Addad et al., FOCS '23). Furthermore, we design a $4$-approximation algorithm, which improves the above approximation ratio of $α+2=4.5$ for the general probability-constraint case. Computational experiments using real-world datasets demonstrate the effectiveness of our proposed algorithms.

Multilayer Correlation Clustering

TL;DR

This work introduces Multilayer Correlation Clustering, which extends Correlation Clustering to multiple layers by minimizing the

-norm of layer-wise disagreements. It develops an

-approximation via a region-growing technique on a convex-relaxation-derived pseudometric, and studies a probability-constrained variant with two constant-factor algorithms, including a 4-approximation. Theoretical guarantees are complemented by extensive experiments on real multilayer networks, showing practical effectiveness and favorable comparisons to baselines. The results advance multilayer network analysis by providing scalable, rigorous clustering that respects multiple, potentially uncertain similarity views. The work also bridges clustering, metric-embedding reductions, and fairness considerations in a unified multilayer framework.

Abstract

. The goal is then to find a clustering of

that minimizes the

-norm (

) of the disagreements vector, which is defined as the vector (with dimension equal to the number of layers), each element of which represents the disagreements of the clustering on the corresponding layer. For this generalization, we first design an

-approximation algorithm, where

is the number of layers, based on the well-known region growing technique. We then study an important special case of our problem, namely the problem with the probability constraint. For this case, we first give an

-approximation algorithm, where

is any possible approximation ratio for the single-layer counterpart. For instance, we can take

in general (Ailon et al., JACM '08) and

for the unweighted case (Cohen-Addad et al., FOCS '23). Furthermore, we design a

-approximation algorithm, which improves the above approximation ratio of

for the general probability-constraint case. Computational experiments using real-world datasets demonstrate the effectiveness of our proposed algorithms.

Paper Structure (16 sections, 8 theorems, 38 equations, 3 tables, 3 algorithms)

This paper contains 16 sections, 8 theorems, 38 equations, 3 tables, 3 algorithms.

Introduction
Our contribution
Related Work
Problem Formulation
Algorithm for Problem \ref{['prob:general']}
The proposed algorithm
Analysis of Algorithm \ref{['alg:general']}
Algorithms for Problem \ref{['prob:general']} with Probability Constraint
The $(\alpha+2)$-approximation algorithm
Analysis of Algorithm \ref{['alg:general_metric']}
The 4-approximation algorithm
Analysis of Algorithm \ref{['alg:threshold']}
Experimental Evaluation
Experimental setup
Results
...and 1 more sections

Key Result

Lemma 1

In Algorithm alg:general, for any $t=1,\dots, |\mathcal{B}|$, it holds that and moreover, $B_{V^{(t)}}(i^{(t)},r^*_{(t)})$ can be computed in polynomial time.

Theorems & Definitions (16)

Lemma 1
proof
Theorem 1
proof
Lemma 2
proof
Theorem 2
proof
Corollary 1
proof
...and 6 more

Multilayer Correlation Clustering

TL;DR

Abstract

Multilayer Correlation Clustering

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (16)