Table of Contents
Fetching ...

Hierarchical Correlation Clustering and Tree Preserving Embedding

Morteza Haghir Chehreghani, Mostafa Haghir Chehreghani

TL;DR

This work addresses clustering with signed pairwise dissimilarities by introducing Hierarchical Correlation Clustering (HCC), which yields multi-level clusters within an agglomerative framework. It then develops two representation-learning routes: a tree-preserving embedding of HCC dendrograms to produce vector features, and the use of minimax dissimilarities extended to correlation clustering to capture transitive, elongated structures with reduced computational complexity. Key contributions include the definitional groundwork for HCC, a level-based ultrametric distance enabling MDS-based embeddings, theoretical results on minimax-based clustering with shift invariance, and extensive experiments showing robustness to noise and superior downstream performance (e.g., with GMM) across UCI datasets, Fashion-MNIST, and other corpora. Overall, the approach provides practical tools for unsupervised learning with signed similarities, delivering both hierarchical clustering capabilities and meaningful feature representations that improve downstream clustering performance and interpretability.

Abstract

We propose a hierarchical correlation clustering method that extends the well-known correlation clustering to produce hierarchical clusters applicable to both positive and negative pairwise dissimilarities. Then, in the following, we study unsupervised representation learning with such hierarchical correlation clustering. For this purpose, we first investigate embedding the respective hierarchy to be used for tree preserving embedding and feature extraction. Thereafter, we study the extension of minimax distance measures to correlation clustering, as another representation learning paradigm. Finally, we demonstrate the performance of our methods on several datasets.

Hierarchical Correlation Clustering and Tree Preserving Embedding

TL;DR

This work addresses clustering with signed pairwise dissimilarities by introducing Hierarchical Correlation Clustering (HCC), which yields multi-level clusters within an agglomerative framework. It then develops two representation-learning routes: a tree-preserving embedding of HCC dendrograms to produce vector features, and the use of minimax dissimilarities extended to correlation clustering to capture transitive, elongated structures with reduced computational complexity. Key contributions include the definitional groundwork for HCC, a level-based ultrametric distance enabling MDS-based embeddings, theoretical results on minimax-based clustering with shift invariance, and extensive experiments showing robustness to noise and superior downstream performance (e.g., with GMM) across UCI datasets, Fashion-MNIST, and other corpora. Overall, the approach provides practical tools for unsupervised learning with signed similarities, delivering both hierarchical clustering capabilities and meaningful feature representations that improve downstream clustering performance and interpretability.

Abstract

We propose a hierarchical correlation clustering method that extends the well-known correlation clustering to produce hierarchical clusters applicable to both positive and negative pairwise dissimilarities. Then, in the following, we study unsupervised representation learning with such hierarchical correlation clustering. For this purpose, we first investigate embedding the respective hierarchy to be used for tree preserving embedding and feature extraction. Thereafter, we study the extension of minimax distance measures to correlation clustering, as another representation learning paradigm. Finally, we demonstrate the performance of our methods on several datasets.

Paper Structure

This paper contains 22 sections, 4 theorems, 13 equations, 5 figures, 5 tables, 1 algorithm.

Key Result

Proposition 1

Single linkage, complete linkage and average linkage methods are invariant w.r.t. the shift of the pairwise dissimilarities by an arbitrary real number $\alpha$.

Figures (5)

  • Figure 1: Illustration of ultrametric property of $\mathbf X$.
  • Figure 2: MI score of different hierarchical clustering methods applied to UCI datasets, where x-axis shows the parameter $\eta$.
  • Figure 3: The datasets with arbitrarily shaped clusters, where 'minimax + correlation clustering' acheives perfect clustering.
  • Figure 4: Embeddings of vehicle motion trajectories computed via dynamic time wrapping and t-SNE HoseiniRC21DemetriouARC23.
  • Figure 5: Rand score of hierarchical clustering methods applied to the UCI datasets (the x-axis shows the flip noise parameter $\eta$). Similar to the MI measure, HCC provides the best scores, even when the datasets are difficult to cluster.

Theorems & Definitions (4)

  • Proposition 1
  • Lemma 1
  • Theorem 1
  • Theorem 2