Hyperbolic Continuous Structural Entropy for Hierarchical Clustering
Guangjie Zeng, Hao Peng, Angsheng Li, Li Sun, Chunyang Liu, Shengze Li, Yicheng Pan, Philip S. Yu
TL;DR
The paper tackles the lack of a global objective and reliance on fixed graphs in hierarchical clustering by introducing HypCSE, a differentiable framework that optimizes a continuous structural entropy objective in hyperbolic space. It defines CSE via hyperbolic lowest common ancestors, encodes graphs with hyperbolic graph neural networks, and decodes hierarchical structures from embeddings. A graph structure learning module with contrastive learning adaptively refines the graph during training, yielding superior hierarchical clustering performance across seven datasets. The work highlights the effectiveness of hyperbolic geometry for representing hierarchies and offers a scalable, end-to-end method for structure-enhanced clustering.
Abstract
Hierarchical clustering is a fundamental machine-learning technique for grouping data points into dendrograms. However, existing hierarchical clustering methods encounter two primary challenges: 1) Most methods specify dendrograms without a global objective. 2) Graph-based methods often neglect the significance of graph structure, optimizing objectives on complete or static predefined graphs. In this work, we propose Hyperbolic Continuous Structural Entropy neural networks, namely HypCSE, for structure-enhanced continuous hierarchical clustering. Our key idea is to map data points in the hyperbolic space and minimize the relaxed continuous structural entropy (SE) on structure-enhanced graphs. Specifically, we encode graph vertices in hyperbolic space using hyperbolic graph neural networks and minimize approximate SE defined on graph embeddings. To make the SE objective differentiable for optimization, we reformulate it into a function using the lowest common ancestor (LCA) on trees and then relax it into continuous SE (CSE) by the analogy of hyperbolic graph embeddings and partitioning trees. To ensure a graph structure that effectively captures the hierarchy of data points for CSE calculation, we employ a graph structure learning (GSL) strategy that updates the graph structure during training. Extensive experiments on seven datasets demonstrate the superior performance of HypCSE.
