Table of Contents
Fetching ...

Adaptive Graph Auto-Encoder for General Data Clustering

Xuelong Li, Hongyuan Zhang, Rui Zhang

TL;DR

AdaGAE tackles general data clustering by generatively constructing a weighted graph from data and learning embeddings with a graph auto-encoder, while adaptively updating the graph to reveal high-level structure. A key insight is that naively updating a fixed sparsity graph collapses the clustering quality; AdaGAE mitigates this by increasing the sparsity parameter $k$ over iterations and by using a distance-based decoder with KL regularization. Theoretical results explain degeneration and provide an alternative interpretation of the decoder, and spectral analysis shows the adaptive updates smooth the Laplacian, contributing to stable, scalable clustering. Empirically, AdaGAE outperforms baselines across a diverse set of text and image datasets and demonstrates robustness to initialization and data scale.

Abstract

Graph-based clustering plays an important role in the clustering area. Recent studies about graph convolution neural networks have achieved impressive success on graph type data. However, in general clustering tasks, the graph structure of data does not exist such that the strategy to construct a graph is crucial for performance. Therefore, how to extend graph convolution networks into general clustering tasks is an attractive problem. In this paper, we propose a graph auto-encoder for general data clustering, which constructs the graph adaptively according to the generative perspective of graphs. The adaptive process is designed to induce the model to exploit the high-level information behind data and utilize the non-Euclidean structure sufficiently. We further design a novel mechanism with rigorous analysis to avoid the collapse caused by the adaptive construction. Via combining the generative model for network embedding and graph-based clustering, a graph auto-encoder with a novel decoder is developed such that it performs well in weighted graph used scenarios. Extensive experiments prove the superiority of our model.

Adaptive Graph Auto-Encoder for General Data Clustering

TL;DR

AdaGAE tackles general data clustering by generatively constructing a weighted graph from data and learning embeddings with a graph auto-encoder, while adaptively updating the graph to reveal high-level structure. A key insight is that naively updating a fixed sparsity graph collapses the clustering quality; AdaGAE mitigates this by increasing the sparsity parameter over iterations and by using a distance-based decoder with KL regularization. Theoretical results explain degeneration and provide an alternative interpretation of the decoder, and spectral analysis shows the adaptive updates smooth the Laplacian, contributing to stable, scalable clustering. Empirically, AdaGAE outperforms baselines across a diverse set of text and image datasets and demonstrates robustness to initialization and data scale.

Abstract

Graph-based clustering plays an important role in the clustering area. Recent studies about graph convolution neural networks have achieved impressive success on graph type data. However, in general clustering tasks, the graph structure of data does not exist such that the strategy to construct a graph is crucial for performance. Therefore, how to extend graph convolution networks into general clustering tasks is an attractive problem. In this paper, we propose a graph auto-encoder for general data clustering, which constructs the graph adaptively according to the generative perspective of graphs. The adaptive process is designed to induce the model to exploit the high-level information behind data and utilize the non-Euclidean structure sufficiently. We further design a novel mechanism with rigorous analysis to avoid the collapse caused by the adaptive construction. Via combining the generative model for network embedding and graph-based clustering, a graph auto-encoder with a novel decoder is developed such that it performs well in weighted graph used scenarios. Extensive experiments prove the superiority of our model.

Paper Structure

This paper contains 21 sections, 4 theorems, 32 equations, 4 figures, 4 tables, 1 algorithm.

Key Result

Theorem 1

Let $p^{(k)}(\cdot | v_i)$ be the $k$-largest $p(\cdot | v_i)$ and $\hat{d}_{ij} = \|z_i - z_j\|_2$ where $z_i$ is generated by GAE with sparsity $k$. Let $\hat{p}(\cdot | v_i) = \arg \min _{p(\cdot | v_i)} \sum _{i=1}^n \mathbb{E}_{v_j \sim p(\cdot | v_i)} \hat{d}_{i j} + \gamma_i \|\bm p_i\|_2^2$ Therefore, with $\varepsilon \rightarrow 0$, $\hat{p}(\cdot | v_i)$ will degenerate into a sparse a

Figures (4)

  • Figure 1: Framework of AdaGAE. $k_0$ is the initial sparsity. First, we construct a sparse graph via the generative model defined in Eq. (\ref{['obj_CAN']}). The learned graph is employed to apply the GAE designed for the weighted graphs. After training the GAE, we update the graph from the learned embedding with a larger sparsity, $k$. With the new graph, we re-train the GAE. These steps are repeated until the convergence.
  • Figure 2: Visualization of the learning process of AdaGAE on USPS. Figure \ref{['subfigure_epoch_1']}-\ref{['subfigure_epoch_8']} show the embedding learned by AdaGAE at the $i$-th epoch, while the raw features and the final results are shown in Figure \ref{['subfigure_epoch_0']} and \ref{['subfigure_epoch_9']}, respectively. An epoch corresponds to an update of the graph.
  • Figure 3: t-SNE visualization on UMIST and USPS: The first and second line illustrate results on UMIST and USPS, respectively. Clearly, AdaGAE projects most semblable samples into the analogous embedding. Note that the cohesive embedding is preferable for clustering.
  • Figure 4: Parameter sensitivity of $\lambda$ on UMIST and USPS. On UMIST, the second term improves the performance distinctly. Besides, if $\lambda$ is not too large, AdaGAE will obtain good results.

Theorems & Definitions (7)

  • Theorem 1
  • Theorem 2
  • Theorem 3
  • proof
  • proof
  • Lemma 1
  • proof