Table of Contents
Fetching ...

Revisiting Self-Supervised Heterogeneous Graph Learning from Spectral Clustering Perspective

Yujie Mo, Zhihe Lu, Runpeng Yu, Xiaofeng Zhu, Xinchao Wang

TL;DR

The paper tackles noise in self-supervised heterogeneous graph learning and underutilization of cluster-level information by reframing SHGL through spectral clustering. It introduces SCHOOL, a framework that employs rank-constrained spectral clustering to produce an affinity matrix with exactly $c$ connected components and uses a projection-based, eigendecomposition–free route to obtain a cluster assignment matrix that aligns with spectral structure via a spectral loss. It further enhances representations with node- and cluster-level dual consistency constraints, yielding a final objective $\mathcal{J} = \mathcal{L}_{sp} + \mu \mathcal{L}_{nc} + \delta \mathcal{L}_{cc}$ and concatenated node representations for downstream tasks. Theoretical results show partitions into $c$ classes with improved generalization, and experiments on both heterogeneous and homogeneous graphs demonstrate consistent improvements over state-of-the-art SHGL methods in node classification and clustering. Overall, the work provides a principled, scalable bridge between SHGL and spectral clustering with practical gains for diverse graph learning tasks.

Abstract

Self-supervised heterogeneous graph learning (SHGL) has shown promising potential in diverse scenarios. However, while existing SHGL methods share a similar essential with clustering approaches, they encounter two significant limitations: (i) noise in graph structures is often introduced during the message-passing process to weaken node representations, and (ii) cluster-level information may be inadequately captured and leveraged, diminishing the performance in downstream tasks. In this paper, we address these limitations by theoretically revisiting SHGL from the spectral clustering perspective and introducing a novel framework enhanced by rank and dual consistency constraints. Specifically, our framework incorporates a rank-constrained spectral clustering method that refines the affinity matrix to exclude noise effectively. Additionally, we integrate node-level and cluster-level consistency constraints that concurrently capture invariant and clustering information to facilitate learning in downstream tasks. We theoretically demonstrate that the learned representations are divided into distinct partitions based on the number of classes and exhibit enhanced generalization ability across tasks. Experimental results affirm the superiority of our method, showcasing remarkable improvements in several downstream tasks compared to existing methods.

Revisiting Self-Supervised Heterogeneous Graph Learning from Spectral Clustering Perspective

TL;DR

The paper tackles noise in self-supervised heterogeneous graph learning and underutilization of cluster-level information by reframing SHGL through spectral clustering. It introduces SCHOOL, a framework that employs rank-constrained spectral clustering to produce an affinity matrix with exactly connected components and uses a projection-based, eigendecomposition–free route to obtain a cluster assignment matrix that aligns with spectral structure via a spectral loss. It further enhances representations with node- and cluster-level dual consistency constraints, yielding a final objective and concatenated node representations for downstream tasks. Theoretical results show partitions into classes with improved generalization, and experiments on both heterogeneous and homogeneous graphs demonstrate consistent improvements over state-of-the-art SHGL methods in node classification and clustering. Overall, the work provides a principled, scalable bridge between SHGL and spectral clustering with practical gains for diverse graph learning tasks.

Abstract

Self-supervised heterogeneous graph learning (SHGL) has shown promising potential in diverse scenarios. However, while existing SHGL methods share a similar essential with clustering approaches, they encounter two significant limitations: (i) noise in graph structures is often introduced during the message-passing process to weaken node representations, and (ii) cluster-level information may be inadequately captured and leveraged, diminishing the performance in downstream tasks. In this paper, we address these limitations by theoretically revisiting SHGL from the spectral clustering perspective and introducing a novel framework enhanced by rank and dual consistency constraints. Specifically, our framework incorporates a rank-constrained spectral clustering method that refines the affinity matrix to exclude noise effectively. Additionally, we integrate node-level and cluster-level consistency constraints that concurrently capture invariant and clustering information to facilitate learning in downstream tasks. We theoretically demonstrate that the learned representations are divided into distinct partitions based on the number of classes and exhibit enhanced generalization ability across tasks. Experimental results affirm the superiority of our method, showcasing remarkable improvements in several downstream tasks compared to existing methods.

Paper Structure

This paper contains 40 sections, 9 theorems, 73 equations, 5 figures, 9 tables, 1 algorithm.

Key Result

Theorem 2.2

Assume the learned representations $\mathbf{H}$ are orthogonal, optimizing previous meta-path-based and adaptive-graph-based SHGL methods is equivalent to performing spectral clustering with additional regularization, i.e., where $R(\cdot)$ indicates the regularization term, $\hat{\mathbf{L}}$ indicates the Laplacian matrix of the meta-path-based graph or the adaptive graph structure.

Figures (5)

  • Figure 1: The flowchart of SCHOOL, which first employs the Multi-Layer Perception $g_{\phi}$ to derive semantic representations $\mathbf{H}$, followed by obtaining orthogonal cluster assignment matrix $\mathbf{Y}$ and orthogonal ${\mathbf{H}}$. Subsequently, SCHOOL filters noisy connections by deriving the rank-constrained affinity matrix $\mathbf{S}$, which is further used to multiply with $\mathbf{H}$ and then obtain node representations $\mathbf{Z}$. Meanwhile, SCHOOL employs a heterogeneous encoder $f_{\theta}$ to aggregate information across node types, yielding heterogeneous representations $\tilde{\mathbf{Z}}$. Finally, SCHOOL incorporates spectral loss $\mathcal{L}_{sp}$ to optimize $\mathbf{Y}$ to fit eigenvectors of the Laplacian matrix of $\mathbf{S}$. Moreover, SCHOOL designs node-level ( i.e., $\mathcal{L}_{nc}$) and cluster-level ( i.e., $\mathcal{L}_{cc}$) consistency constraints on projected representations ( i.e., $\mathbf{Q}$ and $\tilde{\mathbf{Q}}$) and cluster representations $\hat{\mathbf{Q}}$ to capture the invariant and clustering information, respectively.
  • Figure 2: Visualization of the affinity matrix $\mathbf{S}$ and t-SNE on DBLP and Aminer datasets.
  • Figure 3: Visualization plotted by t-SNE and the corresponding silhouette scores (SIL) of node representations of the proposed SCHOOL and other SHGL comparison methods on the DBLP dataset.
  • Figure 4: Classification performance (i.e., Macro-F1) of the proposed method under different clusters.
  • Figure 5: The classification performance of the proposed method at different parameter settings ( i.e., $\mu$, and $\delta$) on all heterogeneous graph datasets.

Theorems & Definitions (17)

  • Definition 2.1
  • Theorem 2.2
  • Theorem 2.3
  • Lemma 2.4
  • Theorem 2.5
  • Theorem 2.6
  • Theorem C.1
  • proof
  • Definition C.2
  • Theorem C.3
  • ...and 7 more