Table of Contents
Fetching ...

Can Self Supervision Rejuvenate Similarity-Based Link Prediction?

Chenhan Zhang, Weiqi Wang, Zhiyi Tian, James Jianqiao Yu, Mohamed Ali Kaafar, An Liu, Shui Yu

TL;DR

3SLP is suitable for the unsupervised condition of similarity-based LP without the assistance of known link labels, and introduces a dual-view contrastive node representation learning (DCNRL) with crafted data augmentation and node representation learning.

Abstract

Although recent advancements in end-to-end learning-based link prediction (LP) methods have shown remarkable capabilities, the significance of traditional similarity-based LP methods persists in unsupervised scenarios where there are no known link labels. However, the selection of node features for similarity computation in similarity-based LP can be challenging. Less informative node features can result in suboptimal LP performance. To address these challenges, we integrate self-supervised graph learning techniques into similarity-based LP and propose a novel method: Self-Supervised Similarity-based LP (3SLP). 3SLP is suitable for the unsupervised condition of similarity-based LP without the assistance of known link labels. Specifically, 3SLP introduces a dual-view contrastive node representation learning (DCNRL) with crafted data augmentation and node representation learning. DCNRL is dedicated to developing more informative node representations, replacing the node attributes as inputs in the similarity-based LP backbone. Extensive experiments over benchmark datasets demonstrate the salient improvement of 3SLP, outperforming the baseline of traditional similarity-based LP by up to 21.2% (AUC).

Can Self Supervision Rejuvenate Similarity-Based Link Prediction?

TL;DR

3SLP is suitable for the unsupervised condition of similarity-based LP without the assistance of known link labels, and introduces a dual-view contrastive node representation learning (DCNRL) with crafted data augmentation and node representation learning.

Abstract

Although recent advancements in end-to-end learning-based link prediction (LP) methods have shown remarkable capabilities, the significance of traditional similarity-based LP methods persists in unsupervised scenarios where there are no known link labels. However, the selection of node features for similarity computation in similarity-based LP can be challenging. Less informative node features can result in suboptimal LP performance. To address these challenges, we integrate self-supervised graph learning techniques into similarity-based LP and propose a novel method: Self-Supervised Similarity-based LP (3SLP). 3SLP is suitable for the unsupervised condition of similarity-based LP without the assistance of known link labels. Specifically, 3SLP introduces a dual-view contrastive node representation learning (DCNRL) with crafted data augmentation and node representation learning. DCNRL is dedicated to developing more informative node representations, replacing the node attributes as inputs in the similarity-based LP backbone. Extensive experiments over benchmark datasets demonstrate the salient improvement of 3SLP, outperforming the baseline of traditional similarity-based LP by up to 21.2% (AUC).

Paper Structure

This paper contains 35 sections, 12 equations, 6 figures, 4 tables, 1 algorithm.

Figures (6)

  • Figure 1: Schematic of 3SLP.
  • Figure 2: Utility of predicted links on node classification tasks.
  • Figure 3: Graph assortativity coefficients versus LP performance (using cosine similarity in PSC). AAC measures graph's attribute homophily. DAC measures graph's topology homophily.
  • Figure 4: Learning performance comparison between datasets Citesser (homophilic) and Reddit (heterophilic). Training loss per epoch (blue) and validation performance per epoch (orange).
  • Figure 5: Spectrum alignment of adjacency matrix $\mathbf{A}$ and relation matrix $\mathbf{R}$. The realms of $\hat{\mathbf{U}}_{a}$ and $\hat{\mathbf{U}}_{r}$ are illustrated with blue and orange shadows, respectively. $\mathcal{A}(\hat{\mathbf{U}}_a, \hat{\mathbf{U}}_r)$ numerically indicates the level of spectrum alignment.
  • ...and 1 more figures