Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classification

Yutong Xia; Runpeng Yu; Yuxuan Liang; Xavier Bresson; Xinchao Wang; Roger Zimmermann

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classification

Yutong Xia, Runpeng Yu, Yuxuan Liang, Xavier Bresson, Xinchao Wang, Roger Zimmermann

TL;DR

The paper interrogates graph data augmentation through a spectral lens, addressing graph-property distortion and limited structural changes in existing methods. It introduces the Dual-Prism (DP) augmentation, with DP-Noise and DP-Mask, which selectively perturb high-frequency Laplacian eigenvalues while preserving low-frequency components to maintain core graph properties. Through extensive experiments across supervised, semi-supervised, unsupervised, and transfer learning on 21 real-world datasets, DP methods achieve state-of-the-art or near-state-of-the-art performance in many settings, demonstrating improved generalization and diversity. The work highlights the practical significance of spectral-aware augmentation for robust graph classification and suggests a new direction for leveraging graph spectra in data augmentation.

Abstract

Graph Neural Networks have become the preferred tool to process graph data, with their efficacy being boosted through graph data augmentation techniques. Despite the evolution of augmentation methods, issues like graph property distortions and restricted structural changes persist. This leads to the question: Is it possible to develop more property-conserving and structure-sensitive augmentation methods? Through a spectral lens, we investigate the interplay between graph properties, their augmentation, and their spectral behavior, and observe that keeping the low-frequency eigenvalues unchanged can preserve the critical properties at a large scale when generating augmented graphs. These observations inform our introduction of the Dual-Prism (DP) augmentation methods, including DP-Noise and DP-Mask, which retain essential graph properties while diversifying augmented graphs. Extensive experiments validate the efficiency of our approach, providing a new and promising direction for graph data augmentation.

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classification

TL;DR

Abstract

Paper Structure (21 sections, 10 figures, 9 tables, 1 algorithm)

This paper contains 21 sections, 10 figures, 9 tables, 1 algorithm.

Introduction
Related Work
A Spectral Lens on Graph Data Augmentations
Spectral Analysis Insights
Methodology
Proposed Augmentation Methods
Empirical Evidence
Theoretical Backing and Insights
Experiments
Supervised Learning
Semi-supervised Learning
Unsupervised Representation Learning
Transfer Learning
Conclusion
Acknowledgements
...and 6 more sections

Figures (10)

Figure 1: Visualization of (a) a graph from the IMDB-BINARY dataset and its augmented graphs via (b) DropEdge rong2019dropedge, (c) DP-Noise (ours), and (d) DP-Mask (ours). Dashed line: Dropped edge. Red line: Added edge. (e) Five properties of these graphs. $r$: radius. $d$: diameter. conn.: connectivity. ASPL: average shortest path length. #peri: number of periphery. Ori.: Original. D.E.: DropEdge. DP-N: DP-Noise. DP-M: DP-Mask. (f) The eigenvalues of these four graphs.
Figure 2: (a) A toy graph $\mathcal{G}$ consisting of eight nodes. (b) Absolute variation in eigenvalues of $\mathcal{G}$ when adding an edge at diverse positions. The red and blue rectangles represent when adding the corresponding edges in $\mathcal{G}$ and the change of the eigenvalues. (c) A real-world case in the REDDIT-BINARY dataset where, when dropping 20% and 50%, the high frequency is more vulnerable.
Figure 3: (a) Training loss of GIN model on REDDIT-BINARY when graphs are augmented by masking different eigenvalues. (b) Graph classification performance on IMDB-BINARY under various hyperparameters. The lines represent the average accuracy, while the shaded areas indicate the error margins.
Figure 4: (a) Diameter and (b) radius distributions of different classes in REDD-M12. When (c) adding or (d) removing an edge, variation of the spectral domain $\Delta L_2$, $1/\lambda_1$ of $\mathcal{G}'$, ASPL and diameter $d$ of $\mathcal{G}'$.
Figure 5: The framework of our Dual-Prism (DP) for graph data augmentation.
...and 5 more figures

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classification

TL;DR

Abstract

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classification

Authors

TL;DR

Abstract

Table of Contents

Figures (10)