Table of Contents
Fetching ...

Data Augmentation for Supervised Graph Outlier Detection via Latent Diffusion Models

Kay Liu, Hengrui Zhang, Ziqing Hu, Fangxin Wang, Philip S. Yu

TL;DR

GODM, a novel data augmentation for mitigating class imbalance in supervised Graph Outlier detection via latent Diffusion Models is introduced and encapsulate into a plug-and-play package and released at PyPI.

Abstract

A fundamental challenge confronting supervised graph outlier detection algorithms is the prevalent problem of class imbalance, where the scarcity of outlier instances compared to normal instances often results in suboptimal performance. Recently, generative models, especially diffusion models, have demonstrated their efficacy in synthesizing high-fidelity images. Despite their extraordinary generation quality, their potential in data augmentation for supervised graph outlier detection remains largely underexplored. To bridge this gap, we introduce GODM, a novel data augmentation for mitigating class imbalance in supervised Graph Outlier detection via latent Diffusion Models. Extensive experiments conducted on multiple datasets substantiate the effectiveness and efficiency of GODM. The case study further demonstrated the generation quality of our synthetic data. To foster accessibility and reproducibility, we encapsulate GODM into a plug-and-play package and release it at PyPI: https://pypi.org/project/godm/.

Data Augmentation for Supervised Graph Outlier Detection via Latent Diffusion Models

TL;DR

GODM, a novel data augmentation for mitigating class imbalance in supervised Graph Outlier detection via latent Diffusion Models is introduced and encapsulate into a plug-and-play package and released at PyPI.

Abstract

A fundamental challenge confronting supervised graph outlier detection algorithms is the prevalent problem of class imbalance, where the scarcity of outlier instances compared to normal instances often results in suboptimal performance. Recently, generative models, especially diffusion models, have demonstrated their efficacy in synthesizing high-fidelity images. Despite their extraordinary generation quality, their potential in data augmentation for supervised graph outlier detection remains largely underexplored. To bridge this gap, we introduce GODM, a novel data augmentation for mitigating class imbalance in supervised Graph Outlier detection via latent Diffusion Models. Extensive experiments conducted on multiple datasets substantiate the effectiveness and efficiency of GODM. The case study further demonstrated the generation quality of our synthetic data. To foster accessibility and reproducibility, we encapsulate GODM into a plug-and-play package and release it at PyPI: https://pypi.org/project/godm/.
Paper Structure (24 sections, 10 equations, 2 figures, 3 tables, 3 algorithms)

This paper contains 24 sections, 10 equations, 2 figures, 3 tables, 3 algorithms.

Figures (2)

  • Figure 3: Visualization of single node feature density and edge type frequency of real data and synthetic data on DGraph.
  • Figure 4: Running time and GPU memory consumption.