HGOE: Hybrid External and Internal Graph Outlier Exposure for Graph Out-of-Distribution Detection
Junwei He, Qianqian Xu, Yangbangyan Jiang, Zitai Wang, Yuchen Sun, Qingming Huang
TL;DR
This work tackles graph-level OOD detection by introducing Hybrid External and Internal Graph Outlier Exposure (HGOE), which combines diverse external outliers from cross-domain sources with synthesized internal outliers created from within in-distribution subgroups using graphon-based mixes. A boundary-aware OE loss guides learning by weighting outliers toward the ID boundary while suppressing invalid intrusions, enabling integration with existing detectors. Key innovations include ID-mixup to generate internal outliers via graphon mixtures $\mathcal{M}=\lambda W_i+(1-\lambda)W_j$ and feature alignment to external outliers, plus a principled loss that adaptively emphasizes boundary samples through $\ell_{ba}$. Empirical results on 8 real-world datasets show that HGOE consistently improves graph OOD performance and that both external and internal outliers contribute to gains, with interpretable visualizations and comprehensive ablations supporting the design choices.
Abstract
With the progressive advancements in deep graph learning, out-of-distribution (OOD) detection for graph data has emerged as a critical challenge. While the efficacy of auxiliary datasets in enhancing OOD detection has been extensively studied for image and text data, such approaches have not yet been explored for graph data. Unlike Euclidean data, graph data exhibits greater diversity but lower robustness to perturbations, complicating the integration of outliers. To tackle these challenges, we propose the introduction of \textbf{H}ybrid External and Internal \textbf{G}raph \textbf{O}utlier \textbf{E}xposure (HGOE) to improve graph OOD detection performance. Our framework involves using realistic external graph data from various domains and synthesizing internal outliers within ID subgroups to address the poor robustness and presence of OOD samples within the ID class. Furthermore, we develop a boundary-aware OE loss that adaptively assigns weights to outliers, maximizing the use of high-quality OOD samples while minimizing the impact of low-quality ones. Our proposed HGOE framework is model-agnostic and designed to enhance the effectiveness of existing graph OOD detection models. Experimental results demonstrate that our HGOE framework can significantly improve the performance of existing OOD detection models across all 8 real datasets.
