Explainable Graph Representation Learning via Graph Pattern Analysis
Xudong Wang, Ziheng Sun, Chris Ding, Jicong Fan
TL;DR
This work addresses the gap in representation-level explainability for graph representations by introducing PXGL-EGK and PXGL-GNN, which analyze and learn graph representations through pattern analysis of substructures. The ensemble graph kernel framework combines multiple pattern kernels with learnable weights to deliver both accurate similarity measures and interpretable pattern contributions, while the GNN-based extension learns pattern representations from sampled subgraphs and fuses them into an explainable ensemble representation. The authors provide robustness and generalization guarantees and demonstrate superior performance and interpretability on benchmark datasets for both supervised and unsupervised tasks. Overall, the approach offers concrete insights into which graph patterns drive representations, enabling more trustworthy and domain-aligned graph learning.
Abstract
Explainable artificial intelligence (XAI) is an important area in the AI community, and interpretability is crucial for building robust and trustworthy AI models. While previous work has explored model-level and instance-level explainable graph learning, there has been limited investigation into explainable graph representation learning. In this paper, we focus on representation-level explainable graph learning and ask a fundamental question: What specific information about a graph is captured in graph representations? Our approach is inspired by graph kernels, which evaluate graph similarities by counting substructures within specific graph patterns. Although the pattern counting vector can serve as an explainable representation, it has limitations such as ignoring node features and being high-dimensional. To address these limitations, we introduce a framework (PXGL-GNN) for learning and explaining graph representations through graph pattern analysis. We start by sampling graph substructures of various patterns. Then, we learn the representations of these patterns and combine them using a weighted sum, where the weights indicate the importance of each graph pattern's contribution. We also provide theoretical analyses of our methods, including robustness and generalization. In our experiments, we show how to learn and explain graph representations for real-world data using pattern analysis. Additionally, we compare our method against multiple baselines in both supervised and unsupervised learning tasks to demonstrate its effectiveness.
