Global Concept Explanations for Graphs by Contrastive Learning
Jonas Teufel, Pascal Friederich
TL;DR
This work develops Megan2, an extension of the MEGAN framework, to generate global concept explanations for graph property prediction tasks by learning a subgraph-centric latent space and clustering it into interpretable concepts. A contrastive learning objective, channel-wise projection networks, and a prototype-optimization pipeline (via a genetic algorithm) enable compact, representative prototypes and concept-level interpretations, with optional GPT-4 hypothesis generation for causal insights. Across synthetic datasets (e.g., BA2Motifs, RbMotifs) and real-world molecular datasets (Mutagenicity, AqSolDB), Megan2 both faithfully recovers ground-truth motifs and uncovers diverse, chemistry-aligned concepts, providing finer-grained explanations than prior global explainers. The results demonstrate the potential of global concept explanations to reveal underlying structure–property relationships in graph domains, while highlighting limitations related to Megan’s assumptions and the reliability of language-model hypotheses for certain tasks.
Abstract
Beyond improving trust and validating model fairness, xAI practices also have the potential to recover valuable scientific insights in application domains where little to no prior human intuition exists. To that end, we propose a method to extract global concept explanations from the predictions of graph neural networks to develop a deeper understanding of the tasks underlying structure-property relationships. We identify concept explanations as dense clusters in the self-explaining Megan models subgraph latent space. For each concept, we optimize a representative prototype graph and optionally use GPT-4 to provide hypotheses about why each structure has a certain effect on the prediction. We conduct computational experiments on synthetic and real-world graph property prediction tasks. For the synthetic tasks we find that our method correctly reproduces the structural rules by which they were created. For real-world molecular property regression and classification tasks, we find that our method rediscovers established rules of thumb. More specifically, our results for molecular mutagenicity prediction indicate more fine-grained resolution of structural details than existing explainability methods, consistent with previous results from chemistry literature. Overall, our results show promising capability to extract the underlying structure-property relationships for complex graph property prediction tasks.
