Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features

Pallabee Das; Stefan Heindorf

Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features

Pallabee Das, Stefan Heindorf

TL;DR

DiGNNExplainer introduces a model-level explanation framework for heterogeneous GNNs that generates explanation graphs with authentic node features using discrete diffusion. It couples DiGress for graph structure with a novel DiTabDDPM for discrete node features, enforcing metagraph consistency and selecting a top explanation per class based on GNN predictions. Across real and synthetic datasets, the method achieves superior realism (via distributional similarity) and faithfulness (PF and GF) compared to state-of-the-art baselines, demonstrating the value of incorporating actual node features in explanations. The approach is scalable to varied graph sizes, extensible to directed and broader heterogeneous domains, and offers practical insights for understanding complex HGNN decision-making.

Abstract

Many real-world datasets, such as citation networks, social networks, and molecular structures, are naturally represented as heterogeneous graphs, where nodes belong to different types and have additional features. For example, in a citation network, nodes representing "Paper" or "Author" may include attributes like keywords or affiliations. A critical machine learning task on these graphs is node classification, which is useful for applications such as fake news detection, corporate risk assessment, and molecular property prediction. Although Heterogeneous Graph Neural Networks (HGNNs) perform well in these contexts, their predictions remain opaque. Existing post-hoc explanation methods lack support for actual node features beyond one-hot encoding of node type and often fail to generate realistic, faithful explanations. To address these gaps, we propose DiGNNExplainer, a model-level explanation approach that synthesizes heterogeneous graphs with realistic node features via discrete denoising diffusion. In particular, we generate realistic discrete features (e.g., bag-of-words features) using diffusion models within a discrete space, whereas previous approaches are limited to continuous spaces. We evaluate our approach on multiple datasets and show that DiGNNExplainer produces explanations that are realistic and faithful to the model's decision-making, outperforming state-of-the-art methods.

Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features

TL;DR

Abstract

Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)