Generating Human Understandable Explanations for Node Embeddings
Zohair Shafi, Ayan Chatterjee, Tina Eliassi-Rad
TL;DR
This work tackles the opacity of node embeddings by introducing XM, a framework that augments existing embedding methods with human-understandable sense features. It defines an Explain matrix $E \in \mathbb{R}^{d\times f}$ that links embedding dimensions to features and minimizes the nuclear norm $||E||_*$ via sparsity and orthogonality constraints, producing denoised, interpretable explanations without sacrificing performance. XM is validated across multiple algorithms and real-world graphs, showing reduced Explain-norms and comparable AUC in link prediction, with ablation indicating the orthogonality constraint as a key driver of explainability. The approach offers a task-agnostic, extensible path to interpretable embeddings by enabling per-dimension explanations linked to structural graph properties, suitable for cross-domain graphs and large-scale networks.
Abstract
Node embedding algorithms produce low-dimensional latent representations of nodes in a graph. These embeddings are often used for downstream tasks, such as node classification and link prediction. In this paper, we investigate the following two questions: (Q1) Can we explain each embedding dimension with human-understandable graph features (e.g. degree, clustering coefficient and PageRank). (Q2) How can we modify existing node embedding algorithms to produce embeddings that can be easily explained by human-understandable graph features? We find that the answer to Q1 is yes and introduce a new framework called XM (short for eXplain eMbedding) to answer Q2. A key aspect of XM involves minimizing the nuclear norm of the generated explanations. We show that by minimizing the nuclear norm, we minimize the lower bound on the entropy of the generated explanations. We test XM on a variety of real-world graphs and show that XM not only preserves the performance of existing node embedding methods, but also enhances their explainability.
