Disentangled and Self-Explainable Node Representation Learning
Simone Piaggesi, André Panisson, Megha Khosla
TL;DR
DiSeNE introduces a disentangled, self-explainable framework for unsupervised node embeddings in graphs, where each embedding dimension is tied to a distinct topological substructure. It combines a random-walk connectivity objective with a dimension-wise disentanglement mechanism and an entropy-based regularizer, producing per-dimension explanation subgraphs via edge-attribution masks. The authors propose five novel metrics (Topological Alignment, Sparsity, Overlap Consistency, Positional Coherence, Plausibility) to quantify interpretability alongside representation quality, and demonstrate that DiSeNE achieves state-of-the-art interpretability across real and synthetic graphs while remaining competitive on downstream tasks. The work shifts explainability from post-hoc analysis of embeddings to inherently interpretable representations, enabling human-centric insights and potential downstream benefits in transparent graph learning applications.
Abstract
Node representations, or embeddings, are low-dimensional vectors that capture node properties, typically learned through unsupervised structural similarity objectives or supervised tasks. While recent efforts have focused on explaining graph model decisions, the interpretability of unsupervised node embeddings remains underexplored. To bridge this gap, we introduce DiSeNE (Disentangled and Self-Explainable Node Embedding), a framework that generates self-explainable embeddings in an unsupervised manner. Our method employs disentangled representation learning to produce dimension-wise interpretable embeddings, where each dimension is aligned with distinct topological structure of the graph. We formalize novel desiderata for disentangled and interpretable embeddings, which drive our new objective functions, optimizing simultaneously for both interpretability and disentanglement. Additionally, we propose several new metrics to evaluate representation quality and human interpretability. Extensive experiments across multiple benchmark datasets demonstrate the effectiveness of our approach.
