From Latent to Lucid: Transforming Knowledge Graph Embeddings into Interpretable Structures with KGEPrisma
Christoph Wehner, Chrysa Iliopoulou, Ute Schmid, Tarek R. Besold
TL;DR
Knowledge Graph Embeddings (KGEs) power link prediction but suffer from opaque decision processes. KGEPrisma provides post-hoc, local explanations by decoding latent embeddings into symbolic clauses drawn from the subgraph neighborhoods of similar embeddings, using a five-step workflow that includes kNN search, positive/negative pair construction, clause mining, surrogate-model-based ranking, and grounding into rule-, instance-, and analogy-based explanations. The method yields faithful explanations without retraining, scales to large graphs, and demonstrates state-of-the-art faithfulness across multiple benchmarks (FB15k-237, WN18RR, Kinship) while delivering competitive runtimes. This approach enables transparent, human-understandable insights into KGE predictions and is adaptable to diverse user needs and domains, including potential biomedical applications.
Abstract
In this paper, we introduce a post-hoc and local explainable AI method tailored for Knowledge Graph Embedding (KGE) models. These models are essential to Knowledge Graph Completion yet criticized for their opaque, black-box nature. Despite their significant success in capturing the semantics of knowledge graphs through high-dimensional latent representations, their inherent complexity poses substantial challenges to explainability. While existing methods like Kelpie use resource-intensive perturbation to explain KGE models, our approach directly decodes the latent representations encoded by KGE models, leveraging the smoothness of the embeddings, which follows the principle that similar embeddings reflect similar behaviours within the Knowledge Graph, meaning that nodes are similarly embedded because their graph neighbourhood looks similar. This principle is commonly referred to as smoothness. By identifying symbolic structures, in the form of triples, within the subgraph neighborhoods of similarly embedded entities, our method identifies the statistical regularities on which the models rely and translates these insights into human-understandable symbolic rules and facts. This bridges the gap between the abstract representations of KGE models and their predictive outputs, offering clear, interpretable insights. Key contributions include a novel post-hoc and local explainable AI method for KGE models that provides immediate, faithful explanations without retraining, facilitating real-time application on large-scale knowledge graphs. The method's flexibility enables the generation of rule-based, instance-based, and analogy-based explanations, meeting diverse user needs. Extensive evaluations show the effectiveness of our approach in delivering faithful and well-localized explanations, enhancing the transparency and trustworthiness of KGE models.
