LinkLogic: A New Method and Benchmark for Explainable Knowledge Graph Predictions
Niraj Kumar-Singh, Gustavo Polleti, Saee Paliwal, Rachel Hodos-Nkhereanye
TL;DR
LinkLogic addresses the interpretability gap in knowledge graph link prediction by surfacing and ranking explanation paths through perturbation-based features, fitted by a non-negative surrogate to explain the KGE score. Using FB13 and FB14 family structures, the authors construct the Parents Benchmark to enable rigorous evaluation of explanation fidelity ($R^2$), selectivity (path count), and relevance ($NDCG@k$). Empirical results show higher fidelity for true facts, controlled explanation complexity, and meaningful qualitative insights from family-tree experiments (tautologies, removal of direct links, and sibling vocabulary). The work establishes a principled evaluation framework and a reproducible explanation method that can guide future benchmarking of KGE explainability approaches and motivate extensions to negative explanations and more expressive surrogates.
Abstract
While there are a plethora of methods for link prediction in knowledge graphs, state-of-the-art approaches are often black box, obfuscating model reasoning and thereby limiting the ability of users to make informed decisions about model predictions. Recently, methods have emerged to generate prediction explanations for Knowledge Graph Embedding models, a widely-used class of methods for link prediction. The question then becomes, how well do these explanation systems work? To date this has generally been addressed anecdotally, or through time-consuming user research. In this work, we present an in-depth exploration of a simple link prediction explanation method we call LinkLogic, that surfaces and ranks explanatory information used for the prediction. Importantly, we construct the first-ever link prediction explanation benchmark, based on family structures present in the FB13 dataset. We demonstrate the use of this benchmark as a rich evaluation sandbox, probing LinkLogic quantitatively and qualitatively to assess the fidelity, selectivity and relevance of the generated explanations. We hope our work paves the way for more holistic and empirical assessment of knowledge graph prediction explanation methods in the future.
