Feature Propagation on Knowledge Graphs using Cellular Sheaves
John Cobb, Thomas Gebhart
TL;DR
The paper proposes a theoretically grounded approach for inductive knowledge graph reasoning by modeling KG embeddings as global sections of a cellular sheaf and propagating known embeddings to unseen nodes via a sheaf Laplacian-driven diffusion. It derives a closed-form harmonic extension for the optimal interior embeddings and provides a convergent Euler-based iterative scheme with explicit rate guarantees, enabling scalable inference without retraining. Empirically, the method yields competitive or superior performance on semi-inductive logical query reasoning and inductive KG completion across large benchmarks, sometimes matching or exceeding specialized inductive models. The approach offers a simple, interpretable, and robust baseline for extending transductive KG embeddings to inductive settings, with strong theoretical guarantees and practical scalability.
Abstract
Many inference tasks on knowledge graphs, including relation prediction, operate on knowledge graph embeddings -- vector representations of the vertices (entities) and edges (relations) that preserve task-relevant structure encoded within the underlying combinatorial object. Such knowledge graph embeddings can be modeled as an approximate global section of a cellular sheaf, an algebraic structure over the graph. Using the diffusion dynamics encoded by the corresponding sheaf Laplacian, we optimally propagate known embeddings of a subgraph to inductively represent new entities introduced into the knowledge graph at inference time. We implement this algorithm via an efficient iterative scheme and show that on a number of large-scale knowledge graph embedding benchmarks, our method is competitive with -- and in some scenarios outperforms -- more complex models derived explicitly for inductive knowledge graph reasoning tasks.
