Distributed Representations of Entities in Open-World Knowledge Graphs
Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Yichi Zhang, Zequn Sun, Zhongpo Bo, Yin Fang, Xiaoze Liu, Huajun Chen, Wen Zhang
TL;DR
This work tackles open-world knowledge graphs where new entities continually emerge. It introduces Decentralized Attention Network (DAN), which distributes relational information over neighbor embeddings and uses neighbor context as the query in a second-order attention mechanism, reducing dependence on self-embeddings and enabling induction for unseen entities. A self-distillation objective is proposed to align input and decentralized embeddings via mutual information, supported by theoretical results. Empirically, the approach, implemented in the decentRL framework, achieves state-of-the-art results on conventional entity alignment and entity prediction benchmarks and shows strong gains in open-world settings, with good efficiency and generality across GNN backbones.
Abstract
Graph neural network (GNN)-based methods have demonstrated remarkable performance in various knowledge graph (KG) tasks. However, most existing approaches rely on observing all entities during training, posing a challenge in real-world knowledge graphs where new entities emerge frequently. To address this limitation, we introduce Decentralized Attention Network (DAN). DAN leverages neighbor context as the query vector to score the neighbors of an entity, thereby distributing the entity semantics only among its neighbor embeddings. To effectively train a DAN, we introduce self-distillation, a technique that guides the network in generating desired representations. Theoretical analysis validates the effectiveness of our approach. We implement an end-to-end framework and conduct extensive experiments to evaluate our method, showcasing competitive performance on conventional entity alignment and entity prediction tasks. Furthermore, our method significantly outperforms existing methods in open-world settings.
