ReaLitE: Enrichment of Relation Embeddings in Knowledge Graphs using Numeric Literals
Antonis Klironomos, Baifan Zhou, Zhuoxun Zheng, Gad-Elrab Mohamed, Heiko Paulheim, Evgeny Kharlamov
TL;DR
ReaLitE introduces a relation-centric enrichment of knowledge graph embeddings by aggregating numeric literals from head and tail entities per relation and fusing these summaries into the relation embeddings. The approach decouples literal processing from the base KGE model, offering linear or gated fusion variants and a flexible aggregation mechanism (including a learnable combination of aggregations). Empirical results show ReaLitE improves link prediction and node classification across multiple datasets and base models, with the largest gains for long-tail relations and numerically correlated literals. The method demonstrates strong compatibility with existing KGE frameworks and highlights the value of leveraging numeric attributes to capture nuanced relational patterns in real-world graphs.
Abstract
Most knowledge graph embedding (KGE) methods tailored for link prediction focus on the entities and relations in the graph, giving little attention to other literal values, which might encode important information. Therefore, some literal-aware KGE models attempt to either integrate numerical values into the embeddings of the entities or convert these numerics into entities during preprocessing, leading to information loss. Other methods concerned with creating relation-specific numerical features assume completeness of numerical data, which does not apply to real-world graphs. In this work, we propose ReaLitE, a novel relation-centric KGE model that dynamically aggregates and merges entities' numerical attributes with the embeddings of the connecting relations. ReaLitE is designed to complement existing conventional KGE methods while supporting multiple variations for numerical aggregations, including a learnable method. We comprehensively evaluated the proposed relation-centric embedding using several benchmarks for link prediction and node classification tasks. The results showed the superiority of ReaLitE over the state of the art in both tasks.
