RECIPE-TKG: From Sparse History to Structured Reasoning for LLM-based Temporal Knowledge Graph Completion
Ömer Faruk Akgül, Feiyu Zhu, Yuxin Yang, Rajgopal Kannan, Viktor Prasanna
TL;DR
RECIPE-TKG tackles the challenge of forecasting in temporal knowledge graphs with sparse historical evidence by integrating three targeted components: rule-based multi-hop history sampling to enrich grounding, contrastive fine-tuning of lightweight LoRA adapters to encode relational semantics, and test-time semantic filtering to enforce contextual consistency. This framework yields stronger relational grounding and reduces hallucinations, especially in low-context queries, outperforming both embedding-based and prior LL-based methods across four benchmarks with up to $30.6\%$ relative gains in $Hits@10$. By systematically analyzing grounding, generalization, and evaluation, the work demonstrates that carefully designed retrieval, training objectives, and inference-time checks can significantly boost the reliability and plausibility of LLM-based TKG forecasters without large-scale retraining. The approach advances practical structured reasoning in foundation models for temporally dynamic knowledge, with implications for forecasting and decision support in domains where historical evidence is sparse or indirect.
Abstract
Temporal Knowledge Graphs (TKGs) represent dynamic facts as timestamped relations between entities. TKG completion involves forecasting missing or future links, requiring models to reason over time-evolving structure. While LLMs show promise for this task, existing approaches often overemphasize supervised fine-tuning and struggle particularly when historical evidence is limited or missing. We introduce RECIPE-TKG, a lightweight and data-efficient framework designed to improve accuracy and generalization in settings with sparse historical context. It combines (1) rule-based multi-hop retrieval for structurally diverse history, (2) contrastive fine-tuning of lightweight adapters to encode relational semantics, and (3) test-time semantic filtering to iteratively refine generations based on embedding similarity. Experiments on four TKG benchmarks show that RECIPE-TKG outperforms previous LLM-based approaches, achieving up to 30.6\% relative improvement in Hits@10. Moreover, our proposed framework produces more semantically coherent predictions, even for the samples with limited historical context.
