Chain-of-History Reasoning for Temporal Knowledge Graph Forecasting
Yuwei Xia, Ding Wang, Qiang Liu, Liang Wang, Shu Wu, Xiaoyu Zhang
TL;DR
This work addresses the limitations of purely graph-based or single-step LLM-based TKG forecasting by introducing Chain-of-History (CoH) reasoning. CoH interrogates high-order histories step-by-step with an LLM to reveal semantically rich temporal patterns, while remaining a plug-and-play module that fuses LLM-derived insights with graph-based predictors via a simple score fusion rule. Empirical results on ICEWS datasets show that CoH improves both pure LLM predictions and, more notably, the predictive power of graph-based TKG models, with ablations confirming the value of high-order histories, stepwise inference, and proper candidate-scoring order. The approach highlights a practical path to integrate semantic reasoning into structured temporal graphs, balancing interpretability and performance, and points to future work on adaptive fusion and efficiency improvements.
Abstract
Temporal Knowledge Graph (TKG) forecasting aims to predict future facts based on given histories. Most recent graph-based models excel at capturing structural information within TKGs but lack semantic comprehension abilities. Nowadays, with the surge of LLMs, the LLM-based TKG prediction model has emerged. However, the existing LLM-based model exhibits three shortcomings: (1) It only focuses on the first-order history for prediction while ignoring high-order historical information, resulting in the provided information for LLMs being extremely limited. (2) LLMs struggle with optimal reasoning performance under heavy historical information loads. (3) For TKG prediction, the temporal reasoning capability of LLM alone is limited. To address the first two challenges, we propose Chain-of-History (CoH) reasoning which explores high-order histories step-by-step, achieving effective utilization of high-order historical information for LLMs on TKG prediction. To address the third issue, we design CoH as a plug-and-play module to enhance the performance of graph-based models for TKG prediction. Extensive experiments on three datasets and backbones demonstrate the effectiveness of CoH.
