Large Language Models as Interpolated and Extrapolated Event Predictors
Libo Zhang, Yue Ning
TL;DR
LEAP reframes sociopolitical event prediction as language-understanding and reasoning by leveraging quintuple-based data and fine-tuned language models. It combines two OP pathways—a ranking-based encoder-structure with ConvTransE and a QA-based generative approach using FLAN-T5BASE—and a MEF pathway that uses RoBERTaLARGE encodings with self-attention to predict future relations. Across ICEWS DVN/28075_2015 datasets, LEAPOP1, LEAPOP2, and LEAPMEF achieve strong accuracy and recall improvements over baselines, with open-source LLMs offering favorable cost-performance trade-offs relative to commercial APIs. Limitations include dataset scope and the need for retrieval-augmented or chain-of-thought prompting to further enhance temporal reasoning, suggesting a clear direction for future work.
Abstract
Salient facts of sociopolitical events are distilled into quadruples following a format of subject, relation, object, and timestamp. Machine learning methods, such as graph neural networks (GNNs) and recurrent neural networks (RNNs), have been built to make predictions and infer relations on the quadruple-based knowledge graphs (KGs). In many applications, quadruples are extended to quintuples with auxiliary attributes such as text summaries that describe the quadruple events. In this paper, we comprehensively investigate how large language models (LLMs) streamline the design of event prediction frameworks using quadruple-based or quintuple-based data while maintaining competitive accuracy. We propose LEAP, a unified framework that leverages large language models as event predictors. Specifically, we develop multiple prompt templates to frame the object prediction (OP) task as a standard question-answering (QA) task, suitable for instruction fine-tuning with an encoder-decoder LLM. For multi-event forecasting (MEF) task, we design a simple yet effective prompt template for each event quintuple. This novel approach removes the need for GNNs and RNNs, instead utilizing an encoder-only LLM to generate fixed intermediate embeddings, which are processed by a customized downstream head with a self-attention mechanism to predict potential relation occurrences in the future. Extensive experiments on multiple real-world datasets using various evaluation metrics validate the effectiveness of our approach.
