Language-TPP: Integrating Temporal Point Processes with Language Models for Event Analysis
Quyu Kong, Yixuan Zhang, Yang Liu, Panrong Tong, Enqi Liu, Feng Zhou
TL;DR
Language-TPP presents a unified framework that integrates Temporal Point Processes with Large Language Models by introducing a novel temporal byte-token encoding to represent time intervals within standard LLM tokenizers. Using a decoder-only Transformer backbone, it models sequences of events described by time, type, and natural language descriptions, enabling next-event time, type, intensity, and description generation. The method achieves state-of-the-art results on multiple TPP tasks and demonstrates superior quality in generated event descriptions, particularly on datasets with textual descriptions. This approach offers a practical pathway for jointly modeling temporal dynamics and textual multimodal event data, with implications for more expressive predictive analytics and explanation generation in real-world sequences.
Abstract
Temporal Point Processes (TPPs) have been widely used for event sequence modeling, but they often struggle to incorporate rich textual event descriptions effectively. Conversely, while Large Language Models (LLMs) have been shown remarkable capabilities in processing textual data, they lack mechanisms for handling temporal dynamics. To bridge this gap, we introduce Language-TPP, a unified framework that integrates TPPs with LLMs for enhanced event sequence modeling. Language-TPP introduces a novel temporal encoding mechanism that converts continuous time intervals into specialized byte-tokens, enabling seamless integration with standard LLM architectures. This approach allows Language-TPP to achieve state-of-the-art performance across multiple TPP tasks, including event time prediction, type prediction, and intensity estimation, on five datasets. Additionally, we demonstrate that incorporating temporal information significantly improves the quality of generated event descriptions.
