Conditional Generative Modeling for High-dimensional Marked Temporal Point Processes
Zheng Dong, Zekai Fan, Shixiang Zhu
TL;DR
The paper tackles the challenge of modeling temporal point processes with high-dimensional marks by replacing explicit conditional intensity modeling with a conditional event generator (CEG) that samples future events from history. It introduces a conditional denoising diffusion model (CDDM) as the flagship generator, trained via score matching and optionally guided by classifier-free guidance to produce quality event sequences and marks without thinning. The framework is extended with non-parametric KDE and variational CVAE variants, broadening applicability across data regimes, including low- and high-dimensional marks. Empirical results on synthetic, semi-synthetic (image marks), and real data (earthquakes, crime) show superior performance in both generation quality and efficiency, outperforming strong baselines such as DNSK and ETAS. The approach enables thinning-free, scalable modeling of complex spatio-temporal-content dynamics with practical implications for real-time decision-making and downstream analytics.
Abstract
Point processes offer a versatile framework for sequential event modeling. However, the computational challenges and constrained representational power of the existing point process models have impeded their potential for wider applications. This limitation becomes especially pronounced when dealing with event data that is associated with multi-dimensional or high-dimensional marks such as texts or images. To address this challenge, this study proposes a novel event-generation framework for modeling point processes with high-dimensional marks. We aim to capture the distribution of events without explicitly specifying the conditional intensity or probability density function. Instead, we use a conditional generator that takes the history of events as input and generates the high-quality subsequent event that is likely to occur given the prior observations. The proposed framework offers a host of benefits, including considerable representational power to capture intricate dynamics in multi- or even high-dimensional event space, as well as exceptional efficiency in learning the model and generating samples. Our numerical results demonstrate superior performance compared to other state-of-the-art baselines.
