Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations
Yujee Song, Donghyun Lee, Rui Meng, Won Hwa Kim
TL;DR
This work tackles the challenge of modeling marked temporal point processes by decoupling the influence of individual events and learning their continuous dynamics with Neural Ordinary Differential Equations. Each past event e_i contributes an independently evolving hidden state h(t;e_i) that generates a per-event influence μ(t;e_i)used to form the ground intensity λ_g^*(t|H_t) and the mark distribution f^*(k|t) through linear, parallelizable aggregations. The Linear Dec-ODE instantiation enables efficient, fixed-step trajectory computation and exact likelihood-based training, achieving competitive or superior results on multiple real-world datasets while offering interpretable visualizations of event-specific dynamics. The framework's parallel training scheme and explicability open avenues for downstream tasks such as survival analysis and out-of-distribution detection in asynchronous event data.
Abstract
A Marked Temporal Point Process (MTPP) is a stochastic process whose realization is a set of event-time data. MTPP is often used to understand complex dynamics of asynchronous temporal events such as money transaction, social media, healthcare, etc. Recent studies have utilized deep neural networks to capture complex temporal dependencies of events and generate embedding that aptly represent the observed events. While most previous studies focus on the inter-event dependencies and their representations, how individual events influence the overall dynamics over time has been under-explored. In this regime, we propose a Decoupled MTPP framework that disentangles characterization of a stochastic process into a set of evolving influences from different events. Our approach employs Neural Ordinary Differential Equations (Neural ODEs) to learn flexible continuous dynamics of these influences while simultaneously addressing multiple inference problems, such as density estimation and survival rate computation. We emphasize the significance of disentangling the influences by comparing our framework with state-of-the-art methods on real-life datasets, and provide analysis on the model behavior for potential applications.
