Table of Contents
Fetching ...

Point processes with event time uncertainty

Xiuyuan Cheng, Tingnan Gong, Yao Xie

TL;DR

The paper develops a principled framework for point processes with event-time uncertainty, combining a continuous-time Hawkes model with uncertain observation windows and a discrete-time grid that yields a learnable kernel matrix or tensor. It provides two efficient inference mechanisms, gradient descent for MLE and a monotone variational-inequality approach with an $O(1/k)$ recovery-rate guarantee, applicable to both time-only and networked settings. The framework accommodates non-stationary kernels and offers low-rank or time-invariant structures to reduce complexity, with VI shown to be more numerically stable in practice. Empirical results on synthetic data and real datasets (SADs and Atlanta crime) demonstrate accurate kernel recovery, interpretable dynamic influence networks, and superior predictive performance over standard GLMs and Hawkes approaches. This work enables flexible, time-uncertainty-aware modeling of complex event dynamics in domains ranging from healthcare to criminology.

Abstract

Point processes are widely used statistical models for continuous-time discrete event data, such as medical records, crime reports, and social network interactions, to capture the influence of historical events on future occurrences. In many applications, however, event times are not observed exactly, motivating the need to incorporate time uncertainty into point process modeling. In this work, we introduce a framework for modeling time-uncertain self-exciting point processes, known as Hawkes processes, possibly defined over a network. We begin by formulating the model in continuous time under assumptions motivated by real-world scenarios. By imposing a time grid, we obtain a discrete-time model that facilitates inference and enables computation via first-order optimization methods such as gradient descent and variational inequality (VI). We establish a parameter recovery guarantee for VI inference with an $O(1/k)$ convergence rate using $k$ steps. Our framework accommodates non-stationary processes by representing the influence kernel as a matrix (or tensor on a network), while also encompassing stationary processes, such as the classical Hawkes process, as a special case. Empirically, we demonstrate that the proposed approach outperforms existing baselines on both simulated and real-world datasets, including the sepsis-associated derangement prediction challenge and the Atlanta Police Crime Dataset.

Point processes with event time uncertainty

TL;DR

The paper develops a principled framework for point processes with event-time uncertainty, combining a continuous-time Hawkes model with uncertain observation windows and a discrete-time grid that yields a learnable kernel matrix or tensor. It provides two efficient inference mechanisms, gradient descent for MLE and a monotone variational-inequality approach with an recovery-rate guarantee, applicable to both time-only and networked settings. The framework accommodates non-stationary kernels and offers low-rank or time-invariant structures to reduce complexity, with VI shown to be more numerically stable in practice. Empirical results on synthetic data and real datasets (SADs and Atlanta crime) demonstrate accurate kernel recovery, interpretable dynamic influence networks, and superior predictive performance over standard GLMs and Hawkes approaches. This work enables flexible, time-uncertainty-aware modeling of complex event dynamics in domains ranging from healthcare to criminology.

Abstract

Point processes are widely used statistical models for continuous-time discrete event data, such as medical records, crime reports, and social network interactions, to capture the influence of historical events on future occurrences. In many applications, however, event times are not observed exactly, motivating the need to incorporate time uncertainty into point process modeling. In this work, we introduce a framework for modeling time-uncertain self-exciting point processes, known as Hawkes processes, possibly defined over a network. We begin by formulating the model in continuous time under assumptions motivated by real-world scenarios. By imposing a time grid, we obtain a discrete-time model that facilitates inference and enables computation via first-order optimization methods such as gradient descent and variational inequality (VI). We establish a parameter recovery guarantee for VI inference with an convergence rate using steps. Our framework accommodates non-stationary processes by representing the influence kernel as a matrix (or tensor on a network), while also encompassing stationary processes, such as the classical Hawkes process, as a special case. Empirically, we demonstrate that the proposed approach outperforms existing baselines on both simulated and real-world datasets, including the sepsis-associated derangement prediction challenge and the Atlanta Police Crime Dataset.

Paper Structure

This paper contains 99 sections, 14 theorems, 161 equations, 20 figures, 8 tables, 2 algorithms.

Key Result

Lemma 3.1

The log-likelihood of $\mathcal{L}$ as in eq:def-calL-time-uncert has the expression

Figures (20)

  • Figure 1: Illustration of a real-world medical application scenario leading to discrete event data with time uncertainty.
  • Figure 2: Illustration of the model: (a) Event times without uncertainty in the standard point process model. (b) Continuous-time model with arbitrary-length event window, see Section \ref{['subsec:cont-time-time-only']}. (c) Discrete-time model with arbitrary-sized event uncertainty, see Section \ref{['subsec:discrete-general-uncertainty']}. (d) Discrete-time model with unit-sized event uncertainty, see Section \ref{['subsec:bernoulli-time-only']}.
  • Figure 3: Kernel matrix $K$ in \ref{['eq:def-Kij-discrete-time']} and the reparameterized form $\Psi$ in \ref{['eq:def-Psi-matrix']}. We set $N' = 5$ and $N=15$. The blue (including light and dark blue) elements represent $j > i$, the entries of the kernel matrix that are estimable in $\theta_K$, see \ref{['eq:def-theta-timeonly-Omega']}; The white elements are non-causal (i.e., $j\leq i$) and thus are excluded from $\theta_K$. We apply the low-rank constraint to the submatrix formed by the dark blue entries in $\Psi$.
  • Figure 4: Recovered kernels on simulated time-only event data with kernel function defined in \ref{['eq:time-only-kernel']}, $N'=8$ and $N=32$. (a) The true kernel. (b) The kernel recovered by TULIK-VI. (c) The kernel recovered by TULIK-GD. The white lines indicate the estimable parameters.
  • Figure 5: The probability predictions on three randomly selected testing sequences. Data is the simulated time-only event data with kernel function defined in \ref{['eq:time-only-kernel']}, $N'=8$ and $N=32$.
  • ...and 15 more figures

Theorems & Definitions (33)

  • Lemma 3.1
  • Remark 3.1: Consistency of conditional intensity function definition
  • Remark 3.2: Time-invariant kernel
  • Lemma 3.2
  • Lemma 3.3
  • Proposition 4.1
  • Lemma 4.2
  • Lemma 4.3
  • Lemma 4.4: Strong monotonicity of $G(z)$
  • Theorem 4.5: Kernel recovery by VI
  • ...and 23 more