Likelihood-Free Estimation for Spatiotemporal Hawkes processes with missing data and application to predictive policing
Pramit Das, Moulinath Banerjee, Yuekai Sun
TL;DR
The paper tackles parameter estimation for spatiotemporal Hawkes processes in predictive policing when crime data are incomplete due to non-reporting. It proposes a likelihood-free, Wasserstein GAN framework with exact, interpretable Hawkes generators to recover the underlying parameters $\theta=(\mu,\alpha,\beta,\sigma^2)$ from distorted observations. Applied to a Bogotá-based crime simulation, the method improves parameter recovery and hotspot forecasts despite substantial missingness, demonstrated through interarrival-based goodness-of-fit tests and hotspot overlap metrics. This work advances accountability in predictive policing by enabling robust inference under missing data and discusses identifiability and future extensions to richer spatiotemporal generators and unknown missingness mechanisms.
Abstract
With the growing use of AI technology, many police departments use forecasting software to predict probable crime hotspots and allocate patrolling resources effectively for crime prevention. The clustered nature of crime data makes self-exciting Hawkes processes a popular modeling choice. However, one significant challenge in fitting such models is the inherent missingness in crime data due to non-reporting, which can bias the estimated parameters of the predictive model, leading to inaccurate downstream hotspot forecasts, often resulting in over or under-policing in various communities, especially the vulnerable ones. Our work introduces a Wasserstein Generative Adversarial Networks (WGAN) driven likelihood-free approach to account for unreported crimes in Spatiotemporal Hawkes models. We demonstrate through empirical analysis how this methodology improves the accuracy of parametric estimation in the presence of data missingness, leading to more reliable and efficient policing strategies.
