Score Matching for Estimating Finite Point Processes
Haoqun Cao, Yixuan Zhang, Feng Zhou
TL;DR
This work provides a mathematically rigorous framework for score matching on finite point processes using Janossy measures, revealing fundamental limitations of prior implicit/Autoregressive SM approaches in finite domains. It introduces a weighted score matching (WSM) and an autoregressive variant (AWSM) with provable consistency and finite-sample guarantees in parametric settings, and shows non-identifiability issues in nonparametric models. To address normalization ambiguities, the authors add a survival-classification augmentation that yields an entirely integration-free training objective applicable to intensity-based nonparametric models for spatio-temporal data. Empirically, AWSM achieves accuracy comparable to maximum likelihood estimation while offering significant computational efficiency across synthetic and real temporal and spatio-temporal datasets, including deep point-process models. The framework thus enables scalable, provably sound training of both classical and deep point processes, with practical benefits for diverse applications.
Abstract
Score matching estimators have garnered significant attention in recent years because they eliminate the need to compute normalizing constants, thereby mitigating the computational challenges associated with maximum likelihood estimation (MLE).While several studies have proposed score matching estimators for point processes, this work highlights the limitations of these existing methods, which stem primarily from the lack of a mathematically rigorous analysis of how score matching behaves on finite point processes -- special random configurations on bounded spaces where many of the usual assumptions and properties of score matching no longer hold. To this end, we develop a formal framework for score matching on finite point processes via Janossy measures and, within this framework, introduce an (autoregressive) weighted score-matching estimator, whose statistical properties we analyze in classical parametric settings. For general nonparametric (e.g., deep) point process models, we show that score matching alone does not uniquely identify the ground-truth distribution due to subtle normalization issues, and we propose a simple survival-classification augmentation that yields a complete, integration-free training objective for any intensity-based point process model for spatio-temporal case. Experiments on synthetic and real-world temporal and spatio-temporal datasets, demonstrate that our method accurately recovers intensities and achieves performance comparable to MLE with better efficiency.
