PCA for Point Processes
Franck Picard, Vincent Rivoirard, Angelina Roche, Victor Panaretos
TL;DR
PCA for Point Processes develops a population-level, functional PCA framework for replicated point processes by embedding each realization as a random measure via its cumulative mass function. It establishes a Karhunen–Loève expansion for these measures and a Mercer representation for the covariance measure, introducing principal measures that govern latent dynamics. The approach yields explicit eigenstructure results for Poisson and Hawkes processes and provides a fully data-driven, smoothing-free estimator with parametric convergence rates, validated through simulations and diverse applications. The methodology enables interpretable dimension reduction and visualization of variability in replicated point patterns, with broad applicability to seismology, single-cell epigenomics, and neuroscience, and is implemented in the R package pppca.
Abstract
We introduce a novel statistical framework for the analysis of replicated point processes that allows for the study of point pattern variability at a population level. By treating point process realizations as random measures, we adopt a functional analysis perspective and propose a form of functional Principal Component Analysis (fPCA) for point processes. The originality of our method is to base our analysis on the cumulative mass functions of the random measures which gives us a direct and interpretable analysis. Key theoretical contributions include establishing a Karhunen-Loève expansion for the random measures and a Mercer Theorem for covariance measures. We establish convergence in a strong sense, and introduce the concept of principal measures, which can be seen as latent processes governing the dynamics of the observed point patterns. We propose an easy-to-implement estimation strategy of eigenelements for which parametric rates are achieved. We fully characterize the solutions of our approach to Poisson and Hawkes processes and validate our methodology via simulations and diverse applications in seismology, single-cell biology and neurosiences, demonstrating its versatility and effectiveness. Our method is implemented in the pppca R-package.
