A unified construction for series representations and finite approximations of completely random measures
Juho Lee, Xenia Miscouridou, François Caron
TL;DR
This work unifies and extends the construction of series representations and finite-dimensional approximations for infinite-activity completely random measures by embedding Poisson random measures in an augmented space and applying arrival-time kernels. The framework encompasses inverse-Lévy and size-biased representations as special cases, and yields new series and iid representations for important CRMs such as the generalized gamma process and stable beta process, along with systematic truncation-error analysis. It provides concrete instances (deterministic, exponential, gamma, inverse gamma, and generalized Pareto arrival times) and derives tractable, conjugate-like distributions (e.g., etBFRY, gbfrY) that facilitate simulation and posterior inference. The truncation results quantify approximation accuracy for functionals and marginal likelihoods, enabling reliable finite-dimensional approximations for Bayesian nonparametric modeling. Overall, the paper offers a practical, theoretically grounded toolkit for scalable inference with infinite-activity CRMs and their common instantiations.
Abstract
Infinite-activity completely random measures (CRMs) have become important building blocks of complex Bayesian nonparametric models. They have been successfully used in various applications such as clustering, density estimation, latent feature models, survival analysis or network science. Popular infinite-activity CRMs include the (generalized) gamma process and the (stable) beta process. However, except in some specific cases, exact simulation or scalable inference with these models is challenging and finite-dimensional approximations are often considered. In this work, we propose a general and unified framework to derive both series representations and finite-dimensional approximations of CRMs. Our framework can be seen as an extension of constructions based on size-biased sampling of Poisson point process [Perman1992]. It includes as special cases several known series representations as well as novel ones. In particular, we show that one can get novel series representations for the generalized gamma process and the stable beta process. We also provide some analysis of the truncation error.
