Beyond the Chinese Restaurant and Pitman-Yor processes: Statistical Models with Double Power-law Behavior
Fadhel Ayed, Juho Lee, François Caron
TL;DR
This work introduces a novel class of doubly regularly-varying CRMs whose normalized forms produce random partitions with a two-regime power-law in frequencies, addressing empirical evidence of double power-law behavior in language and networks. It develops two concrete instantiations, the generalized BFRY (GBFRY) and beta prime (BP) processes, and provides scalable posterior inference via MCMC augmented with latent variables to handle intractable likelihood components. The authors demonstrate that these models fit large-frequency data better than the Pitman–Yor process and related normalized CRMs across synthetic and real datasets, including word frequencies and Twitter networks. The results offer a flexible, theoretically grounded approach to modeling two-regime power-laws with potential extensions to hierarchical or graph-based Bayesian nonparametrics, advancing both theory and practical modeling of heavy-tailed phenomena.
Abstract
Bayesian nonparametric approaches, in particular the Pitman-Yor process and the associated two-parameter Chinese Restaurant process, have been successfully used in applications where the data exhibit a power-law behavior. Examples include natural language processing, natural images or networks. There is also growing empirical evidence that some datasets exhibit a two-regime power-law behavior: one regime for small frequencies, and a second regime, with a different exponent, for high frequencies. In this paper, we introduce a class of completely random measures which are doubly regularly-varying. Contrary to the Pitman-Yor process, we show that when completely random measures in this class are normalized to obtain random probability measures and associated random partitions, such partitions exhibit a double power-law behavior. We discuss in particular three models within this class: the beta prime process (Broderick et al. (2015, 2018), a novel process called generalized BFRY process, and a mixture construction. We derive efficient Markov chain Monte Carlo algorithms to estimate the parameters of these models. Finally, we show that the proposed models provide a better fit than the Pitman-Yor process on various datasets.
