A Martingale Approach to Large-$θ$ Ewens-Pitman Model
Rodrigo Ribeiro
TL;DR
The paper addresses the asymptotic behavior of the number of blocks $K_n$ in the Ewens--Pitman partition under the linear growth regime $\theta=\lambda n$. It develops a martingale-based framework via the Generalized Chinese Restaurant Process to obtain concise LLN and CLT proofs, with refined Berry--Esseen-type bounds. A sharp CLT is established: for $\alpha=0$ the rate is $O(n^{-1/2})$, while for $\alpha\in(0,1)$ the rate is $O(n^{-1/5+\varepsilon})$, addressing a gap for the reinforced regime. The approach yields shorter proofs (compared to prior work) and sharper convergence rates, providing a unified method that highlights the role of martingale structure and conditional variances in large-sample behavior of partition models.
Abstract
We investigate the asymptotic behavior of the number of parts $K_n$ in the Ewens--Pitman partition model under the regime where the diversity parameter is scaled linearly with the sample size, that is, $θ= λn$ for some~$λ> 0$. While recent work has established a law of large numbers (LLN) and a central limit theorem (CLT) for $K_n$ in this regime, we revisit these results through a martingale-based approach. Our method yields significantly shorter proofs, and leads to sharper convergence rates in the CLT, including improved Berry--Esseen bounds in the case $α= 0$, and a new result for the regime $α\in (0,1)$, filling a gap in the literature.
