Genealogical processes of sequential Monte Carlo methods and other non-neutral population models under rapid mutation
Jere Koskela, Paul A. Jenkins, Adam M. Johansen, Dario Spano
TL;DR
The paper addresses how genealogies in non-neutral population models and SMC algorithms converge to the Kingman coalescent under a specific time rescaling, clarifying when and how different resampling schemes affect coalescence. It introduces a restated convergence theorem with explicit holding-time and merger-scale quantities $c_N(\xi,\ell,j;k)$ and $\tau_N(\xi,\ell,j;t)$, and proves convergence under four strong but verifiable mixing assumptions. The authors provide a substantially simplified proof framework via holding-time analysis and non-Markovian-to-Markovian stitching, and correct prior errors in the conditional transition descriptions. The results have practical implications for designing SMC algorithms, showing how edge-case resampling choices influence genealogical degeneracy and the effective speed of coalescence across multinomial, stratified, and stochastic rounding schemes, with a careful note on the limits of comparing schemes through simple moment orders.
Abstract
We show that genealogical trees arising from a broad class of non-neutral models of population evolution converge to the Kingman coalescent under a suitable rescaling of time. As well as non-neutral biological evolution, our results apply to genetic algorithms encompassing the prominent class of sequential Monte Carlo (SMC) methods. The time rescaling we need differs slightly from that used in classical results for convergence to the Kingman coalescent, which has implications for the performance of different resampling schemes in SMC algorithms. In addition, our work substantially simplifies earlier proofs of convergence to the Kingman coalescent, and corrects an error common to several earlier results.
