A master equation approach to the n-coalescent problem
Bahram Houchmandzadeh
TL;DR
The paper addresses the n-coalescent problem by reframing it in terms of the joined state $(n,t)$ and the probability $P(n,t)$, from which the MRCA distribution and tree-width statistics can be derived. It develops a master equation with a triangular jump generator $W$, enabling exact solutions for both continuous-time Moran and discrete-time Wright-Fisher models, and provides explicit expressions for $P(n,t)$, the MRCA CDF $F_1(t)=P(1,t)$, and moments of $t$ and $n(t)$ via recursion on an amplitude matrix $A$. The key contribution is a general, efficient framework that unifies continuous and discrete coalescent analyses and extends naturally to multi-step models, improving theoretical tractability and numerical computability. This approach offers a robust foundation for Coalescent computations and opens avenues for incorporating structure, such as the structured coalescent, using the same master-equation formalism.
Abstract
Given an evolutionary model, such as Wright--Fisher (WF) or Moran, the n-coalescent problem consists of going backward in time to find for example the time to the most recent common ancestor (MRCA) and the topology of the tree. In the literature, this problem is tackled mostly by computing directly the random variable t, time to reach the MRCA. I show here that by shifting the focus from the random variable t to the joined variable (n,t), where n is the number of ancestors at time t, the problem is greatly simplified. Indeed, P(n,t), the probability of this variable, obeys a simpler master equation that can be solved in a straightforward way for the most general model. This probability can then be used to compute relevant information of the n-coalescent, for both random variables $t_{n}$ (random time to reach a given state n) and $n_{t}$ (random number of ancestors at a given time t). The cumulative distribution function for $t_{1}$ for example is $P(1,t)$. I give in this article the general solution for continuous time models such as Moran and discrete time ones such as WF.
