Maximum likelihood estimation for spinal-structured trees
Romain Azaïs, Benoît Henry
TL;DR
The paper addresses the challenge of estimating birth distributions in spinal-structured multi-type Galton-Watson trees with unobserved types, introducing a two-type model with a spine governed by a latent function $f$ and a normal distribution $\mu$. It develops a maximum-likelihood-like approach within the SST framework, employing an 'Ugly Duckling' spine selector to identify the spine and then corrects $\mu$ to estimate $f$, proving almost sure convergence under the condition $\log m(\mu)-\mathfrak{D}(\mathcal{B}\mu,\nu)<0$. Theoretical contributions include rate-function analysis for large deviations in spine-sample selection and continuity results, extending identifiability across subcritical, critical, and supercritical growth regimes. The paper also provides extensive simulations validating consistency and proposes an asymptotic test to distinguish populations conditioned on survival from those that are not, with practical implications for inference in conditioned branching processes.
Abstract
We investigate some aspects of the problem of the estimation of birth distributions (BD) in multi-type Galton-Watson trees (MGW) with unobserved types. More precisely, we consider two-type MGW called spinal-structured trees. This kind of tree is characterized by a spine of special individuals whose BD $ν$ is different from the other individuals in the tree (called normal whose BD is denoted $μ$). In this work, we show that even in such a very structured two-type population, our ability to distinguish the two types and estimate $μ$ and $ν$ is constrained by a trade-off between the growth-rate of the population and the similarity of $μ$ and $ν$. Indeed, if the growth-rate is too large, large deviations events are likely to be observed in the sampling of the normal individuals preventing us to distinguish them from special ones. Roughly speaking, our approach succeeds if $r<\mathfrak{D}(μ,ν)$ where $r$ is the exponential growth-rate of the population and $\mathfrak{D}$ is a divergence measuring the dissimilarity between $μ$ and $ν$.
