Statistics of leaves in growing random trees
Harrison Hartle, P. L. Krapivsky
TL;DR
This work introduces leaf degree as a natural statistic for sparse growing trees and develops a coherent leaf-based framework for two growth paradigms: random recursive trees (RRTs) and leaf-based preferential attachment (leaf-PA). Using generating functions and age-stratified analyses, the authors derive exact and asymptotic forms for the leaf-degree distribution in RRTs, including a factorial decay $m_\ell$ and a Poisson primordial-leaf-degree law, together with precise results for the distribution and cumulants of the total leaf count $\mathcal{L}_N$. In leaf-PA models, a rich set of tail behaviours emerges: a power-law tail for $0<a<1$, a stretched exponential at the critical $a=1$, and exponential tails with algebraic prefactors for $a>1$, with detailed expressions for the primordial and age-stratified leaf statistics; a conjectured equivalence between leaf-degree and degree-tail exponents in the scale-free regime is proposed and partially validated against degree-based PA with matched parameters. The results provide tractable,Analytics-friendly tools for leaf-based statistics in sparse networks and suggest broad avenues for extending leaf-centric analyses to real data and broader graph models.
Abstract
Leaves, i.e., vertices of degree one, can play a significant role in graph structure, especially in sparsely connected settings in which leaves often constitute the largest fraction of vertices. We consider a leaf-based counterpart of the degree, namely, the leaf degree -- the number of leaves a vertex is connected to -- and the associated leaf degree distribution, analogous to the degree distribution. We determine the leaf degree distribution of random recursive trees (RRTs) and trees grown via a leaf-based preferential attachment mechanism that we introduce. The RRT leaf degree distribution decays factorially, in contrast with its purely geometric degree distribution. In the one-parameter leaf-based growth model, each new vertex attaches to an existing vertex with rate $\ell$ + a, where $\ell$ is the leaf degree of the existing vertex, and a > 0. The leaf degree distribution has a powerlaw tail when 0 < a < 1 and an exponential tail (with algebraic prefactor) for a > 1. The critical case of a = 1 has a leaf degree distribution with stretched exponential tail. We compute a variety of additional characteristics in these models and conjecture asymptotic equivalence of degree and leaf degree powerlaw tail exponent in the scale free regime. We highlight several avenues of possible extension for future studies.
