Maximally probable tree topologies with $r$-furcation
Emily H. Dickey, Noah A. Rosenberg
TL;DR
This work identifies the unique maximally probable unlabeled topology for rooted $r$-furcating trees with $n=w(r-1)+1$ leaves by leveraging a deep link to Huffman trees. It shows that the $H$-tree constructed from a uniform weight vector uniquely minimizes the sum of $\log(m(v)-1)$ over internal nodes, thereby maximizing the number of labeled histories and generalizing the Harding--Hammersley--Grimmett result for bifurcating trees. The approach combines majorization theory, Schur-convexity, and the $r$-merge operation to derive an explicit recursive shape for $U_n^*$. It also extends the framework to simultaneous branching, providing conjectures and data for trifurcating cases, and suggests a broad information-theoretic lens on phylogenetic topology selection.
Abstract
For a specific rooted labeled tree topology, a labeled history is a sequence of branchings that give rise to that labeled topology as it unfolds over time. Here, for $r$-furcating trees, we use a connection with Huffman trees from information theory to identify maximally probable rooted trees -- unlabeled $r$-furcating topologies whose labelings each have a number of labeled histories greater than or equal to those of all other labeled topologies. Our characterization of the unique maximally probable $r$-furcating unlabeled topology generalizes the Harding--Hammersley--Grimmett result identifying the maximally probable bifurcating unlabeled topology, and it provides a new proof for that result. We present a conjecture for the maximally probable $r$-furcating unlabeled topology if labeled histories are tabulated allowing for simultaneous branching events across multiple internal nodes of a tree.
