Variational Bayesian Phylogenetic Inference with Semi-implicit Branch Length Distributions
Tianyu Xie, Frederick A. Matsen, Marc A. Suchard, Cheng Zhang
TL;DR
This work tackles the inefficiency of exploring large, multimodal tree spaces in Bayesian phylogenetics by adopting variational inference with a flexible semi-implicit branch length model. It introduces VBPI-SIBranch, which uses graph neural networks to produce permutation-invariant, semi-implicit branch length posteriors conditioned on topology, and couples this with two surrogate lower bounds, MSILB and MIWLB, to train over both topology and branch lengths. The approach yields improved marginal likelihood estimates and tighter branch-length posterior approximations across eight benchmark datasets, with MIWLB providing the strongest performance in many cases. Overall, the method demonstrates that rich variational families coupled with principled lower-bound surrogates can offer scalable, accurate alternatives to MCMC for phylogenetic inference and opens avenues for conditioning mixing distributions on topology through learned graph representations.
Abstract
Reconstructing the evolutionary history relating a collection of molecular sequences is the main subject of modern Bayesian phylogenetic inference. However, the commonly used Markov chain Monte Carlo methods can be inefficient due to the complicated space of phylogenetic trees, especially when the number of sequences is large. An alternative approach is variational Bayesian phylogenetic inference (VBPI) which transforms the inference problem into an optimization problem. While effective, the default diagonal lognormal approximation for the branch lengths of the tree used in VBPI is often insufficient to capture the complexity of the exact posterior. In this work, we propose a more flexible family of branch length variational posteriors based on semi-implicit hierarchical distributions using graph neural networks. We show that this semi-implicit construction emits straightforward permutation equivariant distributions, and therefore can handle the non-Euclidean branch length space across different tree topologies with ease. To deal with the intractable marginal probability of semi-implicit variational distributions, we develop several alternative lower bounds for stochastic optimization. We demonstrate the effectiveness of our proposed method over baseline methods on benchmark data examples, in terms of both marginal likelihood estimation and branch length posterior approximation.
