Phylogenetic Network Diversity Parameterized by Reticulation Number and Beyond
Leo van Iersel, Mark Jones, Jannik Schestag, Celine Scornavacca, Mathias Weller
TL;DR
This work analyzes Network-PD, the diversity score on rooted phylogenetic networks with inheritance probabilities, focusing on Max-Network-PD. It proves fixed-parameter tractability with respect to the reticulation number $r$ for binary networks via an efficient branching algorithm, while showing NP-hardness even for level-1 networks, thereby limiting the effectiveness of level-based parameterization. The hardness results are established through a chain of reductions, notably Subset Product to Penalty Sum (first with irrational numbers, then via rationalization) and a reduction from unit-cost-NAP to Max-Network-PD on level-1 networks using leaf gadgets. Together, these results delineate the boundary between tractable and intractable instances and highlight the need for alternative approaches or parameterizations in practice.
Abstract
Network Phylogenetic Diversity (Network-PD) is a measure for the diversity of a set of species based on a rooted phylogenetic network (with branch lengths and inheritance probabilities on the reticulation edges) describing the evolution of those species. We consider the Max-Network-PD problem: Given such a network, find k species with maximum Network-PD score. We show that this problem is fixed-parameter tractable (FPT) for binary networks, by describing an optimal algorithm running in O(2r log(k)(n + r)) time, with n the total number of species in the network and r its reticulation number. Furthermore, we show that Max-Network-PD is NP-hard for level-1 networks, proving that, unless P=NP, the FPT approach cannot be extended by using the level as parameter instead of the reticulation number.
