Network Representation and Modular Decomposition of Combinatorial Structures: A Galled-Tree Perspective
Anna Lindeberg, Guillaume E. Scholz, Marc Hellmuth
TL;DR
The paper addresses how to explain symbolic dating maps beyond trees by introducing strudigrams and leveraging modular decomposition to replace prime vertices with networks. It develops a general framework of prime-vertex replacement (pvr) networks and then focuses on GaTEx strudigrams, which are exactly those explainable by strong, elementary, quasi-discriminating galled-trees. The authors provide a complete characterization via polar-cats, prove polynomial-time recognition and construction algorithms (Check_polar-cat and pvr), and show that GaTEx networks can explain a broad class of strudigrams with manageable size. This work enables compact, explainable phylogenetic networks for symbolic dating data and suggests future directions toward level-k networks and forbidden-substructure characterizations.
Abstract
In phylogenetics, reconstructing rooted trees from distances between taxa is a common task. Böcker and Dress generalized this concept by introducing symbolic dated maps $δ:X \times X \to Υ$, where distances are replaced by symbols, and showed that there is a one-to-one correspondence between symbolic ultrametrics and labeled rooted phylogenetic trees. Many combinatorial structures fall under the umbrella of symbolic dated maps, such as 2-dissimilarities, symmetric labeled 2-structures, or edge-colored complete graphs, and are here referred to as strudigrams. Strudigrams have a unique decomposition into non-overlapping modules, which can be represented by a modular decomposition tree (MDT). In the absence of prime modules, strudigrams are equivalent to symbolic ultrametrics, and the MDT fully captures the relationships $δ(x,y)$ between pairs of vertices $x,y$ in $X$ through the label of their least common ancestor in the MDT. However, in the presence of prime vertices, this information is generally hidden. To provide this missing structural information, we aim to locally replace the prime vertices in the MDT to obtain networks that capture full information about the strudigrams. While starting with the general framework of prime-vertex replacement networks, we then focus on a specific type of such networks obtained by replacing prime vertices with so-called galls, resulting in labeled galled-trees. We introduce the concept of galled-tree explainable (GATEX) strudigrams, provide their characterization, and demonstrate that recognizing these structures and reconstructing the labeled networks that explain them can be achieved in polynomial time.
