Distance-based Learning of Hypertrees
Shaun Fallat, Kamyar Khodamoradi, David Kirkpatrick, Valerii Maliuk, S. Ahmad Mojallal, Sandra Zilles
TL;DR
This work defines orderly hypertrees—hypertrees whose skeleton graphs form a tree and fit between known acyclicity classes in the Fagin hierarchy—and shows they can be learned efficiently with shortest-path queries. It develops an online SP-query learning algorithm that inserts vertices incrementally, achieving $O(n\Delta\log_\Delta m)$ query complexity (with an $h^2$ overhead for the initial one-edge phase) and proves this is asymptotically optimal via adversarial lower bounds. The paper also analyzes offline learning (removing the $h^2$ cost) and extends the framework to bounded-distance queries, deriving tight bounds for various $d$-bounded settings. Skeleton graphs enable a concise representation and efficient reconstruction of orderly hypertrees, and the results offer practical insights for streaming or partially revealed data in domains like phylogenetics and database theory.
Abstract
We study the problem of learning hypergraphs with shortest-path queries (SP-queries), and present the first provably optimal online algorithm for a broad and natural class of hypertrees that we call orderly hypertrees. Our online algorithm can be transformed into a provably optimal offline algorithm. Orderly hypertrees can be positioned within the Fagin hierarchy of acyclic hypergraph (well-studied in database theory), and strictly encompass the broadest class in this hierarchy that is learnable with subquadratic SP-query complexity. Recognizing that in some contexts, such as evolutionary tree reconstruction, distance measurements can degrade with increased distance, we also consider a learning model that uses bounded distance queries. In this model, we demonstrate asymptotically tight complexity bounds for learning general hypertrees.
