Learning bounded-degree polytrees with known skeleton
Davin Choo, Joy Qiping Yang, Arnab Bhattacharyya, Clément L. Canonne
TL;DR
This work proves finite-sample, computationally efficient learnability for bounded-degree polytrees with a known skeleton, extending tree-based results to $d$-polytrees. It introduces a three-phase orientation algorithm guided by mutual information and conditional mutual information testers, achieving KL-error $\le\varepsilon$ with sample complexity $m = \tilde{\Omega}\left( \frac{n \cdot |\Sigma|^{d+1}}{\varepsilon} \log \frac{1}{\delta} \right)$ and running in polynomial time in $m$, $|\Sigma|^d$, and $n^d$. A matching information-theoretic lower bound shows that dependencies on $n$ and $\varepsilon$ are near-optimal, even when the skeleton is known; a Chow-Liu-based skeleton-recovery condition further supports practical applicability. The results rely on an efficient MI/CMI tester with carefully calibrated thresholds, and Meek rules to propagate orientations while preserving ground-truth structure. This work advances PAC-learning of structured high-dimensional distributions, enabling scalable learning of bounded-degree polytrees under realistic assumptions.
Abstract
We establish finite-sample guarantees for efficient proper learning of bounded-degree polytrees, a rich class of high-dimensional probability distributions and a subclass of Bayesian networks, a widely-studied type of graphical model. Recently, Bhattacharyya et al. (2021) obtained finite-sample guarantees for recovering tree-structured Bayesian networks, i.e., 1-polytrees. We extend their results by providing an efficient algorithm which learns $d$-polytrees in polynomial time and sample complexity for any bounded $d$ when the underlying undirected graph (skeleton) is known. We complement our algorithm with an information-theoretic sample complexity lower bound, showing that the dependence on the dimension and target accuracy parameters are nearly tight.
