Learnability of Parameter-Bounded Bayes Nets
Arnab Bhattacharyya, Davin Choo, Sutanu Gayen, Dimitrios Myrisiotis
TL;DR
This work proves that learning parameter-bounded Bayesian networks remains NP-hard even when a distribution is guaranteed to be Markov to some network with a fixed parameter budget. It extends prior hardness results and furnishes a finite-sample guarantee: given samples from a distribution that is Markov to a $p$-parameter Bayes net, one can efficiently select a network and construct a close approximation $\mathbb{Q}$ with $d_{TV}(\mathbb{P}, \mathbb{Q}) \le \varepsilon$, using a bound that scales with $p$, $n$, and alphabet size $|\Sigma|$. The methodological core combines independence-oracle-based reductions, epsilon-net constructions over network structures, and a Scheffé-style tournament to pick a near-best candidate from a finite set, generalizing degree-bounded results to the parameter-bounded regime. These results illuminate both the computational hardness and the learnability landscape for compact probabilistic models, with implications for structure learning under tight resource constraints.
Abstract
Bayes nets are extensively used in practice to efficiently represent joint probability distributions over a set of random variables and capture dependency relations. In a seminal paper, Chickering et al. (JMLR 2004) showed that given a distribution $\mathbb{P}$, that is defined as the marginal distribution of a Bayes net, it is $\mathsf{NP}$-hard to decide whether there is a parameter-bounded Bayes net that represents $\mathbb{P}$. They called this problem LEARN. In this work, we extend the $\mathsf{NP}$-hardness result of LEARN and prove the $\mathsf{NP}$-hardness of a promise search variant of LEARN, whereby the Bayes net in question is guaranteed to exist and one is asked to find such a Bayes net. We complement our hardness result with a positive result about the sample complexity that is sufficient to recover a parameter-bounded Bayes net that is close (in TV distance) to a given distribution $\mathbb{P}$, that is represented by some parameter-bounded Bayes net, generalizing a degree-bounded sample complexity result of Brustle et al. (EC 2020).
