Table of Contents
Fetching ...

The Complexity of Bayesian Network Learning: Revisiting the Superstructure

Robert Ganian, Viktoriia Korchemna

TL;DR

The paper advances the parameterized complexity landscape of Bayesian Network Structure Learning (BNSL) by analyzing superstructure-based parameters. It establishes fixed-parameter tractability for BNSL under edge-deletion distance via a polynomial kernel with at most $16k$ variables when parameterized by the feedback edge number $\operatorname{fen}$, and also proves FPT results for the localized variant $\operatorname{lfen}$ through dynamic programming, while providing tight hardness results for treecut width and related parameters. It further distinguishes between the non-zero and additive score representations, showing that the additive representation yields FPT results with treewidth, and extends these insights to Polytree Learning (PL), including NP-hardness in the additive setting and a polynomial-time solution for PL$^+$. Overall, the work maps the tractability boundaries across a broad set of graph-structural parameters and input representations, offering a framework that blends kernelization, Courcelle-type reasoning, and matroid-based methods for tractable learning of graphical models.

Abstract

We investigate the parameterized complexity of Bayesian Network Structure Learning (BNSL), a classical problem that has received significant attention in empirical but also purely theoretical studies. We follow up on previous works that have analyzed the complexity of BNSL w.r.t. the so-called superstructure of the input. While known results imply that BNSL is unlikely to be fixed-parameter tractable even when parameterized by the size of a vertex cover in the superstructure, here we show that a different kind of parameterization - notably by the size of a feedback edge set - yields fixed-parameter tractability. We proceed by showing that this result can be strengthened to a localized version of the feedback edge set, and provide corresponding lower bounds that complement previous results to provide a complexity classification of BNSL w.r.t. virtually all well-studied graph parameters. We then analyze how the complexity of BNSL depends on the representation of the input. In particular, while the bulk of past theoretical work on the topic assumed the use of the so-called non-zero representation, here we prove that if an additive representation can be used instead then BNSL becomes fixed-parameter tractable even under significantly milder restrictions to the superstructure, notably when parameterized by the treewidth alone. Last but not least, we show how our results can be extended to the closely related problem of Polytree Learning.

The Complexity of Bayesian Network Learning: Revisiting the Superstructure

TL;DR

The paper advances the parameterized complexity landscape of Bayesian Network Structure Learning (BNSL) by analyzing superstructure-based parameters. It establishes fixed-parameter tractability for BNSL under edge-deletion distance via a polynomial kernel with at most variables when parameterized by the feedback edge number , and also proves FPT results for the localized variant through dynamic programming, while providing tight hardness results for treecut width and related parameters. It further distinguishes between the non-zero and additive score representations, showing that the additive representation yields FPT results with treewidth, and extends these insights to Polytree Learning (PL), including NP-hardness in the additive setting and a polynomial-time solution for PL. Overall, the work maps the tractability boundaries across a broad set of graph-structural parameters and input representations, offering a framework that blends kernelization, Courcelle-type reasoning, and matroid-based methods for tractable learning of graphical models.

Abstract

We investigate the parameterized complexity of Bayesian Network Structure Learning (BNSL), a classical problem that has received significant attention in empirical but also purely theoretical studies. We follow up on previous works that have analyzed the complexity of BNSL w.r.t. the so-called superstructure of the input. While known results imply that BNSL is unlikely to be fixed-parameter tractable even when parameterized by the size of a vertex cover in the superstructure, here we show that a different kind of parameterization - notably by the size of a feedback edge set - yields fixed-parameter tractability. We proceed by showing that this result can be strengthened to a localized version of the feedback edge set, and provide corresponding lower bounds that complement previous results to provide a complexity classification of BNSL w.r.t. virtually all well-studied graph parameters. We then analyze how the complexity of BNSL depends on the representation of the input. In particular, while the bulk of past theoretical work on the topic assumed the use of the so-called non-zero representation, here we prove that if an additive representation can be used instead then BNSL becomes fixed-parameter tractable even under significantly milder restrictions to the superstructure, notably when parameterized by the treewidth alone. Last but not least, we show how our results can be extended to the closely related problem of Polytree Learning.
Paper Structure (9 sections, 19 theorems, 5 equations, 5 figures)

This paper contains 9 sections, 19 theorems, 5 equations, 5 figures.

Key Result

Proposition 1

For every graph $G$, $\operatorname{tcw}(G)\leq \operatorname{lfen}(G)+1$ and $\operatorname{lfen}(G)\leq \operatorname{fen}(G)$.

Figures (5)

  • Figure 1: The complexity landscape of BNSL with respect to parameterizations of the superstructure. Arrows point from more restrictive parameters to less restrictive ones. Results for the three graph parameters on the left side follow from this paper, while all other W$[1]$-hardness results follow from the reduction by Ordyniak and Szeider OrdyniakS13.
  • Figure 2: Example of a superstructure graph (on the top) and a suitable solution DAG (on the bottom) when: $f_a (\{b\})=f_a (\{c\})=1$, $f_a(\{b,c\})=2$; $f_b (\{a\})=f_b (\{c\})=1$, $f_b(\{a,c\})=3$; $f_c (\{a\})=3$, $f_c(\{b\})=2$; $f_d(\{b,c\})=1$; $l=6.$ Note that in the depicted DAG the scores of $b$ and $c$ are equal to 3, the score of $d$ is equal to 1. Parent set of $a$ is empty; as we assume the non-zero representation and $f_a(\emptyset)$ is not specified in the input, we conclude that $f_a(\emptyset)=0$. Therefore the total score is $0+3+3+1=7\geq6=l$.
  • Figure 3: Top: The six possible scenarios that give rise to the values of $\ell_{\emph{max}}$ (Cases 1-4) and $\ell_{\emph{noPath}}$ (Cases 5-6). Bottom: The corresponding arcs in the gadget after the application of Reduction Rule \ref{['redruletwo']}.
  • Figure 4: An example of our main gadget encoding the vector $s=(1,3,2)$ with $k=3$. Heavy arcs are marked in green, while purple and blue arcs are light. Intuitively, the reduction forces a choice between using the blue edge or all the purple edges; the latter case provides a total score that is $1$ greater than the former, but is constrained by the upper bound $r$ on the in-degrees of $v^1$, $v^2$, $v^3$.
  • Figure 5: Top: The eight possible scenarios for solutions to $\mathcal{I}$. Bottom: The corresponding arcs in the gadget after the application of Reduction Rule 2' (the scores of $b$ are specified below).

Theorems & Definitions (44)

  • Proposition 1
  • proof
  • Theorem 2
  • proof
  • Theorem 3
  • Lemma 4
  • proof
  • Lemma 5
  • proof
  • proof : Proof of Theorem \ref{['thm:kernel']}
  • ...and 34 more