Table of Contents
Fetching ...

Are Bayesian networks typically faithful?

Philip Boeken, Patrick Forré, Joris M. Mooij

TL;DR

This work investigates whether faithfulness is a typical property for Bayesian networks. It proves that faithful distributions are dense and open in the total variation topology for any Markov distribution relative to a fixed DAG $G$, with unfaithful distributions forming a nowhere dense set; it further shows that, under mild regularity, faithful parameters are dense with Lebesgue measure zero for conditional exponential-family parametrisations, extending known results beyond linear-Gaussian and discrete models. The results also extend to networks with latent variables via latent projections to ADMGs. Collectively, the findings provide a topological notion of typicality for faithfulness, offering robustness guarantees for causal discovery across broad model classes, while noting limitations due to the chosen notion of typicality and the potential separation between topological and statistical notions of faithfulness.

Abstract

Faithfulness is a ubiquitous assumption in causal inference, often motivated by the fact that the faithful parameters of linear Gaussian and discrete Bayesian networks are typical, and the folklore belief that this should also hold for other classes of Bayesian networks. We address this open question by showing that among all Bayesian networks over a given DAG, the faithful Bayesian networks are indeed `typical': they constitute a dense, open set with respect to the total variation metric. However, this does not imply that faithfulness is typical in restricted classes of Bayesian networks, as are often considered in statistical applications. To this end we consider the class of Bayesian networks parametrised by conditional exponential families, for which we show that under mild regularity conditions, the faithful parameters constitute a dense, open set and the unfaithful parameters have Lebesgue measure zero, extending the existing results for linear Gaussian and discrete Bayesian networks. Finally, we show that the aforementioned results also hold for Bayesian networks with latent variables.

Are Bayesian networks typically faithful?

TL;DR

This work investigates whether faithfulness is a typical property for Bayesian networks. It proves that faithful distributions are dense and open in the total variation topology for any Markov distribution relative to a fixed DAG , with unfaithful distributions forming a nowhere dense set; it further shows that, under mild regularity, faithful parameters are dense with Lebesgue measure zero for conditional exponential-family parametrisations, extending known results beyond linear-Gaussian and discrete models. The results also extend to networks with latent variables via latent projections to ADMGs. Collectively, the findings provide a topological notion of typicality for faithfulness, offering robustness guarantees for causal discovery across broad model classes, while noting limitations due to the chosen notion of typicality and the potential separation between topological and statistical notions of faithfulness.

Abstract

Faithfulness is a ubiquitous assumption in causal inference, often motivated by the fact that the faithful parameters of linear Gaussian and discrete Bayesian networks are typical, and the folklore belief that this should also hold for other classes of Bayesian networks. We address this open question by showing that among all Bayesian networks over a given DAG, the faithful Bayesian networks are indeed `typical': they constitute a dense, open set with respect to the total variation metric. However, this does not imply that faithfulness is typical in restricted classes of Bayesian networks, as are often considered in statistical applications. To this end we consider the class of Bayesian networks parametrised by conditional exponential families, for which we show that under mild regularity conditions, the faithful parameters constitute a dense, open set and the unfaithful parameters have Lebesgue measure zero, extending the existing results for linear Gaussian and discrete Bayesian networks. Finally, we show that the aforementioned results also hold for Bayesian networks with latent variables.

Paper Structure

This paper contains 10 sections, 17 theorems, 23 equations, 4 figures.

Key Result

Theorem 1

With respect to Lebesgue measure over $\Theta_\mathcal{N}$, the set of parameters whose distribution is unfaithful to $G$ is measure-zero.

Figures (4)

  • Figure 1: DAGs of the Bayesian networks that are given in Example \ref{['ex:unfaithful']}.
  • Figure 2: Graphs relating to different mixtures of Bayesian networks with graph $G$.
  • Figure 3: Mixtures of dependent variables might become independent.
  • Figure 4: DAG $G$ and latent projection $G^{\mathrm{p}}$ onto $\{A, B, C\}$.

Theorems & Definitions (43)

  • Theorem 1: spirtes1993causation
  • Theorem 2: meek1995strong
  • Definition 1
  • Theorem 3: verma1990causal
  • Definition 2
  • Example 1
  • Definition 3
  • Theorem 4: lauritzen2024total
  • Theorem 5
  • proof
  • ...and 33 more