Bayesian learning of Causal Structure and Mechanisms with GFlowNets and Variational Bayes
Mizu Nishikawa-Toomey, Tristan Deleu, Jithendaraa Subramanian, Yoshua Bengio, Laurent Charlin
TL;DR
The paper addresses uncertainty quantification in causal structure learning by jointly inferring DAGs and causal mechanisms. It combines Variational Bayes with a GFlowNet (VBG) to model the posterior $P(G,\theta|\mathcal{D})$, enabling sampling of acyclic graphs while estimating mechanism parameters under a linear-Gaussian model. Empirically, VBG is competitive with baseline Bayesian methods on synthetic graphs and real protein-signalling data, and it offers acyclicity guarantees and a clear path to extending to non-linear mechanisms. This approach advances practical causal inference by providing a scalable, uncertainty-aware framework for joint graph and mechanism learning with potential for active intervention design.
Abstract
Bayesian causal structure learning aims to learn a posterior distribution over directed acyclic graphs (DAGs), and the mechanisms that define the relationship between parent and child variables. By taking a Bayesian approach, it is possible to reason about the uncertainty of the causal model. The notion of modelling the uncertainty over models is particularly crucial for causal structure learning since the model could be unidentifiable when given only a finite amount of observational data. In this paper, we introduce a novel method to jointly learn the structure and mechanisms of the causal model using Variational Bayes, which we call Variational Bayes-DAG-GFlowNet (VBG). We extend the method of Bayesian causal structure learning using GFlowNets to learn not only the posterior distribution over the structure, but also the parameters of a linear-Gaussian model. Our results on simulated data suggest that VBG is competitive against several baselines in modelling the posterior over DAGs and mechanisms, while offering several advantages over existing methods, including the guarantee to sample acyclic graphs, and the flexibility to generalize to non-linear causal mechanisms.
