On the Role of Priors in Bayesian Causal Learning
Bernhard C. Geiger, Roman Kern
TL;DR
The paper analyzes how priors shape Bayesian causal learning of independent causal mechanisms, showing that unlabeled cause realizations do not directly improve learning of the mechanism parameter while a factorized prior on $(\theta,\psi)$ yields a factorized posterior. It proves that cause data influence only the cause parameter in the posterior and that any effect on the mechanism parameter must arise via the prior coupling between parameters. Empirical results with Gaussian models reveal that correlated priors can slow learning across unsupervised, fully supervised, and semi-supervised settings, while factorized priors readily support posterior separation and reduce unintended information transfer from causes to mechanisms. The work offers principled guidance for prior design in Bayesian causal learning and informs semi-supervised and Bayesian deep learning approaches that incorporate causal modules.
Abstract
In this work, we investigate causal learning of independent causal mechanisms from a Bayesian perspective. Confirming previous claims from the literature, we show in a didactically accessible manner that unlabeled data (i.e., cause realizations) do not improve the estimation of the parameters defining the mechanism. Furthermore, we observe the importance of choosing an appropriate prior for the cause and mechanism parameters, respectively. Specifically, we show that a factorized prior results in a factorized posterior, which resonates with Janzing and Schölkopf's definition of independent causal mechanisms via the Kolmogorov complexity of the involved distributions and with the concept of parameter independence of Heckerman et al.
