Incorporating Expert Knowledge into Bayesian Causal Discovery of Mixtures of Directed Acyclic Graphs
Zachris Björkman, Jorge Loría, Sophie Wharrie, Samuel Kaski
TL;DR
The paper tackles causal discovery under model misspecification due to heterogeneity by enabling expert-guided, Bayesian inference over mixtures of causal DAGs (CBNs). It introduces an informative latent graph prior, derives a BED-based elicitation strategy to query experts efficiently, and extends DiBS to VaMSL for joint inference over graph structure and parameters across multiple components, including nonlinear BNs. Empirical results on synthetic and breast cancer data show that incorporating expert information improves structure learning (lower ESHD) and downstream classification in heterogeneous settings, achieving performance close to supervised baselines in some cases. The approach is practical for domains with limited data but rich domain expertise, enabling robust, interpretable causal discovery with explicit handling of heterogeneity and expert input.
Abstract
Bayesian causal discovery benefits from prior information elicited from domain experts, and in heterogeneous domains any prior knowledge would be badly needed. However, so far prior elicitation approaches have assumed a single causal graph and hence are not suited to heterogeneous domains. We propose a causal elicitation strategy for heterogeneous settings, based on Bayesian experimental design (BED) principles, and a variational mixture structure learning (VaMSL) method -- extending the earlier differentiable Bayesian structure learning (DiBS) method -- to iteratively infer mixtures of causal Bayesian networks (CBNs). We construct an informative graph prior incorporating elicited expert feedback in the inference of mixtures of CBNs. Our proposed method successfully produces a set of alternative causal models (mixture components or clusters), and achieves an improved structure learning performance on heterogeneous synthetic data when informed by a simulated expert. Finally, we demonstrate that our approach is capable of capturing complex distributions in a breast cancer database.
