Causal Bayesian Optimization with Unknown Graphs
Jean Durand, Yashas Annadani, Stefan Bauer, Sonali Parbhoo
TL;DR
This work introduces CBO-U, a scalable causal Bayesian optimization framework for scenarios where the causal graph is unknown. By learning a Bayesian posterior over the direct parents of the target variable $Y$ and employing interventions on these parents, the method achieves optimization performance equivalent to full-graph methods while scaling to graphs with up to 100 nodes. It provides a closed-form posterior in the linear case and a GP-based random Fourier feature approximation for nonlinear cases, coupled with a prior over parent sets derived from doubly robust causal feature selection. The approach leverages interventional data and do-calculus to refine both the surrogate model and the causal-parent posterior, demonstrating competitive results across synthetic, semi-synthetic, and real-world-like networks. The framework broadens the applicability of CBO in real-world domains where causal structure is incomplete or uncertain, with a clear path for future extensions to broader intervention types and partial discovery.
Abstract
Causal Bayesian Optimization (CBO) is a methodology designed to optimize an outcome variable by leveraging known causal relationships through targeted interventions. Traditional CBO methods require a fully and accurately specified causal graph, which is a limitation in many real-world scenarios where such graphs are unknown. To address this, we propose a new method for the CBO framework that operates without prior knowledge of the causal graph. Consistent with causal bandit theory, we demonstrate through theoretical analysis and that focusing on the direct causal parents of the target variable is sufficient for optimization, and provide empirical validation in the context of CBO. Furthermore we introduce a new method that learns a Bayesian posterior over the direct parents of the target variable. This allows us to optimize the outcome variable while simultaneously learning the causal structure. Our contributions include a derivation of the closed-form posterior distribution for the linear case. In the nonlinear case where the posterior is not tractable, we present a Gaussian Process (GP) approximation that still enables CBO by inferring the parents of the outcome variable. The proposed method performs competitively with existing benchmarks and scales well to larger graphs, making it a practical tool for real-world applications where causal information is incomplete.
