Large Language Models are Effective Priors for Causal Graph Discovery
Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi
TL;DR
This work investigates using Large Language Models (LLMs) as soft priors to guide causal graph discovery under limited data. It introduces a probabilistic expert interaction model, a set of priors-evaluation metrics, and a flexible integration framework with CD-UCT, then empirically shows that 3-Way prompting and the combination of mutual information priors with LLM priors yield improved causal graphs, especially when computational budgets are tight. The main contributions are (i) a principled way to quantify and compare LLM-derived priors for causal discovery, (ii) a prompting Design space demonstrating when LLMs provide reliable directionality signals, and (iii) a practical integration strategy that outperforms hard priors and baseline MI priors on common-sense benchmarks. The findings highlight the potential of LLMs as soft background knowledge for causal structure learning while acknowledging limitations related to domain-specific knowledge, scalability, and possible data leakage, pointing to future work on richer interactive setups and domain-tailored LLMs.
Abstract
Causal structure discovery from observations can be improved by integrating background knowledge provided by an expert to reduce the hypothesis space. Recently, Large Language Models (LLMs) have begun to be considered as sources of prior information given the low cost of querying them relative to a human expert. In this work, firstly, we propose a set of metrics for assessing LLM judgments for causal graph discovery independently of the downstream algorithm. Secondly, we systematically study a set of prompting designs that allows the model to specify priors about the structure of the causal graph. Finally, we present a general methodology for the integration of LLM priors in graph discovery algorithms, finding that they help improve performance on common-sense benchmarks and especially when used for assessing edge directionality. Our work highlights the potential as well as the shortcomings of the use of LLMs in this problem space.
