Theoretical Guarantees for Causal Discovery on Large Random Graphs
Mathieu Chevalley, Arash Mehrjou, Patrick Schwab
TL;DR
The paper tackles the problem of orientation errors in causal discovery under single-variable random interventions with latent confounding, formalized via $\epsilon$-interventional faithfulness. It develops finite-dimension deviation bounds for the false-negative rate (FNR) and establishes dimension-adaptive concentration results across three graph families: dense and sparse Erdős–Rényi DAGs and generalized Barabási–Albert (BA) graphs, with rates that decay as the graph grows. The main technical contributions are concentration results for the topological error $D_{top}$ and the normalized FNR $g$, derived using McDiarmid’s inequality and related bounds, and they reveal how network structure can regularize causal discovery. Empirical simulations corroborate the theory, showing concentration and often vanishing FNR as dimension increases, thus challenging the intuition that high dimensionality and heterogeneity hinder reliable causal orientation. The work provides principled guidance for designing interventional studies in large-scale systems and lays groundwork for extending guarantees to adaptive interventions and broader network models.
Abstract
We investigate theoretical guarantees for the false-negative rate (FNR) -- the fraction of true causal edges whose orientation is not recovered, under single-variable random interventions and an $ε$-interventional faithfulness assumption that accommodates latent confounding. For sparse Erdős--Rényi directed acyclic graphs, where the edge probability scales as $p_e = Θ(1/d)$, we show that the FNR concentrates around its mean at rate $O(\frac{\log d}{\sqrt d})$, implying that large deviations above the expected error become exponentially unlikely as dimensionality increases. This concentration ensures that derived upper bounds hold with high probability in large-scale settings. Extending the analysis to generalized Barabási--Albert graphs reveals an even stronger phenomenon: when the degree exponent satisfies $γ> 3$, the deviation width scales as $O(d^{β- \frac{1}{2}})$ with $β= 1/(γ- 1) < \frac{1}{2}$, and hence vanishes in the limit. This demonstrates that realistic scale-free topologies intrinsically regularize causal discovery, reducing variability in orientation error. These finite-dimension results provide the first dimension-adaptive, faithfulness-robust guarantees for causal structure recovery, and challenge the intuition that high dimensionality and network heterogeneity necessarily hinder accurate discovery. Our simulation results corroborate these theoretical predictions, showing that the FNR indeed concentrates and often vanishes in practice as dimensionality grows.
