Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery

Mateusz Olko; Mateusz Gajewski; Joanna Wojciechowska; Mikołaj Morzy; Piotr Sankowski; Piotr Miłoś

Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery

Mateusz Olko, Mateusz Gajewski, Joanna Wojciechowska, Mikołaj Morzy, Piotr Sankowski, Piotr Miłoś

TL;DR

The paper investigates the limits of neural causal discovery under finite samples, arguing that neural networks cannot reliably distinguish ground-truth causal links from non-links due to $\lambda$-strong faithfulness being a brittle bottleneck. It introduces a unified benchmarking protocol and an empirical framework using $\hat{\lambda}$ to quantify task difficulty, supported by synthetic nonlinear SCMs. Results show that convergence and accuracy improve with larger $\hat{\lambda}$ but degrade as graph size and density increase, aligning with theory that the fraction of $\lambda$-strong faithful distributions shrinks in larger graphs. The findings suggest fundamental constraints in the current neural-discovery paradigm and advocate a paradigm shift toward new data regimes or modeling assumptions beyond standard neural-function approximators.

Abstract

Neural causal discovery methods have recently improved in terms of scalability and computational efficiency. However, our systematic evaluation highlights significant room for improvement in their accuracy when uncovering causal structures. We identify a fundamental limitation: neural networks cannot reliably distinguish between existing and non-existing causal relationships in the finite sample regime. Our experiments reveal that neural networks, as used in contemporary causal discovery approaches, lack the precision needed to recover ground-truth graphs, even for small graphs and relatively large sample sizes. Furthermore, we identify the faithfulness property as a critical bottleneck: (i) it is likely to be violated across any reasonable dataset size range, and (ii) its violation directly undermines the performance of neural discovery methods. These findings lead us to conclude that progress within the current paradigm is fundamentally constrained, necessitating a paradigm shift in this domain.

Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery

TL;DR

The paper investigates the limits of neural causal discovery under finite samples, arguing that neural networks cannot reliably distinguish ground-truth causal links from non-links due to

-strong faithfulness being a brittle bottleneck. It introduces a unified benchmarking protocol and an empirical framework using

to quantify task difficulty, supported by synthetic nonlinear SCMs. Results show that convergence and accuracy improve with larger

but degrade as graph size and density increase, aligning with theory that the fraction of

-strong faithful distributions shrinks in larger graphs. The findings suggest fundamental constraints in the current neural-discovery paradigm and advocate a paradigm shift toward new data regimes or modeling assumptions beyond standard neural-function approximators.

Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery

TL;DR

Abstract

Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)

Theorems & Definitions (3)