Table of Contents
Fetching ...

Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models

Sujai Hiremath, Jacqueline R. M. A. Maasch, Mengxiao Gao, Promit Ghosal, Kyra Gan

TL;DR

The paper tackles scalable causal discovery from observational data by combining local-to-global reasoning with top-down and nonparametric edge pruning. It introduces two hierarchical topological sort algorithms: LHTS for linear non-Gaussian ANMs and NHTS for nonlinear ANMs, each exploiting local ancestral or parental relationships to reduce regressions and conditioning set sizes. A nonparametric constraint-based edge discovery (ED) step then prunes spurious edges using targeted conditioning, with theoretical guarantees and polynomial-time complexities. Empirically, the methods yield higher topological accuracy and edge-recovery rates than state-of-the-art baselines on synthetic data, particularly in sparse graphs, while offering favorable runtimes for nonlinear settings. Collectively, the work advances efficient, flexible causal discovery across linear and nonlinear regimes with measurable practical impact for high-dimensional applications.

Abstract

Learning the unique directed acyclic graph corresponding to an unknown causal model is a challenging task. Methods based on functional causal models can identify a unique graph, but either suffer from the curse of dimensionality or impose strong parametric assumptions. To address these challenges, we propose a novel hybrid approach for global causal discovery in observational data that leverages local causal substructures. We first present a topological sorting algorithm that leverages ancestral relationships in linear structural causal models to establish a compact top-down hierarchical ordering, encoding more causal information than linear orderings produced by existing methods. We demonstrate that this approach generalizes to nonlinear settings with arbitrary noise. We then introduce a nonparametric constraint-based algorithm that prunes spurious edges by searching for local conditioning sets, achieving greater accuracy than current methods. We provide theoretical guarantees for correctness and worst-case polynomial time complexities, with empirical validation on synthetic data.

Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models

TL;DR

The paper tackles scalable causal discovery from observational data by combining local-to-global reasoning with top-down and nonparametric edge pruning. It introduces two hierarchical topological sort algorithms: LHTS for linear non-Gaussian ANMs and NHTS for nonlinear ANMs, each exploiting local ancestral or parental relationships to reduce regressions and conditioning set sizes. A nonparametric constraint-based edge discovery (ED) step then prunes spurious edges using targeted conditioning, with theoretical guarantees and polynomial-time complexities. Empirically, the methods yield higher topological accuracy and edge-recovery rates than state-of-the-art baselines on synthetic data, particularly in sparse graphs, while offering favorable runtimes for nonlinear settings. Collectively, the work advances efficient, flexible causal discovery across linear and nonlinear regimes with measurable practical impact for high-dimensional applications.

Abstract

Learning the unique directed acyclic graph corresponding to an unknown causal model is a challenging task. Methods based on functional causal models can identify a unique graph, but either suffer from the curse of dimensionality or impose strong parametric assumptions. To address these challenges, we propose a novel hybrid approach for global causal discovery in observational data that leverages local causal substructures. We first present a topological sorting algorithm that leverages ancestral relationships in linear structural causal models to establish a compact top-down hierarchical ordering, encoding more causal information than linear orderings produced by existing methods. We demonstrate that this approach generalizes to nonlinear settings with arbitrary noise. We then introduce a nonparametric constraint-based algorithm that prunes spurious edges by searching for local conditioning sets, achieving greater accuracy than current methods. We provide theoretical guarantees for correctness and worst-case polynomial time complexities, with empirical validation on synthetic data.
Paper Structure (60 sections, 39 theorems, 8 equations, 15 figures, 4 algorithms)

This paper contains 60 sections, 39 theorems, 8 equations, 15 figures, 4 algorithms.

Key Result

Lemma 3.0

Each pair of distinct nodes $x_i,x_j \in V$ can be in one of four possible active causal ancestral path relations: AP1) no active path exists between $x_i,x_j$; AP2) there exists an active backdoor path between $x_i,x_j$, but there is no active frontdoor path between them; AP3) there exists an activ

Figures (15)

  • Figure 1: Illustrative DAG, where $x_1$ is a root, $x_3$ is a leaf, $x_3 \in \text{Ch}(x_2), x_3 \in \text{De}(x_1).$
  • Figure 2: Enumeration of active causal path relation types between a pair of nodes $x_i$ and $x_j$. Dashed arrows indicate ancestorship.
  • Figure 3: Enumeration of the potential active causal paths among a fixed variable $x_j$, one of its potential parents $x_i$, and $C=\text{PA}(x_j)\setminus{x_i}$. Solid arrows denote parenthood relations, and undirected dashed connections indicate the existence of active paths.
  • Figure 4: DAG corresponding to Lemma \ref{['lemma:edge']}, which tests whether $x_i \in \text{Pa}(x_j)$ (i.e., whether the red arrow exists).
  • Figure 5: Performance of LHTS on synthetic data. Top row: $n=500$ with varying dimension $d$. Bottom row: $d=10$ with varying sample size $n$. See Appendix \ref{['appendix: lhtsruntime']} for runtime results.
  • ...and 10 more figures

Theorems & Definitions (61)

  • Definition 2.1: Topological Orderings
  • Definition 2.2: ANMs
  • Lemma 3.0: Active Causal Ancestral Path Relation Enumeration
  • Lemma 3.0
  • Lemma 3.0: AP1
  • Lemma 3.0: AP2
  • Lemma 3.0: AP3
  • Lemma 3.0: AP4
  • Theorem 3.1
  • Theorem 3.2
  • ...and 51 more