Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models

Sujai Hiremath; Jacqueline R. M. A. Maasch; Mengxiao Gao; Promit Ghosal; Kyra Gan

Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models

Sujai Hiremath, Jacqueline R. M. A. Maasch, Mengxiao Gao, Promit Ghosal, Kyra Gan

TL;DR

The paper tackles scalable causal discovery from observational data by combining local-to-global reasoning with top-down and nonparametric edge pruning. It introduces two hierarchical topological sort algorithms: LHTS for linear non-Gaussian ANMs and NHTS for nonlinear ANMs, each exploiting local ancestral or parental relationships to reduce regressions and conditioning set sizes. A nonparametric constraint-based edge discovery (ED) step then prunes spurious edges using targeted conditioning, with theoretical guarantees and polynomial-time complexities. Empirically, the methods yield higher topological accuracy and edge-recovery rates than state-of-the-art baselines on synthetic data, particularly in sparse graphs, while offering favorable runtimes for nonlinear settings. Collectively, the work advances efficient, flexible causal discovery across linear and nonlinear regimes with measurable practical impact for high-dimensional applications.

Abstract

Learning the unique directed acyclic graph corresponding to an unknown causal model is a challenging task. Methods based on functional causal models can identify a unique graph, but either suffer from the curse of dimensionality or impose strong parametric assumptions. To address these challenges, we propose a novel hybrid approach for global causal discovery in observational data that leverages local causal substructures. We first present a topological sorting algorithm that leverages ancestral relationships in linear structural causal models to establish a compact top-down hierarchical ordering, encoding more causal information than linear orderings produced by existing methods. We demonstrate that this approach generalizes to nonlinear settings with arbitrary noise. We then introduce a nonparametric constraint-based algorithm that prunes spurious edges by searching for local conditioning sets, achieving greater accuracy than current methods. We provide theoretical guarantees for correctness and worst-case polynomial time complexities, with empirical validation on synthetic data.

Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models

TL;DR

Abstract

Paper Structure (60 sections, 39 theorems, 8 equations, 15 figures, 4 algorithms)

This paper contains 60 sections, 39 theorems, 8 equations, 15 figures, 4 algorithms.

Introduction
Preliminaries
Linear Setting
Nonlinear Setting
Edge Discovery
Experiments
Assumptions, Proofs, and Walkthrough for LHTS
Identifiability in Linear Setting
Proof of Lemma \ref{['lemma:causal_path_partition_ancestors']}
Proof of Lemma \ref{['lemma:path_ances_relation']}
Proof of Lemma \ref{['lemma:AP1']}
Proof of Lemma \ref{['lemma:AP2']}
Proof of Lemma \ref{['lemma:AP3']}
Proof of Lemma \ref{['lemma:AP4']}
Ancestor Sort
...and 45 more sections

Key Result

Lemma 3.0

Each pair of distinct nodes $x_i,x_j \in V$ can be in one of four possible active causal ancestral path relations: AP1) no active path exists between $x_i,x_j$; AP2) there exists an active backdoor path between $x_i,x_j$, but there is no active frontdoor path between them; AP3) there exists an activ

Figures (15)

Figure 1: Illustrative DAG, where $x_1$ is a root, $x_3$ is a leaf, $x_3 \in \text{Ch}(x_2), x_3 \in \text{De}(x_1).$
Figure 2: Enumeration of active causal path relation types between a pair of nodes $x_i$ and $x_j$. Dashed arrows indicate ancestorship.
Figure 3: Enumeration of the potential active causal paths among a fixed variable $x_j$, one of its potential parents $x_i$, and $C=\text{PA}(x_j)\setminus{x_i}$. Solid arrows denote parenthood relations, and undirected dashed connections indicate the existence of active paths.
Figure 4: DAG corresponding to Lemma \ref{['lemma:edge']}, which tests whether $x_i \in \text{Pa}(x_j)$ (i.e., whether the red arrow exists).
Figure 5: Performance of LHTS on synthetic data. Top row: $n=500$ with varying dimension $d$. Bottom row: $d=10$ with varying sample size $n$. See Appendix \ref{['appendix: lhtsruntime']} for runtime results.
...and 10 more figures

Theorems & Definitions (61)

Definition 2.1: Topological Orderings
Definition 2.2: ANMs
Lemma 3.0: Active Causal Ancestral Path Relation Enumeration
Lemma 3.0
Lemma 3.0: AP1
Lemma 3.0: AP2
Lemma 3.0: AP3
Lemma 3.0: AP4
Theorem 3.1
Theorem 3.2
...and 51 more

Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models

TL;DR

Abstract

Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (15)

Theorems & Definitions (61)