Table of Contents
Fetching ...

Decentralized Causal Discovery using Judo Calculus

Sridhar Mahadevan

TL;DR

This work generalizes causal discovery by introducing judo calculus, an intuitionistic, topos-based framework that yields local, regime-specific causal truths across covers of contexts. By modeling interventions and regime variation with a Lawvere–Tierney operator j on a presheaf of local causal models, the approach certifies edges as j-stable within each cover and glues them to produce globally valid conclusions. The authors adapt popular causal discovery methods (GES, psi-FCI, DCDI) into j-stable variants and demonstrate computational efficiency from decentralized, parallelizable per-regime learning, along with improved accuracy and robustness under regime shifts across synthetic DAGs, Sachs signaling data, LINCS perturbations, and OECD PISA datasets. The methodology unifies likelihood, topology, and invariance into TCES and related TCMS/DCDI-TCM variants, enabling stable, transportable causal inferences across heterogeneous environments. The work also presents a formal adjunction between causal and statistical models, enabling functorial, consistent updates to causal structures as CI information evolves, and outlines a path toward public implementation and broader validation.

Abstract

We describe a theory and implementation of an intuitionistic decentralized framework for causal discovery using judo calculus, which is formally defined as j-stable causal inference using j-do-calculus in a topos of sheaves. In real-world applications -- from biology to medicine and social science -- causal effects depend on regime (age, country, dose, genotype, or lab protocol). Our proposed judo calculus formalizes this context dependence formally as local truth: a causal claim is proven true on a cover of regimes, not everywhere at once. The Lawvere-Tierney modal operator j chooses which regimes are relevant; j-stability means the claim holds constructively and consistently across that family. We describe an algorithmic and implementation framework for judo calculus, combining it with standard score-based, constraint-based, and gradient-based causal discovery methods. We describe experimental results on a range of domains, from synthetic to real-world datasets from biology and economics. Our experimental results show the computational efficiency gained by the decentralized nature of sheaf-theoretic causal discovery, as well as improved performance over classical causal discovery methods.

Decentralized Causal Discovery using Judo Calculus

TL;DR

This work generalizes causal discovery by introducing judo calculus, an intuitionistic, topos-based framework that yields local, regime-specific causal truths across covers of contexts. By modeling interventions and regime variation with a Lawvere–Tierney operator j on a presheaf of local causal models, the approach certifies edges as j-stable within each cover and glues them to produce globally valid conclusions. The authors adapt popular causal discovery methods (GES, psi-FCI, DCDI) into j-stable variants and demonstrate computational efficiency from decentralized, parallelizable per-regime learning, along with improved accuracy and robustness under regime shifts across synthetic DAGs, Sachs signaling data, LINCS perturbations, and OECD PISA datasets. The methodology unifies likelihood, topology, and invariance into TCES and related TCMS/DCDI-TCM variants, enabling stable, transportable causal inferences across heterogeneous environments. The work also presents a formal adjunction between causal and statistical models, enabling functorial, consistent updates to causal structures as CI information evolves, and outlines a path toward public implementation and broader validation.

Abstract

We describe a theory and implementation of an intuitionistic decentralized framework for causal discovery using judo calculus, which is formally defined as j-stable causal inference using j-do-calculus in a topos of sheaves. In real-world applications -- from biology to medicine and social science -- causal effects depend on regime (age, country, dose, genotype, or lab protocol). Our proposed judo calculus formalizes this context dependence formally as local truth: a causal claim is proven true on a cover of regimes, not everywhere at once. The Lawvere-Tierney modal operator j chooses which regimes are relevant; j-stability means the claim holds constructively and consistently across that family. We describe an algorithmic and implementation framework for judo calculus, combining it with standard score-based, constraint-based, and gradient-based causal discovery methods. We describe experimental results on a range of domains, from synthetic to real-world datasets from biology and economics. Our experimental results show the computational efficiency gained by the decentralized nature of sheaf-theoretic causal discovery, as well as improved performance over classical causal discovery methods.

Paper Structure

This paper contains 207 sections, 20 theorems, 97 equations, 20 figures, 10 tables, 10 algorithms.

Key Result

Lemma 1

Let be an adjoint pair with unit $\eta : \textup{id}_{\mathbf{StatCI}} \Rightarrow \mathsf{Sem}\!\circ\!\mathsf{Syn}$ and counit $\epsilon : \mathsf{Syn}\!\circ\!\mathsf{Sem} \Rightarrow \textup{id}_{\mathbf{TCM}_\mathcal{E}}.$ Then, for every CI structure $\mathcal{I}\in \mathbf{StatCI}$ and every TCM Probabilistic reading. Sketch of proof. The identities follow from the universal bijection $

Figures (20)

  • Figure 1: Interference with overlapping covers. Edge frequencies $f(E_1\!\to Y), f(E_2\!\to Y)$ by cover (left); per-chart coefficients on intersections (right). Local claims are j-stable on each cover and persist on intersections.
  • Figure 2: Computational advantages of $j$-stable discovery.Left: Per-iteration wall-clock (seconds per 10k iterations) is on par with or lower than vanilla across $d\!\in\!\{10,20,40\}$; the aggregation+$\pi$ step adds negligible overhead. Right: Seed ensembles are highly parallel: wall-clock for 10 seeds drops substantially with 4 workers at $d\!=\!40$.
  • Figure 4: $j$-stable DCDI scales significantly better than regular DCDI on a synthetic DAG benchmark used originally in dcdi.
  • Figure 5: The Three Rules of Judo Calculus.
  • Figure 6: $j$-stability. Cells are (env, variable) mechanisms. Red cells are intervened (free). Colored blocks in a column are non-intervened and thus tied (penalized to agree).
  • ...and 15 more figures

Theorems & Definitions (51)

  • Definition 1: Adjunction between TCM and Statistical Independence
  • Lemma 1: Triangle identities and probabilistic interpretation
  • Corollary 1: Realization via synthesis
  • Definition 2
  • Definition 3
  • Lemma 2: Families vs. sieves
  • Definition 4
  • Definition 5
  • Definition 6: Lawvere--Tierney causal topology
  • Theorem 1
  • ...and 41 more