Table of Contents
Fetching ...

DeCaFlow: A deconfounding causal generative model

Alejandro Almodóvar, Adrián Javaloy, Juan Parras, Santiago Zazo, Isabel Valera

TL;DR

DeCaFlow presents a scalable, end-to-end causal generative framework that extends causal normalizing flows to handle hidden confounding via a variational encoder and proxy variables. It provides correct estimates for a broad class of interventional and counterfactual queries identifiable through do-calculus and proximal identifiability, using a single training per dataset. The approach demonstrates strong empirical performance on semi-synthetic Sachs and Ecoli70 graphs and a real-world fairness use case, outperforming several baselines and matching oracle-like performance on identifiable queries. By integrating structural causal constraints, proxy informativeness, and a principled training objective, DeCaFlow offers a practical tool for causal inference in complex, confounded settings with continuous variables. The work also discusses limitations related to proxy quality and graph correctness and outlines directions for future extensions to time-varying treatments and broader applicability.

Abstract

We introduce DeCaFlow, a deconfounding causal generative model. Training once per dataset using just observational data and the underlying causal graph, DeCaFlow enables accurate causal inference on continuous variables under the presence of hidden confounders. Specifically, we extend previous results on causal estimation under hidden confounding to show that a single instance of DeCaFlow provides correct estimates for all causal queries identifiable with do-calculus, leveraging proxy variables to adjust for the causal effects when do-calculus alone is insufficient. Moreover, we show that counterfactual queries are identifiable as long as their interventional counterparts are identifiable, and thus are also correctly estimated by DeCaFlow. Our empirical results on diverse settings (including the Ecoli70 dataset, with 3 independent hidden confounders, tens of observed variables and hundreds of causal queries) show that DeCaFlow outperforms existing approaches, while demonstrating its out-of-the-box applicability to any given causal graph. An implementation can be found in https://github.com/aalmodovares/DeCaFlow

DeCaFlow: A deconfounding causal generative model

TL;DR

DeCaFlow presents a scalable, end-to-end causal generative framework that extends causal normalizing flows to handle hidden confounding via a variational encoder and proxy variables. It provides correct estimates for a broad class of interventional and counterfactual queries identifiable through do-calculus and proximal identifiability, using a single training per dataset. The approach demonstrates strong empirical performance on semi-synthetic Sachs and Ecoli70 graphs and a real-world fairness use case, outperforming several baselines and matching oracle-like performance on identifiable queries. By integrating structural causal constraints, proxy informativeness, and a principled training objective, DeCaFlow offers a practical tool for causal inference in complex, confounded settings with continuous variables. The work also discusses limitations related to proxy quality and graph correctness and outlines directions for future extensions to time-varying treatments and broader applicability.

Abstract

We introduce DeCaFlow, a deconfounding causal generative model. Training once per dataset using just observational data and the underlying causal graph, DeCaFlow enables accurate causal inference on continuous variables under the presence of hidden confounders. Specifically, we extend previous results on causal estimation under hidden confounding to show that a single instance of DeCaFlow provides correct estimates for all causal queries identifiable with do-calculus, leveraging proxy variables to adjust for the causal effects when do-calculus alone is insufficient. Moreover, we show that counterfactual queries are identifiable as long as their interventional counterparts are identifiable, and thus are also correctly estimated by DeCaFlow. Our empirical results on diverse settings (including the Ecoli70 dataset, with 3 independent hidden confounders, tens of observed variables and hundreds of causal queries) show that DeCaFlow outperforms existing approaches, while demonstrating its out-of-the-box applicability to any given causal graph. An implementation can be found in https://github.com/aalmodovares/DeCaFlow

Paper Structure

This paper contains 93 sections, 16 theorems, 28 equations, 31 figures, 8 tables, 7 algorithms.

Key Result

proposition 1

If induces the same causal graph and observational distribution as the underlying (confounded) SCM generating the data. Then, recovers the SCM for every variable not in , up to an element-wise transformation of their exogenous distributions.

Figures (31)

  • Figure 1: can be effortlessly applied to highly complex causal graphs, as that of the Ecoli70 dataset schafer2005ecoli, with multiple hidden confounders and dozens of variables. We dash hidden confounders, and highlight direct hidden-confounded effects as identifiable (and thus correctly estimated by ), or unidentifiable.
  • Figure 1: Test RMSE on prediction and MMD of inter-group predictive distributions.
  • Figure 2: Example of computations for the causal graph in \ref{['app:fig:miao-block-graph']}. Circles represent input/output variables of the masked conditional normalizing flows, and black dots conditional inputs. Steps performed during training (\ref{['eq:elbo']}), where $\epsilonb$ is a non-causal random variable needed to model $\varhidden$ with . Steps performed to compute an interventional or counterfactual query with CNFs javaloy2024causal, see \ref{['app:sec:do-operator']}, where $\varexo_\vartreat'$ is the value for which $\vartreat = \alpha$ always, , $\varexo_\vartreat' = \flowz(\flowz^{-1}(\varexo)_{\varobs\setminus\vartreat}, \alpha)_\vartreat$.
  • Figure 3: Illustrative causal graph where the presence or absence of some parts render $p(\giventhat{\varoutcome_1}{\doop(\vartreat)})$ identifiable using do-calculus. Else, \ref{['prop:informal']} yields identifiability if and are informative proxies.
  • Figure 4: Twin SCM with observed factual nodes grayed out and edges severed to compute $p(\giventhat{\cfactual\varoutcome}{\doop(\cfactual\vartreat), \factual\varobs})$ in red.
  • ...and 26 more figures

Theorems & Definitions (32)

  • definition 1
  • definition 1
  • definition 2
  • proposition 1: Informal
  • corollary 1: Informal
  • proposition 2: Informal
  • corollary 2
  • corollary 3
  • proposition 3: Informal
  • corollary 4
  • ...and 22 more