Learning Causal Response Representations through Direct Effect Analysis

Homer Durand; Gherardo Varando; Gustau Camps-Valls

Learning Causal Response Representations through Direct Effect Analysis

Homer Durand, Gherardo Varando, Gustau Camps-Valls

TL;DR

The paper addresses learning a representation of the direct causal effect of $X$ on a multivariate outcome $Y$ under conditioning on $Z$. It introduces a CIT-driven framework that projects $Y$ with $\mathbf{w}$ to maximize a CIT statistic, solving a generalized eigenvalue problem with losses $T_S$, $T_F$, and $T_D$. The authors provide theoretical guarantees linking the learned direction to a signal-to-noise ratio and Fisher information, and they derive an $F$-distribution based test (with an upper bound for the direct-effect statistic) to enable conditional independence testing. Empirically, the method recovers the direct-effect subspace in simulations and demonstrates practical climate-attribution benefits, including separating internal climate variability from forced responses and assessing multiple external forcings.

Abstract

We propose a novel approach for learning causal response representations. Our method aims to extract directions in which a multidimensional outcome is most directly caused by a treatment variable. By bridging conditional independence testing with causal representation learning, we formulate an optimisation problem that maximises the evidence against conditional independence between the treatment and outcome, given a conditioning set. This formulation employs flexible regression models tailored to specific applications, creating a versatile framework. The problem is addressed through a generalised eigenvalue decomposition. We show that, under mild assumptions, the distribution of the largest eigenvalue can be bounded by a known $F$-distribution, enabling testable conditional independence. We also provide theoretical guarantees for the optimality of the learned representation in terms of signal-to-noise ratio and Fisher information maximisation. Finally, we demonstrate the empirical effectiveness of our approach in simulation and real-world experiments. Our results underscore the utility of this framework in uncovering direct causal effects within complex, multivariate settings.

Learning Causal Response Representations through Direct Effect Analysis

TL;DR

The paper addresses learning a representation of the direct causal effect of

on a multivariate outcome

under conditioning on

. It introduces a CIT-driven framework that projects

with

to maximize a CIT statistic, solving a generalized eigenvalue problem with losses

, and

. The authors provide theoretical guarantees linking the learned direction to a signal-to-noise ratio and Fisher information, and they derive an

-distribution based test (with an upper bound for the direct-effect statistic) to enable conditional independence testing. Empirically, the method recovers the direct-effect subspace in simulations and demonstrates practical climate-attribution benefits, including separating internal climate variability from forced responses and assessing multiple external forcings.

Abstract

-distribution, enabling testable conditional independence. We also provide theoretical guarantees for the optimality of the learned representation in terms of signal-to-noise ratio and Fisher information maximisation. Finally, we demonstrate the empirical effectiveness of our approach in simulation and real-world experiments. Our results underscore the utility of this framework in uncovering direct causal effects within complex, multivariate settings.

Learning Causal Response Representations through Direct Effect Analysis

TL;DR

Abstract

Learning Causal Response Representations through Direct Effect Analysis

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (15)

Theorems & Definitions (27)