Learning Causal Response Representations through Direct Effect Analysis
Homer Durand, Gherardo Varando, Gustau Camps-Valls
TL;DR
The paper addresses learning a representation of the direct causal effect of $X$ on a multivariate outcome $Y$ under conditioning on $Z$. It introduces a CIT-driven framework that projects $Y$ with $\mathbf{w}$ to maximize a CIT statistic, solving a generalized eigenvalue problem with losses $T_S$, $T_F$, and $T_D$. The authors provide theoretical guarantees linking the learned direction to a signal-to-noise ratio and Fisher information, and they derive an $F$-distribution based test (with an upper bound for the direct-effect statistic) to enable conditional independence testing. Empirically, the method recovers the direct-effect subspace in simulations and demonstrates practical climate-attribution benefits, including separating internal climate variability from forced responses and assessing multiple external forcings.
Abstract
We propose a novel approach for learning causal response representations. Our method aims to extract directions in which a multidimensional outcome is most directly caused by a treatment variable. By bridging conditional independence testing with causal representation learning, we formulate an optimisation problem that maximises the evidence against conditional independence between the treatment and outcome, given a conditioning set. This formulation employs flexible regression models tailored to specific applications, creating a versatile framework. The problem is addressed through a generalised eigenvalue decomposition. We show that, under mild assumptions, the distribution of the largest eigenvalue can be bounded by a known $F$-distribution, enabling testable conditional independence. We also provide theoretical guarantees for the optimality of the learned representation in terms of signal-to-noise ratio and Fisher information maximisation. Finally, we demonstrate the empirical effectiveness of our approach in simulation and real-world experiments. Our results underscore the utility of this framework in uncovering direct causal effects within complex, multivariate settings.
