Table of Contents
Fetching ...

Beyond Conditional Averages: Estimating The Individual Causal Effect Distribution

Richard Post, Edwin van den Heuvel

TL;DR

The paper tackles the challenge that variability in individual causal effects (ICEs) can render average effects uninformative for individuals. It identifies conditions under which the ICE distribution is identifiable from cross-sectional data, notably a conditional independence assumption $Y^1-Y^0 \perp Y^0 \mid \boldsymbol{X}$, and develops latent-variable causal mixed models to recover the ICE distribution via latent components $N_Y$ and $U_1$. By reframing ICE estimation as a deconvolution-like problem and using flexible Gaussian mixture random effects, the authors provide a practical methodology to quantify the ICE distribution and its tail properties. The Framingham case study on Hepatic Steatosis and LVFP demonstrates that about 20.6% of individuals may experience harm exceeding twice the average effect, underscoring the added value of distributional causal analysis for precision medicine and risk stratification.

Abstract

In recent years, the field of causal inference from observational data has emerged rapidly. The literature has focused on (conditional) average causal effect estimation. When (remaining) variability of individual causal effects (ICEs) is considerable, average effects may be uninformative for an individual. The fundamental problem of causal inference precludes estimating the joint distribution of potential outcomes without making assumptions. In this work, we show that the ICE distribution is identifiable under (conditional) independence of the individual effect and the potential outcome under no exposure, in addition to the common assumptions of consistency, positivity, and conditional exchangeability. Moreover, we present a family of flexible latent variable models that can be used to study individual effect modification and estimate the ICE distribution from cross-sectional data. How such latent variable models can be applied and validated in practice is illustrated in a case study on the effect of Hepatic Steatosis on a clinical precursor to heart failure. Under the assumptions presented, we estimate that 20.6% (95% Bayesian credible interval: 8.9%, 33.6%) of the population has a harmful effect greater than twice the average causal effect.

Beyond Conditional Averages: Estimating The Individual Causal Effect Distribution

TL;DR

The paper tackles the challenge that variability in individual causal effects (ICEs) can render average effects uninformative for individuals. It identifies conditions under which the ICE distribution is identifiable from cross-sectional data, notably a conditional independence assumption , and develops latent-variable causal mixed models to recover the ICE distribution via latent components and . By reframing ICE estimation as a deconvolution-like problem and using flexible Gaussian mixture random effects, the authors provide a practical methodology to quantify the ICE distribution and its tail properties. The Framingham case study on Hepatic Steatosis and LVFP demonstrates that about 20.6% of individuals may experience harm exceeding twice the average effect, underscoring the added value of distributional causal analysis for precision medicine and risk stratification.

Abstract

In recent years, the field of causal inference from observational data has emerged rapidly. The literature has focused on (conditional) average causal effect estimation. When (remaining) variability of individual causal effects (ICEs) is considerable, average effects may be uninformative for an individual. The fundamental problem of causal inference precludes estimating the joint distribution of potential outcomes without making assumptions. In this work, we show that the ICE distribution is identifiable under (conditional) independence of the individual effect and the potential outcome under no exposure, in addition to the common assumptions of consistency, positivity, and conditional exchangeability. Moreover, we present a family of flexible latent variable models that can be used to study individual effect modification and estimate the ICE distribution from cross-sectional data. How such latent variable models can be applied and validated in practice is illustrated in a case study on the effect of Hepatic Steatosis on a clinical precursor to heart failure. Under the assumptions presented, we estimate that 20.6% (95% Bayesian credible interval: 8.9%, 33.6%) of the population has a harmful effect greater than twice the average causal effect.
Paper Structure (22 sections, 4 theorems, 30 equations, 22 figures, 8 tables)

This paper contains 22 sections, 4 theorems, 30 equations, 22 figures, 8 tables.

Key Result

Proposition 1

If Assumptions CH2A2, CH2A3 and CH2A1 apply, and $Y^1-Y^0 \mid \boldsymbol{X}=\boldsymbol{x}$ is degenerate, then

Figures (22)

  • Figure 1: Examples of effect distributions where the (conditional) ATE equals $-15$. ICEs follow a Gaussian distribution with $\sigma=2$ (a) and $\sigma{=}15$ (b) respectively, a log-normal distribution ($\mu{=}4, \sigma{=}\sqrt{0.5}),$ c), or a Gaussian mixture distribution with $p{=}0.6, \mu_{1}{=}-31, \sigma_{1}{=}10, \mu_{2}{=}9$ and $\sigma_{2}{=}5$ (d).
  • Figure 2: Estimated $Z_{1}$ distributions for $100$ simulations for underlying ICE distributions $\tau+U_{1x}$, where $\tau=-15$, $U_{1x}~{\sim}~\mathcal{N}(0,10^2)$ (left), or $U_{1x} ~{\sim}~ \text{LN}(4,\sqrt{0.5}^{2})-\exp\left(4+0.5 \sqrt{0.5}^{2}\right)$ (middle) or a two-component Gaussian mixture such that with probability $p=0.6$$U_{1}~{\sim}~ \mathcal{N}(-16,10^{2})$ and otherwise $U_{1}~{\sim}~ \mathcal{N}(24,5^{2})$ (right). The actual density for $\tau+U_{1}$(yellow) and the mean of the estimated $Z_{1}$ densities (black) are presented. Data for individuals were simulated using Equation \ref{['CH3eq:truth']}, where $N_{Yx} \sim \mathcal{N}(0, 50)$, $f_{Y0}(\boldsymbol{x})= \theta_{0}+ x\theta_{1}$, $f_{Y1}(\boldsymbol{x})=\tau$, where $\theta_{0}=120$ and the effect of the confounder was $\theta_{1}=5$. The confounder $X$ had the value $-0.3$ with probability $0.7$ and value $0.7$ with probability $0.3$ and thus a zero mean. The probability of exposure equaled $\text{logit}(-3+3 X_{i})$ for individual $i$. Finally, $N_{Yx}, U_{1x} \mathrel{\hbox{$\perp$}\mkern2mu{\perp}} A$ and $U_{1x} \mathrel{\hbox{$\perp$}\mkern2mu{\perp}} N_{Yx}$.
  • Figure 3: Estimated $Z_{1}$ distributions for $100$ simulations for underlying ICE distributions $\tau+U_{1x}$, where $\tau=-15$ and $(U_{1x}, N_{Yx})$ is bivariate normal distributed with zero means, variance $10^2$ and $50$ respectively and correlation $\rho=-0.75$ (a) or $\rho=0.75$ (b). The actual density for $\tau+U_{1x}$ (yellow) and the mean of the estimated $Z_{1}$ densities (black) are presented. Data for individuals were simulated using Equation \ref{['CH3eq:truth']}, where $N_{Yx} \sim \mathcal{N}(0, 50)$, $f_{Y0}(\boldsymbol{x})= \theta_{0}+ x\theta_{1}$, $f_{Y1}(\boldsymbol{x})=\tau$, where $\theta_{0}=120$ and the effect of the confounder was $\theta_{1}=5$. The confounder $X$ had the value $-0.3$ with probability $0.7$ and value $0.7$ with probability $0.3$ and thus a zero mean. The probability of exposure equaled $\text{logit}(-3+3 X_{i})$ for individual $i$ and $N_{Yx}, U_{1x} \mathrel{\hbox{$\perp$}\mkern2mu{\perp}} A$.
  • Figure 4: Estimated $Z_{1}$ distributions after fitting model \ref{['CH3LVmodel']} (a) or \ref{['CH3LVmodelfix']} (b) for $100$ simulations for underlying ICE distributions $\tau+U_{1x}$, where $\tau=-15$ and $U_{1x}~{\sim}~\mathcal{N}(0,10^2)$. The actual density for $\tau+U_{1}$(yellow) and the mean of the estimated $Z_{1}$ densities (black) are presented. Data for individuals were simulated using Equation \ref{['CH3eq:truth']}, where $N_{Yx}$ Gaussian distributed, $f_{Y0}(\boldsymbol{x})= \theta_{0}+ x\theta_{1}$, $f_{Y1}(\boldsymbol{x})=\tau$, where $\theta_{0}=120$ and the effect of the confounder was $\theta_{1}=5$. The confounder $X$ had the value $-0.3$ with probability $0.7$ and value $0.7$ with probability $0.3$ and thus a zero mean. The variance of $N_{Yx}$ was equal to $50$ for $X=0.7$ and $50 + 10^2$ for $X=-0.3$ and the probability of exposure equaled $\text{logit}(-3+3 X_{i})$ for individual $i$ and $N_{Yx}, U_{1x} \mathrel{\hbox{$\perp$}\mkern2mu{\perp}} A$.
  • Figure 5: Posterior predictive distribution of the LVFP for the model \ref{['CH3LVmodel3']}, with corresponding $95 \%$ Bayesian credible intervals and the kernel density of the observed data for unexposed (left) and exposed individuals (right).
  • ...and 17 more figures

Theorems & Definitions (4)

  • Proposition 1
  • Proposition 2
  • Lemma 1
  • Proposition 3