Table of Contents
Fetching ...

Interaction Testing in Variation Analysis

Drago Plecko

Abstract

Relationships of cause and effect are of prime importance for explaining scientific phenomena. Often, rather than just understanding the effects of causes, researchers also wish to understand how a cause $X$ affects an outcome $Y$ mechanistically -- i.e., what are the causal pathways that are activated between $X$ and $Y$. For analyzing such questions, a range of methods has been developed over decades under the rubric of causal mediation analysis. Traditional mediation analysis focuses on decomposing the average treatment effect (ATE) into direct and indirect effects, and therefore focuses on the ATE as the central quantity. This corresponds to providing explanations for associations in the interventional regime, such as when the treatment $X$ is randomized. Commonly, however, it is of interest to explain associations in the observational regime, and not just in the interventional regime. In this paper, we introduce \text{variation analysis}, an extension of mediation analysis that focuses on the total variation (TV) measure between $X$ and $Y$, written as $\mathrm{E}[Y \mid X=x_1] - \mathrm{E}[Y \mid X=x_0]$. The TV measure encompasses both causal and confounded effects, as opposed to the ATE which only encompasses causal (direct and mediated) variations. In this way, the TV measure is suitable for providing explanations in the natural regime and answering questions such as ``why is $X$ associated with $Y$?''. Our focus is on decomposing the TV measure, in a way that explicitly includes direct, indirect, and confounded variations. Furthermore, we also decompose the TV measure to include interaction terms between these different pathways. Subsequently, interaction testing is introduced, involving hypothesis tests to determine if interaction terms are significantly different from zero. If interactions are not significant, more parsimonious decompositions of the TV measure can be used.

Interaction Testing in Variation Analysis

Abstract

Relationships of cause and effect are of prime importance for explaining scientific phenomena. Often, rather than just understanding the effects of causes, researchers also wish to understand how a cause affects an outcome mechanistically -- i.e., what are the causal pathways that are activated between and . For analyzing such questions, a range of methods has been developed over decades under the rubric of causal mediation analysis. Traditional mediation analysis focuses on decomposing the average treatment effect (ATE) into direct and indirect effects, and therefore focuses on the ATE as the central quantity. This corresponds to providing explanations for associations in the interventional regime, such as when the treatment is randomized. Commonly, however, it is of interest to explain associations in the observational regime, and not just in the interventional regime. In this paper, we introduce \text{variation analysis}, an extension of mediation analysis that focuses on the total variation (TV) measure between and , written as . The TV measure encompasses both causal and confounded effects, as opposed to the ATE which only encompasses causal (direct and mediated) variations. In this way, the TV measure is suitable for providing explanations in the natural regime and answering questions such as ``why is associated with ?''. Our focus is on decomposing the TV measure, in a way that explicitly includes direct, indirect, and confounded variations. Furthermore, we also decompose the TV measure to include interaction terms between these different pathways. Subsequently, interaction testing is introduced, involving hypothesis tests to determine if interaction terms are significantly different from zero. If interactions are not significant, more parsimonious decompositions of the TV measure can be used.

Paper Structure

This paper contains 31 sections, 7 theorems, 111 equations, 9 figures, 3 tables, 2 algorithms.

Key Result

Proposition 1

Counterfactual and factual contrasts admit the following structural basis expansions, respectively:

Figures (9)

  • Figure 1: Causal diagrams for Ex. \ref{['ex:intro-decompositions']}.
  • Figure 2: (a) Overview of the building blocks appearing in Sec. \ref{['sec:ia-analysis']}; (b) Markovian causal diagram with mediators $W$ and confounders $Z$ used in the paper.
  • Figure 3: Causal diagrams used in Ex. \ref{['ex:str-ia-crit']}.
  • Figure 4: Distribution of p-values for interaction testing.
  • Figure 5: Assessing structural interaction of direct and spurious effects (Str-DE-SE).
  • ...and 4 more figures

Theorems & Definitions (30)

  • Example 1: Total Effect and Total Variation Decompositions
  • Definition 1: Structural Causal Model (SCM) pearl:2k
  • Definition 2: Observational Distribution bareinboim2020on
  • Definition 3: Submodel pearl:2k
  • Definition 4: Potential Outcome / Response rubin1974estimatingpearl:2k
  • Definition 5: Counterfactual Distributions bareinboim2020on
  • Definition 6: Causal Diagram pearl:2kbareinboim2020on
  • Definition 7: Contrasts plevcko2024causal
  • Proposition 1: Structural Basis Expansion plevcko2024causal
  • Definition 8: Structural Interaction Criteria
  • ...and 20 more