Table of Contents
Fetching ...

Network Causal Effect Estimation In Graphical Models Of Contagion And Latent Confounding

Yufeng Wu, Rohit Bhattacharya

TL;DR

We address distinguishing contagion from latent confounding in networks under full interference using segregated graphs (SGs) to represent mechanisms. We show identifiability of the target E[Y_i | do(A=a)] and its variants, and develop coding-likelihood based likelihood ratio tests to distinguish contagion versus latent confounding in each layer. We extend auto-g-computation to handle latent confounding, with Gibbs-sampler-based generation of latent variables and a local-structure parameterization for p(L) and p(Y | A, L), ensuring unbiased and consistent network-effect estimates when mechanism inference is correct. We provide network-asymptotic guarantees and validate the methods on synthetic, semi-synthetic, and real networks, highlighting practical conditions under which six-degree-separated samples support reliable testing and estimation.

Abstract

A key question in many network studies is whether the observed correlations between units are primarily due to contagion or latent confounding. Here, we study this question using a segregated graph (Shpitser, 2015) representation of these mechanisms, and examine how uncertainty about the true underlying mechanism impacts downstream computation of network causal effects, particularly under full interference -- settings where we only have a single realization of a network and each unit may depend on any other unit in the network. Under certain assumptions about asymptotic growth of the network, we derive likelihood ratio tests that can be used to identify whether different sets of variables -- confounders, treatments, and outcomes -- across units exhibit dependence due to contagion or latent confounding. We then propose network causal effect estimation strategies that provide unbiased and consistent estimates if the dependence mechanisms are either known or correctly inferred using our proposed tests. Together, the proposed methods allow network effect estimation in a wider range of full interference scenarios that have not been considered in prior work. We evaluate the effectiveness of our methods with synthetic data and the validity of our assumptions using real-world networks.

Network Causal Effect Estimation In Graphical Models Of Contagion And Latent Confounding

TL;DR

We address distinguishing contagion from latent confounding in networks under full interference using segregated graphs (SGs) to represent mechanisms. We show identifiability of the target E[Y_i | do(A=a)] and its variants, and develop coding-likelihood based likelihood ratio tests to distinguish contagion versus latent confounding in each layer. We extend auto-g-computation to handle latent confounding, with Gibbs-sampler-based generation of latent variables and a local-structure parameterization for p(L) and p(Y | A, L), ensuring unbiased and consistent network-effect estimates when mechanism inference is correct. We provide network-asymptotic guarantees and validate the methods on synthetic, semi-synthetic, and real networks, highlighting practical conditions under which six-degree-separated samples support reliable testing and estimation.

Abstract

A key question in many network studies is whether the observed correlations between units are primarily due to contagion or latent confounding. Here, we study this question using a segregated graph (Shpitser, 2015) representation of these mechanisms, and examine how uncertainty about the true underlying mechanism impacts downstream computation of network causal effects, particularly under full interference -- settings where we only have a single realization of a network and each unit may depend on any other unit in the network. Under certain assumptions about asymptotic growth of the network, we derive likelihood ratio tests that can be used to identify whether different sets of variables -- confounders, treatments, and outcomes -- across units exhibit dependence due to contagion or latent confounding. We then propose network causal effect estimation strategies that provide unbiased and consistent estimates if the dependence mechanisms are either known or correctly inferred using our proposed tests. Together, the proposed methods allow network effect estimation in a wider range of full interference scenarios that have not been considered in prior work. We evaluate the effectiveness of our methods with synthetic data and the validity of our assumptions using real-world networks.

Paper Structure

This paper contains 29 sections, 4 theorems, 30 equations, 12 figures, 2 tables, 4 algorithms.

Key Result

Lemma 1

Given a friendship network ${\mathcal{F}}$ and a corresponding partially determined SG ${\mathcal{G}}_{\mathop{\mathrm{\circ---\circ}}\nolimits}$, the following independences hold in each layer under the null hypothesis of contagion but not under the alternative hypothesis of latent confounding, reg

Figures (12)

  • Figure 1: (a) Example of a friendship network ${\mathcal{F}}$ on 10 units. (b) Partially determined SG pattern ${\mathcal{G}}_{\mathop{\mathrm{\circ---\circ}}\nolimits}$ on 3 units (lighter edge colors in the figure are only used to improve readability).
  • Figure 2: Performance of coding likelihood ratio tests with varying sample sizes.
  • Figure 3: Network effect estimation results for our method and auto-g-computation.
  • Figure 4: The graphs shown in (a) and (b) are examples of segregated graphs. The graphs shown in (c), (d), and (e) are not segregated graphs.
  • Figure 5: Gibbs sampler $L$
  • ...and 7 more figures

Theorems & Definitions (4)

  • Lemma 1
  • Lemma 2
  • Theorem 3
  • Lemma 4