Table of Contents
Fetching ...

Partially Functional Dynamic Backdoor Diffusion-based Causal Model

Xinwen Liu, Lei Qian, Song Xi Chen, Niansheng Tang

Abstract

Causal inference in spatio-temporal settings is critically hindered by unmeasured confounders with complex spatio-temporal dynamics and the prevalence of multi-resolution data. While diffusion models present a promising avenue for estimating structural causal models, existing approaches are limited by assumptions of causal sufficiency or static confounding, failing to capture the region-specific, temporally dependent nature of real-world latent variables or to directly handle functional variables. We bridge this gap by introducing the Partially Functional Dynamic Backdoor Diffusion-based Causal Model (PFD-BDCM), a unified generative framework designed to simultaneously tackle causal inference with dynamic confounding and functional data. Our approach formalizes a novel structural causal model that captures spatio-temporal dependencies in latent confounders through conditional autoregressive processes, represents functional variables via basis expansion coefficients treated as standard graph nodes, and integrates valid backdoor adjustment into a diffusion-based generative process. We provide theoretical guarantees on the preservation of causal effects under basis expansion and derive error bounds for counterfactual estimates. Experiments on synthetic data and a real-world air pollution case study demonstrate that PFD-BDCM outperforms existing methods across observational, interventional, and counterfactual queries. This work provides a rigorous and practical tool for robust causal inference in complex spatio-temporal systems characterized by non-stationarity and multi-resolution data.

Partially Functional Dynamic Backdoor Diffusion-based Causal Model

Abstract

Causal inference in spatio-temporal settings is critically hindered by unmeasured confounders with complex spatio-temporal dynamics and the prevalence of multi-resolution data. While diffusion models present a promising avenue for estimating structural causal models, existing approaches are limited by assumptions of causal sufficiency or static confounding, failing to capture the region-specific, temporally dependent nature of real-world latent variables or to directly handle functional variables. We bridge this gap by introducing the Partially Functional Dynamic Backdoor Diffusion-based Causal Model (PFD-BDCM), a unified generative framework designed to simultaneously tackle causal inference with dynamic confounding and functional data. Our approach formalizes a novel structural causal model that captures spatio-temporal dependencies in latent confounders through conditional autoregressive processes, represents functional variables via basis expansion coefficients treated as standard graph nodes, and integrates valid backdoor adjustment into a diffusion-based generative process. We provide theoretical guarantees on the preservation of causal effects under basis expansion and derive error bounds for counterfactual estimates. Experiments on synthetic data and a real-world air pollution case study demonstrate that PFD-BDCM outperforms existing methods across observational, interventional, and counterfactual queries. This work provides a rigorous and practical tool for robust causal inference in complex spatio-temporal systems characterized by non-stationarity and multi-resolution data.

Paper Structure

This paper contains 21 sections, 5 theorems, 13 equations, 3 figures, 1 table, 1 algorithm.

Key Result

Theorem 1

Under Assumption ass:smoothness, the causal effect of the functional variable $\textnormal{z}(t)$ on any endogenous variable $\textnormal{x}$ in the infinite-dimensional SCM is equivalent to the causal effect of the finite-dimensional basis coefficient vector $\mathbf{x}^{(z)}$ on $\textnormal{x}$ i

Figures (3)

  • Figure 1: DAG with three nodes (left) and SCM with three exogenous and endogenous nodes (right)
  • Figure 2: PFST-DSCM with 11 exogenous and endogenous nodes (where nodes $\textnormal{x}_{3,ij}$, $\textnormal{x}_{4,ij}$ and $\textnormal{x}_{9,ij}$ are unmeasured confounder nodes with spatial heterogeneity and temporal dependencies, and $\mathbf{x}_{2,ij}$ is functional node corresponding base expansion vector node)
  • Figure 3: MMD Results for All Experiment Types-Quantile Intervention ($10\%,90\%$)

Theorems & Definitions (8)

  • Theorem 1
  • Theorem 2
  • Theorem 3
  • Theorem 4
  • Theorem 5
  • Remark 1
  • Remark 2
  • Remark 3