Identifying Linearly-Mixed Causal Representations from Multi-Node Interventions
Simon Bing, Urmi Ninad, Jonas Wahl, Jakob Runge
TL;DR
The paper tackles identifiability in causal representation learning when latent variables are linearly mixed and subjected to multi-node interventions across environments. It introduces a variance-density sparsity criterion on the trace left by interventions and proves that, under diverse interventional coverage and injective mixing, latent factors are identifiable up to permutation and rescaling. A practical SGD-based algorithm implements this idea via a five-term loss that enforces variance sparsity, environment- and dimension-wise variance constraints, diagonal sparsity, and norm stabilization, enabling recovery of the ground-truth causal factors in synthetic data. Empirically, the method achieves near-perfect recovery across varying latent dimensions, graph densities, and even nonlinear SCMs, demonstrating robustness and potential applicability as a modular component in causal representation pipelines.
Abstract
The task of inferring high-level causal variables from low-level observations, commonly referred to as causal representation learning, is fundamentally underconstrained. As such, recent works to address this problem focus on various assumptions that lead to identifiability of the underlying latent causal variables. A large corpus of these preceding approaches consider multi-environment data collected under different interventions on the causal model. What is common to virtually all of these works is the restrictive assumption that in each environment, only a single variable is intervened on. In this work, we relax this assumption and provide the first identifiability result for causal representation learning that allows for multiple variables to be targeted by an intervention within one environment. Our approach hinges on a general assumption on the coverage and diversity of interventions across environments, which also includes the shared assumption of single-node interventions of previous works. The main idea behind our approach is to exploit the trace that interventions leave on the variance of the ground truth causal variables and regularizing for a specific notion of sparsity with respect to this trace. In addition to and inspired by our theoretical contributions, we present a practical algorithm to learn causal representations from multi-node interventional data and provide empirical evidence that validates our identifiability results.
