On the Identifiability of Causal Abstractions
Xiusi Li, Sékou-Oumar Kaba, Siamak Ravanbakhsh
TL;DR
This work addresses identifiability in causal representation learning when only counterfactual data pairs are available and interventions target arbitrary subsets of latent variables. It introduces a theoretical framework to identify latent causal models up to abstractions by leveraging non-descendant structures and graph quotient constructions, yielding an acyclic quotient graph and identifiable latent blocks. The key contributions include (i) proving identifiability up to an SCM abstraction determined by non-descendant intervention sets, and (ii) showing additional latents can be disentangled under singleton non-descendant conditions, all under mild assumptions like faithfulness and absolute continuity. The results provide a principled route to learn causally meaningful abstractions from weak supervision, with implications for scalable, interpretable reasoning in CRL-informed systems, while acknowledging practical limitations and future work on scalability and empirical methodology.
Abstract
Causal representation learning (CRL) enhances machine learning models' robustness and generalizability by learning structural causal models associated with data-generating processes. We focus on a family of CRL methods that uses contrastive data pairs in the observable space, generated before and after a random, unknown intervention, to identify the latent causal model. (Brehmer et al., 2022) showed that this is indeed possible, given that all latent variables can be intervened on individually. However, this is a highly restrictive assumption in many systems. In this work, we instead assume interventions on arbitrary subsets of latent variables, which is more realistic. We introduce a theoretical framework that calculates the degree to which we can identify a causal model, given a set of possible interventions, up to an abstraction that describes the system at a higher level of granularity.
