Linear causal disentanglement via higher-order cumulants
Paula Leyes Carreno, Chiara Meroni, Anna Seigal
TL;DR
The paper studies identifiability in linear causal disentanglement (LCD), where observed variables are a linear mix of latent variables with causal relations. It develops a constructive approach based on coupled tensor decompositions of higher-order cumulants across multiple intervention contexts, proving that one perfect intervention per latent node suffices (and is sometimes necessary) to identify the latent DAG and parameters via a linear system; soft interventions yield only a compatibility class for the latent graph. A practical two-step algorithm recovers intervention targets, permutation, and scaling, then recovers latent parameters, with additional simplifications in the injective setting ($q\le p$). The results rely on non-Gaussianity of latent errors to enable identifiability and illustrate that complete identifiability under soft interventions is impossible in general, though transitive closures can be recovered. Overall, the work extends identifiability theory for causal representations and provides a concrete pipeline for recovering latent causal structure from interventional data.
Abstract
Linear causal disentanglement is a recent method in causal representation learning to describe a collection of observed variables via latent variables with causal dependencies between them. It can be viewed as a generalization of both independent component analysis and linear structural equation models. We study the identifiability of linear causal disentanglement, assuming access to data under multiple contexts, each given by an intervention on a latent variable. We show that one perfect intervention on each latent variable is sufficient and in the worst case necessary to recover parameters under perfect interventions, generalizing previous work to allow more latent than observed variables. We give a constructive proof that computes parameters via a coupled tensor decomposition. For soft interventions, we find the equivalence class of latent graphs and parameters that are consistent with observed data, via the study of a system of polynomial equations. Our results hold assuming the existence of non-zero higher-order cumulants, which implies non-Gaussianity of variables.
