Causal Representation Learning Made Identifiable by Grouping of Observational Variables
Hiroshi Morioka, Aapo Hyvärinen
TL;DR
This paper addresses identifiability in Causal Representation Learning (CRL) by introducing a grouping of observational variables, enabling identifiability without temporal structure, supervision, or interventions. It proposes Grouped Causal Representation Learning (G-CaRL), a self-supervised framework that learns group-wise inverse mappings to recover latent causal variables and jointly estimate inter-group causal weights, with provable consistency under suitable assumptions. Theoretical results guarantee identifiability of latent variables up to permutations and variable-wise invertible transformations, and identifiability (up to scaling and transpose) of inter-group causal graphs under directed, nondegenerate structures. Empirically, G-CaRL outperforms state-of-the-art baselines across synthetic DAGs and cycles, gene-regulatory-like networks, and high-dimensional image data, while showing robustness to latent confounders and model misspecification. The work broadens CRL applicability by enabling instantaneous, nonlinear, and potentially cyclic interactions, without supervision, interventions, or strict dynamics, making it impactful for multimodal sensing, biology, and neuroscience.
Abstract
A topic of great current interest is Causal Representation Learning (CRL), whose goal is to learn a causal model for hidden features in a data-driven manner. Unfortunately, CRL is severely ill-posed since it is a combination of the two notoriously ill-posed problems of representation learning and causal discovery. Yet, finding practical identifiability conditions that guarantee a unique solution is crucial for its practical applicability. Most approaches so far have been based on assumptions on the latent causal mechanisms, such as temporal causality, or existence of supervision or interventions; these can be too restrictive in actual applications. Here, we show identifiability based on novel, weak constraints, which requires no temporal structure, intervention, nor weak supervision. The approach is based on assuming the observational mixing exhibits a suitable grouping of the observational variables. We also propose a novel self-supervised estimation framework consistent with the model, prove its statistical consistency, and experimentally show its superior CRL performances compared to the state-of-the-art baselines. We further demonstrate its robustness against latent confounders and causal cycles.
