Towards Identifiable Latent Additive Noise Models
Yuhang Liu, Zhen Zhang, Dong Gong, Erdun Gao, Biwei Huang, Mingming Gong, Anton van den Hengel, Kun Zhang, Javen Qinfeng Shi
TL;DR
This paper tackles identifiability in causal representation learning by exploiting changes in latent causal influences across environments via a surrogate variable $\\mathbf{u}$. It develops a general framework based on latent additive noise models (with exponential-family noise) and proves complete identifiability up to permutation and scaling under a lambda constraint, plus a partial identifiability regime when only a subset of influences changes; it further extends these results to latent post-nonlinear models. A practical ELBO-based learning method is proposed to recover latent causal graphs, enforcing a causal order and sparsity to reveal structure, with theoretical guarantees guiding inference. Empirical validation spans synthetic data, semi-synthetic fMRI data, and real human motion datasets, demonstrating accurate latent graphs, meaningful interventions, and superior performance of MLPl-based approaches over polynomial or linear baselines. Collectively, the work broadens identifiable CRL capabilities to more realistic nonlinear and heterogeneous settings, with clear implications for neuroscience and biomechanics datasets where task- or environment-driven shifts occur.
Abstract
Causal representation learning (CRL) offers the promise of uncovering the underlying causal model by which observed data was generated, but the practical applicability of existing methods remains limited by the strong assumptions required for identifiability and by challenges in applying them to real-world settings. Most current approaches are applicable only to relatively restrictive model classes, such as linear or polynomial models, which limits their flexibility and robustness in practice. One promising approach to this problem seeks to address these issues by leveraging changes in causal influences among latent variables. In this vein we propose a more general and relaxed framework than typically applied, formulated by imposing constraints on the function classes applied. Within this framework, we establish partial identifiability results under weaker conditions, including scenarios where only a subset of causal influences change. We then extend our analysis to a broader class of latent post-nonlinear models. Building on these theoretical insights, we develop a flexible method for learning latent causal representations. We demonstrate the effectiveness of our approach on synthetic and semi-synthetic datasets, and further showcase its applicability in a case study on human motion analysis, a complex real-world domain that also highlights the potential to broaden the practical reach of identifiable CRL models.
