Table of Contents
Fetching ...

Identifiability of latent causal graphical models without pure children

Seunghyun Lee, Yuqi Gu

TL;DR

This work tackles identifiability of causal graphs with binary latent variables under a broad, nonparametric measurement model. It introduces a double triangular condition on the latent-to-observed graph $\Gamma$ that guarantees identifiability of the latent dimension $K$, the latent graph $\Lambda$, and the conditional distributions $\mathbb{P}(H)$ and $\mathbb{P}(X\mid H)$ without relying on pure children or parametric forms. It also derives necessary conditions—three observed children per latent and a subset condition on $\Gamma$—to delineate fundamental limits, and provides a tensor-decomposition-based proof framework (Kruskal's theorem) to establish identifiability under the proposed sufficient conditions. Simulations demonstrate accurate recovery of latent structure under realistic sample sizes, validating the theory. Overall, the results substantially relax prior identifiability requirements and broaden the applicability of latent causal discovery to complex, real-world data with mixed-type observations.

Abstract

This paper considers a challenging problem of identifying a causal graphical model under the presence of latent variables. While various identifiability conditions have been proposed in the literature, they often require multiple pure children per latent variable or restrictions on the latent causal graph. Furthermore, it is common for all observed variables to exhibit the same modality. Consequently, the existing identifiability conditions are often too stringent for complex real-world data. We consider a general nonparametric measurement model with arbitrary observed variable types and binary latent variables, and propose a double triangular graphical condition that guarantees identifiability of the entire causal graphical model. The proposed condition significantly relaxes the popular pure children condition. We also establish necessary conditions for identifiability and provide valuable insights into fundamental limits of identifiability. Simulation studies verify that latent structures satisfying our conditions can be accurately estimated from data.

Identifiability of latent causal graphical models without pure children

TL;DR

This work tackles identifiability of causal graphs with binary latent variables under a broad, nonparametric measurement model. It introduces a double triangular condition on the latent-to-observed graph that guarantees identifiability of the latent dimension , the latent graph , and the conditional distributions and without relying on pure children or parametric forms. It also derives necessary conditions—three observed children per latent and a subset condition on —to delineate fundamental limits, and provides a tensor-decomposition-based proof framework (Kruskal's theorem) to establish identifiability under the proposed sufficient conditions. Simulations demonstrate accurate recovery of latent structure under realistic sample sizes, validating the theory. Overall, the results substantially relax prior identifiability requirements and broaden the applicability of latent causal discovery to complex, real-world data with mixed-type observations.

Abstract

This paper considers a challenging problem of identifying a causal graphical model under the presence of latent variables. While various identifiability conditions have been proposed in the literature, they often require multiple pure children per latent variable or restrictions on the latent causal graph. Furthermore, it is common for all observed variables to exhibit the same modality. Consequently, the existing identifiability conditions are often too stringent for complex real-world data. We consider a general nonparametric measurement model with arbitrary observed variable types and binary latent variables, and propose a double triangular graphical condition that guarantees identifiability of the entire causal graphical model. The proposed condition significantly relaxes the popular pure children condition. We also establish necessary conditions for identifiability and provide valuable insights into fundamental limits of identifiability. Simulation studies verify that latent structures satisfying our conditions can be accurately estimated from data.

Paper Structure

This paper contains 33 sections, 11 theorems, 60 equations, 3 figures, 5 tables, 1 algorithm.

Key Result

Theorem 1

Suppose that the true $\Gamma$-matrix is double triangular. Then, the number of latent variables $K$ is identifiable.

Figures (3)

  • Figure 1: Example of an identifiable causal graphical structure. Here, $H = \{H_1, H_2, H_3\}$ denotes the unobserved hidden/latent variables, and $X = \{X_1, \ldots, X_{8}\}$ denotes the observed responses. The edges of the DAG are partitioned into the latent graph $\Lambda$ (indicated by the dotted red arrows) and the latent-to-observed bipartite graph $\Gamma$ (indicated by the solid black arrows).
  • Figure 2: Counterexamples of three measurements per latent.
  • Figure S.1: Box plots of the SHD values corresponding to \ref{['tab:shd']}, where $N = 5000$. Smaller is better.

Theorems & Definitions (38)

  • Definition 1: BLCM
  • Remark 1: Comparison with assumptions in the literature
  • Definition 2: Identifiability
  • Definition 3: Triangular $\Gamma$-matrix
  • Example 1
  • Remark 2: Comparison with the pure children condition
  • Remark 3: Verifying the double triangular condition
  • Theorem 1: Identifying $K$
  • Lemma 1
  • Remark 4: Allowing alternative models with more latent variables
  • ...and 28 more