Learning Discrete Latent Variable Structures with Tensor Rank Conditions
Zhengming Chen, Ruichu Cai, Feng Xie, Jie Qiao, Anpeng Wu, Zijian Li, Zhifeng Hao, Kun Zhang
TL;DR
The paper addresses learning causal structure among unobserved discrete variables by establishing a tensor rank condition that ties the rank of observed joint contingency tensors to d-separation in the latent graph. It defines the Discrete Latent Structure Model (Discrete LSM) with measurement and structure submodels and key assumptions, then develops a two-stage identification algorithm: first locate latent variables via tensor-rank–driven causal clusters to identify the measurement model, then recover the latent-structure graph using a PC-style algorithm (PC-TENSOR-RANK) based on tensor-rank conditional independence tests. Practical procedures for estimating latent-dimension and testing tensor rank are provided via the CR statistic and a CP-decomposition-based goodness-of-fit test. Simulation studies and real-data analyses demonstrate improved latent-cluster recovery and structure discovery over baselines, confirming the method’s ability to identify non-tree, discrete latent-structure models and extend causal discovery with latent variables. This approach offers a principled, algebraic route to identifiability in discrete latent-variable causal discovery with pragmatic testing procedures.
Abstract
Unobserved discrete data are ubiquitous in many scientific disciplines, and how to learn the causal structure of these latent variables is crucial for uncovering data patterns. Most studies focus on the linear latent variable model or impose strict constraints on latent structures, which fail to address cases in discrete data involving non-linear relationships or complex latent structures. To achieve this, we explore a tensor rank condition on contingency tables for an observed variable set $\mathbf{X}_p$, showing that the rank is determined by the minimum support of a specific conditional set (not necessary in $\mathbf{X}_p$) that d-separates all variables in $\mathbf{X}_p$. By this, one can locate the latent variable through probing the rank on different observed variables set, and further identify the latent causal structure under some structure assumptions. We present the corresponding identification algorithm and conduct simulated experiments to verify the effectiveness of our method. In general, our results elegantly extend the identification boundary for causal discovery with discrete latent variables and expand the application scope of causal discovery with latent variables.
