Unfolding Tensors to Identify the Graph in Discrete Latent Bipartite Graphical Models
Yuqi Gu
TL;DR
This work addresses identifiability of the loading graph in discrete latent bipartite graphical models, including Noisy-OR networks and Restricted Boltzmann Machines. It introduces a constructive identifiability framework based on unfolding the population tensor into matrices and leveraging rank properties to recover the graph structure and the number of latent variables $K$, under a sparsity condition that each latent is linked to at least two pure observed variables. The approach is agnostic to edge direction, latent dependence, and nonlinearities, and yields a practical certificate for identifiability that extends beyond tree-structured models. It also analyzes identifiability of other model parameters and outlines future work toward finite-sample algorithms and broader model classes, with implications for interpretable science and trustworthy ML.
Abstract
We use a tensor unfolding technique to prove a new identifiability result for discrete bipartite graphical models, which have a bipartite graph between an observed and a latent layer. This model family includes popular models such as Noisy-Or Bayesian networks for medical diagnosis and Restricted Boltzmann Machines in machine learning. These models are also building blocks for deep generative models. Our result on identifying the graph structure enjoys the following nice properties. First, our identifiability proof is constructive, in which we innovatively unfold the population tensor under the model into matrices and inspect the rank properties of the resulting matrices to uncover the graph. This proof itself gives a population-level structure learning algorithm that outputs both the number of latent variables and the bipartite graph. Second, we allow various forms of nonlinear dependence among the variables, unlike many continuous latent variable graphical models that rely on linearity to show identifiability. Third, our identifiability condition is interpretable, only requiring each latent variable to connect to at least two "pure" observed variables in the bipartite graph. The new result not only brings novel advances in algebraic statistics, but also has useful implications for these models' trustworthy applications in scientific disciplines and interpretable machine learning.
