Causal Discovery of Linear Non-Gaussian Causal Models with Unobserved Confounding
Daniela Schkoda, Elina Robeva, Mathias Drton
TL;DR
This work tackles causal discovery in linear non-Gaussian acyclic models with latent confounding, where the full causal effects are not uniquely identifiable but the latent structure is. It introduces ReLVLiNGAM, a recursive, cumulant-based algorithm that identifies a source and its latent parents, estimates their effects on descendants via polynomial equations, and iteratively removes their influence to recover the entire graph. The approach relies on rank conditions on higher-order cumulant-based matrices to infer the number of latents and to recover a compatible path matrix $B$, with asymptotic correctness when locally $\ell\le p$. In simulations, ReLVLiNGAM achieves competitive performance with overcomplete ICA techniques without requiring the latent count in advance, and it demonstrates robustness to misspecification of the latent bound, offering a practical tool for causal effect identification in the presence of latent confounding.
Abstract
We consider linear non-Gaussian structural equation models that involve latent confounding. In this setting, the causal structure is identifiable, but, in general, it is not possible to identify the specific causal effects. Instead, a finite number of different causal effects result in the same observational distribution. Most existing algorithms for identifying these causal effects use overcomplete independent component analysis (ICA), which often suffers from convergence to local optima. Furthermore, the number of latent variables must be known a priori. To address these issues, we propose an algorithm that operates recursively rather than using overcomplete ICA. The algorithm first infers a source, estimates the effect of the source and its latent parents on their descendants, and then eliminates their influence from the data. For both source identification and effect size estimation, we use rank conditions on matrices formed from higher-order cumulants. We prove asymptotic correctness under the mild assumption that locally, the number of latent variables never exceeds the number of observed variables. Simulation studies demonstrate that our method achieves comparable performance to overcomplete ICA even though it does not know the number of latents in advance.
