DAG DECORation: Continuous Optimization for Structure Learning under Hidden Confounding
Samhita Pal, James O'quinn, Kaveh Aryan, Heather Pua, James P. Long, Amir Asiaee
TL;DR
This work addresses learning causal structure from observational data in linear Gaussian SEMs with latent confounding by proposing DECOR, a single differentiable estimator that jointly learns a DAG and a correlated noise model. DECOR relies on identifiability under bow-free structure with a uniform eigenvalue margin on the noise covariance, and optimizes a smooth, likelihood-aligned objective with acyclicity and bow constraints, coupled with a post-hoc bow reconciliation. The method alternates between a NOTEARS-style directed-edge update and a convex noise-update (covariance or precision route), enabling a practical, scalable approach that handles non-pervasive confounding. Empirical results on synthetic benchmarks show DECOR matches or outperforms strong baselines, with adaptive regularization providing robust performance as confounding density grows, highlighting the method’s potential for reliable structure learning in realistic settings where latent confounders are present but not pervasive.
Abstract
We study structure learning for linear Gaussian SEMs in the presence of latent confounding. Existing continuous methods excel when errors are independent, while deconfounding-first pipelines rely on pervasive factor structure or nonlinearity. We propose \textsc{DECOR}, a single likelihood-based and fully differentiable estimator that jointly learns a DAG and a correlated noise model. Our theory gives simple sufficient conditions for global parameter identifiability: if the mixed graph is bow free and the noise covariance has a uniform eigenvalue margin, then the map from $(\B,\OmegaMat)$ to the observational covariance is injective, so both the directed structure and the noise are uniquely determined. The estimator alternates a smooth-acyclic graph update with a convex noise update and can include a light bow complementarity penalty or a post hoc reconciliation step. On synthetic benchmarks that vary confounding density, graph density, latent rank, and dimension with $n<p$, \textsc{DECOR} matches or outperforms strong baselines and is especially robust when confounding is non-pervasive, while remaining competitive under pervasiveness.
