Parameter Estimation in DAGs from Incomplete Data via Optimal Transport
Vy Vo, Trung Le, Tung-Long Vuong, He Zhao, Edwin Bonilla, Dinh Phung
TL;DR
This work introduces OTP-DAG, an optimal-transport-based framework for parameter learning in directed acyclic graphical models with latent variables. By recasting learning as minimizing a Wasserstein distance between data and model distributions and employing backward maps from observed nodes to their parents, the approach yields a tractable, end-to-end objective that extends Wasserstein auto-encoders to general DAGs. The authors provide theoretical justification (via a key OT theorem) and extensive empirical evidence across LDA, HMMs, and discrete representation learning, showing robust parameter recovery and competitive downstream performance versus EM and VI baselines. OTP-DAG offers a scalable, flexible alternative to likelihood-based methods, with potential for broader applicability to complex graphical models and future structure-learning tasks.
Abstract
Estimating the parameters of a probabilistic directed graphical model from incomplete data is a long-standing challenge. This is because, in the presence of latent variables, both the likelihood function and posterior distribution are intractable without assumptions about structural dependencies or model classes. While existing learning methods are fundamentally based on likelihood maximization, here we offer a new view of the parameter learning problem through the lens of optimal transport. This perspective licenses a general framework that operates on any directed graphs without making unrealistic assumptions on the posterior over the latent variables or resorting to variational approximations. We develop a theoretical framework and support it with extensive empirical evidence demonstrating the versatility and robustness of our approach. Across experiments, we show that not only can our method effectively recover the ground-truth parameters but it also performs comparably or better than competing baselines on downstream applications.
