Optimal Transport for Probabilistic Circuits
Adrian Ciotinga, YooJung Choi
TL;DR
This work tackles the challenge of computing transport-based distances between probabilistic circuits (PCs) by introducing a CW_p distance that restricts couplings to a coupling circuit, enabling exact, quadratic-time computation for compatible PCs. It develops an efficient recursive algorithm to compute CW_p and extract the associated transport plan, and shows that CW_p upper-bounds the true Wasserstein distance while remaining tractable where classical OT solvers fail on PCs. To enable learning, the paper proposes Empirical Circuit Wasserstein (ECW) and an iterative Wasserstein-Minimization (WM) method that alternates between optimizing the coupling and updating PC parameters, offering a practical alternative to maximum-likelihood training. Empirical results on synthetic PCs, MNIST-classified circuits, and a color-transfer task demonstrate CW_p’s scalability, its utility as a proxy for true OT, and the viability of Wasserstein-based PC parameter learning, with stochastic variants improving optimization in larger models.
Abstract
We introduce a novel optimal transport framework for probabilistic circuits (PCs). While it has been shown recently that divergences between distributions represented as certain classes of PCs can be computed tractably, to the best of our knowledge, there is no existing approach to compute the Wasserstein distance between probability distributions given by PCs. We propose a Wasserstein-type distance that restricts the coupling measure of the associated optimal transport problem to be a probabilistic circuit. We then develop an algorithm for computing this distance by solving a series of small linear programs and derive the circuit conditions under which this is tractable. Furthermore, we show that we can easily retrieve the optimal transport plan between the PCs from the solutions to these linear programs. Lastly, we study the empirical Wasserstein distance between a PC and a dataset, and show that we can estimate the PC parameters to minimize this distance through an efficient iterative algorithm.
