Polynomial Semantics of Tractable Probabilistic Circuits

Oliver Broadrick; Honghua Zhang; Guy Van den Broeck

Polynomial Semantics of Tractable Probabilistic Circuits

Oliver Broadrick, Honghua Zhang, Guy Van den Broeck

TL;DR

The paper addresses how different polynomial encodings of probability distributions in probabilistic circuits relate for distributions over binary variables $X_1,\dots,X_n$. It demonstrates that network $p(x, xbar)$, likelihood $p(x)$, generating $g(x)$, and Fourier hat p(x) representations are polynomial-time transform-equivalent, preserving tractable marginal inference. It leverages Strassen style division-elimination to convert division-containing circuits to division-free forms, enabling efficient marginals across semantics and establishing a unified view of $p(x,xbar)$, $p(x)$, $g(x)$, and $\hat{p}(x)$. The work extends to categorical distributions, showing that for $k \ge 4$ categories inference on PGCs is #P-hard, signaling fundamental limits and guiding future cross-representation methods. Overall, the results unify previously separate marginal-inference approaches and enable learning in one representation to be transformed into others with only polynomial overhead.

Abstract

Probabilistic circuits compute multilinear polynomials that represent multivariate probability distributions. They are tractable models that support efficient marginal inference. However, various polynomial semantics have been considered in the literature (e.g., network polynomials, likelihood polynomials, generating functions, and Fourier transforms). The relationships between circuit representations of these polynomial encodings of distributions is largely unknown. In this paper, we prove that for distributions over binary variables, each of these probabilistic circuit models is equivalent in the sense that any circuit for one of them can be transformed into a circuit for any of the others with only a polynomial increase in size. They are therefore all tractable for marginal inference on the same class of distributions. Finally, we explore the natural extension of one such polynomial semantics, called probabilistic generating circuits, to categorical random variables, and establish that inference becomes #P-hard.

Polynomial Semantics of Tractable Probabilistic Circuits

TL;DR

The paper addresses how different polynomial encodings of probability distributions in probabilistic circuits relate for distributions over binary variables

. It demonstrates that network

, likelihood

, generating

, and Fourier hat p(x) representations are polynomial-time transform-equivalent, preserving tractable marginal inference. It leverages Strassen style division-elimination to convert division-containing circuits to division-free forms, enabling efficient marginals across semantics and establishing a unified view of

, and

. The work extends to categorical distributions, showing that for

categories inference on PGCs is #P-hard, signaling fundamental limits and guiding future cross-representation methods. Overall, the results unify previously separate marginal-inference approaches and enable learning in one representation to be transformed into others with only polynomial overhead.

Abstract

Paper Structure (11 sections, 12 theorems, 27 equations, 3 figures)

This paper contains 11 sections, 12 theorems, 27 equations, 3 figures.

Introduction
Background
Network and likelihood polynomials
Network Polynomials
Likelihood Polynomials
Generating Polynomials
Fourier Transforms
Decomposability
Categorical Distributions
Conclusion
Proofs

Key Result

Proposition 1

Computing marginals on a circuit of size $s$ representing a network polynomial takes $O(s)$ time. For the random variable assignment $X_i=1$, set $x_i=1$ and $\bar{x}_i=0$; for $X_i=0$, set $x_i=0$ and $\bar{x}_i=1$; marginalize out $X_i$ by setting $x_i=\bar{x}_i=1$. Evaluating the circuit now comp

Figures (3)

Figure 1: Polynomial time circuit transformations between polynomial semantics including: likelihood $p(x)$, network $p(x,\bar{x})$, generating $g(x)$, and Fourier $\hat{p}(x)$ polynomials. Previously known transformations are displayed on the left; (2) is given in pgcs, and (3) is implicit in roth. The results in this paper are shown on the right. Edges labeled by * correspond to transformations which map circuits of size $s$ to circuits of size $O(sn^2)$; other edges correspond to transformations which map circuits of size $s$ to circuits of size $O(s)$.
Figure 2: An example transforming a circuit representing a likelihood polynomial $p(x)=0.08x_1x_2+0.16x_1+0.12x_2+0.09$ to a circuit representing a network polynomial. First, (b) gadgets using division nodes are introduced at the leaves (as well as a multiplying factor) to obtain a rational function equivalent to the network polynomial. Then (c:top) all divisions are pushed to a single division node at the root so $p(x,\bar{x})=A/B$, and (c:bottom) a sum over necessary homogeneous parts of $A$ and $B$ is formed.
Figure 3: An example of the permanent-preserving operation used to make $M$ sparse. The new row and column are shaded in blue. The newly-added nonzero entries that preserve the permanent of the matrix are singly-underlined. The two nonzero entries that moved from their original column (highlighted in orange) to the new one are doubly-underlined. The number of nonzero entries in the second column has decreased by one.

Theorems & Definitions (21)

Definition 1
Proposition 1
Proposition 2
proof
Theorem 1: Strassen
Theorem 2
Theorem 3
proof
Lemma 1
Proposition 3
...and 11 more

Polynomial Semantics of Tractable Probabilistic Circuits

TL;DR

Abstract

Polynomial Semantics of Tractable Probabilistic Circuits

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (21)