Sum-Product-Set Networks: Deep Tractable Models for Tree-Structured Graphs

Milan Papež; Martin Rektoris; Tomáš Pevný; Václav Šmídl

Sum-Product-Set Networks: Deep Tractable Models for Tree-Structured Graphs

Milan Papež, Martin Rektoris, Tomáš Pevný, Václav Šmídl

TL;DR

This work addresses probabilistic modeling over tree-structured graphs such as JSON/XML by introducing Sum-Product-Set Networks (SPSNs), which extend probabilistic circuits with Random Finite Sets to handle variable-sized trees and heterogeneity. SPSNs define a density $p(T)$ over tree-structured data via a parameterized computational graph $\mathcal{G}$ and a scope function $\psi$, incorporating a set unit to model finite random sets and enabling exact marginal inference. The authors establish tractability under structural constraints, analyze exchangeability properties, and demonstrate competitive performance against intractable neural-network–based models, while offering robustness to missing data through tractable marginalization. This methodology provides a principled, scalable, and exact-inference alternative for learning and querying distributions over structured graphs, with practical impact on domains requiring reliable probabilistic reasoning over JSON/XML-like data. Key steps include constructing SPSN blocks from data schemas, enforcing smoothness and decomposability to guarantee closed-form inference, and leveraging finite random-set cardinalities to capture variable subtree counts, all while proving partial exchangeability and providing empirical evidence on real-world datasets.

Abstract

Daily internet communication relies heavily on tree-structured graphs, embodied by popular data formats such as XML and JSON. However, many recent generative (probabilistic) models utilize neural networks to learn a probability distribution over undirected cyclic graphs. This assumption of a generic graph structure brings various computational challenges, and, more importantly, the presence of non-linearities in neural networks does not permit tractable probabilistic inference. We address these problems by proposing sum-product-set networks, an extension of probabilistic circuits from unstructured tensor data to tree-structured graph data. To this end, we use random finite sets to reflect a variable number of nodes and edges in the graph and to allow for exact and efficient inference. We demonstrate that our tractable model performs comparably to various intractable models based on neural networks.

Sum-Product-Set Networks: Deep Tractable Models for Tree-Structured Graphs

TL;DR

over tree-structured data via a parameterized computational graph

and a scope function

, incorporating a set unit to model finite random sets and enabling exact marginal inference. The authors establish tractability under structural constraints, analyze exchangeability properties, and demonstrate competitive performance against intractable neural-network–based models, while offering robustness to missing data through tractable marginalization. This methodology provides a principled, scalable, and exact-inference alternative for learning and querying distributions over structured graphs, with practical impact on domains requiring reliable probabilistic reasoning over JSON/XML-like data. Key steps include constructing SPSN blocks from data schemas, enforcing smoothness and decomposability to guarantee closed-form inference, and leveraging finite random-set cardinalities to capture variable subtree counts, all while proving partial exchangeability and providing empirical evidence on real-world datasets.

Abstract

Paper Structure (24 sections, 5 theorems, 21 equations, 12 figures, 2 tables, 2 algorithms)

This paper contains 24 sections, 5 theorems, 21 equations, 12 figures, 2 tables, 2 algorithms.

Introduction
Tree-Structured Data
Sum-Product-Set Networks
Tractability
Exchangebility
Related Work
Non-probabilistic models (NPMs).
Intractable probabilistic models (IPMs).
Tractable probabilistic models (TPMs).
Experiments
Conclusion
Probabilistic Circuits
Random Finite Sets
Exchangeability
Proof of \ref{['prop:constrained-permutations']}
...and 9 more sections

Key Result

Proposition 1

(Tractability of SPSNs). Let $p(T)$ be an SPSN satisfying ass:requirements-on-set-node and def:structural-constraints, and let $f(T)$ be a function satisfying def:tractable-function-graph. Then, the integral eq:spsns_integral is tractable and can be computed recursively as follows: where the measure $\nu_u(d\psi_u)$ instantiates itself either as the Lebesgue measure or the counting measure, depen

Figures (12)

Figure 1: Tree-structured graphs. Left: an example of a molecule from the Mutagenesis dataset debnath1991structure encoded in the JSON format. Right: the corresponding, tree-structured graph, $T$ (\ref{['def:data-graph']}), describing relations between the atoms and their properties, and its schema (dashed line), $S$ (\ref{['def:schema']}). Here, $\bigtriangleup$ is the heterogeneous node (green), $\bigtriangledown$ is the homogeneous node (orange), and $\mathbf{x}$ is the leaf node (blue).
Figure 2: Sum-product-set networks. (a) The schema (\ref{['def:schema']}) from the example in \ref{['fig:trees']}. (b) The SPSN network comprises SPSN blocks that correspondingly model the heterogeneous nodes depicted in (a). (c) The SPSN block embodies the computational (sub)graph, $\mathcal{G}$, (\ref{['def:computational-graph']}), where $+$, $\times$, $\lbrace\cdot\rbrace$ and $p$ are the sum unit, product unit, set unit and input unit of $\mathcal{G}$, respectively. The elements of the heterogeneous node in (a) are modeled by the corresponding parts of $\mathcal{G}$ in (c), as displayed in green, orange, and blue.
Figure 3: Missing values. The test accuracy (higher is better) versus the fraction of missing values for the MLP, GRU, LSTM, HMIL, and SPSN networks. It is displayed for the best model, which was selected based on the validation accuracy. The results are averaged over five runs with different initial conditions.
Figure 4: Sum-product-set networks. (a) The tree-structured, heterogeneous, data graph, $T$, (\ref{['def:data-graph']}), and the schema (dashed line), $S$, (\ref{['def:schema']}). Here, $\bigtriangleup$ is the heterogeneous node, $\bigtriangledown$ is the homogeneous node, and $\mathbf{x}$ denotes the leaf node of $T$. (b) The computational graph, $\mathcal{G}$, of an SPSN (\ref{['def:computational-graph']}) designed based on $S$, where $+$, $\times$, $\lbrace\cdot\rbrace$ and $p$ are the sum unit, product unit, set unit and leaf unit of $\mathcal{G}$, respectively. The subtrees of $T$ in (a) are modeled by the corresponding parts of $\mathcal{G}$ in (b), as displayed in green, orange, and blue. The dashed arrow in (b) (the second child in the top sum units) represents the same sub-network as in the first child. In this example, we consider $n_l=1$, $n_s=2$, and $n_p=2$.
Figure 5: Schema. The schema of the dataset.
...and 7 more figures

Theorems & Definitions (23)

Definition 1
Definition 2
Definition 3
Definition 4
Definition 5
Definition 6
Proposition 1
proof
Definition 7
Definition 8
...and 13 more

Sum-Product-Set Networks: Deep Tractable Models for Tree-Structured Graphs

TL;DR

Abstract

Sum-Product-Set Networks: Deep Tractable Models for Tree-Structured Graphs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (23)