GraphSPNs: Sum-Product Networks Benefit From Canonical Orderings
Milan Papež, Martin Rektoris, Václav Šmídl, Tomáš Pevný
TL;DR
GraphSPNs extend sum-product networks to graphs by padding to a fixed size and enforcing permutation invariance, enabling exact and efficient probabilistic inference over graphs G=(X,A) with varying node counts. They explore exact, canonical (sorting), and approximate invariance via $k$-ary subgraphs and random sampling, plus virtual node-padding to manage variable graph sizes. Empirical results on QM9 show GraphSPNs can generate chemically valid and novel molecules, with the canonical ordering variant delivering the strongest practical performance by reducing multimodality. The work provides a scalable framework for exact probabilistic queries on graphs and offers a principled approach to molecular graph generation and conditional inference.
Abstract
Deep generative models have recently made a remarkable progress in capturing complex probability distributions over graphs. However, they are intractable and thus unable to answer even the most basic probabilistic inference queries without resorting to approximations. Therefore, we propose graph sum-product networks (GraphSPNs), a tractable deep generative model which provides exact and efficient inference over (arbitrary parts of) graphs. We investigate different principles to make SPNs permutation invariant. We demonstrate that GraphSPNs are able to (conditionally) generate novel and chemically valid molecular graphs, being competitive to, and sometimes even better than, existing intractable models. We find out that (Graph)SPNs benefit from ensuring the permutation invariance via canonical ordering.
