On the (Non) Injectivity of Piecewise Linear Janossy Pooling
Ilai Reshef, Nadav Dym
TL;DR
The paper investigates whether continuous piecewise linear (CPwL) $k$-ary Janossy pooling can yield injective and bi-Lipschitz multiset-to-vector mappings. It proves a negative result in full generality: for $n>k$ and a domain $C$ containing a line segment, CPwL $k$-ary Janossy pooling is not injective on multisets. It simultaneously provides a positive result on restricted domains with all elements distinct (compact $D$): there exists a CPwL construction for which 1-ary Janossy pooling is injective and bi-Lipschitz with respect to the Wasserstein distance, with complexity depending on the separation constant $R(D)$. The work clarifies the limits of CPwL pooling for injectivity and bi-Lipschitzness, guiding when standard deepsets suffice and when alternative methods (e.g., sorting-based or higher-order pooling) are warranted, particularly depending on how close multiset elements can be. It also highlights practical implications for molecular-like datasets versus highly clustered point clouds.
Abstract
Multiset functions, which are functions that map multisets to vectors, are a fundamental tool in the construction of neural networks for multisets and graphs. To guarantee that the vector representation of the multiset is faithful, it is often desirable to have multiset mappings that are both injective and bi-Lipschitz. Currently, there are several constructions of multiset functions achieving both these guarantees, leading to improved performance in some tasks but often also to higher compute time than standard constructions. Accordingly, it is natural to inquire whether simpler multiset functions achieving the same guarantees are available. In this paper, we make a large step towards giving a negative answer to this question. We consider the family of k-ary Janossy pooling, which includes many of the most popular multiset models, and prove that no piecewise linear Janossy pooling function can be injective. On the positive side, we show that when restricted to multisets without multiplicities, even simple deep-sets models suffice for injectivity and bi-Lipschitzness.
