Compositional Structures in Neural Embedding and Interaction Decompositions
Matthew Trager, Alessandro Achille, Pramuditha Perera, Luca Zancato, Stefano Soatto
TL;DR
This work addresses the lack of a formal grounding for emergent linear structure in neural embeddings by establishing a precise correspondence between probabilistic dependencies and geometric patterns in embeddings via interaction decompositions. The authors define pure interaction spaces $E_I$ and show that conditional independencies in $P(Y|X)$ are equivalent to orthogonality conditions $\langle {\bm{u}}_I, {\bm{v}}_J\rangle = 0$ for relevant $I,J$, providing necessary and sufficient conditions. Key contributions include a general decomposition framework, projections $Q_I$, and a main theorem that extends prior work to arbitrary factorizations of inputs and outputs, with qualitative examples and a synthetic validation. The framework offers a principled approach for interpretable and controllable representations, connecting to exponential-family geometry and graphical models, with potential impact on understanding and guiding compositional structure in transformers and multimodal models.
Abstract
We describe a basic correspondence between linear algebraic structures within vector embeddings in artificial neural networks and conditional independence constraints on the probability distributions modeled by these networks. Our framework aims to shed light on the emergence of structural patterns in data representations, a phenomenon widely acknowledged but arguably still lacking a solid formal grounding. Specifically, we introduce a characterization of compositional structures in terms of "interaction decompositions," and we establish necessary and sufficient conditions for the presence of such structures within the representations of a model.
