Surveying the space of descriptions of a composite system with machine learning
Kieran A. Murphy, Yujing Zhang, Dani S. Bassett
TL;DR
The paper reframes multivariate information theory by exploring the continuous space of per-component descriptions of a composite system, treating each description as a channel $U_i$ that encodes information about $X_i$ and optimizing these channels to expose global structure via $TC$ and $\Omega$. It develops a neural-network framework that constrains the total transmitted information $\sum_i I(U_i;X_i)$ while using InfoNCE to approximate remaining mutual-information terms, and employs an adversarial setup to minimize or maximize $I(\mathbf{U};\mathbf{X})$ as required. Through case studies on a 5-spin Ising system, a 4x4 Sudoku board, and 4-gram language statistics, the method identifies extremal descriptions that reveal how system-wide variation arises from component-level variation and can be interpreted via hardened, discrete descriptions. The approach scales to real-world data, remains flexible for continuous variables and diverse information-theoretic targets, and provides a practical toolkit for probing the structure of complex systems by navigating the space of descriptions, with potential to extend beyond $TC$ and $\Omega$ to other information-theoretic quantities.
Abstract
Multivariate information theory provides a general and principled framework for understanding how the components of a complex system are connected. Existing analyses are coarse in nature -- built up from characterizations of discrete subsystems -- and can be computationally prohibitive. In this work, we propose to study the continuous space of possible descriptions of a composite system as a window into its organizational structure. A description consists of specific information conveyed about each of the components, and the space of possible descriptions is equivalent to the space of lossy compression schemes of the components. We introduce a machine learning framework to optimize descriptions that extremize key information theoretic quantities used to characterize organization, such as total correlation and O-information. Through case studies on spin systems, sudoku boards, and letter sequences from natural language, we identify extremal descriptions that reveal how system-wide variation emerges from individual components. By integrating machine learning into a fine-grained information theoretic analysis of composite random variables, our framework opens a new avenues for probing the structure of real-world complex systems.
