Information decomposition in complex systems via machine learning
Kieran A. Murphy, Dani S. Bassett
TL;DR
The paper addresses identifying microscale variation most predictive of macroscale behavior in complex systems by leveraging mutual information to relate multiple observables across scales. It introduces a practical distributed information bottleneck (DIB) framework that learns per-input lossy encodings $U_i$ and optimizes $\mathcal{L}_\textnormal{DIB} = \beta \sum_{i=1}^N I(U_i; X_i) - I(\boldsymbol{U}; Y)$ using neural encoders and variational bounds to estimate mutual information. The authors demonstrate the approach on a Boolean circuit and on an amorphous material under shear, showing that the method identifies which microvariables carry macroscale relevance and yields a spectrum of compression schemes that reveal the structure of information flow; for example, in the circuit the most informative inputs emerge in a predictable order, while in glass the most informative density measurements concentrate in inner radial shells, with predictive accuracy improving from ~72% with one bit to over 90% with ~20 bits. Overall, the work provides a scalable, interpretable, information-theoretic tool for connecting microstructure to macroscopic behavior in complex systems, complementing PID with a tractable, ML-driven decomposition that scales to hundreds of inputs.
Abstract
One of the fundamental steps toward understanding a complex system is identifying variation at the scale of the system's components that is most relevant to behavior on a macroscopic scale. Mutual information provides a natural means of linking variation across scales of a system due to its independence of functional relationship between observables. However, characterizing the manner in which information is distributed across a set of observables is computationally challenging and generally infeasible beyond a handful of measurements. Here we propose a practical and general methodology that uses machine learning to decompose the information contained in a set of measurements by jointly optimizing a lossy compression of each measurement. Guided by the distributed information bottleneck as a learning objective, the information decomposition identifies the variation in the measurements of the system state most relevant to specified macroscale behavior. We focus our analysis on two paradigmatic complex systems: a Boolean circuit and an amorphous material undergoing plastic deformation. In both examples, the large amount of entropy of the system state is decomposed, bit by bit, in terms of what is most related to macroscale behavior. The identification of meaningful variation in data, with the full generality brought by information theory, is made practical for studying the connection between micro- and macroscale structure in complex systems.
