How do correlations shape the landscape of information?
Ching-Peng Huang
TL;DR
The paper investigates how correlations shape the landscape of information about a stimulus $S$ when two observations $X$ and $Y$ have fixed marginals, focusing on the mutual information $I(S:X,Y)$ and its decomposition into synergistic and non-synergistic components. It adopts an algebraic statistics and information-geometry approach, introducing the correlation domain $oldsymbol{ riangle}_P$, the shuffle distribution $Q^0$, and Segre-variety geometry to classify when the information is minimised inside the domain or on its boundary, aided by discriminants and toric-chamber considerations. A key contribution is showing how a series-expansion-based decomposition of $I$ aligns with the BROJA Partial Information Decomposition, providing a translation between these viewpoints and clarifying the roles of $I_{ci}$, $I_{cd}$, and $I_{Q^*}$. The work also uses Gaussian-mixture-model intuition to illustrate the correspondence between covariance structure and information, and outlines open questions and broader implications for interdisciplinary dialogue between mathematics and theoretical neuroscience.
Abstract
We explore a few common models on how correlations affect information. The main model considered is the Shannon mutual information $I(S:R_1,\cdots, R_i)$ over distributions with marginals $P_{S,R_i}$ fixed for each $i$, with the analogy in which $S$ is the stimulus and $R_i$'s are neurons. We work out basic models in details, using algebro-geometric tools to write down discriminants that separate distributions with distinct qualitative behaviours in the probability simplex into toric chambers and evaluate the volumes of them algebraically. As a byproduct, we provide direct translation between a decomposition of mutual information inspired by a series expansion and one from partial information decomposition (PID) problems, characterising the synergistic terms of the former. We hope this paper serves for communication between communities especially mathematics and theoretical neuroscience on the topic. KEYWORDS: information theory, algebraic statistics, mathematical neuroscience, partial information decomposition
