Information-Theoretic Measures on Lattices for High-Order Interactions
Zhaolu Liu, Mauricio Barahona, Robert L. Peach
TL;DR
This work addresses the challenge of measuring higher-order interactions in multivariate data, where traditional pairwise or limited-factorisation measures fall short for $d>3$. It develops a lattice-theoretic framework that unifies existing measures and introduces Streitberg Information (SI) defined on the full partition lattice, using the Tsallis-Alpha divergence $D_{\alpha}$ with $\alpha\in(0,1)$ to preserve complex interaction structure; a $k$NN-based estimator provides a consistent, nonparametric approach. The paper shows that SI vanishes exactly when the joint distribution factorises through lower-order terms, validates SI on synthetic datasets including MVGs, XOR, and COPY gates, and demonstrates applicability to stock market interactions, neural decoding, and feature selection. Overall, SI offers a scalable, permutation-invariant tool for detecting and quantifying higher-order statistical dependencies with potential implications for causality and model interpretation.
Abstract
Traditional measures based solely on pairwise associations often fail to capture the complex statistical structure of multivariate data. Existing approaches for identifying information shared among $d>3$ variables are frequently computationally intractable, asymmetric with respect to a target variable, or unable to account for all the ways in which the joint probability distribution can be factorised. Here we present a systematic framework based on lattice theory to derive higher-order information-theoretic measures for multivariate data. Our construction uses lattice and operator function pairs, whereby an operator function is applied over a lattice that represents the algebraic relationships among variables. We show that many commonly used measures can be derived within this framework, yet they fail to capture all interactions for $d>3$, either because they are defined on restricted sublattices, or because the use of the KL divergence as an operator function, a typical choice, leads to undesired disregard of groups of interactions. To fully characterise all interactions among $d$ variables, we introduce the Streitberg Information, which is defined over the full partition lattice and uses generalised divergences (beyond KL) as operator functions. We validate the Streitberg Information on synthetic data, and illustrate its application in detecting complex interactions among stocks, decoding neural signals, and performing feature selection in machine learning.
