Interpretive Efficiency: Information-Geometric Foundations of Data Usefulness
Ronald Katende
TL;DR
This work introduces Interpretive Efficiency, $E(\varphi;N)$, a principled, axiomatic measure of how well task-relevant information passes through an interpretive channel. By linking $E$ to mutual information and Fisher information, the authors establish a rigorous framework with five guiding axioms, provide finite-sample estimation guarantees, and connect to the variational information bottleneck via V-GIB compatibility. Theoretical results (including a local Fisher--geometric expansion) are complemented by controlled synthetic examples and empirical validation on digits and spectral signals, illustrating when compression preserves or degrades interpretive usefulness beyond raw accuracy. Practically, $E$ serves as a diagnostic tool for representation design, highlighting redundancy, robustness, and the geometry of information flow that underpins reliable, interpretable reasoning.
Abstract
Interpretability is central to trustworthy machine learning, yet existing metrics rarely quantify how effectively data support an interpretive representation. We propose Interpretive Efficiency, a normalized, task-aware functional that measures the fraction of task-relevant information transmitted through an interpretive channel. The definition is grounded in five axioms ensuring boundedness, Blackwell-style monotonicity, data-processing stability, admissible invariance, and asymptotic consistency. We relate the functional to mutual information and derive a local Fisher-geometric expansion, then establish asymptotic and finite-sample estimation guarantees using standard empirical-process tools. Experiments on controlled image and signal tasks demonstrate that the measure recovers theoretical orderings, exposes representational redundancy masked by accuracy, and correlates with robustness, making it a practical, theory-backed diagnostic for representation design.
