The Fisher score on the closed simplex
Giovanni Pistone, Fabio Rapallo, Eva Riccomagno
TL;DR
This work extends Fisher score and related information-geometric tools from the open to the closed probability simplex, enabling smooth one-parameter exponential models to hit boundary points where zeros occur. It achieves this by developing an algebraic-statistics framework centered on a fibre bundle of $p$-contrasts and a statistical bundle that encodes velocity via the score, and by introducing exponential transports and natural gradients on the boundary. Key contributions include a boundary-compatible generalization of the Fisher score, a reduced statistical-bundle construction, and a coherent theory of transports and geodesics that preserve or properly modify support. The resulting formalism provides computable, algebraic representations for gradient flows, variational methods, and geometric reasoning in contingency tables and compositional data with zeros, with potential applications to dynamic models and physics-inspired statistical methods.
Abstract
We extend classical analytic tools for finite-state statistical models to allow zero probabilities. Using methods from algebraic statistics and information geometry, we develop a framework in which a smooth statistical model could hit the boundary of the simplex, for example, in contingency tables with non-structural zeros. The central object of our approach is the vector bundle whose fibres are the $p$-contrasts associated to each probability distribution $p$. In this framework, Fisher score and other key statistical concepts, such as entropy for one-dimensional statistical models, admit an algebraic representation also on the boundary of the simplex.
