Markov bases: a 25 year update
Félix Almendra-Hernández, Jesús A. De Loera, Sonja Petrović
TL;DR
This paper assesses the Markov bases framework for sampling from conditional distributions in discrete exponential families, clarifying 25 years of development since the Fundamental Theorem of Markov Bases. It connects algebraic constructs (Markov bases, toric ideals, and Graver bases) to statistical fibers defined by $F(b)=\{u: Au=b,\ u\ge0\}$ and explores both positive (existence, connectivity) and negative (complexity, restricted fibers, non-negativity relaxations) results. New contributions include results on unbounded relaxation of fibers, the persistence of move complexity under relaxations, polynomial bounds for restricted fibers in certain hierarchical models, and limitations of incomplete move sets, especially in the no-three-way interaction model. The discussion situates algebraic advances within classical statistics and highlights practical strategies (dynamic Markov bases, SIS hybrids, mixing considerations) and software resources for implementing exact conditional tests on contingency tables and related models.
Abstract
In this paper, we evaluate the challenges and best practices associated with the Markov bases approach to sampling from conditional distributions. We provide insights and clarifications after 25 years of the publication of the fundamental theorem for Markov bases by Diaconis and Sturmfels. In addition to a literature review we prove three new results on the complexity of Markov bases in hierarchical models, relaxations of the fibers in log-linear models, and limitations of partial sets of moves in providing an irreducible Markov chain.
