Direct sampling from conditional distributions by sequential maximum likelihood estimations
Shuhei Mano
TL;DR
This work addresses direct sampling from the conditional distribution of discrete log-affine models by revealing that the sampling transition equals the UMVUE ratio to the total count, a computation dominated by $Z_A(b;x)$. It introduces an approximate direct sampling algorithm that substitutes the UMVUE with the MLE, enabling practical sampling without solving holonomic or Gröbner-basis problems. It proves that the UMVUE and the rational MLE coincide iff the model is decomposable (in which case the MLE is rational), but not in general, and demonstrates the approach on two-way and three-way contingency-table models. Numerical experiments show the approximate method can outperform Metropolis in certain decomposable cases and remains feasible when exact direct sampling is prohibitive, with asymptotically vanishing bias as sample size grows.
Abstract
We can directly sample from the conditional distribution of any log-affine model. The algorithm is a Markov chain on a bounded integer lattice, and its transition probability is the ratio of the UMVUE (uniformly minimum variance unbiased estimator) of the expected counts to the total number of counts. The computation of the UMVUE accounts for most of the computational cost, which makes the implementation challenging. Here, we investigated an approximate algorithm that replaces the UMVUE with the MLE (maximum likelihood estimator). Although it is generally not exact, it is efficient and easy to implement; no prior study is required, such as about the connection matrices of the holonomic ideal in the original algorithm.
