Optimality of the Half-Order Exponent in the Turing-Good Identities for Bayes Factors
Kensuke Okada
TL;DR
The paper identifies the half-order exponent $t=\tfrac{1}{2}$ as the unique, minimax-stable choice for validating Bayes-factor computations via the Turing--Good identities, by equalizing and bounding the second moments of the two diagnostic sides under mutual absolute continuity. It derives exact variance expressions in terms of the overlap $\rho=I(\tfrac{1}{2})$ and proves that the two-sided, split-sample half-order diagnostic has uniformly finite variance and, in the small-overlap regime, outperforms the standard one-sided Turing check. The work extends the half-order idea to a geometric bridge for normalizing-constant ratios and to importance-sampling diagnostics, where $\rho$ provides a distribution-free, interpretable measure of weight concentration. Simulations on binomial Bayes-factor workflows illustrate stable finite-sample behavior and sensitivity to simulator--evaluator mismatches, and the results motivate a practical default design: a symmetric two-sided half-order check with overlap estimation for robust, cost-aware diagnostics in Bayesian computation. Overall, the half-order framework offers a principled, stable basis for validating and designing ratio-estimation procedures under tail-robustness constraints.
Abstract
Bayes factors are widely computed by Monte Carlo, yet heavy-tailed sampling distributions can make numerical validation unreliable. The Turing--Good identities provide exact moment equalities for powers of a Bayes factor (a density ratio). When these identities are used as Good-check diagnostics, the power choice becomes a statistical design parameter. We develop a nonasymptotic variance theory for Monte Carlo evaluation of the identities and show that the half-order (square-root) power is uniquely minimax-stable: it equalizes variability across the two model orientations and is the only choice that guarantees finite second moments in a distribution-free worst-case sense over all mutually absolutely continuous model pairs. This yields a balanced two-sample half-order diagnostic that is symmetric in model labeling and has a uniform variance bound at fixed computational budget; in small-overlap regimes it is guaranteed to be no less efficient than the standard one-sided Turing check. Simulations for binomial Bayes factor workflows illustrate stable finite-sample behavior and sensitivity to simulator--evaluator mismatches. We further connect the half-order overlap viewpoint to stable primitives for normalizing-constant ratios and importance-sampling degeneracy summaries.
