Criterion for the resemblance between the mother and the model distribution
Yo Sheena
TL;DR
The paper addresses how to quantify closeness between a mother distribution and a model distribution when the model density is intractable, by introducing a discretization-based criterion using the $Hellinger\ distance$ between sample-derived multinomial bins. It grounds the approach in $f$-divergences and their connection to the Bayes error rate, and develops moving-region discretization to form bins from model or sample data, enabling direct comparison without explicit density forms. A key contribution is a concrete, implementable criterion: if $D[\hat m^{(1)}:\hat m^{(2)}] + \frac{p'}{2n_1} + \sqrt{\frac{8p'}{n_2}} < 8\epsilon^2$ given a target Bayes error rate $0.5-\epsilon$, then the distributions are sufficiently close; the framework also provides bias bounds and asymptotic results to guide sample-size choices. The approach is particularly applicable to evaluating complex generative or Bayesian models (e.g., deep generative methods) where exact densities are unavailable, offering a practical, theoretically grounded tool for model evaluation and selection based on distributional closeness.
Abstract
If the probability distribution model aims to approximate the hidden mother distribution, it is imperative to establish a useful criterion for the resemblance between the mother and the model distributions. This study proposes a criterion that measures the Hellinger distance between discretized (quantized) samples from both distributions. Unlike information criteria such as AIC, this criterion does not require the probability density function of the model distribution, which cannot be explicitly obtained for a complicated model such as a deep learning machine. Second, it can draw a positive conclusion (i.e., both distributions are sufficiently close) under a given threshold, whereas a statistical hypothesis test, such as the Kolmogorov-Smirnov test, cannot genuinely lead to a positive conclusion when the hypothesis is accepted. In this study, we establish a reasonable threshold for the criterion deduced from the Bayes error rate and also present the asymptotic bias of the estimator of the criterion. From these results, a reasonable and easy-to-use criterion is established that can be directly calculated from the two sets of samples from both distributions.
