Foundation of Calculating Normalized Maximum Likelihood for Continuous Probability Models
Atsushi Suzuki, Kota Fukuzawa, Kenji Yamanishi
TL;DR
This work resolves a long-standing gap by proving that the normalized maximum likelihood (NML) model complexity for continuous probabilistic models can be computed via the same estimator-based integration method previously used in discrete settings. The authors introduce a novel decomposition rooted in the coarea formula from geometric measure theory, replacing ill-suited Lebesgue-based fiber integrals with Hausdorff-measure-based integrals and a non-square Jacobian to account for dimension mismatch between data and parameter spaces. They derive a precise estimator-PDF representation $p[oldsymbol{ heta}_{lat} mu_{ heta}]$ and prove that the parametric complexity $ ext{Comp}_{v}( mu_{ Theta})$ equals $ int p[oldsymbol{ heta}_{lat} mu_{ heta}](oldsymbol{ heta}) v(oldsymbol{ heta}) doldsymbol{L}^{K}(oldsymbol{ heta})$, validating the continuous-case MC calculation. A concrete example with the exponential model illustrates the practical computation of the MC, and the results generalize prior continuous MDL approaches by providing a complete, rigorous proof. The findings have significant implications for model selection in continuous settings, underpinning the reliability of MDL-based criteria in a broad class of problems.
Abstract
The normalized maximum likelihood (NML) code length is widely used as a model selection criterion based on the minimum description length principle, where the model with the shortest NML code length is selected. A common method to calculate the NML code length is to use the sum (for a discrete model) or integral (for a continuous model) of a function defined by the distribution of the maximum likelihood estimator. While this method has been proven to correctly calculate the NML code length of discrete models, no proof has been provided for continuous cases. Consequently, it has remained unclear whether the method can accurately calculate the NML code length of continuous models. In this paper, we solve this problem affirmatively, proving that the method is also correct for continuous cases. Remarkably, completing the proof for continuous cases is non-trivial in that it cannot be achieved by merely replacing the sums in discrete cases with integrals, as the decomposition trick applied to sums in the discrete model case proof is not applicable to integrals in the continuous model case proof. To overcome this, we introduce a novel decomposition approach based on the coarea formula from geometric measure theory, which is essential to establishing our proof for continuous cases.
