Convergence rates for estimating multivariate scale mixtures of uniform densities
Arlene K. H. Kim, Gil Kur, Adityanand Guntuboyina
TL;DR
This work extends the Grenander density-estimation paradigm to multivariate scale mixtures of uniforms, proving that the multivariate SMU-MLE $\hat p_{n,d}^{\mathrm{SMU}}$ attains a Hellinger risk of order $n^{-2/3}$ up to a dimension-dependent log factor $\gamma_d=4d-2$, under compact support, an upper bound, and a lower-bound relaxing assumption on $p_0$. A general Hellinger-accuracy theorem for MLEs over convex density classes (Theorem hellexp) reduces estimation risk to empirical-process bounds and entropy, a core methodological advance. Bracketing-entropy bounds for SMU subclasses, derived via distribution-function entropies of nonnegative measures, enable the main rates, a minimax lower bound $\inf_{\tilde p_n}\sup_{p_0} \mathbb E h^2(p_0,\tilde p_n) \ge c_d n^{-2/3}(\log n)^{(d-1)/3}$, and adaptive results when $p_0$ is piecewise constant on rectangles. Computationally, the authors provide exact and approximate SMU-MLE algorithms and demonstrate their use on real and simulated data, including leukemia gene-expression p-values, illustrating a mixture of a Uniform component with mass near the origin.
Abstract
The Grenander estimator is a well-studied procedure for univariate nonparametric density estimation. It is usually defined as the Maximum Likelihood Estimator (MLE) over the class of all non-increasing densities on the positive real line. It can also be seen as the MLE over the class of all scale mixtures of uniform densities. Using the latter viewpoint, Pavlides and Wellner~\cite{pavlides2012nonparametric} proposed a multivariate extension of the Grenander estimator as the nonparametric MLE over the class of all multivariate scale mixtures of uniform densities. We prove that this multivariate estimator achieves the univariate cube root rate of convergence with only a logarithmic multiplicative factor that depends on the dimension. The usual curse of dimensionality is therefore avoided to some extent for this multivariate estimator. This result positively resolves a conjecture of Pavlides and Wellner~\cite{pavlides2012nonparametric} under an additional lower bound assumption. Our proof proceeds via a general accuracy result for the Hellinger accuracy of MLEs over convex classes of densities. We also provide algorithms for computing the estimator, and illustrate performance on real and simulated datasets.
