Optimal rates for density and mode estimation with expand-and-sparsify representations
Kaushik Sinha, Christopher Tosh
TL;DR
This work studies expand-and-sparsify representations, where a dense input is projected to a high-dimensional space and sparsified to a $k$-sparse binary vector, for non-parametric density and mode estimation on ${\mathcal{S}}^{d-1}$. It introduces a linear-in-representation density estimator ${\hat f}_n$ derived from random projections and a sparse mask, and proves minimax-optimal $\ell_{\infty}$ rates $O\left((\log n / n)^{\frac{\beta}{2\beta+(d-1)}}\right)$ for $L,\beta$-smooth densities, with matching lower bounds. For mode estimation, simple algorithms atop the density estimator recover a unimodal mode at rate $\tilde{O}\left(n^{-1/(d+3)}\right)$ and, under mild separation, recover salient modes with the same rate up to logarithmic factors in the multi-modal setting. Empirical results on mixtures of von Mises-Fisher distributions illustrate the method’s behavior relative to KDE and kNN-based density estimators as expansion increases. Overall, the paper provides a novel algorithmic bridge between sparse, high-dimensional representations and classical non-parametric estimation, offering minimax-optimal guarantees and practical procedures on spherical data.
Abstract
Expand-and-sparsify representations are a class of theoretical models that capture sparse representation phenomena observed in the sensory systems of many animals. At a high level, these representations map an input $x \in \mathbb{R}^d$ to a much higher dimension $m \gg d$ via random linear projections before zeroing out all but the $k \ll m$ largest entries. The result is a $k$-sparse vector in $\{0,1\}^m$. We study the suitability of this representation for two fundamental statistical problems: density estimation and mode estimation. For density estimation, we show that a simple linear function of the expand-and-sparsify representation produces an estimator with minimax-optimal $\ell_{\infty}$ convergence rates. In mode estimation, we provide simple algorithms on top of our density estimator that recover single or multiple modes at optimal rates up to logarithmic factors under mild conditions.
