Dimension-independent rates for structured neural density estimation

Robert A. Vandermeulen; Wai Ming Tai; Bryon Aragam

Dimension-independent rates for structured neural density estimation

Robert A. Vandermeulen, Wai Ming Tai, Bryon Aragam

TL;DR

A novel justification for deep learning's ability to circumvent the curse of dimensionality is provided, demonstrating dimension-independent convergence rates in these contexts of image, sound, video, and text data.

Abstract

We show that deep neural networks achieve dimension-independent rates of convergence for learning structured densities such as those arising in image, audio, video, and text applications. More precisely, we demonstrate that neural networks with a simple $L^2$-minimizing loss achieve a rate of $n^{-1/(4+r)}$ in nonparametric density estimation when the underlying density is Markov to a graph whose maximum clique size is at most $r$, and we provide evidence that in the aforementioned applications, this size is typically constant, i.e., $r=O(1)$. We then establish that the optimal rate in $L^1$ is $n^{-1/(2+r)}$ which, compared to the standard nonparametric rate of $n^{-1/(2+d)}$, reveals that the effective dimension of such problems is the size of the largest clique in the Markov random field. These rates are independent of the data's ambient dimension, making them applicable to realistic models of image, sound, video, and text data. Our results provide a novel justification for deep learning's ability to circumvent the curse of dimensionality, demonstrating dimension-independent convergence rates in these contexts.

Dimension-independent rates for structured neural density estimation

TL;DR

Abstract

-minimizing loss achieve a rate of

in nonparametric density estimation when the underlying density is Markov to a graph whose maximum clique size is at most

, and we provide evidence that in the aforementioned applications, this size is typically constant, i.e.,

. We then establish that the optimal rate in

which, compared to the standard nonparametric rate of

, reveals that the effective dimension of such problems is the size of the largest clique in the Markov random field. These rates are independent of the data's ambient dimension, making them applicable to realistic models of image, sound, video, and text data. Our results provide a novel justification for deep learning's ability to circumvent the curse of dimensionality, demonstrating dimension-independent convergence rates in these contexts.

Dimension-independent rates for structured neural density estimation

TL;DR

Abstract

Dimension-independent rates for structured neural density estimation

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (33)