Exponential expressivity in deep neural networks through transient chaos

Ben Poole; Subhaneil Lahiri; Maithra Raghu; Jascha Sohl-Dickstein; Surya Ganguli

Exponential expressivity in deep neural networks through transient chaos

Ben Poole, Subhaneil Lahiri, Maithra Raghu, Jascha Sohl-Dickstein, Surya Ganguli

TL;DR

The paper tackles how depth affects neural-network expressivity by exposing an order-to-chaos transition in signal propagation of random deep nets. It develops a mean-field–geometric framework that tracks length and curvature across layers via a length map $q^l$ and a correlation map $C$, predicting exponential growth of global curvature in depth within the chaotic regime. It shows that deep networks can disentangle curved input manifolds into flat hidden representations and that curvature grows exponentially with depth, while shallow networks cannot match this expressivity. This provides a quantitative null model and a geometric lens for understanding deep-function expressivity across arbitrary nonlinearities.

Abstract

We combine Riemannian geometry with the mean field theory of high dimensional chaos to study the nature of signal propagation in generic, deep neural networks with random weights. Our results reveal an order-to-chaos expressivity phase transition, with networks in the chaotic phase computing nonlinear functions whose global curvature grows exponentially with depth but not width. We prove this generic class of deep random functions cannot be efficiently computed by any shallow network, going beyond prior work restricted to the analysis of single functions. Moreover, we formalize and quantitatively demonstrate the long conjectured idea that deep networks can disentangle highly curved manifolds in input space into flat manifolds in hidden space. Our theoretical analysis of the expressive power of deep networks broadly applies to arbitrary nonlinearities, and provides a quantitative underpinning for previously abstract notions about the geometry of deep functions.

Exponential expressivity in deep neural networks through transient chaos

TL;DR

Abstract

Exponential expressivity in deep neural networks through transient chaos

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (1)