ReLU Networks as Random Functions: Their Distribution in Probability Space
Shreyas Chaudhari, José M. F. Moura
TL;DR
This work treats ReLU networks as conditional affine maps whose realized form depends on the random input distribution. By deriving the activation-pattern PMF via Gaussian orthant probabilities and showing that outputs form a mixture of truncated Gaussians, the authors provide explicit, numerically tractable expressions for both function- and output-level distributions under input uncertainty. They propose a sample-free activation-pattern support approximation to efficiently identify the most probable affine regions, enabling scalable robustness and reliability analyses. The approach is validated on synthetic moons data and real-world datasets (MNIST, Fashion-MNIST), with additional experiments on Jacobian spectra under noisy inputs, highlighting practical applicability to interpretability and uncertainty quantification in piecewise linear networks.
Abstract
This paper presents a novel framework for understanding trained ReLU networks as random, affine functions, where the randomness is induced by the distribution over the inputs. By characterizing the probability distribution of the network's activation patterns, we derive the discrete probability distribution over the affine functions realizable by the network. We extend this analysis to describe the probability distribution of the network's outputs. Our approach provides explicit, numerically tractable expressions for these distributions in terms of Gaussian orthant probabilities. Additionally, we develop approximation techniques to identify the support of affine functions a trained ReLU network can realize for a given distribution of inputs. Our work provides a framework for understanding the behavior and performance of ReLU networks corresponding to stochastic inputs, paving the way for more interpretable and reliable models.
