Probabilistic computation and uncertainty quantification with emerging covariance
Hengyuan Ma, Yang Qi, Li Zhang, Wenlian Lu, Jianfeng Feng
TL;DR
This work introduces the moment neural network (MNN) framework to perform probabilistic computation by propagating only the first two cumulants (mean and covariance) through neural layers, enabling uncertainty quantification without explicit covariance supervision. Central to the approach is the SMUC training rule, which backpropagates only through the mean, letting covariance emerge from nonlinear mean–covariance coupling; theory ties SMUC to stochastic Riemannian gradient descent and establishes convergence and mean/variance learning. Empirically, MNNs accurately match the mean and covariance of stochastic networks and provide informative uncertainty measures (entropy) for in-distribution, out-of-distribution, and adversarial scenarios on MNIST/CIFAR-10, with mixed MNNs offering lower computational cost while preserving uncertainty signaling. The framework suggests a scalable, brain-inspired path to robust, uncertainty-aware AI systems, supported by explicit moment activations and variants for common neuron models, including ReLU, Heaviside, and LIF implementations.
Abstract
Building robust, interpretable, and secure AI system requires quantifying and representing uncertainty under a probabilistic perspective to mimic human cognitive abilities. However, probabilistic computation presents significant challenges for most conventional artificial neural network, as they are essentially implemented in a deterministic manner. In this paper, we develop an efficient probabilistic computation framework by truncating the probabilistic representation of neural activation up to its mean and covariance and construct a moment neural network that encapsulates the nonlinear coupling between the mean and covariance of the underlying stochastic network. We reveal that when only the mean but not the covariance is supervised during gradient-based learning, the unsupervised covariance spontaneously emerges from its nonlinear coupling with the mean and faithfully captures the uncertainty associated with model predictions. Our findings highlight the inherent simplicity of probabilistic computation by seamlessly incorporating uncertainty into model prediction, paving the way for integrating it into large-scale AI systems.
