uGMM-NN: Univariate Gaussian Mixture Model Neural Network
Zakeria Sharif Ali
TL;DR
The paper introduces uGMM-NN, a feedforward architecture in which every neuron outputs the log-density of a univariate Gaussian mixture, enabling multimodal representations and uncertainty quantification within deep networks. It demonstrates competitive discriminative performance on MNIST and Iris, and shows how probabilistic, interpretable activations can be integrated into CNNs by replacing dense layers with uGMM units. The approach links neural computation with probabilistic modeling, drawing connections to probabilistic circuits and interpretability frameworks, while highlighting potential for generative-inference extensions and uncertainty-aware design. Future work focuses on scalable MPE inference, extensions to sequential architectures, and sparsity-driven efficiency to enable larger-scale and multimodal applications.
Abstract
This paper introduces the Univariate Gaussian Mixture Model Neural Network (uGMM-NN), a novel neural architecture that embeds probabilistic reasoning directly into the computational units of deep networks. Unlike traditional neurons, which apply weighted sums followed by fixed non-linearities, each uGMM-NN node parameterizes its activations as a univariate Gaussian mixture, with learnable means, variances, and mixing coefficients. This design enables richer representations by capturing multimodality and uncertainty at the level of individual neurons, while retaining the scalability of standard feed-forward networks. We demonstrate that uGMM-NN can achieve competitive discriminative performance compared to conventional multilayer perceptrons, while additionally offering a probabilistic interpretation of activations. The proposed framework provides a foundation for integrating uncertainty-aware components into modern neural architectures, opening new directions for both discriminative and generative modeling.
