Including local feature interactions in deep non-negative matrix factorization networks improves performance
Mahbod Nouri, David Rotermund, Alberto Garcia-Ortiz, Klaus R. Pawelzik
TL;DR
This paper argues that incorporating biologically inspired, non-negative factorization with local feature mixing enhances deep vision models. By embedding CNMF modules and 1×1 convolutions, the approach captures positive long-range interactions while modeling local inhibition, and it employs an approximate back-propagation scheme to manage the iterative NMF computations. Empirical results on CIFAR-10 show that CNMF combined with 1×1 mixing can outperform similarly sized CNNs, with peak performance achieved when CNN and NMF parameter counts are balanced. The work bridges biological and artificial neural computation, offering a plausible, more interpretable alternative that preserves performance while aligning with cortical processing principles.
Abstract
The brain uses positive signals as a means of signaling. Forward interactions in the early visual cortex are also positive, realized by excitatory synapses. Only local interactions also include inhibition. Non-negative matrix factorization (NMF) captures the biological constraint of positive long-range interactions and can be implemented with stochastic spikes. While NMF can serve as an abstract formalization of early neural processing in the visual system, the performance of deep convolutional networks with NMF modules does not match that of CNNs of similar size. However, when the local NMF modules are each followed by a module that mixes the NMF's positive activities, the performances on the benchmark data exceed that of vanilla deep convolutional networks of similar size. This setting can be considered a biologically more plausible emulation of the processing in cortical (hyper-)columns with the potential to improve the performance of deep networks.
