Nonlinear dynamics of localization in neural receptive fields
Leon Lufkin, Andrew M. Saxe, Erin Grant
TL;DR
This work tackles the problem of why neural receptive fields localize in early processing stages by deriving an analytical, data-driven learning dynamics for a minimal nonlinear neuron trained on naturalistic inputs. The authors develop an early-time gradient-flow model, where the localization amplifier $oldsymbol{c}$ depends on the marginal statistics of the input, and show that sufficient negative excess kurtosis in the marginals promotes localized receptive fields, while high positive kurtosis suppresses localization. They further show that elliptical distributions produce nonlocalized, sinusoidal weight states, highlighting the limits of non-Gaussianity as a localization driver. Extending to multi-neuron networks and ICA, they demonstrate both the generality and the constraints of their mechanism, illustrating a data-statistics–driven route to localization that does not require explicit efficiency constraints.
Abstract
Localized receptive fields -- neurons that are selective for certain contiguous spatiotemporal features of their input -- populate early sensory regions of the mammalian brain. Unsupervised learning algorithms that optimize explicit sparsity or independence criteria replicate features of these localized receptive fields, but fail to explain directly how localization arises through learning without efficient coding, as occurs in early layers of deep neural networks and might occur in early sensory regions of biological systems. We consider an alternative model in which localized receptive fields emerge without explicit top-down efficiency constraints -- a feedforward neural network trained on a data model inspired by the structure of natural images. Previous work identified the importance of non-Gaussian statistics to localization in this setting but left open questions about the mechanisms driving dynamical emergence. We address these questions by deriving the effective learning dynamics for a single nonlinear neuron, making precise how higher-order statistical properties of the input data drive emergent localization, and we demonstrate that the predictions of these effective dynamics extend to the many-neuron setting. Our analysis provides an alternative explanation for the ubiquity of localization as resulting from the nonlinear dynamics of learning in neural circuits.
