Deep Continuous Networks
Nergis Tomen, Silvia L. Pintea, Jan C. van Gemert
TL;DR
The paper addresses the mismatch between conventional CNNs and biological vision by introducing Deep Continuous Networks (DCNs) that couple spatially continuous filters with depthwise continuous dynamics via neural ODEs. Filters are defined as Gaussian derivative SRFs with trainable coefficients $\alpha$ and scale $\sigma$, enabling end-to-end learning of both filter shape and spatial extent, while the network depth is treated as a continuous dimension $t$. Empirically, DCNs achieve competitive CIFAR-10 performance with fewer parameters, exhibit data efficiency in small-data settings, and demonstrate improved reconstruction quality; they also reveal biologically plausible scale distributions and support pattern completion, with potential computational savings through contrast-driven time scaling. The work provides a principled bridge between neuroscience-inspired continuous models and modern CNNs, offering a framework for brain-like receptive-field dynamics, efficient parameterization, and new avenues for neuroscientific investigations and energy-efficient deep learning.
Abstract
CNNs and computational models of biological vision share some fundamental principles, which opened new avenues of research. However, fruitful cross-field research is hampered by conventional CNN architectures being based on spatially and depthwise discrete representations, which cannot accommodate certain aspects of biological complexity such as continuously varying receptive field sizes and dynamics of neuronal responses. Here we propose deep continuous networks (DCNs), which combine spatially continuous filters, with the continuous depth framework of neural ODEs. This allows us to learn the spatial support of the filters during training, as well as model the continuous evolution of feature maps, linking DCNs closely to biological models. We show that DCNs are versatile and highly applicable to standard image classification and reconstruction problems, where they improve parameter and data efficiency, and allow for meta-parametrization. We illustrate the biological plausibility of the scale distributions learned by DCNs and explore their performance in a neuroscientifically inspired pattern completion task. Finally, we investigate an efficient implementation of DCNs by changing input contrast.
