The Cooperative Network Architecture: Learning Structured Networks as Representation of Sensory Patterns
Pascal J. Sager, Jan M. Deriu, Benjamin F. Grewe, Thilo Stadelmann, Christoph von der Malsburg
TL;DR
The Cooperative Network Architecture (CNA) addresses robust object representation by representing sensory inputs as coherent nets $ \mathcal{N}^*$ assembled from overlapping net fragments $\mathcal{F}$ learned through input statistics and Hebbian plasticity. It implements a two-stage, time-evolving system: a fixed-feature Stage 1 and a dynamic Stage 2 that learns and composes net fragments with competitive neurons and global attenuation, enabling figure completion and noise filtering. Empirical results on simple line patterns show compositional generalization to unseen structures and strong noise robustness (up to 59% Gaussian noise) with favorable comparison to autoencoders in out-of-distribution scenarios. This work presents a biologically inspired neural-coding framework that couples local feature processing with global, coherent structure formation, offering a potential path toward invariant object recognition and scalable, network-based perception, with future extensions to multi-area architectures.
Abstract
We introduce the Cooperative Network Architecture (CNA), a model that represents sensory signals using structured, recurrently connected networks of neurons, termed "nets." Nets are dynamically assembled from overlapping net fragments, which are learned based on statistical regularities in sensory input. This architecture offers robustness to noise, deformation, and generalization to out-of-distribution data, addressing challenges in current vision systems from a novel perspective. We demonstrate that net fragments can be learned without supervision and flexibly recombined to encode novel patterns, enabling figure completion and resilience to noise. Our findings establish CNA as a promising paradigm for developing neural representations that integrate local feature processing with global structure formation, providing a foundation for future research on invariant object recognition.
