Supervised and Unsupervised protocols for hetero-associative neural networks
Andrea Alessandrelli, Adriano Barra, Andrea Ladiana, Andrea Lepre, Federico Ricci-Tersenghi
TL;DR
This work develops a three-directional associative memory (TAM) framework with three binary layers and $K$ archetypes, learned under supervised and unsupervised Hebbian protocols. It leverages replica-symmetric analysis and Guerra interpolation to derive exact self-consistency equations for order parameters and to map retrieval vs. noise phase diagrams, validated by Monte Carlo simulations on random and structured data. A key finding is layer cooperativity: information-rich layers can bolster poorer ones, expanding the retrieval region beyond layer-local noise constraints. The results illuminate principled design guidelines for hetero-associative architectures and connect to neurobiological concepts of pattern completion and pattern separation, with potential cross-disciplinary insights for AI and neuroscience.
Abstract
This paper introduces a learning framework for Three-Directional Associative Memory (TAM) models, extending the classical Hebbian paradigm to both supervised and unsupervised protocols within an hetero-associative setting. These neural networks consist of three interconnected layers of binary neurons interacting via generalized Hebbian synaptic couplings that allow learning, storage and retrieval of structured triplets of patterns. By relying upon glassy statistical mechanical techniques (mainly replica theory and Guerra interpolation), we analyze the emergent computational properties of these networks, at work with random (Rademacher) datasets and at the replica-symmetric level of description: we obtain a set of self-consistency equations for the order parameters that quantify the critical dataset sizes (i.e. their thresholds for learning) and describe the retrieval performance of these networks, highlighting the differences between supervised and unsupervised protocols. Numerical simulations validate our theoretical findings and demonstrate the robustness of the captured picture about TAMs also at work with structured datasets. In particular, this study provides insights into the cooperative interplay of layers, beyond that of the neurons within the layers, with potential implications for optimal design of artificial neural network architectures.
