Emergence of Functionally Differentiated Structures via Mutual Information Minimization in Recurrent Neural Networks
Yuki Tomoda, Ichiro Tsuda, Yutaka Yamaguti
TL;DR
This work investigates how functional differentiation and modularity emerge in recurrent neural networks under a global information-theoretic constraint. By minimizing mutual information between two predefined neural subgroups using a MINE-based estimator, the authors demonstrate that functional modularity arises early in learning while structural modularity develops more gradually, across two tasks: a 2-bit working memory benchmark and chaotic signal separation of Lorenz and Rössler dynamics. Across both tasks, networks achieve high task performance with distinct functional modules; output and input weight specializations align with the functional roles, and functional separation often precedes structural reorganization. The findings offer a principled perspective on how information-theoretic constraints can shape brain-like differentiation and modularity, with implications for biological understanding and the design of modular, multi-task AI systems, especially when guided by sparsity constraints to translate function into structure.
Abstract
Functional differentiation in the brain emerges as distinct regions specialize and is key to understanding brain function as a complex system. Previous research has modeled this process using artificial neural networks with specific constraints. Here, we propose a novel approach that induces functional differentiation in recurrent neural networks by minimizing mutual information between neural subgroups via mutual information neural estimation. We apply our method to a 2-bit working memory task and a chaotic signal separation task involving Lorenz and Rössler time series. Analysis of network performance, correlation patterns, and weight matrices reveals that mutual information minimization yields high task performance alongside clear functional modularity and moderate structural modularity. Importantly, our results show that functional differentiation, which is measured through correlation structures, emerges earlier than structural modularity defined by synaptic weights. This suggests that functional specialization precedes and probably drives structural reorganization within developing neural networks. Our findings provide new insights into how information-theoretic principles may govern the emergence of specialized functions and modular structures during artificial and biological brain development.
