What should a neuron aim for? Designing local objective functions based on information theory

Andreas C. Schneider; Valentin Neuhaus; David A. Ehrlich; Abdullah Makkeh; Alexander S. Ecker; Viola Priesemann; Michael Wibral

What should a neuron aim for? Designing local objective functions based on information theory

Andreas C. Schneider, Valentin Neuhaus, David A. Ehrlich, Abdullah Makkeh, Alexander S. Ecker, Viola Priesemann, Michael Wibral

TL;DR

The paper tackles the opacity of neuron-level learning in globally trained networks by introducing infomorphic neurons that optimize local objectives derived from Partial Information Decomposition (PID). By structuring inputs into feedforward $F$, context $C$, and lateral $L$ signals, and formulating a per-neuron objective $G = \bm{\gamma}^T \mathbf{\Pi}$ over PID atoms, the approach enables self-organized learning with interpretable information-processing goals. The authors demonstrate both bivariate and trivariate instantiations, showing that trivariate networks can achieve MNIST-level classification performance close to backpropagation, while providing insights into which PID atoms drive learning via hyperparameter optimization. The work advances a principled, information-theoretic foundation for local learning with practical, interpretable dynamics, and releases code to reproduce the results.

Abstract

In modern deep neural networks, the learning dynamics of the individual neurons is often obscure, as the networks are trained via global optimization. Conversely, biological systems build on self-organized, local learning, achieving robustness and efficiency with limited global information. We here show how self-organization between individual artificial neurons can be achieved by designing abstract bio-inspired local learning goals. These goals are parameterized using a recent extension of information theory, Partial Information Decomposition (PID), which decomposes the information that a set of information sources holds about an outcome into unique, redundant and synergistic contributions. Our framework enables neurons to locally shape the integration of information from various input classes, i.e. feedforward, feedback, and lateral, by selecting which of the three inputs should contribute uniquely, redundantly or synergistically to the output. This selection is expressed as a weighted sum of PID terms, which, for a given problem, can be directly derived from intuitive reasoning or via numerical optimization, offering a window into understanding task-relevant local information processing. Achieving neuron-level interpretability while enabling strong performance using local learning, our work advances a principled information-theoretic foundation for local learning strategies.

What should a neuron aim for? Designing local objective functions based on information theory

TL;DR

Abstract

What should a neuron aim for? Designing local objective functions based on information theory

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)