A Computational Framework for Modeling Emergence of Color Vision in the Human Brain
Atsunobu Kotani, Ren Ng
TL;DR
This work tackles how the human cortex could infer the true color dimensionality from optic nerve signals by proposing a computational framework that jointly simulates the eye and cortex. Color is represented as an $N$-dimensional cortical space, with a learnable dimensionality $K$ that emerges from self-supervised temporal prediction of retinal signal fluctuations during fixational eye movements. The framework combines a biologically grounded eye model, a three-part cortical decoding/translation/re-encoding pipeline with neural buckets for retinal encoding properties, and quantitative (CMF-SIM) and qualitative (NS) measures to demonstrate mono-, di-, tri-, and tetrachromatic emergence, including gene-therapy-like boosts from $3$D to $4$D. The results offer a novel perspective on color perception as an emergent property of cortical inference, with potential implications for computational imaging, neurobiology, and vision augmentation.
Abstract
It is a mystery how the brain decodes color vision purely from the optic nerve signals it receives, with a core inferential challenge being how it disentangles internal perception with the correct color dimensionality from the unknown encoding properties of the eye. In this paper, we introduce a computational framework for modeling this emergence of human color vision by simulating both the eye and the cortex. Existing research often overlooks how the cortex develops color vision or represents color space internally, assuming that the color dimensionality is known a priori; however, we argue that the visual cortex has the capability and the challenge of inferring the color dimensionality purely from fluctuations in the optic nerve signals. To validate our theory, we introduce a simulation engine for biological eyes based on established vision science and generate optic nerve signals resulting from looking at natural images. Further, we propose a bio-plausible model of cortical learning based on self-supervised prediction of optic nerve signal fluctuations under natural eye motions. We show that this model naturally learns to generate color vision by disentangling retinal invariants from the sensory signals. When the retina contains N types of color photoreceptors, our simulation shows that N-dimensional color vision naturally emerges, verified through formal colorimetry. Using this framework, we also present the first simulation work that successfully boosts the color dimensionality, as observed in gene therapy on squirrel monkeys, and demonstrates the possibility of enhancing human color vision from 3D to 4D.
