Learning Approximate and Exact Numeral Systems via Reinforcement Learning
Emil Carlsson, Devdatt Dubhashi, Fredrik D. Johansson
TL;DR
The paper addresses how efficient numeral systems can emerge through learning by having two agents engage in a reinforcement-learning driven Lewis signaling game to convey numeral concepts. The approach yields exact and approximate numeral systems that are near-optimal under information-theoretic costs and resemble human systems of comparable complexity, providing a mechanistic learning-based explanation for prior findings. The results demonstrate that non-recursive numeral systems can be learned and aligned with Gaussian Weber-model representations, suggesting broad applicability to other semantic domains. The work offers a principled framework linking reward structures to communicative efficiency and points to future directions such as larger ranges, approximate arithmetic, recursion, and pragmatic reasoning enhancements.
Abstract
Recent work (Xu et al., 2020) has suggested that numeral systems in different languages are shaped by a functional need for efficient communication in an information-theoretic sense. Here we take a learning-theoretic approach and show how efficient communication emerges via reinforcement learning. In our framework, two artificial agents play a Lewis signaling game where the goal is to convey a numeral concept. The agents gradually learn to communicate using reinforcement learning and the resulting numeral systems are shown to be efficient in the information-theoretic framework of Regier et al. (2015); Gibson et al. (2017). They are also shown to be similar to human numeral systems of same type. Our results thus provide a mechanistic explanation via reinforcement learning of the recent results in Xu et al. (2020) and can potentially be generalized to other semantic domains.
