Harmonizing Program Induction with Rate-Distortion Theory
Hanqi Zhou, David G. Nagy, Charley M. Wu
TL;DR
This work addresses how Rate Distortion Theory (RDT) can be integrated with program induction to model resource-bounded learning of structured representations. It introduces a three-way trade among description length $R_l$, distortion $D$, and search budget $R_s$, and applies it to melody learning using Bayesian program induction with a typed combinatory logic, a PCFG prior, and adaptor grammars (AGs) that maintain a shared library across tasks. The study shows that a shared library enables more efficient, compact representations and better generalization, while revealing curriculum sensitivity; it further demonstrates that partial information decomposition (PID) can guide the design of synergistic curricula that enhance library usefulness. These results provide a normative, resource-bounded account of how compositional programs might be learned and reused across tasks, with implications for understanding human learning and for building curriculum-aware AI systems that leverage reusable subprograms.
Abstract
Many aspects of human learning have been proposed as a process of constructing mental programs: from acquiring symbolic number representations to intuitive theories about the world. In parallel, there is a long-tradition of using information processing to model human cognition through Rate Distortion Theory (RDT). Yet, it is still poorly understood how to apply RDT when mental representations take the form of programs. In this work, we adapt RDT by proposing a three way trade-off among rate (description length), distortion (error), and computational costs (search budget). We use simulations on a melody task to study the implications of this trade-off, and show that constructing a shared program library across tasks provides global benefits. However, this comes at the cost of sensitivity to curricula, which is also characteristic of human learners. Finally, we use methods from partial information decomposition to generate training curricula that induce more effective libraries and better generalization.
