Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL
Jihwan Lee, Woochang Sim, Sejin Kim, Sundong Kim
TL;DR
This work addresses the challenge of analogical reasoning in ARC by evaluating model-based RL against model-free baselines. Using DreamerV3, a latent-world-model agent, and PPO on restricted ARC tasks, the study demonstrates that internal models enable more data-efficient learning and better generalization to similar tasks, including effective adaptation from pre-trained tasks. Notably, DreamerV3 shows strong performance on $3 \times 3$ diagonal flips and benefits from high-quality pre-training, though learning dynamics include interim drops that may reflect conceptual consolidation. The findings suggest that internal world models enhance analogical reasoning in structured environments and point to promising directions in meta-learning and transfer learning for robust generalization to untrained ARC tasks.
Abstract
This paper demonstrates that model-based reinforcement learning (model-based RL) is a suitable approach for the task of analogical reasoning. We hypothesize that model-based RL can solve analogical reasoning tasks more efficiently through the creation of internal models. To test this, we compared DreamerV3, a model-based RL method, with Proximal Policy Optimization, a model-free RL method, on the Abstraction and Reasoning Corpus (ARC) tasks. Our results indicate that model-based RL not only outperforms model-free RL in learning and generalizing from single tasks but also shows significant advantages in reasoning across similar tasks.
