Toward a Metrology for Artificial Intelligence: Hidden-Rule Environments and Reinforcement Learning
Christo Mathew, Wentian Wang, Jacob Feldman, Lazaros K. Gallos, Paul B. Kantor, Vladimir Menkov, Hao Wang
TL;DR
This work probes AI metrology by using the GOHR environment to study how agents infer hidden rules under partial observations. It compares Feature-Centric and Object-Centric state encodings fed into a Transformer-based A2C policy/critic, assessing learning efficiency, transfer, and generalization across diverse rule sets. Through independent-rule, transfer, and generalization experiments, the study reveals that OC representations yield more stable cross-metric difficulty and better generalization, while FC representations exhibit metric-dependent variability; transfer effects indicate that learning component rules speeds up compound-rule acquisition when pretraining is comprehensive. The results motivate a formal notion of a difficulty space for hidden-rule tasks and suggest a path toward a Cognodynamics-like science for measuring and teaching artificial intelligence.
Abstract
We investigate reinforcement learning in the Game Of Hidden Rules (GOHR) environment, a complex puzzle in which an agent must infer and execute hidden rules to clear a 6$\times$6 board by placing game pieces into buckets. We explore two state representation strategies, namely Feature-Centric (FC) and Object-Centric (OC), and employ a Transformer-based Advantage Actor-Critic (A2C) algorithm for training. The agent has access only to partial observations and must simultaneously infer the governing rule and learn the optimal policy through experience. We evaluate our models across multiple rule-based and trial-list-based experimental setups, analyzing transfer effects and the impact of representation on learning efficiency.
