Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics
Charlotte Beylier, Simon M. Hofmann, Nico Scherf
TL;DR
This work addresses the opacity of RL learning beyond reward signals by introducing ATOMs, attention-oriented metrics derived from $LRP$ saliency maps, to track how agents allocate attention during training. It defines two metrics, hierarchical-attention and combinatorial-attention, and validates them through three Pong variations and a Dual Ball Discrimination Test to link attention with behaviour. Results show game-specific attention patterns that align with observed actions and reveal a phased learning trajectory where paddle-focused attention emerges alongside performance gains. ATOMs offer a quantitative, controllable framework to understand attention-learning dynamics and could aid in diagnosing observational overfitting and guiding exploration in RL systems, with broader implications for neuroscience-inspired perspectives on learning.
Abstract
The learning process of a reinforcement learning (RL) agent remains poorly understood beyond the mathematical formulation of its learning algorithm. To address this gap, we introduce attention-oriented metrics (ATOMs) to investigate the development of an RL agent's attention during training. In a controlled experiment, we tested ATOMs on three variations of a Pong game, each designed to teach the agent distinct behaviours, complemented by a behavioural assessment. ATOMs successfully delineate the attention patterns of an agent trained on each game variation, and that these differences in attention patterns translate into differences in the agent's behaviour. Through continuous monitoring of ATOMs during training, we observed that the agent's attention developed in phases, and that these phases were consistent across game variations. Overall, we believe that ATOM could help improve our understanding of the learning processes of RL agents and better understand the relationship between attention and learning.
