Quick and Accurate Affordance Learning
Fedor Scholz, Erik Ayari, Johannes Bertram, Martin V. Butz
TL;DR
The paper tackles how agents can efficiently learn environmental affordances through active exploration by coupling an allocentric cognitive map with local affordance learning inside a deep architecture. An end-to-end framework combines a fixed look-up map, a local affordance encoder, and a transition predictor to forecast positional changes, trained via negative log-likelihood. The study compares three uncertainty-guided exploration signals—aleatoric mean, epistemic SD, and Jensen-Shannon divergence across ensemble predictions—finding Jensen-Shannon divergence offers the most balanced and data-efficient learning. Key findings show that encoding affordances locally, coordinating navigation with local motor control, and leveraging density-based ensemble disagreements yield faster, more robust knowledge gain, with implications for robotic curricula and insights into developmental learning processes.
Abstract
Infants learn actively in their environments, shaping their own learning curricula. They learn about their environments' affordances, that is, how local circumstances determine how their behavior can affect the environment. Here we model this type of behavior by means of a deep learning architecture. The architecture mediates between global cognitive map exploration and local affordance learning. Inference processes actively move the simulated agent towards regions where they expect affordance-related knowledge gain. We contrast three measures of uncertainty to guide this exploration: predicted uncertainty of a model, standard deviation between the means of several models (SD), and the Jensen-Shannon Divergence (JSD) between several models. We show that the first measure gets fooled by aleatoric uncertainty inherent in the environment, while the two other measures focus learning on epistemic uncertainty. JSD exhibits the most balanced exploration strategy. From a computational perspective, our model suggests three key ingredients for coordinating the active generation of learning curricula: (1) Navigation behavior needs to be coordinated with local motor behavior for enabling active affordance learning. (2) Affordances need to be encoded locally for acquiring generalized knowledge. (3) Effective active affordance learning mechanisms should use density comparison techniques for estimating expected knowledge gain. Future work may seek collaborations with developmental psychology to model active play in children in more realistic scenarios.
