Learning Affordances from Interactive Exploration using an Object-level Map
Paula Wulkop, Halil Umut Özdemir, Antonia Hüfner, Jen Jen Chung, Roland Siegwart, Lionel Ott
TL;DR
This work tackles robot-centric affordance learning in unknown environments by integrating an object-level map into an interactive exploration loop. It combines a reinforcement-learning-driven exploration policy with a TSDF++ object-level map to re-identify object instances and propagate interaction labels across viewpoints, while periodically retraining a U-Net affordance predictor on episode-generated data. A key contribution is the explicit integration of object-level mapping into the exploration loop, which yields higher interaction success rates and faster, more accurate affordance predictions, as evidenced by improved Affordance IoU and object-level accuracy over baselines. The approach offers a data-efficient pathway toward robot-specific affordance understanding in realistic scenes and lays groundwork for real-world transfer and extension to more complex object interactions.
Abstract
Many robotic tasks in real-world environments require physical interactions with an object such as pick up or push. For successful interactions, the robot needs to know the object's affordances, which are defined as the potential actions the robot can perform with the object. In order to learn a robot-specific affordance predictor, we propose an interactive exploration pipeline which allows the robot to collect interaction experiences while exploring an unknown environment. We integrate an object-level map in the exploration pipeline such that the robot can identify different object instances and track objects across diverse viewpoints. This results in denser and more accurate affordance annotations compared to state-of-the-art methods, which do not incorporate a map. We show that our affordance exploration approach makes exploration more efficient and results in more accurate affordance prediction models compared to baseline methods.
