ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca, Shikhar Bahl, Deepak Pathak
TL;DR
ALAN tackles scalable real-world robotic learning by coupling an environment-change signal with agent-centric uncertainty to drive autonomous exploration, using a world model (RSSM) and planning via Cross-Entropy Method. It also leverages Detic-based visual priors and a kNN-driven goal-reaching mechanism to achieve zero-shot manipulation goals from goal images, demonstrated on two real play kitchens with a Franka Panda in under 150 trajectories. The key contributions are the dual-signal exploration framework, the integration of offline visual priors to focus data collection, and empirical validation showing improved exploration data quality and robust goal attainment in real-world manipulation. This work advances practical autonomous robotics by reducing reliance on task-specific rewards and human demonstrations while enabling efficient data collection and transfer to user-defined goals.
Abstract
Robotic agents that operate autonomously in the real world need to continuously explore their environment and learn from the data collected, with minimal human supervision. While it is possible to build agents that can learn in such a manner without supervision, current methods struggle to scale to the real world. Thus, we propose ALAN, an autonomously exploring robotic agent, that can perform tasks in the real world with little training and interaction time. This is enabled by measuring environment change, which reflects object movement and ignores changes in the robot position. We use this metric directly as an environment-centric signal, and also maximize the uncertainty of predicted environment change, which provides agent-centric exploration signal. We evaluate our approach on two different real-world play kitchen settings, enabling a robot to efficiently explore and discover manipulation skills, and perform tasks specified via goal images. Website at https://robo-explorer.github.io/
