Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
Pierrick Lorang, Hong Lu, Matthias Scheutz
TL;DR
Open-world robotics require fast adaptation to unforeseen dynamics; this work proposes a bi-level neuro-symbolic framework that merges symbolic planning with a curiosity-driven neural model. Lifted symbolic operators learned from interactions are used to build LTL based reward machines, while the imaginary planning domain guides exploration and planning. An adapting executor and a mechanism to refine or create new operators enable rapid accommodation of novelties. In RoboSuite pick and place with sequential novelties, Bi-Model delivers faster convergence and higher asymptotic success than state-of-the-art hybrids, demonstrating improved sample efficiency and robustness.
Abstract
Adapting quickly to dynamic, uncertain environments-often called "open worlds"-remains a major challenge in robotics. Traditional Task and Motion Planning (TAMP) approaches struggle to cope with unforeseen changes, are data-inefficient when adapting, and do not leverage world models during learning. We address this issue with a hybrid planning and learning system that integrates two models: a low level neural network based model that learns stochastic transitions and drives exploration via an Intrinsic Curiosity Module (ICM), and a high level symbolic planning model that captures abstract transitions using operators, enabling the agent to plan in an "imaginary" space and generate reward machines. Our evaluation in a robotic manipulation domain with sequential novelty injections demonstrates that our approach converges faster and outperforms state-of-the-art hybrid methods.
